You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have some questions about the group tokens.
It seems that group_token is set as None for initialization in line 839 ./models/group_vit.py.
So how to make the group_token learnable? Why do not set it as a learnable parameters?
What's more, how do we get the segmentation mask through the model? It seems that the "class GroupViT" of ./models/group_vit.py only return the image features?
These make me very confused.
Could you please help me to solve my confusion?
The text was updated successfully, but these errors were encountered:
Sorry for the late reply. The group token is initialized here.
You may refer to this file for how to get the segmentation mask.
This is an excellent job and thank you for publishing the source code. I encountered difficulties downloading the data cc12m and cc3m, and the vast majority of the images cannot be downloaded. Could you please publish these data to hugging_face and provide us with the opportunity to download them? Thank you very much
Hi, @xvjiarui @shalinidemello
Thank you for presenting this good work.
I have some questions about the group tokens.
It seems that group_token is set as None for initialization in line 839 ./models/group_vit.py.
So how to make the group_token learnable? Why do not set it as a learnable parameters?
What's more, how do we get the segmentation mask through the model? It seems that the "class GroupViT" of ./models/group_vit.py only return the image features?
These make me very confused.
Could you please help me to solve my confusion?
The text was updated successfully, but these errors were encountered: