You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, Did you ever try with distilling the VGGT knowledge to the visual tokens just after ViT saying visual encoder siglip, instead of the LLM final visual hidden states saying inside LLM layers?
Hi, Did you ever try with distilling the VGGT knowledge to the visual tokens just after ViT saying visual encoder siglip, instead of the LLM final visual hidden states saying inside LLM layers?
thx