You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am working with the Llama 3.2 Vision model, which is a multimodal large model consisting of an image_encoder and a causal model, similar to the LLaVA model structure. I plan to freeze the Llama 3.2 Vision model and add an additional causal model to encode the image tokens extracted by the image_encoder, and train this additional causal model.
how do i train this modified model by ur repo?
The text was updated successfully, but these errors were encountered:
thx for ur repo
I am working with the Llama 3.2 Vision model, which is a multimodal large model consisting of an image_encoder and a causal model, similar to the LLaVA model structure. I plan to freeze the Llama 3.2 Vision model and add an additional causal model to encode the image tokens extracted by the image_encoder, and train this additional causal model.
how do i train this modified model by ur repo?
The text was updated successfully, but these errors were encountered: