Replies: 2 comments
-
Thank you for the suggestion. I just tried it out and it seems pretty good, although I did notice some hallucination. Unfortunately, LLaVA has not yet been added to the Transformers library (unlike BLIP-2), so it is difficult to integrate it into TagGUI. |
Beta Was this translation helpful? Give feedback.
0 replies
-
LLaVA has been added in v1.9.0. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have been looking at LLaVA woudl be nice if someday it could be integrated like BLIP2 has.
LLaVA: Large Language and Vision Assistant
https://github.com/haotian-liu/LLaVA#evaluation
More info:
https://arxiv.org/abs/2304.08485
Paper which shows its comparison to BLIP2
https://arxiv.org/pdf/2310.03744.pdf
Beta Was this translation helpful? Give feedback.
All reactions