You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Has there been any consideration for implementing IImageToTextService for the Ollama connector?
I can only find the Hugging Face implementation in the codebase, but I might have missed it if it exists elsewhere. I noticed that OllamaSharp does include support for this feature.
One interesting observation is that while OllamaSharp supports providing prompts with image analysis, the current IImageToTextService interface doesn't include this capability. This prompt functionality could be valuable across different connectors, enabling more flexible image analysis - from controlling response structure to asking specific questions about image content.
I've been experimenting with a local implementation and was wondering if this is something that might be valuable to the project. If there's interest or existing plans for this feature, I'd be happy to learn more or potentially contribute to the effort.
Looking forward to any thoughts or guidance on this.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Has there been any consideration for implementing IImageToTextService for the Ollama connector?
I can only find the Hugging Face implementation in the codebase, but I might have missed it if it exists elsewhere. I noticed that OllamaSharp does include support for this feature.
One interesting observation is that while OllamaSharp supports providing prompts with image analysis, the current IImageToTextService interface doesn't include this capability. This prompt functionality could be valuable across different connectors, enabling more flexible image analysis - from controlling response structure to asking specific questions about image content.
I've been experimenting with a local implementation and was wondering if this is something that might be valuable to the project. If there's interest or existing plans for this feature, I'd be happy to learn more or potentially contribute to the effort.
Looking forward to any thoughts or guidance on this.
Beta Was this translation helpful? Give feedback.
All reactions