Skip to content

Index and search images based on descriptions generated by a local multimodal LLM

License

Notifications You must be signed in to change notification settings

WHOIGit/ollama-chromadb-image-search

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview

Index and search images based on descriptions generated by a local multimodal LLM.

This application makes a directory of images searchable with text queries. It does this by using a local multimodal LLM (e.g., llama3.2-vision) via the ollama API to generate descriptions of images, which it then writes to a semantic database (chromadb).

The text embeddings used by chromadb allow for querying the images with text prompts.

Prerequisites

Use

To index data, run main.py

python main.py --directory /path/to/images

To query the index, use --query

python main.py --query "buoy"

About

Index and search images based on descriptions generated by a local multimodal LLM

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages