This repository is an application that uses LangChain to execute various computer vision models through chat. check out the demo
TODO:
- Generating good results in more specialized fields by training a vision model with a custom dataset from a specific field
- Converting vision model inference from scratch to API(with Nvidia Triton Server)