Chipper provides a web interface, CLI, and a modular, hackable, and lightweight architecture for RAG pipelines, document chunking, web scraping, and query workflows, enhancing generative AI models with advanced information retrieval capabilities. It can also function as a proxy between an Ollama client, such as Enchanted or Open WebUI, and an Ollama instance. Built with Haystack, Ollama, Hugging Face, Docker, TailwindCSS, and ElasticSearch, it runs as a fully containerized service.
This project started as a personal tool to help my girlfriend with her book, using local RAG and LLMs to explore characters and creative ideas while keeping her work private and off cloud services like ChatGPT. What began as a few handy scripts soon grew into a fully dockerized, extensible service and along the way, it became a labor of love. Now, I'm excited to share it with the world.
If you find Chipper useful, leaving a star would be lovely and will help others discover Chipper too.
Live Demo: https://demo.chipper.tilmangriesel.com/
- Local & Cloud Model Support — Run models locally with Ollama or connect to remote models via the Hugging Face API.
- ElasticSearch Integration — Store and retrieve vectorized data efficiently with scalable indexing.
- Document Chunking — Process and split documents into structured segments.
- Web Scraping — Extract and index content from web pages.
- Audio Transcription — Convert audio files to text.
- CLI & Web UI — Access Chipper via a command-line tool or a lightweight, self-contained web interface.
- Dockerized Deployment — Run in a fully containerized setup with minimal configuration.
- Customizable RAG Pipelines — Adjust model selection, query parameters, and system prompts as needed.
- Ollama API Proxy — Extend Ollama with retrieval capabilities, enabling interoperability with clients like Enchanted and Open WebUI.
- API Security — Proxy the Ollama API with API key-based and Baerer token service authentication.
- Offline Web UI — Works without an internet connection using vanilla JavaScript and TailwindCSS.
Visit the Chipper project website for detailed setup instructions.
Note: This is just a research project, so it's not built for production.
At the heart of this project lies my passion for education and exploration. I believe in creating tools that are both approachable for beginners and helpful for experts. My goal is to offer you a well-thought-out service architecture, and a stepping stone for those eager to learn and innovate.
This project wants to be more than just a technical foundation, for educators, it provides a framework to teach AI concepts in a manageable and practical way. For explorers, tinkerers and companies, it offers a playground where you can experiment, iterate, and build upon a versatile platform.
Feel free to improve, fork, copy, share or expand this project. Contributions are always very welcome!
Use Chipper's built-in web interface to set up and customize RAG pipelines with ease. Built with vanilla JavaScript and TailwindCSS, it works offline and doesn't require any framework-specific knowledge. Run the /help
command to learn how to switch models, update the embeddings index, and more.
Automatic syntax highlighting for popular programming languages in the web interface.
For models like DeepSeek-R1, Chipper suppresses the "think" output in the UI while preserving the reasoning steps in the console output.
Full support for the Ollama CLI and API, including reflection and proxy capabilities, with API key route decorations.
Enhance every third-party Ollama client with server-side knowledge base embeddings, allowing server side model selection, query parameters, and system prompt overrides. Enable RAG for any Ollama client or use Chipper as a centralized knowledge base.
- Basic Functionality
- CLI
- Web UI
- Docker
- Enhanced Web UI (better mobile support)
- Improved Linting and Formatting
- Docker Hub Registry Images
- Edge Inference TTS
- Mirror Ollama Chat API to enable Chipper as a drop-in middleware
- Baerer token support
- Distributed Processing — Chain multiple Chipper instances together for workload distribution and extended processing.
- Requires
ChatPromptBuilder
andOllamaChatGenerator
implementation
- Requires
- Automated Unit Tests
- Smart Document Chunking and Embedding
- React-Based Web Application
Be sure to visit the Chipper project website for detailed setup instructions and more information.