Chipper provides a web interface, CLI, and a modular, hackable, and lightweight architecture for RAG pipelines, document chunking, web scraping, and query workflows, enhancing generative AI models with advanced information retrieval capabilities. It can also function as a proxy between an Ollama client, such as Enchanted or Open WebUI, and an Ollama instance. Built with Haystack, Ollama, Hugging Face, Docker, TailwindCSS, and ElasticSearch, it runs as a fully containerized service.

This project started as a personal tool to help my girlfriend with her book, using local RAG and LLMs to explore characters and creative ideas while keeping her work private and off cloud services like ChatGPT. What began as a few handy scripts soon grew into a fully dockerized, extensible service and along the way, it became a labor of love. Now, I'm excited to share it with the world.

If you find Chipper useful, leaving a star would be lovely and will help others discover Chipper too.

Live Demo: https://demo.chipper.tilmangriesel.com/

Features

Local & Cloud Model Support — Run models locally with Ollama or connect to remote models via the Hugging Face API.
ElasticSearch Integration — Store and retrieve vectorized data efficiently with scalable indexing.
Document Chunking — Process and split documents into structured segments.
Web Scraping — Extract and index content from web pages.
Audio Transcription — Convert audio files to text.
CLI & Web UI — Access Chipper via a command-line tool or a lightweight, self-contained web interface.
Dockerized Deployment — Run in a fully containerized setup with minimal configuration.
Customizable RAG Pipelines — Adjust model selection, query parameters, and system prompts as needed.
Ollama API Proxy — Extend Ollama with retrieval capabilities, enabling interoperability with clients like Enchanted and Open WebUI.
API Security — Proxy the Ollama API with API key-based and Baerer token service authentication.
Offline Web UI — Works without an internet connection using vanilla JavaScript and TailwindCSS.

Installation and Setup

Visit the Chipper project website for detailed setup instructions.

Note: This is just a research project, so it's not built for production.

Philosophy

At the heart of this project lies my passion for education and exploration. I believe in creating tools that are both approachable for beginners and helpful for experts. My goal is to offer you a well-thought-out service architecture, and a stepping stone for those eager to learn and innovate.

This project wants to be more than just a technical foundation, for educators, it provides a framework to teach AI concepts in a manageable and practical way. For explorers, tinkerers and companies, it offers a playground where you can experiment, iterate, and build upon a versatile platform.

Feel free to improve, fork, copy, share or expand this project. Contributions are always very welcome!

Demos

Web Interface

Use Chipper's built-in web interface to set up and customize RAG pipelines with ease. Built with vanilla JavaScript and TailwindCSS, it works offline and doesn't require any framework-specific knowledge. Run the /help command to learn how to switch models, update the embeddings index, and more.

Code Output

Automatic syntax highlighting for popular programming languages in the web interface.

Reasoning

For models like DeepSeek-R1, Chipper suppresses the "think" output in the UI while preserving the reasoning steps in the console output.

CLI Interface

Full support for the Ollama CLI and API, including reflection and proxy capabilities, with API key route decorations.

Third-Party Clients

Enhance every third-party Ollama client with server-side knowledge base embeddings, allowing server side model selection, query parameters, and system prompt overrides. Enable RAG for any Ollama client or use Chipper as a centralized knowledge base.

Roadmap

Done

Basic Functionality
CLI
Web UI
Docker
Enhanced Web UI (better mobile support)
Improved Linting and Formatting
Docker Hub Registry Images
Edge Inference TTS
Mirror Ollama Chat API to enable Chipper as a drop-in middleware
Baerer token support

Todo

Distributed Processing — Chain multiple Chipper instances together for workload distribution and extended processing.
- Requires ChatPromptBuilder and OllamaChatGenerator implementation
Automated Unit Tests
Smart Document Chunking and Embedding
React-Based Web Application

Acknowledgments

Haystack
Ollama
Hugging Face
Elastic
Elasticvue
Docker
VitePress
Sherpa ONNX
Sherpa ONNX Wasm

Star History

Be sure to visit the Chipper project website for detailed setup instructions and more information.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Features

Installation and Setup

Philosophy

Demos

Web Interface

Code Output

Reasoning

CLI Interface

Third-Party Clients

Roadmap

Done

Todo

Acknowledgments

Star History

Files

README.md

Latest commit

History

README.md

File metadata and controls

Features

Installation and Setup

Philosophy

Demos

Web Interface

Code Output

Reasoning

CLI Interface

Third-Party Clients

Roadmap

Done

Todo

Acknowledgments

Star History