Chat RAG: Interactive Coding Assistant

Overview

Chat RAG is an advanced interactive coding assistant that leverages Retrieval-Augmented Generation (RAG) to provide informed responses to coding queries. Built with a user-friendly Gradio interface, it allows users to interact with various language models, customize model parameters, and upload context files from local directories or GitHub repositories for more accurate assistance.

Features

Multiple Model Providers: Support for Ollama, HuggingFace, NVIDIA NIM, OpenAI, and Anthropic models. (If you don't see all of these providers make sure you have all the environment variables set in the .env file!)
Wide Range of Language Models: Choose from models like Codestral, Mistral-Nemo, LLaMA3.1, DeepSeek Coder v2, Gemma2, and CodeGemma.
Dynamic Model Switching: Seamlessly switch between different language models.
Customizable Model Parameters: Adjust temperature, max tokens, top-p, and context window size.
Interactive Chat Interface: Easy-to-use chat interface for asking coding questions.
RAG-powered Responses: Utilizes uploaded documents or enter a GitHub repository to provide context-aware answers.
Chat With Files: Support for uploading additional context files.
Chat with a GitHub Repo: Support for using a GitHub repositories files as context for the model.
Chat With a Database: Support of connecting a new or existing database. (Coming Soon)
Custom Prompts: Ability to set custom system prompts for the chat engine.
Enhanced Memory Management: Dynamically manage chat memory for different models.
Streaming Responses: Real-time response generation for a more interactive experience.
Model Quantization: Options for 2-bit(Double 4 Bit Quant), 4-bit, and 8-bit quantization for HuggingFace models.
Parsing Advanced File Types: Parsing with Llama Parse for .pdf, .csv, .xlsx, .docx, .xml.

Setup and Usage

Clone the repository.
Install the required dependencies.

Set up your .env file with the following:

GRADIO_TEMP_DIR="YourPathTo/Chat-RAG/data"
GRADIO_WATCH_DIRS="YourPathTo/Chat-RAG"
HUGGINGFACE_HUB_TOKEN="YOUR HF TOKEN HERE"
NVIDIA_API_KEY="YOUR NVIDIA API KEY HERE"
OPENAI_API_KEY="YOUR OpenAI API KEY HERE"
ANTHROPIC_API_KEY="YOUR Anthropic API KEY HERE"
GITHUB_PAT="YOUR GITHUB PERSONAL ACCESS TOKEN HERE"
LLAMA_CLOUD_API_KEY="YOUR LLAMA_CLOUD_API_KEY"

Run the application:

gradio chatrag.py

or

python app.py

The app will automatically open a new tab and launch in your browser.
Select a Model Provider.
Select a language model from the dropdown menu.
(Optional) Upload relevant files for additional context.
Type your coding question in the text box and press enter.
The model will stream the response to your query back to you in the chat window.

Project Structure

app.py: If you don't want to run it in gradio live reload, use this file.
chatrag.py: Main application file with Gradio UI setup.
chat.py: Utilities for document loading and chat engine creation.
gr_utils.py: Gradio-specific utility functions for UI interactions.
model_utils.py: Model management and configuration utilities.
utils.py: General utilities for embedding, LLM setup, and chat memory.

Pictures

Start State of the App

Dropdown Menu in Action

Query Example

RAG Query Example

Contributing

Contributions are welcome! Please feel free to submit a Pull Request or Fork the Repository.

Coming in Future Updates

Video of the program in action.
Add the ability to load an existing Neo4j DB into the model
The ability to add models to the list for different model providers.

Need Help or Have Feature Suggestions?

Feel free to reach out to me through GitHub, LinkedIn, or through email. All of those are available on my website JFCoded.

Name		Name	Last commit message	Last commit date
Latest commit History 178 Commits
pics		pics
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
chat_utils.py		chat_utils.py
chatrag.py		chatrag.py
config.py		config.py
gradio_utils.py		gradio_utils.py
model_utils.py		model_utils.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chat RAG: Interactive Coding Assistant

Overview

Features

Setup and Usage

Project Structure

Pictures

Start State of the App

Dropdown Menu in Action

Query Example

RAG Query Example

Contributing

Coming in Future Updates

Need Help or Have Feature Suggestions?

About

Releases

Packages

Languages

License

yougotlucky/Chat-RAG

Folders and files

Latest commit

History

Repository files navigation

Chat RAG: Interactive Coding Assistant

Overview

Features

Setup and Usage

Project Structure

Pictures

Start State of the App

Dropdown Menu in Action

Query Example

RAG Query Example

Contributing

Coming in Future Updates

Need Help or Have Feature Suggestions?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages