🛡️ Secure Offline RAG System

We built a RAG system which runs locally on cpu in an offline mode. It uses open source large language models for performing retrieval augmented generation.

🚀 Tech Stack

Programming Language

🐍 Python

Frameworks & Libraries

🎨 Streamlit — For building the interactive and intuitive user interface.

🔗 Langchain — To streamline and optimize the RAG pipeline.

🧠 Ollama — Efficient, local LLM deployment for high-quality inference.

🤗 Hugging Face — Powerful models and tools for natural language processing.

Vector Database

🔍 FAISS (Facebook AI Similarity Search) — Fast, efficient, and scalable vector search for document retrieval.

Reranking Model

🎯 BAAI/bge-reranker-base — Advanced model for reranking results to ensure relevant and accurate information is returned.

✨ Features

Minimum CPU memory and RAM usage
Runs locally even in an offline environment (For PDFs and other documents)
Highly efficient and quantized model
Multilingual support with over 29 languages including Chinese
Fast inference
Intuitive UI
Add new documents to the system without the need for a complete reindexing process, ensuring dynamic and flexible integration of new knowledge.
Built with a focus on minimizing memory usage, the system leverages lightweight retrieval techniques such as FAISS (or alternatives like inverted indices) to manage large datasets without consuming excessive memory.
Low Latency
Total Memory usage: 338 MB (model) + 121 MB (embeddings)
Reranking model 1.1GB but loads only when required and loads once.

Hardware requirements

Nvidia GPUs with compute capability 5.0+ because it uses ollama and ollama supports this GPU capability

📂 File Structure

🛠️ Installation Steps

Clone the repository:

> git clone https://github.com/ParamThakkar123/Secure-Local-Offline-Rag-System.git

Change directory:

> cd Secure-Local-Offline-Rag-System

Installl the dependencies :

> pip install -r requirements.txt

Download Ollama app and run it

Open command line and type :

> ollama pull qwen2:0.5b-instruct-q3_K_S
> ollama pull nextfire/paraphrase-multilingual-minilm:l12-v2

Run the app.py using the following command in the command line

> streamlit run app.py

If the above command gives the error “streamlit not recognized”, enter the following command

> python -m streamlit run app.py

📸 Output Screenshots

🎥 Demo Video

WhatsApp.Video.2024-11-15.at.6.30.30.PM.1.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
__pycache__		__pycache__
faiss_index		faiss_index
Gitrepo.py		Gitrepo.py
LICENSE		LICENSE
README.md		README.md
app.py		app.py
csv_metadata.txt		csv_metadata.txt
csvrag.py		csvrag.py
hfdataset.py		hfdataset.py
markdownrag.py		markdownrag.py
pdf_rag.py		pdf_rag.py
rag-using-q-a-datasets (4).ipynb		rag-using-q-a-datasets (4).ipynb
rag-using-q-a-datasets.xpynb		rag-using-q-a-datasets.xpynb
requirements.txt		requirements.txt
textrag.py		textrag.py
urlrag.py		urlrag.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ Secure Offline RAG System

🚀 Tech Stack

Programming Language

🐍 Python

Frameworks & Libraries

🎨 Streamlit — For building the interactive and intuitive user interface.

🔗 Langchain — To streamline and optimize the RAG pipeline.

🧠 Ollama — Efficient, local LLM deployment for high-quality inference.

🤗 Hugging Face — Powerful models and tools for natural language processing.

Vector Database

🔍 FAISS (Facebook AI Similarity Search) — Fast, efficient, and scalable vector search for document retrieval.

Reranking Model

🎯 BAAI/bge-reranker-base — Advanced model for reranking results to ensure relevant and accurate information is returned.

✨ Features

Hardware requirements

📂 File Structure

🛠️ Installation Steps

Clone the repository:

Change directory:

Installl the dependencies :

Open command line and type :

Run the app.py using the following command in the command line

If the above command gives the error “streamlit not recognized”, enter the following command

📸 Output Screenshots

🎥 Demo Video

About

Releases

Packages

Contributors 2

Languages

License

ParamThakkar123/Secure-Local-Offline-Rag-System

Folders and files

Latest commit

History

Repository files navigation

🛡️ Secure Offline RAG System

🚀 Tech Stack

Programming Language

🐍 Python

Frameworks & Libraries

🎨 Streamlit — For building the interactive and intuitive user interface.

🔗 Langchain — To streamline and optimize the RAG pipeline.

🧠 Ollama — Efficient, local LLM deployment for high-quality inference.

🤗 Hugging Face — Powerful models and tools for natural language processing.

Vector Database

🔍 FAISS (Facebook AI Similarity Search) — Fast, efficient, and scalable vector search for document retrieval.

Reranking Model

🎯 BAAI/bge-reranker-base — Advanced model for reranking results to ensure relevant and accurate information is returned.

✨ Features

Hardware requirements

📂 File Structure

🛠️ Installation Steps

Clone the repository:

Change directory:

Installl the dependencies :

Open command line and type :

Run the app.py using the following command in the command line

If the above command gives the error “streamlit not recognized”, enter the following command

📸 Output Screenshots

🎥 Demo Video

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages