📦 nsrCodes / local-RAG-poc

★ 2 stars ⑂ 0 forks 👁 2 watching

📥 Clone https://github.com/nsrCodes/local-RAG-poc.git

HTTPS git clone https://github.com/nsrCodes/local-RAG-poc.git

SSH git clone git@github.com:nsrCodes/local-RAG-poc.git

CLI gh repo clone nsrCodes/local-RAG-poc

ZIP Desktop VS Code

nsrCodes chore: add notes eb01d74 1 years ago 📝 History

📂 master View all commits →

📄 conversation.py

📄 notes.md

📄 README.md

📄 requirements.txt

📄 README.md

just an attempt to try to create a local RAG system that can help easily understand complex documents

Run:

Clone the code and install the dependencies specified in requirements.txt
Add a pdf that you want to chat with
Have ollama running (preferably with the specified qwen model for the best results)
run python conversations.py <path to pdf>

Specifications:

embedding model: mxbai-embed-large
inference model: qwen2.5:7b-instruct-q8_0 (tried mixtral large and phi3 and llama3, at q4 and q8(wherever available). Qwen models felt like the best. )
model server was run using ollama (it is really hard to run these models via code and keep them dynamic during this POC stage, might pickup later)
mostly built with langchain (no preference, just the first framework that got suggested)

Features:

keeps context of history of chat
does not re-embed a document if they already exist

requirements.txt generated using pip3 freeze > requirements.txt