RAG-Chatbot — Chat with Your Own Documents

This project lets you run a fully local RAG-based (Retrieval-Augmented Generation) chatbot using your own PDFs or web content. Ask questions in natural language, and get answers based on the actual contents of your documents.

It uses the following tools for this:

LangChain for orchestration
FAISS for semantic vector search
Ollama to run open-source LLMs locally
Streamlit for an easy-to-use chat interface

✨ Features

📄 Upload PDFs or URLs as your data source
🧠 Store document chunks as embeddings in a FAISS vector store
🔍 Retrieve relevant content using semantic similarity search
💬 Generate context-aware answers via local LLM
💻 All running 100% locally

🚀 Getting Started

1. Install Ollama and Run a Local LLM

Make sure you have Ollama installed and a compatible model (e.g. granite3.3) downloaded.

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

Download model

ollama pull granite3.3

Start the model

ollama run granite3.3

2. Clone This Repository

git clone https://github.com/yourname/rag-chatbot.git
cd rag_chatbot

3. Install Python Dependencies

This project uses Python 3.9+.

pip install -r requirements.txt

4. Run the Streamlit App

streamlit run app.py

📂 File Upload

Once the app is running: Go to the sidebar to upload one or more PDF files.

Ask natural questions about the content in the chat interface.

The chatbot will search for relevant sections and answer using context.

🏛️ Architecture Overview

ChatUI: The user interface is built with Streamlit, using its built-in chat_message components to create a conversational layout. Users can upload documents in the sidebar and interact with the chatbot in real time.
LLMRAGHandler: This is the main component that connects everything. It is implemented using LangChain and is responsible for managing the conversation flow, retrieving relevant context from the vector store, formatting prompts using a custom template, calling the LLM, and caching chat history.
Vector Store: Responsible for storing the documents as vector embeddings in FAISS, a high-speed similarity search library and retrieving the relevant context
LLM: The chatbot runs the Granite 3.3 model locally using Ollama. This means: Easy setup and prototyping, easy model switching, and full control over your data (everything stays local
Conversation Store: To make the chatbot stateful, we store the conversation history in a local file (e.g. JSON). This allows the chat to resume where you left off - even after refreshing the browser.

⚠️ Limitations

Initial PDF parsing and embedding may take a few seconds for large files.
Latency depends on the chosen LLM model.
Evaluation of answers is qualitative — no scoring function included.
Runs only locally for easier development

💡 Ideas for Future Improvements

Use agentic RAG (history-aware retrievers, dynamic tool-calling)
Tool Calling
Other Data Sources (Google Drive, Notion, ...)
Cloud deployment
UI enhancements and document summarization

📄 License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
resources		resources
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-Chatbot — Chat with Your Own Documents

✨ Features

🚀 Getting Started

1. Install Ollama and Run a Local LLM

Download model

Start the model

2. Clone This Repository

3. Install Python Dependencies

4. Run the Streamlit App

📂 File Upload

🏛️ Architecture Overview

⚠️ Limitations

💡 Ideas for Future Improvements

📄 License

About

Uh oh!

Releases

Packages

Languages

License

tschechlovdev/rag_chatbot

Folders and files

Latest commit

History

Repository files navigation

RAG-Chatbot — Chat with Your Own Documents

✨ Features

🚀 Getting Started

1. Install Ollama and Run a Local LLM

Download model

Start the model

2. Clone This Repository

3. Install Python Dependencies

4. Run the Streamlit App

📂 File Upload

🏛️ Architecture Overview

⚠️ Limitations

💡 Ideas for Future Improvements

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages