Retrieval-Augmented QA Pipeline with Haystack and Ollama

This repository contains an example of building a Retrieval-Augmented Generation (RAG) pipeline using Haystack and Ollama. The pipeline retrieves relevant documents from a dataset and uses a local LLM (provided by Ollama) to generate context-aware answers without relying on external APIs.

Features

RAG Approach: Combines an LLM with retrieved documents to produce more accurate, contextually grounded answers.
Local Inference with Ollama: Run models locally to avoid expensive API calls and maintain privacy.
Simple Setup: Uses an in-memory DocumentStore and Sentence Transformers for embedding queries and documents.
Adaptable: Easily switch out datasets, models, or vector databases as needed.

Prerequisites

Python 3.10+
Pip and basic knowledge of Python environments
Ollama installed and running locally. For installation instructions, see the Ollama documentation.

Quick Start

Clone the Repository:

git clone https://github.com/shreyashag/simple-qa-rag-with-haystack.git
cd simple-qa-rag-with-haystack

Create a Virtual Environment (Recommended):

python3 -m venv venv
source venv/bin/activate

On Windows:

python -m venv venv
venv\Scripts\activate

Install Dependencies:

pip install --upgrade pip
pip install -r requirements.txt

Set Environment Variables: Create a .env file in the project root, follow the .env.sample for keys:
```
echo 'export OLLAMA_ENDPOINT="http://localhost:11434"' >> .env
echo 'export OLLAMA_MODEL="qwen2.5-coder"' >> .env
```
Make sure Ollama is running. For details, see the Ollama Documentation.
Run the Script:
```
source .env && python qa_pipeline_with_retrieval_augmentation.py
```
You should see answers printed to the console for sample questions.

How It Works

Data Loading:
The code fetches the "Seven Wonders of the Ancient World" dataset from Hugging Face and wraps them into Haystack Document objects.
Embedding and Indexing:
Using SentenceTransformersDocumentEmbedder, we create embeddings for each document and store them in an InMemoryDocumentStore.
Pipeline Construction:
- TextEmbedder: Converts user queries into embeddings.
- Retriever: Uses embeddings to find relevant documents.
- PromptBuilder: Constructs a prompt that includes both the user’s query and the retrieved documents.
- OllamaGenerator: Passes the prompt to Ollama’s locally running LLM to generate the final answer.
Querying: By feeding a question through the pipeline, you get a fact-based response that references the provided documents.

Customization

Change the Model: Update the OLLAMA_MODEL in the .env file to try different models.
Use Another Document Store: Replace InMemoryDocumentStore with a vector database like FAISS or Weaviate for larger-scale projects.
Adjust Prompts: Modify the prompt template in the code to alter the behavior and style of generated responses.

Troubleshooting

Ensure Ollama is running and accessible at the URL in .env.
Make sure all Python dependencies are installed and that you’re using a supported Python version.
If embedding fails, confirm that sentence-transformers models are downloading correctly and your internet connection is stable.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.env.sample		.env.sample
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
qa_pipeline_with_retrieval_augmentation.py		qa_pipeline_with_retrieval_augmentation.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Retrieval-Augmented QA Pipeline with Haystack and Ollama

Features

Prerequisites

Quick Start

How It Works

Customization

Troubleshooting

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

shreyashag/simple-qa-rag-with-haystack

Folders and files

Latest commit

History

Repository files navigation

Retrieval-Augmented QA Pipeline with Haystack and Ollama

Features

Prerequisites

Quick Start

How It Works

Customization

Troubleshooting

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages