Features

PDF Chat Application

Interact with your PDF files in a conversational way! This Streamlit-based application allows you to upload PDFs, process their content, and ask questions about the text extracted       from the documents. Using advanced AI models from Google Generative AI, this tool provides detailed, accurate, and context-aware responses.

Features

PDF Upload and Processing: Upload multiple PDF files. Extract text from PDFs efficiently using PyPDF2.
Text Chunking: Split large text into manageable chunks for better context handling.
AI-Powered Q&A: Leverage Google Generative AI (gemini-pro) for question-answering. Responses are generated based on the uploaded PDF content.
Vector Embedding and Search: Use FAISS to create and store vector embeddings for similarity-based context retrieval.
Streamlit Interface: User-friendly interface for PDF upload, processing, and querying.

How It Works

Upload PDFs: Drag and drop one or more PDF files into the app.
Text Extraction: Text is extracted from the uploaded PDFs using PyPDF2.
Text Chunking: The extracted text is split into smaller chunks using LangChain's
RecursiveCharacterTextSplitter to ensure efficient context handling.
Vector Store Creation: Text chunks are converted into vector embeddings using Google Generative AI. These embeddings are stored in a FAISS index for similarity search.
Question-Answering: Users type a question based on the content of the uploaded PDFs. Relevant chunks are retrieved using similarity search and passed to Google's Generative AI model to generate a detailed response.
Response Display: The app displays the AI's response in a clear and concise manner.

Tech Stack

Frontend: Streamlit: Interactive user interface for PDF upload and Q&A.
Backend: LangChain: Text processing, embedding generation, and conversational chain
setup. PyPDF2: Extract text from PDF files. FAISS: Efficient similarity search for vector embeddings. Google Generative AI: AI models for embedding and conversational tasks.
Utilities: dotenv: Manage environment variables securely.

Setup Instructions

Prerequisites 1. Python 3.8 or higher. 2. API Key for Google Generative AI. 3. Basic knowledge of Python and Streamlit.

Installation

Clone the repository:

git clone https://github.com/your-username/pdf-chat-app.git
cd pdf-chat-app

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # For Linux/Mac
venv\Scripts\activate     # For Windows

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables:

Create a .env file in the project root and add your Google API key:
```
GOOGLE_API_KEY=your-google-api-key
```
Run the application:
```
streamlit run app.py
```
Open your browser and navigate to the local URL displayed by Streamlit.

Project Structure

```bash
📂 pdf-chat-app
├── app.py                  # Main application script
├── requirements.txt        # Python dependencies
├── .env                    # Environment variables file
├── README.md               # Project documentation
└── vector_index/           # Saved FAISS index files
```

Usage

Upload PDFs: Use the sidebar to upload one or more PDFs. Click "Process PDFs" to extract and index the content.
Ask Questions: Enter your question in the text input box. The AI will generate a response based on the PDF content.
View Responses: The response is displayed on the main page below the input box.
Future Enhancements Add support for more file types (e.g., Word, Excel). Implement caching for processed PDFs to avoid reprocessing. Integrate support for multiple language queries. Add options for summarizing entire PDFs.

Contributing

Contributions are welcome! If you'd like to contribute, please fork the repository, create a new branch, and submit a pull request. Ensure your code adheres to the project's style       and guidelines.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Acknowledgements

1. Streamlit for an easy-to-use web app framework.
2. Google Generative AI for powerful AI embeddings and models.
3. LangChain for simplifying AI integration.
4. FAISS for efficient similarity search.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDF Chat Application

Features

How It Works

Tech Stack

Setup Instructions

Installation

Project Structure

Usage

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
venv		venv
.env		.env
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

License

piyush06singhal/PDF_chat_application

Folders and files

Latest commit

History

Repository files navigation

PDF Chat Application

Features

How It Works

Tech Stack

Setup Instructions

Installation

Project Structure

Usage

Contributing

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages