AI-Powered Document Intelligence System 📄🤖

Welcome to the AI-Powered Document Intelligence System! This project is a beginner-friendly, full-stack application that uses AI to automatically extract key information from documents like invoices, receipts, contracts, and more. It combines modern web technologies with AI APIs to create a powerful document processing tool. 🚀

📖 Project Overview

Imagine uploading a document (PDF, JPG, or PNG) and having an AI magically extract details like company names, dates, amounts, addresses, and more, presenting them in a neat table or downloadable CSV. That's what this system does! It's perfect for automating data entry tasks and learning about AI integration, full-stack development, and API usage.

Key Features

📤 Document Upload: Upload images or PDFs through a drag-and-drop interface.
🔍 Text Extraction: Uses OCR (Optical Character Recognition) to read text from documents.
🧠 AI Analysis: Leverages OpenAI to extract structured data (e.g., company names, dates, amounts).
📊 Results Display: Presents extracted information in a user-friendly table.
⬇️ Downloadable Results: Export results as a CSV file.
🌐 Responsive UI: Clean and modern frontend built with HTML, CSS, and JavaScript.

Demo Screenshots

Upload Interface	Processing Display

🛠️ Technologies Used

Frontend: HTML, CSS, JavaScript (Vanilla JS)
Backend: Python, FastAPI, Uvicorn
AI & OCR APIs:
- OpenAI for intelligent data extraction
- OCR.Space for text extraction from images
Libraries:
- Flask, FastAPI, Uvicorn, Requests, Pandas, Python-Dotenv, Pillow
Environment: Python 3.9+, VS Code/PyCharm

🚀 Getting Started

Follow these steps to set up and run the project locally.

Prerequisites

Python 3.9+: Download from python.org.
VS Code: Recommended code editor, download from code.visualstudio.com.
API Keys:
- OpenAI API Key (free $5 credit for new users).
- OCR.Space API Key (free 25,000 requests/month).

Installation

Clone the Repository:

git clone https://github.com/kenreddy23/document-intelligence-system.git
cd document-intelligence-system

Set Up a Virtual Environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Required Libraries:
```
pip install requirements.txt
```
Configure Environment Variables: Create a .env file in the root directory and add your API keys:
```
OPENAI_API_KEY=your_openai_api_key_here
OCR_SPACE_API_KEY=your_ocr_space_api_key_here
```
Important: Add .env to your .gitignore to keep API keys secure.

Project Structure:

document-intelligence/
├── backend/
│   ├── main.py
│   ├── document_processor.py
│   └── requirements.txt
├── frontend/
│   ├── index.html
│   ├── style.css
│   └── script.js
├── uploads/
├── results/
├── .env
└── README.md

Running the Application

Start the Backend:

cd document-intelligence
python -m uvicorn backend.main:app --reload --port 8000

Access the Frontend: Open your browser and go to http://localhost:8000/static/index.html.
Test the System:
- Upload a document (e.g., invoice, receipt, or business card).
- Watch the AI extract key information and display it in a table.
- Download the results as a CSV file.

🧪 Testing

Test the system with various document types:

Invoice: Should extract company name, date, total amount.
Receipt: Should extract store name, items, total.
Business Card: Should extract name, phone, email.
Contract: Should extract parties, dates, key terms.

🌍 Deployment

To make your application live, you can deploy it to platforms like:

Option 1: Railway (Recommended)

Push your code to a GitHub repository.
Connect Railway to your GitHub repo.
Add environment variables (OPENAI_API_KEY, OCR_SPACE_API_KEY) in the Railway dashboard.
Deploy automatically.

Option 2: Heroku

Create a Procfile in the root directory:

web: uvicorn backend.main:app --host=0.0.0.0 --port=${PORT:-5000}

Push to Heroku.
Add environment variables in the Heroku dashboard.

🔮 Next Steps & Enhancements

Here are some ideas to take the project further:

🌐 Multi-Language Support: Add support for different OCR languages.
📂 Batch Processing: Allow uploading multiple documents at once.
🛠️ Custom Templates: Create extraction rules for specific document types.
💾 Database Integration: Store processed documents and results.
🔒 User Authentication: Add a login system.
⚡ Performance Improvements:
- Caching to avoid reprocessing.
- Queue system for handling multiple uploads.
- Image preprocessing for better OCR accuracy.

📚 Learning Outcomes

By building this project, you'll gain skills in:

API Integration: Working with OpenAI and OCR.Space APIs.
Full-Stack Development: Building frontend and backend components.
AI Prompt Engineering: Crafting effective prompts for AI.
User Experience Design: Creating intuitive interfaces.
Error Handling: Managing API errors and edge cases.
Data Processing: Structuring and exporting data.

🤝 Contributing

Contributions are welcome! To contribute:

Fork the repository.
Create a new branch (git checkout -b feature/your-feature).
Make your changes and commit (git commit -m 'Add your feature').
Push to the branch (git push origin feature/your-feature).
Open a Pull Request.

Please ensure your code follows the project's style guidelines and includes tests where applicable.

📜 License

This project is licensed under the Apache 2.0. See the LICENSE file for details.

🙌 Acknowledgments

OpenAI for their powerful AI models.
OCR.Space for free OCR services.
The open-source community for providing amazing tools and libraries.

⭐ Star this repository if you found it helpful!
Feel free to reach out with questions or suggestions. Happy coding! 😄

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Powered Document Intelligence System 📄🤖

📖 Project Overview

Key Features

Demo Screenshots

🛠️ Technologies Used

🚀 Getting Started

Prerequisites

Installation

Running the Application

🧪 Testing

🌍 Deployment

Option 1: Railway (Recommended)

Option 2: Heroku

🔮 Next Steps & Enhancements

📚 Learning Outcomes

🤝 Contributing

📜 License

🙌 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
backend		backend
frontend		frontend
LICENSE		LICENSE
README.md		README.md
example_env_file.txt		example_env_file.txt
requirements.txt		requirements.txt

License

kenreddy23/document-intelligence-system

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Document Intelligence System 📄🤖

📖 Project Overview

Key Features

Demo Screenshots

🛠️ Technologies Used

🚀 Getting Started

Prerequisites

Installation

Running the Application

🧪 Testing

🌍 Deployment

Option 1: Railway (Recommended)

Option 2: Heroku

🔮 Next Steps & Enhancements

📚 Learning Outcomes

🤝 Contributing

📜 License

🙌 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages