Retrieval Augmented Generation for EIC based document retriever #6

karthik18495 · 2024-10-02T17:48:14Z

karthik18495
Oct 2, 2024
Maintainer

The Project

Welcome to the project where we aim to build a scalable RAG based document retriever for the upcoming Electron Ion Collider.

The project was first proposed at the AI4EIC workshop in 2023 held at Catholic University of America from November 29 to December 1 2023. The project evolved from there to a web application in early March 2024 with a proceeding (https://arxiv.org/abs/2403.15729)[here].

Retrieval Augmented Generation for EIC

This is a project that is currently being developed to build a RAG based system for the upcoming EIC.

There are three main parts to the RAG pipeline.

Ingestion

Ingestion in Retrieval-Augmented Generation (RAG) is a crucial process that involves the preparation and organization of data to be used by the model. This process can be broken down further into three main steps: chunking of information, embedding models, and storing it in a vector database.

Chunking
Encoding chunked information into a vector using a embedding model (e.g. BERT, seq2seq, text2vec)
Storing the encoded information in a vector database.

Chunking

This is the first step in the ingestion process. The raw data can come in various forms. which could be a large corpus of text, is divided into manageable chunks or segments. The size of these chunks can vary depending on the specific requirements of the task at hand. Chunking helps in reducing the complexity of the data and makes it easier for the model to process the information.

Retrieval

Content Fusion and Generation

Types of RAG system

A very recent survey paper. summarizes the types of RAG system¹. There are three types of RAG architecture broadly based on where LLM being used in the pipeline

Project Milestones

Building a Naive RAG for EIC using the 200 papers from arxiv on EIC. ✅
- Backend is a relatively straight forward RAG architecture. Where ingestion of data is done using PyPDF.
- Frontend is a simple web interface that allows for the user to upload a PDF and get back a list of papers that are relevant to the input.
- Report evaulated RAGAS metrics for the built architecture.
- Publish this in the proceeding for AI4EIC-2023. 🧑‍🏭
Going beyong Naive RAG. Towards building a RAG architecture with Testable Evaulation Metrics. 🧑‍🏭
- This requires going beyond
Multi modal output as a Proof of concept.
- Storing meta data information about table etc.
- Using Agents in Langchain to build a latex report.

References

Types of RAG ↩

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Retrieval Augmented Generation for EIC based document retriever #6

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Retrieval Augmented Generation for EIC based document retriever #6

Uh oh!

karthik18495 Oct 2, 2024 Maintainer

The Project

Retrieval Augmented Generation for EIC

Ingestion

Chunking

Retrieval

Content Fusion and Generation

Types of RAG system

Project Milestones

References

Footnotes

Replies: 0 comments

karthik18495
Oct 2, 2024
Maintainer