Skip to content

ErenDurali/Agentic_PDF_RAG

Repository files navigation

Agentic RAG for PDF Processing

This project is an Agentic Retrieval-Augmented Generation (RAG) system that processes PDFs using Gemini as the model, LangChain for agent-based orchestration, and ChromaDB for vector storage and Streamlit for deploying an interactive web application. The project is built with GitHub Copilot assistance.

Features

  • Extracts relevant information from PDFs.
  • Uses LangChain to create an intelligent agent.
  • Stores and retrieves document embeddings with ChromaDB.
  • Employs Gemini for natural language understanding and response generation.
  • Provides an interactive web interface using Streamlit.

Tech Stack

  • Model: Gemini
  • Framework: LangChain
  • Database: ChromaDB
  • Web Application: Streamlit
  • Assistant: GitHub Copilot

Usage

  • Add PDFs to the data/ folder.
  • Run the script to process and query them.
  • Customize the agent’s behavior in agentic-rag.py.

References

This project was inspired and guided by the following resource:

  • An Improved Langchain RAG Tutorial (v2) by pixegami: This tutorial provided valuable insights into implementing a Retrieval-Augmented Generation system using LangChain and local LLMs.

About

Agentic RAG system that processes PDFs using Gemini, LangChain, and ChromaDB. Built with GitHub Copilot assistance.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages