An AI-powered application that enables users to ask natural language questions about annual reports and receive meaningful answers. Built for the AIS HackAI 2025 LTIMindtree Hackathon Challenge.
- PDF annual report ingestion and processing
- Natural language question answering
- Intelligent information extraction from financial documents
- Interactive web interface
- Support for tables, charts, and text analysis
- Clone this repository
- Install dependencies:
pip install -r requirements.txt- Create a
.envfile with your OpenAI API key:
OPENAI_API_KEY=your_api_key_here- Run the application:
streamlit run src/app.pysrc/: Source code directoryapp.py: Main Streamlit applicationpdf_processor.py: PDF processing utilitiesqa_chain.py: Question-answering chain implementationvector_store.py: Vector database management
data/: Directory for storing processed documentsrequirements.txt: Project dependencies
- Launch the application
- Upload an annual report PDF
- Wait for the document to be processed
- Ask questions in natural language about the report
- Receive AI-generated answers based on the report's content
- LangChain for AI orchestration
- OpenAI for natural language processing
- ChromaDB for vector storage
- Streamlit for web interface
- PyPDF2 and pdfplumber for PDF processing