Talk2Code

This repository contains a Proof of Concept (PoC) for a Retrieval-Augmented Generation (RAG) application, designed to answer user questions about their codebase by leveraging vector-based document retrieval and large language models (LLMs).. It provides the backend and UI for users to ask questions and get responses in real-time.

System Components

The system consists of two main parts:

Embedding vector creation and indexing
Question-Answering (QA) system

Components

Vector Database: Qdrant vector database that indexes embeddings for each file in the code base.
Embeddings Model: We use pre-trained BAAI/bge-large-en-v1.5 model locally over the Huggingface SentenceTransformers to generate embeddings for each file in the code base.
Langchain: Langchain is used exclusively for embedding vector creation, indexing, and vector retrieval. It is not used in the QA system.
LLM Model: We use OpenAI/gpt-4o model for generating responses to user questions.
FastAPI: FastAPI is used to create a REST API for the QA system. It uses chunk streaming to provide responses to the user in real-time.
React: React is used to build the web interface that allows users to interact with the system.

Embedding vector creation and indexing

The embedding vector creation and indexing component is responsible for creating embeddings for each file in the code base and indexing them in a vector store. The vector store is used by the QA system to retrieve relevant documents based on the user's question.

---
title: Embedding vector creation and indexing
---
flowchart LR
    
    B[Load Files from Filesystem] --> C[Parse Files with LanguageParser]
    C --> D[Generate Embeddings using Embedding Model]
    D --> E[Add Embeddings to VectorStore]

Question-Answering (QA) system

The QA system is responsible for answering user questions based on their code base. It retrieves relevant documents from the vector store, generates a response using the LLM model, and streams the response back to the user in real-time.

---
title: Question-Answering (QA) system
---
sequenceDiagram
    User ->>QA Class: Ask a question
    QA Class ->> Vector Store: Retrieve relevant documents
    activate Vector Store
    Vector Store -->> QA Class: Return relevant documents
    deactivate Vector Store
    QA Class ->> History: Add user message to the history
    QA Class ->> History: Add relevant documents to the history
    QA Class ->> LLM Model: Generate response using LLM Model
    activate LLM Model
    loop Response Stream
        LLM Model -->> QA Class: Stream response chunks
        QA Class -->> User: Provide response
    end
    deactivate LLM Model
    QA Class ->> History: Add response to the history

Running the System

The repo comes with https://github.com/langchain-ai/langchain parsed and indexed in the vector store. You can run the system using the following steps:

1. Create an environment file

Create a .env.docker file in the ./backend directory with the following environment variables:

OPENAI_API_KEY=xxxxxxxx
QDRANT_URL=http://talk2code-qdrant-1:6333  # URL of the Qdrant vector store for docker container

2. Start the system

Run the following commands to start the system:

docker compose up --build -d

Note

When you first run the backend container, it downloads the embedding model. This process may take some time depending on your internet connection.

3. Access the web interface

Open your browser and go to http://localhost:8080 to access the web interface. Ask a question related to langchain and see the system in action!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
client		client
data/qdrant		data/qdrant
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Talk2Code

System Components

Components

Embedding vector creation and indexing

Question-Answering (QA) system

Running the System

1. Create an environment file

2. Start the system

3. Access the web interface

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Talk2Code

System Components

Components

Embedding vector creation and indexing

Question-Answering (QA) system

Running the System

1. Create an environment file

2. Start the system

3. Access the web interface

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages