Healthcare Chatbot: Fine-Tuning Llama for Medical QA

This repository contains a comprehensive pipeline for fine-tuning Large Language Models (LLMs) to optimize their performance for specialized healthcare applications. The project focuses on transforming a base Llama model into a specialized medical assistant capable of classifying patient intent and answering clinical queries.

Project Overview

The core objective is to build a training and evaluation pipeline for a healthcare chatbot used by hospitals for patient onboarding. By fine-tuning Llama models on domain-specific data, we bridge the gap between general-purpose language understanding and specialized medical knowledge.

Key Features

Medical Data Engineering: Automated pipeline to load and preprocess the MedQuad-MedicalQnADataset.
Supervised Fine-Tuning (SFT): Implementation of Hugging Face’s SFTTrainer to run efficient fine-tuning loops.
Recipe-Based Configuration: Utilizing TorchTune for streamlined, recipe-based task configuration and data preparation.
Memory Efficiency: Application of quantization and parameter-efficient techniques (like LoRA) to run large models on consumer-grade hardware.

Tech Stack

Language: Python

Core Libraries: torch, torchtune, transformers (Hugging Face)

Data Handling: datasets, pandas, PyYAML

Optimization: bitsandbytes (for 8-bit quantization)

Dataset: MedQuad

The model is trained on the MedQuad-MedicalQnADataset, a high-quality collection of medical question-answer pairs. The pipeline includes:

Intent Classification: Categorizing patient queries to ensure accurate routing.

Prompt Engineering: Formatting raw medical data into instruction-based prompts for the LLM.

Setup & Installation

Environment Configuration It is recommended to use a Conda environment to manage the specific versions of Torch and Transformers required:

git clone https://github.com/Joe-Naz01/SFTT_Trainer.git
cd SFTT_Trainer

conda create -n llama_ft python=3.10 -y
conda activate llama_ft
pip install -r requirements.txt
jupyter notebook

Skills Demonstrated

LLM Orchestration: Building end-to-end pipelines from raw data to model evaluation.

Resource Management: Implementing memory-efficient training strategies for large-scale models.

Domain Adaptation: Specializing general AI models for high-stakes industries like healthcare.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.config		.config
preprocessed_dataset		preprocessed_dataset
sample_data		sample_data
=0.46.1		=0.46.1
Bitext_Sample_Customer_Support_Training_Dataset_27K_responses-v11.csv		Bitext_Sample_Customer_Support_Training_Dataset_27K_responses-v11.csv
README.md		README.md
custom_recipe.yaml		custom_recipe.yaml
requirements.txt		requirements.txt
your_notebook_fixed.ipynb		your_notebook_fixed.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Healthcare Chatbot: Fine-Tuning Llama for Medical QA

Project Overview

Key Features

Tech Stack

Dataset: MedQuad

Setup & Installation

Skills Demonstrated

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Healthcare Chatbot: Fine-Tuning Llama for Medical QA

Project Overview

Key Features

Tech Stack

Dataset: MedQuad

Setup & Installation

Skills Demonstrated

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages