AI-Powered Live Captioning Tool A Real-Time Speech Transcription & Audio Caption Generator built using OpenAI Whisper, Google Speech Recognition, and Streamlit. Designed for meetings, accessibility tools, and audio summarization.
π§ Upload audio files (.mp3, .wav, .m4a) and generate captions using Whisper
π€ Real-time speech-to-text from microphone input using Google STT
π Chunk-wise live transcription for meetings
π¬ Full transcript generated at the end
β FFmpeg-integrated backend for audio processing
Python 3.8+
FFmpeg (must be added to system PATH)
Install dependencies: pip install -r requirements.txt
Tool Purpose Whisper Audio transcription (offline) Google STT Real-time mic transcription Streamlit UI for web interface FFmpeg Audio processing backend
Clone the repository: git clone https://github.com/your-username/your-repo-name.git cd your-repo-name
pip install -r requirements.txt Make sure FFmpeg is installed and added to your system PATH.
streamlit run App.py
π your-project/ β βββ App.py # Main Streamlit app βββ requirements.txt # All Python dependencies βββ README.md # You're reading this!
This project was built as part of the 3Percent Hacks - Building with AI hackathon. Designed to bridge accessibility gaps and support real-time captioning for audio-based communication.