whisper-asr

Here are 8 public repositories matching this topic...

mangodxd / youtube-auto-dub

AI-powered YouTube video dubbing pipeline. Automatically transcribes (Whisper), translates (Google), and generates neural dubbing (Edge-TTS) with smart audio-video synchronization and background music preservation.

machine-translation video-translation speech-to-speech youtube-automation ai-dubbing edge-tts automatic-subtitles whisper-asr multilingual-video automated-dubbing

Updated Apr 1, 2026
Python

ventura8 / Whisper-Pro-ASR

Star

A high-performance Docker container that runs OpenAI's Whisper model. Optimized for CPU, Intel NPU, Intel Arc/iGPU, and NVIDIA CUDA GPUs.

docker cuda speech-to-text hardware-acceleration whisper asr uvr openvino bazarr huggingface vocal-isolation faster-whisper ctranslate2 media-automation whisper-asr intel-npu

Updated Feb 1, 2026
Python

DSinghania13 / SpeechSync

Star

A real-time Speech-to-Speech translation pipeline (ASR ➡️ NMT ➡️ TTS) using OpenAI Whisper, MarianMT, and gTTS. Features a Flask backend and a responsive web UI for low-latency multilingual communication and audio synthesis.

multilingual nlp flask text-to-speech real-time translation deep-learning machine-translation python3 speech-recognition audio-processing gtts huggingface-transformers speech-to-speech marianmt openai-whisper whisper-asr

Updated Feb 3, 2026
HTML

tahaabbas / dictator

Star

Dictator – Supercharge Cursor Chat with voice-to-text, custom AI prompts, and workflow automation. Speak your ideas, inject templates instantly, and code faster with AI-powered assistance.

productivity transformers microphone speech-recognition developer-tools power-tools cursor transcription whisper webgpu voice-to-text speaking dictate onnx ai-tools transcription-tool cursor-ai whisper-asr cursor-extension

Updated Aug 1, 2025
JavaScript

Santhanu7Z / mci_detection

Star

Listening Between the Lines: An explainable multimodal framework for MCI detection from spontaneous speech. Leverages Selective State Space Models (Mamba) and Gated Fusion to integrate linguistic disfluencies and eGeMAPS biomarkers across multi-corpus benchmarks (Pitt, ADReSS, TAUKADIAL)

speech-analysis multimodal-fusion clinical-nlp mild-cognitive-impairment whisper-asr mamba-ssm

Updated Mar 30, 2026
Python

asmarufoglu / roboaudio

Star

Analysis of Robot Ego-Noise impact on ASR models (Whisper) & Signal Processing solutions.

robotics signal-processing audio-processing whisper-asr

Updated Dec 20, 2025
Python

SunnyYadav16 / audio-streaming-poc

Star

A real-time audio streaming POC featuring Voice Activity Detection (VAD), Faster-Whisper ASR, NLLB-200 translation, and Piper TTS. Built with FastAPI and React to demonstrate a low-latency, end-to-end speech-to-speech pipeline.

text-to-speech machine-translation websockets audio-streaming whisper-asr llama4 nllb-200

Updated Mar 1, 2026
TypeScript

Riyaz18 / abacus-ai-tutor

Star

A specialized AI-powered educational tool for mastering mental arithmetic. Features local LLM integration (Llama 3), real-time voice transcription (Whisper), and an interactive Canvas-based Soroban abacus.

python artificial-intelligence edtech speech-recognition abacus fastapi soroban ollama llama3 whisper-asr

Updated Apr 8, 2026
HTML

Improve this page

Add a description, image, and links to the whisper-asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the whisper-asr topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper-asr

Here are 8 public repositories matching this topic...

mangodxd / youtube-auto-dub

ventura8 / Whisper-Pro-ASR

DSinghania13 / SpeechSync

tahaabbas / dictator

Santhanu7Z / mci_detection

asmarufoglu / roboaudio

SunnyYadav16 / audio-streaming-poc

Riyaz18 / abacus-ai-tutor

Improve this page

Add this topic to your repo