speech-processing

Here are 794 public repositories matching this topic...

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Mar 26, 2026
Python

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Mar 26, 2026
Jupyter Notebook

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Mar 26, 2026
Python

pliang279 / awesome-multimodal-ml

Star

Reading list for research topics in multimodal machine learning

machine-learning natural-language-processing reinforcement-learning computer-vision deep-learning robotics healthcare reading-list representation-learning speech-processing multimodal-learning

Updated Aug 20, 2024

microsoft / torchscale

Star

Foundation Architecture for (M)LLMs

machine-learning natural-language-processing translation computer-vision transformer speech-processing multimodal pretrained-language-model

Updated Apr 11, 2024
Python

linto-ai / whisper-timestamped

Star

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Sep 9, 2025
Python

r9y9 / wavenet_vocoder

Sponsor

Star

WaveNet vocoder

python speech pytorch speech-synthesis wavenet speech-processing wavenet-vocoder neural-vocoder

Updated Jul 29, 2023
Python

resemble-ai / resemble-enhance

Star

AI powered speech denoising and enhancement

speech-processing denoise speech-enhancement speech-denoising

Updated Dec 3, 2024
Python

DigitalPhonetics / IMS-Toucan

Star

Controllable and fast Text-to-Speech for over 7000 languages!

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Jan 25, 2026
Python

TEN-framework / ten-vad

Star

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

audio real-time voice-commands speech voice-recognition vad automatic-speech-recognition speech-processing conversational-ai voice-activity-detection voice-agent silero-vad

Updated Feb 2, 2026
C

r9y9 / deepvoice3_pytorch

Sponsor

Star

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

python machine-learning end-to-end pytorch tts speech-synthesis speech-processing multi-speaker

Updated Dec 19, 2023
Python

wq2012 / awesome-diarization

Star

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

machine-learning awesome deep-learning speech-recognition awesome-list speech-processing speaker-diarization

Updated Jul 22, 2025

coqui-ai / open-speech-corpora

Star

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jun 6, 2024

HenryNdubuaku / maths-cs-ai-compendium

Sponsor

Star

Become a cracked AI/ML Research Engineer

python nlp computer-science machine-learning statistics reinforcement-learning computer-vision deep-learning math algorithms linear-algebra machine-learning-algorithms probability mathematics artificial-intelligence speech-processing multimodal-learning jax ai-textbook

Updated Feb 26, 2026
JavaScript

haoheliu / voicefixer

Sponsor

Star

General Speech Restoration

speech tts speech-synthesis super-resolution speech-processing vocoder speech-analysis denoise mel speech-enhancement dereverberation declipping

Updated Feb 17, 2025
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 29, 2025
Python

mravanelli / SincNet

Star

SincNet is a neural architecture for efficiently processing raw audio samples.

Updated Apr 28, 2021
Python

midas-research / audino

Star

Open source audio annotation tool for humans

python machine-learning datasets speech-processing audio-processing annotation-tool audio-annotation

Updated Feb 3, 2026
TypeScript

X-LANCE / SLAM-LLM

Star

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

speech-processing audio-processing peft music-processing large-language-model multimodal-large-language-models

Updated Jan 15, 2026
Python

nyrahealth / CrisperWhisper

Star

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

audio recognition detection speech speech-recognition filler transcription whisper speech-processing asr timestamps verbatim

Updated Jun 3, 2025
Python

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-processing

Here are 794 public repositories matching this topic...

speechbrain / speechbrain

pyannote / pyannote-audio

snakers4 / silero-vad

pliang279 / awesome-multimodal-ml

microsoft / torchscale

linto-ai / whisper-timestamped

r9y9 / wavenet_vocoder

resemble-ai / resemble-enhance

DigitalPhonetics / IMS-Toucan

TEN-framework / ten-vad

r9y9 / deepvoice3_pytorch

wq2012 / awesome-diarization

coqui-ai / open-speech-corpora

HenryNdubuaku / maths-cs-ai-compendium

haoheliu / voicefixer

ictnlp / StreamSpeech

mravanelli / SincNet

midas-research / audino

X-LANCE / SLAM-LLM

nyrahealth / CrisperWhisper

Improve this page

Add this topic to your repo