Skip to content
Change the repository type filter

All

    Repositories list

    • InspireMusic: A Unified Framework for Music, Song, Audio Generation.
      Python
      Apache License 2.0
      136000Updated May 9, 2025May 9, 2025
    • PDMX

      Public
      PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing
      Python
      MIT License
      9000Updated Oct 2, 2024Oct 2, 2024
    • lycon

      Public
      Python
      1000Updated Sep 1, 2024Sep 1, 2024
    • diarizers

      Public
      Python
      22000Updated Jun 14, 2024Jun 14, 2024
    • Awesome speech/audio LLMs, representation learning, and codec models
      73000Updated Apr 13, 2024Apr 13, 2024
    • Zero-Shot Speech Editing and Text-to-Speech in the Wild
      Jupyter Notebook
      Other
      796000Updated Mar 29, 2024Mar 29, 2024
    • Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along wit…
      Python
      MIT License
      2.6k000Updated Jun 25, 2023Jun 25, 2023
    • Port of OpenAI's Whisper model in C/C++
      C
      MIT License
      5.4k000Updated Feb 18, 2023Feb 18, 2023
    • Audio generation using diffusion models, in PyTorch.
      Python
      MIT License
      180000Updated Aug 17, 2022Aug 17, 2022
    • LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained f…
      Python
      Apache License 2.0
      53000Updated Mar 1, 2022Mar 1, 2022
    • NATSpeech

      Public
      A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
      Python
      MIT License
      102000Updated Feb 17, 2022Feb 17, 2022
    • AutoEq

      Public
      Automatic headphone equalization from frequency responses
      Jupyter Notebook
      MIT License
      2.5k000Updated Dec 2, 2021Dec 2, 2021
    • soundata

      Public
      Python library for downloading, loading & working with sound datasets
      Python
      BSD 3-Clause "New" or "Revised" License
      27000Updated Nov 24, 2021Nov 24, 2021
    • TTS

      Public
      🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
      Jupyter Notebook
      Mozilla Public License 2.0
      6k000Updated Sep 17, 2021Sep 17, 2021
    • praudio

      Public
      Audio preprocessing framework for Deep Learning audio applications
      Python
      MIT License
      10000Updated Aug 27, 2021Aug 27, 2021
    • Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
      Python
      MIT License
      115000Updated Jun 9, 2021Jun 9, 2021
    • Command-line tools for speech and intent recognition on Linux
      Python
      MIT License
      66000Updated May 21, 2021May 21, 2021
    • The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging…
      HTML
      31000Updated Mar 18, 2021Mar 18, 2021
    • A C++ standalone library for machine learning
      C++
      Other
      503000Updated Mar 4, 2021Mar 4, 2021
    • Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
      Python
      MIT License
      100000Updated Feb 24, 2021Feb 24, 2021
    • Spokestack is a library that allows a user to easily incorporate a voice interface into a Python application.
      Python
      Apache License 2.0
      15000Updated Jan 27, 2021Jan 27, 2021
    • pyttsx3

      Public
      Offline Text To Speech synthesis for python
      Python
      GNU General Public License v3.0
      359000Updated Sep 30, 2020Sep 30, 2020
    • espnet

      Public
      End-to-End Speech Processing Toolkit
      Python
      Apache License 2.0
      2.4k000Updated Jun 5, 2020Jun 5, 2020
    • An implementation of a Convolutional Neural Network to Classify Music Genres
      Python
      MIT License
      9000Updated Sep 5, 2019Sep 5, 2019
    • implementation of music transformer with tensorflow-2.0 (ICLR2019)
      Python
      MIT License
      78000Updated Aug 12, 2019Aug 12, 2019
    • lmms

      Public
      Cross-platform music production software
      C++
      GNU General Public License v2.0
      1.2k000Updated Apr 17, 2019Apr 17, 2019
    • snickery

      Public
      Hybrid speech synthesiser
      Python
      Apache License 2.0
      6000Updated Nov 6, 2018Nov 6, 2018
    • Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
      Python
      80000Updated Oct 5, 2018Oct 5, 2018
    • pytheory

      Public
      Music Theory for Humans.
      Python
      82000Updated Sep 10, 2018Sep 10, 2018
    • amodem

      Public
      Audio MODEM Communication Library in Python
      Python
      Other
      134000Updated Jun 18, 2018Jun 18, 2018
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.