Skip to content
Change the repository type filter

All

    Repositories list

    • Self-supervised Speaker Diarization Interspeech 2022 Implementation
      Python
      0800Updated Feb 2, 2026Feb 2, 2026
    • Python
      1000Updated Jan 27, 2026Jan 27, 2026
    • Python
      MIT License
      1000Updated Sep 12, 2025Sep 12, 2025
    • Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
      Python
      Other
      13000Updated Aug 21, 2025Aug 21, 2025
    • StructED

      Public
      Risk Minimization Algorithms in Structured Prediction
      Java
      Other
      2000Updated Jun 13, 2025Jun 13, 2025
    • DIFFAR

      Public
      Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation
      Python
      MIT License
      3000Updated Jun 7, 2025Jun 7, 2025
    • Jupyter Notebook
      2000Updated Jun 7, 2025Jun 7, 2025
    • Formant Tracking & Estimation
      Python
      MIT License
      178262Updated Dec 15, 2024Dec 15, 2024
    • Python
      0400Updated Oct 6, 2024Oct 6, 2024
    • DDKtor

      Public
      Python
      MIT License
      1200Updated Aug 8, 2024Aug 8, 2024
    • Python
      3611Updated Jul 27, 2024Jul 27, 2024
    • 0000Updated Jun 9, 2024Jun 9, 2024
    • Python
      MIT License
      61511Updated Apr 1, 2024Apr 1, 2024
    • Dr.VOT

      Public
      Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).
      Python
      103212Updated Jul 25, 2023Jul 25, 2023
    • GradSeg

      Public
      Python
      MIT License
      3410Updated Mar 13, 2023Mar 13, 2023
    • .github

      Public
      0000Updated Dec 31, 2022Dec 31, 2022
    • DeepFry

      Public
      Python
      MIT License
      7000Updated Dec 8, 2022Dec 8, 2022
    • PiMOD

      Public
      Pitch Estimation by Multiple Octave Decoders
      Python
      MIT License
      1000Updated Oct 30, 2022Oct 30, 2022
    • DSegKNN

      Public
      Python
      2100Updated Oct 25, 2022Oct 25, 2022
    • SpeechYOLO Interspeech 2019
      Python
      MIT License
      124630Updated Aug 16, 2022Aug 16, 2022
    • UnsupSeg

      Public
      Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
      Python
      MIT License
      32000Updated Aug 5, 2022Aug 5, 2022
    • CRP

      Public
      Training with constant perturbations against adversarial attacks.
      Python
      0000Updated Feb 17, 2021Feb 17, 2021
    • Python
      12000Updated Jan 17, 2021Jan 17, 2021
    • Python
      0000Updated Nov 22, 2020Nov 22, 2020
    • TeX
      1000Updated Apr 12, 2020Apr 12, 2020
    • Machine learning-based tools for fine grained phonetic measurements
      HTML
      Other
      0000Updated Nov 24, 2019Nov 24, 2019
    • This project implements the paper: "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
      Python
      0100Updated Jul 8, 2019Jul 8, 2019
    • 0040Updated Nov 3, 2018Nov 3, 2018
    • Watermarking Deep Neural Networks (USENIX 2018)
      Python
      33100Updated Oct 8, 2018Oct 8, 2018
    • A software package for automatic extraction of pre-aspiration from speech segments in audio files, using a trainable algorithm.
      C++
      GNU Lesser General Public License v3.0
      1200Updated Jun 24, 2017Jun 24, 2017
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.