Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.8k 915

  2. pocket-tts pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    Python 3.6k 401

  3. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 2.9k 300

  4. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.4k 111

  5. unmute unmute Public

    Make text LLMs listen and speak

    Python 1.2k 215

  6. moshi-finetune moshi-finetune Public

    Python 411 61

Repositories

Showing 10 of 26 repositories
  • dactory Public
    kyutai-labs/dactory’s past year of commit activity
    Python 49 Apache-2.0 6 0 0 Updated Mar 12, 2026
  • pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    kyutai-labs/pocket-tts’s past year of commit activity
    Python 3,562 MIT 401 27 (7 issues need help) 4 Updated Mar 12, 2026
  • flash-attn3-jax Public

    JAX bindings for the FlashAttention 3 kernels

    kyutai-labs/flash-attn3-jax’s past year of commit activity
    C++ 19 BSD-3-Clause 1 0 0 Updated Mar 9, 2026
  • casa Public

    A vision-language model with an improved cross-attention mechanism for scalable streaming inference

    kyutai-labs/casa’s past year of commit activity
    Python 28 MIT 3 3 0 Updated Mar 9, 2026
  • unmute Public

    Make text LLMs listen and speak

    kyutai-labs/unmute’s past year of commit activity
    Python 1,225 MIT 215 27 (3 issues need help) 1 Updated Mar 6, 2026
  • moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    kyutai-labs/moshi’s past year of commit activity
    Python 9,837 Apache-2.0 915 66 14 Updated Mar 4, 2026
  • invincible-voice Public

    To bring back voice to those who lost it

    kyutai-labs/invincible-voice’s past year of commit activity
    TypeScript 67 MIT 7 6 (1 issue needs help) 1 Updated Mar 3, 2026
  • yomikomi Public

    A small rust-based data loader

    kyutai-labs/yomikomi’s past year of commit activity
    Rust 36 Apache-2.0 2 1 1 Updated Feb 20, 2026
  • hibiki-zero Public

    A real-time and multilingual speech translation model

    kyutai-labs/hibiki-zero’s past year of commit activity
    Python 213 MIT 21 2 0 Updated Feb 13, 2026
  • flashy Public

    Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!

    kyutai-labs/flashy’s past year of commit activity
    Python 5 MIT 0 0 0 Updated Feb 4, 2026

Most used topics

Loading…