Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

README.md

Voice Chat

Overview

Voice chat models are designed to enable voice-based interactions with AI systems. Unlike traditional text-only assistants, these models can generate natural-sounding speech as responses, creating a more immersive and human-like conversational experience. Some models accept text input and respond with voice, while others can process both speech and text, allowing users to talk directly to the model or type messages depending on the use case.

Depending on the model, you may have access to settings for bitrate, output audio formats (often including lossless options), stream vs. non-stream modes, as well as a variety of voices and ways to customize or modify them.

All Available Voice Chat Models

Model IDDeveloperContextModel Card
elevenlabs/v3_alphaElevenLabsEleven v3 Alpha
minimax/speech-2.5-turbo-previewMiniMaxMiniMax Speech 2.5 Turbo
minimax/speech-2.5-hd-previewMiniMaxMiniMax Speech 2.5 HD
minimax/speech-2.6-turboMiniMaxMiniMax Speech 2.6 Turbo
minimax/speech-2.6-hdMiniMaxMiniMax Speech 2.6 HD
minimax/speech-2.8-turboMiniMaxSpeech 2.8 Turbo
minimax/speech-2.8-hdMiniMaxSpeech 2.8 HD

Several models that were originally listed in our Text Models (LLM) section should also be included in this category: