| description | Overview of the available speech model providers. |
|---|---|
| icon | waveform |
With our API you are able to synthesize speech and transform speech into text.
We support multiple voice/speech models. You can find the complete list along with API reference links at the end of the page.
{% content-ref url="speech-to-text/" %} speech-to-text {% endcontent-ref %}
{% content-ref url="text-to-speech/" %} text-to-speech {% endcontent-ref %}
{% content-ref url="voice-chat/" %} voice-chat {% endcontent-ref %}
| Model ID + API Reference link | Developer | Context | Model Card |
|---|---|---|---|
| aai/slam-1 | Assembly AI | Slam 1 | |
| aai/universal | Assembly AI | Universal | |
| #g1_nova-2-automotive | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-conversationalai | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-drivethru | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-finance | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-general | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-medical | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-meeting | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-phonecall | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-video | Deepgram | Deepgram Nova-2 | |
| #g1_nova-2-voicemail | Deepgram | Deepgram Nova-2 | |
| #g1_whisper-tiny | OpenAI | - | |
| #g1_whisper-small | OpenAI | - | |
| #g1_whisper-base | OpenAI | - | |
| #g1_whisper-medium | OpenAI | - | |
| #g1_whisper-large | OpenAI | Whisper | |
| openai/gpt-4o-transcribe | OpenAI | GPT-4o Transcribe | |
| openai/gpt-4o-mini-transcribe | OpenAI | GPT-4o Mini Transcribe |
| Model ID | Developer | Context | Model Card |
|---|---|---|---|
| elevenlabs/v3_alpha | ElevenLabs | Eleven v3 Alpha | |
| minimax/speech-2.5-turbo-preview | MiniMax | MiniMax Speech 2.5 Turbo | |
| minimax/speech-2.5-hd-preview | MiniMax | MiniMax Speech 2.5 HD | |
| minimax/speech-2.6-turbo | MiniMax | MiniMax Speech 2.6 Turbo | |
| minimax/speech-2.6-hd | MiniMax | MiniMax Speech 2.6 HD | |
| minimax/speech-2.8-turbo | MiniMax | Speech 2.8 Turbo | |
| minimax/speech-2.8-hd | MiniMax | Speech 2.8 HD |