VOICE TO TEXT

Privacy-first. Blazing fast. Hold a key, speak, release — text appears where you need it.

Built Different

Hold → Speak → Release

Global hotkey captures audio instantly. Release to transcribe. Text injected into your active app.

Your Machine. Your Data.

All transcription runs locally. Audio and logs never leave your device unless you explicitly opt in.

1.5s for 30s Audio

Written in native code with GPU acceleration. The fastest local transcription engine available.

Optional Online Services

Use your own API key for cloud LLMs. Enable or disable at any time — your choice, always.

Backend Flexibility

Auto-detect, CPU-only, or GPU acceleration. Choose the model size that fits your hardware.

LLM-Powered Accuracy

Route audio through Gemini, GPT, or specialized models for ultra-accurate transcription of technical content.

Raw Speed

30 seconds of English audio

OiPer Desktop1.5s
Lemonfox API3.27s
Python Faster-Whisper3.55s
OpenAI Whisper 1 API6.46s

Privacy Is Not Optional

Every transcription runs on your hardware. Your audio never leaves your machine. Activity logs stay local. Online services are available — but only when you choose, with your own API keys.

Local Transcription

Runs entirely on your CPU or GPU

No Telemetry

Zero data collection by default

Your API Keys

Online features use your own credentials