Free Voice-to-Text for Windows β No Subscription Required
Record voice β Get text instantly β Paste anywhere
VoiceGrab is a free, open-source alternative to Dragon NaturallySpeaking ($300) and Windows Voice Typing. Uses Groq Whisper API for professional-grade transcription in 57 languages.
Not a browser extension. Works system-wide: VS Code, Cursor, Slack, Word, ChatGPT, Telegram β any Windows app.
| You get | Why it matters |
|---|---|
| 100% Free Tier | Groq's free API = ~8 hours/day of dictation |
| Speak 3x faster than typing | Voice prompts for ChatGPT, Claude, Copilot |
| Auto-Translation | Speak Russian β Type English (DeepL/Groq) |
| Smart Cleaning | Removes "um", "uh" and Whisper hallucinations |
| 5 Dev Modes | Prompts for coding, docs, notes, chat |
Note: Gemini 2.0 integration is experimental (in development). Primary stable engines: Groq (transcription) + DeepL (translation).
Voice-to-Text with Real-time Translation:
demo.mp4
(15 sec: Dictating in Russian β Auto-translates to English β Pastes into MS Word)
- Download Latest Release (ZIP)
- Get Free API Key β console.groq.com/keys
- Run
VoiceGrab.bat - Enter key β Install Deps β Run
- Press
Right Ctrlto dictate! π
git clone https://github.com/consulfedor/VoiceGrab.git
cd VoiceGrab| Action | How |
|---|---|
| Start/Stop Recording | Right Ctrl |
| Switch Mode | Click β Mode βΆ on indicator |
| Settings | Right-click tray β Settings |
π‘ Tip: Text is always in clipboard! Ctrl+V to paste anywhere.
- π€ One-Click Recording β Press
Right Ctrl(configurable) - βΎοΈ Unlimited Recording β Auto-segments every 3 min
- π Auto-Paste β Text types directly into active window
- β‘ Groq Whisper API β Fast, accurate, FREE tier
- π§Ή Filler Cleanup β Removes "um", "uh", "like"
- π» Hallucination Filter β Removes AI "ghost" phrases
- π Auto-Translate β Off / Replace / Append modes
- π€ DeepL Integration β Quality translation
- π Groq AI Translate β Fast AI translation
- π₯οΈ System Tray β Runs in background
- π‘ Floating Indicator β Shows mode + recording status
- βοΈ Dark Settings Panel β All options per mode
| Mode | Use Case |
|---|---|
| π€ AI Chat | ChatGPT prompts, AI conversations |
| π» Code | Programming, technical terms |
| π Docs | Documentation, formal writing |
| π Notes | Quick notes, ideas |
| π¬ Chat | Casual conversation |
- Language (57 options)
- Prompt (context hint for Whisper)
- Filler/Hallucination filters
- Auto-Translate (Off/Replace/Append)
- Translation engine (Groq/DeepL)
- Windows 10/11
- Python 3.10+ β Download
β οΈ Check "Add Python to PATH" during installation! - Microphone
- π‘οΈ Config Protection β Prevents saving empty/NULL config
- π οΈ NULL Eval Fix β Fixed startup error
- π§ Settings Fix β All fields save correctly
- π Single Default Config β
config_default.json
Older versions
- π·οΈ Custom Mode Names
- π‘ Tab Tooltips
- βοΈ Mode Priority System
- π Auto-Translate (Groq/DeepL)
- π€ Translator Tool
- π Batch Transcription
- π Document Converter
- π 57 Languages support
Is it really free?
Yes! Groq offers a generous FREE tier:
- ~10 requests per minute
- ~25,000 audio seconds per day (~8 hours)
- No credit card required
Which languages are supported?
57 languages including: Russian, English, Ukrainian, Turkish, Spanish, French, German, Chinese, Japanese.
Does it work offline?
No, requires internet. Audio is processed in cloud and deleted immediately.
| Issue | Solution |
|---|---|
| Python not found | Reinstall, check "Add to PATH" |
| No microphone | Check Windows sound settings |
| Rate limit | Wait 1 min, or use Turbo model |
- Core: Python 3.10+
- UI: PowerShell + WebView2 (HTML/CSS)
- APIs: Groq (Whisper), DeepL, Gemini (dev)
MIT License β free for personal and commercial use.
β Star this repo if you find it useful!