Handy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides offline speech-to-text transcription directly into any text field. Using either Whisper models (Small/Medium/Turbo/Large with GPU acceleration) or CPU-optimized Parakeet V3, it filters silence via Silero VAD and pastes transcribed text into your active application.
Completely free and open-source, Handy prioritizes privacy by keeping all voice data on your local machine. It includes configurable keyboard shortcuts with toggle or push-to-talk modes, external editor support, manual model installation for restricted networks, and comprehensive debug logging. Available for macOS, Windows, and Linux.
Use Cases:
Discord (https://img.shields.io/badge/Discord-%235865F2.svg?style=for-the-badge&logo=discord&logoColor=white)
A free, open source, and extensible speech-to-text application that works completely offline.
Handy is a cross-platform desktop application built with Tauri (Rust + React/TypeScript) that provides simple, privacy-focused speech transcription. Press a shortcut, speak, and have your words appear in any text field—all without sending your voice to the cloud.
Handy was created to fill the gap for a truly open source, extensible speech-to-text tool. As stated on handy.computer (https://handy.computer):
Handy isn't trying to be the best speech-to-text app—it's trying to be the most forkable one.
The process is entirely local:
For detailed build instructions including platform-specific requirements, see BUILD.md.
Handy is built as a Tauri application combining:
whisper-rs: Local speech recognition with Whisper modelstranscription-rs: CPU-optimized speech recognition with Parakeet modelscpal: Cross-platform audio I/Ovad-rs: Voice Activity Detectionrdev: Global keyboard shortcuts and system eventsrubato: Audio resamplingHandy includes an advanced debug mode for development and troubleshooting. Access it by pressing:
Cmd+Shift+DCtrl+Shift+DThis project is actively being developed and has some known issues (https://github.com/cjpais/Handy/issues). We believe in transparency about the current state:
Whisper Model Crashes:
Wayland Support (Linux):
Clipboard (CTRL+V) / Clipboard (Shift+Insert)) copy the transcription once, then try to run wtype (https://github.com/atx/wtype) (preferred) or dotool (https://sr.ht/~geb/dotool/) to fire the paste keystroke. Install one of these tools to let Handy drive the compositor-friendly paste shortcut; otherwise it falls back to Enigo-generated key events, which may not work on Wayland.Terminal-based AI coding assistant with multi-provider LLM support, session management, LSP integration, and interactive TUI for developers.
AI-powered video translation and dubbing tool supporting 100 languages with voice cloning, automated subtitle generation, and platform-optimized output for global content distribution.
All-in-one AI content marketing platform for creating, publishing, and monetizing across 14+ social channels with automation, trend tracking, and engagement tools.
Privacy-first AI meeting assistant with local transcription, speaker diarization, and automated summarization running entirely on your infrastructure without cloud dependencies.
Modern cross-platform system monitor built with Rust offering real-time CPU and memory tracking with beautiful UI, process management, and advanced search capabilities.
Ultra-efficient large language model achieving 3x faster reasoning generation on end devices with hybrid sparse attention and extensive hardware acceleration support.