Voice-Operated eXecution — "Talk to your terminal. Hear what matters."
macOS menu bar app that monitors your development tools and speaks smart summaries. Supports 6 apps (Terminal, iTerm2, Claude Desktop, Cursor, VS Code, Windsurf), auto-detects language, and offers 3 TTS engines including local AI summaries via Ollama. Everything on-device, no cloud, no telemetry.
VOX overkill for your setup? This is the lightweight alternative: audio feedback for Claude Code via hooks. Hear when Claude is done, makes an error, or needs your attention — with swappable sound packs (Warcraft Peon, Beavis & Butthead, macOS system sounds, or your own pack). Multiple random variants per event and per-terminal pack setting via env var. No app needed, pure Node.js + hooks.
VOX monitors your development tools and speaks smart summaries of what happened.
Use Terminal, iTerm2, Claude Desktop, Cursor, VS Code, or Windsurf as usual.
VOX detects new output via AppleScript and processes it with local AI (Ollama) or heuristics.
A smart summary is spoken in the detected language using your chosen TTS engine.
VOX never executes commands — it only reads terminal output. No accessibility permissions, no shell access, no risk.
Compact, powerful, and fully on-device.
Monitors Terminal.app, iTerm2, Claude Desktop, Cursor, VS Code, and Windsurf. Automatic background detection.
Auto-detects Dutch, English, and German content. Speaks summaries in the matching language with appropriate voice.
macOS native (NSSpeechSynthesizer), Edge TTS (Microsoft voices), and ElevenLabs. Choose quality vs. speed.
Ollama-powered on-device summarization. No cloud required. Falls back to heuristic summaries when Ollama is unavailable.
MyInstants integration for downloadable sound effects. Custom event sounds for completions, errors, and alerts.
Searchable, filterable log of all monitored commands and responses. Persistently stored as JSON.
Choose how much detail you want to hear. The Summary level uses smart heuristics: git status shows file counts, build output detects success/failure, errors automatically increase verbosity.
| Level | Name | What you hear |
|---|---|---|
| 0 | Silent | Nothing (visual indicator only) |
| 1 | Ping | "Done." or "Error occurred." |
| 2 | Summary | Heuristic 1-2 sentence summary (default) |
| 3 | Full | Complete response read aloud |
VOX is built with privacy as its foundation. Everything runs locally on your Mac.
All processing happens on-device. No server, no API calls.
Zero analytics, zero tracking, zero data collection.
Voice audio is never stored. Hex handles all input.
VOX reads output via AppleScript, never executes commands.
No accessibility or microphone permissions required.
Full source code is public and auditable.
Native macOS app built with modern Swift.
Swift 6 with SwiftUI for the UI
macOS native, Edge TTS, ElevenLabs
Ollama (local, on-device) + heuristic fallback
AppleScript via /usr/bin/osascript
macOS 14+ (Apple Silicon optimized)
XCTest unit tests
Build from source with Swift Package Manager.
Overview of the app architecture.
From MVP to multi-app voice companion.
VOX is open source and welcomes contributions from everyone.
License: MIT · Tags: voice-control, macos, swift, developer-tools, speech-to-text