Question 1

How does it work without internet?

Accepted Answer

Detto runs two AI models locally on your Mac. Parakeet-TDT v3 handles speech recognition on the Neural Engine. Llama 3.2 3B handles text refinement on the GPU via MLX. After a one-time model download (~3 GB), dictation and transcription work entirely offline.

Question 2

What Mac do I need?

Accepted Answer

Any Mac with Apple Silicon (M1 or later) running macOS 26 or later, with at least 8 GB of RAM. The models need approximately 3 GB of disk space. Intel Macs are not supported.

Question 3

How accurate is it compared to cloud transcription?

Accepted Answer

Parakeet-TDT v3 is competitive with cloud ASR for English in professional contexts. The local refinement step cleans up filler words, grammar, and punctuation. Small local models won't match a frontier cloud model on polish. That's the trade for owning your voice.

Question 4

Does dictation work with any app?

Accepted Answer

Yes. Detto injects text at your cursor in whatever app has focus. Slack, Zoom chat, VS Code, Notion, email, browser forms, Terminal. If you can type in it, Detto works with it.

Question 5

What languages does it support?

Accepted Answer

ASR auto-detects 25 European languages. Text refinement is English-tuned. Other languages transcribe fine, with less polish on punctuation and capitalization.

Question 6

What permissions does it need?

Accepted Answer

Microphone for all modes. Screen Recording for call capture (system audio). Accessibility for dictation (text injection). All requested during onboarding.

Question 7

Is it really free?

Accepted Answer

Yes. Detto is source-available under the Business Source License 1.1. You can use it personally, build from source, and modify it. The license converts to MIT in 2030. The speech engine (GrembleVoice) is open source under Apache 2.0.

Your voice stays on your Mac.

Three ways to capture. One place it lands.

Capture with context.

Plain files you own.

Speech recognition on the Neural Engine

Text refinement on the GPU

No server. No account.

Privacy is architecture, not a toggle.

Questions

Your voice. Your Mac. No one listening.