Question 1

What is Doing?

Accepted Answer

Doing is a voice transcription app for macOS that runs entirely on your Mac. Hold a hotkey, speak, and your words are transcribed and pasted into whatever app you're using. It uses NVIDIA's Parakeet model for on-device transcription at 150x realtime speed — a 60-second recording transcribes in about 400 milliseconds.

Question 2

Does Doing work offline?

Accepted Answer

Yes. The default transcription engine (Parakeet) runs completely on-device with no internet connection required. Your audio never leaves your Mac. Cloud-based engines (OpenAI Whisper, Gemini, AssemblyAI) are also available if you prefer, but they're optional and require your own API keys.

Question 3

What is the best voice transcription app for Mac?

Accepted Answer

Doing is designed to be the fastest and most private voice transcription app for macOS. It processes audio locally at 150x realtime using NVIDIA's Parakeet model, costs $49 once with no subscription, and works in any text field system-wide. Unlike cloud-based alternatives, no audio is ever uploaded to a server. See how Doing compares to other voice transcription tools at doing.tools/compare.

Question 4

Is Doing a subscription?

Accepted Answer

No. Doing is a one-time purchase of $49. There are no monthly fees, no account required, and no usage limits after purchase. You get free updates and can activate on up to 3 Macs. A free trial of 100 transcriptions is included with no credit card required.

Question 5

How is Doing different from macOS Dictation?

Accepted Answer

macOS Dictation is built into the system but optimized for real-time typing. Doing is optimized for voice-first workflows — hold a hotkey, speak naturally, and the full transcription is pasted at once when you release. Doing also supports AI post-processing (Skills) to clean up, formalize, or transform your text before pasting, and includes custom dictionaries for technical jargon.

Question 6

What apps does Doing work with?

Accepted Answer

Doing works with any macOS app that accepts text input. It's particularly popular with AI tools like Claude, ChatGPT, Cursor, and Claude Code, as well as productivity apps like Slack, Notion, Gmail, and code editors. It operates at the system level — wherever you can type, you can speak.

Question 7

What are Skills in Doing?

Accepted Answer

Skills are AI post-processing prompts that transform your transcription before it's pasted. For example, the Cleanup skill removes filler words, the Formalize skill rewrites casual speech for professional contexts, and you can create custom Skills for any transformation. Skills can run on-device with Apple Intelligence (macOS 26+) or with your own API keys for OpenAI, Claude, or Gemini.

Question 8

What Mac do I need to run Doing?

Accepted Answer

Doing requires macOS 14 (Sonoma) or later and an Apple Silicon Mac (M1 or newer). The Parakeet transcription model downloads once (~480 MB) and runs locally on your Mac's Neural Engine. No GPU or special hardware beyond Apple Silicon is needed.

Talk to Claude,
don't type.

Hold fn Talk. Done.

Your voice never leaves your Mac.

Add visual context, instantly.

Own your tools, stop renting.

Transcription with zero wait.

Level up with YOLO Mode.

Works with every AI tool and app.

Keep your music playing.

Customize with Skills.

Choose your engine with data, not marketing.

I built the voice tool I wanted.

Frequently asked questions

Want the full picture?

Local. Fast. Yours.

Talk to Claude,don't type.