← Back to Speechcap
Private by designLast updated 2026-06-01

Offline dictation for Mac that's actually offline.

Speechcap is the only Mac dictation app where both Whisper transcription AND AI cleanup run on your Mac on Pro — your voice never leaves the device. Push-to-talk, 89 languages, from $3/month. Works on a plane, in a SCIF, behind a firewall. Same experience as online.

Download Speechcap· for macOSSee the full app roundup
Your Mac
Capture
Audio captured into memory. Never written to disk. Never uploaded.
On-device
Transcribe
Whisper Large v3 runs locally on Apple Silicon.
On-device
Clean up
On-device LLM (IBM Granite 4 Micro) strips fillers, fixes punctuation, applies context-aware formatting.
On-device
Type
Pasted at your cursor in whichever Mac app was focused.
On-device
Cloud · audio · transcripts · prompts0 bytes sent

What “offline” actually means in 2026

Most Mac dictation apps that advertise “offline mode” in 2026 mean one of two things: either transcription runs on your Mac but the AI cleanup pass routes through a cloud LLM, or the whole product still needs an internet connection but caches gracefully when you're briefly offline. Both are reasonable engineering tradeoffs. Neither is fully offline.

Fully offline means a specific architectural shape: audio is captured into RAM only, transcribed by a model running on your CPU/GPU, processed by an on-device LLM for cleanup, and typed into your editor — all without a single packet leaving the machine. You can turn the Wi-Fi off, fly to another country, or stand inside a Faraday cage, and dictation keeps working with the same quality.

The distinction matters because privacy policies are promises and architecture is a guarantee. A promise can change with a new TOS revision, a data breach, a subpoena, an acquisition, or a quiet misconfiguration. Architecture stays put. For work that touches client privilege, patient data, regulated information, or NDA-bound source code, the architectural question is the one the compliance team actually asks.

Which Mac dictation apps actually run fully offline?

As of the 2026 mid-year landscape, the answer is uncomfortably short:

AppTranscription on-deviceAI cleanup on-deviceTruly offline?
SpeechcapYes (Pro)Yes (Pro) · IBM Granite 4 MicroYes — both stages
SuperwhisperYesNo — typically routes through OpenAIPartial
MacWhisperYes (file transcription)Partial (post-process pass)Partial · different category
Wispr FlowNo — cloudNo — cloudNo
Willow VoiceNo — primarily cloudNo — cloudNo
Apple DictationOptional (lower quality)No AI cleanup at allPartial · accuracy drops

What stays on your Mac on Pro

With Speechcap Pro's on-device engine selected (Settings → Transcription → On-device), the following data classes never reach our servers:

The only network calls Speechcap makes on Pro on-device mode are: license verification (an intermittent ping with your account ID — no content), and optional crash diagnostics if you haven't disabled them. Both are toggleable in Settings.

Who this is for

Offline dictation isn't about being a stereotypical privacy maximalist. It's about specific workflows where the architectural answer is the only acceptable one.

Lawyers and legal professionals

Attorney-client privilege survives only if privileged content isn't shared with third parties. Dictating draft contracts, depositions, or case notes through a cloud dictation app risks waiving privilege depending on jurisdiction and the cloud provider's subprocessor chain. On-device dictation removes that exposure entirely.

Healthcare and medical practice

We're not yet HIPAA-certified — that's a Q3 2026 target for Speechcap Pro. But the architecture (no audio uploaded, no PHI transcripts in our systems) is the foundation HIPAA compliance will eventually attest to. Practitioners who can't wait for the certification often use Speechcap today on the basis that no PHI reaches our servers in the first place.

Journalists and investigative reporters

Source protection means assuming any cloud you touch is subpoena-able. Reporters dictating notes from a sensitive interview, drafts of a story with an anonymous source, or even just internal editorial messages benefit from architecture that doesn't create discoverable artifacts. Offline dictation produces nothing for a future legal demand to discover.

Security-conscious developers and founders

Dictating draft commit messages that mention unannounced features, PR descriptions for an unfiled patent, founder notes for a future fundraise — these are routine and routinely sensitive. On-device dictation lets you keep voice-as-input as a daily tool without growing the surface area of where your unannounced work exists.

The technical architecture, briefly

Two models do the work, both running on your Mac:

Whisper Large v3 for transcription

Speechcap uses OpenAI's open-source Whisper Large v3 model for speech recognition. On M-series Apple Silicon, transcription runs at 4–8× realtime — your 30-second dictation transcribes in roughly 4 seconds, end-to-end. The model is ~1.5 GB on disk, memory-mapped at runtime, unloaded when the app is idle. Accuracy on typical English (including accented variants) lands at 95–98% per our internal testing across the supported language set.

IBM Granite 4 Micro for cleanup

The AI cleanup pass — removing fillers, fixing punctuation, applying context-aware formatting, handling in-flight transforms — runs through a 3-billion-parameter on-device LLM. We ship IBM's Granite 4 Micro because it has the best speed-to-quality tradeoff we've measured for the cleanup task on consumer Apple hardware. The model is ~2 GB, loaded on first use, cached for the session.

Hardware requirements

Both models run comfortably on any Apple Silicon Mac (M1 and newer). On Intel Macs, performance is noticeably slower — usable but not snappy. Memory: 16 GB Macs run both models in parallel without pressure; 8 GB Macs work but you'll notice the swap pressure if you have many other apps open. Storage: ~4 GB for both models combined; downloaded once on first launch.

What you trade by going fully offline

Honest tradeoffs, because no architectural choice is free:

For most workflows the tradeoffs are imperceptible. The privacy upside is permanent.

Pricing

On-device dictation isn't a separate product — it's a setting inside Speechcap Pro. Pro starts at $3/month with PPP-localised pricing in 89 markets ($6/month is the top tier). The free tier (2,000 words/month, cloud transcription) is generous enough to try the workflow before committing. There's also a 14-day Pro trial with no credit card.

Compared to other Mac dictation apps, Speechcap Pro is roughly half the price of Wispr Flow and Willow Voice ($12–15/month) and undercuts Superwhisper's monthly tier ($8.49). The architectural difference (full on-device) plus the pricing difference is the central pitch.

Try Speechcap Pro free for 14 days — no card.
Start the trial

Frequently asked questions

What does "offline dictation" actually mean for a Mac app?

It depends on the app. The honest definition: every stage of the dictation pipeline — audio capture, transcription, AI cleanup, and text injection into your editor — runs on the user's Mac with no cloud round-trip. Some apps run transcription locally but still send the transcript to a cloud LLM for cleanup; that's partially offline, not fully offline. Speechcap Pro runs both stages on your Mac; nothing leaves the device.

Which Mac dictation apps work without an internet connection?

On Pro, Speechcap runs entirely on your Mac — full Whisper transcription and AI cleanup both local. Superwhisper runs transcription locally but their AI cleanup pass typically routes through a cloud LLM (OpenAI). MacWhisper runs file transcription on-device but has limited live-dictation features. Wispr Flow, Willow Voice, and Apple Dictation's cloud mode all require an internet connection. Apple Dictation has an on-device option but at noticeably lower accuracy.

Is on-device dictation as accurate as cloud-based dictation in 2026?

On modern Apple Silicon Macs, yes. Whisper Large v3 running locally on M1 / M2 / M3 / M4 produces 95–98% accuracy on typical English — comparable to cloud transcription. The accuracy gap that justified cloud-only dictation in 2019 has effectively closed. The remaining quality gaps are about cleanup quality (small on-device LLMs vs frontier cloud models), not transcription itself.

How much disk space and RAM does on-device dictation use?

Whisper Large v3 is about 1.5 GB on disk. The on-device cleanup model (Speechcap uses IBM Granite 4 Micro, ~2 GB) adds another ~2 GB. At runtime, models are memory-mapped and use ~3 GB of RAM when actively dictating. Models are unloaded when idle. On an 8 GB Mac that's tight; 16 GB is comfortable; 24 GB or more leaves room for everything else.

Can I dictate on a plane, in a SCIF, or behind a corporate firewall?

Yes — that's the whole point of full-offline mode. Toggle Wi-Fi off, hold your push-to-talk key, speak. Speechcap will transcribe, AI-clean, and type the result into whatever app you're focused on without any network access. The only thing you lose offline is cloud-synced custom vocabulary; the local list still works.

Why does on-device matter if the company has a privacy policy?

Privacy policies are promises. On-device architecture is a guarantee. A privacy-policy promise can be broken by a future policy change, a data breach, a subpoena, a misconfigured server, or an acquiring company with different priorities. On-device processing removes the data from those failure modes by design. For attorney-client privileged work, patient notes, NDA'd source code, executive communications — the architectural answer matters more than the policy answer.

Speechcap Labs · 2026-06-01All Mac dictation apps compared →