Skip to main content

Short utterances and the hidden cost of handshakes

For 5–15 second notes, network setup time can outweigh everything else. On‑device avoids the detours.

You say “Thanks.” The network says “Hold on.”

TL;DR

  • For short clips, DNS/TLS and upload setup can be most of the wait.
  • On-device avoids the handshake tax entirely.
  • The “feel” of dictation is dominated by stop-to-text latency.

Cloud flows typically include multiple handshakes (TLS/DNS) and at least one remote hop. For very short phrases, this setup time can dominate. On‑device dictation avoids the detours entirely: your audio stays local, text appears immediately, and there’s nothing to upload.

See the effect in the interactive demo (choose Short and try different networks): /blog/latency-demo

Related: Offline stays fast · Long sessions

FreshnessUpdated Dec 25, 2025

This article is reviewed against current product behavior, macOS guidance, and linked references. If a workflow changed after Dec 25, 2025, check the latest product docs and Apple guidance before relying on older steps or screenshots.

Try Voice Type

Dictate into any Mac text field without waiting on uploads.

Voice Type fits people who want local dictation, custom vocabulary, and a faster stop-to-text loop. The trial is the quickest way to see how it behaves on your own setup.

Freshly reviewed·7-day trial·one-time purchase