Cleaner input, cleaner transcripts: audio conditioning for accuracy - Voice Type blog Skip to main content Voice Type Pricing Learn Enterprise Trust Blog Blog Cleaner input, cleaner transcripts: audio conditioning for accuracy Normalized loudness and gentle filtering help the recognizer hear what you meant, not the room. ← Back to Blog | Home 30 Sept 2025 If the input is messy, the output will be too. Voice Type normalizes loudness to a consistent target and applies a light high pass filter to reduce low frequency rumble. Combined with noise aware voice activity detection, this gives the model input closer to the audio it was trained on which leads to fewer garbles and better punctuation. We avoid heavy “prompt fixes” that can make transcripts look confident but less faithful. Instead, we improve the signal before recognition. Previous Punctuation that sticks: what we fixed and how to get best results Next Long sessions: uploads vs 30‑second windowing Related articles Product Punctuation that sticks: what we fixed and how to get best results A straight look at punctuation and spacing. What we shipped to make it better, how to test it, and a few habits that help on macOS. Performance Long sessions: uploads vs 30‑second windowing Why streaming on‑device and finalizing only the last ~30 seconds keeps long dictations responsive. Voice Type Learn All guides Voice Type vs Apple Dictation Dragon alternatives For writers For developers Notion on Mac Latency demo Press kit Company Enterprise Trust Center Pricing Blog Company Terms of service Privacy policy Contact us © 2025 Careless Whisper Inc.