AI text cleanup
An optional per-mode step sends your transcript to an AI model to drop filler and format it. A deterministic filler removal runs first, no key needed.
3 min read
Plain transcription gives you exactly what you said, ums included. The optional cleanup step, set per mode, rewrites that raw transcript into finished text: it drops filler, fixes the order, and formats for where the text is going.
How cleanup works
- You speak and wispa transcribes the raw text.
- The transcript is sent to the AI model in your mode, with your instructions for how to style it.
- The cleaned text comes back and is inserted at your cursor.
Filler-word removal
Separately, wispa can strip filler words like um and uh with a deterministic rule that runs before any AI step. It needs no key, works with local models, and saves tokens when you do use cleanup. Turn it on in the Audio settings.
FAQ
Questions and answers
Does cleanup need an internet connection?
Yes, because it uses a cloud AI model with your key. Plain transcription and the deterministic filler removal both work offline with a local model.
Will the AI answer my transcript instead of formatting it?
No. A guardrail marks your transcript as data to format, not a prompt to answer, so questions or commands in your speech are cleaned up rather than acted on.
Can I use cleanup without a big cloud bill?
Yes. The default Claude Haiku is inexpensive, and turning on filler removal first trims the text so the AI step handles fewer tokens.