Modes and cleanup

AI text cleanup

An optional per-mode step sends your transcript to an AI model to drop filler and format it. A deterministic filler removal runs first, no key needed.

3 min read

Plain transcription gives you exactly what you said, ums included. The optional cleanup step, set per mode, rewrites that raw transcript into finished text: it drops filler, fixes the order, and formats for where the text is going.

How cleanup works

You speak and wispa transcribes the raw text.
The transcript is sent to the AI model in your mode, with your instructions for how to style it.
The cleaned text comes back and is inserted at your cursor.

Filler-word removal

Separately, wispa can strip filler words like um and uh with a deterministic rule that runs before any AI step. It needs no key, works with local models, and saves tokens when you do use cleanup. Turn it on in the Audio settings.

FAQ

Questions and answers

Does cleanup need an internet connection?

Yes, because it uses a cloud AI model with your key. Plain transcription and the deterministic filler removal both work offline with a local model.

Will the AI answer my transcript instead of formatting it?

No. A guardrail marks your transcript as data to format, not a prompt to answer, so questions or commands in your speech are cleaned up rather than acted on.

Can I use cleanup without a big cloud bill?

Yes. The default Claude Haiku is inexpensive, and turning on filler removal first trims the text so the AI step handles fewer tokens.

AI text cleanup

How cleanup works

Filler-word removal

Questions and answers

Does cleanup need an internet connection?

Will the AI answer my transcript instead of formatting it?

Can I use cleanup without a big cloud bill?

Related articles

Dictation modes

Setting up a cloud key

Supported providers

Start dictating in minutes