The quality of every transcription, speaker separation, and text analysis depends directly on the audio quality of the recording. That’s why you should use good, unobtrusive, portable microphones for every session.
Each stage of this pipeline amplifies or accumulates errors from the previous stage. Poor input audio quality degrades all subsequent outputs— regardless of the quality of the models used.
Poor audio quality at the input stage → errors in transcription → incorrect speaker assignments → unreliable documentation → limited AI quality.
The requirements in therapeutic settings differ fundamentally from those in meetings or call centers.
Technology must not dominate the therapeutic relationship
Large, conspicuous recording devices on the table can affect the atmosphere of the session. Many clients feel as though they are being watched or “recorded” — which undermines the natural openness of the conversation.
Small, body-worn microphones become unnoticeable after just a few minutes. The meeting remains the focus — not the technology.
Software-based diarization alone is not sufficient for clinical practice
Modern diarization models are powerful — but they have a crucial limitation: They can only separate what is acoustically separable. In typical therapy rooms without single-track recordings, software-only diarization fails to:
Separate wireless microphones solve this problem physically — not algorithmically. Each speaker has their own signal. The result: precise speaker labels, accurate attributions in the transcript, and reliable documentation.
For clinical use, we recommend the following wireless systems — one transmitter per person, worn on the body (clip-on/lavalier).
All of the devices listed are recognized as separate audio sources and can be selected directly in the app's Recording tab.
The following configurations do not produce clinically useful transcriptions and are not supported by us.
Six simple steps for consistent recording quality.
The therapist and client each wear their own transmitter. This is the only reliable way to distinguish between their voices.
Ideal placement: breast pocket, lapel, or shirt collar — about 20–30 cm from the mouth, away from any friction caused by clothing.
Check the charge level just before you start. Most systems display the status via an LED or an app.
Air conditioning, fans, and street noise noticeably reduce transcription quality. A quiet room is the most cost-effective improvement.
A scarf, jacket, or loose fabric covering the microphone can cause unwanted noise. Check briefly before starting the recording.
On the Recording tab, select the correct audio device from the drop-down menu — not the laptop's default microphone.
Audio data is the most sensitive data in the system — and we treat it accordingly.