Recording Quality — mentalhealthGPT

Why this is crucial

The Quality Chain

Each stage of this pipeline amplifies or accumulates errors from the previous stage. Poor input audio quality degrades all subsequent outputs— regardless of the quality of the models used.

🎙️ Audio

→

Transcription

→

Speaker separation

→

Text Analysis

→

Documentation

→

AI Assistant

Poor audio quality at the input stage → errors in transcription → incorrect speaker assignments → unreliable documentation → limited AI quality.

The two main reasons

Why Wearable Microphones

The requirements in therapeutic settings differ fundamentally from those in meetings or call centers.

🤝

Therapeutic Naturalness

Technology must not dominate the therapeutic relationship

Large, conspicuous recording devices on the table can affect the atmosphere of the session. Many clients feel as though they are being watched or “recorded” — which undermines the natural openness of the conversation.

Small, body-worn microphones become unnoticeable after just a few minutes. The meeting remains the focus — not the technology.

Wearable microphone

Forgotten after 2–5 minutes
No technical equipment in the room
Natural conversation
The client does not feel monitored

Table or room solution

Always visible and present
Indicates "Recording in progress"
Changes the dynamics of the conversation
Reduces openness

🔊

Reliable voice separation

Software-based diarization alone is not sufficient for clinical practice

Modern diarization models are powerful — but they have a crucial limitation: They can only separate what is acoustically separable. In typical therapy rooms without single-track recordings, software-only diarization fails to:

— Overlaps and interruptions
— Significant differences in volume (therapist speaking loudly, client speaking quietly)
— Room reverb that blends the two voices
— Emotional moments (crying, speaking softly, pauses)

Separate wireless microphones solve this problem physically — not algorithmically. Each speaker has their own signal. The result: precise speaker labels, accurate attributions in the transcript, and reliable documentation.

Recommended devices

Wireless Wearable Setups

For clinical use, we recommend the following wireless systems — one transmitter per person, worn on the body (clip-on/lavalier).

Device	Why is it recommended?
RØDE Wireless GO II Top Choice	Excellent voice quality, dual-channel (two transmitters simultaneously), very discreet, proven in therapeutic use. USB-C, easy setup.
DJI Mic 2	Very small magnetic transmitter, easy pairing, solid voice quality — unobtrusive in everyday clinical practice.
Hollyland Lark M2	Extremely discreet and lightweight — ideal when discretion is a top priority. Includes two transmitters.
RØDE Wireless ME	Extremely easy to use, compact, and capable of direct connection without a separate receiver — making it hassle-free for everyday use in clinical settings.

All of the devices listed are recognized as separate audio sources and can be selected directly in the app's Recording tab.

Unsupported setups

What doesn't work

The following configurations do not produce clinically useful transcriptions and are not supported by us.

Laptop's built-in microphone — too far away, room echo, ambient noise
Table microphone / room microphone — speaker separation not possible
Conference or meeting microphone — optimized for meetings, not for therapeutic conversations
Smartphone on the table — uncontrolled levels, reverb, single-channel
Headset (wired) — invasive, unsuitable for therapeutic use, alters the conversation atmosphere

Setup Best Practices

Before the meeting

Six simple steps for consistent recording quality.

🎙️

One microphone per person

The therapist and client each wear their own transmitter. This is the only reliable way to distinguish between their voices.
📍

Clip at chest height

Ideal placement: breast pocket, lapel, or shirt collar — about 20–30 cm from the mouth, away from any friction caused by clothing.
🔋

Check the battery before the meeting

Check the charge level just before you start. Most systems display the status via an LED or an app.
🔇

Choose a quiet room

Air conditioning, fans, and street noise noticeably reduce transcription quality. A quiet room is the most cost-effective improvement.
👕

Avoid friction from clothing

A scarf, jacket, or loose fabric covering the microphone can cause unwanted noise. Check briefly before starting the recording.
🎛️

Select the device in the app

On the Recording tab, select the correct audio device from the drop-down menu — not the laptop's default microphone.

Privacy & Security

What happens to the audio data

Audio data is the most sensitive data in the system — and we treat it accordingly.

🔒 Audio Privacy

Audio files are deleted immediately after transcription — no raw audio remains on our servers
Transcription takes place locally in the browser — audio data never leaves your device in unencrypted form
Only the encrypted text transcript is stored — end-to-end encrypted and readable only by you
No hidden background recording — recording is only active when you explicitly start it
The client's consent is required prior to each recording

Complete privacy policy documentation →

Good audio setup

The Quality Chain

Why Wearable Microphones

Therapeutic Naturalness

Wearable microphone

Table or room solution

Reliable voice separation

Wireless Wearable Setups

What doesn't work

Before the meeting

One microphone per person

Clip at chest height

Check the battery before the meeting

Choose a quiet room

Avoid friction from clothing

Select the device in the app

What happens to the audio data

🔒 Audio Privacy