| PII/PHI tagging | PII/PHI redaction | |
|---|---|---|
| How to use | pii_phi_tagging=true on any STT endpoint | Dedicated Redaction API |
| Transcript | Sensitive spans wrapped in entity tags | Sensitive spans replaced with entity tags |
| Audio | Original audio unchanged | Sensitive audio ranges silenced |
| Use when | You need the transcript cleaned but keep audio | You need both transcript and audio sanitized |
Batch
Send a complete audio file. The response ismultipart/form-data with two parts: metadata (JSON transcript) and audio (redacted MP3).
Install
requests-toolbelt to decode the multipart response: pip install requests-toolbelt.Expected metadata response
Expected metadata response
[FIRSTNAME], [SSN], [PHI], etc.) correspond directly to the silenced redaction_ranges in the audio.
Streaming (WebSocket)
Connect over WebSocket and receive redacted utterances and silenced MP3 clips as each utterance completes. The stream delivers two message types interleaved:- JSON text frames —
utterancemessages with the redacted transcript text - Binary frames — MP3 clips with the silenced audio for each utterance
Example messages received
Example messages received
API reference
- Redaction Batch — full parameter and response schema
- Redaction Streaming — WebSocket protocol, message format, close codes