Deepfake Detection

Synthetic voice detection (SVD) returns per-frame deepfake scores for an audio source. Choose batch for files you already have, or streaming for live audio over WebSocket.

	Batch	Streaming
Use case	Deepfake detection on a file	Real-time deepfake detection
Protocol	HTTP POST	WebSocket
Audio formats	AAC, AIFF, FLAC, MOV, MP3, MP4, OGG, Opus, WAV, WebM	Raw PCM and container formats
Synthetic voice scoring	Per-frame	Per-frame

For a side-by-side comparison with the other Modulate capabilities, see Which API should I use?.

Authentication

Batch uses the X-API-Key header. Streaming uses an api_key query parameter at connection time. See Authentication and rate limits.

Speech-to-Text Streaming English

Deepfake Detection Batch

⌘I

​Authentication

Authentication