Velma-2 synthetic voice detection (SVD) returns per-frame deepfake scores for an audio source. Choose batch for files you already have, or streaming for live audio over WebSocket.Documentation Index
Fetch the complete documentation index at: https://docs.modulate.ai/llms.txt
Use this file to discover all available pages before exploring further.
| Batch | Streaming | |
|---|---|---|
| Use case | Deepfake detection on a file | Real-time deepfake detection |
| Protocol | HTTP POST | WebSocket |
| Audio formats | AAC, AIFF, FLAC, MOV, MP3, MP4, OGG, Opus, WAV, WebM | Raw PCM and container formats |
| Synthetic voice scoring | Per-frame | Per-frame |
Authentication
Batch uses theX-API-Key header. Streaming uses an api_key query parameter at connection time. See Authentication and rate limits.