Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.modulate.ai/llms.txt

Use this file to discover all available pages before exploring further.

Velma-2 synthetic voice detection (SVD) returns per-frame deepfake scores for an audio source. Choose batch for files you already have, or streaming for live audio over WebSocket.
BatchStreaming
Use caseDeepfake detection on a fileReal-time deepfake detection
ProtocolHTTP POSTWebSocket
Audio formatsAAC, AIFF, FLAC, MOV, MP3, MP4, OGG, Opus, WAV, WebMRaw PCM and container formats
Synthetic voice scoringPer-framePer-frame
For a side-by-side comparison with the other Velma-2 capabilities, see Which API should I use?.

Authentication

Batch uses the X-API-Key header. Streaming uses an api_key query parameter at connection time. See Authentication and rate limits.