Skip to main content
POST
/
api
/
velma-2-synthetic-voice-detection-batch
Detect synthetic voice in an audio file
curl --request POST \
  --url https://platform.modulate.ai/api/velma-2-synthetic-voice-detection-batch \
  --header 'Content-Type: multipart/form-data' \
  --header 'X-API-Key: <api-key>' \
  --form upload_file='@example-file'
{
  "filename": "recording.wav",
  "frames": [
    {
      "start_time_ms": 0,
      "end_time_ms": 3000,
      "verdict": "synthetic",
      "confidence": 0.9732
    }
  ],
  "duration_ms": 34500
}

Authorizations

X-API-Key
string
header
required

API key used for authentication and usage tracking.

Body

multipart/form-data
upload_file
file
required

Audio file to analyse. Must be non-empty. Supported formats: .3gp, .3gpp, .aac, .aiff, .amr, .au, .flac, .m4a, .mov, .mp3, .mp4, .ogg, .opus, .wav, .webm, .wma. Maximum file size: 100 MB.

Response

Detection completed successfully.

filename
string | null
required

Original filename of the uploaded audio.

Example:

"recording.wav"

frames
object[]
required

Ordered list of per-frame detection results covering the full audio duration.

duration_ms
integer
required

Total duration of the audio in milliseconds.

Required range: x >= 0
Example:

34500