Deepfake Detection Batch

POST

api

velma-2-synthetic-voice-detection-batch

Detect synthetic voice in an audio file

curl --request POST \
  --url https://platform.modulate.ai/api/velma-2-synthetic-voice-detection-batch \
  --header 'Content-Type: multipart/form-data' \
  --header 'X-API-Key: <api-key>' \
  --form upload_file='@example-file'

{
  "filename": "recording.wav",
  "frames": [
    {
      "start_time_ms": 0,
      "end_time_ms": 3000,
      "verdict": "synthetic",
      "confidence": 0.9732
    }
  ],
  "duration_ms": 34500
}

Authorizations

X-API-Key

string

header

required

API key used for authentication and usage tracking.

Body

multipart/form-data

upload_file

file

required

Audio file to analyse. Must be non-empty. Supported formats: .3gp, .3gpp, .aac, .aiff, .amr, .au, .flac, .m4a, .mov, .mp3, .mp4, .ogg, .opus, .wav, .webm, .wma. Maximum file size: 100 MB.

Response

Detection completed successfully.

filename

string | null

required

Original filename of the uploaded audio.

Example:

"recording.wav"

frames

object[]

required

Ordered list of per-frame detection results covering the full audio duration.

Show child attributes

duration_ms

integer

required

Total duration of the audio in milliseconds.

Required range: x >= 0

Example:

34500

Deepfake Detection

Deepfake Detection Streaming

⌘I

Detect synthetic voice in an audio file

curl --request POST \
  --url https://platform.modulate.ai/api/velma-2-synthetic-voice-detection-batch \
  --header 'Content-Type: multipart/form-data' \
  --header 'X-API-Key: <api-key>' \
  --form upload_file='@example-file'

{
  "filename": "recording.wav",
  "frames": [
    {
      "start_time_ms": 0,
      "end_time_ms": 3000,
      "verdict": "synthetic",
      "confidence": 0.9732
    }
  ],
  "duration_ms": 34500
}