Skip to main content
POST
/
api
/
velma-2-language-detection-batch
Identify the spoken language of an audio file
curl --request POST \
  --url https://modulate-developer-apis.com/api/velma-2-language-detection-batch \
  --header 'Content-Type: multipart/form-data' \
  --header 'X-API-Key: <api-key>' \
  --form upload_file='@example-file'
{
  "predicted_language": "English",
  "predicted_language_code": "en",
  "confidence": 0.9847,
  "duration_ms": 14253
}

Authorizations

X-API-Key
string
header
required

API key used for authentication and usage tracking.

Body

multipart/form-data
upload_file
file
required

Audio file to analyze. Must be non-empty and of a supported format. Maximum file size: 100 MB.

Response

Language detection completed successfully.

predicted_language
string
required

Human-readable language name (e.g. "English", "French", "Mandarin"). Suitable for display to end users.

Example:

"English"

predicted_language_code
string
required

Lowercase ISO 639-1 language code (e.g. "en", "fr", "zh"). Suitable for routing, locale switching, or BCP-47 tags.

Example:

"en"

confidence
number<double>
required

Probability associated with the predicted language, in the range 0.0-1.0. Higher means more confident.

Required range: 0 <= x <= 1
Example:

0.9847

duration_ms
integer
required

Total duration of the decoded audio in milliseconds. Note: only the first 30 seconds are analyzed regardless of this value.

Required range: x >= 0
Example:

14253