Skip to main content
Velma-2 language detection identifies the spoken language of an audio file and returns a confidence-scored result. It is a pure classification endpoint — no transcription, diarization, or enrichment data is produced.
Batch
Use caseIdentify the spoken language of an audio file
ProtocolHTTP POST
Languages100 spoken languages
Max file size100 MB
Audio analyzedFirst 30 seconds only
OutputISO 639-1 code, display name, confidence score
For a side-by-side comparison with the other Velma-2 capabilities, see Which API should I use?.

Authentication

Uses the X-API-Key header. See Authentication and rate limits.