Only the first 30 seconds of audio are analyzed. Longer files are accepted but the additional audio is ignored. For best results, ensure at least 3–5 seconds of clear speech in the first 30 seconds.
Make a request
Expected response
Expected response
| Field | Type | Description |
|---|---|---|
predicted_language | string | Human-readable language name — suitable for display to end users. |
predicted_language_code | string | Lowercase ISO 639-1 code (e.g. "en", "fr", "zh") — suitable for routing or locale switching. |
confidence | float | Probability for the predicted language, 0.0–1.0. Higher means more certain. |
duration_ms | integer | Total audio duration in milliseconds. Only the first 30 seconds are analyzed regardless of this value. |
Working with confidence scores
Theconfidence field is a probability — values close to 1.0 mean the model is highly certain, values close to 0.0 mean it could not commit to any language. A common pattern is to set a threshold below which you fall back to a default behavior:
confidence.
Supported languages
100 spoken languages are recognized: Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Bashkir, Basque, Belarusian, Bengali, Bosnian, Breton, Bulgarian, Cantonese, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Faroese, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Lao, Latin, Latvian, Lingala, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Myanmar, Nepali, Norwegian, Nynorsk, Occitan, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskrit, Serbian, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tagalog, Tajik, Tamil, Tatar, Telugu, Thai, Tibetan, Turkish, Turkmen, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, Yiddish, Yoruba.API reference
- Language Detection Batch — full parameter and response schema