Music Detection Batch
Classify music and speech in an audio file. Returns frame-level probabilities, a primary label, and percentage breakdowns of content type.
Authorizations
API key used for authentication and usage tracking.
Body
Audio file to analyse. Must be non-empty and of a supported format. Maximum file size: 100 MB.
Response
Detection completed successfully.
Name of the submitted audio file. Empty string if no filename was provided in the upload.
"my_audio.wav"
Total duration of the analysed audio in seconds.
x >= 05.76
Overall classification of the clip:
music- music covers at least as much of the clip as speech, and more than zero.speech- speech covers more of the clip than music, and more than zero.neither- neither music nor speech reached the dominant threshold for any portion of the clip.unknown- no frames could be produced from the audio.
music, speech, neither, unknown "speech"
Percentage of the clip classified as containing music.
0 <= x <= 1000
Percentage of the clip classified as containing speech.
0 <= x <= 10086.7
End-to-end inference time in milliseconds.
x >= 01243.5
Ordered list of per-frame classification results covering the full duration of the clip.