primary_verdict of ai-vocal-music, ai-instrumental, or not-ai-music.
This is distinct from music detection, which classifies audio as music, speech, or neither. AI music detection answers a different question: is this music AI-generated?
| Batch | Streaming | |
|---|---|---|
| Use case | Classify a complete audio file | Real-time per-window classification |
| Protocol | HTTP POST | WebSocket |
| Output | Clip-level verdict plus per-window breakdown | Per-window vocal AI results emitted progressively, final clip-level summary on completion |
| Instrumental AI detection | Included per window and clip-level | Clip-level only, in the final done message |
Authentication
Batch uses theX-API-Key header. Streaming uses an api_key query parameter at connection time. See Authentication and rate limits.
Performance notes
- Per-window results can be less accurate than the clip-level verdict and its confidence. Rely on the clip-level result when judging a whole song or segment.
- Heavily processed or high-production tracks are sometimes mislabeled as AI-generated. This is a known gap targeted by future model updates.