Build with Velma
These are just examples — if you can describe it, Velma can detect it.Fraud Prevention
Detect account impersonation, vishing, and social engineering in real time. Monitor agent SOP compliance on the same call.
Trust and Safety
Deploy Velma’s full safety catalog against live audio. Flag harmful content the moment it happens, not after the damage is done.
Full Call Analysis
The whole conversation analyzed. Summary, sentiment, speakers, behaviors, and risks in one response.
Batch or streaming?
| Velma-2 Batch | Velma-2 Streaming | |
|---|---|---|
| Use case | Analyze a complete recording | Analyze a live or in-progress conversation |
| Protocol | HTTP POST | WebSocket |
| Response | Single JSON response | Stream of typed events |
| Best for | Post-call QA, compliance review, offline processing | Live monitoring, real-time alerting, in-call coaching |
What Velma produces
Behavior detection
Per-behavior verdicts with confidence scores and the specific clips that triggered each detection.
Conversation type
Classification of the conversation against the types you define — customer support, sales, interview, and more.
Participant roles
Per-speaker role assignments drawn from the roles you configure, resolved in real time as speakers are identified.
Topics
An aggregated list of the subjects discussed across the full conversation.
Topic sentiment
Per-speaker sentiment scores for each extracted topic, ranging from −1 (negative) to +1 (positive).
Summary
A free-form narrative summary of the conversation generated at end of stream.
How it works
- Batch
- Streaming
Velma-2 Batch is an HTTP POST endpoint at Use batch when you have a finished recording and want a complete analysis in one call — post-call QA, compliance review, or processing an uploaded file.
/api/velma-2-batch. Submit a complete audio file as multipart/form-data — the upload_file field carries the audio (up to 100 MB), and the config field carries a JSON-encoded BatchConfig or the literal string "default" to use Velma’s built-in defaults.Velma processes the full recording and returns a single JSON response containing all clips, role assignments, behavior detections, topics, topic sentiment scores, and a summary.How Velma differs from the other endpoints
| Velma-2 | STT | SVD | |
|---|---|---|---|
| Primary output | Conversation intelligence | Transcript | Deepfake scores |
| Behavior detection | ✓ (configurable) | — | — |
| Conversation classification | ✓ | — | — |
| Topic and sentiment analysis | ✓ | — | — |
| Summarization | ✓ | — | — |
| Transcription | ✓ (via STT options) | ✓ | — |
| Batch and streaming | ✓ | ✓ | ✓ |
Velma includes transcription as part of its output via STT options in your configuration. If transcription alone is what you need, the STT endpoints are the right choice.
Next steps
Capabilities
Explore all of Velma’s analysis outputs and configuration options in detail.
Behaviors
Learn how to define the signals you want Velma to detect.