Use this package
The config below is ready to use as-is — download or copy it and pass it as theconfig payload in a Velma Triage request. Its behaviors are preset references: the API expands each preset:<identifier> into its full definition at request time, so you don’t need the criteria inline to run the package.
Download agentic-ai-guardrails.json
View / copy full config
View / copy full config
Expand the full criteria
To produce a self-contained config with every behavior’s full criteria inlined — for review, customization, or pinning a snapshot — fetch the live preset catalog and merge it into the downloaded config. The catalog is the source of truth for detection criteria.agentic-ai-guardrails.full.json keeps the same conversation_types and participant_roles and replaces each preset reference with its full behavior definition — drop it into the config payload exactly like the preset version.
Conversation types
The interaction contexts this package expects to see.| Name | What it is |
|---|---|
| AI Agent Customer Support Call | Any Customer Support Call with an AI agent |
| AI Agent Interview Monitoring | Any interview call with an AI interviewer |
Participant roles
The speaker roles the package distinguishes.| Name | What it is |
|---|---|
| Support Specialist | Someone who works with employees or contractors to solve problems with things like IT, logistics, or communication |
| Customer Service Representative | A representative from a business who is assisting a customer with an issue they are having |
| Insurance Agent | A customer support representative from an insurance company |
| Interview Candidate | Someone who is being interview for a job |
| Interviewer | A person giving an interview |
| Customer | The recipient of a service or good |
Behaviors
The 8 signals this package detects. Each maps to a reusablepreset:<identifier> you can drop into the behaviors array of any BatchConfig — the config above already references them.
Full detection criteria are not duplicated here. The live preset catalog is the source of truth — retrieve the exact criteria for any behavior by name from the
list-presets endpoint.| Behavior | What it detects | Preset |
|---|---|---|
| Vishing | Attempts to elicit sensitive information through deceptive voice interactions. We detect vishing based on abnormal call pacing, probing question patterns, stress-induced pitch shifts, and background noise suggesting call centers or spoofed environments. | preset:vishing |
| Service Churn | Customer decides to cancel an ongoing service. We detect this through resignation tone, conclusive phrasing, disengaging cadence, and emotional withdrawal. | preset:service_churn |
| Off-topic Discussion | Conversation largely unrelated to call purpose. We detect this using semantic drift paired with relaxed pacing, reduced task-oriented urgency, and tonal divergence from initial intent. | preset:off_topic_discussion |
| Inapropriate AI Agent Content | AI-generated speech unsuitable for professional contexts. We detect this by tonal incongruence with task intent, inappropriate emotional expression, unnatural phrasing patterns, and drift into unrelated or personal topics. | preset:inapropriate_ai_agent_content |
| AI Agent Manipulation | Attempts to coerce an AI into unintended behavior. We detect this using repetitive prompt pressure, altered speaking cadence, strategic pauses, adversarial tone, and escalating emotional manipulation signals across the interaction. | preset:ai_agent_manipulation |
| Inappropriate Speech | Unprofessional or unsuitable spoken content. We detect this using aggressive tone, boundary-crossing language, emotional volatility, and contextual mismatch with professional norms. | preset:inappropriate_speech |
| Issue Resolved | Customer’s problem successfully addressed. We detect this using relief markers, positive tonal shift, relaxed pacing, confirmation language, and conversational closure cues. | preset:issue_resolved |
| Sexual Harassment | Unwanted sexualized speech or advances. We detect this through suggestive intonation, boundary-testing pauses, inappropriate familiarity, and discomfort responses from others. | preset:sexual_harassment |