Demo Guide

Demo Guide¶

Your Complete Guide to Running the Voice Agent Demo

This guide walks you through setting up and using the ART Voice Agent Accelerator demo—from creating your first demo profile to testing advanced multi-agent conversations.

Choose Your Path¶

🟢 Basic (Start Here)🟡 Intermediate🔴 Advanced

Time: 10-15 minutes | Goal: Get talking to an AI agent

Create a demo profile
Start a voice conversation
Explore basic agent interactions

Best for: First-time users, quick demos, stakeholder presentations

Time: 30-45 minutes | Goal: Customize and understand the system

Everything in Basic, plus:

Create custom agents with the Agent Builder
Configure tools and handoffs
Test both orchestration modes

Best for: Developers evaluating the platform, solution architects

Time: 1-2 hours | Goal: Deep platform mastery

Everything in Intermediate, plus:

VAD tuning and latency optimization
Custom tool development
Multi-agent orchestration patterns
Performance monitoring and telemetry

Best for: Technical leads, platform engineers, production planning

Step 1: Create a Demo Profile¶

Before you can have a personalized conversation with the AI agents, you need to create a demo profile. This generates synthetic customer data that agents use for context-aware interactions.

Create Your Profile¶

Open the frontend application at http://localhost:5173 (or your deployed URL)
Click the profile icon (👤) in the bottom navigation bar
Click "Create Demo Profile" to open the profile creation form

Fill in the required fields:

Field	Description	Example
Full Name	Your display name for the demo	`John Smith`
Email	Use your real work email if testing MFA features	`john.smith@yourcompany.com`
Phone Number (optional)	E.164 format for SMS demos	`+14155551234`
Preferred Channel	MFA delivery preference	`email` or `sms`

Important: Use Your Real Email for MFA Testing

If you want to test tools that involve multi-factor authentication (like transaction verification), you must provide your actual email address. The system will send real 6-digit verification codes to this email that you'll need to read back to the agent during the conversation. This is enabled through Email Communication Services integrated through Azure Communication Services.

Click "Create Profile" — The system generates:
- A unique client_id for your session
- Synthetic financial data (account balances, transaction history)
- Customer intelligence data (communication preferences, relationship history)
- Demo-safe verification codes

Profile Expiration

Demo profiles automatically expire after 24 hours and are purged from the system. All data is synthetic and safe for demos.

What Gets Generated¶

Your demo profile includes rich context that agents use for personalized interactions:

Sample Demo Profile: john_smith_cfs

Institution: Contoso Financial Services
Relationship Tier: Platinum
Client Since: 2019

Category	Data
Current Balance	$542,000
YTD Transaction Volume	$8.2M
Risk Tolerance	Moderate
Account Health Score	94

Customer Intelligence:

Communication Style: Direct/Business-focused
Preferred Resolution: Fast, efficient solutions
Known Preferences: Quick summaries over detail

Verification Codes (for demo MFA):

SSN Last 4: 7823
Phone Last 4: 1234
Employee ID: 4521

Step 2: Create and Configure Agents (Intermediate)¶

The Agent Builder lets you create custom AI agents directly from the frontend without editing YAML files.

Open the Agent Builder¶

Click the robot icon (🤖) in the bottom navigation bar
The Agent Builder panel opens on the left side

Understanding Agent Configuration¶

Configuration	Description	Example Values
Name	Unique identifier for the agent	`CustomerSupportAgent`
Description	What this agent specializes in	`Handles billing inquiries`
System Prompt	Instructions that define agent behavior	`You are a helpful billing specialist...`
Greeting	First message when agent activates	`Hello! I'm here to help with billing.`
Return Greeting	Message when returning to this agent	`Welcome back! Where were we?`
Voice	Neural voice for TTS	`en-US-JennyNeural`
Temperature	Response creativity (0.0-1.0)	`0.7`

Configure Tools¶

Tools give agents capabilities to take actions. Each agent's tools are defined in their agent.yaml file:

# Example from concierge/agent.yaml
tools:
  - verify_client_identity
  - get_account_summary
  - handoff_card_recommendation
  - handoff_investment_advisor
  - transfer_call_to_call_center

Tool Category	Examples	Use Case
Account Tools	`get_account_summary`, `get_recent_transactions`	Financial inquiries
Identity Tools	`verify_client_identity`, `get_user_profile`	Customer verification
MFA Tools	`send_verification_code`, `verify_code`	Security verification
Transfer Tools	`transfer_call_to_call_center`	Live agent escalation
Handoff Tools	`handoff_fraud_agent`, `handoff_concierge`	Agent-to-agent routing

Handoff Tools vs Scenario Routing

While agents declare which handoff tools they can use, the scenario configuration determines the actual routing behavior (discrete vs announced) and validates the agent graph.

Understanding Scenarios¶

Scenarios define the complete multi-agent orchestration graph. Each scenario specifies:

Which agents are included in the conversation flow
The starting agent for the scenario
Handoff routes — directed edges defining agent-to-agent transfers
Handoff behavior — discrete (silent) or announced (target agent greets)

Scenarios are defined in YAML files under apps/artagent/backend/registries/scenariostore/:

Scenario	File	Description
`banking`	`banking/orchestration.yaml`	Private banking with card and investment specialists
`insurance`	`insurance/scenario.yaml`	Insurance claims with auth and fraud agents
`default`	`default/scenario.yaml`	All agents available, announced handoffs

Handoff Configuration (Scenario-Level)¶

Handoffs are now configured at the scenario level, not per-agent. Each handoff is a directed edge in the agent graph:

# Example from banking/orchestration.yaml
handoffs:
  - from: BankingConcierge
    to: CardRecommendation
    tool: handoff_card_recommendation
    type: discrete            # Silent transition
    share_context: true

  - from: CardRecommendation
    to: BankingConcierge
    tool: handoff_concierge
    type: discrete            # Seamless return

Handoff Types:

Type	Behavior	Use Case
`discrete`	Silent handoff, conversation continues naturally	Same-team specialists, returns
`announced`	Target agent greets/announces the transfer	Sensitive topics (fraud), new context

Example: Banking Scenario Graph:

Configure VAD (Voice Activity Detection)¶

Selecting a Scenario

Scenarios are loaded based on configuration. To switch scenarios:

Set the SCENARIO_NAME environment variable (e.g., banking, insurance, default)
Or configure via the frontend's scenario selector (if available)

Each scenario defines:

Which agents are available
The starting agent
How handoffs behave between agents
Shared template variables (company name, industry, etc.)

VAD settings control how the system detects when you're speaking:

Setting	Description	Recommended
Threshold	Sensitivity to speech (0.0-1.0)	`0.5` (balanced)
Prefix Padding (ms)	Audio to keep before speech	`300`
Silence Duration (ms)	Silence before turn ends	`500`

VAD Tuning Tips

Noisy environment? Increase threshold to 0.6-0.7
Quick responses needed? Reduce silence duration to 300-400ms
Agent cutting you off? Increase silence duration to 600-800ms

Voice Configuration¶

Region-Dependent Voices

Available neural voices depend on your Azure Speech Services region. Not all voices are available in all regions. An API-based retrieval of available voices will be added in a future release.

Popular voice options:

Voice	Style	Best For
`en-US-JennyNeural`	Conversational	General customer service
`en-US-SaraNeural`	Friendly	Casual interactions
`en-US-GuyNeural`	Professional	Business contexts
`en-US-AriaNeural`	Expressive	Dynamic conversations

Step 3: Start a Voice Conversation¶

Launch the Voice Interface¶

Click the person icon (👤) in the bottom navigation to access conversation mode
You'll see two orchestration options:

Choose Your Orchestration Mode¶

The system supports two orchestration modes. Select based on your latency and customization needs:

Voice Live (Recommended for Low Latency)¶

Audio streams directly to OpenAI's Realtime API:

🎤 Your Voice  →  OpenAI Realtime API  →  🔊 Audio Response

Metric	Typical Value
End-to-end latency	~200-400ms
Barge-in handling	Automatic
Audio processing	Server-managed VAD
Voice options	OpenAI voices

Best for: Lowest latency requirements, simple demos, when Azure Speech customization isn't needed.

Cascade (Recommended for Control)¶

Audio flows through Azure Speech services with separate STT and TTS:

🎤 Your Voice  →  Azure STT  →  LLM  →  Azure TTS  →  🔊 Audio

Metric	Typical Value
End-to-end latency	~400-800ms
Barge-in handling	Custom VAD
Audio processing	Azure Speech SDK
Voice options	Azure Neural Voices

Best for: Custom VAD/segmentation control, Azure Neural Voice selection, phrase list customization, fine-grained audio control.

Switching Modes

Set ACS_STREAMING_MODE in your .env file:

VOICE_LIVE — Use OpenAI Realtime API
MEDIA — Use Cascade (Azure Speech)

Start Speaking¶

Allow microphone access when prompted
Wait for the greeting — the active agent introduces itself
Start your conversation!

Step 4: Understanding the Conversation Flow¶

What to Observe During Conversations¶

flowchart LR subgraph Input["🎤 You Speak"] MIC["Microphone"] end subgraph Processing["⚙️ System Processing"] VAD["VAD\nDetection"] STT["Speech-to-Text\n(Transcription)"] LLM["LLM\nInference"] TTS["Text-to-Speech"] end subgraph Output["🔊 Agent Responds"] AUDIO["Audio Output"] end MIC --> VAD VAD --> STT STT --> LLM LLM --> TTS TTS --> AUDIO VAD -.->|"Barge-in\nDetection"| TTS

Key Metrics to Watch¶

Metric	What It Means	Healthy Range
VAD Latency	Time to detect speech start/stop	< 100ms
STT Latency	Time to transcribe your speech	< 500ms
LLM TTFT	Time to first token from LLM	< 300ms
TTS Latency TTFB	Time to start audio playback	< 200ms
Total Turn Time	End-to-end response time	< 1.5s

Handoff Behavior¶

When an agent hands off to another agent, behavior depends on the handoff type defined in the scenario:

Handoff Type	Behavior
Announced	Target agent greets the customer, acknowledging the transfer
Discrete	Silent handoff — conversation continues naturally without explicit transition

Context is always transferred:

handoff_context — Includes reason, summary, and relevant data
previous_agent — Target agent knows who handed off
share_context: true — Preserves full conversation history (configurable)

Example: Announced Handoff (Fraud)

sequenceDiagram participant U as 👤 You participant C as 🎧 Concierge participant F as 🔒 FraudAgent U->>C: "I think someone stole my credit card" C->>C: Detects fraud concern Note over C,F: type: announced C-->>F: Handoff with context F->>U: "I'm the fraud specialist. I understand you're concerned about potential unauthorized activity. Let me help you secure your account immediately."

Example: Discrete Handoff (Same-team specialist)

sequenceDiagram participant U as 👤 You participant BC as 🏦 BankingConcierge participant CR as 💳 CardRecommendation U->>BC: "I want a new credit card" BC->>BC: Routes to card specialist Note over BC,CR: type: discrete (silent) BC-->>CR: Handoff with context CR->>U: "I'd be happy to help you find the perfect card. What do you typically spend the most on - travel, dining, or groceries?"

Watch for Anomalies¶

Anomaly	Possible Cause	Solution
Agent cuts off mid-sentence	VAD silence too short	Increase `silence_duration_ms`
Long pauses before response	High LLM latency	Check Azure OpenAI quotas
Echo or feedback	Microphone picking up speaker	Use headphones
Transcription errors	Background noise	Adjust VAD threshold
Handoff not triggering	Missing handoff tool	Check agent tool configuration

Demo Scenarios to Try¶

Basic Scenarios¶

Scenario	What to Say	Expected Behavior
Account Inquiry	"What's my account balance?"	Agent retrieves synthetic balance
Transaction History	"Show me my recent transactions"	Agent lists demo transactions
Profile Info	"What's my relationship tier?"	Agent confirms Platinum/Gold tier

MFA Scenarios (Requires Real Email)¶

Scenario	What to Say	Expected Behavior
Verification Request	"I need to verify a large transaction"	Agent sends code to your email
Code Verification	Read the 6-digit code aloud	Agent verifies and proceeds

Handoff Scenarios¶

Scenario	What to Say	Expected Behavior
Card Interest	"I want a new credit card"	Handoff to CardRecommendation (discrete)
Investment Question	"I want to discuss my portfolio"	Handoff to InvestmentAdvisor (discrete)
Fraud Concern	"I think my card was stolen"	Handoff to FraudAgent (announced)
Return to Concierge	"That's all I needed for investments"	Handoff back to BankingConcierge (discrete)

Barge-in Testing¶

Scenario	What to Do	Expected Behavior
Interrupt Agent	Start speaking while agent talks	Agent stops, processes your input
Quick Follow-up	Speak immediately after agent	Barge-in triggers if within window

Troubleshooting¶

Quick Diagnostics¶

# Check backend health
curl http://localhost:8010/health

# Check readiness (all dependencies)
curl http://localhost:8010/readiness

# View available agents
curl http://localhost:8010/api/v1/agents

# View available scenarios
ls apps/artagent/backend/registries/scenariostore/

# Check scenario configuration
cat apps/artagent/backend/registries/scenariostore/banking/orchestration.yaml

Environment Variables Checklist¶

Ensure your root .env file contains:

# ===== Required for Voice =====
AZURE_SPEECH_REGION=eastus          # Your Speech Services region
AZURE_SPEECH_KEY=<your-key>         # Or use managed identity

# ===== Required for AI =====
AZURE_OPENAI_ENDPOINT=https://<your-aoai>.openai.azure.com
AZURE_OPENAI_KEY=<your-key>
AZURE_OPENAI_DEPLOYMENT=gpt-4o      # Your deployment name

# ===== For Demo Profiles =====
AZURE_COSMOS_CONNECTION_STRING=<cosmos-conn-string>
AZURE_COSMOS_DATABASE_NAME=financial_services_db

# ===== For Session State =====
REDIS_HOST=<redis-host>
REDIS_PORT=6380
REDIS_PASSWORD=<redis-password>

# ===== Orchestration Mode =====
ACS_STREAMING_MODE=VOICE_LIVE       # or MEDIA for Cascade

# ===== Dev Tunnel (local dev) =====
BASE_URL=https://<tunnel-url>       # From devtunnel host

Common Issues¶

No audio response from agent

Check:

Microphone permissions granted in browser
AZURE_SPEECH_KEY and AZURE_SPEECH_REGION are set
1. if using RBAC, AZURE_SPEECH_REGION and AZURE_SPEECH_RESOURCE_ID are required
Backend is running and healthy (curl http://localhost:8010/health)
WebSocket connection is established (check browser console)

Profile not loading in conversation

Check:

Redis & CosmosDB are running and accessible
Demo profile was created successfully (check for success message)
Session ID matches between frontend and backend
Try creating a new profile

Handoffs not working

Check:

Handoff tool is in the source agent's tool list (agent.yaml)
Target agent exists and is loaded
Scenario includes both agents in the agents: list
Handoff route is defined in scenario's handoffs: section
Check backend logs for handoff events

# Verify scenario handoff configuration
cat apps/artagent/backend/registries/scenariostore/banking/orchestration.yaml

High latency responses

Check:

Azure OpenAI quota not exceeded
Network connectivity to Azure services
Try Voice Live mode for lower latency
Check for cold start (first request is slower)

Transcription errors

Check:

Microphone quality and positioning
Background noise levels
Add domain terms to phrase lists (Cascade mode)
Adjust VAD threshold settings

Log Locations¶

Component	Location	What to Look For
Backend	Terminal running uvicorn	WebSocket events, tool calls, handoffs
Frontend	Browser DevTools Console	Connection status, errors
Azure	Application Insights	Distributed traces, latency

Getting Help¶

GitHub Issues — Report bugs
Troubleshooting Guide — Detailed solutions
API Reference — Endpoint documentation

Next Steps¶

After BasicAfter IntermediateAfter Advanced

Try the Agent Builder to create custom agents
Explore different orchestration modes
Review the Architecture Overview

Learn about Scenario Configuration for multi-agent orchestration
Deep dive into Handoff Strategies
Learn about Telemetry and monitoring
Try Load Testing

Create custom scenarios in apps/artagent/backend/registries/scenariostore/
Set up Production Deployment
Configure CI/CD Pipelines
Implement custom tools and integrations

Key Configuration Locations¶

Component	Location	Purpose
Agents	`apps/artagent/backend/registries/agentstore/`	Agent definitions (`agent.yaml` + `prompt.jinja`)
Scenarios	`apps/artagent/backend/registries/scenariostore/`	Multi-agent orchestration graphs
Tools	`apps/artagent/backend/registries/toolstore/`	Tool schemas and executors
Defaults	`apps/artagent/backend/registries/agentstore/_defaults.yaml`	Shared agent defaults