Home
Real-Time Voice Agent Documentation HubΒΆ
Welcome to the Complete Guide
This documentation covers everything you need to deploy, operate, and extend the Azure-based Real-Time Voice Agent with Python 3.11 + FastAPI and enterprise-grade Azure integrations.
Quick StartΒΆ
Start here for basic setup and deployment:
- Getting Started Guide - Installation & basic usage
- Local Development - Development workflow
- Deployment Guide - Azure deployment with azd
Understand the system design:
- Architecture Overview - System architecture
- Data Flows - Redis & Cosmos DB architecture
- ACS Integration - Three-thread voice processing
Deploy and monitor in production:
- Production Deployment - Production checklist
- Monitoring Guide - Application Insights setup
- Troubleshooting - Common issues & solutions
Navigation GuideΒΆ
Guide | Description |
---|---|
Quick Start Guide | Complete setup and basic usage examples |
Local Development | Local development setup and testing |
Configuration Guide | Advanced configuration options |
Deployment Guide | Complete Azure deployment with Terraform/azd |
Architecture Overview | System architecture and design decisions |
Troubleshooting | Common issues and solutions |
By TopicΒΆ
Microsoft Learn Integration
Documentation includes comprehensive Microsoft Learn references with validated links to official Azure documentation, samples, and best practices.
Core System Design
- Architecture Overview - Enterprise Azure infrastructure & logical design
- ACS Flows - Three-thread voice processing architecture
- Data Flows - Redis & Cosmos DB three-tier storage
- Cross-Cloud Integration - Azure/AWS integration patterns
- LLM Orchestration - AI model routing & conversation flows
Production Deployment
- Deployment Guide - Complete Azure deployment with
azd
- Production Checklist - Security, scaling & monitoring
- CI/CD Pipeline - Automated deployment workflows
- Monitoring & Observability - Application Insights integration
- Troubleshooting - Diagnostic guides & solutions
- Load Testing - Performance validation strategies
- Testing Framework - Comprehensive testing approach
Development Resources
- Getting Started - Quick setup & basic usage
- Local Development - Development environment
- Configuration Guide - Environment & service setup
- API Reference - Complete REST & WebSocket API documentation
- Interactive API Docs - OpenAPI specification with testing
Supporting Documentation
- Speech Synthesis - Azure Speech TTS integration
- Speech Recognition - Azure Speech STT capabilities
- Streaming Modes - Audio processing pipelines
- Utilities & Tools - Helper services & infrastructure
- Repository Structure - Codebase organization
- Authentication Guide - Security & session management
Domain-Specific Guides
- Healthcare Solutions - HIPAA-compliant voice applications
- Samples & Examples - Implementation examples & tutorials
Diagram HighlightsΒΆ
- Production reference: Architecture Overview β Production Deployment (image:
assets/RTAudio.v0.png
) - Data lifecycle: Data Flows β Call Lifecycle with interactive Mermaid sequence diagrams
- Contact center routing: ACS Flows featuring step-by-step diagrams and Mermaid flows
- Authentication flows: Authentication Guide detailing OAuth and shared access tokens
Architecture OverviewΒΆ
Enterprise-Grade Voice AI Platform
Built on Azure Communication Services, Azure OpenAI, and Azure Speech Services with FastAPI backend architecture.
Call Automation & Media Streaming] Speech[π£οΈ Azure Speech Services
STT/TTS + Real-time Processing] OpenAI[π§ Azure OpenAI
GPT-4o + Realtime API] Redis[β‘ Azure Cache for Redis
Session State & Coordination] Cosmos[ποΈ Azure Cosmos DB
Conversation History] end subgraph "ποΈ Application Platform" Apps[π¦ Azure Container Apps
FastAPI Backend + React Frontend] Monitor[π Azure Monitor
Application Insights & Tracing] end Phone --> ACS Browser --> ACS Teams --> ACS ACS <--> Speech ACS <--> Apps Speech <--> Apps Apps <--> OpenAI Apps <--> Redis Apps <--> Cosmos Apps --> Monitor classDef communication fill:#e1f5fe,stroke:#01579b,stroke-width:2px,color:#000 classDef azure fill:#fff3e0,stroke:#e65100,stroke-width:2px,color:#000 classDef platform fill:#f3e5f5,stroke:#4a148c,stroke-width:2px,color:#000 class Phone,Browser,Teams communication class ACS,Speech,OpenAI,Redis,Cosmos azure class Apps,Monitor platform
Microsoft Learn Resources
- Azure Communication Services Architecture - Real-time media streaming concepts
- Azure Developer CLI Templates - Deployment automation with
azd up
- Azure Container Apps - Serverless container platform
Next StepsΒΆ
Choose Your Learning Path
Select the path that matches your role and experience level:
New to the project? Start here:
- Getting Started Guide - Complete setup walkthrough
- Architecture Overview - Understand the system design
- Deployment Guide - Deploy with
azd up
in 15 minutes - API Reference - Explore the REST and WebSocket APIs
Infrastructure and operations focus:
- Production Deployment - Enterprise deployment checklist
- Monitoring Setup - Application Insights configuration
- Security Guide - Authentication & session management
- Troubleshooting - Diagnostic playbooks
Integration and customization:
- Local Development - Dev environment setup
- Cross-Cloud Integration - Azure/AWS patterns
- Healthcare Solutions - Domain-specific implementations
- Speech Services - Advanced voice capabilities
Microsoft Learn Learning Paths
Complement this documentation with official Microsoft learning resources:
- Azure Communication Services Learning Path - Comprehensive ACS training
- Azure Developer CLI Fundamentals - Master
azd
deployment workflows - Azure Container Apps - Container orchestration on Azure