Skip to content

Home

Real-Time Voice Agent Documentation HubΒΆ

Welcome to the Complete Guide

This documentation covers everything you need to deploy, operate, and extend the Azure-based Real-Time Voice Agent with Python 3.11 + FastAPI and enterprise-grade Azure integrations.

Quick StartΒΆ

Start here for basic setup and deployment:

  1. Getting Started Guide - Installation & basic usage
  2. Local Development - Development workflow
  3. Deployment Guide - Azure deployment with azd

Understand the system design:

  1. Architecture Overview - System architecture
  2. Data Flows - Redis & Cosmos DB architecture
  3. ACS Integration - Three-thread voice processing

Deploy and monitor in production:

  1. Production Deployment - Production checklist
  2. Monitoring Guide - Application Insights setup
  3. Troubleshooting - Common issues & solutions
Guide Description
Quick Start Guide Complete setup and basic usage examples
Local Development Local development setup and testing
Configuration Guide Advanced configuration options
Deployment Guide Complete Azure deployment with Terraform/azd
Architecture Overview System architecture and design decisions
Troubleshooting Common issues and solutions

By TopicΒΆ

Microsoft Learn Integration

Documentation includes comprehensive Microsoft Learn references with validated links to official Azure documentation, samples, and best practices.

Core System Design

Production Deployment

Development Resources

Supporting Documentation

Domain-Specific Guides

Diagram HighlightsΒΆ

Architecture OverviewΒΆ

Enterprise-Grade Voice AI Platform

Built on Azure Communication Services, Azure OpenAI, and Azure Speech Services with FastAPI backend architecture.

graph TB subgraph "πŸ“ž Communication Layer" Phone[πŸ“± Phone/PSTN] Browser[🌐 Web Browser] Teams[πŸ‘₯ MS Teams] end subgraph "⚑ Azure Services" ACS[πŸ”— Azure Communication Services
Call Automation & Media Streaming] Speech[πŸ—£οΈ Azure Speech Services
STT/TTS + Real-time Processing] OpenAI[🧠 Azure OpenAI
GPT-4o + Realtime API] Redis[⚑ Azure Cache for Redis
Session State & Coordination] Cosmos[πŸ—„οΈ Azure Cosmos DB
Conversation History] end subgraph "πŸ—οΈ Application Platform" Apps[πŸ“¦ Azure Container Apps
FastAPI Backend + React Frontend] Monitor[πŸ“Š Azure Monitor
Application Insights & Tracing] end Phone --> ACS Browser --> ACS Teams --> ACS ACS <--> Speech ACS <--> Apps Speech <--> Apps Apps <--> OpenAI Apps <--> Redis Apps <--> Cosmos Apps --> Monitor classDef communication fill:#e1f5fe,stroke:#01579b,stroke-width:2px,color:#000 classDef azure fill:#fff3e0,stroke:#e65100,stroke-width:2px,color:#000 classDef platform fill:#f3e5f5,stroke:#4a148c,stroke-width:2px,color:#000 class Phone,Browser,Teams communication class ACS,Speech,OpenAI,Redis,Cosmos azure class Apps,Monitor platform

Microsoft Learn Resources

Next StepsΒΆ

Choose Your Learning Path

Select the path that matches your role and experience level:

New to the project? Start here:

  1. Getting Started Guide - Complete setup walkthrough
  2. Architecture Overview - Understand the system design
  3. Deployment Guide - Deploy with azd up in 15 minutes
  4. API Reference - Explore the REST and WebSocket APIs

Infrastructure and operations focus:

  1. Production Deployment - Enterprise deployment checklist
  2. Monitoring Setup - Application Insights configuration
  3. Security Guide - Authentication & session management
  4. Troubleshooting - Diagnostic playbooks

Integration and customization:

  1. Local Development - Dev environment setup
  2. Cross-Cloud Integration - Azure/AWS patterns
  3. Healthcare Solutions - Domain-specific implementations
  4. Speech Services - Advanced voice capabilities

Microsoft Learn Learning Paths

Complement this documentation with official Microsoft learning resources: