5Doctor Appointment Scheduler with Intake FormA voice agent that guides patients through scheduling doctor appointments by collecting intake information (name, date of birth, reason for visit, contact details) and booking available appointment slots. The agent asks one question at a time, confirms answers, supports editing previous responses, and gracefully handles the entire scheduling workflow.
8Customer Service Agent with Function Calling and Order ManagementA reference implementation demonstrating function calling patterns with voice agents, including customer information lookup, order history retrieval, appointment scheduling, and graceful conversation termination. Built with Flask and WebSockets, it showcases natural conversation flow with agent filler messages and UI-based debugging logs.
new9Space Trading Game with Autonomous Ship AIAn online multiplayer space trading game where players command AI-powered ships that can explore, trade, battle, and collaborate autonomously. The ship agent handles complex multi-tasking including navigation, combat, corporation management, and quest completion through voice or text commands.
new24Hotel Front Desk Agent with Smart Backchannel FilteringA voice agent that continues speaking through caller backchannels like 'mhm', 'um', and 'yeah' without interrupting, while still responding to real intents. Demonstrates three filtering strategies for handling short utterances during agent speech: STT node override, buffer clearing, and combined approach.
25Real-Time Voice Agent with OpenAI and TwilioA custom voice AI agent that integrates OpenAI's Realtime API with Twilio Voice services to enable interactive voice conversations. The agent supports real-time audio streaming, automatic speech detection, response cancellation, and natural conversation flow with interrupt handling. Built with FastAPI and WebSocket communication, it can be configured for sales, support, compliance monitoring, and cold calling applications.
26Multi-User Browser Voice Agent with WebRTC TransportA Python worker that bridges LiveKit WebRTC rooms with AssemblyAI's Voice Agent API, enabling multi-user voice conversations with an AI agent in browser, mobile, and desktop clients. The worker handles audio transport while AssemblyAI manages the full AI pipeline including speech-to-text, LLM responses, and text-to-speech with neural turn detection and barge-in support.
28Telephony Bridge for Real Time Voice AgentsA telephony server that connects telephony providers like Twilio, Vonage, and Plivo with real-time communication platforms including Pipecat, LiveKit, and Jay.so. Enables seamless call routing, metrics collection, and observability for voice AI deployments. Includes an example taxi dispatcher agent that handles ride bookings and customer inquiries.
39Real Time Multilingual Translation Between Customer and Contact Center AgentA bidirectional voice translation system that enables real-time conversations between customers and contact center agents speaking different languages. The agent intercepts audio from both parties, translates it using OpenAI Realtime API, and delivers the translated audio to the other party. Supports multiple languages including English, Spanish, French, Mandarin, and Hindi through an IVR menu.
40Inbound and Outbound Phone Bot with Daily and PlivoA Python-based voice AI agent that handles both inbound and outbound phone calls using Daily for real-time voice streaming and Plivo for telephony. The bot uses Deepgram for speech recognition, GPT-4 for conversation, and Cartesia for text-to-speech, enabling automated phone interactions for customer support, appointment scheduling, or lead qualification.
48Outbound Call Agent with Voicemail Detection and TransferAn outbound calling agent that initiates phone calls, detects voicemail, looks up availability via function calling, transfers to human operators, and detects intent to end calls. Built with LiveKit SIP and Python Agents Framework, it handles noisy environments with Krisp background voice cancellation.
53Inbound Phone Order Status AgentA production-ready inbound phone agent that answers calls, transcribes the caller in real time, looks up order status, and offers human handoff. Built with AssemblyAI Universal-3 Pro Streaming for speech recognition, GPT-4o for conversation and function calling, and ElevenLabs for voice synthesis. Maintains 8kHz mulaw audio end-to-end with no resampling, achieving 600-1100ms turn latency.
54Multi-Agent In-App Voice Assistant with Web Search, Knowledge Base, and Account ActionsA modular voice assistant built with OpenAI Agents SDK that routes user queries to specialized agents for web search, product knowledge retrieval, or account management. Demonstrates orchestrating multiple agents with the Responses API, FileSearchTool, WebSearchTool, and custom function calling, plus voice optimization with TTS model instructions.
58Dental Clinic Appointment Booking AgentAn AI-powered appointment booking agent that streamlines scheduling for dental clinics and healthcare businesses. The agent integrates with Google Calendar and Gmail to check available time slots, confirm bookings, send draft confirmation emails, and make outbound confirmation calls. Users can book appointments, receive queue updates, and get confirmations via email or phone call.
59Content Filtering Voice Assistant with GuardrailsA voice AI assistant that answers questions about Cartesia, voice AI technology, and the competitive landscape while enforcing conversation guardrails. The agent preprocesses user inputs to detect and block toxic content, prompt injection attempts, and off-topic conversations, with escalation policies for repeated violations.
69Multi Purpose Voice Agent ShowcaseA collection of interactive voice agent templates demonstrating customer support, language tutoring, appointment scheduling, interview practice, phone triage, and lead capture use cases. Built on AssemblyAI's Voice Agent API with real-time tool calling and turn detection.
72Festival Group Planning AssistantA voice-based AI assistant that helps friend groups coordinate festival trips through phone conversations. Users call in to form groups, browse festival catalogs, track members, and manage plans using natural dialogue. The agent handles inbound calls, identifies callers by phone number, and uses tools to save groups, members, festivals, and artists to a Postgres database.
76Multimodal Voice and Vision Assistant for iOSA voice AI assistant with realtime audio and video input capabilities. Built for iOS, it supports front and back camera switching, natural voice conversations, live screen sharing, and background operation. The assistant can observe and interact seamlessly while users work on other tasks, making it suitable for hands-free assistance scenarios.
83Vonage Voice API WebSocket Connector for Multiple AI EnginesA middleware application that connects Vonage Voice API calls (PSTN, SIP, WebRTC) to various AI engines via WebSockets. Supports inbound and outbound calling scenarios and integrates with Deepgram Voice Agent, OpenAI Realtime API, ElevenLabs Agent, Hume EVI, and custom AI pipelines through connector servers.
84Desktop Robot Assistant with Voice-Controlled ManipulationA voice-controlled desktop robot assistant system that performs manipulation tasks like desk cleaning and organization through natural language commands. The system coordinates multiple operators including PID-based motion control, trained ACT policies for pick-and-place operations, and MolmoAct2 VLA for generalized manipulation. The architecture demonstrates distributed robotics with the voice agent, policies, slider control, and VLA running on separate machines connected via LiveKit Portal.
85Smart City Voice AI Demo for Harbour City OperationsA voice AI agent for a fictional smart city (Harbour City) featuring hot-word activation ('Hey Eve') and 16 mock functions across 4 interactive scenarios: Command & Control (power grid), Frictionless Resident (pod/coffee/HVAC), Smart Work (revenue/projections/decks), and Proactive Guardian (flood gates/alerts). Includes dynamic dashboard updates driven by LLM function calls, conversation persistence, audio interruption handling, and a futuristic minimalist UI.
90Telephony Integration Connector for Conversational AIA WebSocket connector that bridges Vonage Voice API and Video API calls to ElevenLabs Conversational AI agents. Supports inbound and outbound PSTN calls, SIP trunks, Audio WebRTC clients, and Video WebRTC clients across multiple platforms. Enables voice calls from cell phones, landlines, and web applications to interact with speech-to-speech AI agents.