← All templatestemplates / general-purpose-voice-agent-with-open-source-nvidia-models
Pipecatupdated Jun 1, 2025 · other · other
General Purpose Voice Agent with Open Source NVIDIA Models
A low-latency voice agent implementation using NVIDIA's open source models (Nemotron Speech ASR, Nemotron-3 Nano LLM, and Magpie TTS). Features adaptive TTS streaming, buffered LLM with 100% KV cache reuse, and multiple deployment options including local GPU execution on DGX Spark/RTX 5090 or cloud deployment via Modal and Pipecat Cloud.
telephonyWeb Only
speech-to-textDeepgram Nova-3
llmLlama 3.3 70B
text-to-speechCartesia Sonic-3
No prompt published.
nvidiaopen-sourcelow-latencyself-hostedgpudgx-sparkrtx-5090websocketadaptive-streamingkv-cache
Voice Notes
Voice AI recipes, picks, and analysis.
Get the useful new templates plus the occasional teardown of what’s working in production voice AI.