ai agents Introducing VideoSDK Agent Cloud: Deploy Voice Agents at Production Scale In this launch, VideoSDK introduces Deployments a purpose-built infrastructure layer for running AI voice agents reliably in production. Designed to handle scaling, concurrency, latency, and uptime out of the box.
Product Updates Product Updates - January 2026 : Managed Inference, Expanded AI Ecosystem, and Advanced SDK Improvements Kick off 2026 with VideoSDK’s biggest updates yet. January brings VideoSDK-managed inference for AI Agents, expanded AI integrations, new agent evaluation tools, advanced video optimizations, and deeper support for IoT voice experiences .
AI voice agent Announcing VideoSDK Inference: One Magic API for Every Voice AI Model We’re thrilled to announce Inferencing in VideoSDK AI Voice Agents a unified way to run STT, LLM, TTS, and Realtime models directly inside your voice pipeline without managing multiple accounts through Agent Runtime Dashboard and Python Agents SDK.
ai agents Introducing VideoSDK Phone Numbers: Build AI Call Agents in 60 seconds Today, we’re launching VideoSDK Phone Numbers, a first-party telephony capability that lets you connect AI voice agents directly to the phone network.
ai agents Introducing the Ultravox Realtime Plugin in VideoSDK Learn more about building real-time voice agents with Ultravox and VideoSDK Agents, where listening, reasoning, and speaking happen together for ultra-low latency, natural conversations.
ai agents Introducing xAI Grok Real-Time Speech-to-Speech Plugin for VideoSDK Agents Build real-time voice and text agents with xAI’s Grok now natively integrated into VideoSDK Agents for multimodal, context-aware AI experiences.
ai agents Introducing the Nvidia Speech to Text Plugin in VideoSDK Learn how to integrate NVIDIA STT with the VideoSDK Agents SDK to generate fast, accurate, and production-ready transcriptions.
ai agents Introducing the MurfAI Text To Speech Plugin in VideoSDK Learn how to integrate Murf AI Text-to-Speech with VideoSDK Agents to generate natural, expressive, and low-latency voice output for AI agents.
ai agents Introducing the Nvidia Text to Speech Plugin in VideoSDK Learn how to integrate NVIDIA Riva TTS with the VideoSDK Agents SDK to deliver real-time, low-latency speech that makes AI voice agents sound natural, responsive, and production-ready.
plugins Introducing the Gladia Speech to Text Plugin in VideoSDK We’re introducing the Gladia Speech-to-Text plugin for VideoSDK. With multilingual support, instant partial results, and handling of mixed languages, it provides a reliable speech input layer for voice-driven applications.
ai agents Introducing Testing and Evaluation in AI Voice Agents Learn how to run testing and evaluation for AI voice agents using the VideoSDK Agent SDK, including STT, LLM, and TTS benchmarking, latency metrics, and LLM-based response judging.
ai agents How to Build an AI Voice System Using Real-Time Multi-Agent Switching In this blog you'll learn about how to build an AI systems with multi-agent switching that intelligently transfer control between specialized agents. Keep conversations natural, tasks organized, and users engaged by letting each agent focus on what it does best.
ai telephony How to enable Voice Mail Detection in AI Voice Agents Learn how Voice Mail Detection improves outbound calling by identifying voicemail systems automatically. Instead of wasting time speaking into silence, your agent can leave a message or end the call smoothly, helping save call time, reduce costs, and improve overall calling efficiency.
Product Updates Featured Product Updates - December 2025 : New Billing & Pricing, AI Agents with Graphs & Fallback, and More! Announcing our new transparent billing system and pricing for 2026! This month's update also gives your AI agents a "brain" with Conversational Graphs, makes them unstoppable with Provider Fallback, and adds powerful new video optimization features across all core SDKs.
ANNOUNCEMENT Featured Announcing New Pricing: Free Credits, Simpler Pricing, and a Smarter Billing Dashboard VideoSDK introduces a new Pay-As-You-Go On-Demand model and a redesigned Billing Dashboard featuring a $20 free balance, prepaid wallet, and real-time usage controls.
ai telephony How to Transfer Calls in AI Voice Agents Using SIP Telephony Learn how to automatically transfer ongoing SIP call to another phone number without disconnecting the call or redialing using Call Transfer in SIP Telephony
ai telephony How to enable DTMF Events in Telephony AI Agent Learn how DTMF input powers reliable, menu-driven voice interactions. This blog explores common use cases and shows how VideoSDK voice agents process real-time keypad events to drive precise call flows.
preemtive-response How to enable preemptive response in AI Voice Agents Learn how Preemptive Response reduces voice AI latency by streaming partial transcripts to the LLM, enabling faster and more natural conversational agents.
Product Updates Featured Product Updates - November 2025 : Agent Runtime, WHIP/WHEP and Realtime Data Store This month, we're changing the game for AI development. Introducing the Agent Runtime, our new no-code/low-code agent builder! Plus, WHIP/WHEP docs, sync data with the new Realtime Store, and check out our full suite of new quickstart guides.
Product Updates Featured Product Updates - October 2025 : Supercharged AI Agents, New SDK Features & More This month's update is all about AI! We're unveiling Namo, our powerful in-house turn detection model for truly natural conversations, and a new WhatsApp AI Voice Agent Quickstart. Plus, get the details on major Android video control features and new React monitoring hooks.
ai agents How to handle speech in AI Voice Agents with Namo Turn Detection Model Learn how to make your AI Voice Agents sound natural and interruption-aware using the NAMO Turn Detection model - a semantic, transformer-based system that replaces silence timers with true speech understanding.
ai agents How to Build an AI Voice Agent Using the RAG Pipeline and VideoSDK Learn how to build an AI Voice Agent with Retrieval-Augmented Generation (RAG). This guide walks through ingestion, embeddings, retrieval, and real-time voice response with complete code examples.
Namo-Turn-Detection-v1: Semantic Turn Detection for AI Voice Agents Turn-taking, the ability to know exactly when a user has finished speaking, is the invisible force behind natural human conversation. Yet most voice agents today rely on Voice Activity Detection (VAD) or fixed silence timers, leading to premature cut-offs or long, robotic pauses. We introduce state of the art NAMO
Developer Blog Build a AI Phone Agent for Inbound & Outbound SIP Calls Build and deploy a AI phone agent with VideoSDK—complete step-by-step guide from coding to live phone integration.
Developer Blog Top AI Voice Agent Use Cases Across Industries in 2025 AI voice agent use cases in 2025 across industries, showcasing how conversational AI boosts efficiency, customer satisfaction, and business growth.