Ultravox Realtime enables you to build voice AI agents that work with your choice of telephony provider, with web or native apps, or in custom ways by using websockets and our native protocol. Built on our best-in-class open-weight model, Ultravox understands speech directly without relying on traditional ASR pipelines.
While the console is great for getting started, Ultravox is fundamentally an API-first platform. You should expect to write code to integrate voice agents into your applications.Our REST API and SDKs give you complete control over:
Dynamic agent configuration
Custom tool integration
Advanced call flows
Production deployments
Integrating voice AI into phone calls, web apps, and native apps
No ASR Pipeline: Unlike traditional, component model voice AI systems, Ultravox understands speech directly. There’s no automatic speech recognition (ASR) stage, making conversations faster and more context-aware. Context matters. We want Ultravox to hear the world as we hear it. This makes Ultravox faster and better at understanding than other systems that rely on ASR and speech to text.
Ultravox is designed as a bring-your-own-telephony platform, giving you complete flexibility in how you connect voice AI to your users (inbound or outbound). Whether you’re using SIP trunking, Twilio, or any other telephony provider, Ultravox seamlessly integrates with your existing infrastructure.Learn More →
Everything your agents do is based on the prompt instructions you give them. While it’s tempting to write verbose prompts, focused instructions yield better results.Remember:
Tool names and descriptions are visible to the model
Complex interactions may need multiple call stages
Less is often more when it comes to instruction clarity
Ultravox includes built-in tools and you can create custom tools. Tools (AKA function calling) give your agents superpowers—from accessing databases to making API calls. They’re versatile, powerful, and straightforward to implement. Whether you’re building customer support bots or sales agents, tools connect your AI to the real world.At their core, tools are functions that agents can invoke to perform actions or retrieve information. Any functionality you can encapsulate in a function can be exposed to your agents as a tool. Addtionally, Ultravox automatically calls the underlying function so you don’t have to sweat gluing things together.Learn More →
Voice AI only works when conversations feel natural and fluid. No awkward pause. No lag. Just smooth back-and-forth dialogue that feels human. Ultravox Realtime doesn’t just meet this standard—it sets it.
Don’t take our word for it. See the numbers yourself for comparisons between Ultravox Realtime and other leading platforms. Our benchmarks tell a clear story: when it comes to real-time voice AI, speed matters, and we deliver.
At just $0.05 per minute, Ultravox Realtime delivers enterprise-grade performance at consumer prices. Why? Because we believe groundbreaking technology should come with groundbreaking pricing. You can pay-as-you-go if you have commitment issues. We also have paid plans that remove all call concurrency caps so you can scale.No hidden fees. Just straightforward rates that make premium voice AI accessible to everyone.