Built-in filtering to focus on the primary speaker in multi-speaker environments.
Ultravox Realtime provides built-in support for handling background speakers and multi-speaker environments. The system is designed to focus on the primary speaker while filtering out cross-talk and unwanted voice interactions.
Automatic FilteringBackground speaker filtering is enabled by default for all Ultravox calls. This helps your AI agent focus on the intended speaker even in challenging multi-speaker scenarios.
Ultravox employs sophisticated techniques to handle these challenging scenarios:Model Training → The Ultravox model distinguishes between speech and noise/unintelligible speech.Speaker Tracking → Advanced algorithms analyze voical power levels and patterns to identify the primary speaker and filter out background voices.Real-time Processing → All speaker detection and filtering happens in real-time without adding latency to the conversation.The result is cleaner voice interactions where your AI agent responds to the intended speaker, reducing confusion and improving conversation quality in complex acoustic environments.