Blog
Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

2025-03-31 09:37:02

86590

Google has just launched its new Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities. We wasted no time integrating it into our platform—perfectly aligning with our Conversation Agents Connector initiative.

With Voximplant handling all audio processing and providing a serverless architecture, developers can now connect telephony to Gemini-powered agents in minutes. Just write a simple JavaScript scenario to manage prompts, tool calling, and real-time message handling with Gemini.

Switching between voice and text output is effortless. And for text-to-speech, our ElevenLabs Streaming integration offers a vast library of high-quality voices for real-time TTS synthesis, giving developers plenty of options.

To experience the Gemini 2.0 Flash Live API Client in action, call 1-888-927-7255 and try our demo.

ElevenLabs has launched its WebSockets API and the ultra-fast Flash 2.5 model, enabling low-latency streaming speech synthesis – perfect for giving voice to LLM responses. By streaming text chunks in real time, audio generation begins almost instantly, making delays virtually unnoticeable.

You can use it in VoxEngine scenarios – either standalone or alongside OpenAI or Gemini integrations

Learn more in our Gemini Live API Client Guide and Realtime Speech Synthesis Guide.

Recommendations

TTS voice ai inworld

Voximplant now supports Inworld Realtime API

Voximplant now supports Inworld's Realtime API, so you can bring Inworld's expressive, conversation-aware agents into real phone calls, SIP, and WhatsApp without custom media infrastructure

Grok Voice Agent API now available in Voximplant

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

voice ai BYO LLM Compliance

Сompliant speech-to-speech voice AI with Realtime API

Use Realtime API in VoxEngine to build speech-to-speech voice AI with data residency control. Set baseUrl to any OpenAI Realtime-compatible endpoint, including Azure EU deployments, and control where call audio is processed

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

Sign Up for a free Voximplant developer account or talk to our experts

Recommendations

Voximplant now supports Inworld Realtime API

Grok Voice Agent API now available in Voximplant

Сompliant speech-to-speech voice AI with Realtime API

Get the latest from Voximplant

Contact Us