
OpenAI Client update: gpt-realtime GA alignment
OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

OpenAI has launched its beta Realtime API, revolutionizing voice assistants with speech-to-speech interactions, ultra-low latency, and realistic voices. Voximplant’s integration makes it easy to connect calls to OpenAI's models, enabling seamless, human-like conversations with minimal setup.

How many times a day do you talk to a computer? We’re not referring to the exasperated exclamation you direct at your laptop when it overheats and crashes. We want you to think about the moments you speak to a device and it actually listens.

Voximplant Kit will soon have a new and improved IVR block with faster and more accurate speech synthesis and recognition. Learn more about the improvements.

62% Word Error Rate (WER) improvement for US English

Following Google’s release of new Speech API, we are happy to announce improved quality of call records transcription.

We are happy to announce the high quality speech recognition for both audio call records transcription and real-time recognition scenarios.

Introducing the Text-to-Speech functionality integrated into VoxEngine.

Unlock the Full Power of Neural Text to Speech Sounds human-like. Power your applications with lifelike speech. Our low latency models are designed to enhance user interactions, making every conversation more engaging and realistic.

Connect any Voximplant call to ElevenLabs Conversational AI agents

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.

Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.

Boost your food tech app in 2024! Learn 12 in-app content tricks from a study of 5000+ stories. Personalize, gamify, and use cross-channel messaging for user retention.

Chili Piper is popular, but is it the best for you? This article compares it to competitors like Dashly, Calendly, and others, examining features, pricing, and ideal use cases. Discover the right scheduling tool for your team's needs.

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers