
What is Voice SDK?
Voximplant’s Voice SDK helps you provide a high-quality voice calling experience for your staff and customers.

Voximplant’s Voice SDK helps you provide a high-quality voice calling experience for your staff and customers.

Now Unity developers can use the SDK to embed real-time voice and video communication into VR/AR apps and games in minutes, we will take care of complexity and infrastructure.

If a call is made in non-P2P mode then its media stream goes via our media servers and we can record it if required.

Yep! app for making friendships all around the world is now using Voximplant!

Calls are right inside a chat session for the sake of instant voice connection between sales managers ans customers.

Boost your food tech app in 2024! Learn 12 in-app content tricks from a study of 5000+ stories. Personalize, gamify, and use cross-channel messaging for user retention.

Discover the future of tech at LEAP 2025 in Riyadh! Join Voximplant as we dive into the latest AI innovations, startup ecosystems, and groundbreaking technologies shaping tomorrow. Don’t miss this chance to network, learn, and transform your business.

New Features in Voximplant Kit: Update overview. We are constantly working to improve our product to make it easier to use and more effective for you. In this update, we have added several useful features. Here’s what’s new:

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

In this digest, we will bring you the latest updates to Voximplant Kit. We have added support for outbound WhatsApp messages, Mobile chats, support for ElevenLabs neural voices, and new automated campaign settings.

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.