Blog | Voximplant.com

Enhanced speech recognition model is now available

Voximplant speech recognition for most of the languages is powered by Google Speech API. At the end of February Google introduced new Speech API capabilities including Enhanced speech recognition model for better quality of recognition (for video and phone calls). According to Google, this model offers the 62-64% Word Error Rate (WER) improvement compared to the standard one.

WER-comparison

Voximplant developers can use this model by setting model to phone_call_enhanced in ASRParameters:

const asr = VoxEngine.createASR({
	profile: ASRProfileList.Google.en_US,
	model: ASRModelList.Google.phone_call_enhanced,
	singleUtterance: true
})

Please note that billing for enhanced recognition differs from the standard one as it requires more resources.

Recommendations

voximplant kit voximplant-kit-automation-news

Voximplant Kit updates. December 2024

In this digest, we will bring you the latest updates to Voximplant Kit. We have added support for outbound WhatsApp messages, Mobile chats, support for ElevenLabs neural voices, and new automated campaign settings.

Guide to Enhancing Remote Team Training: From Onboarding to Advanced Learning With an LMS in Cloud Contact Centres

Discover effective strategies to enhance remote training for cloud contact centres using an LMS. From seamless onboarding to advanced learning modules, see how well-structured training can boost team productivity, engagement, and skill development across all levels.

TTS streaming gemini elevenlabs voice agent

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

Enhanced speech recognition model is now available

Sign Up for a free Voximplant developer account or talk to our experts

Recommendations

Voximplant Kit updates. December 2024

Guide to Enhancing Remote Team Training: From Onboarding to Advanced Learning With an LMS in Cloud Contact Centres

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

Contact Us