Tag: speech-to-text

Blog
>
Tag: speech-to-text

ASR speech recognition speech-to-text

Enhanced speech recognition model is now available

62% Word Error Rate (WER) improvement for US English

ASR speech-to-text

Hot Summer Speech-to-Text Updates

Following Google’s release of new Speech API, we are happy to announce improved quality of call records transcription.

TTS streaming gemini elevenlabs voice agent

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

Voximplant Kit updates. December 2024

In this digest, we will bring you the latest updates to Voximplant Kit. We have added support for outbound WhatsApp messages, Mobile chats, support for ElevenLabs neural voices, and new automated campaign settings.

elevenlabs voice agent voice ai conversational ai

Introducing integration with ElevenLabs Conversational AI

Connect any Voximplant call to ElevenLabs Conversational AI agents

Voximplant Kit updates. April 2025

Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.

Voximplant Kit updates. January 2025

New Features in Voximplant Kit: Update overview We are constantly working to improve our product to make it easier to use and more effective for you. In this update, we have added several useful features. Here’s what’s new:

Grok Voice Agent API now available in Voximplant

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

What Is a Voice AI Orchestration Platform?

Learn how a Voice AI Orchestration Platform connects LLMs, STT/TTS, turn‑taking, and telephony (PSTN, SIP, WebRTC) to build reliable real‑time voice agents. See benefits, architecture, and how Voximplant helps.

Voximplant adds enhanced pipeline options for Voice AI

Voximplant now lets developers build full-cascade voice AI pipelines in VoxEngine without sacrificing turn-taking quality.

voximplant kit podcast voximplant-kit-cc-news product management voximplant-kit-automation-news web sdk webrtc video kit-updates call center ios sdk sip voximplant pstn api

Tag: speech-to-text

Enhanced speech recognition model is now available

Hot Summer Speech-to-Text Updates

Sign Up for a free Voximplant developer account or talk to our experts

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

Voximplant Kit updates. December 2024

Introducing integration with ElevenLabs Conversational AI

Voximplant Kit updates. April 2025

Voximplant Kit updates. January 2025

Grok Voice Agent API now available in Voximplant

What Is a Voice AI Orchestration Platform?

Voximplant adds enhanced pipeline options for Voice AI

Sign Up for a free Voximplant developer account or talk to our experts

Tag: speech-to-text

Sign Up for a free Voximplant developer account or talk to our experts

Sign Up for a free Voximplant developer account or talk to our experts

Contact Us