Buyer testimonials
Google Cloud customers We’re already utilizing Gemini’s native audio capabilities to drive actual enterprise outcomes, from processing mortgages to calling prospects.
- “Customers usually overlook they’re speaking to an AI inside a minute of utilizing Sidekick, and in some circumstances, they even thank the bot after a protracted chat…New Reside API AI capabilities provided by Gemini [2.5 Flash Native Audio] Empower our sellers to win. ” – David Wurtz, VP of Merchandise, Shopify
- “By integrating the Gemini 2.5 Flash Native Audio mannequin, we’ve considerably enhanced the capabilities of Mia since its launch in Could 2025. This highly effective mixture has enabled us to generate over 14,000 loans for our dealer companions.“ – Jason Bressler, Chief Expertise Officer, United Wholesale Mortgage (UWM)”
- “Working with the Gemini 2.5 Flash native audio mannequin by Vertex AI, Newo.ai AI receptionists can ship unparalleled conversational intelligence. They will establish the primary speaker even in noisy environments, change languages mid-conversation, and sound extremely pure and expressive.” – David Yang, co-founder of Newo.ai
dwell voice translation
Gemini now natively helps a brand new dwell voice-to-speech translation characteristic designed to deal with each steady listening and two-way dialog.
With steady listening, Gemini routinely interprets audio spoken in a number of languages right into a single goal language. This lets you put in your headphones and listen to the world round you in your individual language.
For 2-way conversations, Gemini’s Reside Voice Translator processes translations between two languages in real-time and routinely switches the output language based mostly on who’s talking. For instance, for those who converse English and need to chat with somebody who speaks Hindi, you may hear the English translation in actual time by your headphones, and while you’re executed talking, your cellphone will broadcast the Hindi.
Gemini’s dwell voice translation has a number of necessary options which are helpful in the true world.
- Supported languages: Translate audio in over 70 languages and a couple of,000 language pairs by combining the world data and multilingual capabilities of Gemini fashions with native audio capabilities.
- Fashion switch: Captures the nuances of human speech and preserves the speaker’s intonation, tempo, and pitch to make sure translations sound pure.
- Multilingual enter: Perceive a number of languages concurrently in a single session, so you possibly can comply with multilingual conversations with out having to fiddle with language settings.
- Computerized detection: It identifies the language being spoken and begins the interpretation, so you do not even have to know the language being spoken to start out translating.
- noise immunity: Eliminates surrounding noise and means that you can have a cushty dialog even in noisy out of doors environments.

