New Gemini 2.5 Options
Native audio output and stay API enhancements
right this moment, Live API We introduce preview variations of audiovisual enter and native audio out dialogs, permitting you to construct conversational experiences straight utilizing the extra pure and expressive Gemini.
It additionally permits customers to govern tones, accents and speech types. For instance, you’ll be able to instruct your mannequin to make use of dramatic voices when telling tales. It additionally helps the usage of the device and permits you to seek for it in your behalf.
You’ll be able to check out a set of early options together with:
- An emotional dialogue through which the mannequin detects and responds appropriately to the person’s voice feelings.
- Proactive audio that lets the mannequin ignore background conversations and is aware of when to reply.
- The idea within the stay API makes use of Gemini’s pondering capabilities, which assist the mannequin help extra complicated duties.
We’re additionally releasing new previews of text-to-speech in 2.5 Professional and a pair of.5 Flash. These have preliminary help for a number of audio system, permitting speech from textual content utilizing two voices by way of native audio out.
Like native audio dialogs, text-to-speech is expressive and may seize very delicate nuances akin to whispers. Works in over 24 languages and seamlessly change between them.

