New Gemini 2.5 Options
Native audio output and stay API enhancements
at the moment, Live API We introduce preview variations of audiovisual enter and native audio out dialogs, permitting you to construct conversational experiences immediately utilizing the extra pure and expressive Gemini.
It additionally permits customers to govern tones, accents and speech types. For instance, you may instruct your mannequin to make use of dramatic voices when telling tales. It additionally helps using the instrument and permits you to seek for it in your behalf.
You may check out a set of early options together with:
- An emotional dialogue by which the mannequin detects and responds appropriately to the consumer’s voice feelings.
- Proactive audio that lets the mannequin ignore background conversations and is aware of when to reply.
- The idea within the stay API makes use of Gemini’s pondering capabilities, which assist the mannequin assist extra advanced duties.
We’re additionally releasing new previews of text-to-speech in 2.5 Professional and a pair of.5 Flash. These have preliminary assist for a number of audio system, permitting speech from textual content utilizing two voices through native audio out.
Like native audio dialogs, text-to-speech is expressive and might seize very delicate nuances comparable to whispers. Works in over 24 languages and seamlessly swap between them.

