Modelwire
Subscribe

Gemini 3.1 Flash TTS: the next generation of expressive AI speech

Google DeepMind released Gemini 3.1 Flash TTS, an audio model featuring granular audio tags that enable fine-grained control over expressive speech synthesis. The capability allows developers to direct AI-generated audio with unprecedented precision for creative and commercial applications.

MentionsGoogle DeepMind · Gemini 3.1 Flash TTS

Modelwire summarizes — we don’t republish. The full article lives on deepmind.google. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

Related

Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning

Gemini can now pull from Google Photos to generate personalized images

Gemini can now create personalized AI images by digging around in Google Photos

Gemini 3.1 Flash TTS: the next generation of expressive AI speech · Modelwire