🔥 Google DeepMind launched Gemini 3.1 Flash TTS, a text-to-speech model for precise audio control.
Its "Audio Tags" feature lets users set vocal style, emotion, and pacing via text. Supporting over 70 languages with native SynthID watermarking, the model is available in preview on the Gemini API and Google AI Studio, rolling out to enterprises on Vertex AI, and integrating into Google Vids for consumers.
Source.
aipost 🏴
Its "Audio Tags" feature lets users set vocal style, emotion, and pacing via text. Supporting over 70 languages with native SynthID watermarking, the model is available in preview on the Gemini API and Google AI Studio, rolling out to enterprises on Vertex AI, and integrating into Google Vids for consumers.
Source.
aipost 🏴