How Nano Banana Pro helps you bring any idea or design to life Nano Banana Pro can help you visualize any idea and design anything — from prototypes, to representing data as infographics, to turning handwritten notes into diagrams. With Nano Banana Pro, now you can: Generate more accurate, context-rich visuals based on enhanced reasoning,…
With 2K and 4k resolution available, you can ensure outputs meet resolution standards required for professional production. Effortlessly create cohesive advertisements by combining diverse elements such as product images, logos, and references. Achieve consistent resemblance for up to five individuals, integrate six high-fidelity shots, or blend as many as fourteen standard inputs into a single,…
Today, we’re introducing Gemini 3.1 Flash TTS, the latest text-to-speech model that delivers improved controllability, expressivity and quality — empowering developers, enterprises and everyday users to build the next generation of AI-speech applications. Starting today, 3.1 Flash TTS is rolling out: Improved speech quality and controllability We’ve improved the overall speech quality of Gemini 3.1…
What’s next This launch builds on our history of providing context about images in Google Search and exploring new research innovations like Backstory from Google DeepMind. Looking ahead, we will continue to invest in more ways to empower you to determine the origin and history of content online. Soon, we’ll expand SynthID verification to support…
At the edge, our E2B and E4B models redefine on-device utility, prioritizing multimodal capabilities, low-latency processing and seamless ecosystem integration over raw parameter count. Powerful, accessible, open To power the next generation of pioneering research and products, we've sized the Gemma 4 models specifically to run and fine-tune efficiently on hardware — from billions of…
Providing new places to generate music High-quality music generation should be accessible wherever creativity happens. Whether you are an app developer, a business or music professional, or a creator, these integrations allow you to use Lyria’s advanced musical awareness to scale your production. Vertex AI: Lyria 3 Pro is now in public preview on Vertex…
Today, we’re advancing Gemini’s real-time dialogue capabilities with Gemini 3.1 Flash Live, our highest-quality audio and voice model yet. It delivers the speed and natural rhythm needed for the next generation of voice-first AI, offering a more intuitive experience for developers, enterprises and everyday users. 3.1 Flash Live is available across Google products: For developers:…
To understand AI capabilities across these cognitive abilities, we propose a three-stage evaluation protocol that benchmarks system performance in relation to human capabilities: Evaluate AI systems across a broad suite of cognitive tasks covering each ability, using held-out test sets to prevent data contamination Collect human baselines for the same tasks from a demographically representative…
Introducing our National Partnerships for AI and collaboration in India We believe AI will be the most transformative technology in human history and that it should be deployed in ways that benefit all of humanity. This requires deep, strategic collaboration between frontier AI labs, governments, academia, and civil society. To fully realise AI’s potential, Google…
Catalyzing breakthroughs in science By proving it could navigate the massive search space of a Go board, AlphaGo demonstrated the potential for AI to help us better understand the vast complexities of the physical world. We started by attempting to solve the protein folding problem, a 50-year grand challenge of predicting the 3D structure of…