Google Says Its New Gemini 3 Flash AI Model Is Better and Faster Than 2.5 Pro
The new Gemini 3 Flash AI model from Google is built for faster output at a lower cost, and functions as well as previous powerful reasoning models, Google said in a press release on Wednesday.
According to benchmark tests, Gemini 3 Flash achieved PhD-level reasoning in the GPQA Diamond test, with a score of 90.4%, and it achieved a score of 33.7% (without tools) in Humanity's Last Exam. These are very difficult tests designed to push AI models and require expert-level knowledge. Gemini 3 Pro achieved scores of 91.9% and 37.5%, respectively, on the tests. Google said Gemini 3 Flash outperforms Gemini 2.5 Pro, which was the top Google model when it released earlier this year, at three times the speed.
Don't miss any of our unbiased tech content and lab-based reviews. Add CNET as a preferred Google source.
Gemini 3 Flash is available in Google AI Studio and Gemini CLI for developers. For general consumers, it is rolling out in the Gemini app, the new Antigravity and AI Mode in Google Search. For enterprise users, Gemini 3 Flash is available in Vertex AI and Gemini Enterprise.
Google said Gemini 3 Flash can be a handy agent for customer support or in-game support, tasks that require fast responses. "This is a model that is smart and fast," Josh Woodward, vice president of Google Labs and Google Gemini, told CNET in an interview. "A lot of times you have to make a trade off. You go for a big model that's slow or a fast model that's dumb. And this one is right at that point on the Pareto frontier, when we look at quality and speed, that is a very interesting spot."
Think of the Pareto front as finding a solution with the best possible compromises. For example, when looking to buy a car that is both fast and cheap, there will need to be some tradeoffs in body material or seat quality. The Pareto front in engineering looks to find the most efficient solution with the fewest consequences.
Woodward sees Gemini 3 Flash as a model to tackle everyday tasks, like answering questions about travel, shopping or education. Interestingly, Google will introduce a "thinking" version for Gemini 3 Flash, which will take a bit more time to formulate its responses. It is the first time Google is doing this and Woodward is curious to see how consumers will react. Google is also experimenting with an "auto" mode, one in which the model intuitively switches between fast and thinking versions based on the query.
Gemini 3 Pro and Nano Banana, the Google image model, are landing in AI Mode in Search. They will be accessible via a dropdown menu and only available to AI Pro and Ultra subscribers. For those who want to stick to the free tier, Gemini 3 Flash will be in AI Mode and users can choose the thinking model as well for better, if slightly slower, output.
Gemini 3 Flash comes as Google is starting to gain a lead against ChatGPT maker OpenAI. The release of Gemini 3 in November reportedly pushed OpenAI into declaring a "code red." OpenAI has since fought back with the release of ChatGPT-5.2 last week. Anthropic also released a new version of its most powerful model, Claude Opus 4.5, in recent weeks.
What's Your Reaction?
Like
0
Dislike
0
Love
0
Funny
0
Angry
0
Sad
0
Wow
0