

Google has introduced updates throughout its Gemini 2.5 household of reasoning fashions, together with making Gemini 2.5 Professional and Flash usually obtainable and introducing a preview of Gemini 2.5 Flash-Lite.
In accordance with Google, no adjustments have been made to Professional and Flash for the reason that final preview, aside from the pricing for Flash is completely different. When these fashions have been first introduced, there was separate considering and non-thinking pricing, however Google mentioned that separation led to confusion amongst builders.
The brand new pricing for two.5 Flash is identical for each considering and non-thinking modes. The costs at the moment are $0.30/1 million enter tokens for textual content, picture, and video, $1.00/ 1 million enter tokens for audio, and $2.50/1 million output tokens for all. This represents a rise in enter value and a lower in output value.
“Whereas we try to take care of constant pricing between preview and steady releases to attenuate disruption, this can be a particular adjustment reflecting Flash’s distinctive worth, nonetheless providing one of the best cost-per-intelligence obtainable,” Google wrote in a weblog publish.
Google additionally launched a preview of Gemini 2.5 Flash-Lite, which has the bottom latency and value among the many 2.5 fashions. The corporate sees this as a cheap improve from 1.5 and a pair of.0 Flash, with higher efficiency throughout most evaluations, decrease time to first token, and better tokens per second decode.
Gemini 2.5 Flash-Lite additionally permits customers to manage the considering finances by way of an API parameter. Because the mannequin is designed for value and velocity effectivity, considering is turned off by default.
The brand new mannequin additionally helps Google’s native instruments together with Grounding with Google Search, Code Execution, URL Context, and performance calling.
The pricing for Gemini 2.5 Flash-Lite is $0.10/1 million enter tokens for textual content, picture, and video, $0.50/ 1 million enter tokens for audio, and $.40/1 million output tokens for all.