Google Unveils Gemini 3.5 Flash, Promising Faster AI Performance
Quick Look
- Google announced Gemini 3.5 Flash, its new default AI model, at Google I/O 2026.
- It offers faster performance than Gemini 3.1 Pro for agentic and coding tasks at a lower cost, with enhanced safeguards against harmful content.
AI-generated summary
Why It Matters
Google announced Gemini 3.5 Flash at Google I/O 2026, positioning it as their new default AI model. This model aims to provide faster performance for agentic and coding tasks compared to previous versions, while also being more cost-effective.
Google has unveiled Gemini 3.5, starting with the Gemini 3.5 Flash model that promises to outperform Gemini 3.1 Pro in real-world agentic and coding tasks. "3.5 Flash delivers frontier-level intelligence at exceptional speed β proving you no longer have to trade quality for latency."
Announced at Google I/O 2026, this will be Google's default AI model (not to be confused with Flash-Lite), designed to deliver better speed than the current Gemini Pro models at a more affordable price. The tradeoff is lower performance than the 3.5 Pro model (coming next month) in tasks that require deep reasoning and high-context understanding.
However, Google has reduced the compromise between the Pro and Flash models, saying Gemini 3.5 Flash "delivers intelligence that rivals large flagship models on multiple dimensions." It outperforms the current Gemini 3.1 Pro model on coding and agentic benchmarks like Terminal-Bench 2.1 (76.2 percent), MCP Atlas scaled tool use (83.6 percent) and multimodal understanding with 84.2 percent on CharXiv Reasoning. From an output tokens-per-second standpoint, it's four times faster than other frontier models, according to Google.
All of that makes 3.5 Flash ideal for long-horizon agentic tasks, completing what used to take weeks in "a fraction of the time," Google wrote. "Under supervision, it can reliably execute multi-step workflows and coding tasks while sustaining frontier performance." It added that partners, including banks and fintechs, have used it to automate multi-week workflows.
Google noted that 3.5 Flash is now the default model for the Gemini app and for AI Mode in Search worldwide. The personal AI agent Gemini Spark, rolling out to testers today, also runs on 3.5 Flash. At the same time, Google said it strengthened Gemini 3.5's cyber and CBRN (chemical, biological, radiological, nuclear) safeguard so it's less likely to generate harmful content (or mistakenly refuse to answer safe queries). Google detailed the new model's enhanced agentic coding capabilities in a new blog post.
Gemini 3.5 Flash is now "generally available via Google Antigravity, the Gemini API in Google AI Studio and Android Studio, Gemini Enterprise Agent Platform and Gemini Enterprise. It's also now available to everyone in the Gemini app and AI Mode in Search," Google said.
There's no word yet on pricing for paid tiers. As for Gemini 3.5 Pro, Google said: "It's already being used internally, and we look forward to rolling it out next month."
What to Watch
AI outlook β possibilities, not facts
Gemini 3.5 Pro will be rolled out next month.
Very likely Β· Within months
Open Questions
- What is the exact pricing for paid tiers of Gemini 3.5 Flash?
- What are the specific benchmarks and performance metrics for Gemini 3.5 Pro?
- What are the detailed technical specifications of the enhanced cyber and CBRN safeguards?
- What specific banks and fintechs have used the model to automate workflows?






