Skip to main content
  1. Articles/

Google I/O 2026: Firebase Studio Is Live, Jules Goes Free, and the Agentic Race Gets a Third Contender

·1583 words·8 mins·
Author
Florent Clairambault
CTO & Software engineer

The preview article I published six days ago said to watch three things at Google I/O 2026: whether Gemini 4’s context window advantage translated into better coding outcomes, whether Firebase Studio became a real agent-native platform or a rebranding exercise, and whether Jules V2 had a credible answer to Claude Code Routines.

The keynote delivered answers. Not all of them were what Google needed — but some were.

What Actually Shipped
#

Google’s I/O 2026 developer story is cleaner than most expected. Three products moved from preview or beta to shipped:

  • Firebase Studio launched as an agent-native full-stack development environment
  • Jules exited beta and became available to all users on free and paid AI Pro and Ultra tiers
  • Gemini Code Assist hit general availability for individuals and GitHub users, powered by Gemini 2.5

Supporting these launches: Gemini Intelligence — Google’s integrated AI suite across Android, ChromeOS, Wear OS, Android Auto, and Android XR — and Googlebooks, the first Aluminium OS laptops from Acer, ASUS, Dell, HP, and Lenovo arriving this fall.

Google did not ship Gemini 4 as a standalone model with a clean benchmark story. The model capability layer is table stakes now; the developer tooling story is what differentiated I/O 2026.

Firebase Studio: The Real Announcement
#

Firebase Studio is the most significant thing Google shipped today for developers who care about agentic workflows. It is not Project IDX with a new name. It is a substantively different product.

The architecture: a Code OSS IDE environment in the browser, a no-code prototyping layer for non-developer stakeholders, and an agent mode capable of executing multi-step development tasks autonomously. Figma integration means a design file becomes an application prototype in Firebase Studio without manual handoff. Google Cloud backend provisioning is automated — Cloud Run, Firebase Hosting, and related services are available without configuration.

The intended workflow is: prototype in Google AI Studio → build in Firebase Studio → deploy to Google Cloud. For teams already in the Google ecosystem, this is a credible end-to-end pipeline with fewer seams than anything Google has shipped before.

The comparison to Claude Code is the obvious one and Google knows it. Firebase Studio’s thesis is that the browser is where more developers live, that cloud-native development removes the local environment complexity, and that Figma-to-deployment in a single environment lowers the barrier for teams that currently have a designer-to-developer handoff problem.

Claude Code’s thesis is that the terminal provides the access, flexibility, and tooling depth that truly autonomous agents require — and that browser-based environments introduce platform constraints that limit what agents can do. Both theses are coherent. They’re not targeting exactly the same developer.

Where Firebase Studio wins: Google Cloud-native integration depth is real and unmatched. If you are deploying to Cloud Run, using BigQuery, or running on Firebase Hosting, the one-click deployment and native service wiring saves hours of configuration that Claude Code requires separately via MCP servers or custom scripts.

Where Claude Code wins: environment ownership. Claude Code agents can run arbitrary shell commands, modify system configs, install toolchains, and manage processes in ways that a browser IDE cannot. For the kind of spec-driven, multi-agent, CI-integrated autonomous development that Claude Code Routines enables, Firebase Studio’s browser-native architecture is a constraint, not an advantage.

Firebase Studio is a real product that will earn real adoption. It is not Claude Code.

Jules Goes Free: The KPI-Driven Bet
#

Jules exiting beta is significant for one reason: it is now available to all users, including the free tier. That means any developer can queue an async task on Google’s infrastructure, walk away, and come back to a pull request.

The architectural story has not changed from the Jules deep dive published in March. Jules integrates with GitHub, runs on Google infrastructure, creates multi-step plans, executes them asynchronously, and presents results as a diff with reasoning attached. Audio changelogs of its work are available. The CI loop closes automatically.

What is new is Project Jitro — the Jules V2 approach that changes the input model. Instead of telling Jules what to do (fix this bug, refactor this module), you tell it what to achieve: raise test coverage to 80%, reduce p95 API latency by 30 milliseconds, resolve all accessibility violations in the component library. Jitro maps the goal to the required code changes, runs asynchronously, and delivers a pull request targeting the metric, not the task.

KPI-driven development is a genuinely interesting framing. It is also harder to evaluate than task-driven development because the rubric for success is embedded in the goal definition. If you tell an agent “raise test coverage to 80%,” the most efficient path is to write trivial tests that cover lines without exercising real behavior. Whether Google has solved that evaluation problem is not yet clear from today’s launch.

Claude Code’s analog is Managed Agents Outcomes, announced at Code with Claude SF on May 6: a separate rubric-based grader that runs in its own context window, evaluates whether the agent’s output meets defined criteria, and triggers re-runs if it doesn’t. The grader runs independently of the agent, which is structurally different from building the goal evaluation into the task itself. Neither approach has published failure rate data at production scale.

Jules free tier changes who can evaluate these tools. The comparison is now available to any developer without a budget commitment.

Gemini Code Assist: GA With Caveats
#

Gemini Code Assist reached general availability for individuals and GitHub users. Gemini 2.5 powers the assistant. A 2 million token context window is announced as coming soon, not yet live.

The “coming soon” caveat matters. A 2M token context window for Gemini Code Assist would change the competitive comparison with Claude Code’s 1M context window significantly for whole-codebase tasks. But it is not shipping today. GA means the product is available and supported — it does not mean the context window feature announced for the future is present now.

At current Gemini 2.5 Pro performance levels, the model quality gap versus Opus 4.7 on SWE-bench Pro is approximately ten percentage points (54% vs 64.3%). Gemini Code Assist’s competitive advantage today is not model quality — it is Google Cloud native integration and price. The free tier (700,000+ VS Code installs) gives Google enormous distribution. If Gemini 4 substantially closes the model quality gap, that distribution becomes a moat.

The Platform Layer: Gemini Intelligence and Googlebooks
#

Two announcements from today are not directly about developer tooling but matter for the longer arc.

Gemini Intelligence — the integrated AI suite across Android, ChromeOS, Wear OS, Android Auto, and Android XR — represents Google’s bet that the agent layer lives in the OS, not just in development tools. Features like proactive task automation, custom widget generation, and the Rambler speech-to-text assistant are not developer tools. They are consumer surfaces that normalize agentic behavior for users who will eventually consume agentic software. Google is building the audience for agentic applications at the OS level while simultaneously building the tools to create them.

Googlebooks — the first Aluminium OS laptops from major OEMs — is the physical manifestation of the Android-ChromeOS merger that developers have been tracking for two years. Aluminium OS arrives in fall 2026. For Android and web developers, it is a new primary development and consumption target.

The Three-Way Race
#

I/O 2026 establishes something that was not clearly true twelve months ago: the agentic coding market has three serious competitors.

Claude Code leads on model quality (Opus 4.7, 64.3% SWE-bench Pro), terminal-native autonomy, Managed Agents platform depth, and enterprise infrastructure (Cowork, Analytics API, Code Review GA). It is the benchmark that everyone else is chasing.

OpenAI’s Codex is the credible cost alternative, leveraging GPT-5.5 (82.7% Terminal-Bench 2.0, 58.6% SWE-bench Pro) with async execution, a mobile supervision layer, and pricing that enterprise procurement finds easier to defend than Claude Code’s per-token costs.

Google now has Firebase Studio (agent-native platform for Google Cloud deployments), Jules (free-tier async agent with KPI-driven V2 approach), and Gemini Code Assist (2M context window incoming, 700K VS Code installs). Google’s stack wins on distribution, integration depth within the Google ecosystem, and price. It loses on autonomous execution depth and current model quality.

The developer who builds on Google Cloud, has Figma in their design workflow, and wants an integrated environment for new projects now has a real choice that Firebase Studio represents. The developer doing spec-driven multi-agent development at team scale with CI integration, production CLAUDE.md invariants, and autonomous overnight coding runs has not had a reason to switch today.

The benchmark that matters: when Gemini Code Assist ships the 2M context window and if Google closes the SWE-bench Pro gap with whatever model ships next, the model-quality argument for Claude Code’s premium pricing weakens. Until then, Google has narrowed the tooling gap without closing the quality gap.

That is real progress. It is also still a gap.


Sources:

Related