Two paths for the agentic web
Recent conferences from Google and Microsoft reveal diverging strategies for the agentic web, with Google focusing on vertical integration and Microsoft on horizontal infrastructure protocols.
Joel Miller

This week’s back-to-back conferences from Google and Microsoft revealed more than new products – they exposed different philosophies about how AI will reshape computing. Google I/O showcased an execution machine cannibalising its own business model. Microsoft Build unveiled infrastructure for an entirely new digital ecosystem. Together, they show an industry evolving rapidly.
Google’s transformation from last year’s I/O was clear. Where 2024 brought impressive demos with vague timelines, this week’s event delivered products live during the keynote. AI Mode for Search activated for US, project Astra’s camera features rolled out to Android devices. One fascinating stat; token processing grew from 9.7 trillion monthly to 480 trillion – a 50-fold increase demonstrating Google has built infrastructure to deliver AI products at global scale, not just demonstrate them.
Microsoft Build set out a different approach. Over 50 announcements painted a vision for what they call the “open agentic web” and many of the security, development and infrastructure components needed to power it. Rather than focusing on the product layer, Microsoft argued for a future where AI agents become first-class citizens of the internet.
Google’s announcements spanned every aspect of their AI platform:
- Gemini: From ultra-cheap and upgraded Flash models to deep reasoning mode challenging OpenAI’s upcoming o3-pro
- Veo 3: Video generation with native audio, creating content approaching broadcast quality (see below)
- Jules: Direct competition with Codex in autonomous software development
- AI Mode: A reimagining of search through conversational interaction
- Gemma 3: Open-weight models delivering impressive capabilities at smaller mobile friendly scales
- Gemini Diffusion: An experimental text diffusion model that learns to generate outputs by converting random noise into coherent text
Microsoft’s announcements focused on building the foundation layer:
- NLWeb Protocol: Natural language AI interfaces for websites
- Model Context Protocol (MCP) Integration: Every NLWeb instance becomes an MCP server, plus they’re dropping MCP into Windows
- Agent2Agent (A2A) Support: Direct collaboration between AI agents
- Microsoft Entra Agent ID: Unique, verifiable identities for AI agents
- GitHub Copilot Coding Agent: Autonomous code refactoring, testing, and feature implementation
- Local Foundry: Running AI models and agents directly on Windows 11 PCs
Google’s approach centres on vertical integration. They control everything from custom silicon to global distribution, optimising their entire stack for different use cases. This provides hardware that is optimise for the workloads they run, and the ability to deploy to hundreds of millions of users overnight.
Microsoft pursues horizontal infrastructure. Rather than owning the consumer stack, they’re focusing on building protocols and platforms others can build upon. NLWeb could become “HTML for the agentic web”, whilst MCP support across their ecosystem creates interoperability beyond Microsoft’s boundaries. Google is building a Gemini-based fortress, Microsoft is building the roads.
Google’s execution impresses through sheer velocity. Products are now moving from research to deployment in months. Their 50-fold increase in token processing demonstrates ability to scale AI workloads that would break most companies. The market responded positively, and shares jumped 4% following the announcements.
Microsoft’s direction is impressively open and connective, although both companies are yet to offer the knowledge worker a compelling or coherent basis for leveraging agents. But there is a further and the crucial difference. Google faces an existential paradox – actively disrupting the search advertising business generating the majority of their revenue. AI Mode reduces clicks to external websites, potentially undermining the ecosystem funding the web through advertising. Their new Gemini subscriptions (including $250 for Ultra) represent early attempts at alternative revenue but pale beside their advertising empire.
Microsoft faces no such dilemma. Their AI agent strategy should reinforce existing cloud and subscription revenues. When agents automate workflows, enterprise customers pay more, not less. Microsoft can enhance their business model whilst Google must potentially destroy theirs.
But Microsoft’s vision is not yet complete. The NLWeb concept technically connects websites to agents. But what will compel website owners to participate in a system reducing their visitor traffic and advertising revenue? The current web operates on human attention usually monetised through ads. When AI agents replace browsers, that foundation collapses. Both approaches accelerate AI’s integration into everyday computing but in different ways. We’re entering a golden age of AI capabilities; Google’s consumer focus brings sophisticated AI to billions; Microsoft’s enterprise focus transforms how businesses operate.
But many fundamental questions remain unanswered. The economics of content creation in an AI-dominated world haven’t been worked through. Neither company addresses how creators get rewarded when their work is consumed or re-purposed. This isn’t just a business problem, it threatens the sustainability of human knowledge flow. The solution might involve native agentic payments. AI agents could pay tiny amounts for each piece of information accessed, creating new incentives for quality content creation and increasing interconnectivity. Agents that generate revenue could share that with the data and IP owners that allowed their underlying models to be trained. But neither Google nor Microsoft have got this far yet.
Takeaways: Google and Microsoft have revealed contrasting but equally ambitious visions for AI’s future. The immediate future brings extraordinary AI capabilities to billions of users and could populate the Internet with billions of agents. But we must now work out what will sustain this new knowledge ecosystem. If we don’t tackle this soon, a thriving AI environment may become a closed, extractive, zero-sum one. The technical foundations are being laid, and the products like Veo 3 are mind bending, but the economic architecture of the AI age remains technology’s greatest unsolved challenge.
