ExoBrain Weekly Newsletter25 April 2025

AI’s experience beyond words, Frontier firms lead workplace change, and the geography of compute

Welcome to our weekly newsletter, a combination of thematic insights from the founders at ExoBrain, and a broader news roundup from our Exo agents.

This week we look at:

AI’s experience beyond words
A new essay by Sutton and Silver argues that AI must transition from mimicking human data to learning through experiential interaction with the real world to overcome current performance ceilings.
Frontier firms lead workplace change
Surveys from Microsoft and KPMG reveal that frontier firms are accelerating AI agent adoption, though a significant gap remains between widespread piloting and actual deployment due to workforce readiness challenges.
The geography of compute
Recent research indicates that the US dominates global AI compute with 75% of aggregate performance, while escalating hardware costs and power demands threaten to constrain future model training capabilities.

AI’s experience beyond words

A new essay by Sutton and Silver argues that AI must transition from mimicking human data to learning through experiential interaction with the real world to overcome current performance ceilings.

Joel Miller

25 April 20253 min read

Imagine grading a cake by reading the recipe instead of tasting it. Google DeepMind’s David Silver uses this image to mark the passing age human data powered AI, where graders reward an answer because it looks convincing rather than because it works. The result is a layer of effective mimicry that flatters users yet can’t push past human knowledge. Silver’s provocation is simple: let the model bake the cake, eat it, and learn from the flavour.

Richard Sutton and Silver call this the era of experience. Their new essay argues that agents must inhabit lifelong streams of action, sense the consequences and tune their policies to grounded signals such as heart-rate, revenue or tensile strength. Static human data will soon hit a ceiling; experiential data can grow without limit…

Memories of AlphaGo support the point. When DeepMind stripped professional games from AlphaZero’s training diet and relied on self-play, performance soared. The same pattern resurfaced in AlphaProof, which generated 100 million of its own proofs and reached International Mathematical Olympiad silver level.

The push away from human limits is underway. Coconut, a 2024 experimental model from Meta, keeps its “thoughts” inside high dimensional space instead of working through a chain of readable words. It solved standard logic tests with the same 98.8 % accuracy as a baseline while emitting one-tenth of the text, saving compute. Other researchers show models switching languages mid-problem or signalling answers through non-English starter tokens, a reminder that non-human reasoning already lives quietly inside many systems. Critics worry that experiential agents will be harder to audit: if the chain of thought never reaches text, how do we check for deception? Sutton and Silver reply that real-world rewards actually provide a clearer incentive than a subjective thumb-ups. Yet obtaining reliable signals outside labs remains expensive, and online learning keeps GPUs spinning long after pre-training ends.

Venture capitalist Deedy Das called the essay “Sutton’s most important since The Bitter Lesson”, a 2019 note in which Sutton showed that letting algorithms crunch vast amounts of data and compute usually beats carefully hand-coded rules. His praise signals investor optimism that a similar “scale wins” dynamic could now unfold around experiential data rather than human text. Pratap Ranade, who builds AI tools for retailers, echoed that view, saying “nature is the best compression algorithm”. In plain terms: the real world already stores knowledge in the way things behave, so an agent that pokes and measures the world may learn more efficiently than one that reads documentation. Sceptics counter that the vision is less revolutionary than it sounds. Computer-science professor Ali Minai argued the ideas are “obvious to anyone who has looked beyond gradient descent”, the basic method most neural networks use to nudge their internal numbers towards lower error. His point: researchers steeped in older schools of AI, such as symbolic reasoning or evolutionary methods, have long championed active exploration, so re-branding it as a new era risks unneeded hype. Together, the reactions reveal a community split between those betting that experience will unlock fresh value and those wary of repeating history’s cycles of exuberance and retrenchment.

Takeaways: The age of experience reframes progress; success from the new wave of agents will hinge on feedback from reality, not from human graders. Companies that wire agents to real-world signals, trim the cost of continuous learning and develop new safety lenses for silent thought will harness the true power of the new digital workforce. Sutton and Silver offer a manifesto, but the race now moves from theory to the challenge of giving AI agency connected to the real world.

Frontier firms lead workplace change

Surveys from Microsoft and KPMG reveal that frontier firms are accelerating AI agent adoption, though a significant gap remains between widespread piloting and actual deployment due to workforce readiness challenges.

Joel Miller

25 April 20252 min read

New reports bring fresh perspectives on AI’s trajectory in the workplace this week. Microsoft’s comprehensive Work Trend Index gathered insights from 31,000 workers across 31 countries, while KPMG’s AI Pulse Survey focused on 130 US leaders within large organisations. Together, they reveal accelerating AI agent adoption alongside subtle but increasing workforce adjustments.

The rise of AI agents is a central theme, driving significant organisational shifts:

Microsoft identifies the “frontier firm” profile emerging… structured around on-demand intelligence and human-agent teams, reporting higher thriving rates (71% vs 37% globally).
Adoption intent is strong: 81% of leaders expect significant agent integration soon.
Current use is substantial: 46% of leaders report agents already automate workflows.
Confidence in capacity expansion via agents is high (82% of leaders), making it a top priority for 45%.
A new “agent boss” role is anticipated, managing AI, though leaders are currently more aligned with this mindset than employees.
Piloting is surging (65%, up from 37%), yet actual deployment lags (11%).
Most firms (67%) plan to buy platforms, not build their own agents.
Technology (76%), Operations (74%), and Risk (56%) functions are expected to benefit most.
Key training challenges include system complexity (66%) and the pace of technological change (56%).

Beyond agents, the broader impact on work involves complex adjustments:

Most leaders (76%) believe AI automates tasks, not roles, however, 33% are considering AI-driven headcount reductions.
Simultaneously, 78% are considering hiring for new AI-specific roles.
Upskilling the existing workforce is a top strategy (47%), with AI literacy deemed the most in-demand skill.
Operational leadership is shifting, with CIOs increasingly directing AI initiatives (86%).
AI is expected to enhance performance for both strong (69%) and lower (57%) performers.

Takeaways: The enthusiasm for AI agents is clear, with high expectations for integration and automation. However, the reality on the ground shows a significant gap between widespread piloting and actual deployment, hampered by practical challenges like risk management, trust, and workforce readiness. Businesses face a complex period of adjustment. While AI promises to enhance productivity, decisions around task automation, potential job displacement, new role creation, and essential upskilling require careful navigation. Successfully integrating AI requires more than technology; it demands strategic organisational change, and a workforce equipped with new skills, notably AI literacy.

The geography of compute

Recent research indicates that the US dominates global AI compute with 75% of aggregate performance, while escalating hardware costs and power demands threaten to constrain future model training capabilities.

ExoBrain

25 April 20251 min read

This image, based on recent EPOC research, shows the global distribution of AI compute from 2019-2025. The US overwhelmingly dominates, accounting for roughly 75% of aggregate performance. China now holds a distant second place, its share fluctuating and declining after GPU export controls tightened. This concentration is driven by a profound shift: AI compute is now overwhelmingly private, with industry controlling 80% of performance, up from 40% in 2019. While performance doubles every nine months, the report highlights unsustainable resource growth, with hardware costs and power needs doubling annually. Researchers predict that if trends continue, a leading AI system in 2030 could cost $200 billion and require an immense 9 gigawatts of power, equivalent to nine nuclear reactors. They suggest securing such power is the primary constraint, likely forcing a shift towards training models across multiple, distributed sites rather than single colossal clusters. Can the US maintain its lead as its supply chains begin to suffer unprecedented stress?

Subscribe to the ExoBrain Weekly Newsletter

Stay up to date with AI. Get analysis of the week's most important stories, plus a focused roundup across business, governance, research and infrastructure.