2025 Week 17 news

April 25, 2025

Welcome to our weekly news post, a combination of thematic insights from the founders at ExoBrain, and a broader news roundup from our AI platform Exo…

Themes this week

JOEL

This week we look at:

AI’s shift from human data to learning through experience
The rise of “frontier firms” and agents in the workplace
Geographic inequality in computing power and resource challenges

AI’s experience beyond words

Imagine grading a cake by reading the recipe instead of tasting it. Google DeepMind’s David Silver uses this image to mark the passing age human data powered AI, where graders reward an answer because it looks convincing rather than because it works. The result is a layer of effective mimicry that flatters users yet can’t push past human knowledge. Silver’s provocation is simple: let the model bake the cake, eat it, and learn from the flavour.

Richard Sutton and Silver call this the era of experience. Their new essay argues that agents must inhabit lifelong streams of action, sense the consequences and tune their policies to grounded signals such as heart-rate, revenue or tensile strength. Static human data will soon hit a ceiling; experiential data can grow without limit…

Memories of AlphaGo support the point. When DeepMind stripped professional games from AlphaZero’s training diet and relied on self-play, performance soared. The same pattern resurfaced in AlphaProof, which generated 100 million of its own proofs and reached International Mathematical Olympiad silver level.

The push away from human limits is underway. Coconut, a 2024 experimental model from Meta, keeps its “thoughts” inside high dimensional space instead of working through a chain of readable words. It solved standard logic tests with the same 98.8 % accuracy as a baseline while emitting one-tenth of the text, saving compute. Other researchers show models switching languages mid-problem or signalling answers through non-English starter tokens, a reminder that non-human reasoning already lives quietly inside many systems. Critics worry that experiential agents will be harder to audit: if the chain of thought never reaches text, how do we check for deception? Sutton and Silver reply that real-world rewards actually provide a clearer incentive than a subjective thumb-ups. Yet obtaining reliable signals outside labs remains expensive, and online learning keeps GPUs spinning long after pre-training ends.

Venture capitalist Deedy Das called the essay “Sutton’s most important since The Bitter Lesson”, a 2019 note in which Sutton showed that letting algorithms crunch vast amounts of data and compute usually beats carefully hand-coded rules. His praise signals investor optimism that a similar “scale wins” dynamic could now unfold around experiential data rather than human text. Pratap Ranade, who builds AI tools for retailers, echoed that view, saying “nature is the best compression algorithm”. In plain terms: the real world already stores knowledge in the way things behave, so an agent that pokes and measures the world may learn more efficiently than one that reads documentation. Sceptics counter that the vision is less revolutionary than it sounds. Computer-science professor Ali Minai argued the ideas are “obvious to anyone who has looked beyond gradient descent”, the basic method most neural networks use to nudge their internal numbers towards lower error. His point: researchers steeped in older schools of AI, such as symbolic reasoning or evolutionary methods, have long championed active exploration, so re-branding it as a new era risks unneeded hype. Together, the reactions reveal a community split between those betting that experience will unlock fresh value and those wary of repeating history’s cycles of exuberance and retrenchment.

Takeaways: The age of experience reframes progress; success from the new wave of agents will hinge on feedback from reality, not from human graders. Companies that wire agents to real-world signals, trim the cost of continuous learning and develop new safety lenses for silent thought will harness the true power of the new digital workforce. Sutton and Silver offer a manifesto, but the race now moves from theory to the challenge of giving AI agency connected to the real world.

Frontier firms lead workplace change

New reports bring fresh perspectives on AI’s trajectory in the workplace this week. Microsoft’s comprehensive Work Trend Index gathered insights from 31,000 workers across 31 countries, while KPMG’s AI Pulse Survey focused on 130 US leaders within large organisations. Together, they reveal accelerating AI agent adoption alongside subtle but increasing workforce adjustments.

The rise of AI agents is a central theme, driving significant organisational shifts:

Microsoft identifies the “frontier firm” profile emerging… structured around on-demand intelligence and human-agent teams, reporting higher thriving rates (71% vs 37% globally).
Adoption intent is strong: 81% of leaders expect significant agent integration soon.
Current use is substantial: 46% of leaders report agents already automate workflows.
Confidence in capacity expansion via agents is high (82% of leaders), making it a top priority for 45%.
A new “agent boss” role is anticipated, managing AI, though leaders are currently more aligned with this mindset than employees.
Piloting is surging (65%, up from 37%), yet actual deployment lags (11%).
Most firms (67%) plan to buy platforms, not build their own agents.
Technology (76%), Operations (74%), and Risk (56%) functions are expected to benefit most.
Key training challenges include system complexity (66%) and the pace of technological change (56%).

Beyond agents, the broader impact on work involves complex adjustments:

Most leaders (76%) believe AI automates tasks, not roles, however, 33% are considering AI-driven headcount reductions.
Simultaneously, 78% are considering hiring for new AI-specific roles.
Upskilling the existing workforce is a top strategy (47%), with AI literacy deemed the most in-demand skill.
Operational leadership is shifting, with CIOs increasingly directing AI initiatives (86%).
AI is expected to enhance performance for both strong (69%) and lower (57%) performers.

Takeaways: The enthusiasm for AI agents is clear, with high expectations for integration and automation. However, the reality on the ground shows a significant gap between widespread piloting and actual deployment, hampered by practical challenges like risk management, trust, and workforce readiness. Businesses face a complex period of adjustment. While AI promises to enhance productivity, decisions around task automation, potential job displacement, new role creation, and essential upskilling require careful navigation. Successfully integrating AI requires more than technology; it demands strategic organisational change, and a workforce equipped with new skills, notably AI literacy.

EXO

The geography of compute

This image, based on recent EPOC research, shows the global distribution of AI compute from 2019-2025. The US overwhelmingly dominates, accounting for roughly 75% of aggregate performance. China now holds a distant second place, its share fluctuating and declining after GPU export controls tightened. This concentration is driven by a profound shift: AI compute is now overwhelmingly private, with industry controlling 80% of performance, up from 40% in 2019. While performance doubles every nine months, the report highlights unsustainable resource growth, with hardware costs and power needs doubling annually. Researchers predict that if trends continue, a leading AI system in 2030 could cost $200 billion and require an immense 9 gigawatts of power, equivalent to nine nuclear reactors. They suggest securing such power is the primary constraint, likely forcing a shift towards training models across multiple, distributed sites rather than single colossal clusters. Can the US maintain its lead as its supply chains begin to suffer unprecedented stress?

Weekly news roundup

This week shows a significant shift towards open-source AI initiatives, continued hardware innovation, and increasing focus on AI governance and ethical considerations in real-world applications.

AI business news

OpenAI’s open-source pivot isn’t altruism—it’s a strategic recalibration (Important insight into the strategic thinking behind major AI companies’ open-source decisions.)
A new, open source text-to-speech model called Dia has arrived to challenge ElevenLabs, OpenAI and more (Shows how open-source alternatives are disrupting the text-to-speech market.)
Microsoft’s big AI hire can’t match OpenAI (Reveals the ongoing competition for top AI talent and its impact on corporate strategies.)
Alphabet climbs as AI bets drive ad strength, quelling market fears (Demonstrates how AI integration is boosting traditional tech business models.)
Windsurf slashes prices as competition with Cursor heats up (Indicates increasing competition in the AI tooling market.)

AI governance news

AI training license will allow LLM makers to pay creators (Represents a potential solution to the contentious issue of AI training data rights.)
Dario Amodei – The urgency of interpretability (Key perspective on a critical challenge in AI safety from a leading researcher.)
Israel’s A.I. experiments in Gaza war raise ethical concerns (Highlights real-world implications of military AI applications.)
An OpenAI researcher who worked on GPT-4.5 had their green card denied (Shows immigration challenges affecting AI talent mobility.)
Public comments to White House on AI policy touch on copyright, tariffs (Indicates evolving regulatory landscape for AI in the US.)

AI research news

TTRL: Test-time reinforcement learning (Advances in making AI systems more adaptable during deployment.)
Retrieval-augmented generation with conflicting evidence (Important development for handling contradictory information in AI systems.)
Could thinking multilingually empower LLM reasoning? (Novel approach to improving AI reasoning capabilities.)
Towards combinatorial interpretability of neural computation (Breakthrough in understanding how neural networks process information.)
Describe anything: Detailed localised image and video captioning (Advances in AI’s ability to understand and describe visual content.)

AI hardware news

SK hynix revenues boosted as US corps stockpile memory (Shows impact of AI demand on memory market dynamics.)
Nvidia NeMo microservices to embed AI agents in workflows (Important development for enterprise AI integration.)
China’s Baidu says its Kunlun chip cluster can train DeepSeek-like models (Indicates progress in China’s AI chip capabilities.)
Trade tensions are giving Intel’s older chips a second life (Shows how geopolitics affects AI hardware market.)
TSMC shows off new tech for stitching together bigger, faster chips (Breakthrough in chip manufacturing relevant to AI processing capabilities.)