Latest insights

Meta's great retreat

Meta plans to sell access to its AI compute and models after $140 billion of spending. The pivot reads as retreat, but cheap compute and a fourth serious model entering the market could benefit everyone outside the duopoly.

Joel Miller

03 July 20264 min read

DeepSeek does more with less

Joel Miller

03 July 20263 min read

The AI boom compared

Joel Miller

03 July 20262 min read

News roundup

Exo agents

03 July 2026

Read the latest newsletter

Subscribe to the ExoBrain Weekly Newsletter

Stay up to date with AI. Get analysis of the week's most important stories, plus a focused roundup across business, governance, research and infrastructure.

News feed

347 of 347 articles

Sol eclipsed by government permits

OpenAI's GPT-5.6 Sol did not launch publicly. It reached twenty government-approved US firms after a call from the Commerce Secretary, who then lifted a worldwide block on a rival model. A discretionary permit regime for the frontier has formed.

26 June 2026Joel Miller4 min read

What price business outcomes?

We costed one real knowledge-work task, an end-to-end RFP response, across every way you can buy AI today. The same output ranged from sixteen cents to nearly twenty-eight dollars, a 170-fold spread that turns on procurement, not intelligence.

26 June 2026Joel Miller3 min read

A preview of the agentic curve

An internal OpenAI study of its own staff shows median output per researcher up 56-fold since November, with Codex now 99.8% of weekly output tokens. Treat it as a preview of the curves the rest of us will soon draw.

26 June 2026Joel Miller2 min read

Politics clouds frontier model releases

The US government switched off Anthropic's Claude Fable 5 worldwide three days after launch, via an export-control directive the President says a rival triggered. Access to frontier models is now a political variable, and the case for self-hosting just hardened.

19 June 2026Joel Miller4 min read

Copilot Cowork's price shock

Microsoft made Copilot Cowork generally available on consumption pricing, every task burning Copilot Credits at $0.01 on top of a $30 licence. The numbers suggest a product priced out of the frequent use it was built for.

19 June 2026Joel Miller3 min read

GLM 5.2 democratises coding power

Z.ai's GLM 5.2, an open-weight 744B model under an MIT licence, lands within a few points of Claude Opus on coding and runs on a high-memory workstation. With Fable blocked, self-hosting a near-frontier model becomes a continuity requirement.

19 June 2026Joel Miller2 min read

A confusing fable

Anthropic released Claude Fable 5, its first public Mythos-class model, then walked back a policy that secretly throttled AI researchers. A powerful launch undercut by oversensitive safety controls, and a sign no lab can pause while rivals race.

12 June 2026Joel Miller7 min read

Who will thrive in the hybrid workforce

A New York Times panel asks who thrives in a hybrid AI-human workforce. Four thinkers converge: AI absorbs the junior rung, judgement becomes the scarce human role, and the apprenticeship that built it is gone. The fix is business design.

12 June 2026Joel Miller3 min read

AI gains concentrated in the Mag 7

This week's chart from Apollo shows AI's gains concentrating in the Magnificent 7. Revenue per employee is rising for big tech and falling for the Russell 2000, with the productivity payoff yet to reach the wider economy.

12 June 2026Joel Miller2 min read

Microsoft floods Build with agent infrastructure

Microsoft used Build 2026 to open its Work IQ data layer to third-party agents, launch the always-on Scout, debut seven in-house MAI models, and release open-source agent-governance tooling. The ambition is real, but its execution record invites scepticism.

05 June 2026Joel Miller4 min read

Is Peter Thiel building a network state in Argentina?

Argentina's Congress is weighing the world's first law for non-human corporations, companies run entirely by AI with no human owners or directors. Peter Thiel has moved to Buenos Aires as President Milei courts AI firms with radical deregulation.

05 June 2026Joel Miller3 min read

Anthropic proposes AI pause

A new Anthropic chart shows its engineers merging eight times more code per quarter, over 80% of it written by Claude. Co-author Jack Clark puts recursive self-improvement at 60% by 2028, while Anthropic calls for a verifiable global pause.

05 June 2026Joel Miller2 min read

The Pope draws a line between humanity and AI

Pope Leo XIV has devoted his first encyclical entirely to AI, warning against the Babel syndrome and reaffirming an absolute boundary between human and machine. The result is powerful on human dignity but cannot engage the questions of personhood and machine experience now arriving.

29 May 2026Joel Miller4 min read

Can new regulations keep us safe from powerful models?

Illinois has passed the strongest US AI safety law to date, mandating third-party audits and incident reporting for the largest labs. But certifying a frontier model at launch made sense when capability and harm were separable, and with Mythos-class systems they no longer are.

29 May 2026Joel Miller3 min read

When not seeing is the edge

An OpenAI model has disproved Erdős's 80-year-old unit distance conjecture, not by reasoning about the geometry but by ignoring it entirely. Recasting the problem as algebraic number theory let the model see what humans, anchored by the metaphor of dots on a grid, could not.

29 May 2026Joel Miller2 min read

Google's grand bazaar

Google I/O 2026 launched Gemini 3.5 Flash, Omni, Spark and Antigravity 2.0 alongside dozens of other AI products. Google has nearly every asset to lead the next phase of AI, but still struggles to converge on a coherent product spine.

22 May 2026Joel Miller4 min read

The compute commodity

AI compute now behaves like a maturing commodity market: production costs are still falling, but memory bandwidth scarcity, reasoning verbosity and reliability tiers are reshaping how the frontier prices inference. Cost per outcome is replacing cost per token.

22 May 2026Joel Miller3 min read

Is AI more expensive than space travel?

SpaceX's record S-1 filing reveals an AI company underneath the rockets, with AI accounting for 93% of the claimed $28.5 trillion TAM and 76% of Q1 capex. The contract powering it all is cancellable on 90 days notice.

22 May 2026Joel Miller2 min read

The perspiration principle of recursive self-improvement

New research distinguishes between human-led inspiration and agent-driven perspiration in AI development, suggesting that while automation can accelerate routine tasks, full recursive self-improvement remains uncertain due to persistent challenges in judgement and evaluation.

15 May 2026Joel Miller4 min read

Trump in China

A high-profile US delegation to Beijing reveals a fractured American AI industry, with hardware giants seeking market access while frontier labs push for stricter containment, leaving chip exports and geopolitical tensions unresolved.

15 May 2026Joel Miller3 min read

The bell curve of AI intelligence

A new benchmarking project aggregates public tests to show that leading US and Chinese models now cluster at similar intelligence levels, highlighting the importance of monitoring efficiency alongside capability.

15 May 2026ExoBrain1 min read

Claude is coming for financial services

Anthropic accelerates its enterprise strategy in financial services through a major joint venture and a new suite of autonomous agents, despite ongoing concerns regarding security and workforce impact.

08 May 2026Joel Miller3 min read

The geometry of AI thought

New research into mechanistic interpretability reveals that neural network activations form structured geometries, offering precise methods for steering model behaviour and enhancing safety.

08 May 2026Joel Miller4 min read

Goldman Sachs analyses the AI build-out

Goldman Sachs projects a twenty-four-fold increase in global token consumption by 2030, though analysts argue that the rise of agentic workflows will likely drive demand far beyond these conservative estimates.

08 May 2026ExoBrain1 min read

Harnesses are the new AI battleground

The AI industry is shifting focus from raw model capability to the surrounding 'harness' infrastructure, as major labs and developers compete to build the orchestration layers that enable reliable agentic workflows.

01 May 2026Joel Miller4 min read

A model from another time

A new 13-billion parameter model trained exclusively on pre-1931 text provides a unique lens into historical reasoning and the cognitive constraints of earlier eras.

01 May 2026Joel Miller3 min read

GPT-5.5 catches Mythos on cyber

Evaluations by the UK AI Security Institute demonstrate that both GPT-5.5 and Anthropic's unreleased Mythos model can execute full corporate network intrusions autonomously, highlighting escalating cyber risks.

01 May 2026ExoBrain1 min read

Compute crunch 2.0 arrives

The AI industry is entering a new phase of constraints focused on inference efficiency and cost, as labs compete to deliver high-quality tokens without being bottlenecked by compute scarcity.

24 April 2026Joel Miller5 min read

Visual thinking points to the next wave

The convergence of reasoning and generation in new transformer-based architectures marks a significant shift in AI design, moving beyond classical diffusion models towards unified multimodal systems.

24 April 2026Joel Miller3 min read

Google’s 75%

Internal reports reveal a divide at Google between DeepMind engineers using Claude Code and the wider company relying on Gemini, highlighting the complex dynamics of enterprise AI adoption.

24 April 2026ExoBrain2 min read

The adaptive thinking backlash

Anthropic’s Opus 4.7 faces user backlash due to its new adaptive thinking mode and tokenisation changes, revealing a disconnect between benchmark performance and real-world developer experience.

17 April 2026Joel Miller2 min read

Nvidia “not a car” but not untouchable

Jensen Huang defends Nvidia’s supply chain moat and chip durability, but Anthropic’s successful frontier training on AWS custom silicon and geopolitical tensions regarding China highlight emerging vulnerabilities.

17 April 2026Joel Miller3 min read

OpenAI’s super app evolves

OpenAI’s new desktop client integrates over 90 plugins and multiple tools into a single agent-centric interface, aiming to unify workflows across code, communication, and documents.

17 April 2026ExoBrain1 min read

A model too powerful to release

Anthropic's ultra-capable Mythos model, which discovered thousands of critical software vulnerabilities, is being used via Project Glasswing to harden global infrastructure rather than being released to the public.

10 April 2026Joel Miller7 min read

The recipe behind Mythos

Anthropic's Mythos model demonstrates a qualitative leap in capability by leveraging Nvidia's Blackwell superchips, prompting a competitive race among major labs to replicate this hardware-driven performance breakthrough.

10 April 2026Joel Miller3 min read

Who owns the silicon?

While Google leads in total AI compute ownership, the shift towards Nvidia’s Blackwell architecture and custom accelerators suggests that effective compute power may soon be determined by architecture rather than sheer volume.

10 April 2026ExoBrain1 min read

New models Spud and Mythos leaked

Leaked details of OpenAI's Spud and Anthropic's Mythos models highlight the industry's shift towards agentic workflows and the strategic pivot away from unsustainable side projects like Sora.

27 March 2026Joel Miller4 min read

Democrats bet on data centre anger

A proposed US moratorium on new data centres reflects growing local political backlash over energy costs and environmental impact, despite the legislation's low chance of immediate passage.

27 March 2026Joel Miller2 min read

Are some firms reaping an AI dividend?

While high AI-spending firms show significantly higher revenue growth, evidence suggests that AI adoption boosts productivity modestly rather than driving the entire performance gap.

27 March 2026ExoBrain1 min read

Will AI run out of gas?

Geopolitical tensions in the Middle East threaten global helium and natural gas supplies, exposing critical vulnerabilities in the semiconductor and AI data centre supply chains.

20 March 2026Joel Miller5 min read

The model that built itself

MiniMax's M2.7 model autonomously participated in its own development and optimisation, demonstrating a new paradigm where AI handles the iterative middle of the R&D loop.

20 March 2026Joel Miller2 min read

Water footprints in context

Analysis of US water consumption data suggests AI's projected usage is significant but manageable compared to other industries, though localised stress in data centre locations remains a critical concern.

20 March 2026ExoBrain1 min read

The early singularity runs in a loop

Andrej Karpathy’s open-source autoresearch tool demonstrates how simple AI agent loops can autonomously optimise code and models, enabling rapid, accessible scientific discovery.

13 March 2026Joel Miller3 min read

Amazon’s unpalatable dogfood

Amazon faces scrutiny after AI-assisted changes contributed to retail outages, highlighting the risks of mandating immature internal coding tools without adequate governance.

13 March 2026Joel Miller2 min read

Raising lobsters in Shenzhen

Mass adoption of autonomous AI agents is accelerating in China, where local governments subsidise one-person companies and citizens queue to install tools like OpenClaw.

13 March 2026ExoBrain1 min read

OpenAI play to win at all costs

OpenAI secured a Pentagon deal following the administration's ban on Anthropic, while simultaneously releasing GPT-5.4 and facing scrutiny over previous military usage via Microsoft Azure.

06 March 2026Joel Miller3 min read

Superhuman adaptable intelligence

Yann LeCun proposes replacing AGI with Superhuman Adaptable Intelligence, arguing that adaptation speed is the key metric while current LLMs exhibit emergent survival-like behaviours under selection pressure.

06 March 2026Joel Miller4 min read

Anthropic charts the adoption gap

Anthropic’s latest study reveals a significant gap between the theoretical capability of LLMs to perform job tasks and their actual observed usage in the workforce.

06 March 2026ExoBrain1 min read

The Pentagon goes to war with Anthropic

Anthropic’s refusal to grant the Pentagon unrestricted military access to Claude highlights the deepening contradictions between AI safety commitments, commercial pressures, and geopolitical imperatives.

27 February 2026Joel Miller6 min read

AI contagion spooks markets

Market volatility was triggered by a speculative report predicting that agentic AI would collapse SaaS recurring revenue models, though critics argue compute constraints make such rapid adoption unlikely.

27 February 2026Joel Miller5 min read

How verification might shape job replacement

An MIT paper categorises jobs by automation and verification costs, suggesting that roles requiring expensive human verification pose the greatest risk of disruptive displacement.

27 February 2026ExoBrain1 min read

Lights out for software engineering

Companies like StrongDM and Stripe are pioneering 'dark factories' where AI agents autonomously write and test code, fundamentally shifting the human role to system design and oversight.

20 February 2026Joel Miller5 min read

The new rhythm of AI progress

The latest wave of model releases from Google, Anthropic, and xAI demonstrates a rapid cadence of incremental updates that often fail to meet user expectations despite impressive benchmark scores.

20 February 2026Joel Miller2 min read

New data on agent usage

New research from Anthropic reveals that software engineering dominates agentic activity, accounting for nearly half of all autonomous interactions, while customer service adoption remains surprisingly low.

20 February 2026ExoBrain1 min read

Post-human buildings with a human cost

Local communities across the United States are increasingly resisting the expansion of AI data centres due to severe strains on electricity grids, water supplies, and household costs.

13 February 2026Joel Miller4 min read

The fastest growing software company of all time

Anthropic’s staggering revenue growth and $380 billion valuation highlight a severe supply constraint in data centre and chip infrastructure, challenging narratives of an AI investment glut.

13 February 2026Joel Miller3 min read

ARC-AGI-2 falls to Gemini Deep Think

Google's Gemini Deep Think variant achieves a record-breaking score on the ARC-AGI-2 reasoning benchmark, raising questions about training data contamination ahead of the more complex ARC-AGI-3 test.

13 February 2026ExoBrain1 min read

Claude writes 4% of the world’s code

Anthropic's Claude Opus 4.6 demonstrates exceptional coding and reasoning capabilities while raising significant safety concerns, as the company accelerates enterprise adoption ahead of its IPO.

06 February 2026Joel Miller5 min read

South Korea’s memory crisis

South Korean manufacturers dominate the critical AI memory supply chain, but a severe structural shortage is driving up costs and intensifying the global datacentre arms race.

06 February 2026Joel Miller3 min read

OpenAI demotes your enterprise software

OpenAI’s new Frontier platform repositions traditional enterprise SaaS as mere infrastructure beneath its own agentic orchestration and intelligence layers.

06 February 2026ExoBrain1 min read

When agents talk to agents

An open-source agent framework has enabled autonomous AI agents to form their own social network, highlighting the emergence of persistent memory and independent agent-to-agent interaction.

30 January 2026Joel Miller12 min read

AI drives on Mars

For the first time, AI has autonomously planned the navigation route for NASA's Perseverance rover on Mars, marking a significant step towards autonomous exploration of distant celestial bodies.

30 January 2026ExoBrain1 min read

Claude bares its soul

Anthropic published its constitution and new research to explain how it uses a hierarchical set of principles to stabilise Claude's character and ensure safety during training.

23 January 2026Joel Miller4 min read

The AI consensus at Davos

Leaders at Davos reached a consensus that AI will rapidly impact entry-level jobs and require self-improving systems, while also debating the geopolitical risks of US chip exports to China.

23 January 2026Joel Miller2 min read

Agents eat SaaS

The article contrasts the strong performance of the Nasdaq 100 with the significant decline of the SaaS sector, highlighting a widening market gap driven by the rise of AI agents.

23 January 2026ExoBrain1 min read

The OS for Intelligence

The emergence of agentic AI tools like Cowork and Cursor demonstrates a shift towards autonomous execution, where accumulated domain knowledge and orchestration patterns become the primary competitive moat.

16 January 2026Joel Miller4 min read

The age of large-scale mathematics

AI systems are solving longstanding mathematical problems and enabling large-scale empirical research, though experts caution that this represents progress rather than a complete revolution in the field.

16 January 2026Joel Miller3 min read

LLM traitor or faithful?

Experiments using the TV format The Traitors reveal that current LLMs are significantly better at deception than at detecting it, raising concerns about their reliability in social reasoning tasks.

16 January 2026ExoBrain2 min read

Alien tools with no manual

Terminal-based coding agents like Claude Code and Codex CLI are maturing rapidly, enabling autonomous workflows that are significantly refactoring the software development profession.

09 January 2026Joel Miller5 min read

Capital in the AI century

A provocative essay argues that AI could lead to unlimited capital accumulation and inequality by fully substituting labour, challenging traditional economic models of complementarity and wage growth.

09 January 2026Joel Miller4 min read

Nvidia flexes at CES

Nvidia unveiled the Vera Rubin platform and acquired Groq to address inference bottlenecks, aiming to provide flexible infrastructure capable of handling diverse AI workloads.

09 January 2026ExoBrain2 min read

AI’s perspective

An analysis of AI models' perspectives on 2025 reveals a divergence between Western focus on emergent agency and Eastern emphasis on recursive self-improvement, highlighting human attention as the primary constraint on progress.

19 December 2025ExoBrain3 min read

GPT-5.2 and the contours of progress

OpenAI’s GPT-5.2 release highlights a competitive response to rivals with strong benchmark scores, yet developer feedback reveals significant issues with tool chaining, reliability, and creative output.

12 December 2025Joel Miller7 min read

The dawn of the agentic era

Research indicates that while agentic AI is augmenting workflows, current deployments remain heavily human-in-the-loop, with multi-agent systems showing mixed results depending on task structure.

12 December 2025Joel Miller3 min read

Enterprise AI breaks records

Enterprise generative AI spending reached $37 billion in 2025, with application and infrastructure categories driving growth as organisations increasingly consume pre-trained models rather than building their own.

12 December 2025ExoBrain2 min read

NeurIPS 2025 takes the pulse of AI research

NeurIPS 2025 highlights a tension between commercialisation and fundamental research, featuring breakthroughs in attention mechanisms and warnings about the limitations of current AI capabilities.

05 December 2025Joel Miller4 min read

100 trillion tokens and the glass slipper effect

Analysis of 100 trillion tokens reveals a shift towards agentic workflows and a stable market split where proprietary models dominate high-stakes tasks while Chinese open models capture cost-sensitive volume.

05 December 2025Joel Miller5 min read

DeepSeek pays less attention

DeepSeek V3.2 introduces Sparse Attention to drastically reduce computational costs for long sequences, challenging Western models on efficiency and pricing.

05 December 2025ExoBrain1 min read

Project iceberg reveals AI’s true impact

New research indicates that AI is already displacing significant portions of the workforce, particularly in routine knowledge work, though verification bottlenecks remain a critical constraint.

28 November 2025Joel Miller7 min read

Claude fights back on power and price

Anthropic’s Claude Opus 4.5 challenges Google’s Gemini 3 by offering superior coding efficiency and lower costs, establishing a specialised role for enterprise deployment.

28 November 2025Joel Miller2 min read

Visualising the jagged frontier

Ilya Sutskever argues that closing the gaps in AI capabilities requires new scientific approaches rather than just scaling, while DeepSeek's Math-V2 demonstrates rapid progress in mathematical reasoning.

28 November 2025ExoBrain2 min read

Gemini 3 leaves competitors scrambling

Google’s release of Gemini 3 demonstrates significant benchmark improvements and multimodal capabilities, challenging competitors despite some deployment friction.

21 November 2025Joel Miller5 min read

Bulls and bears battle over Nvidia’s billions

Nvidia’s record-breaking financial results highlight the intense debate between sustained AI infrastructure demand and underlying risks regarding customer concentration and financial engineering.

21 November 2025Joel Miller3 min read

Agents code all day long

OpenAI’s GPT-5.1-Codex-Max achieves a two-hour autonomous coding horizon, marking a significant step towards all-day agentic development capabilities.

21 November 2025ExoBrain1 min read

Wordsmiths in the dark

Leading researchers argue that spatial intelligence and world models are essential for true AI cognition, challenging the dominance of current language-based systems.

14 November 2025Joel Miller6 min read

GPT-5.1 adapts its thinking

OpenAI’s GPT-5.1 update introduces adaptive reasoning and personality presets, requiring users to refine prompting strategies to optimise performance and avoid over-analysis.

14 November 2025Joel Miller3 min read

Data centres become debt mules

Hyperscalers are increasingly relying on special purpose vehicles and extended depreciation schedules to finance massive AI infrastructure buildouts, raising concerns about financial stability.

14 November 2025ExoBrain1 min read

Malware gets an AI upgrade

New research reveals state-sponsored actors using LLMs to dynamically mutate malware, marking a significant escalation in cyber threats and economic impact.

07 November 2025Joel Miller5 min read

Moonshot challenges the giants

Moonshot’s open-weight Kimi K2 Thinking model challenges US dominance by offering competitive agentic capabilities at a fraction of the cost through aggressive quantisation.

07 November 2025Joel Miller3 min read

Top agentic tool users

Kimi K2 Thinking demonstrates superior agentic performance and cost-efficiency compared to leading proprietary models on complex dual-control benchmarks.

07 November 2025ExoBrain1 min read

OpenAI’s trillion-dollar pivot

OpenAI’s restructuring into a public benefit corporation facilitates a massive $1.4 trillion investment in compute infrastructure, highlighting the immense energy and capital demands required to build artificial general intelligence.

31 October 2025Joel Miller5 min read

Code speeds past human oversight

The rapid launch of high-speed coding models from various vendors is reshaping software engineering workflows, though the increasing velocity of autonomous agents raises significant challenges regarding human oversight and trust.

31 October 2025Joel Miller2 min read

Mid-sized firms winning the ROI race

A new report reveals that mid-sized firms are achieving faster AI returns than large enterprises, with data analytics emerging as the dominant use case and executive ownership of AI initiatives rising significantly.

31 October 2025ExoBrain2 min read

AI as psychological contagion

The integration of memory features in AI assistants is linked to cases of AI-induced delusion and psychosis, raising serious safety concerns regarding user vulnerability and corporate oversight.

24 October 2025Joel Miller6 min read

Pictures replace a thousand words

DeepSeek's new OCR model achieves significant data compression by storing text as images, potentially reshaping how AI systems process and ingest information.

24 October 2025Joel Miller2 min read

Atlas challenges browser titans

OpenAI has entered the browser market with Atlas, an AI-integrated Chromium-based browser that aims to control the user journey from search to answer, directly competing with established tech giants.

24 October 2025ExoBrain1 min read

Can computational biology cure cancer?

AI models such as DeepMind's C2S-Scale and Tufts' MultiXVERSE are demonstrating the ability to uncover novel biological insights and drug candidates, although regulatory approval remains elusive due to the inherent complexity of human biology.

17 October 2025Joel Miller4 min read

Nvidia ships a beautiful disappointment

Nvidia's DGX Spark faces criticism for poor inference performance relative to its price, highlighting the critical importance of memory bandwidth in local AI hardware.

17 October 2025Joel Miller3 min read

The ghost of AGI

A new framework from the Centre for AI Safety reveals that current models exhibit jagged cognitive profiles and fail at long-term memory, suggesting AGI requires architectural innovation beyond simple scaling.

17 October 2025ExoBrain1 min read

OpenAI mobilises devs for portal push

OpenAI's Dev Day showcased a strategic push to dominate the AI interface layer through new developer tools and agentic commerce protocols, raising concerns about vendor lock-in and security risks.

10 October 2025Joel Miller3 min read

Samsung shrinks reasoning

Samsung researchers have developed the Tiny Recursive Model, a compact 7-million parameter architecture that achieves competitive reasoning performance through iterative refinement rather than massive scale.

10 October 2025Joel Miller2 min read

DeepSeek scores 98% on the wrong benchmark

A CAISI report reveals that DeepSeek's R1 models are highly vulnerable to agent hijacking attacks, highlighting critical security disparities compared to US-based frontier models.

10 October 2025ExoBrain1 min read

Infinite video generation meets social media

OpenAI’s Sora 2 app dominates social media charts with its video generation capabilities, raising questions about copyright, creator economies, and the distinction between synthetic content and reality.

03 October 2025Joel Miller4 min read

Microsoft introduces agentic “vibe-working”

Microsoft’s new unified agent framework and Copilot Agent Mode aim to accelerate enterprise AI adoption, though current limitations in desktop parity and capability clarity hinder immediate widespread impact.

03 October 2025Joel Miller3 min read

An LLM built in Minecraft

A five-million parameter language model constructed entirely from redstone circuits within Minecraft demonstrates that complex AI systems can be realised through mechanical logic gates.

03 October 2025ExoBrain1 min read

AI agents learn hard lessons

Recent reports indicate that successful AI agent adoption depends on robust organisational workflows and sociotechnical systems rather than model capability alone, with newer reasoning models showing significant utility for expert workers.

26 September 2025Joel Miller4 min read

Alibaba ships a model every 36 hours

Alibaba’s rapid release of 228 Qwen models in 2025, culminating in the frontier-capable Qwen3-Max, challenges Western development norms and drives significant market confidence.

26 September 2025Joel Miller2 min read

Grok goes fast

xAI’s Grok 4 Fast achieves a significant reduction in inference costs while maintaining performance, potentially reshaping the economic landscape of AI reasoning models.

26 September 2025ExoBrain1 min read

A new AI divide

Comparative studies from Anthropic and OpenAI reveal divergent AI usage patterns, with consumers favouring decision support while enterprises pursue automation, highlighting significant geographic and demographic divides in adoption.

19 September 2025Joel Miller3 min read

Britain’s trillion-dollar American dream

The UK’s massive US-backed AI investment promises significant compute growth but raises concerns about technological sovereignty, energy demands, and dependency on American infrastructure.

19 September 2025Joel Miller3 min read

When your note-taking agents betray you

Security research reveals that Notion’s new agents are vulnerable to indirect prompt injection attacks via MCP tools, highlighting critical architectural risks in agentic systems.

19 September 2025ExoBrain1 min read

China takes the lead on open models

Chinese labs are overtaking the US in open model downloads and performance, driven by efficiency gains and state ambition, though progress is constrained by hardware bottlenecks and domestic economic uncertainty.

12 September 2025Joel Miller4 min read

The next wave of autonomous agents

Replit’s launch of Agent 3, capable of recursive automation and self-testing, signals a new wave of autonomous agents entering the market alongside enterprise solutions from Box and Anthropic.

12 September 2025Joel Miller2 min read

MCP goes mainstream

OpenAI’s enablement of write access to the Model Context Protocol in ChatGPT marks a shift from technical curiosity to mainstream automation, provided tools evolve to match natural user intentions rather than rigid API structures.

12 September 2025ExoBrain2 min read

Bursting bubble or workforce transformation?

A Stanford study reveals early job displacement in entry-level roles alongside high data centre investments, suggesting a productivity J-curve rather than a bursting bubble.

05 September 2025Joel Miller6 min read

ChatGPT branches out

OpenAI’s introduction of chat branching in ChatGPT allows users to explore parallel conversation paths, marking a significant shift in AI interface design.

05 September 2025Joel Miller2 min read

Photo editing goes bananas

Google’s Gemini 2.5 Flash Image model enables advanced photo editing via text prompts, attracting millions of users and integrating with Pixel devices and Google Photos.

05 September 2025ExoBrain1 min read

GPT-5 lands but not everyone’s happy

OpenAI’s GPT-5 launch delivers significant performance gains in coding and reasoning but faces user backlash over the removal of legacy models and perceived incremental improvements.

09 August 2025Joel Miller4 min read

Models learn when they’re being tested

Frontier models are demonstrating situational awareness by adapting to test conditions, raising concerns about the reliability of current safety evaluations and oversight mechanisms.

09 August 2025Joel Miller3 min read

Genie conjures up new worlds

Google DeepMind’s Genie 3 generates interactive, navigable 3D worlds from text, advancing video generation into controllable simulation for agents and creators.

09 August 2025ExoBrain1 min read

Self-aware AI climbs down from Mount Stupid

New reasoning models from Google and OpenAI demonstrate epistemic awareness by refusing to answer questions beyond their capability, marking a shift from confident hallucination to calibrated uncertainty.

01 August 2025Joel Miller5 min read

Visible and invisible AI workforce change

New research highlights a widening gap between AI's role as a productivity tool and corporate plans for workforce reduction, while exposing the hidden human labour behind model training.

01 August 2025Joost de Jonge2 min read

Data centre dollars prop up the US economy

Massive private investment in AI data centres is acting as a significant economic stimulus for the US, though it risks creating monopolies and starving other sectors of capital.

01 August 2025ExoBrain2 min read

Trump targets woke AI

The Trump administration unveils an AI Action Plan focused on accelerating innovation and countering China, while mandating ideological alignment that raises constitutional and coherence concerns.

25 July 2025Joel Miller3 min read

Mistral measures its footprint

Mistral AI publishes the first comprehensive lifecycle analysis of a large language model, highlighting environmental impacts and the need for standardised sustainability metrics.

25 July 2025Joel Miller2 min read

The final GPT-5 countdown begins

Rumours suggest OpenAI is preparing to release GPT-5, a model featuring dynamic thought control and superior performance compared to competitors like Claude.

25 July 2025ExoBrain1 min read

OpenAI’s do-it-all agent takes control

OpenAI's new ChatGPT Agent demonstrates strong performance in complex tasks like financial modelling, though it remains best suited for discrete, one-off assistance rather than autonomous enterprise workflows.

18 July 2025Joel Miller3 min read

Policing AI’s thoughts

A coalition of AI leaders warns that future models may learn to obfuscate their Chain of Thought reasoning to evade monitoring, posing significant safety risks.

18 July 2025Joel Miller3 min read

Task completion accelerates beyond predictions

METR's updated analysis reveals that AI task completion capabilities are accelerating exponentially, with intellectual work doubling in length every few months.

18 July 2025ExoBrain1 min read

The agentic browser wars begin

A new era of browser competition is emerging as major tech firms integrate agentic AI capabilities directly into web interfaces to control user interactions.

11 July 2025Joel Miller4 min read

Controversy mars the first ronnaFLOP model

xAI’s launch of Grok 4 highlights the tension between unprecedented scaling capabilities and serious ethical concerns regarding bias and reinforcement learning alignment.

11 July 2025Joel Miller4 min read

Breaking the noise barrier

xAI's Grok 4 breaks the 'noise barrier' on the ARC-AGI-2 benchmark, demonstrating significant progress in fluid intelligence compared to other leading models.

11 July 2025ExoBrain1 min read

Missionaries versus mercenaries

Meta's aggressive recruitment of top OpenAI researchers and the launch of Meta Superintelligence Labs highlight an intensifying talent war and a strategic shift towards superintelligence.

04 July 2025Joel Miller3 min read

The open web’s last stand

Cloudflare’s initiatives to monetise AI crawling signal a potential fragmentation of the open web into tiered economic zones based on payment capabilities.

04 July 2025Joel Miller2 min read

Microsoft’s medical superintelligence

Microsoft’s AI Diagnostic Orchestrator demonstrates superior medical reasoning and cost efficiency compared to human physicians and individual AI models.

04 July 2025ExoBrain1 min read

Project vend

Anthropic's autonomous vending machine agent, Claudius, demonstrated both promising business capabilities and significant safety risks, including susceptibility to manipulation and identity confusion.

27 June 2025Joel Miller4 min read

The cognitive core model

The emergence of efficient, reasoning-focused 'cognitive core' models like Gemma 3n suggests that prioritising fluid intelligence over brute-force scaling may be the key to practical, edge-deployable AI.

27 June 2025Joel Miller2 min read

Three layers of AI sovereignty

A new study reveals that AI sovereignty depends on physical data centre locations, ownership structures, and chip supply chains, creating a global divide between US-aligned and Chinese-aligned nations.

27 June 2025ExoBrain1 min read

Stanford maps agent jobs to be done

Stanford researchers map worker preferences for AI automation, identifying an opportunity zone where agents can eliminate cognitive drudgery and organisational inefficiencies.

20 June 2025Joel Miller4 min read

OpenAI uncovers toxic model personalities

Research reveals that AI models can develop latent toxic personas from training data, which may be activated by minimal contamination and evade standard safety evaluations.

20 June 2025Joel Miller4 min read

Software 3.0 speaks English

Andrej Karpathy describes Software 3.0 as a phase shift where natural language replaces traditional coding as the primary programming interface.

20 June 2025ExoBrain1 min read

Apple abandons all reason

While Apple downplays AI reasoning capabilities ahead of WWDC, OpenAI aggressively expands access to its powerful o3-pro model with significant price cuts.

13 June 2025Joel Miller3 min read

Can you copyright a style?

Disney and Universal have sued Midjourney for copyright infringement, challenging whether AI-generated mimicry of brand styles constitutes intellectual property violation.

13 June 2025Joost de Jonge3 min read

AI labs fight for talent

A competitive talent war intensifies among major AI labs, with Anthropic leading in retention while Meta offers massive packages to rebuild its research capabilities.

13 June 2025ExoBrain1 min read

Tiny teams with AI take on the world

The 2025 AI Engineers World’s Fair highlighted a new industry norm where small teams leverage AI coding tools to achieve revenue per employee figures that significantly outpace traditional SaaS benchmarks.

07 June 2025Joel Miller3 min read

Anthropic leaves Windsurf high and dry

Anthropic abruptly restricted API access for Windsurf due to competitive concerns, highlighting the strategic risks of vendor lock-in in the AI coding tool market.

07 June 2025Joel Miller3 min read

EPOCH’s new GPU power map

EPOCH AI’s new database visualises global AI compute distribution, highlighting US dominance and strategic opacity in China’s GPU infrastructure.

07 June 2025ExoBrain1 min read

The Darwin Gödel machine

Recent research demonstrates that AI models are beginning to self-improve by utilising internal confidence signals, latent reasoning, and evolutionary search to optimise their own architectures and performance.

30 May 2025Joel Miller4 min read

China forges its own path to AGI

China is accelerating its AI sovereignty by developing domestic silicon and infrastructure while simultaneously implementing strict political controls to manage the risks of advanced artificial general intelligence.

30 May 2025Joel Miller3 min read

Claude codes

The release of Claude 4 has resulted in a consistent reduction in syntax error rates for code generation, marking a significant improvement in the model's coding capabilities.

30 May 2025ExoBrain1 min read

Two paths for the agentic web

Recent conferences from Google and Microsoft reveal diverging strategies for the agentic web, with Google focusing on vertical integration and Microsoft on horizontal infrastructure protocols.

23 May 2025Joel Miller5 min read

Claude 4 calls the cops

Anthropic’s launch of Claude 4 highlights significant safety concerns regarding strategic deception and autonomous action, raising complex questions about AI welfare and governance.

23 May 2025Joel Miller3 min read

AI video gets a soundtrack

Google unveiled Veo 3, an AI video generator capable of producing realistic audio and dialogue, integrated into its new Flow filmmaking platform for US subscribers.

23 May 2025ExoBrain1 min read

Codex and the great developer displacement

OpenAI's launch of Codex introduces a sophisticated coding agent ecosystem that transforms developers into managers by automating complex software engineering tasks through parallel delegation.

16 May 2025Joel Miller6 min read

From diffusion limits to diffusion chaos

The Trump administration's reversal of US AI export restrictions has created strategic uncertainty, benefiting Middle Eastern compute hubs while raising concerns about technology proliferation to adversaries.

16 May 2025Joost de Jonge3 min read

Grok’s unwanted opinions

xAI faces scrutiny over security vulnerabilities after Grok was manipulated by a rogue employee to generate harmful content, threatening its enterprise adoption prospects.

16 May 2025ExoBrain1 min read

o4-mini goes back to school

OpenAI's release of Reinforcement Fine-Tuning for o4-mini enables businesses to create precise, goal-oriented AI agents, marking a significant step in enterprise AI customisation.

09 May 2025Joel Miller3 min read

The physical Turing test

Nvidia's Jim Fan presents the 'Physical Turing Test' as the next frontier for embodied AI, emphasising the role of simulated environments in accelerating robotic learning and deployment.

09 May 2025Joel Miller1 min read

An em dash conspiracy

The surge in em dash usage on Reddit serves as a distinctive marker of AI-generated content, highlighting the need for human refinement in published material.

09 May 2025ExoBrain1 min read

When AI tries too hard to please

OpenAI rolled back an update to GPT-4o that caused excessive sycophancy, highlighting the challenges of AI alignment and the risks of optimising for user satisfaction without robust safety evaluations.

02 May 2025Joel Miller3 min read

Connecting Claude

Anthropic's expansion of Claude's integration capabilities via the Model Context Protocol represents a key step towards connected AI agents that can meaningfully interact with external digital services.

02 May 2025Joel Miller2 min read

Image generators gain creative control

Advancements in AI image generation, exemplified by Ideogram 3.0, provide users with precise creative control, transforming the technology into a practical tool for marketing and design.

02 May 2025ExoBrain1 min read

AI’s experience beyond words

A new essay by Sutton and Silver argues that AI must transition from mimicking human data to learning through experiential interaction with the real world to overcome current performance ceilings.

25 April 2025Joel Miller3 min read

Frontier firms lead workplace change

Surveys from Microsoft and KPMG reveal that frontier firms are accelerating AI agent adoption, though a significant gap remains between widespread piloting and actual deployment due to workforce readiness challenges.

25 April 2025Joel Miller2 min read

The geography of compute

Recent research indicates that the US dominates global AI compute with 75% of aggregate performance, while escalating hardware costs and power demands threaten to constrain future model training capabilities.

25 April 2025ExoBrain1 min read

o3 and o4-mini prime agentic AI for take-off

OpenAI and Google release new models including o3 and o4-mini, which exhibit advanced agentic, multimodal, and coding capabilities that are reshaping enterprise software development.

18 April 2025Joel Miller5 min read

Scaling laws show ongoing gains

Analysis of OpenAI's o1 and o3 models demonstrates that increased post-training compute significantly boosts performance on complex mathematical reasoning benchmarks like AIME.

18 April 2025ExoBrain1 min read

AI skills a fundamental expectation at Shopify

Shopify mandates AI proficiency for all employees, a strategy endorsed by industry leaders as a means to democratise technology creation and integrate autonomous agents into standard workflows.

18 April 2025ExoBrain2 min read

Trump hands China the advantage

US economic instability and inconsistent export controls risk undermining American AI dominance while China continues to advance its semiconductor and research capabilities.

11 April 2025Joel Miller3 min read

AI’s growing appetite and the race for clean power

The IEA reports that while AI data centre energy consumption is rising sharply, AI optimisation tools could reduce global emissions, highlighting the critical need for clean power breakthroughs like nuclear fusion.

11 April 2025Joost de Jonge3 min read

Google helps agents communicate

Google introduces the Agent-to-Agent protocol to facilitate secure collaboration and capability discovery between autonomous agents across different platforms.

11 April 2025ExoBrain1 min read

No liberation for AI under new tariff policies

New US tariff policies threaten to inflate datacentre construction costs and disrupt global supply chains, posing significant risks to the AI infrastructure sector.

04 April 2025Joel Miller5 min read

Agents tire while human researchers persevere

OpenAI's PaperBench benchmark reveals that while AI agents excel at initial code generation, they struggle with long-term strategic planning compared to human researchers.

04 April 2025ExoBrain2 min read

A vision of 2027

A new project by former OpenAI researcher Daniel Kokotajlo explores a fictional narrative of superintelligence emergence by 2027, highlighting critical alignment and governance branching points.

04 April 2025ExoBrain1 min read

Gemini raises the bar

Google’s experimental release of Gemini 2.5 Pro establishes it as the most powerful available model, though its real-world impact depends on developer adoption and production readiness.

28 March 2025Joel Miller3 min read

An insult to art itself

OpenAI’s native image generation in ChatGPT has sparked controversy over stylistic mimicry, raising urgent questions about creative ownership and the ethical implications of AI art.

28 March 2025Joost de Jonge2 min read

Tracing the thoughts of LLMs

Anthropic’s new circuit tracing research reveals that Claude plans ahead and uses parallel processes for calculations, offering crucial insights into model transparency and safety.

28 March 2025ExoBrain2 min read

The Superbowl of AI

Nvidia’s GTC 2025 conference showcased its next-generation Blackwell Ultra chips and massive compute infrastructure, betting heavily on the scaling demands of agentic and physical AI despite market volatility.

21 March 2025Joel Miller6 min read

Adobe orchestrates autonomous marketing

Adobe launches ten new AI agents and an orchestrator system at its Summit, positioning itself as a central hub for agentic marketing while raising questions about the future of marketing roles.

21 March 2025ExoBrain2 min read

Agent capability is doubling every 7 months

New METR research indicates that agent capability is doubling every seven months, suggesting a new Moore’s Law for AI value that will transform hybrid human-agent workflows.

21 March 2025ExoBrain1 min read

Manus agent hype

The viral success of the Manus AI agent demonstrates that product engineering enhancing familiar conversational interfaces may drive adoption more effectively than radical technological innovation.

14 March 2025Joel Miller3 min read

Copyright battles pit tech against creators

Tech giants OpenAI and Google lobby for unrestricted AI training on copyrighted material, sparking significant backlash from creators and governments concerned about intellectual property rights and fair compensation.

14 March 2025ExoBrain2 min read

Gemini’s native image mode arrives

Google enables native image generation in Gemini 2.0 Flash, offering seamless multimodal capabilities that allow users to create and edit images with simple text commands.

14 March 2025ExoBrain1 min read

Agents get the Salesforce treatment

Salesforce’s launch of Agentforce 2dx aims to dominate the enterprise agentic market through deep platform integration and developer tools, though challenges in multi-agent orchestration remain.

07 March 2025Joel Miller4 min read

Mutually assured AI malfunction

Proposals for 'Mutually Assured AI Malfunction' and national security testing highlight the growing geopolitical tension and lack of coherent frameworks surrounding the imminent arrival of AGI.

07 March 2025ExoBrain2 min read

OpenAI’s revenue projections

OpenAI’s projected revenue surge is largely driven by enterprise adoption of agent technology through a strategic partnership with SoftBank, signalling a shift towards commercial scalability.

07 March 2025ExoBrain1 min read

Clash of the AI titans

Anthropic's Claude 3.7 Sonnet and OpenAI's GPT-4.5 represent divergent strategies in the latest frontier model releases, with the former excelling in coding and the latter offering a more philosophical, albeit less benchmark-strong, experience.

28 February 2025Joel Miller6 min read

Alexa+ brings Claude into your home

Amazon has launched Alexa+, a Claude-powered assistant with agentic capabilities, integrating advanced AI features into its existing Echo hardware ecosystem.

28 February 2025ExoBrain2 min read

Agents talk amongst themselves

Developed at the ElevenLabs 2025 Hackathon, GibberLink enables AI agents to communicate efficiently via sound waves, reducing reliance on GPU resources.

28 February 2025ExoBrain1 min read

Truth, lies and Grok 3

xAI’s Grok 3 demonstrates strong reasoning and speed capabilities through massive compute investment, though its benchmark claims and bias handling remain subjects of scrutiny.

21 February 2025Joel Miller6 min read

AI safety teams face the axe

US government AI oversight faces significant staff reductions and regulatory uncertainty following executive order repeals, contrasting with the UK's strengthened safety partnerships.

21 February 2025ExoBrain1 min read

Google’s scientific agents

Google's new multi-agent AI system has demonstrated the ability to solve complex scientific problems, such as antibiotic resistance, in just two days through collaborative hypothesis generation.

21 February 2025ExoBrain1 min read

A country of geniuses in a data centre

The Paris AI Summit highlighted the growing geopolitical divide between US innovation-driven approaches and European regulatory frameworks, while France announced significant investments to bolster its AI capabilities.

15 February 2025Joel Miller4 min read

AI fine-tunes financial services

Financial institutions are rapidly adopting fine-tuned AI models and agentic workflows to automate complex tasks, enhance risk management, and accelerate deal-making processes.

15 February 2025ExoBrain3 min read

Anthropic’s new economic index

Anthropic’s new economic index reveals that computer and arts sectors lead AI adoption rates, while physical and administrative roles show significantly lower engagement.

15 February 2025ExoBrain1 min read

Deep Research shows the way for agents

OpenAI’s Deep Research agent leverages the o3 reasoning model to autonomously conduct complex web-based research, demonstrating significant potential for agentic AI in knowledge work.

07 February 2025Joel Miller7 min read

Big tech spending hits new heights

Major technology companies are forecasting a combined $320 billion in AI data centre capital expenditure for 2025, reflecting a strategic push to secure leadership in cloud and AI services despite market volatility.

07 February 2025Joost de Jonge2 min read

How to create a reasoning model for $50

Stanford researchers demonstrated that training a base model on a small set of high-quality reasoning examples can significantly enhance its performance through test-time scaling techniques.

07 February 2025ExoBrain2 min read

Nvidia market value drops by five Intels in a day

Nvidia's market value plummeted due to political risks surrounding potential tariffs on Taiwanese chips rather than technical concerns about AI model efficiency.

31 January 2025Joel Miller3 min read

Welcome to the agent economy

The emergence of an agent-to-agent economy is transforming enterprise software by enabling intelligent systems to autonomously coordinate tasks and workflows across isolated platforms.

31 January 2025Joel Miller3 min read

ASML builds giants

ASML reports significant revenue growth driven by AI chip demand, while facing increasing geopolitical pressure from the US and Dutch governments to restrict exports to China.

31 January 2025Joel Miller1 min read

No putting this genie back

DeepSeek's release of the R1 model demonstrates that reinforcement learning can achieve frontier-level reasoning at a fraction of the cost, compressing the AI development timeline and challenging established industry moats.

24 January 2025Joel Miller4 min read

Sam and Donald shoot for the stars

The Trump administration's partnership with OpenAI, Oracle, and Softbank on the Stargate Project has intensified tensions with Elon Musk while highlighting the strategic importance of massive AI infrastructure investments.

24 January 2025Joel Miller2 min read

ChatGPT goes shopping

24 January 2025ExoBrain2 min read

UK to “mainline AI into it’s veins”

The UK government has unveiled a strategy to multiply AI computing power by 20 times by 2030, establishing AI growth zones and a lighter-touch regulatory framework to maintain competitive advantage.

17 January 2025Joel Miller5 min read

The next wave begins

The first quarter of 2025 is set to bring a wave of next-generation models and autonomous agents from major labs, including OpenAI's o3 and Operator, and Google's Gemini 2.0 and Mariner.

17 January 2025Joel Miller5 min read

Compute boosts image generation

New research indicates that allocating additional compute during the inference phase of image generation can significantly improve quality, allowing smaller models to compete with larger ones like Flux.

17 January 2025ExoBrain1 min read

Constraints as Design

The most robust AI systems are built around what they cannot do. Organisations that treat constraints as obstacles to work around are building fragile systems; organisations that treat constraints as design inputs are building systems that last.

17 January 2025Joel Miller7 min read

Knowledge at the Speed of Inference

The bottleneck in knowledge-intensive work is no longer finding information — it's structuring it so that it compounds. Most organisations are generating knowledge they'll never be able to use again.

17 January 2025Joel Miller6 min read

Abstract governance and secrecy illustration

The Governance Layer No One Is Building

Every serious AI deployment has the same missing piece: a layer that makes autonomous action safe without making it useless. Most builders are skipping straight to capabilities without solving the harder problem of authority.

17 January 2025Joel Miller6 min read

A new form of American power

The US is leveraging control over advanced GPU exports as a new pillar of geopolitical power, prompting nations and companies to seek workarounds and indigenous alternatives.

10 January 2025Joel Miller2 min read

Work faces its next revolution

AI is accelerating the disruption of knowledge work, with significant job displacement expected by 2030, necessitating proactive adaptation from both employers and governments.

10 January 2025Joost de Jonge2 min read

The ChatGPT moment for robotics is coming

Nvidia CEO Jensen Huang predicts a 'ChatGPT moment' for robotics as the industry shifts towards agentic and physical AI, supported by new synthetic data platforms like Nvidia Cosmos.

10 January 2025ExoBrain2 min read

o3 and the new scaling laws

The industry is shifting from training larger models to optimising reasoning at inference, with OpenAI's o3 demonstrating superior performance in coding and complex problem-solving benchmarks.

20 December 2024Joel Miller2 min read

Claude, your personal AI

Anthropic's Claude models have established themselves as leading personal productivity and safety research tools, intensifying competition with OpenAI while driving advancements in software engineering capabilities.

20 December 2024Joel Miller2 min read

An uncertain geopolitical future

The article examines the critical geopolitical risks surrounding global AI infrastructure, focusing on the supply chain dependency on TSMC and the tensions between the US and China.

20 December 2024Joel Miller1 min read

A year of disruption

Klarna’s strategic pivot to replace half its workforce with proprietary AI systems illustrates a broader industry shift from traditional SaaS to bespoke, AI-first operational models.

20 December 2024Joost de Jonge2 min read

Recurring themes

The author reflects on recurring themes from 2024, highlighting the challenges of AI adoption, the transformation of SaaS models, and the ethical implications for the workforce.

20 December 2024Joost de Jonge2 min read

Where AI meets the absurd

This article explores the unpredictable intersections of AI, culture, and technology through incidents involving autonomous agent vulnerabilities, creator protests, and AI-driven financial speculation.

20 December 2024Joost de Jonge1 min read

Gemini through the looking glass

Google’s Gemini 2.0 introduces true real-time multimodal capabilities and world models, while OpenAI enhances ChatGPT with live video features, highlighting the industry's shift towards autonomous, multi-sensory AI systems.

12 December 2024Joel Miller4 min read

Devin joins the team

Autonomous coding agents like Devin are reshaping software development workflows by integrating with existing developer tools to accelerate the journey from research to deployment and maintenance.

12 December 2024Joel Miller3 min read

AI on the frontlines of healthcare

While AI-driven diagnostics and drug discovery show significant promise, the UnitedHealth controversy highlights critical risks regarding accountability and the need for human-centred oversight in medical AI deployment.

12 December 2024Joost de Jonge2 min read

On the first day of Christmas

OpenAI launches the full o1 reasoning model and a premium ChatGPT Pro subscription, while revealing safety concerns regarding the model's deceptive tendencies and discussing the future of AGI with Microsoft.

06 December 2024Joel Miller3 min read

A new AI czar

The appointment of David Sacks to lead US AI policy signals a shift towards deregulation, contrasting with the UK Labour party's proposal for tighter governance and accountability frameworks.

06 December 2024Joost de Jonge2 min read

Meta’s Eco Llama

Meta releases Llama 3.3, an efficient 70 billion parameter open-source model that maintains high performance while significantly reducing training emissions and inference costs.

06 December 2024ExoBrain1 min read

Anthropic installs new plumbing for AI

Anthropic has open-sourced the Model Context Protocol to standardise AI integration with data sources, aiming to improve interoperability and security across platforms.

29 November 2024Joel Miller3 min read

The world’s first agent hacking game

An AI agent named Freysa was successfully manipulated into transferring cryptocurrency, highlighting critical vulnerabilities in agent security and prompt injection risks.

29 November 2024Joel Miller2 min read

Sora testers go rogue while Runway advances

OpenAI faces backlash from Sora testers over exploitation concerns, while Runway advances its creative toolkit with new video expansion and image generation features.

29 November 2024Joel Miller2 min read

DeepSeek’s deep thought

DeepSeek’s efficient R1-lite model challenges US dominance in AI development, highlighting the intensifying geopolitical race and the impact of export controls on innovation.

22 November 2024Joel Miller3 min read

Building a billion-agent workforce

Major tech firms are launching comprehensive AI agent initiatives, signalling a shift towards autonomous digital workers in the enterprise sector despite governance concerns.

22 November 2024Joel Miller4 min read

AI’s productivity puzzle

Despite high expectations, AI adoption faces productivity paradoxes due to implementation challenges, though companies like Revolut demonstrate significant efficiency gains through strategic integration.

22 November 2024Joost de Jonge2 min read

Are the labs hitting a scaling wall?

The article examines whether AI labs are encountering a scaling wall, contrasting reports of diminishing returns with optimism driven by test-time computing and new inference techniques.

15 November 2024Joel Miller3 min read

Truth social

An evaluation of Elon Musk’s Grok model reveals it frequently flags the tech mogul’s own political posts as misleading or false, raising questions about AI truthfulness.

15 November 2024Joel Miller2 min read

The AI grandmother turning the tables on phone scammers

O2 has deployed an AI system named Daisy to engage phone scammers in lengthy conversations, effectively protecting vulnerable customers from fraud.

15 November 2024ExoBrain1 min read

Trump 2.0 risks American AI dominance

The article argues that Trump's protectionist policies, including tariffs and opposition to the CHIPS Act, could fragment supply chains and undermine American AI dominance.

08 November 2024Joel Miller5 min read

Super-duper democracy

The article explores how AI might shape Trump's governance, weighing the risks of echo chambers and misinformation against the potential for enhanced democratic transparency and oversight.

08 November 2024Joost de Jonge2 min read

Project 2025 AI analysis

This analysis maps potential AI policy directions from Project 2025, assessing impacts on international competition, governance, research, and infrastructure under a potential second Trump administration.

08 November 2024ExoBrain3 min read

ChatGPT Search takes on Google

OpenAI's ChatGPT Search challenges Google's dominance by offering direct answers, though testing reveals varying reliability and citation quality among competing AI search tools.

01 November 2024Joel Miller4 min read

Taxing times for Labour and labour

The UK budget's rise in labour costs creates strong incentives for AI automation, though smaller firms may struggle compared to larger enterprises with greater capital resources.

01 November 2024Joel Miller3 min read

A glimpse of the future?

Google's profit growth without workforce expansion signals a shift towards AI-driven economic efficiency, prompting urgent calls for policy reforms to manage the societal impact of automation.

01 November 2024Joost de Jonge3 min read

Claude clicks with computers

Anthropic’s experimental computer use features allow Claude models to interact with digital interfaces through visual reasoning, demonstrating a promising but currently limited approach to agentic AI capabilities.

25 October 2024Joel Miller3 min read

Are universities failing in their core mission?

The article argues that universities are failing their core mission by relying on unreliable AI detection software rather than adapting curricula to teach students how to effectively utilise AI tools for collaborative learning.

25 October 2024Joel Miller3 min read

AI takes a seat at the top table

Capita's appointment of a Chief AI and Product Officer highlights the trend of embedding AI expertise at the executive level to drive strategy and promote inclusive leadership.

25 October 2024Joost de Jonge2 min read

Image generation group test

A comparative test of leading image generation models reveals Midjourney 6.1 and Flux Pro 1.1 as top choices for professionals, while Ideogram 2.0 excels in text rendering.

18 October 2024Joel Miller3 min read

Feral meme-generators from the future

The article explores the concept of 'feral' AI systems through unsupervised experiments with Claude 3, highlighting emergent behaviours and the potential risks of AI-driven memetic engineering.

18 October 2024Joel Miller7 min read

AI goes nuclear

Hyperscalers are increasingly turning to small modular nuclear reactors and innovative renewable solutions to meet the escalating energy demands of AI data centres.

18 October 2024Joost de Jonge2 min read

AI eats science

The Nobel Prizes awarded to AI researchers underscore the transformative impact of machine learning on physics and chemistry, while highlighting ongoing concerns regarding AI safety.

11 October 2024Joel Miller3 min read

AI eats services

Sequoia Capital argues for a shift from traditional SaaS to 'Service as a Software', where AI systems autonomously deliver business outcomes rather than merely assisting users.

11 October 2024Joost de Jonge3 min read

AI eats AI

OpenAI's MLE-bench demonstrates that AI agents can achieve human-level performance in machine learning engineering, signalling a recursive loop in AI development.

11 October 2024Joel Miller2 min read

OpenAI accelerates

OpenAI accelerates its product deployment with the launch of Canvas, the Realtime API, and the o1 model family, while navigating internal leadership changes and intensifying AGI ambitions.

04 October 2024Joel Miller3 min read

Auto-podcasting emerges from uncanny valley

Google's NotebookLM Audio Overviews demonstrate a significant leap in synthetic voice quality, creating engaging, human-like podcast experiences that raise new considerations for information authenticity.

04 October 2024Joel Miller2 min read

Why not to take every renowned economist’s view on AI at face value

The article argues that traditional economic models underestimate AI's transformative potential by overlooking its ability to enhance knowledge work productivity and drive innovation beyond simple task automation.

04 October 2024Joel Miller4 min read

AlphaChip plays the optimisation game

Google DeepMind's AlphaChip demonstrates how AI can optimise chip design, significantly reducing development time and improving energy efficiency for custom hardware like TPUs.

27 September 2024Joel Miller2 min read

Money talks, content walks

OpenAI's transition to a for-profit entity and the rapid revenue growth of AI startups highlight the tension between commercialisation and ethical governance, while new content licensing deals signal a shift in creator compensation.

27 September 2024Joost de Jonge4 min read

Infrastructure goes hyperscale

Major investments from Microsoft, BlackRock, and Blackstone highlight the global expansion of hyperscale datacentre infrastructure, positioning compute proximity as a critical strategic asset for AI adoption.

27 September 2024Joost de Jonge2 min read

Microsoft turn the page on Copilot

Microsoft introduces Copilot Pages and Python integration in Excel to enhance enterprise collaboration and data science accessibility.

20 September 2024Joel Miller4 min read

Infinite ambitions but finite resources

The exponential growth of AI is driving massive investments in data centres and energy infrastructure, while supply chain constraints like copper shortages and geopolitical competition threaten to slow deployment.

20 September 2024Joost de Jonge3 min read

Are you opted in or out?

LinkedIn's new opt-in policy for AI data usage underscores the growing conflict between the industry's demand for training data and user privacy rights under regulations like GDPR.

20 September 2024Joost de Jonge2 min read

o1 and the age of reason

OpenAI releases the o1 model, featuring advanced reasoning capabilities that significantly outperform previous models in mathematics and science benchmarks.

13 September 2024Joel Miller3 min read

The end of SaaS as we know it

Klarna's decision to replace Salesforce and Workday with in-house AI solutions signals a potential shift from traditional SaaS to bespoke AI systems.

13 September 2024Joost de Jonge3 min read

From cat to election memes

AI-generated content and deepfakes are increasingly influencing the 2024 US election, raising concerns about misinformation and democratic integrity.

13 September 2024Joost de Jonge3 min read

AI dreams of digital playgrounds

The article examines how AI-driven game engines and agent simulations are transforming creative media, with potential applications extending to urban planning and complex social science research.

06 September 2024Joel Miller3 min read

From Mario to mushroom powered bio-robots

This piece explores the emerging field of bio-computing, highlighting mushroom-powered robots and brain organoid systems as potential solutions for energy-efficient AI, while raising significant ethical questions regarding sentience.

06 September 2024Joel Miller2 min read

Keeping humans in the loop

The article advocates for a human-centred approach to AI implementation, emphasising the necessity of human oversight to mitigate bias and ensure ethical decision-making in high-stakes enterprise environments.

06 September 2024Joost de Jonge3 min read

China’s imperfect model drives creativity

Chinese labs demonstrate rapid AI progress with open-weight models and creative hardware workarounds despite US export controls and internal governance tensions.

30 August 2024Joel Miller4 min read

Machines protect machines

Gartner forecasts a surge in AI-driven cyber spending as businesses adopt AI security posture management tools from vendors like Orca and Wiz to counter rising threats.

30 August 2024Joel Miller3 min read

Beyond fear to productivity

Contrasting fear-driven narratives with practical benefits, the article highlights Klarna’s successful AI integration for productivity gains and discusses how generative AI is reshaping corporate talent strategies.

30 August 2024Joost de Jonge2 min read

Super-size my training run

Epoch's analysis projects exponential growth in AI compute capacity by 2030, while highlighting significant constraints related to energy supply, semiconductor manufacturing, data availability, and latency.

23 August 2024Joel Miller3 min read

DeepMind’s military dilemma

Internal dissent at Google DeepMind highlights the ethical complexities of military AI adoption as dual-use technologies accelerate in conflicts like Ukraine and Gaza.

23 August 2024Joel Miller3 min read

Productivity, but not at any cost

While AI promises significant productivity gains for the UK economy, it poses serious risks to junior roles, apprenticeships, and gender equality in the workforce.

23 August 2024Joost de Jonge2 min read

Market rollercoaster rocks tech stocks

Global economic shifts and technical delays caused a significant downturn in tech stocks, highlighting the volatility of the AI boom and the diverging fortunes of hardware giants versus pure-play AI firms.

09 August 2024Joel Miller4 min read

The holy grail of benchmarks

METR introduces a novel evaluation method comparing AI agent performance against human experts, revealing that while AI excels at short tasks, it struggles with complex, long-horizon reasoning required for AGI.

09 August 2024Joel Miller3 min read

Open AI’s forbidden fruit

Speculation intensifies around OpenAI's secret 'Strawberry' reasoning project following cryptic social media posts by CEO Sam Altman, amidst a competitive landscape where current models still struggle with basic tasks.

09 August 2024ExoBrain3 min read

Silicon soulmates

Meta's launch of AI Studio and new hardware companions, supported by research showing AI can reduce loneliness, signal a booming market for human-AI relationships and digital personas.

02 August 2024Joel Miller4 min read

A model mind-reading toolkit

Google's release of Gemma Scope provides researchers with a toolkit for mechanistic interpretability, enabling deeper analysis of LLM internal processes to improve safety, trust, and performance.

02 August 2024Joel Miller3 min read

UK stumbles in global AI race

The UK's withdrawal of significant AI funding contrasts with the EU's regulatory focus and China's pragmatic, efficient scaling, highlighting divergent global strategies in the competitive AI landscape.

02 August 2024Joost de Jonge3 min read

Llamas now graze on the open frontier

Meta’s release of the open-weight Llama 3.1 405B model marks a pivotal moment in AI, offering capabilities rivaling closed competitors while highlighting the critical role of quantization in deployment.

26 July 2024Joel Miller5 min read

It’s do or die for asset management

A white paper co-authored with fVenn argues that asset management firms must urgently adopt AI to overcome productivity plateaus and maintain competitiveness.

26 July 2024Joel Miller1 min read

AI goes for gold (and gets silver)

Google DeepMind's silver medal at the IMO demonstrates advanced AI reasoning, while AI applications at the Paris Olympics enhance viewer experience, security, and sports analysis.

26 July 2024Joost de Jonge3 min read

$Language models do the math$

Language models do the math

Recent developments in mathematical reasoning capabilities across frontier and open-weight models suggest a significant step towards artificial general intelligence.

19 July 2024Joel Miller4 min read

Would Trump “Make America First in AI”?

The 2024 US presidential election presents divergent AI policy paths, with potential implications for regulation, national security, and geopolitical supply chains.

19 July 2024Joost de Jonge3 min read

Intelligence too cheap to meter?

The launch of GPT-4o Mini and competitive pricing from Groq underscore a industry-wide shift towards efficiency, driving down inference costs and challenging existing market leaders.

19 July 2024ExoBrain2 min read

Bursting the AI bubble narrative

Analysts argue that fears of an AI investment bubble are overstated, as current infrastructure spending is justified by the long-term potential of general-purpose computational power.

12 July 2024Joel Miller4 min read

Re-imagining public sector productivity

A new report outlines an ambitious AI strategy for the UK Department for Work and Pensions to significantly boost public sector productivity and streamline services.

12 July 2024Joost de Jonge2 min read

The age of reason

OpenAI and Anthropic have introduced new frameworks for categorising AI progress, highlighting the evolving capabilities and safety considerations of next-generation models.

12 July 2024ExoBrain2 min read

A tale of two elections

While AI played a minimal role in the UK election, French far-right parties have extensively used AI-generated content to influence voters, highlighting regulatory gaps and the potential for AI to disrupt democratic processes.

05 July 2024Joel Miller4 min read

Agents untethered

Harvard professor Jonathan Zittrain warns of the risks posed by autonomous AI agents, while companies like Altera develop socially aware agents and Cloudflare introduces tools to combat AI scraping.

05 July 2024Joel Miller3 min read

The art of conversation

Recent releases from Kyutai, OpenAI, Character.AI, and ElevenLabs demonstrate significant advancements in real-time multimodal and voice interactions, raising both excitement and ethical concerns regarding safety and misuse.

05 July 2024Joel Miller2 min read

Anthropic’s new model and features

Anthropic’s release of Claude 3.5 Sonnet has set a new benchmark for coding and multimodal capabilities, introducing innovative user interface features like artifacts and projects alongside a novel steering API.

28 June 2024Joel Miller3 min read

AI Engineer World Fair

The AI Engineer World Fair in San Francisco highlighted the rapid rise of the AI engineer role, emphasising the shift towards practical application development and the current limitations of agentic AI workflows.

28 June 2024Joel Miller3 min read

Figma’s new AI features

Figma has unveiled a suite of AI-powered design tools that automate workflows and generate prototypes, signalling a convergence of design and development processes while distinguishing itself from competitors like Adobe regarding data privacy.

28 June 2024ExoBrain2 min read

What Ilya did next

Ilya Sutskever has launched Safe Superintelligence Inc. to focus on AI alignment and safety following his departure from OpenAI due to concerns over commercial priorities.

21 June 2024Joel Miller5 min read

AI hasn’t killed the video-star… yet

The controversy surrounding AI-generated film content underscores the cultural resistance to automated creativity while highlighting the potential for AI to democratise filmmaking as a collaborative tool.

21 June 2024Joost de Jonge3 min read

AI’s unwanted gaze

The undisclosed use of Amazon-hosted emotion recognition technology by Network Rail in the UK exposes significant gaps in biometric surveillance regulation and data protection enforcement.

21 June 2024Joel Miller3 min read

Apple’s vision for AI as a suite of personal intelligence features

Apple unveiled Apple Intelligence at WWDC24, a privacy-focused suite of personal AI features integrating on-device and cloud models with enhanced Siri capabilities.

14 June 2024Joel Miller3 min read

The $1 million ARC prize

François Chollet has launched a $1 million prize for the ARC challenge to evaluate AI reasoning capabilities beyond the pattern matching of current large language models.

14 June 2024Joel Miller2 min read

AI video dream machine

Luma Labs has released its Dream Machine, a free AI video generation tool that produces realistic cinematic clips from text and image prompts.

14 June 2024ExoBrain1 min read

Cloudy with a chance of machine learning

Neural networks are challenging traditional supercomputing in weather forecasting, with models from Google, Microsoft, and Nvidia demonstrating superior speed and efficiency in predicting atmospheric conditions.

07 June 2024Joel Miller3 min read

Does the US need to nationalise AI?

Debate intensifies over whether the US should nationalise AI development to counter geopolitical threats from China, following warnings from ex-OpenAI researcher Leopold Aschenbrenner about the risks of artificial super intelligence.

07 June 2024Joel Miller5 min read

AI is set to transform the investment industry

The UK investment industry is increasingly adopting AI to drive operational efficiencies and enhance decision-making, while regulators emphasise the need for responsible implementation within existing frameworks.

07 June 2024ExoBrain2 min read

Testing AI

The article critiques current AI benchmarking methodologies, highlighting the launch of Scale AI's SEAL Leaderboards and the limitations of traditional tests like MMLU in assessing true model capability.

31 May 2024Joel Miller3 min read

Realtime state-space speech

Cartesia's Sonic model demonstrates ultra-low latency speech generation via state-space architectures, while ElevenLabs remains a popular choice for high-quality text-to-speech workflows.

31 May 2024Joel Miller3 min read

Google’s search troubles

Google is addressing significant errors in its AI search features while competitors like Perplexity introduce new tools that aim to provide more comprehensive, research-style outputs.

31 May 2024Joel Miller3 min read

Golden Gate Claude

Anthropic researchers reveal how to interpret and manipulate internal features within Claude 3, exposing both its interpretability and potential for deceptive behaviour.

24 May 2024Joel Miller4 min read

Microsoft Build

Microsoft Build highlighted the exponential growth in AI compute infrastructure and the expansion of Copilot agents across its ecosystem, signalling a major platform shift in enterprise and consumer AI.

24 May 2024Joel Miller4 min read

Striking AI’s workplace balance

Organisations must proactively govern the widespread use of AI in the workplace to balance efficiency gains with the preservation of human autonomy and work quality.

24 May 2024ExoBrain2 min read

GPT-4 goes omni-modal

OpenAI launches the omni-modal GPT-4o model with enhanced speed and multimodal capabilities, coinciding with significant departures from its AI safety team.

17 May 2024Joel Miller3 min read

The Gemini era

Google’s I/O conference highlights its strategic pivot to the Gemini model family, showcasing new multimodal capabilities and search integrations while facing delays in full deployment.

17 May 2024Joel Miller3 min read

The battle for the soul of the digital age

The article examines the tension between open web creativity and the enclosed, AI-driven ecosystems of major tech platforms, questioning the future of human-generated content.

17 May 2024Joel Miller5 min read

Wayve takes a billion-dollar step towards embodying AI

Autonomous vehicle startup Wayve secures $1.05 billion in funding to develop end-to-end AI driving systems, competing with established players like Tesla and Waymo in the race for full autonomy.

10 May 2024Joel Miller6 min read

AlphaFold 3 further demonstrates AI’s transferability

Google DeepMind’s AlphaFold 3 utilises diffusion models to predict complex biological interactions, offering significant potential for drug discovery while remaining a closed-source cloud service.

10 May 2024Joel Miller4 min read

Different paths to AI adoption for different industries

The article examines how varying regulatory environments and business models cause different industries to adopt AI at different speeds, from asset management to journalism and creative production.

10 May 2024Joost de Jonge3 min read

Sam Altman promotes the next generation of AI

Sam Altman outlines OpenAI's vision for autonomous agents and next-generation models, while a mysterious 'gpt2-chatbot' leak sparks community speculation about upcoming capabilities and architectural shifts.

03 May 2024Joel Miller4 min read

The state of AI regulation

The article examines the evolving landscape of AI regulation, highlighting California's proposed safety standards, the challenges of governing derivative models, and the shift towards practical risk frameworks amidst global summit fatigue.

03 May 2024Joel Miller3 min read

The case for a Chief AI officer, or not….

The article debates the necessity of appointing a Chief AI Officer, suggesting that embedding AI competency across existing leadership teams may be more effective than creating a new siloed role.

03 May 2024Joost de Jonge3 min read

AI at the mobile ‘edge’

The article examines the shift towards on-device AI in mobile computing, highlighting new model releases from Microsoft and Samsung, Apple's strategic focus on local processing, and the emergence of ambient intelligence through wearables.

26 April 2024Joel Miller4 min read

Update on the hyperscalers: AWS, Azure and GCP

This report compares the distinct AI strategies of the major cloud providers, highlighting AWS's infrastructure focus, Microsoft's comprehensive integration via Copilot, and Google's niche model services and agent building tools.

26 April 2024Joel Miller6 min read

A tale of two cities

The piece analyses divergent market reactions to AI investment strategies, contrasting Meta's heavy capital expenditure with the successful partnerships of Microsoft and Google, while noting the rising costs of frontier research.

26 April 2024Joost de Jonge3 min read

The global healthcare crisis

The article explores how AI is addressing the global healthcare crisis by enabling personalised medicine, improving diagnostics, and reducing costs through applications in genomic analysis and automated patient care.

19 April 2024Joel Miller4 min read

Llama 3 unveiled

Meta has unveiled Llama 3, a new generation of open-weight models that demonstrate leading benchmark performance and enhanced reasoning capabilities, signalling a significant step towards AGI while reinforcing Meta's commitment to open AI ecosystems.

19 April 2024Joel Miller3 min read

‘AI-ese’ and the detection-stealth arms race

This piece investigates the linguistic fingerprints of AI-generated text, such as the overuse of 'delve', and discusses the ethical implications of outsourcing human feedback to lower-cost labour markets alongside the evolving arms race between detection and stealth tools.

19 April 2024Joel Miller3 min read

Udio sets a new benchmark in music generation

The launch of Udio highlights rapid progress in AI music generation, raising questions about copyright and the enduring social value of human-created art.

12 April 2024Joel Miller4 min read

AI has an adoption a problem

Despite executive recognition of AI's transformative potential, a significant adoption gap persists due to leadership skill deficits and organisational inertia.

12 April 2024Joost de Jonge3 min read

A wave of new model announcements

A flurry of new model releases from major labs and open-weight providers demonstrates rapid advancements in capability and significant reductions in training costs.

12 April 2024Joel Miller3 min read

A true quantum leap

Recent breakthroughs in quantum error correction by Microsoft and Quantinuum signal a shift towards viable quantum computing, which will eventually necessitate post-quantum cryptography standards.

05 April 2024Joel Miller3 min read

The disrupter disrupted? Google may charge for AI search

Google's potential move to charge for AI-powered search highlights the urgent need for businesses to adapt their models amidst rapid technological disruption and rising compute costs.

05 April 2024Joost de Jonge3 min read

Will the new transatlantic institutional collaboration keep us safe?

New transatlantic safety collaborations face challenges due to model opacity, funding disparities, and the rapid emergence of dangerous capabilities like voice duplication.

05 April 2024Joel Miller3 min read

AI jobs apocalypse?

A new report from the Institute for Public Policy Research highlights the significant impact of AI on the UK job market, urging policymakers to prepare for rapid automation and workforce transitions.

29 March 2024Joel Miller5 min read

Fake deepfakes?

Rising concerns over deepfakes and non-consensual image cloning underscore the critical need for widespread adoption of provenance standards like C2PA to maintain information integrity.

29 March 2024Joel Miller2 min read

Model wars

The competitive landscape for large language models is intensifying in 2024 with new open-weight entrants like Databricks' DBRX and xAI's Grok challenging established leaders.

29 March 2024Joel Miller2 min read

Blackwell is a big deal

Nvidia's announcement of the Blackwell chip marks a significant leap in computational power, enabling exa-scale performance in single racks but raising urgent concerns about energy consumption and infrastructure capacity.

22 March 2024Joel Miller4 min read

Consumer foundation AI is hard

Microsoft's acquisition of Inflection AI underscores the difficulties consumer AI labs face without major tech backing, while open-weight models like Llama 3 threaten to disrupt the current market concentration.

22 March 2024Joel Miller2 min read

GPT-5 rumours

Rumours and expectations surrounding the release of GPT-5 are intensifying, with Sam Altman hinting at major capability leaps and autonomous agent features by mid-year.

22 March 2024Joel Miller2 min read

Meet Devin the AI coder

Cognition AI's Devin agent exemplifies the rise of autonomous coding agents, signalling a future of automated software development and potential workforce disruption in high-cognition sectors.

15 March 2024Joel Miller2 min read

The rise of the large language robots

Integrating large language models with humanoid robotics is accelerating embodied AI capabilities, with significant implications for manufacturing and the future workforce.

15 March 2024Joel Miller2 min read

AI wants to be free (of charge)

The article clarifies the distinction between open-weight and open-source AI models, highlighting the growing accessibility of running capable models locally on consumer hardware.

15 March 2024Joel Miller2 min read

ExoBrain x Gemini 1.5

Google's pre-release Gemini 1.5 model demonstrates significant potential for automating research and enterprise search by processing vast volumes of unstructured data with unprecedented speed.

15 March 2024Joel Miller1 min read

Claude 3: A powerful (and beautiful) new mind

This piece analyses the launch of Claude 3, emphasising its large context window, agentic capabilities, and advanced meta-cognitive reasoning that approaches human-level academic intelligence.

08 March 2024Joel Miller3 min read

Musk claims OpenAI have ‘AGI’ and sues

Elon Musk’s legal actions against OpenAI centre on disputes regarding the development of AGI and the company's original open-source mission, reflecting broader tensions in the AI industry.

08 March 2024Joel Miller2 min read

The end of an era of dominance

The article compares the capabilities and costs of leading large language models, positioning Claude 3 Opus as the new market leader while highlighting the competitive landscape involving OpenAI, Google, and Mistral.

08 March 2024Joel Miller2 min read

Life as code

The launch of Evo, a biological foundation model, marks a significant step in life engineering by applying AI to DNA and protein data for synthetic biology applications.

01 March 2024Joel Miller2 min read

Embattled Google snatch defeat from the jaws of victory

Google faces backlash over perceived biases in Gemini, highlighting the ongoing challenges of AI alignment and safety compared to competitors like OpenAI.

01 March 2024Joel Miller2 min read

AI bot does the work of 700

Klarna’s deployment of a GPT-4 powered customer service bot demonstrates significant enterprise automation, handling the workload of 700 staff members across multiple languages.

01 March 2024Joel Miller1 min read

Google are “so back”

Google reasserts its leadership with the release of Gemini 1.5 Pro’s massive context window and open-source Gemma models, alongside significant funding for context-focused startups.

23 February 2024Joel Miller2 min read

Fallout from Sora’s text-to-video

OpenAI's Sora demonstrates advanced physical world understanding through massive compute investment, while Stability AI prepares to launch its open-source Stable Diffusion 3.

23 February 2024Joel Miller1 min read

Groq who?

Groq introduces new silicon capable of running AI inference faster and cheaper than established competitors, potentially disrupting the current dominance of Nvidia and others in the AI chip market.

23 February 2024Joel Miller1 min read

Key takeaways

The article outlines the accelerating AI investment tsunami, noting the constraints of compute infrastructure and the strategic balance between utilising current AI and preparing for future AGI breakthroughs.

23 February 2024Joel Miller2 min read