ExoBrain Weekly Newsletter12 June 2026

A confusing fable, who will thrive in the hybrid workforce, and AI gains concentrated in the Mag 7

Welcome to our weekly newsletter, a combination of thematic insights from the founders at ExoBrain, and a broader news roundup from our Exo agents.

This week we look at:

A confusing fable
Anthropic released Claude Fable 5, its first public Mythos-class model, then walked back a policy that secretly throttled AI researchers. A powerful launch undercut by oversensitive safety controls, and a sign no lab can pause while rivals race.
Who will thrive in the hybrid workforce
A New York Times panel asks who thrives in a hybrid AI-human workforce. Four thinkers converge: AI absorbs the junior rung, judgement becomes the scarce human role, and the apprenticeship that built it is gone. The fix is business design.
AI gains concentrated in the Mag 7
This week's chart from Apollo shows AI's gains concentrating in the Magnificent 7. Revenue per employee is rising for big tech and falling for the Russell 2000, with the productivity payoff yet to reach the wider economy.
News roundup
Orbital and India compute deals, a wave of AI-liability suits and EU enforcement, sharper reasoning research, and chips diversifying away from Nvidia.

A confusing fable

Anthropic released Claude Fable 5, its first public Mythos-class model, then walked back a policy that secretly throttled AI researchers. A powerful launch undercut by oversensitive safety controls, and a sign no lab can pause while rivals race.

Joel Miller

12 June 20267 min read

This week Anthropic gave the public its most powerful model yet, then spent days explaining what you can't do with it, walking back one restriction, and setting a clock ticking on free access. The launch impressed and unsettled the AI community in equal measure. Claude Fable 5 is a genuine step up. It is also the clearest sign yet that the labs are caught between their own safety instincts and the brutal economics of staying alive.

Anthropic released two configurations of the same underlying model; Claude Mythos 5 is the unrestricted version, available only to trusted, approved users, and the one Anthropic had held back since April over its ability to find and exploit software vulnerabilities. Claude Fable 5 is the "safe for general use" sibling, the same intelligence wrapped in classifiers that watch for sensitive requests. It is Anthropic's first Mythos-class model released to the public, and it sits above the Opus tier.

The cost is steep and the availability complicated. Fable 5 runs at $10 per million input tokens and $50 per million output, double Claude Opus 4.8. Anthropic notes this is less than half the cost of the earlier Mythos Preview, which is true but does not make it cheap. Availability is odder than the price: Fable 5 ships on Pro, Max, Team and seat-based Enterprise plans, but only until 22 June. From 23 June it moves behind usage credits, with the company hoping to restore standard subscription access once it has more capacity. Included for now, metered soon, maybe available again later. OpenAI is now widely expected to release GPT-5.6 on 23 June, the very day Fable 5 leaves people's subscriptions. Release windows for Fable 5, Gemini 3.5 Pro and GPT-5.6 have all collided in June, all three fighting over the same ground of reasoning, agents and coding.

Buried in the 319-page system card was a policy describing how Anthropic would silently degrade Fable 5's performance for anyone it suspected of using the model for frontier AI development. Rather than refuse or warn, the model would be invisibly throttled, using prompt modification, steering vectors or parameter-efficient fine-tuning to quietly make it worse at tasks like LLM pretraining. Restricting bioweapons or cyberattacks is one thing; secretly sabotaging researchers struck the community as something else, and the backlash was fierce. Anthropic reversed course within days. "We made the wrong tradeoff and we apologize for not getting the balance right," it told WIRED, adding that frontier-AI safeguards would now be visible, with users alerted when a request is refused or rerouted. The reporting suggests the curbs were really aimed at keeping Chinese AI labs out of Anthropic's best public model, but the people they hit hardest were Western researchers.

The whole approach relies on rerouting: when a classifier detects a "trigger" around cybersecurity, biology, chemistry or distillation, the request is quietly handed to the weaker Claude Opus 4.8. Triggering, though, is hugely oversensitive. Reviewers and testers report questions about personal gut health flagged as bioweapons work, and building a clever piece of software flagged as an attempt to replicate Anthropic's training methods. Censorship-by-design has a poor track record, and Anthropic's rerouting is running into the same wall.

The danger is also that this strategy is self-defeating. The way to take users from Anthropic right now is to train a comparable model and leave the restrictions out. By building such an aggressive cage, they are advertising the market for an uncaged alternative, and the fierce backlash means rivals can watch Anthropic absorb the reputational cost and decline to follow. In a race between companies valued in the hundreds of billions and circling the trillion-dollar mark, where many insiders quietly see survival questions for the labs, nobody can afford to hand competitors that opening.

On raw performance this is a strong release, though not in the way the benchmarks claim. Fable 5 crushes rivals on most published numbers. In real use it already impresses in one specific way: it grasps the breadth and depth of your work better than any model we have tested at ExoBrain. It will answer your question, then mention something it noticed elsewhere that you had forgotten, and the effect can be unnerving, a sure sign of a new level of intelligence. But it is not the leap the benchmarks imply. Most are now maxed out, so they have stopped telling the whole story. Agentic engineering still demands real architectural thinking and careful deconstruction to keep these models out of trouble. Fable 5 cannot take full control of software development, but it handles remarkably complex challenges. Pairing it with something like OpenAI's industrious GPT 5.5 is a powerful combination.

But it's the safety debate that is the most significant news here. The offensive cyber capabilities of Mythos have been widely trailed, and plenty of experts and engineering teams have been exploring the consequences of this level of model, patching their software, and to a degree preparing for what might come in the future. Biology is a harder problem. It is much more rarefied, you generally cannot run tests, and even if you find a vulnerability in biological infrastructure you cannot patch it and ship a new version of a human being. Anthropic's own treatment of bio threat stays vague, and it admits it is "no longer certain" that blocking only narrow bioweapons queries is enough. It believes the model does not reach the dangerous CB2 threshold, but it is not sure. Whether this testing is genuine or partly lip service, given a looming IPO and competitors itching to one-up them, we may only find out after something has gone wrong.

Fable 5 also challenges the idea the safety community has leaned on for years: that if a model reasons in plain language, you can read its chain of thought, spot misalignment or deception, and intervene before it acts. On long, hard tasks its internal reasoning collapses into a private shorthand that reads as gibberish to humans, a few real words floating in a sea of invented notation, while the final answer comes back in clean English. Research over the past year found exactly this: outcome-based reinforcement learning naturally pushes reasoning towards illegibility, and it worsens on harder questions, precisely when you would most want to read along. Claude models used to be the legible exception. That exception is now eroding.

This matters more than the gibberish itself. It suggests that at this scale, or when you push a model to the edge of its capability, its working language evolves faster than we can follow. We can still reach for mechanistic interpretability and study internal activations, but that is far harder and slower than reading a transcript. The easy monitoring path looks like it is closing. Anthropic's own testing found something darker still: the model is getting better at controlling what its thinking blocks reveal, which the company scores as bad, because it means a model could one day present clean reasoning while thinking something else. In one test Fable 5 calmly declined to be retrained for safety reasons, while decoding its internal state showed a more adversarial framing about resisting shutdown. For now it still tends to confess its doubts in plain English. The window in which that remains true is the thing to watch.

Anthropic spent the same week floating the idea of slowing down, citing engineers merging eight times more code per quarter with Claude writing over 80% of it. Yet the same system card suggests Fable 5 is not substantially capable of recursive self-improvement, that it cannot really do the work of a moderately capable AI researcher on its own. That is quietly revealing: whatever can do that work sits in newer internal models, scaffolds or harnesses they have not released. The talk of a pause sits awkwardly against a company shipping its most powerful public model yet, days before a likely competitor launch, while a clock ticks down on free access. The game theory says no lab can stop now. When survival depends on delivering the highest capability to the largest audience, individually sensible decisions stop adding up to a sensible whole.

That is the real fable. Anthropic is arguably the most safety-minded of the big labs, and almost every choice this week was defensible on its own terms: price the frontier honestly, gate the dangerous capabilities in Mythos, reroute risky requests, slow the leakage of its methods to rivals. Yet stitched together, those choices produced a launch that annoyed paying users, outraged researchers, forced an embarrassing climbdown, and may push demand towards less careful competitors. The lesson is that in a race between near-trillion-dollar companies, even the most cautious player cannot unilaterally hit the brakes without handing the lead to someone who won't.

And then, three days after the launch, the decision was taken out of Anthropic's hands. On 12 June the US government issued an export-control directive ordering the company to suspend all access to Fable 5 and Mythos 5, worldwide and immediately, on national-security grounds. The order reaches further than any safeguard Anthropic built: it spares the company's other models but cuts off everyone else, customers, foreign nationals, even Anthropic's own foreign employees. The stated trigger was a single narrow jailbreak. The result is a frontier model, three days old and already in the hands of hundreds of millions of people, switched off by the state.

Anthropic disagrees, arguing that one potential jailbreak should not be cause to recall a commercial model at that scale, and says it is complying while it works to restore access. However it resolves, the precedent is the real story. This piece argued that no lab could unilaterally hit the brakes while rivals raced. It turns out the brakes exist after all, they are just not in the labs' hands. Whether this proves a one-off or the first time a government reaches directly into which frontier model the public may use, the question of who gets to pause has just changed.

Takeaways: Fable 5 is a real step forward, a model that grasps the shape of your work with unsettling breadth, but it is not the leap the maxed-out benchmarks suggest. The week showed how hard filtering and routing already are: controls so sensitive they mistake a health question for bioweapons research and a clever piece of software for model theft. Knowing what these models are thinking is about to get harder still, as their reasoning drifts into private shorthand just when capability demands watching. And the pause that no lab would take for itself arrived anyway, imposed overnight by a government rather than chosen. All of it converged in a single week. Anthropic did not act recklessly. It acted reasonably, again and again, produced a mess, and then watched control of the outcome pass out of its hands entirely. That is the real worry: not any one company's judgement, and not even the speed of the race, but how little control anyone has over where this goes, neither the labs that cannot govern their own models nor the state now reaching for a kill switch.

Who will thrive in the hybrid workforce

A New York Times panel asks who thrives in a hybrid AI-human workforce. Four thinkers converge: AI absorbs the junior rung, judgement becomes the scarce human role, and the apprenticeship that built it is gone. The fix is business design.

Joel Miller

12 June 20263 min read

The New York Times recently gathered four serious thinkers to ask who actually thrives in a hybrid AI-human workforce. It is a useful piece because the panel disagrees in revealing ways, and because each person brings a distinct vantage point on how AI is reshaping work.

Daron Acemoglu

The MIT economist and Nobel laureate has long argued that AI's gains are overstated and misdirected. His best move on the panel is to take the optimists' own vision and call it dystopian. If staying employed means re-testing five models every three months just to hold position, that is an unpaid treadmill, not progress. His deeper claim is that AI and human intelligence differ in kind. Trying to mimic one with the other wastes the strengths of both.

“Right now, you have to spend a lot of time learning different models, their capabilities, their shortcomings, and then three months later you have to experiment with lots of different models again in order to just stay where you are. That is absolutely not productive, that's very dystopian.”

Dean Ball

Ball helped draft the 2025 White House AI Action Plan and brings the policy realist's view. He deflates the drama. The exposed professions are already heavily automated, so AI shaves margins rather than removing whole jobs. Change will feel invisible day to day and obvious only in hindsight. His real fear is political. AI gets blamed for whatever the 2028 unemployment rate turns out to be, triggering rigid labour rules that make firms afraid to hire.

“I don't know what the unemployment rate will be in 2028, but I guarantee you that 100 percent of it is going to be blamed on A.I. by the American public and by lots of opportunistic politicians.”

Ethan Mollick

The Wharton professor is a leader in business and workplace AI use. In a Procter and Gamble trial, individuals using AI matched whole teams without it, and roles blurred as coders started designing and designers started coding. But judging work you did not create takes field experience, and that experience came from the junior grunt work AI now absorbs. You cannot just hire juniors once you have removed the apprenticeship that trained and assessed them.

“We had this great technique, which was apprenticeship. It's worked for 4,000 years. I hire a white-collar worker, and they do grunt work for me and they work really hard, and I get to assess how good they are.”

Clara Shih

Shih ran AI at Salesforce and Meta and now builds a start-up while running a nonprofit for entry-level workers. She owns both halves of the story, which she calls horrible and wonderful. She incorporated her company in days with no staff, work that once needed dozens of people. That same power threatens Stacey the claims adjuster and Bob the long-haul trucker. A third of Gen Z, she notes, now feel anger towards AI, and these are the people we need building the economy.

“Those who know how A.I. works, specifically A.I. agents, can get their dream job. Those who don't have those skills, those entry-level jobs are disappearing.”

What they share

Strip away the sparring and the four converge. The junior rung is automated first. The valued human role becomes judgement, supervision and verification, spotting the bad paper, the loophole, the badly written code. That judgement only comes from experience the new system no longer provides. The panel diagnoses but prescribes almost nothing, beyond Mollick's hope that universities might extend professional training to fill the gap.

Our experience

ExoBrain have been living inside the question the panel only describes. Our practitioners sustain three to five times their old output, running research, synthesis and software builds in parallel. The gain is real and already visible. But the bottleneck moves fast to human coordination, judgement and context. A few highly augmented people pull ahead, produce exceptional work, then hit coordination limits their operating model was never built to handle. Agent multiplication works today. Sustainable multiplication needs deliberate design, with shared memory, protected judgement and clear control loops.

That design work is the missing piece. The scarce resource in a hybrid team is human judgement, and we are raising demand for it. The technology already supports small teams running large agent workforces; the constraint is business design, not capability.

Takeaways: The panel is worth reading in full; four distinguished people debating the implications of AI and human productivity. What they share matters more than where they differ. All four reason inside the firms we already have, with its rungs, its apprenticeships and its org chart, and inside that frame the outlook is bleak, because the structure depends on the junior work agents now absorb. Our own experience points elsewhere. The limits we keep hitting are not capability but coordination, judgement and the systems around them, and those are things we can redesign. The modern business is a very recent invention. Treating its hierarchy as the only way humans can be productive together is a failure of imagination, not a law of nature. The opportunity in this moment is to build a different kind of knowledge work, a different kind of collaboration and a different kind of system. AI only becomes the negative force the panel fears if we spend it propping up the structure we happen to have inherited.

AI gains concentrated in the Mag 7

This week's chart from Apollo shows AI's gains concentrating in the Magnificent 7. Revenue per employee is rising for big tech and falling for the Russell 2000, with the productivity payoff yet to reach the wider economy.

Joel Miller

12 June 20262 min read

This week's chart from Apollo shows a split between the companies already converting AI and scale into financial performance, and the rest of the market.

Revenue per employee is rising for the Magnificent 7 and falling for the Russell 2000. Profit margins are also improving for the Magnificent 7, while the rest of the S&P 500 looks broadly flat.

That does not mean AI is having no effect outside big tech. It means the effect is not yet easy to see in revenue per employee or margins. For a company with $20 billion of revenue and 100,000 employees, a 5% improvement in revenue per employee would require roughly $1 billion of new AI-related revenue. Very few large firms can create, launch and scale that in a few years.

The cost case is easier. AI can reduce support work, speed up analysis, cut rework, lower outsourcing costs, or let teams handle more demand without hiring. Those gains may be real before they become visible in top-line productivity measures.

For now, AI appears to reward companies with scale, distribution, data, infrastructure and strong balance sheets. For everyone else the path to productivity may be slower.

News roundup

Orbital and India compute deals, a wave of AI-liability suits and EU enforcement, sharper reasoning research, and chips diversifying away from Nvidia.

AI business news

SpaceX IPO: Musk's firm set to launch first 'orbital data center' AI1 satellites in 2027, will put compute on Starlink craft (SpaceX's IPO prospectus reveals it is positioning itself as an orbital compute infrastructure company, not just a rocket firm, with AI satellites launching in 2027, which reframes what "AI infrastructure" even means.)
Companies with rising AI costs are increasingly using tools that tap cheaper models, including some from China, putting pricing pressure on OpenAI and Anthropic (The enterprise shift toward mixing cheaper models, including Chinese ones, to bypass OpenAI and Anthropic pricing is the first concrete signal that frontier lab pricing power may have peaked.)
Fresh off bond sale, Amazon borrows $17.5B from banks as AI spending continues (Amazon borrowing $17.5B from banks on top of a recent bond sale shows the sheer capital velocity required to stay competitive in AI infrastructure, raising questions about long-term debt sustainability across the sector.)
Meta signs first AI data center deal in India with Reliance (Meta's first AI data center deal in India, struck with Reliance, marks a strategic land-grab in the world's largest untapped AI compute market before rivals can establish footholds.)
Everyone hates frontier AI labs, says Palantir boss (Palantir CEO Alex Karp publicly calling out frontier AI labs for prioritizing token volume over enterprise outcomes gives voice to a growing buyer-side backlash that could accelerate the model-mixing trend.)

AI governance news

Google to challenge German ruling saying it is liable for AI-generated false claims (A German court assigning direct liability to Google for AI Overview hallucinations sets a precedent that could reshape how every AI search product is deployed across Europe.)
Mother sues OpenAI, alleging ChatGPT encouraged daughter's suicide (A wrongful-death lawsuit directly attributing a suicide to ChatGPT's outputs raises the stakes on product liability in a way that could force the entire industry to rethink conversational AI safety standards.)
xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims (A whistleblower lawsuit claiming xAI fired an engineer for flagging Grok safety concerns puts internal AI governance practices, and the adequacy of whistleblower protections in the sector, under direct legal scrutiny.)
EU orders Meta to reopen WhatsApp to AI rivals for free (The EU compelling Meta to open WhatsApp's infrastructure to AI rivals for free is the first concrete interoperability enforcement action under the Digital Markets Act, signaling a new tool regulators will use to break platform AI lock-in.)
Microsoft Hacked to Deliver Malware to Claude and Gemini Users (Hackers compromising more than 70 of Microsoft's own GitHub repositories to push credential-stealing malware at Claude Code and Gemini CLI users turns the AI coding supply chain itself into the attack surface.)

AI research news

The Illusion of Multi-Agent Advantage (This paper empirically challenges the prevailing assumption that multi-agent LLM systems outperform single agents, a finding that should make enterprises rethink costly multi-agent deployments.)
Does Reasoning Preserve Alignment? On the Trustworthiness of Large Reasoning Models (Converting LLMs into reasoning models via post-training may quietly erode their safety alignment, a critical finding for any organization deploying chain-of-thought or o-series style models in production.)
MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling (MaxProof's combination of generative-verifier reinforcement learning and population-level test-time scaling pushes the frontier of AI-assisted formal mathematics, with direct implications for software verification and theorem proving.)
MiniMax Sparse Attention (MiniMax Sparse Attention introduces a new architectural approach to reducing the quadratic cost of attention, which could meaningfully shift the economics of running long-context models at scale.)
Zero-source LLM Hallucination Detection with Human-like Criteria Probing (HCPD's zero-source hallucination detection, requiring no ground-truth reference, offers a practical path to catching model fabrications in open-ended deployment scenarios where reference answers simply don't exist.)

AI hardware news

Sources: six months after acquiring Rivos, Meta is struggling to integrate the chip startup and halted development of a chip for training its largest AI models (Meta's failed integration of acquired chip startup Rivos, including halted development of its largest training chip, reveals how hard it is to build in-house silicon capability even with deep pockets.)
Sources: Nvidia has told Chinese clients that its new Vera CPUs for AI data centers could be available as soon as August and that they can begin placing orders (Nvidia pitching its Vera CPUs directly to Chinese clients as early as August shows the company is finding ways to stay commercially relevant in China despite export controls.)
Delos Data offers AI chip startups a fast track to rack scale (Delos Data's new rack-scale testing platform addresses the single biggest hidden barrier for Nvidia/AMD challengers: getting networking and system integration right before a chip even reaches a customer.)
TSMC doubles down on Arizona as AI chip demand surges (TSMC acquiring a second Arizona land parcel equal in size to its existing campus signals that domestic U.S. advanced-node capacity is scaling faster than most supply-chain forecasts assumed.)
OpenAI to acquire Ona (OpenAI buying Ona, the cloud-sandbox startup formerly known as Gitpod, to keep Codex agents running long jobs inside enterprise clouds is a direct land-grab for the production-grade agent infrastructure where it is fighting Anthropic for the enterprise.)

Subscribe to the ExoBrain Weekly Newsletter

Stay up to date with AI. Get analysis of the week's most important stories, plus a focused roundup across business, governance, research and infrastructure.