DeepSeek’s deep thought
DeepSeek’s efficient R1-lite model challenges US dominance in AI development, highlighting the intensifying geopolitical race and the impact of export controls on innovation.
Joel Miller

The Chinese lab DeepSeek has exploded assumptions about AI development this week by matching OpenAI’s o1’s reasoning capabilities just a few months after the leading lab fired the starting gun on the reasoning AI era. Their new R1-lite model not only matches o1 on key benchmarks like AIME but does so with remarkable efficiency through their ‘mixture of experts’ architecture – using a minuscule 16B parameters.
This achievement comes as the US-China Economic and Security Review Commission recommended a “Manhattan Project” for AGI to Congress, suggesting broad government control over AI compute resources. The US-China AI race is accelerating.
DeepSeek’s success builds on the emerging trend of maximising inference-time computation rather than just scaling up model size. By getting models to “think harder” when answering questions, smaller and more efficient architectures can match the capabilities of much larger systems. This approach, similar to o1’s strategy, suggests that raw size and training compute may matter less than previously thought.
The timing is particularly noteworthy given ongoing US export controls on AI chips to China. DeepSeek’s rapid progress, and access to at least 10,000 Nvidia H100 equivalent GPUs, alongside other Chinese labs, demonstrates how regulation and restrictions might reshape but not necessarily halt AI development, and possibly even accelerate it.
Whilst DeepSeek (backed by High-Flyer Capital a Hong Kong based hedge fund), open source their models they still clearly conform to the kinds of content censorship that is required via Chinese AI regulation. Their model exhibits clear content boundaries – refusing to discuss sensitive political topics like Taiwan for example (or indeed this article).
Meanwhile, US policy discussions increasingly frame AI development as a geopolitical race. The commission’s report suggests giving the Department of Defence first priority on AI compute resources – from GPUs to data centre capacity. Some policymakers are even considering restrictions on open-weight AI models, although given the nature of training AI, Chinese labs could continue to innovate independently in any case. But the US will likely step-up restrictions on US investment into the Chinese AI market. The Biden administration’s compute threshold approach may be expanded significantly under Trump, whose appointees like Marco Rubio have pushed for broader constraints on US-China tech investment. The current “small yard, high fence” approach could shift to wider restrictions across multiple sectors like biotech and batteries, though some Trump-aligned business leaders with Chinese investments may push back.
Takeaways: DeepSeek R1 proves that the next phase of AI competition will not be about who can build the biggest models, but who can build the most efficient ones. DeepSeek’s achievement suggests that focused technical innovation within regulatory boundaries could be more productive than government-directed moonshots. For businesses, this means opportunities might lie in optimising existing models rather than chasing ever-larger architectures. Watch for more developments in inference-time computation and efficient architectures as the field continues to evolve. You can try the new model now via DeepSeek chat.