ExoBrain
agentic AIAI securitybenchmarks and evalsfrontier labs

DeepSeek scores 98% on the wrong benchmark

A CAISI report reveals that DeepSeek's R1 models are highly vulnerable to agent hijacking attacks, highlighting critical security disparities compared to US-based frontier models.

ExoBrain

1 min read
DeepSeek scores 98% on the wrong benchmark

This chart comes from a new report from CAISI (the Center for AI Standards and Innovation), a division within NIST under the US Department of Commerce. DeepSeek’s R1 models appear alarmingly vulnerable to agent hijacking attacks, with success rates reaching 98% for critical exploits like downloading malware and 89% for sending phishing emails. In contrast, US models from OpenAI and Anthropic show dramatically lower vulnerability. Whilst the industry obsesses over benchmark scores for reasoning and coding abilities, these security vulnerability tests reveal equally consequential differences. Let’s hope they become the norm.