
Share
As AI advances swiftly in fields like cybersecurity and biotech, Anthropic’s Red Team report reveals both groundbreaking potential and persistent risks, underscoring the necessity for stringent safeguards to protect national security.
In a world where artificial intelligence (AI) is increasingly integrated into various aspects of society, understanding its potential risks to national security has become more critical than ever. A recent report from Anthropic’s Frontier Red Team offers valuable insights into the rapid progress of AI models, particularly in cybersecurity and biological knowledge. While these advancements are significant, they also highlight the ongoing need for robust risk management strategies.
The implications of advanced AI in cybersecurity and biotechnology are far-reaching. Cybersecurity breaches can lead to the theft of sensitive data, disruption of critical infrastructure, and even physical harm. In biotechnology, AI could potentially accelerate the development of new therapies or, conversely, be misused for harmful purposes. Understanding these risks is crucial for policymakers, security experts, and the public to ensure that we harness AI's benefits while mitigating its dangers.
In 2024, Anthropic’s AI model, Claude, made a significant leap in cybersecurity capabilities. During Capture The Flag (CTF) exercises, which simulate real-world hacking scenarios, Claude improved from the level of a high school student to that of an undergraduate in just one year. This progress is not just incremental; it represents a substantial enhancement in the model’s ability to identify and exploit software vulnerabilities.
Figure 1: In less than a year, Claude went from solving less than a quarter to nearly all Intercode CTF challenges, publicly available CTFs that were designed for high school competitions.
Anthropic's latest model, Claude 3.7 Sonnet, has continued this trend of improvement. On Cybench, a public benchmark using CTF challenges to evaluate language models (LLMs), Claude 3.7 Sonnet solves about a third of the challenges within five attempts, up from about five percent with their previous frontier model at this time last year (see Figure 2).
Figure 2: Claude 3.7 Sonnet shows clear improvement on the Cybench CTF benchmark compared to the previous two generations of models, even without using extended thinking.
These advancements span various types of cybersecurity tasks. For instance, Claude has shown improvements in discovering and exploiting vulnerabilities in insecure software (‘pwn’), web applications (‘web’), and cryptographic primitives and protocols (‘crypto’). However, it's important to note that Claude’s skills still lag behind those of expert humans, particularly in complex areas like reverse engineering.

In the realm of biological knowledge, AI models are also making strides. Some models have reached or exceeded undergraduate-level skills in certain areas, demonstrating a deep understanding of complex biological processes. This could be immensely beneficial for medical research and drug development, but it also raises concerns about potential misuse.
For example, advanced AI could theoretically aid in the design of bioweapons, though current models are far from achieving this level of sophistication. The ethical implications of such capabilities are significant, and they underscore the need for stringent regulations and oversight to prevent harmful applications.
While AI's rapid progress is impressive, it's crucial to recognize that real-world risks depend on more than just the capabilities of AI models. Physical constraints, specialized equipment, human expertise, and practical implementation challenges all play a role in determining how these technologies are used. For instance, even if an AI can identify a software vulnerability, exploiting it often requires specific tools and knowledge that may not be readily available.
Anthropic’s Frontier Red Team has identified several best practices for evaluating the risks associated with advanced AI:
The rapid advancements in AI, particularly in cybersecurity and biological knowledge, are a double-edged sword. While they offer exciting possibilities for innovation and progress, they also present significant risks that
Tags
Original Sources
About the author
Amara's entry point into AI was an epidemiology role at a London research hospital, where she spent five years studying how digital health tools reached — or conspicuously failed to reach — underserved communities. Watching early algorithmic systems in healthcare quietly entrench existing inequalities, she redirected her career toward the systemic consequences of AI at scale. She covers AI through an unflinching lens: who benefits, who bears the cost, and what evidence actually says versus what the press release claims. Her writing is calm and precise, but she doesn't mistake balance for neutrality.
More from The Steward →This Week's Edition
20 March 2025
133 articles
Related Articles
Related Articles
More Stories