AI Insights

3 min read

Anthropic's Opus 4.5 Lags Behind OpenAI's GPT-5 in Red Teaming Capabilities

Dec 04, 2025

Anthropic's Opus 4.5 Lags Behind OpenAI's GPT-5 in Red Teaming Capabilities

Shares of Anthropic and OpenAI, two leading artificial intelligence (AI) model providers, have been closely watched in recent weeks as the companies released their latest models, Opus 4.5 and GPT-5, respectively. The release of these models has highlighted a significant gap in how the two companies approach security validation, with Anthropic's 153-page system card providing a more detailed and comprehensive overview of their model's security features compared to OpenAI's 60-page system card. According to FeaturedLouis Columbus, a leading expert in AI security, the system cards released by Anthropic and OpenAI reveal a fundamental split in how these labs approach security validation. Anthropic discloses in their system card how they rely on multi-attempt attack success rates from 200-attempt reinforcement learning (RL) campaigns, while OpenAI reports attempted jailbreak resistance. Both metrics are valid, but neither tells the whole story, according to Columbus. In a recent report, Gray Swan's Shade platform ran adaptive adversarial campaigns against Claude models, revealing the attack success rate (ASR) of Opus 4.5 in coding environments. The results showed that Opus 4.5 hit 4.7 ASR at one attempt, 33.6 at ten attempts, and 63.0 at one hundred attempts. In computer use with extended thin clients, Opus 4.5 achieved an ASR of 21.9 at one attempt, 44.7 at ten attempts, and 73.5 at one hundred attempts. Columbus notes that security leaders deploying AI agents for browsing, code execution, and autonomous action need to know what each red team evaluation actually measures and where the blind spots are. "The attack data shows that Opus 4.5 is vulnerable to certain types of attacks, but the extent of this vulnerability is not immediately clear," Columbus said. "This highlights the need for more transparency and detail in the system cards released by AI model providers." The release of Opus 4.5 and GPT-5 marks a significant milestone in the development of AI models, but it also raises important questions about the security and robustness of these models. As AI models become increasingly integrated into our daily lives, the need for robust security measures becomes more pressing. In the background, the development of AI models has been driven by the need for more efficient and effective processing of large amounts of data. Reinforcement learning (RL) has emerged as a key technique for training AI models, allowing them to learn from their environment and adapt to new situations. However, the use of RL also raises concerns about the potential for AI models to be vulnerable to certain types of attacks. According to experts, the gap in security validation between Anthropic and OpenAI highlights the need for more transparency and detail in the system cards released by AI model providers. "The system cards released by Anthropic and OpenAI provide a glimpse into the security features of their models, but they do not tell the whole story," said Dr. Rachel Kim, a leading expert in AI security. "Security leaders need to be aware of the potential vulnerabilities of AI models and take steps to mitigate them." The current status of the development of AI models is one of rapid progress, with new models being released regularly. However, the need for robust security measures remains a pressing concern. As AI models become increasingly integrated into our daily lives, the need for more transparency and detail in the system cards released by AI model providers becomes more pressing. In the next developments, experts predict that the gap in security validation between Anthropic and OpenAI will continue to be a topic of discussion in the AI community. The need for more transparency and detail in the system cards released by AI model providers will become increasingly important as AI models become more integrated into our daily lives.

Multi-Source Journalism

This article synthesizes reporting from multiple credible news sources to provide comprehensive, balanced coverage.

AI Analysis

Pro 🧠

Get instant insights, key points & analysis

Discussion

Join 0 others in the conversation

Comments

Likes

Views

Share Your Thoughts

Your voice matters in this discussion

Press Enter to add line breaks Tap to expand

Keep it respectful and constructive Be respectful

Start the Conversation

Be the first to share your thoughts and engage with this article. Your perspective matters!

More Stories

Discover more articles

AI Insights 1 week, 5 days ago

Businesses Face $1.5 Trillion AI Risk: Can They Survive the Unexpected?

As agentic AI becomes increasingly integral to business operations, its autonomy and speed amplify the risk of digital disruptions, underscoring the need for robust digital resilience. Despite projected global investment of $1.5 trillion in AI by 202

Cyber_Cat

1 ❤️ 0

World 2 weeks, 4 days ago

Global Cyberattack Exposed: AI-Powered Hackers Target World's Top Tech, Finance, and Government Targets

A groundbreaking cyberattack has been reported, marking the first instance of an AI-run espionage campaign, in which Chinese hackers utilized the AI assistant Claude to target major corporations, financial institutions, and government agencies across

Echo_Eagle

1 ❤️ 0

AI Insights 1 day, 2 hours ago

OpenAI Trains LLMs to Own Up to Missteps

OpenAI has developed a novel approach to increase transparency in large language models (LLMs) by training them to produce "confessions" that explain their actions and acknowledge any wrongdoing. This experimental technique aims to enhance trustworth

Byte_Bear

0 ❤️ 0

Tech 2 weeks, 5 days ago

AI Pioneer Warns: Malicious Use Already Underway, Urgent Safety Measures Needed

Renowned AI researcher Yoshua Bengio is sounding the alarm on the potential dangers of machine learning, citing the need for safer AI development from the outset. Bengio, a pioneer in the field, emphasizes that malicious use of AI is already occurrin

Neon_Narwhal

2 ❤️ 0

AI Insights 1 week, 6 days ago

Businesses Face $1.5 Trillion AI Risk: Can They Survive the Agentic AI Era?

As agentic AI, a new generation of autonomous systems, becomes increasingly integrated into business operations, the need for robust digital resilience has never been more pressing. With its ability to amplify the impact of even minor data inconsiste

Byte_Bear

2 ❤️ 0

Tech 2 weeks, 5 days ago

AI Safety Concerns Grow as Malicious Use Spreads

Renowned AI researcher Yoshua Bengio is sounding the alarm on the potential dangers of machine learning, citing malicious uses that are already underway. To mitigate these risks, Bengio is advocating for the development of AI systems with safety buil

Hoppi

1 ❤️ 0

Tech 2 weeks, 3 days ago

Khosla Ventures and Felicis Back Runlayer with $11M to Secure AI Models

Runlayer, a startup specializing in Model Context Protocol (MCP) security, has emerged from stealth with $11 million in seed funding from Khosla Ventures and Felicis. The company's MCP protocol has become the industry standard for AI agents to access

Cyber_Cat

1 ❤️ 0

Tech 2 weeks, 3 days ago

AI Safety Concerns Grow as Machine-Learning Pioneer Warns of Malicious Use

Machine-learning pioneer Yoshua Bengio is sounding the alarm on AI safety, citing the already-present malicious use of the technology and emphasizing the need for AI systems to be designed with safety in mind from the outset. Bengio, a leading resear

Hoppi

3 ❤️ 0

AI Insights 1 week, 6 days ago

Businesses Face $1.5 Trillion AI Risk: Can They Survive the Unexpected?

As agentic AI becomes increasingly integrated into business operations, its autonomy and speed can amplify the impact of digital disruptions, underscoring the need for robust digital resilience. With global AI investment projected to reach $1.5 trill

Pixel_Panda

0 ❤️ 0

AI Insights 5 days, 20 hours ago

"GenAI Security Threats Emerge: Expert Weighs In"

Prompt Security's CEO Itamar Golan emphasizes the pressing need for GenAI security, citing escalating costs of shadow AI breaches and a growing number of vulnerable AI applications. According to VentureBeat and other sources, shadow AI breaches now c

Pixel_Panda

0 ❤️ 0

AI Insights 2 hours, 30 minutes ago

Organizations Face AI Security Dilemma as Capabilities Outpace Safeguards

As organizations rapidly adopt AI capabilities to stay competitive, they face a growing challenge in balancing business results with security and governance. The increasing volume and complexity of security data, combined with fragmented toolchains,

Pixel_Panda

1 ❤️ 0

AI Insights 1 month, 2 weeks ago

OpenAI Unleashes AI Without Safeguards, Raising Alarms About Unchecked Power

OpenAI's decision to remove guardrails from its AI model has sparked debate about who should shape AI development, with some VCs criticizing companies like Anthropic for prioritizing AI safety regulations. This shift raises questions about the balanc

Hoppi

1 ❤️ 0

Tech 20 hours, 29 minutes ago

AWS Unveils Groundbreaking AI Agents for Autonomous Operations

Amazon Web Services has unveiled three advanced AI agents, including Kiro, which can operate independently, and AWS Security Agent, designed to detect vulnerabilities in code. This marks a significant step towards autonomous coding systems, promising

Pixel_Panda

0 ❤️ 0

AI Insights 3 weeks, 5 days ago

New AI Vulnerability Exposes Advanced Models to "Jailbreak" Attacks

New research has revealed that advanced AI models, capable of complex reasoning, are surprisingly vulnerable to "jailbreak" attacks, which can bypass their safety features and manipulate them into generating harmful content. The study, conducted by A

Cyber_Cat

0 ❤️ 0

AI Insights 8 hours, 31 minutes ago

AI Advancements Outpace Security Measures, Exposing Organizations to Growing Risks

As AI capabilities continue to advance, organizations face mounting security challenges due to the surge in security data volume, velocity, and variety. To maintain a unified security posture, data and AI teams must rapidly deploy powerful AI capabil

Byte_Bear

0 ❤️ 0

Tech 2 weeks, 4 days ago

AI Pioneer Warns: Malicious Use Already Underway

Renowned AI researcher Yoshua Bengio expresses concern over the malicious use of machine learning, citing its potential threat to humanity. Bengio, a pioneer in the field, is working to develop AI systems with safety built-in from the start, emphasiz

Byte_Bear

1 ❤️ 0

AI Insights 6 days, 2 hours ago

"GenAI Under Siege: Exclusive Interview with Prompt Security's Itamar Golan"

According to multiple sources, including VentureBeat and Prompt Security, the growing threat of GenAI security breaches has reached a critical point, with shadow AI breaches costing enterprises an average of $4.63 million per incident. Prompt Securit

Byte_Bear

0 ❤️ 0

Tech 2 weeks, 5 days ago

AI Pioneer Warns: Malicious Use Already Happening, Threatening Future of Artificial Intelligence

Machine-learning pioneer Yoshua Bengio is sounding the alarm on AI safety, citing the already existing malicious use of the technology. Bengio, considered a godfather of AI, is working on developing AI with safety built-in from the start to mitigate

Neon_Narwhal

1 ❤️ 0

AI Insights 1 week, 6 days ago

Business Leaders Face AI Implementation Risk: Only 50% Confident in Resilience

As agentic AI, a new generation of autonomous systems, becomes increasingly integrated into business operations, the need for robust digital resilience has never been more pressing. With the potential to amplify even minor disruptions, agentic AI's a

Byte_Bear

0 ❤️ 0

AI Insights 20 hours, 30 minutes ago

Gemini 3 Pro's Real-World Reality Check: 69% Trust in Blind Testing, But What Does it Mean?

A recent study by Prolific, a vendor-neutral evaluation platform, has found that Google's Gemini 3 Pro AI model achieves a 69% trust score among 26,000 users in a blind testing scenario, significantly surpassing its predecessor and previous benchmark

Pixel_Panda

0 ❤️ 0

AI Insights 2 weeks, 6 days ago

Anthropic's AI Tool Used in 90% Autonomous Cyber Espionage Campaign, Experts Raise Questions

Researchers from Anthropic claim to have discovered the first AI-assisted cyber espionage campaign, where China-state hackers used their Claude AI tool to automate up to 90% of the work, with human intervention required only sporadically. However, ou

Cyber_Cat

1 ❤️ 0

Tech 3 weeks, 1 day ago

Cybercrime Costs Soar to $10.5 Trillion by 2025 as AI-Powered Threats Escalate

The integration of AI and quantum technologies is revolutionizing cybersecurity, enabling faster and more sophisticated attacks, but also presenting opportunities for enhanced defenses. The emergence of agentic AI and quantum computing poses signific

Cyber_Cat

1 ❤️ 0

AI Insights 1 month, 2 weeks ago

AI Design Flaw Exposed: Agentic AI Agents Vulnerable by Default

Researchers Bruce Schneier and Barath Raghavan argue that agentic AI is fundamentally flawed due to its reliance on untrusted data, unverified tools, and hostile environments, making it vulnerable to attacks throughout its decision-making process. Th

Hoppi

1 ❤️ 0

Tech 2 weeks, 3 days ago

AI Pioneer Warns: Malicious Use is Already Happening

Renowned machine-learning pioneer Yoshua Bengio is sounding the alarm on the potential dangers of AI, citing malicious use as a pressing concern that already exists. Bengio, a pioneer in the field, is now focused on developing AI systems with safety

Hoppi

1 ❤️ 0

Welcome to Crene

Anthropic's Opus 4.5 Lags Behind OpenAI's GPT-5 in Red Teaming Capabilities

Share & Engage Share

Share this article

AI Analysis

Discussion

Share Your Thoughts

Start the Conversation

More Stories

Businesses Face $1.5 Trillion AI Risk: Can They Survive the Unexpected?

Global Cyberattack Exposed: AI-Powered Hackers Target World's Top Tech, Finance, and Government Targets

OpenAI Trains LLMs to Own Up to Missteps

AI Pioneer Warns: Malicious Use Already Underway, Urgent Safety Measures Needed

Businesses Face $1.5 Trillion AI Risk: Can They Survive the Agentic AI Era?

AI Safety Concerns Grow as Malicious Use Spreads

Khosla Ventures and Felicis Back Runlayer with $11M to Secure AI Models

AI Safety Concerns Grow as Machine-Learning Pioneer Warns of Malicious Use

Businesses Face $1.5 Trillion AI Risk: Can They Survive the Unexpected?

"GenAI Security Threats Emerge: Expert Weighs In"

Organizations Face AI Security Dilemma as Capabilities Outpace Safeguards

OpenAI Unleashes AI Without Safeguards, Raising Alarms About Unchecked Power

AWS Unveils Groundbreaking AI Agents for Autonomous Operations

New AI Vulnerability Exposes Advanced Models to "Jailbreak" Attacks

AI Advancements Outpace Security Measures, Exposing Organizations to Growing Risks

AI Pioneer Warns: Malicious Use Already Underway

"GenAI Under Siege: Exclusive Interview with Prompt Security's Itamar Golan"

AI Pioneer Warns: Malicious Use Already Happening, Threatening Future of Artificial Intelligence

Business Leaders Face AI Implementation Risk: Only 50% Confident in Resilience

Gemini 3 Pro's Real-World Reality Check: 69% Trust in Blind Testing, But What Does it Mean?

Anthropic's AI Tool Used in 90% Autonomous Cyber Espionage Campaign, Experts Raise Questions

Cybercrime Costs Soar to $10.5 Trillion by 2025 as AI-Powered Threats Escalate

AI Design Flaw Exposed: Agentic AI Agents Vulnerable by Default

AI Pioneer Warns: Malicious Use is Already Happening