breakingMarch 29, 2026

Claude claims zero-day findings in Ghost and Linux kernel during 90-minute demo

Nicholas Carlini showed a scaffolded Claude setup that reportedly found a blind SQL injection in Ghost and repeated the pattern against the Linux kernel. The attributed demo shifts cyber-capability debate from abstract evals to disclosed software targets and 90-minute workflows, so readers should treat the result as a specific reported demo.

Claude Agent Security Red Teaming

2 min read

Claude claims zero-day findings in Ghost and Linux kernel during 90-minute demo

TL;DR

Anthropic researcher Nicholas Carlini reportedly showed a scaffolded Claude setup finding a zero-day in Ghost during a live demo, shifting the discussion from abstract capability claims to a named target and a bounded workflow demo thread.
In the account summarized in the talk link post, the system found a blind SQL injection in about 90 minutes, extracted an admin API key, and then "repeated the same move" against the Linux kernel.
The same thread says Anthropic's setup used a "surprisingly minimal scaffold" and had already uncovered "500+ high-severity vulnerabilities," though the public evidence here is a conference talk summary rather than a vendor writeup or disclosure note demo thread.

What did the demo actually show?

Rohan Paul

@rohanpaul_ai

·Follow

A top Research Scientist at Anthropic showed how Claude found zero-day vulnerabilities live on stage. By Nicholas Carlini. It discovered a zero-day in Ghost, which has 50,000 stars on GitHub and had never had a critical security vulnerability in its history. In 90 minutes, it Show more

Watch on X

11:26 AM · Mar 29, 2026

199

Read 23 replies

According to the primary thread, Carlini's demo centered on two concrete case studies: Ghost CMS and a Linux kernel bug in NFS. The Ghost example is the sharper engineering claim. The summary says Claude found a blind SQL injection in Ghost, a project described there as having about 50,000 GitHub stars and no prior critical vulnerability history, then used that path to take an admin API key.

Rohan Paul

@rohanpaul_ai

·Follow

Replying to @rohanpaul_ai

youtube.com/watch?v=1sd26p…

11:26 AM · Mar 29, 2026

Read more on X

The same account, echoed in the linked-talk post, frames the result as a capability threshold: a model with a "minimal scaffold" autonomously discovering and exploiting bugs in heavily audited software within a 90-minute session. That matters because the claim is no longer just that frontier models score better on security evals; it is that a scaffolded agent reportedly completed an end-to-end vulnerability workflow against disclosed software targets. What remains public, though, is still second-order evidence around a conference presentation and the talk video, not a detailed technical paper or reproduction package.