← Back to all categories

Vulnerability Disclosure

1 resource

Red Teaming & Evaluation

Coordinated disclosure processes for AI vulnerabilities

paper reviewed open access 2024

LLM Agents Can Autonomously Exploit One-day Vulnerabilities

Richard Fang, Rohan Bindu, Akul Gupta + 1 more — arXiv preprint

Shows that LLM agents (GPT-4) can autonomously exploit real-world one-day vulnerabilities given CVE descriptions, achieving 87% success rate.