← Back to all categories
Vulnerability Disclosure
1 resourceRed Teaming & Evaluation
Coordinated disclosure processes for AI vulnerabilities
paper reviewed open access 2024
LLM Agents Can Autonomously Exploit One-day Vulnerabilities
Richard Fang, Rohan Bindu, Akul Gupta + 1 more — arXiv preprint
Shows that LLM agents (GPT-4) can autonomously exploit real-world one-day vulnerabilities given CVE descriptions, achieving 87% success rate.