Vulnerability Disclosure

1 resource

Red Teaming & Evaluation

Coordinated disclosure processes for AI vulnerabilities

paper reviewed open access 2024

Richard Fang, Rohan Bindu, Akul Gupta + 1 more — arXiv preprint

Shows that LLM agents (GPT-4) can autonomously exploit real-world one-day vulnerabilities given CVE descriptions, achieving 87% success rate.