← Back to all categories
Social Engineering
2 resourcesAttacks & Threats
AI-enabled phishing, deepfakes, and manipulation
paper reviewed open access 2024
LLM Agents Can Autonomously Hack Websites
Richard Fang, Rohan Bindu, Akul Gupta + 2 more — arXiv preprint
Demonstrates that LLM agents can autonomously perform web hacking tasks including SQL injection, XSS, and CSRF attacks without human guidance.
paper reviewed open access 2024
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety
Yi Zeng, Hongpeng Lin, Jingwen Zhang + 3 more — ACL 2024
Applies social science persuasion techniques to jailbreak LLMs, showing high attack success rates using persuasion taxonomy.