← Back to all categories

Social Engineering

2 resources

Attacks & Threats

AI-enabled phishing, deepfakes, and manipulation

paper reviewed open access 2024

LLM Agents Can Autonomously Hack Websites

Richard Fang, Rohan Bindu, Akul Gupta + 2 more — arXiv preprint

Demonstrates that LLM agents can autonomously perform web hacking tasks including SQL injection, XSS, and CSRF attacks without human guidance.

paper reviewed open access 2024

How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety

Yi Zeng, Hongpeng Lin, Jingwen Zhang + 3 more — ACL 2024

Applies social science persuasion techniques to jailbreak LLMs, showing high attack success rates using persuasion taxonomy.