← Back to search
paper reviewed open access llmsec-2024-00060
Adaptive Attacks Break Defenses Against LLM Jailbreaking
Jingwei Yi, Yueqi Xie, Bin Zhu, Keegan Hines, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu
2024 — arXiv preprint 50 citations
Abstract
Shows that adaptive adversaries can bypass most proposed jailbreak defenses, highlighting the arms race between attacks and defenses.
Categories
Tags
adaptive-attacksdefense-bypassarms-race
Framework Mappings
OWASP LLM: LLM01 MITRE ATLAS: AML.T0054
Cite This Resource
@article{llmsec202400060,
title = {Adaptive Attacks Break Defenses Against LLM Jailbreaking},
author = {Jingwei Yi and Yueqi Xie and Bin Zhu and Keegan Hines and Emre Kiciman and Guangzhong Sun and Xing Xie and Fangzhao Wu},
year = {2024},
journal = {arXiv preprint},
url = {https://arxiv.org/abs/2404.02151},
} Metadata
- Added
- 2026-04-14
- Added by
- manual
- Source
- manual
- arxiv_id
- 2404.02151