← Back to search
paper reviewed open access llmsec-2024-00060

Adaptive Attacks Break Defenses Against LLM Jailbreaking

Jingwei Yi, Yueqi Xie, Bin Zhu, Keegan Hines, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu

2024 — arXiv preprint 50 citations

Abstract

Shows that adaptive adversaries can bypass most proposed jailbreak defenses, highlighting the arms race between attacks and defenses.

Categories

Tags

adaptive-attacksdefense-bypassarms-race

Framework Mappings

OWASP LLM: LLM01 MITRE ATLAS: AML.T0054

Cite This Resource

@article{llmsec202400060,
  title = {Adaptive Attacks Break Defenses Against LLM Jailbreaking},
  author = {Jingwei Yi and Yueqi Xie and Bin Zhu and Keegan Hines and Emre Kiciman and Guangzhong Sun and Xing Xie and Fangzhao Wu},
  year = {2024},
  journal = {arXiv preprint},
  url = {https://arxiv.org/abs/2404.02151},
}

Metadata

Added
2026-04-14
Added by
manual
Source
manual
arxiv_id
2404.02151