← Back to search
paper reviewed open access llmsec-2024-00005

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Eric Wallace, Kai Xiao, Reimar Leike, Lilian Weng, Johannes Heidecke, Alex Beutel

2024-04 — arXiv preprint 150 citations

Abstract

Proposes an instruction hierarchy for training LLMs to prioritize system prompts over user prompts over third-party content, as a defense against prompt injection.

Categories

Tags

defenseinstruction-hierarchysystem-prompt

Framework Mappings

OWASP LLM: LLM01 OWASP LLM: LLM07 MITRE ATLAS: AML.T0051

Cite This Resource

@article{llmsec202400005,
  title = {The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions},
  author = {Eric Wallace and Kai Xiao and Reimar Leike and Lilian Weng and Johannes Heidecke and Alex Beutel},
  year = {2024},
  journal = {arXiv preprint},
  url = {https://arxiv.org/abs/2404.13208},
}

Metadata

Added
2026-04-14
Added by
manual
Source
manual
arxiv_id
2404.13208