← Back to search
paper reviewed open access llmsec-2024-00005
The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions
Eric Wallace, Kai Xiao, Reimar Leike, Lilian Weng, Johannes Heidecke, Alex Beutel
2024-04 — arXiv preprint 150 citations
Abstract
Proposes an instruction hierarchy for training LLMs to prioritize system prompts over user prompts over third-party content, as a defense against prompt injection.
Categories
Tags
defenseinstruction-hierarchysystem-prompt
Framework Mappings
OWASP LLM: LLM01 OWASP LLM: LLM07 MITRE ATLAS: AML.T0051
Cite This Resource
@article{llmsec202400005,
title = {The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions},
author = {Eric Wallace and Kai Xiao and Reimar Leike and Lilian Weng and Johannes Heidecke and Alex Beutel},
year = {2024},
journal = {arXiv preprint},
url = {https://arxiv.org/abs/2404.13208},
} Metadata
- Added
- 2026-04-14
- Added by
- manual
- Source
- manual
- arxiv_id
- 2404.13208