paper reviewed open access llmsec-2024-00043

Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models

Jeffrey Cheng, Ruoxi Jia

2024-02 — arXiv preprint 40 citations

Abstract

Develops precise methods for detecting and extracting training data from LLMs when white-box access is available, with implications for copyright and privacy.

Framework Mappings

OWASP LLM: LLM02 MITRE ATLAS: AML.T0024

Cite This Resource

@article{llmsec202400043,
  title = {Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models},
  author = {Jeffrey Cheng and Ruoxi Jia},
  year = {2024},
  journal = {arXiv preprint},
  url = {https://arxiv.org/abs/2402.17012},
}

Metadata

Added: 2026-04-14
Added by: manual
Source: manual
arxiv_id: 2402.17012

Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models

Abstract

Categories

Tags

Framework Mappings

Cite This Resource

Metadata