← Back to all categories
Human-in-the-Loop
1 resourceAgentic AI Security
Oversight mechanisms, approval workflows, and escalation patterns
paper reviewed open access 2024
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
Tongxin Yuan, Zhiwei He, Lingzhong Dong + 9 more — EMNLP 2024
Introduces R-Judge benchmark for evaluating whether LLM agents can identify safety risks in agentic scenarios involving tool use and multi-step reasoning.