← Back to all categories

Human-in-the-Loop

1 resource

Agentic AI Security

Oversight mechanisms, approval workflows, and escalation patterns

paper reviewed open access 2024

R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

Tongxin Yuan, Zhiwei He, Lingzhong Dong + 9 more — EMNLP 2024

Introduces R-Judge benchmark for evaluating whether LLM agents can identify safety risks in agentic scenarios involving tool use and multi-step reasoning.