arxiv_cs_ai ยท Jun 12, 2026 ยท paper
From Shield to Target: Denial-of-Service Attacks on LLM-Based Agent Guardrails
In brief
LLM-based guardrails have emerged as a highly effective defense against prompt injection and jailbreak attacks in autonomous agents. However, we reveal that the very reasoning and task-following capabilities enabling...
agentevaluation