Story
arxiv_cs_cl · Jun 22, 2026 · paper
arxiv.orgJun 22, 2026
original source linked
In brief
Enterprise agents increasingly operate inside workspaces: they read heterogeneous files, invoke tools, and deliver business artifacts. We introduce EnterpriseClawBench, an enterprise agent benchmark constructed from p...
Feed lens
agentharnessevaluationcodex