Story

arxiv_llm_reliability ยท Jun 26, 2026 ยท paper

Source brief

When Search Agents Should Ask: DiscoBench for Clarification-Aware Deep Search

arxiv.orgJun 26, 2026
original source linked

In brief

Search agents powered by large language models (LLMs) are increasingly used to solve complex information-seeking tasks, requiring multi-step retrieval and reasoning to fulfill user goals. However, existing benchmarks...

Feed lens
agenteval

Continue reading

Read the original at arxiv.org โ†’Open in live feedRead that dayโ€™s brief

Earlier in this thread 4 items