arxiv_llm_reliability ยท Jun 18, 2026 ยท paper
arxiv.orgJun 18, 2026
original source linked
In brief
Multimodal Large Language Models (MLLMs) have demonstrated remarkable success in various Remote Sensing (RS) tasks. However, their ability to comprehend negation remains underexplored, limiting deployment in real-worl...
Feed lens
evaluation