Story
arxiv_cs_ai ยท Jun 26, 2026 ยท paper
Source brief
HAT-4D: Lifting Monocular Video for 4D Multi-Object Interactions via Human-Agent Collaboration
arxiv.orgJun 26, 2026
original source linked
In brief
Extracting dynamic 4D object interactions from massive, in-the-wild monocular videos offers a highly efficient data collection pathway for scaling Embodied AI and training VLAs. However, existing monocular 4D reconstr...
Feed lens
agenticevaluation