EgoCS-400K: An Egocentric Gameplay Dataset for World Models Paper • 2606.18180 • Published 13 days ago • 15
LectūraAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted Learning and Embodied Teaching Paper • 2606.16428 • Published 14 days ago • 39
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 18 days ago • 109
From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain Paper • 2605.23895 • Published May 22 • 54
World Models Meet Language Models: On the Complementarity of Concrete and Abstract Reasoning Paper • 2606.03603 • Published 27 days ago • 29
K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts Paper • 2606.02404 • Published 28 days ago • 59
OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents Paper • 2606.02031 • Published 28 days ago • 20
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration? Paper • 2606.01247 • Published 29 days ago • 31
Joint Agent Memory and Exploration Learning via Novelty Signals Paper • 2606.01528 • Published 28 days ago • 15
SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories Paper • 2606.01311 • Published 29 days ago • 37
Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)? Paper • 2605.30557 • Published May 28 • 12
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction Paper • 2605.29341 • Published May 28 • 18
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published May 27 • 32
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published May 28 • 60
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published May 1 • 25