DyCon: Dynamic Reasoning Control via Evolving Difficulty Modeling Paper • 2606.07108 • Published 6 days ago • 1
DyCon: Dynamic Reasoning Control via Evolving Difficulty Modeling Paper • 2606.07108 • Published 6 days ago • 1
FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging Paper • 2602.08024 • Published Feb 8 • 2
Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior Paper • 2512.06866 • Published Dec 7, 2025 • 5
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Paper • 2505.04410 • Published May 7, 2025 • 44
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception Paper • 2508.11256 • Published Aug 15, 2025
FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging Paper • 2602.08024 • Published Feb 8 • 2
Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior Paper • 2512.06866 • Published Dec 7, 2025 • 5
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision Paper • 2405.17913 • Published May 28, 2024
Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations Paper • 2510.23607 • Published Oct 27, 2025 • 181
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 119