TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration Paper • 2606.04743 • Published 18 days ago • 45
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 24 days ago • 76
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 24 days ago • 76
Self-Improving Language Models with Bidirectional Evolutionary Search Paper • 2605.28814 • Published 25 days ago • 60
Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents Paper • 2605.28775 • Published 25 days ago • 38
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 25 days ago • 92
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published May 18 • 12
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents Paper • 2605.17873 • Published May 18 • 12
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Paper • 2605.20258 • Published May 18 • 30
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Paper • 2605.20258 • Published May 18 • 30
Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR Paper • 2605.15726 • Published May 15 • 34