yoonkyumng

yoonkg

9 1

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

upvoted a paper about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper about 2 months ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

upvoted 3 papers about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published May 13 • 223

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published May 7 • 26

upvoted 5 papers 9 months ago

Diversity-Incentivized Exploration for Versatile Reasoning

Paper • 2509.26209 • Published Sep 30, 2025 • 17

It Takes Two: Your GRPO Is Secretly DPO

Paper • 2510.00977 • Published Oct 1, 2025 • 32

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Paper • 2509.20712 • Published Sep 25, 2025 • 20

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 101

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 83