Running 114 Unlocking On-Policy Distillation for Any Model Family 📝 114 Explore on-policy distillation visualization for any model
Running 3.9k The Ultra-Scale Playbook 🌌 3.9k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Agents Featured 1.01k Model Memory Utility 🚀 1.01k Calculate GPU memory needed for training HF Mirror models
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running MCP 191 Recommend Similar Papers 🌖 191 Get similar paper recommendations from a HF Mirror link