Collection dedicated to all the datasets, checkpoints and any additional artifacts for Tiny Think
Bojan Jakimovski
Shekswess
AI & ML interests
AWS Ambassador | Machine Learning & Applied Research Lead | College Professor
Recent Activity
updated a dataset 2 days ago
lokahq/bioreason-rl updated a model 2 days ago
lokahq/Trinity-Mini-DrugProt-Think published a dataset 8 days ago
lokahq/bioreason-rlOrganizations
models 31
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_3-lr3e-6-e1-bs8
Text Generation • 0.1B • Updated • 2
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8
Text Generation • 0.1B • Updated • 2
Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8
Text Generation • 0.1B • Updated • 2
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8
Text Generation • 0.1B • Updated • 3
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr5e-6-e1-bs8
Text Generation • 0.1B • Updated • 3
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8
Text Generation • 0.1B • Updated • 3
Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 3
Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 4
Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8
Text Generation • 0.1B • Updated • 3
Shekswess/tiny-think-sft-math-stem-loss-nll-bf16-e2-bs8
Text Generation • 0.1B • Updated • 3
datasets 38
Shekswess/OpenR1-Math-10k-GRPO
Viewer • Updated • 10k • 13
Shekswess/OpenR1-Math-10k-SFT
Viewer • Updated • 10k • 22
Shekswess/OpenR1-Math-10k-Raw
Viewer • Updated • 10k • 42
Shekswess/fineweb-edu-700m
Viewer • Updated • 681k • 49
Shekswess/tiny-think-sft-math-n-stem
Viewer • Updated • 29.1k • 17
Shekswess/tiny-think-dpo-math-n-stem
Viewer • Updated • 2.86k • 19
Shekswess/trlm-sft-stage-1-final-2
Viewer • Updated • 58k • 19 • 1
Shekswess/trlm-sft-stage-2-final-2
Viewer • Updated • 78k • 20
Shekswess/trlm-dpo-stage-3-final-2
Viewer • Updated • 50k • 12
Shekswess/customer-support
Viewer • Updated • 1k • 20 • 1