torch tcapelle/train_ds_triton Viewer • Updated May 21, 2025 • 887 • 9 • 1 predibase/Predibase-T2T-32B-RFT 33B • Updated Mar 19, 2025 • 5 • 20 GPUMODE/KernelBook Viewer • Updated Feb 5 • 18.2k • 494 • 51
safety Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Explore and submit LLM benchmarks
SFT meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.7k • 458 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 65.4k • 703 HuggingFaceH4/ultrafeedback_binarized Viewer • Updated Oct 16, 2024 • 187k • 14.1k • 334 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 914k • 1.3k
Math meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.7k • 458 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 22.8k • 744
torch tcapelle/train_ds_triton Viewer • Updated May 21, 2025 • 887 • 9 • 1 predibase/Predibase-T2T-32B-RFT 33B • Updated Mar 19, 2025 • 5 • 20 GPUMODE/KernelBook Viewer • Updated Feb 5 • 18.2k • 494 • 51
safety Running on CPU Upgrade Agents 93 LLM Safety Leaderboard 🥇 93 Explore and submit LLM benchmarks
Math meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.7k • 458 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 22.8k • 744
SFT meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.7k • 458 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 65.4k • 703 HuggingFaceH4/ultrafeedback_binarized Viewer • Updated Oct 16, 2024 • 187k • 14.1k • 334 openai/gsm8k Benchmark • Updated Mar 23 • 17.6k • 914k • 1.3k