·
AI & ML interests
None yet
Organizations
Lux0926/Qwen2-7B-SFT-CGPO-10k
Viewer
• Updated • 10.8k • 2
Lux0926/Qwen1.5-32B-SFT-CGPO-10k
Viewer
• Updated • 10.8k • 2
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO-10k
Viewer
• Updated • 10.8k • 3
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO-10k
Viewer
• Updated • 10.6k • 3
Lux0926/MetaMath-Llama-8B-CGPO-10k
Viewer
• Updated • 10.8k • 20
Lux0926/MetaMath-Mistral-7B-CGPO-10k
Viewer
• Updated • 10.8k • 18
Lux0926/ASPRM-BON-Evaluation-Dataset-Code
Preview
• Updated • 101
Lux0926/ASPRM-BON-Evaluation-Dataset-Math
Preview
• Updated • 130
Lux0926/ASPRM-Math-Rollout-Result
Viewer
• Updated • 215k • 8
Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset
Viewer
• Updated • 99.8k • 54
• 1
Lux0926/ASPRM-MATHCODE-Mistral-Training-Dataset
Viewer
• Updated • 438k • 10
Lux0926/ASPRM-D-Training-Dataset
Viewer
• Updated • 49.9k • 7
Lux0926/ASPRM-L-Training-Dataset
Viewer
• Updated • 372k • 7
Lux0926/ASPRM-D-Training-Dataset-ORM
Viewer
• Updated • 49.9k • 7
Lux0926/ASPRM-M-Training-Dataset
Viewer
• Updated • 388k • 7
Lux0926/ASPRM-Code-Rollout-Result