-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 103 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.71k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 93 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 32
AI & ML interests
Scale up the Reasoner-Zero Training
Organization Card
Welcome to Open-Reasoner-Zero!
Please check our GitHub!
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 103 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.71k • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 93 • 1 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 32
models 9
Open-Reasoner-Zero/ORZ-R1-Distill-Qwen-14B
15B • Updated • 3 • 2
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-32B
Reinforcement Learning • 32B • Updated • 12 • 7
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-7B
Reinforcement Learning • 7B • Updated • 18 • 1
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-0.5B
Reinforcement Learning • 0.5B • Updated • 7
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 2.71k • 33
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • Updated • 103 • 33
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 32
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-1.5B
Reinforcement Learning • 2B • Updated • 5 • 1
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 93 • 1