Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Yurun Yuan
PRO
RyanYr
Follow
KenCao2007's profile picture
xuanfeiren's profile picture
21world's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
updated
a dataset
3 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_matheval
updated
a model
4 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
published
a model
4 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
View all activity
Organizations
None yet
RyanYr
's models
30
Sort: Recently updated
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
Updated
4 days ago
•
49
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_200
Updated
4 days ago
•
1
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
4 days ago
•
26
RyanYr/pg_sais-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
4 days ago
•
49
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
4 days ago
•
53
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
4 days ago
•
53
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
4 days ago
•
53
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
4 days ago
•
57
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
4 days ago
•
55
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
4 days ago
•
56
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
4 days ago
•
53
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
4 days ago
•
53
RyanYr/pg_sais-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
4 days ago
•
51
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
4 days ago
•
5
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
4 days ago
•
6
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
5 days ago
•
41
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl
Updated
5 days ago
•
36
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
5 days ago
•
35
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
5 days ago
•
39
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
5 days ago
•
32
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
5 days ago
•
33
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
5 days ago
•
32
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
5 days ago
•
36
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
5 days ago
•
37
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
5 days ago
•
45
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
6 days ago
•
25
RyanYr/grpo-dapo-qwen2.5-math-1.5B-n4
Updated
6 days ago
RyanYr/grpo-dapo-qwen3-1.7B-Base-mbs128-n4
Updated
19 days ago
•
35
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
•
3
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25