This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
updated a model about 12 hours ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct-parquet_qwen3-1.7b_epoch_1_mask published a model about 12 hours ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct-parquet_qwen3-1.7b_epoch_1_mask published a model about 12 hours ago
SeanWang0027/rl_warm_up_stitch_minesweeper_3K-parquet_qwen3-1.7b_epoch_1_mask