This collection contains all the GRPO-trained models for our paper "A Rising Tide Lifts All Boats". Please consider citing us!
Ishika Agarwal
ishikaa
·
AI & ML interests
active learning, reinforcement learning, reasoning, planning, NLP
Recent Activity
updated a model about 4 hours ago
ishikaa/UAS_qwen7b_only_alpaca_uniform published a model about 4 hours ago
ishikaa/UAS_qwen7b_only_alpaca_uniform updated a model about 6 hours ago
ishikaa/UAS_qwen7b_only_medmcqa_uniform