arxiv:2507.01949
Xiao Hu
huxiao09
·
AI & ML interests
Reinforcement Learning, LLM Reasoning
Recent Activity
upvoted an article 9 days ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries upvoted a paper 4 months ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting liked a model 5 months ago
Kwai-Keye/Keye-VL-671B-A37BOrganizations
None yet