arxiv:2509.22611
Junkang Wu
junkang0909
AI & ML interests
LLM alignment
Recent Activity
upvoted a paper 8 days ago
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning upvoted a paper 8 days ago
Rubric-based On-policy Distillation upvoted a paper about 2 months ago
On the Direction of RLVR Updates for LLM Reasoning: Identification and ExploitationOrganizations
None yet