Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning Paper • 2510.20150 • Published Oct 23, 2025 • 7
MIMIGenRec Collection A collection of MIMIGenRec ckpt, including sft and rl model • 8 items • Updated Mar 6
MIMIGenRec Collection A collection of MIMIGenRec ckpt, including sft and rl model • 8 items • Updated Mar 6