Lei Wang's picture

Lei Wang

demolei

·

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 1 day ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

authored a paper 1 day ago

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

authored a paper 1 day ago

AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis

View all activity

Organizations

upvoted a paper 1 day ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published 3 days ago • 66

authored 6 papers 1 day ago

Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models

Paper • 2602.01849 • Published Feb 2 • 5

AgentSkiller: Scaling Generalist Agent Intelligence through Semantically Integrated Cross-Domain Data Synthesis

Paper • 2602.09372 • Published Feb 10 • 7

From Perception to Action: An Interactive Benchmark for Vision Reasoning

Paper • 2602.21015 • Published Feb 24 • 24

Document Reconstruction Unlocks Scalable Long-Context RLVR

Paper • 2602.08237 • Published Feb 9

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Paper • 2603.28407 • Published Mar 30 • 70

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published Apr 8 • 38

upvoted a paper 2 days ago

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Paper • 2605.10923 • Published 3 days ago • 12

upvoted a paper 7 days ago

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 10 days ago • 112

upvoted 6 papers 21 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 324

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 628

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Paper • 2604.19859 • Published 23 days ago • 51

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 22 days ago • 14

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 22 days ago • 240

upvoted a paper 22 days ago

Mind DeepResearch Technical Report

Paper • 2604.14518 • Published 27 days ago • 23

upvoted 3 papers 23 days ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 24 days ago • 84

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published 24 days ago • 22

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents

Paper • 2604.18543 • Published 24 days ago • 29

upvoted a paper 29 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published about 1 month ago • 94