MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks Paper • 2604.27818 • Published 19 days ago • 5
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 19 days ago • 215
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published Apr 13 • 7
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Paper • 2604.13902 • Published Apr 15 • 62
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 351