Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published 2 days ago • 24
FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments Paper • 2604.25135 • Published 4 days ago • 8
RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments Paper • 2604.26067 • Published 4 days ago • 64
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 2 days ago • 74
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios Paper • 2604.25914 • Published 4 days ago • 40
ClawGym: A Scalable Framework for Building Effective Claw Agents Paper • 2604.26904 • Published 3 days ago • 46
Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation Paper • 2604.25819 • Published 4 days ago • 16
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora Paper • 2604.24819 • Published 5 days ago • 82
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4 Any-to-Any • 18B • Updated about 15 hours ago • 180k • 64
Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing Paper • 2604.22782 • Published 29 days ago • 6
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents Paper • 2604.23781 • Published 6 days ago • 31
Stabilizing Efficient Reasoning with Step-Level Advantage Selection Paper • 2604.24003 • Published 5 days ago • 6
Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis Paper • 2604.24198 • Published 5 days ago • 19
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper • 2604.22875 • Published 9 days ago • 31
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning Paper • 2604.24300 • Published 5 days ago • 64
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 5 days ago • 66
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets Paper • 2604.22294 • Published 8 days ago • 16