Learning Adaptive Reasoning Paths for Efficient Visual Reasoning Paper • 2604.14568 • Published 5 days ago • 6
Towards Autonomous Mechanistic Reasoning in Virtual Cells Paper • 2604.11661 • Published 7 days ago • 4
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation Paper • 2604.15309 • Published 5 days ago • 6
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 7 days ago • 22
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 6 days ago • 101
Zero-shot World Models Are Developmentally Efficient Learners Paper • 2604.10333 • Published 10 days ago • 7
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 8 days ago • 13
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 8 days ago • 28
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs Paper • 2604.10480 • Published 9 days ago • 20
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published 27 days ago • 123
ELT: Elastic Looped Transformers for Visual Generation Paper • 2604.09168 • Published 11 days ago • 19
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 13 days ago • 94
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper • 2604.08995 • Published 11 days ago • 46
Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference Paper • 2604.07394 • Published 13 days ago • 16
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering Paper • 2604.08209 • Published 12 days ago • 25
Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models Paper • 2604.08545 • Published 12 days ago • 41