MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 13 days ago • 327
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 12 days ago • 123
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published 23 days ago • 63
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published 27 days ago • 97
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published Mar 16 • 153
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published Jan 20 • 48
ShapeR: Robust Conditional 3D Shape Generation from Casual Captures Paper • 2601.11514 • Published Jan 16 • 24
Sharp Monocular View Synthesis in Less Than a Second Paper • 2512.10685 • Published Dec 11, 2025 • 30
Reflection Removal through Efficient Adaptation of Diffusion Transformers Paper • 2512.05000 • Published Dec 4, 2025 • 18
view article Article Diffusers welcomes FLUX-2 +6 YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart • Nov 25, 2025 • 191
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published Nov 19, 2025 • 233
Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising Paper • 2511.08633 • Published Nov 9, 2025 • 56