Fashion130K: An E-commerce Fashion Dataset for Outfit Generation with Unified Multi-modal Condition Paper • 2605.10127 • Published 6 days ago • 11
UM-Text: A Unified Multimodal Model for Image Understanding Paper • 2601.08321 • Published Jan 13 • 20
Dynamic-TreeRPO: Breaking the Independent Trajectory Bottleneck with Structured Sampling Paper • 2509.23352 • Published Sep 27, 2025 • 10
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs Paper • 2510.09201 • Published Oct 10, 2025 • 50
Running Featured 597 Image Arena Leaderboard 📊 597 Image Generation and Image Editing Arena & Leaderboard