Submitted by akhaliq 121 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models · 51 authors 900 4
Submitted by koalazf99 64 Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale · 5 authors 269 4
Submitted by akhaliq 14 DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion · 6 authors 152 3
Submitted by OAOA 13 Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors · 5 authors 217 5
Submitted by michaal94 13 AIM 2024 Sparse Neural Rendering Challenge: Dataset and Benchmark · 6 authors 2
Submitted by akhaliq 11 Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing · 2 authors 55 2
Submitted by akhaliq 11 HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale · 3 authors 241 2
Submitted by chuanenlin 10 NoTeeline: Supporting Real-Time Notetaking from Keypoints with Large Language Models · 5 authors 3 2
Submitted by akhaliq 6 TalkinNeRF: Animatable Neural Fields for Full-Body Talking Humans · 6 authors 2
Submitted by ayshrv 6 Self-Supervised Any-Point Tracking by Contrastive Random Walks · 2 authors 55 2