DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF Image-Text-to-Text • 39B • Updated 2 days ago • 207k • 99
Running 159 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 159 Building and scaling RL environments for LLM training
Controlling Multimodal LLMs via Reward-guided Decoding Paper • 2508.11616 • Published Aug 15, 2025 • 7
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Paper • 2503.16365 • Published Mar 20, 2025 • 41
Running Agents 42 MVBench Leaderboard 🐨 42 Submit and view model evaluation results in a leaderboard format
SRMT: Shared Memory for Multi-agent Lifelong Pathfinding Paper • 2501.13200 • Published Jan 22, 2025 • 70
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper • 2501.13826 • Published Jan 23, 2025 • 24
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published Jan 21, 2025 • 23
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering Paper • 2501.05131 • Published Jan 9, 2025 • 37