RedHatAI/Qwen3-30B-A3B-quantized.w4a16 Text Generation • 5B • Updated May 13, 2025 • 1.7k • 7
mistralai/Voxtral-Small-24B-2507 Audio-Text-to-Text • 24B • Updated Dec 20, 2025 • 46.3k • 493
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30, 2025 • 282
Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper • 2408.14906 • Published Aug 27, 2024 • 144