AWorld: Orchestrating the Training Recipe for Agentic AI Paper • 2508.20404 • Published Aug 28, 2025 • 39
🎯 Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated Apr 8 • 114
2025 LLM Papers on Hugging Face with Japanese Memos Collection 78 items • Updated Apr 29, 2025 • 2
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14, 2025 • 62
A Survey on Post-training of Large Language Models Paper • 2503.06072 • Published Mar 8, 2025 • 11
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20, 2025 • 48
Medical QA Datasets Collection A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 48
QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training Paper • 2506.00711 • Published May 31, 2025 • 1
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published Feb 25, 2025 • 75
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 207
DeepSeek-R1-ReDistill Collection Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30, 2025 • 15
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published Dec 23, 2024 • 22
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging akjindal53244 • Aug 19, 2024 • 79