1 40 168

peng

superpeng

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

AWorld: Orchestrating the Training Recipe for Agentic AI

upvoted a collection 8 days ago

Reasoning

upvoted a collection 8 days ago

Image Generation

View all activity

Organizations

None yet

upvoted a paper 8 days ago

AWorld: Orchestrating the Training Recipe for Agentic AI

Paper • 2508.20404 • Published Aug 28, 2025 • 39

upvoted 3 collections 8 days ago

upvoted a paper 8 months ago

Fast Segment Anything

Paper • 2306.12156 • Published Jun 21, 2023 • 36

upvoted 2 collections 8 months ago

🎯 Liquid Nanos

Collection

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated Apr 8 • 114

Papers to Read

Collection

208 items • Updated Aug 24, 2025 • 11

upvoted a paper 8 months ago

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published Mar 31, 2025 • 61

upvoted a collection 8 months ago

2025 LLM Papers on Hugging Face with Japanese Memos

Collection

78 items • Updated Apr 29, 2025 • 2

upvoted 2 papers 8 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8, 2025 • 11

upvoted a paper 9 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 48

upvoted 2 collections 9 months ago

II-Medical

Collection

9 items • Updated Jul 4, 2025 • 16

Medical QA Datasets

Collection

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 48

upvoted a paper 10 months ago

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

Paper • 2506.00711 • Published May 31, 2025 • 1

upvoted a paper about 1 year ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

upvoted 2 collections about 1 year ago

Phi-4

Collection

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 207

DeepSeek-R1-ReDistill

Collection

Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30, 2025 • 15

upvoted a paper over 1 year ago

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 22

upvoted an article over 1 year ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

akjindal53244

•

Aug 19, 2024

• 79

peng

AI & ML interests

Recent Activity

Organizations

superpeng's activity

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging