Open to Work

2 6 17

Vladislav Grigorian

sleepygargoyle

AI & ML interests

None yet

Recent Activity

liked a Space 25 days ago

nanotron/ultrascale-playbook

liked a model about 1 month ago

amd/Nitro-E

upvoted an article 3 months ago

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

View all activity

Organizations

liked a Space 25 days ago

The Ultra-Scale Playbook

🌌

3.85k

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 month ago

amd/Nitro-E

Text-to-Image • Updated Nov 3, 2025 • 631 • 99

upvoted an article 3 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

kuotient

•

Aug 9, 2025

• 57

updated a dataset 3 months ago

sleepygargoyle/gen_math_md

Viewer • Updated Feb 19 • 3.53k • 5

published a dataset 3 months ago

sleepygargoyle/gen_math_md

Viewer • Updated Feb 19 • 3.53k • 5

upvoted an article 3 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 297

upvoted a paper 3 months ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 52

upvoted an article 4 months ago

Article

📐 Muon Optimizer: The Power of Collective Momentum

onekq

•

Nov 14, 2025

• 6

updated a model 4 months ago

NM-development/madlad400-3b-mt-ce-v0

Translation • 3B • Updated Jan 19 • 29

updated a Space 4 months ago

README

💻

updated a dataset 4 months ago

NM-development/smol_ce

Preview • Updated Jan 16 • 23

liked a model 4 months ago

google/translategemma-27b-it

Image-Text-to-Text • Updated Jan 28 • 9.7k • 371

published a dataset 4 months ago

NM-development/smol_ce

Preview • Updated Jan 16 • 23

liked a dataset 4 months ago

Agisight/google-smol-en-ru

Viewer • Updated Nov 30, 2025 • 1.82k • 28 • 3

published a model 4 months ago

NM-development/madlad400-3b-mt-ce-v0

Translation • 3B • Updated Jan 19 • 29

liked a model 4 months ago

tencent/HY-MT1.5-1.8B

Translation • Updated Jan 1 • 31.1k • 1.17k

liked a model 5 months ago

tencent/WeDLM-8B-Instruct

Text Generation • 8B • Updated Jan 1 • 672 • 312

liked 3 datasets 5 months ago

Vladislav Grigorian

AI & ML interests

Recent Activity

Organizations

sleepygargoyle's activity

The Ultra-Scale Playbook

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

How to generate text: using different decoding methods for language generation with Transformers

📐 Muon Optimizer: The Power of Collective Momentum

README