4 18 176

Turbo Pascal

TurboPascal

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

Marco-MoE

liked a model 16 days ago

AIDC-AI/Marco-Mini-Global-Base

liked a model 20 days ago

AIDC-AI/Marco-Nano-Base

View all activity

Organizations

upvoted a collection 1 day ago

Marco-MoE

Collection

A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated 15 days ago • 16

liked a model 16 days ago

AIDC-AI/Marco-Mini-Global-Base

Text Generation • 17B • Updated 20 days ago • 511 • 5

liked a model 20 days ago

AIDC-AI/Marco-Nano-Base

Text Generation • 8B • Updated 20 days ago • 572 • 12

upvoted a paper 22 days ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 25 days ago • 144

upvoted a paper 23 days ago

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 111

liked 4 datasets 28 days ago

upvoted 2 papers about 1 month ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 145

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 111

liked a model about 1 month ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 497k • • 1.96k

New activity in Alibaba-NLP/new-impl about 2 months ago

torch.AcceleratorError: CUDA error: device-side assert triggered

#14 opened about 2 months ago by

TurboPascal

liked a model 6 months ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 462k • 352

upvoted an article 7 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers

Mar 26, 2025

•

192

liked a model 8 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • Updated Aug 26, 2025 • 16.2k • 495

upvoted a collection 8 months ago

BGE

Collection

31 items • Updated Feb 4 • 156

liked a dataset 8 months ago

HuggingFaceTB/smoltalk2

Viewer • Updated Oct 31, 2025 • 8.61M • 6.01k • 153

liked 2 models 8 months ago

Alibaba-NLP/WebDancer-32B

Text Generation • Updated Jun 26, 2025 • 75 • • 57

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 53.1k • • 717