R
Romanoffalex
AI & ML interests
None yet
Recent Activity
liked a model 1 day ago
ai-sage/GigaChat-20B-A3B-base updated a collection 8 days ago
Best small models liked a model 11 days ago
huihui-ai/Huihui4-8B-A4B-v2Organizations
None yet
CV models
-
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text β’ 33B β’ Updated β’ 107k β’ 482 -
baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
Image-Text-to-Text β’ 29B β’ Updated β’ 109 β’ 38 -
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction β’ 7B β’ Updated β’ 13.7k β’ 226 -
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text β’ 30B β’ Updated β’ 2.07k β’ 537
Upscalers
video gan
Graphic gan
- Running on ZeroAgentsFeatured942
OminiControl
π942Generate custom images from a reference photo and text
- Running on ZeroAgentsFeatured2.08k
PuLID-FLUX
π€2.08kGenerate custom images from text and a reference photo
- RunningAgents661
PR Puppet Sora
π661Generate AI videos from text prompts
-
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 9.2k β’ β’ 1.32k
llm ru
- PausedAgents47
Saiga 13b Q4_1 llama.cpp Retrieval QA
π47Upload files and ask questions based on their content
-
Deci/DeciLM-7B
Text Generation β’ 7B β’ Updated β’ 545 β’ 226 - Running92
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
π92Evaluate multilingual models using FineTasks
Codex model
Audio
-
stabilityai/stable-audio-open-1.0
Text-to-Audio β’ Updated β’ 22.2k β’ 1.46k -
laion/emonet-face-binary
Preview β’ Updated β’ 50 β’ 3 -
laion/emonet-face-hq
Viewer β’ Updated β’ 2.5k β’ 97 β’ 2 - Paused240
Omnilingual ASR Media Transcription
π240Transcribe audio/video files into text instantly
3d
dataset
Llms alfa test
- Running on ZeroAgents1.13k
OOTDiffusion
π₯Ό1.13kHigh-quality virtual try-on ~ Your cyber fitting room
-
stabilityai/stable-diffusion-3-medium
Text-to-Image β’ Updated β’ 3.88k β’ β’ 4.95k -
rain1011/pyramid-flow-sd3
Text-to-Video β’ Updated β’ 839 -
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation β’ 50B β’ Updated β’ 31.6k β’ 233
Best small models
Codex model
CV models
-
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text β’ 33B β’ Updated β’ 107k β’ 482 -
baidu/ERNIE-4.5-VL-28B-A3B-Base-PT
Image-Text-to-Text β’ 29B β’ Updated β’ 109 β’ 38 -
facebook/dinov3-vit7b16-pretrain-lvd1689m
Image Feature Extraction β’ 7B β’ Updated β’ 13.7k β’ 226 -
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
Image-Text-to-Text β’ 30B β’ Updated β’ 2.07k β’ 537
Audio
-
stabilityai/stable-audio-open-1.0
Text-to-Audio β’ Updated β’ 22.2k β’ 1.46k -
laion/emonet-face-binary
Preview β’ Updated β’ 50 β’ 3 -
laion/emonet-face-hq
Viewer β’ Updated β’ 2.5k β’ 97 β’ 2 - Paused240
Omnilingual ASR Media Transcription
π240Transcribe audio/video files into text instantly
Upscalers
3d
video gan
dataset
Graphic gan
- Running on ZeroAgentsFeatured942
OminiControl
π942Generate custom images from a reference photo and text
- Running on ZeroAgentsFeatured2.08k
PuLID-FLUX
π€2.08kGenerate custom images from text and a reference photo
- RunningAgents661
PR Puppet Sora
π661Generate AI videos from text prompts
-
genmo/mochi-1-preview
Text-to-Video β’ Updated β’ 9.2k β’ β’ 1.32k
Llms alfa test
- Running on ZeroAgents1.13k
OOTDiffusion
π₯Ό1.13kHigh-quality virtual try-on ~ Your cyber fitting room
-
stabilityai/stable-diffusion-3-medium
Text-to-Image β’ Updated β’ 3.88k β’ β’ 4.95k -
rain1011/pyramid-flow-sd3
Text-to-Video β’ Updated β’ 839 -
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation β’ 50B β’ Updated β’ 31.6k β’ 233
llm ru
- PausedAgents47
Saiga 13b Q4_1 llama.cpp Retrieval QA
π47Upload files and ask questions based on their content
-
Deci/DeciLM-7B
Text Generation β’ 7B β’ Updated β’ 545 β’ 226 - Running92
Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks
π92Evaluate multilingual models using FineTasks