Benjamim Alves Nepomuceno Neto
AI & ML interests
Recent Activity
Organizations
- RunningAgentsFeatured2.08k
Wan2.1
💻2.08kWan: Open and Advanced Large-Scale Video Generative Models
- Running on ZeroMCPFeatured1.61k
Wan2.1 Fast
🎥1.61kGenerate a video from an image with a prompt
- Runtime errorAgentsFeatured72
NAG Wan2-1-fast
🏢72Demo of Normalized Attention Guidance for 4 steps Wan2.1
- Running on ZeroMCPFeatured322
Self Forcing Wan 2.1
🎥322Real-time video generation
- RunningAgents43
Mediapipe Face Mesh 3d
👀43create 3d-gltf face-mesh from image with mediapipe
- SleepingAgents7
Mediapipe Head Pose Estimation
👁72 head pose estimation with mediapipe and trained-model
- RunningAgents10
Mediapipe 68 Points Facial Mask
⚡10create facial masks from 68 points landmark
- Running on ZeroAgentsFeatured1.1k
InfiniteYou-FLUX
📸1.1kFlexible Photo Recrafting While Preserving Your Identity
- Running on ZeroAgentsFeatured231
MatAnyone
🤡231Gradio demo for MatAnyone 1 & 2
- Running on ZeroAgentsFeatured613
Video Background Removal
📽613Remove/Change background of video.
- Running on ZeroAgentsFeatured112
SAM3 Video Segmentation
🐠112Track and label objects in videos using text prompts or clicks
- Running on ZeroAgents17
VideoMaMa
⚡17Remove video backgrounds and generate matte videos
- Build errorAgents117
Dpt Depth Estimation + 3D Voxels
🧊117Create 3D models from images using depth estimation
- Running on ZeroAgents3.26k
Hunyuan3D-2.0
🌍3.26kText-to-3D and Image-to-3D Generation
- Running on ZeroAgentsFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
- Running on ZeroAgentsFeatured224
Video Depth Anything
👀224Generate depth video from input video
- PausedFeatured179
Manimator
👀179Transform research papers and mathematical concepts into stu
- PausedAgentsFeatured179
Gaze Demo
👀179Gaze detection using Moondream
- RunningAgents11
Metropolitan Museum
🎨11The Metropolitan Museum of Art Collection
- Running on T4Featured122
CountGD_Multi-Modal_Open-World_Counting
🚀122Count objects in images using text and example boxes
- Running on ZeroAgentsFeatured578
Midi Music Generator
🎼578Generate MIDI music from prompts
- PausedAgentsFeatured202
YuE
👩202Generate music from lyrics and genre tags
- PausedAgents51
Open SUNO
👩51Your Lyrics into Complete Songs with Vocals in Multilingual
- Running on ZeroAgentsFeatured687
Di♪♪Rhythm
🎶687Blazingly Fast and Embarrassingly Simple Song Generation
- Running on ZeroAgentsFeatured259
SD3 Long Captioner
🏃259Generate detailed captions for any image
- Runtime errorAgentsFeatured111
ChartGemma
🐨111Generate insights from charts using text prompts
- Running on ZeroAgents90
AuraFlow-v0.3 with Captioner
🖼90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 21M • 2k
- Runtime errorAgentsFeatured462
Omni-Zero
🧛462Restylize & repose person ID
- Running on ZeroAgents1.21k
PhotoMaker V2
📷1.21kGenerate personalized realistic portraits from your photos
- Runtime errorAgentsFeatured640
FLUX.1 [Inpainting]
🎨640 - Running on L40SFeatured1.64k
Expression Editor
🐨1.64kQuickly edit the expression of a face
- Running on ZeroAgentsFeatured1.05k
ToonCrafter
😻1.05kGenerate animated video between two cartoon images
- RunningAgentsFeatured84
NaRCan
💊84Edit your video with text prompts and style control
- PausedAgents3.74k
Live Portrait
🤪3.74kApply the motion of a video on a portrait
- Running on T4Featured20
FloodDiffusion Streaming Demo
🌊20Streaming Motion Generation (CVPR 2026 Highlight)
- Running on ZeroAgentsFeatured948
MMAudio — generating synchronized audio from video/text
🔊948Generate synchronized audio from video or text prompts
- Running on ZeroAgents326
TangoFlux
🚀326Text to Audio (Sound SFX) Generator
- Running on ZeroAgents465
Stable Audio Open Zero
🔥465Generate immersive audio from text prompts
- PausedAgentsFeatured202
YuE
👩202Generate music from lyrics and genre tags
- Running on ZeroAgents2.63k
Voice Clone
🗣2.63kGenerate speech in a cloned voice from reference audio
- PausedAgents848
Ilaria RVC
😻848Convert and separate audio using models and TTS
- Running on ZeroAgentsFeatured921
Screenshot to HTML
⚡921Convert screenshot to HTML code and preview
- Running on ZeroMCPFeatured39
NeuTTS-Nano Multilingual Collection
🌍39Generate speech with voice cloning, now in four languages!
- RunningAgents378
PDF Chatbot
🌍378Ask questions about PDFs using a chatbot
- Running on ZeroAgentsFeatured367
Video Transcription Smart Summary
⚡367Transcribe videos and generate concise summaries
- RunningAgents138
Quantized Retrieval
🔍138Efficient quantized retrieval over Wikipedia
- RunningFeatured1.33k
FineWeb: decanting the web for the finest text data at scale
🍷1.33kExplore and download the FineWeb web‑text dataset
- RunningAgents40
Anime Image Classification
📚40Analyze anime images for various attributes
- Running on ZeroAgentsFeatured171
PaintsUndo
🎨171Create videos from a single image using AI‑generated key frames
- PausedAgents160
Kolors IP-Adapter
🖼160Create images from text and reference photos
- Running on ZeroAgentsFeatured2.08k
PuLID-FLUX
🤗2.08kGenerate custom images from text and a reference photo
- Runtime errorAgentsFeatured93
Panoptic Segment Anything
🖼93 - Runtime errorAgentsFeatured396
Grounded Segment Anything
📚396 - Running on ZeroAgents200
Inspyrenet Remove Background
🏢200Remove background from images or extract a mask
- Runtime errorAgentsFeatured515
Florence2 + SAM2
🔥515Segment and caption objects in images and videos
- Runtime errorAgentsFeatured114
BigVGAN
🔊114Generate high‑quality audio from your input file with BigVGAN
- RunningAgents24
Audio Emotion Recognition
🎼24Detect emotions from audio recordings
- RunningAgentsFeatured61
SoundwaveDemo
📉61Process audio and generate text output based on instructions
- RunningAgentsFeatured71
DiffVox
🦀71Enhance vocals with professional effects using sliders
- Running on ZeroAgentsFeatured989
Tile Upscaler
🚀989Enhance and upscale images with HDR and AI details
- Runtime errorAgentsFeatured192
SeemoRe
💻192Enhance image details with super-resolution
- Running on ZeroAgents1.68k
Flux.1-dev Upscaler
🔎1.68kUpscale low‑resolution images to higher resolution
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification • 86.6M • Updated • 428k • 352 - Running on ZeroAgents314
Llasa 3b Tts
🔥314Zero Shot voice cloning with llasa 3b (Unofficial Demo)
- PausedAgentsFeatured202
YuE
👩202Generate music from lyrics and genre tags
- Running on ZeroAgentsFeatured413
Zonos
🌍413Generate expressive speech audio from text with custom voice
- RunningAgentsFeatured2.08k
Wan2.1
💻2.08kWan: Open and Advanced Large-Scale Video Generative Models
- Running on ZeroMCPFeatured1.61k
Wan2.1 Fast
🎥1.61kGenerate a video from an image with a prompt
- Runtime errorAgentsFeatured72
NAG Wan2-1-fast
🏢72Demo of Normalized Attention Guidance for 4 steps Wan2.1
- Running on ZeroMCPFeatured322
Self Forcing Wan 2.1
🎥322Real-time video generation
- RunningAgents43
Mediapipe Face Mesh 3d
👀43create 3d-gltf face-mesh from image with mediapipe
- SleepingAgents7
Mediapipe Head Pose Estimation
👁72 head pose estimation with mediapipe and trained-model
- RunningAgents10
Mediapipe 68 Points Facial Mask
⚡10create facial masks from 68 points landmark
- Running on ZeroAgentsFeatured1.1k
InfiniteYou-FLUX
📸1.1kFlexible Photo Recrafting While Preserving Your Identity
- Running on ZeroAgentsFeatured231
MatAnyone
🤡231Gradio demo for MatAnyone 1 & 2
- Running on ZeroAgentsFeatured613
Video Background Removal
📽613Remove/Change background of video.
- Running on ZeroAgentsFeatured112
SAM3 Video Segmentation
🐠112Track and label objects in videos using text prompts or clicks
- Running on ZeroAgents17
VideoMaMa
⚡17Remove video backgrounds and generate matte videos
- Build errorAgents117
Dpt Depth Estimation + 3D Voxels
🧊117Create 3D models from images using depth estimation
- Running on ZeroAgents3.26k
Hunyuan3D-2.0
🌍3.26kText-to-3D and Image-to-3D Generation
- Running on ZeroAgentsFeatured4.78k
TRELLIS
🏢4.78kScalable and Versatile 3D Generation from images
- Running on ZeroAgentsFeatured224
Video Depth Anything
👀224Generate depth video from input video
- PausedFeatured179
Manimator
👀179Transform research papers and mathematical concepts into stu
- PausedAgentsFeatured179
Gaze Demo
👀179Gaze detection using Moondream
- RunningAgents11
Metropolitan Museum
🎨11The Metropolitan Museum of Art Collection
- Running on T4Featured122
CountGD_Multi-Modal_Open-World_Counting
🚀122Count objects in images using text and example boxes
- Running on ZeroAgentsFeatured948
MMAudio — generating synchronized audio from video/text
🔊948Generate synchronized audio from video or text prompts
- Running on ZeroAgents326
TangoFlux
🚀326Text to Audio (Sound SFX) Generator
- Running on ZeroAgents465
Stable Audio Open Zero
🔥465Generate immersive audio from text prompts
- PausedAgentsFeatured202
YuE
👩202Generate music from lyrics and genre tags
- Running on ZeroAgentsFeatured578
Midi Music Generator
🎼578Generate MIDI music from prompts
- PausedAgentsFeatured202
YuE
👩202Generate music from lyrics and genre tags
- PausedAgents51
Open SUNO
👩51Your Lyrics into Complete Songs with Vocals in Multilingual
- Running on ZeroAgentsFeatured687
Di♪♪Rhythm
🎶687Blazingly Fast and Embarrassingly Simple Song Generation
- Running on ZeroAgentsFeatured259
SD3 Long Captioner
🏃259Generate detailed captions for any image
- Runtime errorAgentsFeatured111
ChartGemma
🐨111Generate insights from charts using text prompts
- Running on ZeroAgents90
AuraFlow-v0.3 with Captioner
🖼90Generate images from prompts or images
-
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 21M • 2k
- Running on ZeroAgents2.63k
Voice Clone
🗣2.63kGenerate speech in a cloned voice from reference audio
- PausedAgents848
Ilaria RVC
😻848Convert and separate audio using models and TTS
- Running on ZeroAgentsFeatured921
Screenshot to HTML
⚡921Convert screenshot to HTML code and preview
- Running on ZeroMCPFeatured39
NeuTTS-Nano Multilingual Collection
🌍39Generate speech with voice cloning, now in four languages!
- RunningAgents378
PDF Chatbot
🌍378Ask questions about PDFs using a chatbot
- Running on ZeroAgentsFeatured367
Video Transcription Smart Summary
⚡367Transcribe videos and generate concise summaries
- RunningAgents138
Quantized Retrieval
🔍138Efficient quantized retrieval over Wikipedia
- RunningFeatured1.33k
FineWeb: decanting the web for the finest text data at scale
🍷1.33kExplore and download the FineWeb web‑text dataset
- Runtime errorAgentsFeatured462
Omni-Zero
🧛462Restylize & repose person ID
- Running on ZeroAgents1.21k
PhotoMaker V2
📷1.21kGenerate personalized realistic portraits from your photos
- Runtime errorAgentsFeatured640
FLUX.1 [Inpainting]
🎨640 - Running on L40SFeatured1.64k
Expression Editor
🐨1.64kQuickly edit the expression of a face
- RunningAgents40
Anime Image Classification
📚40Analyze anime images for various attributes
- Running on ZeroAgentsFeatured171
PaintsUndo
🎨171Create videos from a single image using AI‑generated key frames
- PausedAgents160
Kolors IP-Adapter
🖼160Create images from text and reference photos
- Running on ZeroAgentsFeatured2.08k
PuLID-FLUX
🤗2.08kGenerate custom images from text and a reference photo
- Running on ZeroAgentsFeatured1.05k
ToonCrafter
😻1.05kGenerate animated video between two cartoon images
- RunningAgentsFeatured84
NaRCan
💊84Edit your video with text prompts and style control
- PausedAgents3.74k
Live Portrait
🤪3.74kApply the motion of a video on a portrait
- Running on T4Featured20
FloodDiffusion Streaming Demo
🌊20Streaming Motion Generation (CVPR 2026 Highlight)
- Runtime errorAgentsFeatured93
Panoptic Segment Anything
🖼93 - Runtime errorAgentsFeatured396
Grounded Segment Anything
📚396 - Running on ZeroAgents200
Inspyrenet Remove Background
🏢200Remove background from images or extract a mask
- Runtime errorAgentsFeatured515
Florence2 + SAM2
🔥515Segment and caption objects in images and videos
- Runtime errorAgentsFeatured114
BigVGAN
🔊114Generate high‑quality audio from your input file with BigVGAN
- RunningAgents24
Audio Emotion Recognition
🎼24Detect emotions from audio recordings
- RunningAgentsFeatured61
SoundwaveDemo
📉61Process audio and generate text output based on instructions
- RunningAgentsFeatured71
DiffVox
🦀71Enhance vocals with professional effects using sliders
- Running on ZeroAgentsFeatured989
Tile Upscaler
🚀989Enhance and upscale images with HDR and AI details
- Runtime errorAgentsFeatured192
SeemoRe
💻192Enhance image details with super-resolution
- Running on ZeroAgents1.68k
Flux.1-dev Upscaler
🔎1.68kUpscale low‑resolution images to higher resolution
-
MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification • 86.6M • Updated • 428k • 352 - Running on ZeroAgents314
Llasa 3b Tts
🔥314Zero Shot voice cloning with llasa 3b (Unofficial Demo)
- PausedAgentsFeatured202
YuE
👩202Generate music from lyrics and genre tags
- Running on ZeroAgentsFeatured413
Zonos
🌍413Generate expressive speech audio from text with custom voice