VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244
Running on A10G Featured 222 faster-qwen3-tts 🎙 222 Generate natural speech from text using custom or cloned voices
Running on T4 Agents Featured 469 Parakeet-TDT-0.6b-V2 469 Transcribe audio files with timestamps and download transcripts
Running on CPU Upgrade Featured 3.16k The Smol Training Playbook 📚 3.16k The secrets to building world-class LLMs
Running Featured 91 Parakeet STT Progressive Transcription 🎤 91 Transcribe speech to text instantly with WebGPU acceleration
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 6.84M • • 3k
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations Paper • 2108.01073 • Published Aug 2, 2021 • 9
Runtime error Agents Featured 136 Qwen3-ASR Demo 🎙 136 Transcribe audio to text with multi-language timestamps
Running on CPU Upgrade Agents 1.66k Omni Image Editor 🖼 1.66k Image edit, text to image, image upscale, remove watermark
Runtime error Agents Featured 1.92k Qwen3-TTS Demo 🎙 1.92k Generate speech audio from text with custom or cloned voices