AI & ML interests
Audio, Music, and AI
Organization Card
Audio, Music, and AI Lab (AMAAI)
The Audio, Music, and AI lab at Singapore University of Technology and Design focuses on cutting-edge innovations in multimodal AI, more specifically: Audio and Music AI.
More info and publications here.
Popular software:
- SonicMaster: all-in-one music restoration and mastering: code - examples - live demo
- Jam 0.5: text-to-song: code - examples- Dataset in collaboration with Declare lab
- SonicVerse: time-aware music captioning: code - live demo
- Music2Emo: emotion detection from music: code - live demo
- Mustango: text-to-music generation: code - live demo
- Video2Music: video-to-music generation: code
- Text2midi: text-to-midi generation: code
- nnAudio: on-the-fly spectrogram extraction: code
Popular Datasets:
- JamendoMaxCaps: text captions with instrumental music audio
- MusicBench: text captions with music audio
- MidiCaps: text captions with music midi (large-scale)
- SonicMaster: music with mastered / enhanced version and enhancement caption
spaces 7
Running on Zero
Agents
30
SonicMaster – Text-Guided Music Restoration & Mastering
🎧
Enhance audio quality using text prompts
Sleeping
Agents
2
MineROI Net
📊
Smart Timing for Mining: A Deep Learning Framework for Bitco
Runtime error
Agents
13
SonicVerse
🖼
Generate detailed music descriptions from audio clips
Running on Zero
Agents
18
Music2emo
📊
Towards Unified Music Emotion Recognition across Dimensional
Runtime error
Agents
21
Video2music
📚
Generate music for a video based on its content and key
models 9
amaai-lab/MineROI-Net
Updated
amaai-lab/SonicMaster
Audio-to-Audio • 0.9B • Updated • 19
amaai-lab/SonicVerse
Audio-Text-to-Text • Updated • 198 • 20
amaai-lab/MelodySim
Updated • 1
amaai-lab/music2emo
Updated • 9
amaai-lab/text2midi
Updated • 18
amaai-lab/DisfluencySpeech_BenchmarkC
Text-to-Speech • Updated • 3
amaai-lab/DisfluencySpeech_BenchmarkB
Text-to-Speech • Updated
amaai-lab/DisfluencySpeech_BenchmarkA
Text-to-Speech • Updated • 4
datasets 6
amaai-lab/SonicMasterDataset
Viewer • Updated • 166k • 1.1k • 12
amaai-lab/JamendoMaxCaps
Viewer • Updated • 344k • 1.44k • 23
amaai-lab/melodySim
Viewer • Updated • 192k • 213 • 4
amaai-lab/MusicBench
Viewer • Updated • 53.6k • 713 • 56
amaai-lab/MidiCaps
Viewer • Updated • 168k • 473 • 50
amaai-lab/DisfluencySpeech
Viewer • Updated • 5k • 196 • 20