5 105 33

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 6 days ago

Context Unrolling in Omni Models

upvoted a paper 8 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

upvoted a paper 10 days ago

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Context Unrolling in Omni Models

Paper • 2604.21921 • Published 8 days ago • 12

upvoted a paper 8 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 9 days ago • 237

upvoted a paper 10 days ago

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Paper • 2604.18168 • Published 11 days ago • 97

upvoted a paper 11 days ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 14 days ago • 56

liked a dataset 13 days ago

walledai/AdvBench

Viewer • Updated Jul 4, 2024 • 520 • 10.7k • 98

upvoted 2 papers 16 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 18 days ago • 70

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 20 days ago • 77

upvoted 2 papers 21 days ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 22 days ago • 51

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 23 days ago • 187

upvoted a paper 24 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 25 days ago • 111

upvoted a paper about 1 month ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

upvoted a paper about 2 months ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published Mar 12 • 13

liked a model about 2 months ago

Qwen/Qwen3.5-0.8B

Image-Text-to-Text • 0.9B • Updated Mar 2 • 3.06M • 512

upvoted a paper 3 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 268

upvoted an article 4 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

upvoted 2 papers 4 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

DreamOmni3: Scribble-based Editing and Generation

Paper • 2512.22525 • Published Dec 27, 2025 • 15

upvoted 2 papers 5 months ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 177

liked a dataset 5 months ago

yenopoya/thousand-voices-trauma

Updated Oct 24, 2025 • 16 • 4

Ha-Yeong Choi

AI & ML interests

Recent Activity

Organizations

Ha0's activity

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR