YM Qin's picture

YM Qin

Wakals

·

https://wakals.github.io/

AI & ML interests

Computer Vision, Vision-language Model, Generative Model

Recent Activity

upvoted a paper 4 days ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

upvoted a paper 23 days ago

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

upvoted a collection about 1 month ago

View all activity

Organizations

None yet

Collections 1

Papers 1

arxiv:2511.19418

models 5

Wakals/CoVT-LLaVA-13B-depth

13B • Updated Dec 5, 2025 • 10 • 2

Wakals/CoVT-7B-seg

8B • Updated Dec 5, 2025 • 14 • 1

Wakals/CoVT-7B-depth

8B • Updated Dec 5, 2025 • 6 • 2

Wakals/CoVT-7B-seg_depth_dino_edge

8B • Updated Dec 5, 2025 • 122 • 2

Wakals/CoVT-7B-seg_depth_dino

8B • Updated Dec 5, 2025 • 1.77k • 2

datasets 1

Wakals/CoVT-Dataset

Viewer • Updated Dec 5, 2025 • 1.17M • 1.19k • 12