Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought!
YM Qin
Wakals
AI & ML interests
Computer Vision, Vision-language Model, Generative Model
Recent Activity
upvoted a paper 23 days ago
Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning upvoted a collection about 1 month ago
Qwen3.5Organizations
None yet