15 8

YM Qin

Wakals

https://wakals.github.io/

AI & ML interests

Computer Vision, Vision-language Model, Generative Model

Recent Activity

upvoted a paper 5 days ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

upvoted a paper 23 days ago

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

upvoted a collection about 1 month ago

Qwen3.5

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Paper • 2601.14750 • Published Jan 21 • 18

upvoted a paper 23 days ago

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning

Paper • 2601.11109 • Published Jan 16 • 3

upvoted a collection about 1 month ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.59k

liked a dataset about 1 month ago

VisGym/visgym_data

Preview • Updated Feb 5 • 4.65k • 17

liked a dataset about 2 months ago

VisGym/inference-dataset

Updated Jan 26 • 63 • 3

upvoted a paper about 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 115

upvoted a paper 2 months ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

liked a dataset 2 months ago

DietCoke4671/BlenderBench

Viewer • Updated Feb 28 • 27 • 438 • 29

upvoted a collection 3 months ago

MMFineReason

Collection

High-quality STEM reasoning dataset for Multimodal LLM post-training. • 8 items • Updated Mar 31 • 23

upvoted a paper 3 months ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Paper • 2601.16973 • Published Jan 23 • 40

liked a dataset 4 months ago

DietCoke4671/ToolVQA

Preview • Updated Aug 16, 2025 • 627 • 5

upvoted 2 papers 4 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 222

COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence

Paper • 2512.04563 • Published Dec 4, 2025 • 16

updated 5 models 5 months ago

updated a dataset 5 months ago

Wakals/CoVT-Dataset

Viewer • Updated Dec 5, 2025 • 1.17M • 1.19k • 12

authored a paper 5 months ago

Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

Paper • 2511.19418 • Published Nov 24, 2025 • 29

YM Qin

AI & ML interests

Recent Activity

Organizations

Wakals's activity