MF Yu's picture

MF Yu

halyu

·

yuymf

AI & ML interests

RL

Recent Activity

liked a model 6 days ago

DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF

liked a Space 6 days ago

AdithyaSK/rl-environments-guide

updated a collection 3 months ago

View all activity

Organizations

None yet

liked a model 6 days ago

DavidAU/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NEO-CODE-Di-IMatrix-MAX-GGUF

Image-Text-to-Text • 39B • Updated 2 days ago • 207k • 99

liked a Space 6 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

Building and scaling RL environments for LLM training

updated a collection 3 months ago

Games

4 items • Updated Feb 26

upvoted a paper 9 months ago

Controlling Multimodal LLMs via Reward-guided Decoding

Paper • 2508.11616 • Published Aug 15, 2025 • 7

liked a model 12 months ago

google-bert/bert-base-chinese

Fill-Mask • Updated Jul 3, 2025 • 1.1M • • 1.42k

upvoted a paper about 1 year ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published Mar 20, 2025 • 41

liked a Space about 1 year ago

MVBench Leaderboard

Submit and view model evaluation results in a leaderboard format

liked a model about 1 year ago

sy1998/Video_XL

Updated Oct 25, 2024 • 18

updated a collection over 1 year ago

Text

5 items • Updated Feb 10, 2025

upvoted 2 papers over 1 year ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published Jan 22, 2025 • 70

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 24

updated 3 collections over 1 year ago

Text

5 items • Updated Feb 10, 2025

Videos

2 items • Updated Feb 10, 2025

Actions

4 items • Updated Feb 10, 2025

upvoted a paper over 1 year ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21, 2025 • 15

updated 2 collections over 1 year ago

Actions

4 items • Updated Feb 10, 2025

Img

3 items • Updated Jan 22, 2025

upvoted 2 papers over 1 year ago

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Paper • 2501.12375 • Published Jan 21, 2025 • 23

3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering

Paper • 2501.05131 • Published Jan 9, 2025 • 37