10 15 1

beiqing

zhangBeiQing

ZhangBeiQing

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

liked a Space 7 months ago

Apollo-LMMs/TimeScope

commentedon a paper 7 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Paper • 2605.12501 • Published 4 days ago • 13

liked a Space 7 months ago

TimeScope

💻

Visualize accuracy curves for video models

commented a paper 7 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21, 2025 • 1 •

upvoted a paper 7 months ago

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21, 2025 • 1

commented a paper 7 months ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23 •

upvoted a paper 7 months ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23

commented a paper 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265 •

commented a paper 8 months ago

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Paper • 2505.00675 • Published May 1, 2025 • 3 •

upvoted a paper 8 months ago

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

Paper • 2505.00675 • Published May 1, 2025 • 3

commented a paper 8 months ago

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Paper • 2508.09874 • Published Aug 13, 2025 • 11 •

upvoted 2 papers 8 months ago

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models

Paper • 2508.09874 • Published Aug 13, 2025 • 11

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Paper • 2508.19828 • Published Aug 27, 2025 • 8

commented 2 papers 9 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 189 •

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5, 2025 • 135 •

upvoted 4 papers 9 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 146

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 144

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2, 2025 • 108

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 136

commented a paper 9 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161 •

upvoted a paper 9 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 161

beiqing

AI & ML interests

Recent Activity

Organizations

zhangBeiQing's activity

TimeScope