2 8 13

An Luo

lainmn

https://anluo1.github.io/

AI & ML interests

Agentic AI

Recent Activity

authored a paper 15 days ago

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

upvoted a paper 15 days ago

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

liked a model 18 days ago

deepseek-ai/DeepSeek-V4-Pro

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

Paper • 2505.13770 • Published Mar 4 • 1

upvoted a paper 28 days ago

ADD for Multi-Bit Image Watermarking

Paper • 2604.11491 • Published 30 days ago • 3

upvoted a paper about 2 months ago

AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science

Paper • 2603.19005 • Published Mar 19 • 6

upvoted a paper 5 months ago

Can Agentic AI Match the Performance of Human Data Scientists?

Paper • 2512.20959 • Published Dec 24, 2025 • 1

upvoted 2 articles 7 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

baptistecolle, hannayukhymenko, lvwerra

•

Sep 10, 2025

• 64

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

eggie5, martinigoyanes, frisokingma, andreumora, lvwerra, thomwolf, m-ric

•

Feb 4, 2025

• 128

upvoted 2 papers 7 months ago

An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems

Paper • 2505.18397 • Published May 23, 2025 • 1

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science

Paper • 2506.13992 • Published May 25, 2025 • 1

An Luo

AI & ML interests

Recent Activity

Organizations

lainmn's activity

Jupyter Agents: training LLMs to reason with notebooks

DABStep: Data Agent Benchmark for Multi-step Reasoning