FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 14 days ago • 94
PulseLM: A Foundation Dataset and Benchmark for PPG-Text Learning Paper • 2603.03331 • Published Feb 10 • 2
MemoryArena: Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks Paper • 2602.16313 • Published Feb 18 • 3