view article Article DeepSeek-V4: a million-token context that agents can actually use 4 days ago • 38
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 12 days ago • 65
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published 12 days ago • 35
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents Paper • 2604.18543 • Published 8 days ago • 27
GLiNER-PII Collection PII detection models developed in collaboration with Wordcab • 5 items • Updated Jan 29 • 23
Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 14 days ago • 34
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 14 days ago • 87
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 14 days ago • 99
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents Paper • 2603.27490 • Published about 1 month ago • 18
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images Paper • 2604.09531 • Published 18 days ago • 8
ECHO: Efficient Chest X-ray Report Generation with One-step Block Diffusion Paper • 2604.09450 • Published 18 days ago • 22
Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory Paper • 2604.08995 • Published 18 days ago • 48
Small Vision-Language Models are Smart Compressors for Long Video Understanding Paper • 2604.08120 • Published 19 days ago • 20
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 19 days ago • 51
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 21 days ago • 59
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery Paper • 2604.01658 • Published 26 days ago • 55