Marco-MoE Collection A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated 15 days ago • 16
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 25 days ago • 144
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published Dec 15, 2025 • 111
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published Jan 30 • 111
view article Article Training and Finetuning Reranker Models with Sentence Transformers Mar 26, 2025 • 192