CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models Paper • 2506.07463 • Published Jun 9, 2025 • 12
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 • 228