A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio Paper • 2409.06624 • Published Sep 10, 2024
Multi-Party Supervised Fine-tuning of Language Models for Multi-Party Dialogue Generation Paper • 2412.05342 • Published Dec 6, 2024
Learn-to-learn on Arbitrary Textual Conditioning: A Hypernetwork-Driven Meta-Gated LLM Paper • 2605.01973 • Published 8 days ago