-
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 30 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 82 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82
allthingsdisaggregated
lastweek
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Attention Residuals upvoted a paper about 1 month ago
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild upvoted a paper about 1 month ago
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy DistillationOrganizations
None yet