view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 manu • Jul 5, 2024 • 317
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick cxdu • Oct 24, 2024 • 14
view article Article Faster Text Generation with Self-Speculative Decoding +2 ariG23498, melhoushi, pcuenq, reach-vb • Nov 20, 2024 • 65
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante • May 11, 2023 • 78