Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Paper • 2405.05904 • Published May 9, 2024 • 6
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs Paper • 2505.24858 • Published May 30, 2025 • 17
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7, 2025 • 67
SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge Paper • 2509.07968 • Published Sep 9, 2025 • 14
Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality Paper • 2602.14080 • Published Feb 15 • 21
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 17 days ago • 23
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 17 days ago • 23
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers Paper • 2401.04695 • Published Jan 9, 2024 • 13
Surfacing Biases in Large Language Models using Contrastive Input Decoding Paper • 2305.07378 • Published May 12, 2023 • 1