view article Article Evaluating Audio Reasoning with Big Bench Audio mhillsmith, georgewritescode • Dec 20, 2024 • 29
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications Paper • 2409.07314 • Published Sep 11, 2024 • 56