Promoting Clinical Decision Support: Evaluating the Medical Reasoning Capabilities of OpenAI's o1-Preview Model
Assessment of LLMs in medical tasks has traditionally been based on multiple-choice question benchmarks. However, these benchmarks are limited in ...