Angelo D'Ambrosio1, Francesco Baglivo2, Luigi De Angelis2
1European Centre for Disease Prevention and Control, Stockholm, Sweden.
We evaluated 40 large language models (LLMs) on a travel medicine quiz. Frontier models like OpenAI o3 showed high accuracy, confirming LLMs
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Area of Science: