Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

2.7K
2.7K
Observational Learning01:12

Observational Learning

305
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
305
Machines: Problem Solving II01:30

Machines: Problem Solving II

367
Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.
367
Machines: Problem Solving I01:22

Machines: Problem Solving I

406
A toggle clamp is a mechanical device commonly used for holding and clamping objects in various applications, such as woodworking, metalworking, and assembly operations. Consider a toggle clamp subjected to a force of 200 N at the handle. The vertical clamping force can be calculated, provided the dimensions of the toggle clamp are known.
The toggle clamp system is a machine structure consisting of movable, pin-connected multi-force members that form a stabilized system to transmit forces. The...
406
Reasoning01:30

Reasoning

129
Reasoning is the action of thinking about something in a logical, sensible way. It is integral to problem-solving, decision-making, and critical thinking. Reasoning can be inductive or deductive. Reasoning involves transforming information into conclusions, which is essential for problem-solving, decision-making, and critical thinking.
Inductive reasoning involves deriving generalizations from specific observations. This type of reasoning helps form beliefs about the world. For example,...
129
Language and Cognition01:27

Language and Cognition

436
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
436

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Large language models perpetuate bias in palliative care: Development and analysis of the Palliative Care Adversarial Dataset (PCAD).

PLOS digital health·2026
Same author

Memorization in large language models in medicine prevalence characteristics and implications.

Nature communications·2026
Same author

The lifespan and healthspan extending effects of ellagic acid in <i>Caenorhabditis elegans</i> require an intact insulin/IGF-1 signaling pathway.

Frontiers in aging·2026
Same author

Development and Validation of a Multimodal Clinical, Pathologic, and Genomic Model for Breast Cancer Recurrence.

medRxiv : the preprint server for health sciences·2026
Same author

Histology-Derived Signatures Predict Recurrence Risk and Chemotherapy Benefit in Randomized Trials of Early Breast Cancer.

medRxiv : the preprint server for health sciences·2026
Same author

Intraoperative Optical Coherence Tomography Features of Epiretinal Human Amniotic Membrane Graft Under Different Tamponade Agents.

Journal of vitreoretinal diseases·2026
Same journal

Discoverability of Pediatric Ophthalmologists Among Online Resources.

JAMA ophthalmology·2026
Same journal

Translating AI Into the Eye Clinic-From Models to Clinical Workflow.

JAMA ophthalmology·2026
Same journal

Diagnostic Performance of Prespecified OCT Rules for Glaucomatous Optic Neuropathy in Nonpathologic Myopia.

JAMA ophthalmology·2026
Same journal

Painless Blindness With Conflicting Clues.

JAMA ophthalmology·2026
Same journal

Promise and Pitfalls of Artificial Intelligence Smart Glasses in Low Vision Care.

JAMA ophthalmology·2026
Same journal

Diabetic Retinal Disease Beginning in Childhood.

JAMA ophthalmology·2026
See all related articles

Related Experiment Video

Updated: Sep 8, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

680

Ophthalmological Question Answering and Reasoning Using OpenAI o1 vs Other Large Language Models.

Sahana Srinivasan1,2, Xuguang Ai3, Minjie Zou1

  • 1Centre for Innovation and Precision Eye Health, Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.

JAMA Ophthalmology
|July 31, 2025
PubMed
Summary
This summary is machine-generated.

OpenAI's o1 LLM shows high accuracy in ophthalmology but lags in text generation compared to GPT-4o and GPT-4. Expert reviews found o1 more useful and organized, suggesting potential for specialized LLMs.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

565
Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.1K

Related Experiment Videos

Last Updated: Sep 8, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

680
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

565
Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

10.1K

Area of Science:

  • Artificial Intelligence
  • Ophthalmology
  • Medical Informatics

Background:

  • Large Language Models (LLMs) are increasingly evaluated for specialized medical applications.
  • OpenAI's o1 LLM, with dedicated reasoning capabilities, requires assessment in ophthalmology.
  • General LLM reasoning may not suffice for specialized medical domains, necessitating domain-specific models.

Purpose of the Study:

  • To evaluate the performance and reasoning abilities of OpenAI's o1 LLM against other leading LLMs using ophthalmology-specific questions.
  • To determine if o1's general reasoning capabilities meet the demands of specialized medical fields.
  • To inform the development and necessity of domain-specific LLMs in ophthalmology.

Main Methods:

  • Six LLMs (o1, GPT-4o, GPT-4, GPT-3.5, Llama 3-8B, Gemini 1.5 Pro) were tested on 6990 ophthalmology questions from the MedMCQA dataset.
  • Performance was measured by accuracy and macro F1 score.
  • Reasoning abilities were assessed using text-generation metrics (ROUGE-L, BERTScore, BARTScore, AlignScore, METEOR) and expert qualitative evaluation for usefulness and organization.

Main Results:

  • LLM o1 achieved the highest accuracy (0.877) and macro F1 score (0.877).
  • GPT-4o and GPT-4 outperformed o1 in BERTScore and AlignScore metrics.
  • o1 demonstrated superior performance in BARTScore and METEOR, and expert reviews rated its responses as more useful and organized than GPT-4o.

Conclusions:

  • OpenAI's o1 LLM exhibits strong performance in accuracy for ophthalmology questions but shows variability in text-generation quality.
  • While o1 offers promising clinical utility and organization, its performance suggests that domain-specialized LLMs may still be required for optimal ophthalmology applications.
  • Further targeted evaluations are recommended to fully understand LLM capabilities in specialized medical fields.