Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Machines: Problem Solving II

Machines: Problem Solving II

Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.

Machines: Problem Solving I

Machines: Problem Solving I

A toggle clamp is a mechanical device commonly used for holding and clamping objects in various applications, such as woodworking, metalworking, and assembly operations. Consider a toggle clamp subjected to a force of 200 N at the handle. The vertical clamping force can be calculated, provided the dimensions of the toggle clamp are known.
The toggle clamp system is a machine structure consisting of movable, pin-connected multi-force members that form a stabilized system to transmit forces. The...

Reasoning

Reasoning

Reasoning is the action of thinking about something in a logical, sensible way. It is integral to problem-solving, decision-making, and critical thinking. Reasoning can be inductive or deductive. Reasoning involves transforming information into conclusions, which is essential for problem-solving, decision-making, and critical thinking.
Inductive reasoning involves deriving generalizations from specific observations. This type of reasoning helps form beliefs about the world. For example,...

Language and Cognition

Language and Cognition

Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Large language models perpetuate bias in palliative care: Development and analysis of the Palliative Care Adversarial Dataset (PCAD).

PLOS digital health·2026

Same author

Memorization in large language models in medicine prevalence characteristics and implications.

Nature communications·2026

Same author

The lifespan and healthspan extending effects of ellagic acid in <i>Caenorhabditis elegans</i> require an intact insulin/IGF-1 signaling pathway.

Frontiers in aging·2026

Same author

Development and Validation of a Multimodal Clinical, Pathologic, and Genomic Model for Breast Cancer Recurrence.

medRxiv : the preprint server for health sciences·2026

Same author

Histology-Derived Signatures Predict Recurrence Risk and Chemotherapy Benefit in Randomized Trials of Early Breast Cancer.

medRxiv : the preprint server for health sciences·2026

Same author

Intraoperative Optical Coherence Tomography Features of Epiretinal Human Amniotic Membrane Graft Under Different Tamponade Agents.

Journal of vitreoretinal diseases·2026

Same journal

Discoverability of Pediatric Ophthalmologists Among Online Resources.

JAMA ophthalmology·2026

Same journal

Translating AI Into the Eye Clinic-From Models to Clinical Workflow.

JAMA ophthalmology·2026

Same journal

Diagnostic Performance of Prespecified OCT Rules for Glaucomatous Optic Neuropathy in Nonpathologic Myopia.

JAMA ophthalmology·2026

Same journal

Painless Blindness With Conflicting Clues.

JAMA ophthalmology·2026

Same journal

Promise and Pitfalls of Artificial Intelligence Smart Glasses in Low Vision Care.

JAMA ophthalmology·2026

Same journal

Diabetic Retinal Disease Beginning in Childhood.

JAMA ophthalmology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 8, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Ophthalmological Question Answering and Reasoning Using OpenAI o1 vs Other Large Language Models.

Sahana Srinivasan^1,2, Xuguang Ai³, Minjie Zou¹

¹Centre for Innovation and Precision Eye Health, Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.

JAMA Ophthalmology

|July 31, 2025

Summary

This summary is machine-generated.

OpenAI's o1 LLM shows high accuracy in ophthalmology but lags in text generation compared to GPT-4o and GPT-4. Expert reviews found o1 more useful and organized, suggesting potential for specialized LLMs.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

Related Experiment Videos

Last Updated: Sep 8, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

Area of Science:

Artificial Intelligence
Ophthalmology
Medical Informatics

Background:

Large Language Models (LLMs) are increasingly evaluated for specialized medical applications.
OpenAI's o1 LLM, with dedicated reasoning capabilities, requires assessment in ophthalmology.
General LLM reasoning may not suffice for specialized medical domains, necessitating domain-specific models.

Purpose of the Study:

To evaluate the performance and reasoning abilities of OpenAI's o1 LLM against other leading LLMs using ophthalmology-specific questions.
To determine if o1's general reasoning capabilities meet the demands of specialized medical fields.
To inform the development and necessity of domain-specific LLMs in ophthalmology.

Main Methods:

Six LLMs (o1, GPT-4o, GPT-4, GPT-3.5, Llama 3-8B, Gemini 1.5 Pro) were tested on 6990 ophthalmology questions from the MedMCQA dataset.
Performance was measured by accuracy and macro F1 score.
Reasoning abilities were assessed using text-generation metrics (ROUGE-L, BERTScore, BARTScore, AlignScore, METEOR) and expert qualitative evaluation for usefulness and organization.

Main Results:

LLM o1 achieved the highest accuracy (0.877) and macro F1 score (0.877).
GPT-4o and GPT-4 outperformed o1 in BERTScore and AlignScore metrics.
o1 demonstrated superior performance in BARTScore and METEOR, and expert reviews rated its responses as more useful and organized than GPT-4o.

Conclusions:

OpenAI's o1 LLM exhibits strong performance in accuracy for ophthalmology questions but shows variability in text-generation quality.
While o1 offers promising clinical utility and organization, its performance suggests that domain-specialized LLMs may still be required for optimal ophthalmology applications.
Further targeted evaluations are recommended to fully understand LLM capabilities in specialized medical fields.