Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Bioequivalence Experimental Study Designs: Repeated Measures, Cross-Over, Carry-Over, and Latin Square Designs01:15

Bioequivalence Experimental Study Designs: Repeated Measures, Cross-Over, Carry-Over, and Latin Square Designs

179
Body:Bioequivalence experimental study designs play a pivotal role in testing the effectiveness of various treatments. Key among these are the repeated measures, cross-over, carry-over, and Latin square designs. In the repeated measures design, each subject receives all treatments, allowing for temporal comparisons. This type of design is useful in reducing variability but requires careful planning to avoid bias.The cross-over design, an economical method, involves sequential administration of...
179
Group Design02:01

Group Design

10.2K
The most basic experimental design involves two groups: the experimental group and the control group. The two groups are designed to be the same except for one difference— experimental manipulation. The experimental group gets the experimental manipulation—that is, the treatment or variable being tested—and the control group does not. Since experimental manipulation is the only difference between the experimental and control groups, we can be sure that any differences between...
10.2K
Study Designs in Epidemiology01:20

Study Designs in Epidemiology

881
Epidemiological study designs are fundamental tools for investigating the distribution, determinants, and control of health conditions in populations. They help researchers understand the relationships between exposures and outcomes, and they broadly fall into two categories: "observational" and "experimental" studies.
Observational studies are those where the researcher does not intervene but rather observes natural variations. They include cross-sectional, cohort, and...
881
Bioequivalence Experimental Study Designs: Completely Randomized and Randomized Block Designs01:20

Bioequivalence Experimental Study Designs: Completely Randomized and Randomized Block Designs

226
Body:Bioequivalence experimental study designs are crucial methodologies used in evaluating and comparing the bioavailability of different drug products. These designs are categorized into various types: completely randomized, randomized block, repeated measures, cross and carry-over, and Latin square designs.Completely randomized designs involve randomly allocating treatments to all subjects participating in the experiment. This allocation is achieved by assigning unique random numbers to...
226
Randomized Experiments01:13

Randomized Experiments

8.8K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.8K
Observational Studies01:11

Observational Studies

10.7K
Observational studies are a type of analytical study where researchers observe events without any interventions. In other words, the researcher does not influence the response variable or the experiment's outcome.
There are three types of observational studies – Prospective, retrospective, and cross-sectional.
Prospective Study
Prospective studies, also known as longitudinal or cohort studies, are carried out by collecting future data from groups sharing similar characteristics. One...
10.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

AI-induced never-skilling in medical education.

Nature medicine·2026
Same author

The detectability paradox: bilingual medical report generation with open-weight models and the limits of human oversight.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

How to meaningfully evaluate AI in clinical medicine.

Nature medicine·2026
Same author

SpNeigh: spatial neighborhood and differential expression analysis for high-resolution spatial transcriptomics.

NAR genomics and bioinformatics·2026
Same author

Impact of an Ambient AI Scribe Among Clinicians and Patients: Real-World Prospective Observational Time-Motion Study.

JMIR medical informatics·2026
Same author

DILIConsult: A Multi-Agent Large Language Model Framework for Evaluating Drug-Induced Liver Injury in ICU Settings.

Pharmacotherapy·2026
Same journal

How Does That Large Language Model Make You Feel?

Journal of medical Internet research·2026
Same journal

Transformation Versus Innovation in Digital Health Care and the Future of Clinical AI.

Journal of medical Internet research·2026
Same journal

Building a Malaria Intelligence System for Real-Time Prediction and Data-Driven Intervention Planning.

Journal of medical Internet research·2026
Same journal

Therapeutic Interaction Features of AI Chatbots in Depression Interventions: Systematic Review and Meta-Analysis.

Journal of medical Internet research·2026
Same journal

Large Language Model Versus Multidisciplinary Team: Feasibility Study of Pancreatic Cancer Management Recommendations.

Journal of medical Internet research·2026
Same journal

Centers for Medicare & Medicaid Services to Launch Landmark ACCESS Program.

Journal of medical Internet research·2026
See all related articles

Related Experiment Video

Updated: Jan 15, 2026

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.7K

Large Language Models in Randomized Controlled Trials Design: Observational Study.

Liyuan Jin1, Jasmine Chiat Ling Ong1,2, Kabilan Elangovan3

  • 1Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore, 65 66016503.

Journal of Medical Internet Research
|October 7, 2025
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) show promise in improving randomized controlled trial (RCT) design, enhancing recruitment and generalizability. While effective in intervention planning, LLMs require expert oversight for eligibility criteria and outcome measures to ensure safety and ethical standards.

Keywords:
GPT-4LLM-generated clinical trial designsclinical research ethicsclinical trial design evaluationeligibility criteriarecruitment diversitytrial failure reduction

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K
Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

8.0K

Related Experiment Videos

Last Updated: Jan 15, 2026

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.7K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K
Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

8.0K

Area of Science:

  • Clinical Trial Design
  • Artificial Intelligence in Healthcare
  • Medical Research Methodology

Background:

  • Randomized controlled trials (RCTs) face significant challenges including limited generalizability, insufficient participant diversity, and high failure rates.
  • These limitations often stem from restrictive eligibility criteria and inefficient patient selection processes.
  • Large language models (LLMs) show potential in clinical applications, but their role in optimizing RCT design is largely unexplored.

Purpose of the Study:

  • To investigate the capability of LLMs, specifically GPT-4-Turbo-Preview, in assisting the design of RCTs.
  • To assess LLM's potential to improve RCT generalizability, recruitment diversity, and reduce failure rates.
  • To evaluate LLM-assisted RCT design while upholding clinical safety and ethical standards.

Main Methods:

  • An observational study analyzed 20 parallel-arm RCTs (10 completed, 10 registered) published after January 2024.
  • LLMs generated RCT designs based on provided criteria, including eligibility, recruitment, interventions, and outcomes.
  • Quantitative assessment by clinical experts and NLP metrics (BLEU, ROUGE-L, METEOR) evaluated LLM design accuracy against ClinicalTrials.gov data; qualitative assessments used Likert scales for safety, accuracy, bias, pragmatism, inclusivity, and diversity.

Main Results:

  • LLMs achieved 72% overall accuracy in replicating RCT designs, with high accuracy in recruitment (88%) and intervention (93%) design.
  • Lower accuracy was observed in designing eligibility criteria (55%) and outcomes measurement (53%).
  • Qualitative evaluations indicated strong clinical alignment, with LLM-generated designs ranking similarly to original designs in safety, accuracy, and objectivity, while enhancing diversity and pragmatism.

Conclusions:

  • LLMs demonstrate significant potential to enhance RCT design, particularly in recruitment and intervention strategies, improving generalizability and diversity.
  • Expert oversight and regulatory frameworks are crucial for ensuring patient safety and ethical compliance in LLM-assisted RCT design.
  • Further refinement of LLMs is needed to overcome limitations in eligibility criteria and outcomes measurement for broader clinical trial application.