Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Language Development

Language Development

Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...

Statistical Software for Data Analysis and Clinical Trials

Statistical Software for Data Analysis and Clinical Trials

Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...

Analysis of Population Pharmacokinetic Data

Analysis of Population Pharmacokinetic Data

Analysis of population pharmacokinetic data involves studying the behavior of drugs within diverse populations to understand their pharmacokinetic parameters. Traditional pharmacokinetic methods typically involve collecting samples from a few individuals and estimating these parameters. While these methods are commonly used, they have limitations in capturing the variability in drug response among individuals or heterogeneous populations. Population pharmacokinetics is employed to address these...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

[Study on Standardization Methods of Multi-Source Heterogeneous Data from ICU Medical Devices Based on openEHR].

Zhongguo yi liao qi xie za zhi = Chinese journal of medical instrumentation·2026

Same author

Leveraging Large Language Models to Integrate Clinical Knowledge and Machine Learning Predictions for Lymph Node Metastasis Prediction: Development of a Knowledge-Augmented Framework.

JMIR medical informatics·2026

Same author

DCBM-Tri: a dual-channel bilinear mapping triplet model for early recognition of acute kidney injury in imbalanced cohorts.

Scientific reports·2026

Same author

Genome-Wide Association Study and Candidate Gene Mining for Plant Height and Main Stem Node Number in Soybean from Northwest China.

Plants (Basel, Switzerland)·2026

Same author

Challenges and Solutions in Deploying Systematized Nomenclature of Medicine-Clinical Terms in the Chinese Healthcare Context.

Health care science·2026

Same author

A Large Language Model-Powered Multiagent Framework Emulating Standardized Patients in Clinical Communication Skills Training: Development and Evaluation Study.

Journal of medical Internet research·2026

Same journal

BlockFedMed: A blockchain-federated learning framework for privacy-preserving mortality prediction across heterogeneous intensive care units.

International journal of medical informatics·2026

Same journal

Integrating clinical decision support systems in pediatric oncology: A scoping review of applications, implementation gaps, and management Implications.

International journal of medical informatics·2026

Same journal

Understanding digital health capability of allied health professionals - a mixed-methods study with content validity analysis.

International journal of medical informatics·2026

Same journal

On-premises open-source large language models for privacy-preserving multimodal depression screening.

International journal of medical informatics·2026

Same journal

Data mining methods, tasks, and algorithms for adverse drug reaction analysis in pharmacovigilance: A scoping review.

International journal of medical informatics·2026

Same journal

Development and validation of an interpretable machine learning model for predicting systemic inflammatory response syndrome after percutaneous nephrolithotomy: A multicenter study.

International journal of medical informatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 12, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Learning from experts: A self-improving LLM framework for study population generation in clinical research.

Yaoqian Sun¹, Zikang Chen¹, Hailing Cai¹

¹College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, 310027 Hangzhou, Zhejiang Province, China.

International Journal of Medical Informatics

|November 6, 2025

Summary

This summary is machine-generated.

CriteriaLLM, a novel framework, uses large language models (LLMs) with clinician feedback to generate credible study populations from clinical objectives. This expert-in-the-loop approach enhances real-world evidence generation for clinical research.

Keywords:

Clinical research Expert-in-the-loop Large language models Study population

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Related Experiment Videos

Last Updated: Jan 12, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Area of Science:

Artificial Intelligence in Clinical Research
Real-World Data (RWD) and Real-World Evidence (RWE) Generation
Clinical Study Design Optimization

Background:

Electronic health records have increased real-world data (RWD) availability for real-world evidence (RWE) generation.
Large language models (LLMs) aid RWD research but struggle with interpretable and credible study population design.
Bridging study objectives and downstream analyses for RWD research remains a challenge.

Purpose of the Study:

To introduce CriteriaLLM, a framework enabling LLMs to generate eligible study populations from clinical research objectives.
To incorporate clinician feedback and an expert knowledge base for enhanced LLM-driven population design.
To improve the interpretability and credibility of LLM-generated study populations.

Main Methods:

CriteriaLLM integrates clinician feedback into LLMs for study population generation.
An expert knowledge base, inspired by after-action reviews, records LLM outputs and clinician modifications.
A dual-retrieval algorithm (disease domain relevance and lexical similarity) guides LLM generations using historical cases.
A continuous validation loop with iterative expert feedback refines model performance.

Main Results:

The framework was evaluated on 254 published clinical studies using MIMIC-III data and four LLMs (GPT-4o, Deepseek-R1, LLaMA models).
CriteriaLLM achieved a high Macro F1 score of 0.9180 in generating quality study populations.
The framework demonstrated generalizability across LLMs with varying sizes and deployment methods.

Conclusions:

CriteriaLLM enables LLMs to generate eligible study populations from clinical objectives without fine-tuning.
Structured expert feedback and retrieval guidance enhance the quality and reliability of study criteria.
The framework offers a scalable, self-improving approach for integrating AI into clinical research, ensuring clinical appropriateness and interpretability.