Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mouse Models of Cancer Study

Mouse Models of Cancer Study

Mice have long served as models for studying human biology and pathology because of their phylogenetic and physiological similarity with humans. They are also easy to maintain and breed in the laboratory, and hence, many inbred strains are now available for research. Studies on mice have contributed immeasurably to our understanding of cancer biology.
The development of transgenic, knockout, and knock-in mice has led to an exponential increase in their use as model organisms in research,...

Cancer Survival Analysis

Cancer Survival Analysis

Cancer survival analysis focuses on quantifying and interpreting the time from a key starting point, such as diagnosis or the initiation of treatment, to a specific endpoint, such as remission or death. This analysis provides critical insights into treatment effectiveness and factors that influence patient outcomes, helping to shape clinical decisions and guide prognostic evaluations. A cornerstone of oncology research, survival analysis tackles the challenges of skewed, non-normally...

Combination Therapies and Personalized Medicine

Combination Therapies and Personalized Medicine

Combining two or more treatment methods increases the life span of cancer patients while reducing damage to vital organs or tissue from the overuse of a single treatment. Combination therapy also targets different cancer-inducing pathways, thus reducing the chances of developing resistance to treatment.
The combination of the drug acetazolamide and sulforaphane is a good example of combination therapy to treat cancer. The cells in the interior of a large tumor often die due to the hypoxic and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Predicted Effector Gene Aggregation, Standards and Unified Schema (PEGASUS): A Community Framework for Effector Gene Reporting.

bioRxiv : the preprint server for biology·2026

Same author

Programmatic access to ICTV virus taxonomy through a public ontology API.

bioRxiv : the preprint server for biology·2026

Same author

Systemic Anticancer Therapy Timelines Extraction From Electronic Medical Records Text: Algorithm Development and Validation.

JMIR bioinformatics and biotechnology·2025

Same author

Reimagining Evidence: Artificial Intelligence Synthetic Data Generation for Cancer Research.

JCO clinical cancer informatics·2025

Same author

Cross-Site Predictions of Readmission After Psychiatric Hospitalization With Mood or Psychotic Disorders: Retrospective Study.

JMIR mental health·2025

Same author

Informatics at the Frontier of Cancer Research.

Cancer research·2025

Same journal

Readability of AI-Generated Patient Information on Glucagon-Like Peptide-1 Receptor Agonists.

JMIR bioinformatics and biotechnology·2026

Same journal

Random Survival Forest Versus Elastic-Net Regularized Cox Regression for Survival Prediction in Acute Myeloid Leukemia at Distinct Treatment Time Points: Model Performance Comparison Study.

JMIR bioinformatics and biotechnology·2026

Same journal

Temporal Reproducibility of a Genetic Algorithm-Derived Health Risk Score: Standardized Out-of-Fold Validation Framework (2021-2023).

JMIR bioinformatics and biotechnology·2026

Same journal

The AudioGene Translational Dashboard for Diagnosing Autosomal Dominant Nonsyndromic Hearing Loss: Phenotypic Data Visualization and Analysis Study.

JMIR bioinformatics and biotechnology·2026

Same journal

A Strategic Partnership to Advance AI Applications in Genomics and Bioinformatics for Health Innovation.

JMIR bioinformatics and biotechnology·2026

Same journal

Prevalence and Associated Risk Factors of Bovine Fasciolosis in Bahir Dar, Ethiopia: Cross-Sectional Study.

JMIR bioinformatics and biotechnology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 9, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Extracting Knowledge From Scientific Texts on Patient-Derived Cancer Models Using Large Language Models: Algorithm

Jiarui Yao^1,2, Zinaida Perova³, Tushar Mandloi³

¹Computational Health Informatics Program, Boston Children's Hospital, 401 Park Drive, Boston, MA, United States, 1 7813545014.

JMIR Bioinformatics and Biotechnology

|December 4, 2025

Summary

This summary is machine-generated.

Soft prompting significantly boosts performance of open large language models (LLMs) for extracting patient-derived cancer model (PDCM) entities from scientific texts, rivaling proprietary models.

Keywords:

in-context learning information extraction knowledge extraction large language models patient-derived cancer models prompt tuning soft prompting

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Related Experiment Videos

Last Updated: Jan 9, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Area of Science:

Biomedical Informatics
Artificial Intelligence in Oncology
Computational Biology

Background:

Patient-derived cancer models (PDCMs) are crucial for cancer research and preclinical studies.
The volume of PDCM-related publications has surged, necessitating efficient knowledge extraction.
Large language models (LLMs) offer advanced capabilities for processing scientific literature at scale.

Purpose of the Study:

To investigate LLM-based systems for automated extraction of PDCM-related entities.
To compare direct prompting and soft prompting techniques for entity extraction.

Main Methods:

Explored direct prompting (manual prompt design) and soft prompting (trainable continuous vectors).
Evaluated both approaches across proprietary (GPT4-o) and open (LLaMA3) LLMs.
Utilized a manually annotated dataset of 100 PDCM abstracts with 15 entity types.

Main Results:

GPT4-o with direct prompting achieved F1-scores of 50.48 (exact match) and 71.36 (overlapping match).
LLaMA3 soft prompting significantly improved performance over direct prompting (exact match: 7.06 to 46.68; overlapping match: 12.0 to 71.80).
LLaMA3 soft prompting slightly outperformed GPT4-o direct prompting in the overlapping match setting.

Conclusions:

Soft prompting enhances the performance of smaller open LLMs for PDCM entity extraction.
Training soft prompts on open models can yield performance comparable to proprietary LLMs.
This approach facilitates scalable knowledge discovery in PDCM research.