Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Transformers with Off-Nominal Turns Ratios

Transformers with Off-Nominal Turns Ratios

In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...

Anatomical Terminology

Anatomical Terminology

Knowledge of anatomy is essential to understand human biology and medicine. Anatomists and health care professionals use standard terminology to describe the human body with more precision and no ambiguity. Anatomical terms have mostly Greek and Latin-derived roots. Because these languages are rarely used in conversation, the meaning of words remains the same. Each term is made up of a root in between the prefixes and suffixes. The root of a term often refers to an organ, tissue, or condition,...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Study protocol for a mobile app that delivers peer-led mental health support to adults with type 1 diabetes: The REACHOUT randomized wait-list controlled trial.

Contemporary clinical trials·2026

Same author

The Common Fund Data Ecosystem (CFDE).

bioRxiv : the preprint server for biology·2026

Same author

Interfacial V<sup>5+</sup>/V<sup>4+</sup> Redox Kinetics Enabling Fe-free Photo-Fenton Catalysis in α-V<sub>2</sub>O<sub>5</sub>-ACNT Nanohybrids for Efficient Removal of Mixed CBZ and DFN.

ACS applied materials & interfaces·2026

Same author

Alternative splicing in pediatric central nervous system tumors highlights oncofetal candidate <i>CLK1</i> exon 4.

Neuro-oncology pediatrics·2026

Same author

Non-reciprocal coevolution in a fungus-gardening ant.

Molecular phylogenetics and evolution·2026

Same author

Follicular Helper T Cells and B Cell Maturation in Patients with 22q11.2 Deletion Syndrome and Recurrent Infections.

Journal of clinical immunology·2026

Same journal

Simulation and empirical evaluation of biologically-informed neural network performance.

Machine learning with applications·2026

Same journal

Regularized regression outperforms trees for predicting cognitive function in the Health and Retirement Study.

Machine learning with applications·2025

Same journal

Case-Base Neural Network: Survival analysis with time-varying, higher-order interactions.

Machine learning with applications·2025

Same journal

INSTRAS: INfrared Spectroscopic imaging-based TRAnsformers for medical image Segmentation.

Machine learning with applications·2024

Same journal

Application of deep neural networks for inferring pressure in polymeric acoustic transponders/sensors.

Machine learning with applications·2023

Same journal

An automated treatment plan alert system to safeguard cancer treatments in radiation therapy.

Machine learning with applications·2023

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Benchmarking Transformer Embedding Models for Biomedical Terminology Standardization.

Aditya Lahiri¹, Sangeeta Shukla¹, Ben Stear¹

¹The Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia, Philadelphia PA, USA.

Machine Learning with Applications

|July 28, 2025

Summary

This summary is machine-generated.

Large language models (LLMs) can standardize biomedical terminology in clinical trial registries, improving data consistency. This study benchmarks LLMs against traditional methods, showing superior accuracy for terminology standardization.

Keywords:

Clinical Text Standardization Large Language Models NIH Clinical Trials Registry Text Embedding WHO Tumor Classification

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Related Experiment Videos

Last Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Area of Science:

Biomedical Informatics
Natural Language Processing
Machine Learning

Background:

Biomedical databases suffer from inconsistent terminology, hindering machine learning and data integration.
Standardizing terminology is crucial for effective use of biomedical data.

Purpose of the Study:

To evaluate the effectiveness of transformer/large language models (LLMs) for standardizing biomedical terminology in the NIH Clinical Trials Registry (CTR).
To benchmark LLM-based approaches against traditional text-matching algorithms using the World Health Organization Classification of Tumours (WHO System) as a gold standard.

Main Methods:

Developed CANTOS (Clinical Trials Automated Nomenclature and Tumor Ontology Standardization) framework to extract and standardize tumor names from the CTR.
Benchmarked 36 methods, including LLM/transformer text embeddings and traditional algorithms, against manually annotated WHO System terms.
Assessed accuracy using a sample of 1,600 CTR tumor names.

Main Results:

LLM/transformer-based embedding methods significantly outperformed text-matching approaches, achieving up to 69.4% accuracy.
Text-matching methods achieved a maximum accuracy of 32.6%.
A majority voting ensemble improved accuracy to 71.9%.

Conclusions:

LLM/transformer embedding models are effective for standardizing biomedical terminology.
The CANTOS framework provides a reproducible method for benchmarking machine learning in biomedical data standardization.
Accurate terminology standardization enhances the utility of biomedical databases for research and machine learning.