Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Transformers with Off-Nominal Turns Ratios01:25

Transformers with Off-Nominal Turns Ratios

205
In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...
205
Anatomical Terminology01:20

Anatomical Terminology

19.9K
Knowledge of anatomy is essential to understand human biology and medicine. Anatomists and health care professionals use standard terminology to describe the human body with more precision and no ambiguity. Anatomical terms have mostly Greek and Latin-derived roots. Because these languages are rarely used in conversation, the meaning of words remains the same. Each term is made up of a root in between the prefixes and suffixes. The root of a term often refers to an organ, tissue, or condition,...
19.9K
Improving Translational Accuracy02:07

Improving Translational Accuracy

11.9K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Study protocol for a mobile app that delivers peer-led mental health support to adults with type 1 diabetes: The REACHOUT randomized wait-list controlled trial.

Contemporary clinical trials·2026
Same author

The Common Fund Data Ecosystem (CFDE).

bioRxiv : the preprint server for biology·2026
Same author

Interfacial V<sup>5+</sup>/V<sup>4+</sup> Redox Kinetics Enabling Fe-free Photo-Fenton Catalysis in α-V<sub>2</sub>O<sub>5</sub>-ACNT Nanohybrids for Efficient Removal of Mixed CBZ and DFN.

ACS applied materials & interfaces·2026
Same author

Alternative splicing in pediatric central nervous system tumors highlights oncofetal candidate <i>CLK1</i> exon 4.

Neuro-oncology pediatrics·2026
Same author

Non-reciprocal coevolution in a fungus-gardening ant.

Molecular phylogenetics and evolution·2026
Same author

Follicular Helper T Cells and B Cell Maturation in Patients with 22q11.2 Deletion Syndrome and Recurrent Infections.

Journal of clinical immunology·2026
Same journal

Simulation and empirical evaluation of biologically-informed neural network performance.

Machine learning with applications·2026
Same journal

Regularized regression outperforms trees for predicting cognitive function in the Health and Retirement Study.

Machine learning with applications·2025
Same journal

Case-Base Neural Network: Survival analysis with time-varying, higher-order interactions.

Machine learning with applications·2025
Same journal

INSTRAS: INfrared Spectroscopic imaging-based TRAnsformers for medical image Segmentation.

Machine learning with applications·2024
Same journal

Application of deep neural networks for inferring pressure in polymeric acoustic transponders/sensors.

Machine learning with applications·2023
Same journal

An automated treatment plan alert system to safeguard cancer treatments in radiation therapy.

Machine learning with applications·2023
See all related articles

Related Experiment Video

Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

684

Benchmarking Transformer Embedding Models for Biomedical Terminology Standardization.

Aditya Lahiri1, Sangeeta Shukla1, Ben Stear1

  • 1The Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia, Philadelphia PA, USA.

Machine Learning with Applications
|July 28, 2025
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) can standardize biomedical terminology in clinical trial registries, improving data consistency. This study benchmarks LLMs against traditional methods, showing superior accuracy for terminology standardization.

Keywords:
Clinical Text StandardizationLarge Language ModelsNIH Clinical Trials RegistryText EmbeddingWHO Tumor Classification

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

584
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

904

Related Experiment Videos

Last Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

684
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

584
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

904

Area of Science:

  • Biomedical Informatics
  • Natural Language Processing
  • Machine Learning

Background:

  • Biomedical databases suffer from inconsistent terminology, hindering machine learning and data integration.
  • Standardizing terminology is crucial for effective use of biomedical data.

Purpose of the Study:

  • To evaluate the effectiveness of transformer/large language models (LLMs) for standardizing biomedical terminology in the NIH Clinical Trials Registry (CTR).
  • To benchmark LLM-based approaches against traditional text-matching algorithms using the World Health Organization Classification of Tumours (WHO System) as a gold standard.

Main Methods:

  • Developed CANTOS (Clinical Trials Automated Nomenclature and Tumor Ontology Standardization) framework to extract and standardize tumor names from the CTR.
  • Benchmarked 36 methods, including LLM/transformer text embeddings and traditional algorithms, against manually annotated WHO System terms.
  • Assessed accuracy using a sample of 1,600 CTR tumor names.

Main Results:

  • LLM/transformer-based embedding methods significantly outperformed text-matching approaches, achieving up to 69.4% accuracy.
  • Text-matching methods achieved a maximum accuracy of 32.6%.
  • A majority voting ensemble improved accuracy to 71.9%.

Conclusions:

  • LLM/transformer embedding models are effective for standardizing biomedical terminology.
  • The CANTOS framework provides a reproducible method for benchmarking machine learning in biomedical data standardization.
  • Accurate terminology standardization enhances the utility of biomedical databases for research and machine learning.