Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

2.6K
2.6K
Improving Translational Accuracy02:07

Improving Translational Accuracy

11.5K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.5K
Anatomical Terminology01:20

Anatomical Terminology

22.0K
Knowledge of anatomy is essential to understand human biology and medicine. Anatomists and health care professionals use standard terminology to describe the human body with more precision and no ambiguity. Anatomical terms have mostly Greek and Latin-derived roots. Because these languages are rarely used in conversation, the meaning of words remains the same. Each term is made up of a root in between the prefixes and suffixes. The root of a term often refers to an organ, tissue, or condition,...
22.0K
Genomics02:02

Genomics

35.2K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
35.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Temporal Annotation of German Clinical Language in Real and Synthetic Clinical Documents: Corpus Development and Baseline Tagger Validation Study.

Journal of medical Internet research·2026
Same author

GeMTeX's De-Identification in Action: Lessons Learned & Devil's Details.

Studies in health technology and informatics·2025
Same author

Clinical document corpora-real ones, translated and synthetic substitutes, and assorted domain proxies: a survey of diversity in corpus design, with focus on German text data.

JAMIA open·2025
Same author

De-Identifying GRASCCO - A Pilot Study for the De-Identification of the German Medical Text Project (GeMTeX) Corpus.

Studies in health technology and informatics·2024
Same author

Final Report on the German Clinical Reference Corpus 3000PA.

Studies in health technology and informatics·2024
Same author

Influence of Context in Transformer-Based Medication Relation Extraction.

Studies in health technology and informatics·2024
Same journal

A GenAI Pipeline for Violinist Kinematic Data Management.

Studies in health technology and informatics·2026
Same journal

AMAL-For-Qatar: A Comprehensive AI Ecosystem for Fetal Ultrasound Analysis - Project Overview and Achievements.

Studies in health technology and informatics·2026
Same journal

Longitudinal Treatment-Aware Multimodal AI for Dermatology: A Scoping Review.

Studies in health technology and informatics·2026
Same journal

Predicting Postpartum Depression Using Imbalance-Aware Machine Learning.

Studies in health technology and informatics·2026
Same journal

Validation of Deep-Learning Models for Autosegmentation of Brain Metastases.

Studies in health technology and informatics·2026
Same journal

Delay-Dependent Gating in Modular RNNs.

Studies in health technology and informatics·2026
See all related articles

Related Experiment Video

Updated: Apr 25, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.3K

Exploiting parallel corpora to scale up multilingual biomedical terminologies.

Johannes Hellrich1, Udo Hahn1

  • 1Jena University Language & Information Engineering (JULIE) Lab Friedrich-Schiller-Universität Jena, Jena, Germany.

Studies in Health Technology and Informatics
|August 28, 2014
PubMed
Summary
This summary is machine-generated.

This study introduces a computational method for creating biomedical terminologies using machine translation and parallel corpora. The approach successfully generated thousands of new terms and synonyms across four languages, with high expert validation rates.

More Related Videos

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

10.8K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.9K

Related Experiment Videos

Last Updated: Apr 25, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.3K
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications
09:20

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

10.8K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.9K

Area of Science:

  • Biomedical Informatics
  • Computational Linguistics
  • Natural Language Processing

Background:

  • Biomedical terminology creation is resource-intensive, relying heavily on human domain experts.
  • Maintaining multilingual biomedical terminologies presents significant challenges due to cost and complexity.

Purpose of the Study:

  • To computationally support the creation and maintenance of biomedical terminologies.
  • To develop a machine translation-guided classification approach for automated term acquisition.
  • To evaluate the effectiveness of this method in generating new terms and synonyms for multiple languages.

Main Methods:

  • Treated term acquisition as a machine translation-guided classification problem.
  • Utilized parallel corpora for training and term generation.
  • Applied the method to UMLS-derived terminologies for French, German, Spanish, and Dutch.

Main Results:

  • Generated 18,000 new terms/synonyms for French.
  • Generated 23,000 new terms/synonyms for German.
  • Generated 19,000 new terms/synonyms for Spanish.
  • Generated 12,000 new terms/synonyms for Dutch.
  • Expert assessment of a German subset showed ~80% of new terms were bio-medically reasonable and terminologically valid.

Conclusions:

  • The machine translation-guided classification approach effectively supports automated biomedical terminology acquisition.
  • This computational method significantly reduces the resource burden associated with creating multilingual biomedical terminologies.
  • The generated terms demonstrate high bio-medical and terminological validity, as confirmed by expert evaluation.