Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Polygenic Traits01:18

Polygenic Traits

70.3K
When more than one gene is responsible for a given phenotype, the trait is considered polygenic. Human height is a polygenic trait. Studies have uncovered hundreds of loci that influence height, and there are believed to be many more. Due to the high number of genes involved, as well as environmental and nutritional factors, height varies significantly within a given population. The distribution of height forms a bell-shaped curve, with relatively few individuals in the population at the...
70.3K
Multiple Allele Traits01:49

Multiple Allele Traits

38.7K
The Concept of Multiple Allelism
38.7K
Pharmacogenetic Phenotypes: Alterations in Pharmacokinetics, Drug Targets and Biologic Milieu01:29

Pharmacogenetic Phenotypes: Alterations in Pharmacokinetics, Drug Targets and Biologic Milieu

108
Genetic variations significantly influence drug response through pharmacokinetics, receptor interactions, and biologic milieu modifications. Pharmacokinetic alterations impact drug metabolism and clearance, affecting efficacy and toxicity. Variants in drug-metabolizing enzymes, such as CYP2C9 and CYP2C19, alter drug activation and elimination. For example, CYP2C9 loss-of-function variants require lower warfarin doses to prevent excessive bleeding, while CYP2C19 variants reduce clopidogrel...
108
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

16.6K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
16.6K
Epistasis01:39

Epistasis

51.1K
In addition to multiple alleles at the same locus influencing traits, numerous genes or alleles at different locations may interact and influence phenotypes in a phenomenon called epistasis. For example, rabbit fur can be black or brown depending on whether the animal is homozygous dominant or heterozygous at a TYRP1 locus. However, if the rabbit is also homozygous recessive at a locus on the tyrosinase gene (TYR), it will have an unshaded coat that appears white, regardless of its TYRP1...
51.1K
Epistasis Analysis01:09

Epistasis Analysis

6.2K
Although Mendel chose seven unrelated traits in peas to study gene segregation, most traits involve multiple gene interactions that create a spectrum of phenotypes. When the interaction of various genes or alleles at different locations influences a phenotype, this is called epistasis. Epistasis often involves one gene masking or interfering with the expression of another (antagonistic epistasis). Epistasis often occurs when different genes are part of the same biochemical pathway. The...
6.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Robust AI-ECG for Predicting Left Ventricular Systolic Dysfunction in Pediatric Congenital Heart Disease.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

An ECG foundation model for generalizable cardiac function prediction across the lifespan.

medRxiv : the preprint server for health sciences·2026
Same author

Estimating worker reentry times following application of two generic spray polyurethane foam formulations according to ASTM Standard Practice D8445-22a.

Journal of occupational and environmental hygiene·2026
Same author

Large language models require a new form of oversight: capability-based monitoring.

NPJ digital medicine·2026
Same author

A Hybrid Language Framework for Ontology-Based Clinical Concept Extraction.

Journal of healthcare informatics research·2026
Same author

Incidence and risk factors for delayed structural changes in Pipeline Embolization Devices.

Journal of neurointerventional surgery·2026
Same journal

Sensitivity Analyses of a Scoring System for a Contraception Decision Aid.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Improving electronic health record processing of large language models via retrieval-augmented generation: A case study on dietary supplements.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Developing a User-Centered Mobile Application Prototype: Bridging Lower-Limb Fracture Care from Skilled Nursing Facility and Back to the Community.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Automating Adjudication of Cardiovascular Events Using Large Language Models.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same journal

Predictive Factors and State-Level Barriers to Postpartum Birth Control Usage in the United States: Insights from PRAMS Phase 8.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
See all related articles

Related Experiment Video

Updated: Mar 24, 2026

Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons
09:21

Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons

Published on: July 7, 2023

2.1K

Semi-supervised Learning for Phenotyping Tasks.

Dmitriy Dligach1, Timothy Miller1, Guergana K Savova1

  • 1Boston Children's Hospital and Harvard Medical School, Boston, MA.

AMIA ... Annual Symposium Proceedings. AMIA Symposium
|March 10, 2016
PubMed
Summary
This summary is machine-generated.

Semi-supervised learning, using Expectation Maximization (EM), offers a cost-effective alternative to manual chart review for electronic health records phenotyping. This approach significantly reduces the need for labeled data, saving time and resources.

More Related Videos

A Telemetric, Gravimetric Platform for Real-Time Physiological Phenotyping of Plant–Environment Interactions
15:30

A Telemetric, Gravimetric Platform for Real-Time Physiological Phenotyping of Plant–Environment Interactions

Published on: August 5, 2020

12.8K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Related Experiment Videos

Last Updated: Mar 24, 2026

Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons
09:21

Author Spotlight: Generating Neuronal Phenotypic Profiles - A Protocol to Culture and Image Human Midbrain Dopaminergic Neurons

Published on: July 7, 2023

2.1K
A Telemetric, Gravimetric Platform for Real-Time Physiological Phenotyping of Plant–Environment Interactions
15:30

A Telemetric, Gravimetric Platform for Real-Time Physiological Phenotyping of Plant–Environment Interactions

Published on: August 5, 2020

12.8K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K

Area of Science:

  • Computational biology
  • Medical informatics
  • Machine learning

Background:

  • Supervised learning is standard for electronic health records (EHR)-based phenotyping but requires expensive manual chart review.
  • Semi-supervised learning leverages both limited labeled and abundant unlabeled data to improve efficiency.

Purpose of the Study:

  • To investigate a family of semi-supervised learning algorithms based on Expectation Maximization (EM) for EHR phenotyping.
  • To evaluate the effectiveness of basic and Augmented EM algorithms in reducing the need for manual data labeling.

Main Methods:

  • Exploration of Expectation Maximization (EM) and Augmented EM algorithms for phenotyping tasks.
  • Assessment of weighting factors in Augmented EM, considering cross-validation and heuristic methods.
  • Training phenotyping models with limited labeled and extensive unlabeled EHR data.

Main Results:

  • Basic EM can yield inaccurate parameter estimates when modeling assumptions are unmet.
  • Augmented EM, with a weighting factor, mitigates inaccuracies by downweighting unlabeled data.
  • Accurate phenotyping models were achieved using only a few hundred labeled examples alongside numerous unlabeled ones.

Conclusions:

  • Semi-supervised learning, particularly Augmented EM, offers a viable and cost-effective solution for EHR phenotyping.
  • This approach can substantially decrease the manual chart review burden, enabling significant cost savings.
  • Careful selection of weighting factors, potentially using heuristic methods over cross-validation, is crucial for optimal performance.