Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Principles of Pharmacogenetics: Types of Genetic Variants01:27

Principles of Pharmacogenetics: Types of Genetic Variants

137
The human genome is over 99.9% identical between individuals, yet genetic differences exist at millions of bases. The human genome contains approximately 3 million variant positions per individual, many of which are heterozygous, contributing to genetic diversity and individual traits. Genetic variations include single-nucleotide polymorphisms (SNPs), insertions, deletions, and copy number variations (CNVs).SNPs, the most common variation, involve single-base changes in DNA. These can be...
137
Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

14.5K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
14.5K
Genetic Variation01:25

Genetic Variation

1.6K
Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles,...
1.6K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

11.6K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
11.6K
Classification of Systems-I01:26

Classification of Systems-I

728
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
728
Human Virome01:26

Human Virome

47
The human body harbors a vast and diverse viral community known as the human virome. The virome includes bacteriophages that infect bacteria, and eukaryotic viruses that infect human cells. Transient dietary and environmental viruses also contribute to this dynamic ecosystem. Estimates suggest the human body may contain on the order of 10¹³ viral particles, though abundance varies widely by body site and detection method.Comprehensive characterization of the virome has become possible...
47

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Longest surviving patient with a homozygous splice-altering <i>EGFR</i> pathogenic variant presenting with skin autoinflammation and a Bartter-like salt-losing tubulopathy.

Journal of medical genetics·2026
Same author

Rethinking MS Therapeutics: From Disease Pathogenesis Mechanisms to AI-Driven Drug Discovery.

Journal of neuroimmune pharmacology : the official journal of the Society on NeuroImmune Pharmacology·2026
Same author

Prevalence of Novel and Recurrent Pathogenic Variants in <i>BRCA</i> Genes in a Cohort of Iranian Hereditary Breast Cancer Patients.

Advanced biomedical research·2026
Same author

Genetic Characterization of MODY in Iranian Families Using Multigenerational-Based Whole-Exome Sequencing Approach.

Journal of diabetes research·2026
Same author

A Truncating Variant in the <i>ERCC6</i> Gene With Three Different Phenotypes: Significant Effects of Modifier Genes.

Genetics research·2025
Same author

Uncovering age-specific subtypes of pediatric obesity and metabolic syndrome using machine learning algorithms.

Scientific reports·2025
Same journal

Correction: A method for supervoxel-wise association studies of age and other non-imaging variables from coronary computed tomography angiograms.

Scientific reports·2026
Same journal

Poly(bromophenol blue)/CoSn(OH)<sub>6</sub> cubic particles modified pencil graphite electrode for electrochemical determination of diphenhydramine.

Scientific reports·2026
Same journal

Dietary Chlorella, Spirulina, and acidifier modulate jejunal cytokine-related gene expression in broiler chickens.

Scientific reports·2026
Same journal

Perceived physical activity barriers in university students: associations with fatigue and eating behaviours.

Scientific reports·2026
Same journal

Refuge limitation structures habitat use in agricultural landscapes: evidence from Sunda pangolins.

Scientific reports·2026
Same journal

Lightweight stateless transaction verification with outsourced witness updates for UTXO blockchains.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Apr 29, 2026

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

10.2K

MetaXVP: an interpretable machine learning framework for deep insight into variant pathogenicity and VUS

Masoud Dehghan Tezerjani1, Mohammadreza Sehhati2, Mohammad Amin Tabatabaiefar3

  • 1Department of Bioinformatics, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.

Scientific Reports
|April 27, 2026
PubMed
Summary
This summary is machine-generated.

MetaXVP, an interpretable XGBoost model, accurately classifies nonsynonymous single nucleotide variants (nsSNVs) and variants of uncertain significance (VUS). It achieves high performance and provides feature explanations, aiding clinical genomics and variant interpretation.

Keywords:
Explainable AIMachine learningMedical genomicsWES

More Related Videos

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
06:41

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

12.4K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

36.1K

Related Experiment Videos

Last Updated: Apr 29, 2026

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

10.2K
In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
06:41

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

12.4K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

36.1K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Classifying nonsynonymous single nucleotide variants (nsSNVs) is crucial for clinical genomics but challenging, especially for variants of uncertain significance (VUS).
  • Existing computational methods for variant pathogenicity prediction often lack sufficient performance and interpretability.

Purpose of the Study:

  • To develop MetaXVP (Meta eXplainable XGBoost Variant Predictor), a novel computational tool combining high predictive accuracy with feature interpretability for nsSNV classification.
  • To provide a robust and explainable model for classifying VUS and aiding clinical decision-making.

Main Methods:

  • Developed MetaXVP using the XGBoost algorithm, trained on high-confidence pathogenic and benign nsSNVs from ClinVar.
  • Integrated 32 features from dbNSFP, selected via correlation-based feature selection.
  • Employed SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) for model interpretability.
  • Evaluated performance on independent ClinVar test sets, a TP53 functional dataset, and a cancer somatic driver dataset using AUROC and AUPRC metrics.

Main Results:

  • MetaXVP achieved high classification performance across various test sets, with AUROC values of 0.991 for pathogenic/benign classification and 0.986 for VUS reclassification.
  • Demonstrated strong performance on a somatic cancer driver dataset (AUROC = 0.924).
  • SHAP analysis identified ensemble predictors (ClinPred, gMVP) and population allele frequency (gnomAD) as key influential features.

Conclusions:

  • MetaXVP offers a high-performing and interpretable solution for nsSNV classification, significantly advancing the interpretation of variants of uncertain significance.
  • The model's robustness and feature explanations provide valuable insights for clinical genomics applications.
  • Pre-computed scores for millions of nsSNVs are available to the research community.