Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

9.3K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
9.3K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

17.5K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
17.5K
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Harmonizing standards and resources for the medical genome.

Nature·2026
Same author

Evolutionary dynamics of Respiratory Syncytial Virus in pre-pandemic, pandemic, and post-pandemic periods in Houston, Texas, USA.

bioRxiv : the preprint server for biology·2026
Same author

Structural variant calling using Sniffles2.

Nature protocols·2026
Same author

A complete human pancreatic cancer genome.

bioRxiv : the preprint server for biology·2026
Same author

Rapid phylogenomic analysis for viral surveillance and metagenomic profiling with Omni2Tree.

bioRxiv : the preprint server for biology·2026
Same author

A computational model for quantifying instability of tandem repeats across the genome.

bioRxiv : the preprint server for biology·2026
Same journal

A pore-facing glycan constrains GABA<sub>A</sub> receptor subunit stoichiometry and gating behavior.

Communications biology·2026
Same journal

Resorantel: a dual-targeting therapeutic with potent efficacy against Staphylococcus aureus with low potential for drug resistance.

Communications biology·2026
Same journal

Rise and subsequent fall in neuro-behavioral coupling during learning a skilled reaching task is revealed by generative AI.

Communications biology·2026
Same journal

Neural effects of expectation violation generalise across sensory modalities.

Communications biology·2026
Same journal

Contraction, recombination and innovation shape the dynamic pan-plastome of Astragalus sinicus.

Communications biology·2026
Same journal

Electric fields trigger ceramide-dependent vesicle budding and boost the generation of small extracellular vesicles.

Communications biology·2026
See all related articles

Related Experiment Video

Updated: Jun 10, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.0K

StratoMod: predicting sequencing and variant calling errors with interpretable machine learning.

Nathan Dwarshuis1, Peter Tonner2, Nathan D Olson2

  • 1Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA. njd2@nist.gov.

Communications Biology
|October 13, 2024
PubMed
Summary
This summary is machine-generated.

StratoMod predicts germline variant calling errors using machine learning, aiding pipeline design. It identifies challenging genomic regions and missed clinically relevant variants, improving variant calling accuracy.

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.2K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

33.7K

Related Experiment Videos

Last Updated: Jun 10, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.0K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.2K
Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

33.7K

Area of Science:

  • Genomics
  • Bioinformatics
  • Machine Learning

Background:

  • No single variant calling pipeline is optimal for the entire human genome.
  • Assessing pipeline tradeoffs currently relies on intuition rather than data.
  • Developers, clinicians, and researchers need better tools for pipeline design.

Purpose of the Study:

  • To present StratoMod, an interpretable machine-learning classifier to predict germline variant calling errors.
  • To provide a data-driven method for assessing tradeoffs in variant calling pipelines.
  • To identify genomic regions and factors contributing to variant calling errors.

Main Methods:

  • Developed StratoMod, an interpretable machine-learning classifier.
  • Utilized a draft benchmark based on the Q100 HG002 assembly for difficult regions.
  • Assessed the impact of mapping strategies (linear vs. graph-based references) on variant calling.
  • Quantified contributions of difficult-to-map and homopolymer regions to errors.

Main Results:

  • StratoMod accurately predicts recall for different sequencing platforms (Hifi, Illumina).
  • Identified specific difficult-to-map regions where graph-based methods show significant improvement.
  • Quantified the impact of mismapping on predicted recall.
  • Demonstrated StratoMod's ability to predict missed clinically relevant variants.

Conclusions:

  • StratoMod offers a data-driven approach to optimize variant calling pipelines.
  • Its interpretability allows for precise risk-reward analyses in pipeline design.
  • StratoMod improves upon existing methods by predicting missed variants, not just filtering false positives.