Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Uncertainty: Overview00:59

Uncertainty: Overview

1.6K
In analytical chemistry, we often perform repetitive measurements to detect and minimize inaccuracies caused by both determinate and indeterminate errors. Despite the cares we take, the presence of random errors means that repeated measurements almost never have exactly the same magnitude. The collective difference between these measurements - observed values - and the estimated or expected value is called uncertainty. Uncertainty is conventionally written after the estimated or expected value.
1.6K
Uncertainty: Confidence Intervals00:54

Uncertainty: Confidence Intervals

10.2K
The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...
10.2K
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

249
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
249
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

590
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
590
Uncertainty in Measurement: Accuracy and Precision03:37

Uncertainty in Measurement: Accuracy and Precision

99.8K
Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value. 
99.8K
Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

492
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
492

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Label-free 3D virtual histology of human formalin-fixed paraffin-embedded (FFPE) prostate needle biopsies with propagation-based phase-contrast micro-CT (PBCT).

bioRxiv : the preprint server for biology·2026
Same author

PCR bias impacts microbiome ecological analyses.

PLoS computational biology·2026
Same author

Scalable Inference for Bayesian Multinomial Logistic-Normal Dynamic Linear Models.

Proceedings of machine learning research·2026
Same author

Female Anterior Cruciate Ligaments Exhibit a Muted Mechanobiological Response to Mechanical Loading.

bioRxiv : the preprint server for biology·2025
Same author

Beyond Normalization: Incorporating Scale Uncertainty in Microbiome and Gene Expression Analysis.

bioRxiv : the preprint server for biology·2024
Same author

Addressing erroneous scale assumptions in microbe and gene set enrichment analysis.

PLoS computational biology·2023
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 16, 2026

Tick Microbiome Characterization by Next-Generation 16S rRNA Amplicon Sequencing
07:21

Tick Microbiome Characterization by Next-Generation 16S rRNA Amplicon Sequencing

Published on: August 25, 2018

13.4K

Uncertainty Modeling Outperforms Machine Learning for Microbiome Data Analysis.

Maxwell A Konnaris1, Manan Saxena2, Nicole Lazar3

  • 1Program in Bioinformatics and Genomics, Pennsylvania State University, University Park, PA, USA.

Biorxiv : the Preprint Server for Biology
|September 26, 2025
PubMed
Summary
This summary is machine-generated.

Microbiome sequencing lacks total microbial load data. Machine learning models fail to predict load accurately, but Bayesian methods offer a reliable solution for microbiome analysis.

Keywords:
16S rRNA-seqCovariate ShiftMachine LearningMetagenomicsMicrobiome DataPartially Identified ModelsScale Reliant InferenceSequence Count DataTotal Microbial LoadUncertainty Quantification

More Related Videos

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.3K
Microbiota Analysis Using Two-step PCR and Next-generation 16S rRNA Gene Sequencing
11:22

Microbiota Analysis Using Two-step PCR and Next-generation 16S rRNA Gene Sequencing

Published on: October 15, 2019

31.1K

Related Experiment Videos

Last Updated: Jan 16, 2026

Tick Microbiome Characterization by Next-Generation 16S rRNA Amplicon Sequencing
07:21

Tick Microbiome Characterization by Next-Generation 16S rRNA Amplicon Sequencing

Published on: August 25, 2018

13.4K
Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.3K
Microbiota Analysis Using Two-step PCR and Next-generation 16S rRNA Gene Sequencing
11:22

Microbiota Analysis Using Two-step PCR and Next-generation 16S rRNA Gene Sequencing

Published on: October 15, 2019

31.1K

Area of Science:

  • Microbiology
  • Bioinformatics
  • Computational Biology

Background:

  • Microbiome sequencing quantifies relative, not absolute, microbial abundances.
  • Existing normalization methods rely on assumptions that can introduce bias.
  • Direct microbial load measurements are accurate but costly and infrequent.

Purpose of the Study:

  • To evaluate the efficacy of machine learning in predicting microbial load from sequencing data alone.
  • To assess the generalizability of machine learning models across diverse microbiome studies.
  • To compare machine learning approaches with alternative methods for handling microbial load uncertainty.

Main Methods:

  • Assembled 'mutt,' the largest database of paired sequencing and microbial load measurements (35 studies, >15,000 samples).
  • Evaluated published machine learning models on the 'mutt' database and benchmark datasets.
  • Implemented and compared Bayesian partially identified models for propagating scale uncertainty.

Main Results:

  • Machine learning models demonstrated poor generalization, performing worse than a naive baseline on average.
  • Model failures were attributed to covariate shift, limited shared taxa, compositional differences, and preprocessing variations.
  • Bayesian partially identified models consistently outperformed normalization and machine learning methods across 30 benchmark datasets.

Conclusions:

  • Machine learning approaches are unreliable for predicting microbial load from microbiome sequencing data.
  • Bayesian partially identified models provide a principled and reproducible method for accounting for scale uncertainty in microbiome inference.
  • The 'mutt' database serves as a valuable resource for evaluating microbiome analysis methods.