Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

14.8K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
14.8K
Prediction Intervals01:03

Prediction Intervals

2.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.5K
End Point Prediction: Gran Plot01:07

End Point Prediction: Gran Plot

814
A Gran plot is used to predict the equivalence volume or endpoint of a potentiometric or acid-base titration without reaching the endpoint. Typically, titration data is collected as a function of the titrant's volume up to a point less than the equivalence volume and then transformed into a linear format. The straight line is extended to the x-axis, indicating the necessary titrant volume to achieve the equivalence point.
For potentiometric titration, the Gran plot is created by plotting...
814
Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

123
Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
123
Genomics02:02

Genomics

38.4K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
38.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Self-reported perception of statistical literacy: Evidence from a National Survey of U.S. Adults.

PloS one·2026
Same author

A Novel Bioinformatics Pipeline and a Machine-Learning Approach for Antimicrobial Resistance Phenotypic Prediction.

Bioinformatics and biology insights·2026
Same author

Unpacking X (formerly Twitter) discourse on fluoride and related topics during the 2024 US presidential election.

Journal of the American Dental Association (1939)·2026
Same author

Functional characterization of the 9q34.13 locus identifies <i>RAPGEF1</i> as modulating risk for melanoma and nevi via RAS activation.

bioRxiv : the preprint server for biology·2026
Same author

Identification of immune cell type-specific susceptibility genes in multiple cancers using transcriptome-wide association studies.

Journal of the National Cancer Institute·2026
Same author

Effect of exercise training on clinical and physiological variables in adults with myotonic dystrophy type 1: a systematic review and meta-analysis.

Journal of neurology·2026
Same journal

Novel variants in LSS related hypotrichosis simplex 14.

Frontiers in genetics·2026
Same journal

Network-based analysis identifies shared mechanisms between ischemic stroke and myocardial infarction and therapeutic ingredients of Buyang Huanwu Decoction.

Frontiers in genetics·2026
Same journal

GWAS analysis of a depression cohort defined by an EHR-phenotyping algorithm reveals the role of immune regulations in depression risk.

Frontiers in genetics·2026
Same journal

Ferroptosis, lipid metabolism, and genetic regulation in postoperative rehabilitation of elderly hip fractures: from molecular mechanisms to clinical translation.

Frontiers in genetics·2026
Same journal

Single-cell and pseudobulk analyses reveal hidden mitochondrial expression imbalance in gastric cancer.

Frontiers in genetics·2026
Same journal

Transcriptomic profiling and experimental validation of myeloid-cell-differentiation-related key genes in osteoarthritis.

Frontiers in genetics·2026
See all related articles

Related Experiment Video

Updated: Nov 6, 2025

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons
08:04

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

733

Metagenomic Geolocation Prediction Using an Adaptive Ensemble Classifier.

Samuel Anyaso-Samuel1, Archie Sachdeva1, Subharup Guha1

  • 1Department of Biostatistics, University of Florida, Gainesville, FL, United States.

Frontiers in Genetics
|May 7, 2021
PubMed
Summary
This summary is machine-generated.

Urban microbiome samples can predict geographic origin. An ensemble classifier combining multiple algorithms achieved optimal performance in identifying source cities for unknown metagenomic samples.

Keywords:
ensemble classifiergeolocationmachine learningmetagenomicsmicrobiome

More Related Videos

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.6K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

1.4K

Related Experiment Videos

Last Updated: Nov 6, 2025

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons
08:04

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

733
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.6K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

1.4K

Area of Science:

  • Environmental microbiology
  • Bioinformatics
  • Computational biology

Background:

  • Urban environments harbor distinct microbial communities.
  • Geographic location can be inferred from microbial signatures in environmental samples.

Purpose of the Study:

  • To predict the geographic origin of unknown microbiome samples using urban microbial signatures.
  • To evaluate the performance of various classification algorithms for metagenomic sample source prediction.

Main Methods:

  • Standard bioinformatics pipelines were used to pre-process raw metagenomic data.
  • Taxonomy-dependent and taxonomy-free approaches were employed to train component classifiers.
  • Class weighting and optimal oversampling techniques addressed data imbalance.
  • An ensemble classifier was developed by combining multiple component classifiers.

Main Results:

  • Component classifiers showed varied performance in predicting sample origins.
  • The ensemble classifier consistently delivered optimal prediction performance.
  • The ensemble approach matched the performance of the best-performing individual classifier.

Conclusions:

  • Relying on a single classification algorithm for metagenomic sample source prediction is unreliable.
  • Ensemble methods, by integrating multiple classifiers, enhance prediction accuracy and robustness.
  • Microbiome analysis of urban environments offers a viable strategy for geographic source attribution.