Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Single Nucleotide Polymorphisms-SNPs01:05

Single Nucleotide Polymorphisms-SNPs

15.3K
A single nucleotide polymorphism or SNP is a single nucleotide variation at a specific genomic position in a large population. It is the most prevalent type of sequence variation found in the human genome. Point mutations that occur in more than 1% of the population qualify as SNPs. These are present once every 1000 nucleotides on an average in the human genome. Replacement of a purine with another purine (A/G) or a pyrimidine with another pyrimidine (C/T) is known as a transition. In contrast,...
15.3K
Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

155
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
155
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

13.6K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
13.6K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K
Classification of Illness01:17

Classification of Illness

7.6K
The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe...
7.6K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

55
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
55

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Mining-impacted rice paddies select for Archaeal methylators and reveal a putative (Archaeal) regulator of mercury methylation.

ISME communications·2023
Same author

Long-Term Tillage and Crop Rotation Regimes Reshape Soil-Borne Oomycete Communities in Soybean, Corn, and Wheat Production Systems.

Plants (Basel, Switzerland)·2023
Same author

Viral spillover risk increases with climate change in High Arctic lake sediments.

Proceedings. Biological sciences·2022
Same author

Identifying the drivers of computationally detected correlated evolution among sites under antibiotic selection.

Evolutionary applications·2020
Same author

Swift evolutionary response of microbes to a rise in anthropogenic mercury in the Northern Hemisphere.

The ISME journal·2019
Same author

Viral Long-Term Evolutionary Strategies Favor Stability over Proliferation.

Viruses·2019
Same journal

Correction: Bulatov et al. Camelpox Virus in Western Kazakhstan: Assessment of the Role of Local Fauna as Reservoirs of Infection. <i>Viruses</i> 2024, <i>16</i>, 1626.

Viruses·2026
Same journal

Correction: Franco et al. Whole Blood Volume-Based Absolute Quantification of HTLV-1 Proviral Load: A Comparative Method Evaluation Study. <i>Viruses</i> 2026, <i>18</i>, 580.

Viruses·2026
Same journal

Correction: Medkour et al. Adenovirus Infections in African Humans and Wild Non-Human Primates: Great Diversity and Cross-Species Transmission. <i>Viruses</i> 2020, <i>12</i>, 657.

Viruses·2026
Same journal

Burden of Malaria and Dengue Across Global, Asian, and Chinese Populations Based on GBD 2021 Data: A Quantitative Assessment of Importation Risks to China.

Viruses·2026
Same journal

First Report of <i>Orthonairovirus songlingense</i> in <i>Haemaphysalis concinna</i> Ticks from Russia.

Viruses·2026
Same journal

Epidemiological and Virological Characteristics of H9N2 Avian Influenza Virus in Jiangsu Province, China, 2024.

Viruses·2026
See all related articles

Related Experiment Video

Updated: Jul 25, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.7K

Machine Learning Algorithms Associate Case Numbers with SARS-CoV-2 Variants Rather Than with Impactful Mutations.

Matthieu Vilain1, Stéphane Aris-Brosou1,2

  • 1Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada.

Viruses
|June 28, 2023
PubMed
Summary
This summary is machine-generated.

Predicting SARS-CoV-2 cases using genomic data showed high accuracy but relied on variant identity, not specific mutations. This highlights the need for better data understanding and explainability in predictive modeling.

Keywords:
COVID-19SHapley Additive exPlanation (SHAP)biasfeedforward neural networkmachine learningrandom forest

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.9K
In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
00:06

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

13.7K

Related Experiment Videos

Last Updated: Jul 25, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.7K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.9K
In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila
00:06

In Vivo Functional Study of Disease-associated Rare Human Variants Using Drosophila

Published on: August 20, 2019

13.7K

Area of Science:

  • Epidemiology
  • Genomics
  • Machine Learning

Background:

  • COVID-19 pandemic spurred efforts to predict case numbers using epidemiological data.
  • Viral genomic information, including variant-specific virulence, was largely overlooked in predictive models.
  • The Alpha and Delta variants of SARS-CoV-2 co-circulated, offering an opportunity to study genomic impacts on case prediction.

Purpose of the Study:

  • To investigate whether viral genomic sequences can improve predictions of future SARS-CoV-2 case numbers.
  • To assess the predictive performance of machine learning models trained on genomic data.
  • To analyze the explainability of these models to understand their predictive basis.

Main Methods:

  • Implemented simple predictive models using genomic sequences of SARS-CoV-2 Alpha and Delta variants.
  • Encoded viral sequences and matched them with future case numbers based on collection dates.
  • Trained two algorithms: random forests and a feed-forward neural network.

Main Results:

  • Models achieved high prediction accuracies (≥93%).
  • Explainability analysis revealed models associated predictions with specific variants (Alpha, Delta), not virulence-linked mutations.
  • The models did not identify specific mutations correlating with increased case numbers.

Conclusions:

  • While genomic data can predict case numbers, models may rely on variant identity rather than specific virulence factors.
  • Explainability analysis is crucial to ensure predictive models are not misleading.
  • Further research is needed to understand the interplay between viral genomics and epidemiological predictions.