Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Significance Testing: Overview01:04

Significance Testing: Overview

3.3K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
3.3K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

16.8K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
16.8K
Variability: Analysis01:11

Variability: Analysis

116
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
116
Genetic Variation01:25

Genetic Variation

239
Genetic variation is the diversity in DNA sequences found among individuals of the same species. This diversity is crucial for a species' survival because it helps organisms adapt to environmental changes. Genetic variation begins with fertilization, where an egg and sperm cell merge. Each of these cells carries 23 chromosomes, up to 46 in the fertilized egg. Chromosomes are long DNA strands that contain genes, the basic units of heredity.
Genes exist in different versions called alleles,...
239
Genome-wide Association Studies-GWAS01:11

Genome-wide Association Studies-GWAS

12.1K
Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...
12.1K
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.1K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

16S rRNA Gene-Based Metagenomic Analysis of Rhizosphere Soil Bacteria in Arkansas Rice Crop Fields.

Agronomy (Basel, Switzerland)·2026
Same author

A Comment on: 'Towards a universal definition of 'domestication'' (2025), by Lord et al.

Philosophical transactions of the Royal Society of London. Series B, Biological sciences·2026
Same author

ZNF16 is a nucleolar-associated protein that regulates expression of rDNA and cancer-associated genes.

Biology open·2025
Same author

ZNF16 is a nucleolar-associated protein that regulates expression of the rDNA and cancer-associated genes.

bioRxiv : the preprint server for biology·2025
Same author

A Tracts of Homozygosity Approach Identifies Methylation-Regulated CSMD1 Expression Targets in Non-Small Cell Lung Cancers Related to Smoking Behavior.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2025
Same author

Pre-trained convolutional neural networks identify Parkinson's disease from spectrogram images of voice samples.

Scientific reports·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 13, 2025

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2
10:10

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2

Published on: September 18, 2021

36.4K

Improving data interpretability with new differential sample variance gene set tests.

Yasir Rahmatallah1, Galina Glazko2

  • 1Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA. yrahmatallah@uams.edu.

BMC Bioinformatics
|April 14, 2025
PubMed
Summary
This summary is machine-generated.

New gene set analysis methods detect differential sample variance, uncovering biological insights missed by traditional mean-focused approaches. These methods improve interpretation of complex omics data, especially in heterogeneous phenotypes.

Keywords:
Anderson–DarlingCramer-Von MisesDifferential variabilityGene set analysisMinimum spanning tree

More Related Videos

Characterization of In Vitro Differentiation of Human Primary Keratinocytes by RNA-Seq Analysis
07:29

Characterization of In Vitro Differentiation of Human Primary Keratinocytes by RNA-Seq Analysis

Published on: May 16, 2020

6.0K
Development of Compendium for Esophageal Squamous Cell Carcinoma
03:36

Development of Compendium for Esophageal Squamous Cell Carcinoma

Published on: April 12, 2024

341

Related Experiment Videos

Last Updated: May 13, 2025

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2
10:10

Three Differential Expression Analysis Methods for RNA Sequencing: limma, EdgeR, DESeq2

Published on: September 18, 2021

36.4K
Characterization of In Vitro Differentiation of Human Primary Keratinocytes by RNA-Seq Analysis
07:29

Characterization of In Vitro Differentiation of Human Primary Keratinocytes by RNA-Seq Analysis

Published on: May 16, 2020

6.0K
Development of Compendium for Esophageal Squamous Cell Carcinoma
03:36

Development of Compendium for Esophageal Squamous Cell Carcinoma

Published on: April 12, 2024

341

Area of Science:

  • Genomics
  • Bioinformatics
  • Systems Biology

Background:

  • Current gene set analysis methods primarily detect mean expression changes in omics data.
  • Methods for detecting variance changes are underexplored but crucial for understanding biological heterogeneity.
  • Existing approaches overlook gene sets with subgroup-specific changes within phenotypes, hindering biological interpretation.

Purpose of the Study:

  • To develop and validate multivariate sample-level variance analysis methods for gene set analysis.
  • To address the limitations of existing methods in detecting heterogeneous pattern changes.
  • To provide novel tools for interpreting omics data with complex biological differences.

Main Methods:

  • Generalized univariate statistics (Cramer-Von Mises, Anderson-Darling) into multivariate methods using minimum spanning tree ranking.
  • Developed methods to detect differential sample variance and mean.
  • Evaluated detection power and Type I error rates via simulations and real-world datasets.

Main Results:

  • Applied methods to gene expression datasets from acute lymphoblastic leukemia and colorectal polyps.
  • Demonstrated that variance analysis methods identify biologically relevant gene sets missed by mean-based methods.
  • Successfully detected hallmark gene sets associated with distinct phenotypes and subtypes.

Conclusions:

  • Methods detecting differential sample variance offer significant biological interpretations for heterogeneous data.
  • These new approaches are valuable for understanding complex signaling pathways and phenotype differences.
  • Software implementation (GSAR package) is available for gene expression and other omics data.