Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Test for Homogeneity01:23

Test for Homogeneity

2.6K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
2.6K
One-Way ANOVA01:18

One-Way ANOVA

15.2K
One-way ANOVA analyzes more than three samples categorized by one factor. For example, it can compare the average mileage of sports bikes. Here, the data is categorized by one factor - the company. However, one-way ANOVA cannot be used to simultaneously compare the sample mean of three or more samples categorized by two factors. An example of two factors would be sports bikes from different companies driven in different terrains, such as a desert or snowy landscape. Here, two-way ANOVA is used...
15.2K
Variability: Analysis01:11

Variability: Analysis

782
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
782
Bioequivalence Data: Statistical Interpretation01:16

Bioequivalence Data: Statistical Interpretation

345
The statistical interpretation of bioequivalence data is a significant aspect of pharmaceutical research. Bioequivalence refers to the absence of any significant difference in the rate and extent to which the active ingredient in pharmaceutical products becomes available at the site of drug action when administered at the same molar dose under similar conditions. This helps determine if different drug products have similar absorption rates, ensuring their interchangeability.Statistical...
345
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

604
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
604
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

725
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
725

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Identifying Causal Genotype-Phenotype Relationships for Population-Sampled Parent-Child Trios.

Genetic epidemiology·2026
Same author

Coancestry superposed on admixed populations yields measures of relatedness at individual-level resolution.

PLoS computational biology·2025
Same author

Unearthing soil biodiversity through collaborative genomic research and education.

Nature genetics·2025
Same author

Insights into the Datasets, Tools, and Training Needs of the AnVIL Community: 2024.

bioRxiv : the preprint server for biology·2025
Same author

ipd: an R package for conducting inference on predicted data.

Bioinformatics (Oxford, England)·2025
Same author

Coancestry superposed on admixed populations yields measures of relatedness at individual-level resolution.

bioRxiv : the preprint server for biology·2025
Same journal

Genetic survey of biomarkers at early and mid-pregnancy identifies pregnancy-specialized immune regulation.

PLoS genetics·2026
Same journal

Argonaute proteins orchestrate Meiotic Sex Chromosome Inactivation and timing of the spermatogenic transcriptional program.

PLoS genetics·2026
Same journal

Genome wide association study meta-analysis of neuropathologic lesions of Alzheimer's disease and related dementias in a multi-site autopsy cohort.

PLoS genetics·2026
Same journal

Microtubule stiffening by the doublecortin-domain protein ZYG-8 contributes to mitotic spindle orientation during zygote division in Caenorhabditis elegans.

PLoS genetics·2026
Same journal

Multiple instance fine-mapping: Predicting causal regulatory variants with a deep sequence model.

PLoS genetics·2026
Same journal

Nuclear ubiquitin-conjugating enzyme TrUbc4 and F-box protein TrFwd1-mediated modification of Cre1 in Trichoderma reesei establishes a regulatory mechanism for carbon catabolite repression.

PLoS genetics·2026
See all related articles

Related Experiment Video

Updated: Apr 16, 2026

An Allele-specific Gene Expression Assay to Test the Functional Basis of Genetic Associations
10:17

An Allele-specific Gene Expression Assay to Test the Functional Basis of Genetic Associations

Published on: November 3, 2010

23.5K

Capturing heterogeneity in gene expression studies by surrogate variable analysis.

Jeffrey T Leek1, John D Storey

  • 1Department of Biostatistics, University of Washington, Seattle, Washington, USA.

Plos Genetics
|October 3, 2007
PubMed
Summary
This summary is machine-generated.

Heterogeneity in gene expression studies, caused by unmeasured factors, can reduce study power and introduce spurious signals. Surrogate variable analysis (SVA) addresses these issues, improving biological accuracy and reproducibility in genome-wide expression analyses.

More Related Videos

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

18.5K
Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

4.1K

Related Experiment Videos

Last Updated: Apr 16, 2026

An Allele-specific Gene Expression Assay to Test the Functional Basis of Genetic Associations
10:17

An Allele-specific Gene Expression Assay to Test the Functional Basis of Genetic Associations

Published on: November 3, 2010

23.5K
A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

18.5K
Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

4.1K

Area of Science:

  • Genomics
  • Bioinformatics
  • Statistical Genetics

Background:

  • Gene expression levels are influenced by genetic, environmental, demographic, and technical factors.
  • Unaccounted sources of heterogeneity can significantly impact study results, reducing power and introducing spurious signals.
  • These issues persist even in well-designed, randomized studies.

Purpose of the Study:

  • To introduce a novel method, surrogate variable analysis (SVA), to address heterogeneity in gene expression studies.
  • To demonstrate the effectiveness of SVA in improving the accuracy and reproducibility of analyses.
  • To provide a tool applicable to various gene expression study designs.

Main Methods:

  • Development and application of surrogate variable analysis (SVA).
  • Integration of SVA with standard analysis techniques for gene expression data.
  • Testing SVA on disease class, time course, and genetics of gene expression studies.

Main Results:

  • Surrogate variable analysis (SVA) effectively captures unknown sources of variation in gene expression data.
  • Incorporating SVA mitigates the detrimental effects of heterogeneity, such as reduced power and spurious correlations.
  • SVA application led to increased biological accuracy and reproducibility across different study types.

Conclusions:

  • Surrogate variable analysis (SVA) is a robust method for handling unmeasured heterogeneity in gene expression studies.
  • SVA enhances the reliability and validity of findings in genome-wide expression analyses.
  • This approach is crucial for accurate interpretation of gene expression data in diverse research settings.