Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Significance Testing: Overview

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...

Comparing Copy Number Variations and SNPs

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...

Statistical Methods to Analyze Parametric Data: ANOVA

Statistical Methods to Analyze Parametric Data: ANOVA

Analysis of Variance, or ANOVA, is a powerful statistical technique used to analyze parametric data, primarily in research and experimental studies. It's designed to compare the means of two or more groups, assisting researchers in identifying any significant differences between these group means. There are two main types of ANOVA based on the complexity of the analysis: one-way and two-way.
One-way ANOVA is applied when a single independent variable or factor is scrutinized. It compares the...

Variance

Variance

The deviations show how spread out the data are about the mean. A positive deviation occurs when the data value exceeds the mean, whereas a negative deviation occurs when the data value is less than the mean. If the deviations are added, the sum is always zero. So one cannot simply add the deviations to get the data spread. By squaring the deviations, the numbers are made positive; thus, their sum will also be positive.The standard deviation measures the spread in the same units as the data.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026

Same author

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026

Same author

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same author

TESTING FOR THE CAUSAL MEDIATION EFFECTS OF MULTIPLE MEDIATORS USING THE KERNEL MACHINE DIFFERENCE METHOD IN GENOME-WIDE EPIGENETIC STUDIES.

The annals of applied statistics·2026

Same author

Hypothesis Tests of Direct and Indirect Effects Under Various Semicompeting Risks Models.

Statistics in medicine·2026

Same author

EBV strain interacts with host HLA to drive nasopharyngeal carcinoma risk.

Nature·2026

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 10, 2026

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

Gene set analysis using variance component tests.

Yen-Tsung Huang¹, Xihong Lin

¹Department of Epidemiology, Brown University, 121 South Main Street, Providence, RI 02912, USA.

BMC Bioinformatics

|June 29, 2013

Summary

This summary is machine-generated.

We developed a new method, Test for the Effect of a Gene Set (TEGS), to analyze gene sets by accounting for gene correlations. TEGS improves statistical power in genomic research compared to existing methods like GSEA.

More Related Videos

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Related Experiment Videos

Last Updated: May 10, 2026

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Candidate Gene Testing in Clinical Cohort Studies with Multiplexed Genotyping and Mass Spectrometry

Published on: June 21, 2018

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

Area of Science:

Genomics
Statistical Genetics
Bioinformatics

Background:

Gene set analyses are crucial for understanding complex diseases driven by multiple genes.
Existing methods often overlook the inherent correlations among genes within functional sets.
Addressing gene correlation is key to enhancing statistical power in genomic research.

Purpose of the Study:

To develop a novel gene set analysis method that explicitly models correlations among genes.
To improve the statistical power of gene set analyses by incorporating gene interdependence.
To provide a more accurate approach for identifying biologically relevant gene sets.

Main Methods:

Utilized a multivariate linear regression model to analyze gene set effects, explicitly modeling gene correlations with a working covariance matrix.
Developed the Test for the Effect of a Gene Set (TEGS), a variance component test.
Calculated p-values using permutation and a scaled chi-square approximation.

Main Results:

Simulations demonstrated that TEGS protects Type I error rates across various covariance matrix choices.
Statistical power increased as the working covariance matrix approximated the true covariance.
TEGS outperformed the global test and Gene Set Enrichment Analysis (GSEA) on both simulated and diabetes dataset.

Conclusions:

Introduced TEGS, a gene set analysis method within a multivariate regression framework that models gene expression interdependence.
TEGS demonstrated superior performance over GSEA and the global test in simulation studies and a real-world diabetes dataset.
The developed method offers a more powerful approach for gene set analysis by accounting for gene correlations.