Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Introduction to Test of Independence

Introduction to Test of Independence

In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:

Hypothesis Test for Test of Independence

Hypothesis Test for Test of Independence

The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...

Test for Homogeneity

Test for Homogeneity

The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

F Distribution

F Distribution

The F distribution was named after Sir Ronald Fisher, an English statistician. The F statistic is a ratio (a fraction) with two sets of degrees of freedom; one for the numerator and one for the denominator. The F distribution is derived from the Student's t distribution. The values of the F distribution are squares of the corresponding values of the t distribution. One-Way ANOVA expands the t test for comparing more than two groups. The scope of that derivation is beyond the level of this...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Emergence of Ceftazidime-Avibactam-Induced KPC Variants (KPC-25/127) in Intracranial Infection and Implications for Clinical Management.

MicrobiologyOpen·2026

Same author

[Gut-brain axis and Parkinson's disease: research progress on the gut-brain interaction].

Sheng li xue bao : [Acta physiologica Sinica]·2026

Same author

Transient Potential Profiling for Rapid Calcium Ion Quantification: Eliminating Conditioning Time in Solid-Contact Ion-Selective Electrodes.

Biosensors·2026

Same author

Fragmentation property identity card-assisted reliable structural annotation of calycosin metabolites in vivo: A strategy for isomer discrimination using mass spectrometry.

Journal of chromatography. A·2026

Same author

Non-surgical treatment of severe trachea injury due to tracheal intubation-a case report.

AME case reports·2026

Same author

Maternal vitamin A deficiency programs offspring visceral hypersensitivity through RARβ-COX-2/PGE2 signaling.

Biochimica et biophysica acta. Molecular and cell biology of lipids·2026

Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026

Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026

Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026

Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026

Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 5, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Asymptotic Distribution-Free Independence Test for High Dimension Data.

Zhanrui Cai¹, Jing Lei², Kathryn Roeder²

¹Faculty of Business and Economics, The University of Hong Kong.

Journal of the American Statistical Association

|December 9, 2024

Summary

This summary is machine-generated.

We introduce a new framework for independence testing in high-dimensional data. This method leverages machine learning classifiers to detect sparse dependencies, offering a powerful tool for complex datasets.

Keywords:

permutation rank sum test sample splitting test of independence

More Related Videos

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

Published on: February 3, 2013

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Related Experiment Videos

Last Updated: Jun 5, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

Published on: February 3, 2013

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Statistics
Machine Learning
Data Science

Background:

Independence testing is crucial for variable selection, graphical models, and causal inference.
High-dimensional and sparse data present significant challenges for traditional independence tests due to a lack of distributional or structural assumptions.

Purpose of the Study:

To propose a general and robust framework for independence testing applicable to high-dimensional, complex data.
To develop a test statistic with a universal, fixed Gaussian null distribution, independent of the data distribution.

Main Methods:

A novel framework for independence testing by fitting a classifier to distinguish joint and product distributions.
Utilizing advanced classification algorithms from machine learning.
Employing a sample split and fixed permutation strategy to ensure a fixed Gaussian null distribution.

Main Results:

The proposed test demonstrates advantages over existing methods in extensive simulations.
The framework effectively handles high-dimensional and sparse data, outperforming current approaches.
Successful application to a single-cell sequencing dataset for testing independence between measurement types.

Conclusions:

The new framework offers a powerful and flexible approach to independence testing, particularly for complex, high-dimensional data.
The method's ability to leverage machine learning enhances its applicability in modern data analysis.
The universal null distribution simplifies interpretation and broadens the scope of application.