Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Introduction to Test of Independence01:21

Introduction to Test of Independence

2.2K
In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:
2.2K
Hypothesis Test for Test of Independence01:16

Hypothesis Test for Test of Independence

3.5K
The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...
3.5K
Test for Homogeneity01:23

Test for Homogeneity

1.9K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
1.9K
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.2K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.2K
F Distribution01:19

F Distribution

3.7K
The F distribution was named after Sir Ronald Fisher, an English statistician. The F statistic is a ratio (a fraction) with two sets of degrees of freedom; one for the numerator and one for the denominator. The F distribution is derived from the Student's t distribution. The values of the F distribution are squares of the corresponding values of the t distribution. One-Way ANOVA expands the t test for comparing more than two groups. The scope of that derivation is beyond the level of this...
3.7K
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

5.7K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
5.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Emergence of Ceftazidime-Avibactam-Induced KPC Variants (KPC-25/127) in Intracranial Infection and Implications for Clinical Management.

MicrobiologyOpen·2026
Same author

[Gut-brain axis and Parkinson's disease: research progress on the gut-brain interaction].

Sheng li xue bao : [Acta physiologica Sinica]·2026
Same author

Transient Potential Profiling for Rapid Calcium Ion Quantification: Eliminating Conditioning Time in Solid-Contact Ion-Selective Electrodes.

Biosensors·2026
Same author

Fragmentation property identity card-assisted reliable structural annotation of calycosin metabolites in vivo: A strategy for isomer discrimination using mass spectrometry.

Journal of chromatography. A·2026
Same author

Non-surgical treatment of severe trachea injury due to tracheal intubation-a case report.

AME case reports·2026
Same author

Maternal vitamin A deficiency programs offspring visceral hypersensitivity through RARβ-COX-2/PGE2 signaling.

Biochimica et biophysica acta. Molecular and cell biology of lipids·2026
Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026
Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026
Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026
Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026
Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026
Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.4K

Asymptotic Distribution-Free Independence Test for High Dimension Data.

Zhanrui Cai1, Jing Lei2, Kathryn Roeder2

  • 1Faculty of Business and Economics, The University of Hong Kong.

Journal of the American Statistical Association
|December 9, 2024
PubMed
Summary
This summary is machine-generated.

We introduce a new framework for independence testing in high-dimensional data. This method leverages machine learning classifiers to detect sparse dependencies, offering a powerful tool for complex datasets.

Keywords:
permutationrank sum testsample splittingtest of independence

More Related Videos

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
13:55

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

Published on: February 3, 2013

18.3K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K

Related Experiment Videos

Last Updated: Jun 5, 2025

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.4K
Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
13:55

Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

Published on: February 3, 2013

18.3K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.9K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • Independence testing is crucial for variable selection, graphical models, and causal inference.
  • High-dimensional and sparse data present significant challenges for traditional independence tests due to a lack of distributional or structural assumptions.

Purpose of the Study:

  • To propose a general and robust framework for independence testing applicable to high-dimensional, complex data.
  • To develop a test statistic with a universal, fixed Gaussian null distribution, independent of the data distribution.

Main Methods:

  • A novel framework for independence testing by fitting a classifier to distinguish joint and product distributions.
  • Utilizing advanced classification algorithms from machine learning.
  • Employing a sample split and fixed permutation strategy to ensure a fixed Gaussian null distribution.

Main Results:

  • The proposed test demonstrates advantages over existing methods in extensive simulations.
  • The framework effectively handles high-dimensional and sparse data, outperforming current approaches.
  • Successful application to a single-cell sequencing dataset for testing independence between measurement types.

Conclusions:

  • The new framework offers a powerful and flexible approach to independence testing, particularly for complex, high-dimensional data.
  • The method's ability to leverage machine learning enhances its applicability in modern data analysis.
  • The universal null distribution simplifies interpretation and broadens the scope of application.