Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Introduction to Test of Independence

Introduction to Test of Independence

In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:

Degrees of Freedom

Degrees of Freedom

The degree of freedom for a particular statistical calculation is the number of values that are free to vary. Thus, the minimum number of independent numbers can specify a particular statistic. The degrees of freedom differ greatly depending on known and uncalculated statistical components.
For example, suppose there are three unknown numbers whose mean is 10; although we can freely assign values to the first and second numbers, the value of the last number can not be arbitrarily assigned.

Correlation of Experimental Data

Correlation of Experimental Data

Dimensional analysis simplifies complex physical problems and guides experimental investigations, but it does not provide complete solutions. It identifies the dimensionless groups that influence a phenomenon, but experimental data is needed to establish the specific relationships and validate theoretical predictions.
For example, a spherical particle moving through a viscous fluid experiences drag. Dimensional analysis shows that the drag force depends on the particle's diameter, velocity,...

Biostatistics: Overview

Biostatistics: Overview

Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...

Variation

Variation

An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...

Hypothesis Test for Test of Independence

Hypothesis Test for Test of Independence

The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Ancient DNA reveals pervasive directional selection across West Eurasia.

Nature·2026

Same author

SpatialFusion: A lightweight multimodal foundation model for pathway-informed spatial niche mapping.

bioRxiv : the preprint server for biology·2026

Same author

Detecting chromatin state alterations in PBMCs associated with Type 2 Diabetes Mellitus.

Communications medicine·2026

Same author

Multimodal framework for the joint analysis of single-cell RNA and T cell receptor sequencing data predicts T cell response to cancer immunotherapy.

Nature communications·2026

Same author

Functional dissection of complex trait variants at single-nucleotide resolution.

Nature·2026

Same author

Partially shared multi-modal embedding learns holistic representation of cell state.

Nature computational science·2026

Same journal

The TaMYB55-TaSnRK1α1-TabZIP9 module confers heat stress tolerance in wheat.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Superstatistics approach to turbulent circulation fluctuations.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

A molecular timescale for evolution of cobamide biosynthesis.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Pierre Chambon, a pioneer of molecular biology and gene regulation in eukaryotes.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Granulosa cell glycogen fuels the avascular corpus luteum.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same journal

Synthetic essentiality of TRAIL/TNFSF10 in VHL-deficient renal cell carcinoma.

Proceedings of the National Academy of Sciences of the United States of America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 10, 2025

Diagonal Method to Measure Synergy Among Any Number of Drugs

Diagonal Method to Measure Synergy Among Any Number of Drugs

Published on: June 21, 2018

Efficiently quantifying dependence in massive scientific datasets using InterDependence Scores.

Adityanarayanan Radhakrishnan^1,2, Yajit Jain¹, Caroline Uhler^1,3

¹Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142.

Proceedings of the National Academy of Sciences of the United States of America

|August 20, 2025

Summary

This summary is machine-generated.

We introduce the InterDependence Score (IDS), a new scalable method to find linear and nonlinear relationships in large scientific datasets. IDS efficiently uncovers hidden patterns in complex data, aiding scientific discovery.

Keywords:

deep learning feature learning independence testing single-cell transcriptomics

More Related Videos

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Published on: November 10, 2023

Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation

Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation

Published on: August 21, 2019

Related Experiment Videos

Last Updated: Sep 10, 2025

Diagonal Method to Measure Synergy Among Any Number of Drugs

Diagonal Method to Measure Synergy Among Any Number of Drugs

Published on: June 21, 2018

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Published on: November 10, 2023

Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation

Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation

Published on: August 21, 2019

Area of Science:

Computational Biology
Data Science
Bioinformatics

Background:

Modern scientific datasets are massive, featuring millions of samples and tens of thousands of variables.
Existing dependence measures like Pearson correlation are limited to linear relationships and do not scale well.
Discovering complex, nonlinear dependencies is crucial for novel insights in large-scale data.

Purpose of the Study:

Introduce the InterDependence Score (IDS), a novel, scalable measure for quantifying both linear and nonlinear dependencies.
Develop an efficient algorithm for IDS computation suitable for high-dimensional, large-scale datasets.
Demonstrate IDS's utility in identifying key variables, topics, and biological relationships.

Main Methods:

IDS is inspired by dependence measures in infinite-dimensional Hilbert spaces, capturing all dependence types.
An efficient, linear-time algorithm leveraging neural network principles is employed for computation.
The algorithm is optimized for parallel processing on GPUs, enabling analysis of billions of variable pairs.

Main Results:

IDS successfully identifies relevant variables for predictive modeling tasks.
The method effectively extracts word sets representing topics from large document corpora.
IDS reveals gene sets associated with "gene-expression programs" in massive single-cell datasets.

Conclusions:

IDS offers a scalable and effective solution for detecting diverse dependencies in large scientific datasets.
Its speed and ability to capture nonlinear relationships make it a valuable tool for data exploration and insight generation.
IDS has broad applicability across various scientific domains dealing with high-dimensional data.