Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Introduction to Test of Independence01:21

Introduction to Test of Independence

2.4K
In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:
2.4K
Degrees of Freedom01:02

Degrees of Freedom

3.4K
The degree of freedom for a particular statistical calculation is the number of values that are free to vary. Thus, the minimum number of independent numbers can specify a particular statistic. The degrees of freedom differ greatly depending on known and uncalculated statistical components.
For example, suppose there are three unknown numbers whose mean is 10; although we can freely assign values to the first and second numbers, the value of the last number can not be arbitrarily assigned.
3.4K
Correlation of Experimental Data01:23

Correlation of Experimental Data

269
Dimensional analysis simplifies complex physical problems and guides experimental investigations, but it does not provide complete solutions. It identifies the dimensionless groups that influence a phenomenon, but experimental data is needed to establish the specific relationships and validate theoretical predictions.
For example, a spherical particle moving through a viscous fluid experiences drag. Dimensional analysis shows that the drag force depends on the particle's diameter, velocity,...
269
Biostatistics: Overview01:20

Biostatistics: Overview

365
Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
365
Variation01:19

Variation

7.2K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
7.2K
Hypothesis Test for Test of Independence01:16

Hypothesis Test for Test of Independence

3.7K
The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...
3.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Ancient DNA reveals pervasive directional selection across West Eurasia.

Nature·2026
Same author

SpatialFusion: A lightweight multimodal foundation model for pathway-informed spatial niche mapping.

bioRxiv : the preprint server for biology·2026
Same author

Detecting chromatin state alterations in PBMCs associated with Type 2 Diabetes Mellitus.

Communications medicine·2026
Same author

Multimodal framework for the joint analysis of single-cell RNA and T cell receptor sequencing data predicts T cell response to cancer immunotherapy.

Nature communications·2026
Same author

Functional dissection of complex trait variants at single-nucleotide resolution.

Nature·2026
Same author

Partially shared multi-modal embedding learns holistic representation of cell state.

Nature computational science·2026
Same journal

The TaMYB55-TaSnRK1α1-TabZIP9 module confers heat stress tolerance in wheat.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Superstatistics approach to turbulent circulation fluctuations.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

A molecular timescale for evolution of cobamide biosynthesis.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Pierre Chambon, a pioneer of molecular biology and gene regulation in eukaryotes.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Granulosa cell glycogen fuels the avascular corpus luteum.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal

Synthetic essentiality of TRAIL/TNFSF10 in VHL-deficient renal cell carcinoma.

Proceedings of the National Academy of Sciences of the United States of America·2026
See all related articles

Related Experiment Video

Updated: Sep 10, 2025

Diagonal Method to Measure Synergy Among Any Number of Drugs
12:08

Diagonal Method to Measure Synergy Among Any Number of Drugs

Published on: June 21, 2018

18.8K

Efficiently quantifying dependence in massive scientific datasets using InterDependence Scores.

Adityanarayanan Radhakrishnan1,2, Yajit Jain1, Caroline Uhler1,3

  • 1Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142.

Proceedings of the National Academy of Sciences of the United States of America
|August 20, 2025
PubMed
Summary
This summary is machine-generated.

We introduce the InterDependence Score (IDS), a new scalable method to find linear and nonlinear relationships in large scientific datasets. IDS efficiently uncovers hidden patterns in complex data, aiding scientific discovery.

Keywords:
deep learningfeature learningindependence testingsingle-cell transcriptomics

More Related Videos

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis
07:11

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Published on: November 10, 2023

2.7K
Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation
07:57

Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation

Published on: August 21, 2019

8.5K

Related Experiment Videos

Last Updated: Sep 10, 2025

Diagonal Method to Measure Synergy Among Any Number of Drugs
12:08

Diagonal Method to Measure Synergy Among Any Number of Drugs

Published on: June 21, 2018

18.8K
Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis
07:11

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Published on: November 10, 2023

2.7K
Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation
07:57

Quantification of Protein Interaction Network Dynamics using Multiplexed Co-Immunoprecipitation

Published on: August 21, 2019

8.5K

Area of Science:

  • Computational Biology
  • Data Science
  • Bioinformatics

Background:

  • Modern scientific datasets are massive, featuring millions of samples and tens of thousands of variables.
  • Existing dependence measures like Pearson correlation are limited to linear relationships and do not scale well.
  • Discovering complex, nonlinear dependencies is crucial for novel insights in large-scale data.

Purpose of the Study:

  • Introduce the InterDependence Score (IDS), a novel, scalable measure for quantifying both linear and nonlinear dependencies.
  • Develop an efficient algorithm for IDS computation suitable for high-dimensional, large-scale datasets.
  • Demonstrate IDS's utility in identifying key variables, topics, and biological relationships.

Main Methods:

  • IDS is inspired by dependence measures in infinite-dimensional Hilbert spaces, capturing all dependence types.
  • An efficient, linear-time algorithm leveraging neural network principles is employed for computation.
  • The algorithm is optimized for parallel processing on GPUs, enabling analysis of billions of variable pairs.

Main Results:

  • IDS successfully identifies relevant variables for predictive modeling tasks.
  • The method effectively extracts word sets representing topics from large document corpora.
  • IDS reveals gene sets associated with "gene-expression programs" in massive single-cell datasets.

Conclusions:

  • IDS offers a scalable and effective solution for detecting diverse dependencies in large scientific datasets.
  • Its speed and ability to capture nonlinear relationships make it a valuable tool for data exploration and insight generation.
  • IDS has broad applicability across various scientific domains dealing with high-dimensional data.