Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.4K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

6.9K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
6.9K
Introduction to Test of Independence01:21

Introduction to Test of Independence

2.9K
In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:
2.9K
Test for Homogeneity01:23

Test for Homogeneity

2.3K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
2.3K
Hypothesis Test for Test of Independence01:16

Hypothesis Test for Test of Independence

7.2K
The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...
7.2K
Fisher's Exact Test01:08

Fisher's Exact Test

1.1K
Fisher's exact test is a statistical significance test widely used to analyze 2x2 contingency tables, particularly in situations where sample sizes are small. Unlike the chi-squared test, which approximates P-values and assumes minimum expected frequencies of at least five in each cell, Fisher's exact test calculates the exact probability (P-value) of observing the data or more extreme results under the null hypothesis. This feature makes it especially valuable when the assumptions of...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Persistence of large P. aeruginosa aggregates despite reduced airway burden in people with cystic fibrosis on CFTR modulator therapy.

Journal of cystic fibrosis : official journal of the European Cystic Fibrosis Society·2026
Same author

Urolithin A-Enhanced Multi-Bioactive Formulation Mitigates Cyclophosphamide-Induced Premature Ovarian Failure Through Suppression of Oxidative-Inflammatory Stress and Preservation of Follicle Fate.

Antioxidants (Basel, Switzerland)·2026
Same author

Cross-Compartment Virome Profiling in Human Immunodeficiency Virus Infection and Substance Use Disorder Reveals Brain-CSF-Periphery Discordance and Hepatitis B Virus in Central Nervous System.

International journal of molecular sciences·2026
Same author

The effect of <i>Staphylococcus aureus</i> on the persistence of <i>Pseudomonas aeruginosa</i> due to aggregation in cystic fibrosis airway infections.

iScience·2026
Same author

Sturgeon roe protein hydrolysate alleviates cyclophosphamide-induced ovarian dysfunction in mice with reduced ovarian oxidative-inflammatory injury, granulosa cell apoptosis, and HPO axis-related endocrine imbalance.

Food & function·2026
Same author

Assessment of active and inactive sacroiliitis in axial spondyloarthritis using DCE-MRI and ZOOMit-DWI parameters: A prospective study.

Medicine·2026
Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jan 1, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K

Estimating Feature-Label Dependence Using Gini Distance Statistics.

Silu Zhang, Xin Dang, Dao Nguyen

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |December 24, 2019
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces Gini distance for measuring statistical dependence between numerical features and categorical labels in machine learning. Gini distance offers a robust alternative to traditional methods, ensuring accurate independence detection and faster convergence.

    More Related Videos

    Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans
    09:23

    Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

    Published on: August 16, 2017

    8.5K
    Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
    13:55

    Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

    Published on: February 3, 2013

    18.9K

    Related Experiment Videos

    Last Updated: Jan 1, 2026

    A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
    08:12

    A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

    Published on: March 1, 2022

    2.9K
    Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans
    09:23

    Quantification of Information Encoded by Gene Expression Levels During Lifespan Modulation Under Broad-range Dietary Restriction in C. elegans

    Published on: August 16, 2017

    8.5K
    Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization
    13:55

    Combined Immunofluorescence and DNA FISH on 3D-preserved Interphase Nuclei to Study Changes in 3D Nuclear Organization

    Published on: February 3, 2013

    18.9K

    Area of Science:

    • Machine Learning
    • Statistical Learning Theory
    • Data Science

    Background:

    • Supervised learning fundamentally relies on identifying statistical dependence between features and labels.
    • Existing methods like Pearson correlation do not fully characterize independence.
    • There is a need for robust dependence measures in machine learning.

    Purpose of the Study:

    • To present a novel framework for estimating dependence between numerical features and categorical labels.
    • To introduce Gini distance-based measures: Gini distance covariance and Gini distance correlation.
    • To demonstrate the advantages of Gini distance over existing methods.

    Main Methods:

    • Utilizing generalized Gini distance, an energy distance within reproducing kernel Hilbert spaces (RKHS).
    • Developing Gini distance covariance and Gini distance correlation as dependence measures.
    • Deriving uniform convergence bounds and asymptotic bounds for test statistics.

    Main Results:

    • Gini distance-based measures accurately define both dependence and independence.
    • Test statistics are computationally simple and bypass the need for probability density estimation.
    • Gini distance statistics exhibit faster uniform convergence than distance covariance statistics.
    • Tighter bounds on Type I and Type II errors are achieved with Gini distance.

    Conclusions:

    • The proposed Gini distance framework provides a superior method for assessing feature-label dependence in supervised learning.
    • Gini distance offers improved statistical properties, including faster convergence and more accurate error bounds.
    • The method demonstrates strong performance in experimental evaluations.