Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

109
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
109
Variability: Analysis01:11

Variability: Analysis

135
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
135
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.2K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.2K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

2.5K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
2.5K
Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

6.0K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
6.0K
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

5.7K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
5.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Developmentally dynamic, non-convergent transcriptomic profiles in CNV models for schizophrenia risk.

Molecular psychiatry·2026
Same author

Human brain prefrontal cortex proteomics identifies compromised energy metabolism and neuronal function in Schizophrenia.

Nature communications·2026
Same author

Keeping track of moving targets.

eLife·2025
Same author

Cell Type Specific Inference of Perturbations in Synaptic Communication with MultiNeuronChat.

bioRxiv : the preprint server for biology·2025
Same author

Genomic analyses reveal new insights into Alzheimer's disease.

medRxiv : the preprint server for health sciences·2025
Same author

Response to comment on "Identifying patterns differing between high-dimensional datasets with generalized contrastive PCA".

PLoS computational biology·2025
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jun 16, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.6K

Identifying patterns differing between high-dimensional datasets with generalized contrastive PCA.

Eliezyer Fermino de Oliveira1, Pranjal Garg2, Jens Hjerling-Leffler3

  • 1Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY.

Biorxiv : the Preprint Server for Biology
|August 16, 2024
PubMed
Summary
This summary is machine-generated.

Generalized contrastive PCA (gcPCA) offers a hyperparameter-free method for comparing high-dimensional biological datasets. This robust approach overcomes limitations of previous techniques, enabling new insights from complex biological data.

More Related Videos

A Method for Investigating Age-related Differences in the Functional Connectivity of Cognitive Control Networks Associated with Dimensional Change Card Sort Performance
09:01

A Method for Investigating Age-related Differences in the Functional Connectivity of Cognitive Control Networks Associated with Dimensional Change Card Sort Performance

Published on: May 7, 2014

10.2K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.5K

Related Experiment Videos

Last Updated: Jun 16, 2025

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.6K
A Method for Investigating Age-related Differences in the Functional Connectivity of Cognitive Control Networks Associated with Dimensional Change Card Sort Performance
09:01

A Method for Investigating Age-related Differences in the Functional Connectivity of Cognitive Control Networks Associated with Dimensional Change Card Sort Performance

Published on: May 7, 2014

10.2K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.5K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Data Science

Background:

  • High-dimensional biological data are increasingly common.
  • Comparing datasets from different conditions is crucial for biological insights.
  • Existing dimensionality reduction methods struggle with multi-dataset comparisons.

Purpose of the Study:

  • Introduce generalized contrastive PCA (gcPCA) as a hyperparameter-free solution.
  • Address limitations of traditional contrastive principal component analysis (cPCA).
  • Provide a flexible and symmetric method for comparing biological datasets.

Main Methods:

  • Developed generalized contrastive PCA (gcPCA) to eliminate hyperparameter tuning.
  • Provided theoretical analysis on why cPCA requires a hyperparameter and how gcPCA avoids it.
  • Created an open-source gcPCA toolbox with Python and MATLAB implementations.

Main Results:

  • gcPCA successfully analyzed diverse high-dimensional biological data.
  • Demonstrated unsupervised detection of hippocampal replay in neurophysiological recordings.
  • Revealed heterogeneity in type II diabetes using single-cell RNA sequencing data.

Conclusions:

  • gcPCA is a fast, robust, and user-friendly method for comparing high-dimensional datasets.
  • Facilitates deeper insights into complex biological phenomena.
  • Offers a valuable resource for biological data analysis.