Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Types of Biopharmaceutical Studies: Controlled and Non-Controlled Approaches01:23

Types of Biopharmaceutical Studies: Controlled and Non-Controlled Approaches

380
Biopharmaceutical studies constitute a vital field aiming to enhance drug delivery methods and refine therapeutic approaches, drawing upon diverse interdisciplinary knowledge. In research methodologies, the choice between controlled and non-controlled studies significantly influences the study's reliability and accuracy.
Non-controlled studies, commonly employed for initial exploration, lack a control group, rendering them susceptible to biases and external influences. In contrast,...
380
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

533
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
533
Contingency Table01:29

Contingency Table

3.8K
A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...
3.8K
Study Design in Statistics01:15

Study Design in Statistics

9.9K
A study design is a set of techniques that allow a researcher to collect and analyze data from different variables defined for a specific research problem. Statistics is commonly for effective study design and more robust experiments,
Does aspirin reduce the risk of heart attacks? Is one brand of fertilizer more effective at growing roses than another? Is fatigue as dangerous to a driver as the influence of alcohol? Questions like these are answered using randomized experiments with proper...
9.9K
Bias in Epidemiological Studies01:29

Bias in Epidemiological Studies

1.2K
Biases can arise at various stages of research, from study design and data collection to analysis and interpretation. Recognizing and addressing these biases is essential to ensure the validity and reliability of epidemiological findings.Broadly speaking, biases in epidemiology fall into three main categories: selection bias, information bias, and confounding. A more detailed description of possible biases is:  
1.2K
Introduction to Test of Independence01:21

Introduction to Test of Independence

2.9K
In statistics, the term independence means that one can directly obtain the probability of any event involving both variables by multiplying their individual probabilities. Tests of independence are chi-square tests involving the use of a contingency table of observed (data) values.
The test statistic for a test of independence is similar to that of a goodness-of-fit test:
2.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Machine Learning-Guided Discovery of Bacterial-Selective Membrane-Active Compounds Reveals Mechanistic Bias in Antibiotic Training Datasets.

bioRxiv : the preprint server for biology·2026
Same author

Contrastive Dimension Reduction: A Systematic Review.

Wiley interdisciplinary reviews. Computational statistics·2026
Same author

Using image classifiers to predict CMT2A disease-relevant mitochondrial motility phenotypes in iPSC motor neurons.

bioRxiv : the preprint server for biology·2026
Same author

<i>k</i> -spaces: Mixtures of Gaussian latent variable models.

bioRxiv : the preprint server for biology·2025
Same author

CONTRASTIVE LINEAR REGRESSION.

The annals of applied statistics·2025
Same author

<i>SPARK</i>: <i>in silico</i> simulations for benchmarking nascent RNA sequencing experiments.

bioRxiv : the preprint server for biology·2025
Same journal

Inference on summaries of a model-agnostic longitudinal variable importance trajectory with application to suicide prevention.

The annals of applied statistics·2026
Same journal

A NOVEL BAYESIAN FRAMEWORK UNCOVERING BRAIN CONNECTIVITY-TO-SHAPE RELATIONSHIP IN PRECLINICAL ALZHEIMER'S DISEASE.

The annals of applied statistics·2026
Same journal

EVALUATING MULTIPLEX DIAGNOSTIC TEST USING PARTIALLY ORDERED BAYES CLASSIFIER.

The annals of applied statistics·2026
Same journal

BRIDGING THE GAP: ENHANCING THE GENERALIZABILITY OF EPIGENETIC CLOCKS THROUGH TRANSFER LEARNING.

The annals of applied statistics·2026
Same journal

TREATMENT EFFECT HETEROGENEITY AND IMPORTANCE MEASURES FOR MULTIVARIATE CONTINUOUS TREATMENTS.

The annals of applied statistics·2026
Same journal

FEDERATED LEARNING OF ROBUST INDIVIDUALIZED DECISION RULES WITH APPLICATION TO HETEROGENEOUS MULTIHOSPITAL SEPSIS POPULATION.

The annals of applied statistics·2026
See all related articles

Related Experiment Video

Updated: Jan 8, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K

PROBABILISTIC CONTRASTIVE DIMENSION REDUCTION FOR CASE-CONTROL STUDY DATA.

Didong Li1, Andrew Jones2, Barbara Engelhardt3

  • 1Department of Biostatistics, University or North Carolina at Chapel Hill.

The Annals of Applied Statistics
|December 15, 2025
PubMed
Summary
This summary is machine-generated.

Probabilistic Contrastive Principal Component Analysis (PCPCA) is a new method for analyzing case-control data. It helps identify unique biological variations in disease cases compared to controls, improving genomic data analysis.

Keywords:
High-dimensional datacontrastive modelprobabilistic principal components analysis

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.0K
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.7K

Related Experiment Videos

Last Updated: Jan 8, 2026

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.9K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.0K
Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment
06:48

Lexical Decision Task for Studying Written Word Recognition in Adults with and without Dementia or Mild Cognitive Impairment

Published on: June 25, 2019

9.7K

Area of Science:

  • Genomics
  • Biostatistics
  • Computational Biology

Background:

  • Case-control studies are crucial for hypothesis testing in biological research.
  • Identifying variations unique to cases (e.g., disease patients) versus controls is a key challenge.
  • Existing methods may lack interpretability, uncertainty quantification, or robustness.

Purpose of the Study:

  • To introduce Probabilistic Contrastive Principal Component Analysis (PCPCA), a novel dimension reduction technique.
  • To develop a method specifically designed for analyzing case-control datasets.
  • To generalize and improve upon existing PCA and contrastive PCA methods.

Main Methods:

  • Developed PCPCA, a probabilistic dimension reduction model utilizing a contrastive likelihood for inference.
  • Demonstrated that PCPCA encompasses PCA, Probabilistic PCA, and Contrastive PCA as special cases.
  • Established theoretical and practical guidelines for parameter tuning.

Main Results:

  • PCPCA offers enhanced interpretability, uncertainty quantification, and principled inference compared to related methods.
  • The model exhibits robustness to noise and missing data.
  • PCPCA can generate 'foreground-enriched' data, highlighting case-specific variations.
  • Simulations and real-world genomic data (gene expression, protein expression, images) analyses confirm PCPCA's effectiveness.

Conclusions:

  • PCPCA is a powerful and versatile tool for analyzing case-control data, particularly in genomics.
  • The method provides significant advantages in identifying disease-specific variations.
  • PCPCA advances the analysis of complex biological data by offering improved interpretability and robustness.