Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Statistical Methods to Analyze Parametric Data: ANOVA

Statistical Methods to Analyze Parametric Data: ANOVA

Analysis of Variance, or ANOVA, is a powerful statistical technique used to analyze parametric data, primarily in research and experimental studies. It's designed to compare the means of two or more groups, assisting researchers in identifying any significant differences between these group means. There are two main types of ANOVA based on the complexity of the analysis: one-way and two-way.
One-way ANOVA is applied when a single independent variable or factor is scrutinized. It compares...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Introduction to Nonparametric Statistics

Introduction to Nonparametric Statistics

Nonparametric statistics offer a powerful alternative to traditional parametric methods, useful when assumptions about the population distribution cannot be made. Unlike parametric tests, which require data to follow a specific distribution with well-defined parameters (such as the mean and standard deviation), nonparametric tests do not require such constraints. This makes them particularly valuable when dealing with small sample sizes, skewed data, or ordinal and categorical variables.
One of...

One-Way ANOVA

One-Way ANOVA

One-way ANOVA analyzes more than three samples categorized by one factor. For example, it can compare the average mileage of sports bikes. Here, the data is categorized by one factor - the company. However, one-way ANOVA cannot be used to simultaneously compare the sample mean of three or more samples categorized by two factors. An example of two factors would be sports bikes from different companies driven in different terrains, such as a desert or snowy landscape. Here, two-way ANOVA is used...

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Robust discovery of mutational signatures using power posteriors.

PLoS computational biology·2026

Same author

Local graph estimation with pathwise false discovery control.

Nature communications·2026

Same author

Integrated path stability selection.

Journal of the American Statistical Association·2026

Same author

Reproducible parameter inference using bagged posteriors.

Electronic journal of statistics·2025

Same author

Model-based dimensionality reduction for single-cell RNA-seq using generalized bilinear models.

Biostatistics (Oxford, England)·2025

Same author

Group-wise normalization in differential abundance analysis of microbiome samples.

BMC bioinformatics·2025

Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

Same journal

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 24, 2025

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

Principal variables analysis for non-Gaussian data.

Dylan Clark-Boucher¹, Jeffrey W Miller¹

¹Department of Biostatistics, Harvard University.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America

|March 5, 2025

Summary

This summary is machine-generated.

Generalized Principal Variables Analysis (PVA) improves variable selection for non-Gaussian and ordinal data by using alternative correlations. This method enhances understanding of complex datasets like neurodegenerative disorders.

Keywords:

Variable selection X-linked dystonia parkinsonism non-normality ordinal data

More Related Videos

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Related Experiment Videos

Last Updated: May 24, 2025

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Area of Science:

Statistics
Data Science
Bioinformatics

Background:

Principal Variables Analysis (PVA) is crucial for identifying informative variables in datasets.
Traditional PVA relies on Pearson correlation, which is suboptimal for non-Gaussian data.
A need exists for flexible PVA methods applicable to diverse data types.

Purpose of the Study:

To introduce a generalized PVA (GPVA) framework.
To evaluate the performance of GPVA using Spearman, Gaussian copula, and polychoric correlations.
To compare GPVA against traditional PVA on simulated and real-world data.

Main Methods:

Developed a generalized PVA framework accommodating various correlation measures.
Conducted simulation studies with varying multivariate distributions (continuous non-Gaussian, ordinal).
Applied GPVA to a clinical dataset of 102 variables from X-linked dystonia parkinsonism (XDP) patients.

Main Results:

GPVA with Gaussian copula or Spearman correlations significantly improved performance on continuous non-Gaussian data compared to Pearson correlation.
GPVA with polychoric correlations demonstrated superior performance on ordinal data.
Application to XDP data revealed distinct principal variable sets based on correlation choice, highlighting parkinsonism metrics.

Conclusions:

Generalized PVA offers a more robust approach for variable selection, especially with non-Gaussian and ordinal data.
The choice of correlation measure in GPVA critically influences the identified principal variables.
GPVA provides valuable insights into complex clinical datasets, as demonstrated in XDP research.