Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Methods to Analyze Parametric Data: ANOVA01:12

Statistical Methods to Analyze Parametric Data: ANOVA

263
Analysis of Variance, or ANOVA, is a powerful statistical technique used to analyze parametric data, primarily in research and experimental studies. It's designed to compare the means of two or more groups, assisting researchers in identifying any significant differences between these group means. There are two main types of ANOVA based on the complexity of the analysis: one-way and two-way.
One-way ANOVA is applied when a single independent variable or factor is scrutinized. It compares...
263
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

113
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
113
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

130
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
130
Introduction to Nonparametric Statistics01:28

Introduction to Nonparametric Statistics

648
Nonparametric statistics offer a powerful alternative to traditional parametric methods, useful when assumptions about the population distribution cannot be made. Unlike parametric tests, which require data to follow a specific distribution with well-defined parameters (such as the mean and standard deviation), nonparametric tests do not require such constraints. This makes them particularly valuable when dealing with small sample sizes, skewed data, or ordinal and categorical variables.
One of...
648
One-Way ANOVA01:18

One-Way ANOVA

7.8K
One-way ANOVA analyzes more than three samples categorized by one factor. For example, it can compare the average mileage of sports bikes. Here, the data is categorized by one factor - the company. However, one-way ANOVA cannot be used to simultaneously compare the sample mean of three or more samples categorized by two factors. An example of two factors would be sports bikes from different companies driven in different terrains, such as a desert or snowy landscape. Here, two-way ANOVA is used...
7.8K
Parametric Survival Analysis: Weibull and Exponential Methods01:14

Parametric Survival Analysis: Weibull and Exponential Methods

322
Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...
322

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Robust discovery of mutational signatures using power posteriors.

PLoS computational biology·2026
Same author

Local graph estimation with pathwise false discovery control.

Nature communications·2026
Same author

Integrated path stability selection.

Journal of the American Statistical Association·2026
Same author

Reproducible parameter inference using bagged posteriors.

Electronic journal of statistics·2025
Same author

Model-based dimensionality reduction for single-cell RNA-seq using generalized bilinear models.

Biostatistics (Oxford, England)·2025
Same author

Group-wise normalization in differential abundance analysis of microbiome samples.

BMC bioinformatics·2025
Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.8K

Principal variables analysis for non-Gaussian data.

Dylan Clark-Boucher1, Jeffrey W Miller1

  • 1Department of Biostatistics, Harvard University.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America
|March 5, 2025
PubMed
Summary
This summary is machine-generated.

Generalized Principal Variables Analysis (PVA) improves variable selection for non-Gaussian and ordinal data by using alternative correlations. This method enhances understanding of complex datasets like neurodegenerative disorders.

Keywords:
Variable selectionX-linked dystonia parkinsonismnon-normalityordinal data

More Related Videos

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.5K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Related Experiment Videos

Last Updated: May 24, 2025

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.8K
O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.5K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Area of Science:

  • Statistics
  • Data Science
  • Bioinformatics

Background:

  • Principal Variables Analysis (PVA) is crucial for identifying informative variables in datasets.
  • Traditional PVA relies on Pearson correlation, which is suboptimal for non-Gaussian data.
  • A need exists for flexible PVA methods applicable to diverse data types.

Purpose of the Study:

  • To introduce a generalized PVA (GPVA) framework.
  • To evaluate the performance of GPVA using Spearman, Gaussian copula, and polychoric correlations.
  • To compare GPVA against traditional PVA on simulated and real-world data.

Main Methods:

  • Developed a generalized PVA framework accommodating various correlation measures.
  • Conducted simulation studies with varying multivariate distributions (continuous non-Gaussian, ordinal).
  • Applied GPVA to a clinical dataset of 102 variables from X-linked dystonia parkinsonism (XDP) patients.

Main Results:

  • GPVA with Gaussian copula or Spearman correlations significantly improved performance on continuous non-Gaussian data compared to Pearson correlation.
  • GPVA with polychoric correlations demonstrated superior performance on ordinal data.
  • Application to XDP data revealed distinct principal variable sets based on correlation choice, highlighting parkinsonism metrics.

Conclusions:

  • Generalized PVA offers a more robust approach for variable selection, especially with non-Gaussian and ordinal data.
  • The choice of correlation measure in GPVA critically influences the identified principal variables.
  • GPVA provides valuable insights into complex clinical datasets, as demonstrated in XDP research.