Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...
Pharmacokinetic Models: Comparison and Selection Criterion01:26

Pharmacokinetic Models: Comparison and Selection Criterion

Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.
Survival Tree01:19

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a survival tree begins...
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This number is...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026
Same author

Incorporating external risk information with the Cox model under population heterogeneity: applications to trans-ancestry polygenic hazard scores.

Journal of the Royal Statistical Society. Series A, (Statistics in Society)·2026
Same author

The Immunological Landscape of the Tumor Microenvironment: Implications for Immunotherapy of Unresectable and Metastatic Soft Tissue Sarcomas.

Current treatment options in oncology·2026
Same author

The effects of 8 weeks of functional strength training and blood flow restriction training on lower limb muscle strength, maximal power, and movement quality in male sprinter college athletes.

Frontiers in physiology·2026
Same author

Multimodal Navigation Technology for Giant Choledochal Cyst Resection: A Precision Surgical Navigation Strategy.

Journal of gastrointestinal surgery : official journal of the Society for Surgery of the Alimentary Tract·2026
Same author

Does resistance training alone or in combination with aerobic training improve vascular function indices in adults with type 2 diabetes? A systematic review and meta-analysis of randomized controlled trials.

Frontiers in endocrinology·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Jun 18, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Pairwise variable selection for high-dimensional model-based clustering.

Jian Guo1, Elizaveta Levina, George Michailidis

  • 1Department of Statistics, University of Michigan, Ann Arbor, Michigan 48109, USA.

Biometrics
|November 17, 2009
PubMed
Summary
This summary is machine-generated.

This study introduces a new pairwise variable selection method for high-dimensional clustering. The approach enhances interpretability by identifying specific clusters separable by each variable, outperforming existing methods.

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Related Experiment Videos

Last Updated: Jun 18, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Area of Science:

  • Statistics
  • Data Science
  • Machine Learning

Background:

  • Variable selection is crucial for high-dimensional data analysis, especially in model-based clustering.
  • Current methods often use a 'one-in-all-out' approach, lacking detailed cluster-specific variable information.
  • Identifying which clusters are separated by specific variables is important for deeper insights.

Purpose of the Study:

  • To propose a novel pairwise variable selection method for high-dimensional model-based clustering.
  • To address the limitation of existing methods in identifying cluster-specific variable separability.
  • To improve the interpretability of variable selection in clustering.

Main Methods:

  • Development of a new pairwise penalty for variable selection.
  • Application of the method to high-dimensional model-based clustering.
  • Comparison with existing variable selection techniques using ℓ(1) and ℓ(∞) penalties.

Main Results:

  • The proposed pairwise method demonstrates superior performance compared to alternative approaches.
  • The new method provides enhanced interpretability by specifying separable cluster pairs for each variable.
  • Effectiveness validated on both simulated and real-world datasets.

Conclusions:

  • The pairwise variable selection method offers a significant advancement for high-dimensional clustering.
  • It provides more granular insights into variable contributions to cluster separation.
  • The method enhances understanding and application of clustering in complex datasets.