Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Test for Homogeneity01:23

Test for Homogeneity

2.1K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
2.1K
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

309
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
309
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

311
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
311
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.3K
Variability: Analysis01:11

Variability: Analysis

196
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
196
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

5.9K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
5.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

An interpretable early intensive care unit model for predicting in-hospital mortality in trauma patients combined with rib fractures: development and external validation.

Journal of thoracic disease·2026
Same author

Association of endothelial activation and stress index with all-cause and cardiovascular mortality in patients with diabetic kidney disease: a population-based prospective study.

Diabetology & metabolic syndrome·2026
Same author

Aequorea Victoria-Inspired Covalent Anchoring Enables Water-Resistant and Scalable Room-Temperature Phosphorescence in Cellulose.

Advanced materials (Deerfield Beach, Fla.)·2026
Same author

An Improved DQN Framework with Dual Residual Horizontal Feature Pyramid for Autonomous Fault Diagnosis in Strong-Noise Scenarios.

Sensors (Basel, Switzerland)·2025
Same author

Antibacterial activity and mechanism of Cinnamomum loureirii bark oil against three foodborne and opportunistic pathogens.

Letters in applied microbiology·2025
Same author

Analysis of cross-platform health communication with a network approach.

Biometrics·2025
Same journal

Comparison of Different Methods for the Meta-Analysis of Diagnostic Test Accuracy Studies-A Simulation Study.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Ensuring Quality in Preclinical Research: The Importance of Being Human.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Addressing Cluster-Level Treatment Effect Heterogeneity in Sample Size Determination for Hierarchical 2 × 2 Factorial Designs.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

A Multiple Imputation Approach to Distinguish Curative From Life-Prolonging Effects in the Presence of Missing Covariates.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Tests for Categorical Data Beyond Pearson: A Distance Covariance and Energy Distance Approach.

Biometrical journal. Biometrische Zeitschrift·2026
See all related articles

Related Experiment Video

Updated: Sep 24, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.4K

Regression-based heterogeneity analysis to identify overlapping subgroup structure in high-dimensional data.

Ziye Luo1, Xinyue Yao2, Yifan Sun1,3

  • 1School of Statistics, Renmin University of China, Beijing, P. R. China.

Biometrical Journal. Biometrische Zeitschrift
|May 7, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces a new method for analyzing complex diseases by allowing samples to belong to multiple subgroups. This approach improves understanding of disease biology and enhances prediction accuracy.

Keywords:
heterogeneity analysishigh-dimensional dataoverlapping subgroup structurepenalizationregression

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.1K
Heterogeneity Mapping of Protein Expression in Tumors using Quantitative Immunofluorescence
07:54

Heterogeneity Mapping of Protein Expression in Tumors using Quantitative Immunofluorescence

Published on: October 25, 2011

18.8K

Related Experiment Videos

Last Updated: Sep 24, 2025

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.4K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.1K
Heterogeneity Mapping of Protein Expression in Tumors using Quantitative Immunofluorescence
07:54

Heterogeneity Mapping of Protein Expression in Tumors using Quantitative Immunofluorescence

Published on: October 25, 2011

18.8K

Area of Science:

  • Biostatistics
  • Computational Biology
  • Genomics

Background:

  • Complex diseases exhibit significant heterogeneity, making analysis challenging.
  • Existing regression-based heterogeneity analyses often assume disjoint subgroups, limiting their applicability.
  • Real-world biological data frequently involves overlapping structures, such as genes with multiple functions.

Purpose of the Study:

  • To develop a novel regression-based heterogeneity analysis method that accommodates overlapping subgroups.
  • To address high-dimensional data and small sample size limitations in heterogeneity analysis.
  • To improve the understanding of disease biology by identifying complex subgroup structures.

Main Methods:

  • Introduced a subgroup membership vector for each sample to allow for overlapping memberships.
  • Developed a novel loss function incorporating an L0 norm penalty for membership vectors to handle small sample sizes.
  • Applied sparse penalization for regularized estimation and feature selection in high-dimensional data.

Main Results:

  • The proposed method demonstrated superior performance compared to existing approaches in extensive simulations.
  • Analysis of Cancer Cell Line Encyclopedia and The Cancer Genome Atlas lung cancer data revealed overlapping subgroup structures.
  • The approach showed favorable performance in prediction and stability on real-world biological datasets.

Conclusions:

  • The novel approach effectively identifies overlapping subgroup structures in complex diseases.
  • This method provides a more realistic and powerful tool for heterogeneity analysis in biology and medicine.
  • The findings suggest improved disease subtyping and personalized treatment strategies.