Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Incidence and remission of endometriosis in Germany based on prevalence data from 35 million patients from the statutory health insurance.

BMC women's health·2026
Same author

DEVELOPMENT AND APPLICATION OF BRAIN TISSUE BASED MULTI-OMICS PROFILE SCORES FOR ALZHEIMER'S DISEASE.

Research square·2026
Same author

Autoantibodies as predictors for immune-related adverse events in checkpoint inhibition therapy of metastatic melanoma.

Journal for immunotherapy of cancer·2026
Same author

Socio-spatial characterization of sub-sewersheds for wastewater-based epidemiology (WBE): Developing and evaluating two estimators for population-related variables.

Spatial and spatio-temporal epidemiology·2025
Same author

Hierarchical modeling of risk factors with and without prior information-the process of regression model evaluation for an example of respiratory diseases in piglet production from daily practice data.

Frontiers in veterinary science·2025
Same author

Joint models in big data: simulation-based guidelines for required data quality in longitudinal electronic health records.

BioData mining·2025
Same journal

Comparison of Different Methods for the Meta-Analysis of Diagnostic Test Accuracy Studies-A Simulation Study.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

When to Adjust for Multiple Testing: A Unifying Guiding Principle.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Ensuring Quality in Preclinical Research: The Importance of Being Human.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Addressing Cluster-Level Treatment Effect Heterogeneity in Sample Size Determination for Hierarchical 2 × 2 Factorial Designs.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

A Multiple Imputation Approach to Distinguish Curative From Life-Prolonging Effects in the Presence of Missing Covariates.

Biometrical journal. Biometrische Zeitschrift·2026
Same journal

Tests for Categorical Data Beyond Pearson: A Distance Covariance and Energy Distance Approach.

Biometrical journal. Biometrische Zeitschrift·2026
See all related articles

Related Experiment Video

Updated: Feb 28, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Multivariate binary classification of imbalanced datasets-A case study based on high-dimensional multiplex autoimmune

Laura Schlieker1, Anna Telaar2, Angelika Lueking3

  • 1ClinStat GmbH, Max-Planck-Str. 22a, 50858 Cologne, formerly Protagen AG, Otto-Hahn-Str. 15, 44227, Dortmund, Germany.

Biometrical Journal. Biometrische Zeitschrift
|June 20, 2017
PubMed
Summary
This summary is machine-generated.

Class imbalance in medical datasets, common in rare diseases, poses challenges for accurate patient classification. This study suggests cost-sensitive learning may improve Random Forest performance for imbalanced data.

Keywords:
Cost-sensitive learningImbalanced dataPPLS-DARandom ForestsSampling

More Related Videos

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

2.2K
High-Throughput Automated Multiplex Immunofluorescence Assays for Translational Research
09:12

High-Throughput Automated Multiplex Immunofluorescence Assays for Translational Research

Published on: June 10, 2025

1.3K

Related Experiment Videos

Last Updated: Feb 28, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

2.2K
High-Throughput Automated Multiplex Immunofluorescence Assays for Translational Research
09:12

High-Throughput Automated Multiplex Immunofluorescence Assays for Translational Research

Published on: June 10, 2025

1.3K

Area of Science:

  • Medical diagnostics and predictive medicine
  • Machine learning in healthcare
  • Biostatistics and data analysis

Background:

  • Accurate patient classification is crucial for diagnosis and treatment.
  • Class imbalance, where subgroups are small (e.g., rare diseases), complicates statistical classification.
  • High error rates in minority classes are a common issue in imbalanced datasets.

Purpose of the Study:

  • To investigate class imbalance effects on Random Forests and Powered Partial Least Squares Discriminant Analysis (PPLS-DA).
  • To evaluate classifiers combined with imbalance compensation methods (sampling, cost-sensitive learning).
  • To assess classifier performance using a scoring system on an autoimmune assay dataset.

Main Methods:

  • Case study using a high-dimensional multiplex autoimmune assay dataset (Rheumatoid Arthritis vs. Systemic Lupus Erythematodes).
  • Simulated class imbalance by reducing the majority class size.
  • Comparison of Random Forests and PPLS-DA with and without sampling and cost-sensitive learning.

Main Results:

  • Cost-sensitive learning approaches showed potential benefits for Random Forests in handling class imbalance.
  • Performance evaluation utilized a scoring system to analyze classification outcomes.
  • The study highlights the impact of imbalance on classifier error rates.

Conclusions:

  • Cost-sensitive learning may be a valuable strategy for improving Random Forest performance with imbalanced medical data.
  • Further research with diverse datasets and simulations is recommended.
  • This work emphasizes the importance of addressing class imbalance in medical data analysis.