Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Diabetes Mellitus: Overview and Type I Subtype01:22

Diabetes Mellitus: Overview and Type I Subtype

5.7K
Diabetes mellitus is a chronic metabolic disorder characterized by high blood glucose levels due to inadequate insulin production, insulin resistance, or both. The condition affects millions worldwide and can significantly impact their health and quality of life.
Type 1 diabetes is an autoimmune disease in which the immune system mistakenly attacks and destroys the insulin-producing beta cells in the pancreas. As a result, the body is unable to produce sufficient insulin, and individuals with...
5.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comparing Two Categorical Gini Correlations with Applications to Classification Problems.

Statistical papers (Berlin, Germany)·2026
Same author

Confidence estimation based on data from independent studies.

Statistical methods in medical research·2023
Same author

Pre-vaccination RT-PCR negative contacts in workplace settings show high, SARS COV-2 neutralizing antibody levels.

BMC public health·2022
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Mar 7, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

A copula based supervised filter for feature selection in machine learning driven diabetes risk prediction.

Agnideep Aich1, Md Monzur Murshed2, Sameera Hewage3

  • 1Department of Mathematics, University of Louisiana at Lafayette, Lafayette, LA, USA. agnideep.aich1@louisiana.edu.

Scientific Reports
|March 5, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Gumbel copula feature selection method that identifies extreme risk factors in patient data. It efficiently ranks predictors by their upper-tail dependence, outperforming standard methods on diabetes datasets.

Keywords:
Copula-based feature selectionDiabetesGumbel copulaMachine learningPublic healthRisk predictionSupervised feature selectionTail dependence

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.6K

Related Experiment Videos

Last Updated: Mar 7, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.4K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.6K

Area of Science:

  • Machine Learning
  • Biostatistics
  • Medical Informatics

Background:

  • Effective feature selection is crucial for interpretable predictive models in medicine.
  • Traditional methods may miss predictors important in extreme patient groups.
  • Identifying risk factors in the tails of data distributions is essential for targeted interventions.

Purpose of the Study:

  • To introduce a novel, computationally efficient supervised filter for feature selection.
  • To rank features based on their tendency to be extreme with the positive class using Gumbel copula upper-tail concordance.
  • To evaluate the proposed method against standard baselines on diabetes datasets.

Main Methods:

  • Developed a supervised filter leveraging Gumbel copula implied upper-tail concordance score.
  • Evaluated the filter against Mutual Information, mRMR, ReliefF, and L1/Elastic-Net.
  • Tested on two diabetes datasets (CDC and PIMA) using four classifiers.
  • Conducted statistical tests, permutation importance, and robustness checks.

Main Results:

  • The Gumbel-based selector was the fastest on the CDC dataset, reducing features by ≈52% with minimal performance trade-off.
  • It significantly outperformed Mutual Information and mRMR, and was comparable to ReliefF.
  • On the PIMA dataset, the ranking yielded the highest ROC-AUC numerically.
  • The method consistently identified clinically relevant predictors across both datasets.

Conclusions:

  • Feature selection via upper-tail dependence is an efficient and interpretable screening approach.
  • This method can effectively complement standard feature selection techniques.
  • It is particularly valuable for public health and clinical risk prediction, focusing on extreme patient strata.