Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

80
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
80
Outliers and Influential Points01:08

Outliers and Influential Points

4.0K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
4.0K
Types of Selection01:46

Types of Selection

40.4K
Natural selection influences the frequencies of particular alleles and phenotypes within populations in several different ways. Primarily, natural selection can be directional, stabilizing, or disruptive. Directional selection favors one extreme trait and shifts the population towards that phenotype while selecting against individuals displaying alternate traits. Stabilizing selection favors an intermediate trait with a narrow range of variation. Deviation from the optimal phenotype towards an...
40.4K
Frequency-dependent Selection01:21

Frequency-dependent Selection

22.0K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
22.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

AKI-twinX: explainable organ structured digital twin for sepsis AKI trajectory forecasting.

medRxiv : the preprint server for health sciences·2026
Same author

CauReL: Dynamic Counterfactual Learning for Precision Drug Repurposing in Alzheimer's Disease.

Research square·2026
Same author

SpaIM: single-cell spatial transcriptomics imputation via style transfer.

Nature communications·2025
Same author

SpaIM: Single-cell Spatial Transcriptomics Imputation via Style Transfer.

bioRxiv : the preprint server for biology·2025
Same author

PINet: Privileged Information Improve the Interpretablity and generalization of structural MRI in Alzheimer's Disease.

ACM-BCB ... ... : the ... ACM Conference on Bioinformatics, Computational Biology and Biomedicine. ACM Conference on Bioinformatics, Computational Biology and Biomedicine·2024
Same author

ARDaC Common Data Model Facilitates Data Dissemination and Enables Data Commons for Modern Clinical Studies.

Studies in health technology and informatics·2024
See all related articles

Related Experiment Video

Updated: Jun 27, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Feature Selection for Unsupervised Machine Learning.

Huyunting Huang1, Ziyang Tang1, Tonglin Zhang1

  • 1Purdue University West Lafayette, Indiana.

IEEE International Conference on Smart Cloud
|May 6, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a stepwise feature selection method for unsupervised machine learning (ML) clustering, improving accuracy and efficiency for Gaussian mixture models (GMM) and k-means compared to using all features.

Keywords:
Gaussian mixture modeladjusted rand indexk-meansstepwise

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.8K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

696

Related Experiment Videos

Last Updated: Jun 27, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.8K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

696

Area of Science:

  • Computer Science
  • Machine Learning
  • Data Mining

Background:

  • Feature selection is less developed for unsupervised machine learning (ML) compared to supervised ML.
  • Clustering algorithms like Gaussian Mixture Models (GMM) and k-means often utilize all available features, potentially impacting performance.
  • There is a need for improved feature selection techniques in unsupervised learning to enhance clustering accuracy and efficiency.

Purpose of the Study:

  • To propose a stepwise feature selection approach for unsupervised clustering methods.
  • To adapt and improve Gaussian Mixture Models (GMM) and k-means algorithms through feature subset selection.
  • To evaluate the performance of the proposed method against existing approaches using simulations and real-world data.

Main Methods:

  • A stepwise feature selection strategy was developed for clustering.
  • The proposed method selects an optimal subset of features for GMM and k-means.
  • Gaussian Mixture Models (GMM) and k-means were potentially modified with improved initializations.

Main Results:

  • The proposed feature selection method demonstrated superior accuracy and computational efficiency compared to using all features.
  • Experiments using Monte Carlo simulations validated the effectiveness of the approach.
  • A real-world dataset analysis confirmed the findings from the simulations, highlighting practical applicability.

Conclusions:

  • The developed stepwise feature selection approach enhances the performance of GMM and k-means clustering.
  • Selecting relevant features improves computational efficiency and accuracy in unsupervised ML.
  • The findings suggest a practical improvement for applying GMM and k-means to complex datasets.