Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

9.9K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
9.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Molecular cloning and antiviral function analysis of pigeon interferon lambda(piIFN-λ).

Poultry science·2026
Same author

Study on the Mechanism of Arsenic(V) Removal from Water Using Fe<sub>3</sub>O<sub>4</sub>-Sepiolite Composite Fibers.

Langmuir : the ACS journal of surfaces and colloids·2026
Same author

Ketogenic diet alleviates septic lung injury via microbial gut-lung axis.

Cell metabolism·2026
Same author

Novel duck reovirus: A comprehensive review of molecular pathogenesis and advances in diagnosis and vaccination.

Poultry science·2026
Same author

Population Genomics and Climate Change Vulnerability in Two Sympatric Desert Rodents.

Global change biology·2025
Same author

Effect of Oxidation on the Floatability and Surface Properties of Chalcopyrite.

ACS omega·2025
Same journal

Online label aggregation with incomplete crowd responses.

Journal of King Saud University. Computer and information sciences·2026
Same journal

Attention mechanism and mixup data augmentation for classification of COVID-19 Computed Tomography images.

Journal of King Saud University. Computer and information sciences·2024
Same journal

DeSa COVID-19: Deep salient COVID-19 image-based quality assessment.

Journal of King Saud University. Computer and information sciences·2024
Same journal

iVaccine-Deep: Prediction of COVID-19 mRNA vaccine degradation using deep learning.

Journal of King Saud University. Computer and information sciences·2024
Same journal

Lexical sorting centrality to distinguish spreading abilities of nodes in complex networks under the Susceptible-Infectious-Recovered (SIR) model.

Journal of King Saud University. Computer and information sciences·2024
Same journal

Pruning-based oversampling technique with smoothed bootstrap resampling for imbalanced clinical dataset of Covid-19.

Journal of King Saud University. Computer and information sciences·2024
See all related articles

Related Experiment Video

Updated: Jun 29, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

CGUFS: A clustering-guided unsupervised feature selection algorithm for gene expression data.

Zhaozhao Xu1, Fangyuan Yang2, Hong Wang2

  • 1School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, Henan 454000, China.

Journal of King Saud University. Computer and Information Sciences
|April 3, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel clustering-guided unsupervised feature selection (CGUFS) algorithm for high-dimensional gene expression data. CGUFS effectively handles feature redundancy and determines optimal feature subsets, outperforming existing methods in classification accuracy.

Keywords:
Clustering-guidedGene expression dataSpectral clusteringUnsupervised feature selectionk-means

More Related Videos

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.4K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

709

Related Experiment Videos

Last Updated: Jun 29, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.4K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

709

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Machine Learning

Background:

  • Gene expression data is high-dimensional with many irrelevant features, posing challenges for analysis.
  • Existing unsupervised feature selection methods often ignore feature redundancy and struggle with determining the optimal number of features.

Purpose of the Study:

  • To propose a clustering-guided unsupervised feature selection (CGUFS) algorithm for gene expression data.
  • To address limitations in existing algorithms regarding feature redundancy and optimal feature subset selection.

Main Methods:

  • Developed an adaptive k-value strategy for automatic cluster number determination.
  • Implemented a feature grouping strategy to manage highly redundant features.
  • Introduced an adaptive filtering strategy to select optimal feature combinations.

Main Results:

  • The CGUFS algorithm achieved average accuracy (ACC) of 74.37% and Matthews Correlation Coefficient (MCC) of 63.84% with the C4.5 classifier.
  • Significantly superior performance was observed with the Adaboost classifier compared to existing algorithms.
  • Statistical experiments confirmed significant differences in performance between CGUFS and existing methods.

Conclusions:

  • The proposed CGUFS algorithm effectively selects optimal features from high-dimensional gene expression data.
  • CGUFS demonstrates superior classification performance and addresses key limitations of current unsupervised feature selection techniques.