Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Weighted Mean

Weighted Mean

While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

Frequency-dependent Selection

Frequency-dependent Selection

When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Employing an immunoinformatics approach revealed potent multi-epitope based subunit vaccine for lymphocytic choriomeningitis virus.

Journal of infection and public health·2023

Same author

Microbial-induced calcium carbonate precipitation: Influencing factors, nucleation pathways, and application in waste water remediation.

The Science of the total environment·2022

Same author

Synergistic removal of nitrate by a cellulose-degrading and denitrifying strain through iron loaded corn cobs filled biofilm reactor at low C/N ratio: Capability, enhancement and microbiome analysis.

Bioresource technology·2022

Same author

Ring Expansion of Isatins <i>via</i> 1,2-Phospha-Brook Rearrangement: A Route to the Synthesis of 2-Quinolinone-Derived <i>p</i>-Quinone Methides.

The Journal of organic chemistry·2022

Same author

Production of a recyclable nanobiocatalyst to synthesize quinazolinone derivatives.

RSC advances·2022

Same author

Comparative Pan-Genomic Analysis Revealed an Improved Multi-Locus Sequence Typing Scheme for <i>Staphylococcus aureus</i>.

Genes·2022

Same journal

Novel Parent Survey Measures Sensory Behaviors Incorporating Sensory Modality and Stimulus Intensity.

Heliyon·2026

Same journal

Expression of concern: "SQSTM1/p62 promotes the progression of gastric cancer through epithelial-mesenchymal transition" [Heliyon 10 (2024) e24409].

Heliyon·2026

Same journal

Expression of concern: "TL1A promotes metastasis and EMT process of colorectal cancer" [Heliyon 10 (2024) e24392].

Heliyon·2026

Same journal

Expression of concern: "Factors affecting timing of surgery following neoadjuvant chemoradiation for esophageal cancer" [Heliyon 9 (2023) e23212].

Heliyon·2026

Same journal

Expression of concern: "On stratified single-valued soft topogenous structures" [Heliyon 10 (2024) e27926].

Heliyon·2026

Same journal

Expression of concern: "Artifact removal and motor imagery classification in EEG using advanced algorithms and modified DNN" [Heliyon 10 (2024) e27198].

Heliyon·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 10, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Feature selection via robust weighted score for high dimensional binary class-imbalanced gene expression data.

Zardad Khan¹, Amjad Ali¹, Saeed Aldahmani¹

¹Department of Statistics and Business Analytics, United Arab Emirates University, Al Ain, United Arab Emirates.

|October 14, 2024

Summary

This summary is machine-generated.

A new feature selection method, Robust Weighted Score for Unbalanced Data (ROWSU), effectively identifies key genes in imbalanced gene expression data. ROWSU improves classification accuracy by selecting discriminative features even with skewed distributions.

Keywords:

Features selection Gene expression data Robust score Support vectors Unbalanced class distribution

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

Related Experiment Videos

Last Updated: Jun 10, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

Area of Science:

Bioinformatics
Computational Biology
Machine Learning in Genomics

Background:

High-dimensional gene expression data often presents class-imbalance challenges.
Skewed class distributions negatively impact the performance of classification algorithms.
Effective feature selection is crucial for accurate binary classification in genomics.

Purpose of the Study:

To propose a robust weighted score for unbalanced data (ROWSU) for feature selection.
To address the challenge of class imbalance in high-dimensional gene expression datasets.
To improve the performance of classification algorithms on imbalanced genomic data.

Main Methods:

Data balancing through synthetic minority over-sampling technique.
Greedy search approach for initial minimum gene subset selection.
Novel weighted robust score using support vector weights for gene refinement.

Main Results:

The ROWSU method successfully selects discriminative genes from imbalanced datasets.
Evaluated on 7 gene expression datasets, ROWSU demonstrated superior performance.
Outperformed existing methods in classification accuracy, sensitivity, and F1-score using kNN and RF classifiers.

Conclusions:

The proposed ROWSU method is effective for feature selection in imbalanced gene expression data.
ROWSU enhances classifier performance by selecting the most discriminative genes.
This approach offers a robust solution for binary classification problems in genomics with skewed data.