Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This number is...

DNA Microarrays

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

Classification of Systems-II

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

PAC-Bayes Guarantees for Data-Adaptive Pairwise Learning.

Entropy (Basel, Switzerland)·2025

Same author

Optimization and Learning With Randomly Compressed Gradient Updates.

Neural computation·2023

Same author

Fractional norm regularization: learning with very few relevant features.

IEEE transactions on neural networks and learning systems·2014

Same author

A fast algorithm for robust mixtures in the presence of measurement errors.

IEEE transactions on neural networks·2010

Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026

Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026

Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026

Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026

Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 14, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Classification of mislabelled microarrays using robust sparse logistic regression.

Jakramate Bootkrajang¹, Ata Kabán

¹School of Computer Science, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK. J.Bootkrajang@cs.bham.ac.uk

Bioinformatics (Oxford, England)

|February 19, 2013

Summary

This summary is machine-generated.

This study introduces a novel method for detecting mislabelled microarray data by integrating label noise detection into sparse logistic regression classification. The approach accurately identifies mislabelled arrays and improves predictive performance, even with noisy data.

Related Experiment Videos

Last Updated: May 14, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Bioinformatics
Machine Learning
Genomics

Background:

Microarray datasets are susceptible to labelling errors, compromising classifier reliability.
Existing methods for handling label noise in bioinformatics are limited and often require parameter tuning.
Accurate data labelling is crucial for reliable biological data analysis and inference.

Purpose of the Study:

To develop a robust method for detecting mislabelled arrays concurrently with learning a sparse logistic regression classifier.
To address the limitations of existing data cleansing methods by integrating label noise handling into the classification process.
To provide a computationally efficient and parameter-free approach for handling label noise.

Main Methods:

A novel label-noise robust extension of Bayesian logistic regression is formulated.
A label-flipping process is incorporated into the classifier to account for potential mislabelling.
Bayesian regularization is employed for automatic setting of the regularization parameter, avoiding cross-validation.

Main Results:

The proposed method effectively detects mislabelled arrays with high accuracy.
It significantly improves predictive performance on datasets with labelling errors.
The approach is effective in identifying marker genes and demonstrates robustness against label noise.

Conclusions:

The developed method offers a powerful solution for handling label noise in microarray data analysis.
It enhances the reliability of classification and marker gene identification in the presence of labelling errors.
The automatic regularization setting and integrated approach offer computational advantages over existing methods.