Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.7K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.7K
Correlation and Regression00:53

Correlation and Regression

1.3K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
1.3K
Correlation of Experimental Data01:23

Correlation of Experimental Data

258
Dimensional analysis simplifies complex physical problems and guides experimental investigations, but it does not provide complete solutions. It identifies the dimensionless groups that influence a phenomenon, but experimental data is needed to establish the specific relationships and validate theoretical predictions.
For example, a spherical particle moving through a viscous fluid experiences drag. Dimensional analysis shows that the drag force depends on the particle's diameter, velocity,...
258
Coefficient of Correlation01:12

Coefficient of Correlation

6.3K
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable x and the dependent variable y.
If you suspect a linear relationship between x and y, then r can measure how strong the linear relationship is.
What the VALUE of r tells us:
The value of r is always between –1 and +1: –1 ≤ r ≤ 1.
The size of the correlation r indicates the...
6.3K
DNA Microarrays02:34

DNA Microarrays

17.7K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
17.7K
Correlation01:09

Correlation

12.0K
In statistics, two variables are said to be correlated if the values of one variable are associated with the other variable. Depending on the relationship between two variables, correlation can be of three types– positive correlation, negative correlation, and zero correlation.
Two variables, for example, a and b, are said to be positively correlated if both variables move in the same direction. In other words, a positive correlation exists between two variables, a and b, if:
12.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Precise staging of diabetic retinopathy through machine learning analysis of leakage source characteristics: A Radiomics-driven approach integrating OCT-A and FFA.

Photodiagnosis and photodynamic therapy·2026
Same author

Accurate and robust ambiguity detection in software requirements documents using a GloVe-BiLSTM deep learning framework with data augmentation.

Scientific reports·2026
Same author

Secure electronic health record access control via blockchain, dual-attribute encryption, and large language model-based attribute extraction.

Scientific reports·2026
Same author

Enhancing Robust and Stable Feature Selection Through the Integration of Ranking Methods and Wrapper Techniques in Genetic Data Classification.

Methods in molecular biology (Clifton, N.J.)·2025
Same author

Improved ADHD Diagnosis Using EEG Connectivity and Deep Learning through Combining Pearson Correlation Coefficient and Phase-Locking Value.

Neuroinformatics·2024
Same author

Blockchain technology and raft consensus for secure physician prescriptions and improved diagnoses in electronic healthcare systems.

Scientific reports·2024
Same journal

Corrigendum to "Integrating experimental biology, computational methods, and artificial Intelligence in anticancer drug discovery: Bridging the translational Gap" [Comput. Biol. Med. 213 (2026) 111832].

Computers in biology and medicine·2026
Same journal

Organ dose optimization for a point-of-care forearm X-ray photon-counting CT.

Computers in biology and medicine·2026
Same journal

Physics-guided transformation of breathomic feature spaces into disease-specific representations for respiratory disease classification.

Computers in biology and medicine·2026
Same journal

An AI-driven deep learning pipeline for taxonomic classification and biodiversity assessment of deep-sea environmental DNA.

Computers in biology and medicine·2026
Same journal

Rapid personalisation of cardiovascular models using invasively measured right ventricular pressure.

Computers in biology and medicine·2026
Same journal

Biologically inspired mechanisms for enhancing robustness in EEG signal modeling: Challenges, opportunities, and perspectives.

Computers in biology and medicine·2026
See all related articles

Related Experiment Video

Updated: Jul 29, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

Robust microarray data feature selection using a correntropy based distance metric learning approach.

Venus Vahabzadeh1, Mohammad Hossein Moattar1

  • 1Department of Software Engineering, Mashhad Branch, Islamic Azad University, Mashhad, Iran.

Computers in Biology and Medicine
|May 26, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a robust feature selection method for high-dimensional microarray data, effectively handling outliers using correntropy-based metric learning. The approach enhances classification accuracy and robustness in genetic data processing.

Keywords:
CorrentropyDistance metric learningFeature selectionMicroarray data classificationsRobustness

More Related Videos

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis
07:11

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Published on: November 10, 2023

2.5K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

813

Related Experiment Videos

Last Updated: Jul 29, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis
07:11

Author Spotlight: Emerging Technologies and Advanced Tools for Decoding Metabolomics Data Analysis

Published on: November 10, 2023

2.5K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

813

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genetics

Background:

  • High-dimensional microarray data classification presents significant challenges.
  • Outliers in feature selection can compromise the performance of traditional distance metrics like Euclidean distance.
  • Robust feature selection is crucial for accurate genetic data analysis.

Purpose of the Study:

  • To propose a novel distance metric learning-based feature selection approach for high-dimensional data.
  • To address the challenge of outliers in feature selection using a robust discrimination metric.
  • To improve the accuracy and robustness of classification for microarray data.

Main Methods:

  • Developed a feature selection method leveraging distance metric learning.
  • Employed the correntropy function as the core discrimination metric.
  • Formulated and solved the metric learning problem as an optimization task using the Lagrange method.

Main Results:

  • The proposed method effectively identifies the most important and robust features.
  • Classification experiments using SVM, decision trees, and NN classifiers demonstrated superior performance.
  • Evaluated on 13 high-dimensional datasets, the method showed improved accuracy, precision, recall, and F-measure compared to existing models.

Conclusions:

  • The correntropy-based metric learning approach offers a robust solution for feature selection in high-dimensional microarray data.
  • This method effectively mitigates the impact of outliers, leading to enhanced classification performance.
  • The proposed technique represents a significant advancement in bioinformatics and genetic data processing.