Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.3K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

2.5K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
2.5K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

16.8K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
16.8K
DNA Microarrays02:34

DNA Microarrays

17.1K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
17.1K
Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

63
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
63
Wilcoxon Signed-Ranks Test for Median of Single Population01:14

Wilcoxon Signed-Ranks Test for Median of Single Population

73
The Wilcoxon signed-rank test for the median of a single population is a nonparametric test used to evaluate whether the median of a population differs from a specified value. Unlike parametric tests, it does not require data to follow a normal distribution, making it suitable for non-normal or small samples. The test begins by calculating the difference (d) between each observation and the hypothesized median. The absolute values of these differences are ranked in ascending order, with ties...
73

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Retraction notice to "Effect of soil texture and zinc oxide nanoparticles on growth and accumulation of cadmium by wheat; a life cycle study" [Environ. Res. 216 (2023)114397].

Environmental research·2026
Same author

SynBio-DSP: an integrated synthetic biology platform with fed-batch fermentation and continuous downstream processing for sustainable production of thermostable cellulases.

Biotechnology for biofuels and bioproducts·2026
Same author

Revisiting the null association between hypertensive disorders of pregnancy and offspring kidney function: methodological considerations.

Pediatric nephrology (Berlin, Germany)·2026
Same author

Lipase from Stenotrophomonas maltophilia strain HO5 for efficient biodiesel synthesis using non-edible plant oils.

Scientific reports·2026
Same author

The effectiveness of surgical management in knee flexion contracture in primary total knee arthroplasty.

Pakistan journal of medical sciences·2026
Same author

Integrating gene expression and morphological traits for drought stress adaptation in maize hybrids.

Scientific reports·2026
Same journal

A computational model of chemically- and mechanically-induced thrombus formation in cerebral aneurysms.

Computers in biology and medicine·2026
Same journal

An improved catch fish optimization based deep learning model for Parkinson disease classification using EEG signal.

Computers in biology and medicine·2026
Same journal

Assessing the robustness of evaluation metrics for synthetic ECG signal quality.

Computers in biology and medicine·2026
Same journal

Integrating stemness and epithelial-mesenchymal transition signatures with machine learning identifies RUNX1 as a therapeutic vulnerability in colorectal cancer.

Computers in biology and medicine·2026
Same journal

Differential regional textural attributes of tongue in normal and acidity patients in the light of traditional Chinese medicine.

Computers in biology and medicine·2026
Same journal

SC-MSDNet: Spatial-consistent multi-view self-distillation for retinal OCT classification.

Computers in biology and medicine·2026
See all related articles

Related Experiment Video

Updated: May 13, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Modified Robust Proportional Overlapping Score for feature selection in high-dimensional micro-array data.

Muhammad Hamraz1, Tahir Abbas2, Fawad Ali1

  • 1Department of Statistics, Abdul Wali Khan University, Mardan, 23200, Pakistan.

Computers in Biology and Medicine
|April 15, 2025
PubMed
Summary
This summary is machine-generated.

The Modified Robust Proportional Overlapping Score (MRPOS) effectively selects discriminative genes from high-dimensional gene expression data. This novel feature selection method addresses the curse of dimensionality, improving classification accuracy in biological studies.

Keywords:
Classification errorGene expressionOverlapping scoreRandom forestRousseeuw and croux statisticsSupport vector machine (SVM)k-Nearest Neighbor (k-NN)

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

607
Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

18.2K

Related Experiment Videos

Last Updated: May 13, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

607
Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

18.2K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • High-dimensional microarray datasets present challenges due to the curse of dimensionality (n << p).
  • Traditional feature selection methods struggle with the vast number of genes and limited samples in these datasets.
  • Effective gene selection is crucial for accurate biological data analysis and classification.

Purpose of the Study:

  • To introduce the Modified Robust Proportional Overlapping Score (MRPOS), a novel feature selection method.
  • To enhance gene selection for binary classification problems in high-dimensional gene expression data.
  • To robustly identify discriminative genes by minimizing inter-class similarity and maximizing class differentiation.

Main Methods:

  • MRPOS utilizes robust dispersion statistics (Sn and Qn) for gene evaluation.
  • Gene expression overlap is assessed to identify genes that best distinguish between classes.
  • Four gene expression datasets were used, split into 70% training and 30% testing subsets.
  • Performance was evaluated against four existing feature selection techniques using Random Forest, k-NN, and SVM classifiers.

Main Results:

  • MRPOS demonstrated effectiveness in identifying discriminative genes.
  • The method successfully reduced the impact of the curse of dimensionality.
  • Classification error rates were analyzed and visualized, showing the superiority of MRPOS.
  • Comparative analysis confirmed the distinctiveness and effectiveness of the proposed method over established techniques.

Conclusions:

  • The Modified Robust Proportional Overlapping Score (MRPOS) is a highly effective feature selection method for high-dimensional gene expression data.
  • MRPOS offers a robust approach to overcome the curse of dimensionality in bioinformatics.
  • The proposed method shows significant potential for improving classification accuracy in biological studies.