Quantifying and Rejecting Outliers: The Grubbs Test
Cluster Sampling Method
Variability: Analysis
Expected Frequencies in Goodness-of-Fit Tests
Residuals and Least-Squares Property
Multiple Regression
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jul 11, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
Published on: October 11, 2018
Kangming Li1, Daniel Persaud1, Kamal Choudhary2
1Department of Materials Science and Engineering, University of Toronto, 27 King's College Cir, Toronto, ON, Canada.
Redundant materials data, often comprising up to 95%, can be removed without harming machine learning predictions. Focusing on data richness, not volume, improves model performance and training efficiency.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: