Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Factorial Design02:01

Factorial Design

16.0K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
16.0K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

5.0K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
5.0K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

8.9K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).
8.9K
Outliers and Influential Points01:08

Outliers and Influential Points

6.8K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
6.8K
Methods of Medium Optimization01:28

Methods of Medium Optimization

63
Optimizing growth media enhances microbial proliferation and maximizes product yield. Statistical experimental design methodologies provide structured and reproducible approaches, offering progressively higher levels of robustness and efficiency.The One-Factor-at-a-Time (OFAT) MethodThe One-Factor-at-a-Time (OFAT) method involves adjusting a single variable while keeping all others constant. However, it cannot detect interactions between variables, often leading to suboptimal outcomes when...
63
Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

585
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
585

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Correction: Regulation of focal adhesion dynamics and cell motility by the EB2 and Hax1 protein complex.

The Journal of biological chemistry·2019
Same author

Chemical Syntheses and Chemical Biology of Carboxyl Polyether Ionophores: Recent Highlights.

Angewandte Chemie (International ed. in English)·2019
Same author

Cholesterol content in cell membrane maintains surface levels of ErbB2 and confers a therapeutic vulnerability in ErbB2-positive breast cancer.

Cell communication and signaling : CCS·2019
Same author

Exercise interventions on patients with end-stage renal disease: a systematic review.

Clinical rehabilitation·2019
Same author

Long-term creep deformations in colloidal calcium-silicate-hydrate gels by accelerated aging simulations.

Journal of colloid and interface science·2019
Same author

Microconcave MAPbBr<sub>3</sub> Single Crystal for High-Performance Photodetector.

The journal of physical chemistry letters·2019
Same journal

Gradient Descent Provably Solves Nonlinear Tomographic Reconstruction.

IEEE transactions on information theory·2026
Same journal

Theoretical Guarantees for Sparse Principal Component Analysis based on the Elastic Net.

IEEE transactions on information theory·2025
Same journal

Uniform Convergence of Deep Neural Networks With Lipschitz Continuous Activation Functions and Variable Widths.

IEEE transactions on information theory·2025
Same journal

Kernel Stein Discrepancy on Lie Groups: Theory and Applications.

IEEE transactions on information theory·2024
Same journal

Matrix Reordering for Noisy Disordered Matrices: Optimality and Computationally Efficient Algorithms.

IEEE transactions on information theory·2024
Same journal

Non-Asymptotic Guarantees for Reliable Identification of Granger Causality via the LASSO.

IEEE transactions on information theory·2024
See all related articles

Related Experiment Video

Updated: Apr 18, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

Optimal Feature Selection in High-Dimensional Discriminant Analysis.

Mladen Kolar1, Han Liu2

  • 1Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15217, USA.

IEEE Transactions on Information Theory
|January 27, 2015
PubMed
Summary
This summary is machine-generated.

This study provides sharp conditions for consistent variable selection in high-dimensional discriminant analysis. Our findings offer faster convergence rates and optimal scaling for sample size, dimensionality, and sparsity.

Keywords:
discriminant analysishigh-dimensional statisticsoptimal rates of convergencevariable selection

More Related Videos

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.9K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.5K

Related Experiment Videos

Last Updated: Apr 18, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data
04:57

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

17.9K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.5K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • High-dimensional discriminant analysis methods lack sharp theoretical analysis for variable selection performance.
  • Model interpretability is crucial in scientific data analysis, yet variable selection performance remains underexplored.

Purpose of the Study:

  • To bridge the gap in theoretical analysis by providing sharp sufficient conditions for consistent variable selection in high-dimensional discriminant analysis.
  • To establish optimal convergence rates for variable selection that depend on sample size, dimensionality, and sparsity.

Main Methods:

  • Developed sharp sufficient conditions for consistent variable selection in sparse discriminant analysis.
  • Established significantly faster rates of convergence compared to existing methods.
  • Analyzed an exhaustive search procedure for benchmark comparison.

Main Results:

  • Achieved optimal scaling of sample size (n), dimensionality (p), and sparsity (s) in the high-dimensional setting.
  • Established information-theoretic limits for variable selection consistency.
  • Demonstrated optimal results for ROAD and sparse optimal scaling estimators.
  • Showed that an exhaustive search procedure is variable selection consistent under weaker conditions.

Conclusions:

  • The proposed conditions for consistent variable selection significantly advance the theoretical understanding of high-dimensional discriminant analysis.
  • The findings provide a benchmark for evaluating variable selection performance in this domain.
  • The study offers a comprehensive theoretical framework with practical implications for scientific data analysis.