Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.5K
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

8.8K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
8.8K
Factorial Design02:01

Factorial Design

15.4K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
15.4K
Wilcoxon Signed-Ranks Test for Matched Pairs01:09

Wilcoxon Signed-Ranks Test for Matched Pairs

585
The Wilcoxon signed-rank test for matched pairs evaluates the null hypothesis by combining the ranks of differences with their signs. It essentially tests whether the median of the differences in a population of matched pairs is zero. Since the test incorporates more information than the sign test, it generally yields more trustable conclusions. This test also does not require the data to follow a normal distribution, but two conditions must be met for it to be applicable: (1) the data must...
585
Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

1.7K
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
1.7K
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

9.4K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
9.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

RSTG: Robust Generation of High Quality Spatial Transcriptomics Data using Beta Divergence Based AutoEncoder.

IEEE journal of biomedical and health informatics·2026
Same author

MutAIverse: an AI-powered, mechanism-backed platform for discovering novel DNA adducts and their precursor genotoxins.

Journal of cheminformatics·2026
Same author

Gene dependency-informed inference of response to targeted cancer therapies.

Nature communications·2026
Same author

Optimizing genomics-aware clinical agents in precision oncology.

NPJ systems biology and applications·2026
Same author

Altered chromatin accessibility and nucleosome positioning landscape upon HDAC and LSD1 inhibition in cancer cell.

bioRxiv : the preprint server for biology·2026
Same author

Soft Bayesian Additive Regression Trees (SBART) for correlated survey response with non-Gaussian error.

Journal of nonparametric statistics·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hierarchical Semantic Concept Modeling for Generalizable Myocardial Pathology Segmentation on Multisequence CMR Images.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Mar 26, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

A Scoring Scheme for Online Feature Selection: Simulating Model Performance Without Retraining.

Debarka Sengupta, Sanghamitra Bandyopadhyay, Debajyoti Sinha

    IEEE Transactions on Neural Networks and Learning Systems
    |January 27, 2016
    PubMed
    Summary
    This summary is machine-generated.

    Feature selection is crucial for model performance and interpretability. This study introduces a novel score for efficient online feature selection, avoiding costly retraining and subjective relevance definitions.

    More Related Videos

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
    03:37

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

    Published on: March 1, 2024

    1.4K
    Constructing and Visualizing Models using Mime-based Machine-learning Framework
    06:19

    Constructing and Visualizing Models using Mime-based Machine-learning Framework

    Published on: July 22, 2025

    3.1K

    Related Experiment Videos

    Last Updated: Mar 26, 2026

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
    07:35

    Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

    Published on: October 11, 2018

    8.1K
    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
    03:37

    Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

    Published on: March 1, 2024

    1.4K
    Constructing and Visualizing Models using Mime-based Machine-learning Framework
    06:19

    Constructing and Visualizing Models using Mime-based Machine-learning Framework

    Published on: July 22, 2025

    3.1K

    Area of Science:

    • Machine Learning
    • Data Science
    • Artificial Intelligence

    Background:

    • Increasing model complexity with irrelevant features can lead to overfitting and reduced interpretability.
    • Online learning and real-world scenarios necessitate periodic model retraining or performance testing with newly discovered features.
    • Current feature selection methods, supervised or unsupervised, present computational or subjective limitations.

    Purpose of the Study:

    • To introduce an accurate feature importance score for optimal feature selection.
    • To address the challenges of online feature selection in dynamic environments.
    • To enable performance evaluation of new features without model retraining.

    Main Methods:

    • Development of a novel scoring mechanism to determine feature importance.
    • Application of the score in online feature selection scenarios.
    • Evaluation of the score's ability to interpret performance improvements.

    Main Results:

    • The proposed score accurately determines feature importance.
    • The score offers low time complexity suitable for online scenarios.
    • Performance improvement can be interpreted without invoking model retraining.

    Conclusions:

    • The introduced score provides an efficient and effective solution for online feature selection.
    • This method enhances model interpretability and reduces computational overhead.
    • The score facilitates dynamic model adaptation in response to new data features.