Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

977
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
977
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.7K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.7K
Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

364
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
364
Survival Tree01:19

Survival Tree

250
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
250
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.3K
Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

397
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
397

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Self-healing chitosan-catechol hydrogel with tailored self-assembled fibrillar or micellar structures for biomedical applications.

Carbohydrate polymers·2026
Same author

Building MCP-native hierarchical AI scientist ecosystems: a perspective on scaling multi-agent scientific discovery.

Frontiers in artificial intelligence·2026
Same author

Biomechanical effects of various occlusal conditions and orthodontic force levels on mandibular canine distalization: a finite element analysis.

Scientific reports·2026
Same author

Blue-Light-Free Orange-Red Organic Light-Emitting Diodes With Less Influence on Melatonin Secretion.

Small (Weinheim an der Bergstrasse, Germany)·2026
Same author

Fundamental safety-capability trade-offs in fine-tuning large language models.

PNAS nexus·2026
Same author

Effects of bright light and near-infrared light on insomnia symptoms in community-dwelling older adults: a randomized clinical trial.

Photodiagnosis and photodynamic therapy·2026
Same journal

A Multi-Head Attention Transformer Model for Wearable in Situ Fall Detection.

IEEE access : practical innovations, open solutions·2026
Same journal

Validating Single-Camera Pose Estimation Against Multi-Camera Motion Capture for Accessible Biomechanical Assessment.

IEEE access : practical innovations, open solutions·2026
Same journal

Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification.

IEEE access : practical innovations, open solutions·2026
Same journal

Radio-Frequency Toroid Susceptometry of Magnetic Nanoparticles: What Goes Around Comes Around.

IEEE access : practical innovations, open solutions·2026
Same journal

Cross-Architecture Knowledge Distillation for Histopathological Image Analysis.

IEEE access : practical innovations, open solutions·2026
Same journal

Mislabel Identification Using Transfer Learning-Based Ensemble Method.

IEEE access : practical innovations, open solutions·2026
See all related articles

Related Experiment Video

Updated: Nov 26, 2025

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.2K

Improving Prediction Efficacy through Abnormality Detection and Data Preprocessing.

Chun-Chen Tu1, Pin-Yu Chen2, Naisyin Wang1

  • 1Department of Statistics, University of Michigan, Ann Arbor, MI 48105, USA.

IEEE Access : Practical Innovations, Open Solutions
|December 14, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a novel preprocessing system to handle abnormal testing data, improving model performance. The system effectively detects and corrects aberrant data, enhancing prediction efficacy for various models.

Keywords:
Data preprocessingGaussian mixture modelimage reconstructionoutlier detectionprincipal component analysis

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.8K

Related Experiment Videos

Last Updated: Nov 26, 2025

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.2K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.8K

Area of Science:

  • Data Science
  • Machine Learning
  • Computer Vision

Background:

  • Abnormal testing data significantly degrades predictive model performance.
  • Existing methods often address specific data anomalies in isolation.
  • A unified approach for handling diverse abnormal data is needed.

Purpose of the Study:

  • To propose a robust preprocessing system for handling various types of abnormal testing data.
  • To develop an aberrant data detector and corrector for improved data quality.
  • To demonstrate the system's generic applicability and performance enhancement across different predictive models.

Main Methods:

  • A two-component system comprising an aberrant data detector and an aberrant data corrector.
  • Aberrant data detector classifies incoming data types.
  • Aberrant data corrector applies type-specific corrections, e.g., Gaussian Locally Linear Mappings for corrupted images and nearest neighbors for adversarial samples.

Main Results:

  • The proposed aberrant data detector and corrector components demonstrate competitive performance against existing alternatives.
  • The integrated system effectively reconstructs corrupted data and improves predictions on adversarial samples.
  • The system successfully enhances prediction efficacy when applied to three different downstream predictive models.

Conclusions:

  • The proposed preprocessing system offers a generic and effective solution for managing abnormal testing data.
  • The system's modular design allows integration with various predictive models, improving their overall performance.
  • This work highlights the importance of dedicated preprocessing for robust machine learning applications.