Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Mean Absolute Deviation

Mean Absolute Deviation

The mean absolute deviation is also a measure of the variability of data in a sample. It is the absolute value of the average difference between the data values and the mean.
Let us consider a dataset containing the number of unsold cupcakes in five shops: 10, 15, 8, 7, and 10. Initially, calculate the sample mean. Then calculate the deviation, or the difference, between each data value and the mean. Next, the absolute values of these deviations are added and divided by the sample size to...

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

Testing a Claim about Standard Deviation

Testing a Claim about Standard Deviation

A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

GSDeformer: Direct, Real-Time and Extensible Cage-Based Deformation for 3D Gaussian Splatting.

IEEE transactions on visualization and computer graphics·2026

Same author

Computation-aided design of rod-shaped nanoparticles for tumoral targeting.

Journal of controlled release : official journal of the Controlled Release Society·2025

Same author

An end-to-end implicit neural representation architecture for medical volume data.

PloS one·2025

Same author

Author Correction: BigNeuron: a resource to benchmark and predict performance of algorithms for automated tracing of neurons in light microscopy datasets.

Nature methods·2024

Same author

Exploring therapeutic targets for molecular therapy of idiopathic pulmonary fibrosis.

Science progress·2024

Same author

Computation-aided Design of Rod-Shaped Janus Base Nanopieces for Improved Tissue Penetration and Therapeutics Delivery.

bioRxiv : the preprint server for biology·2024

Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026

Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026

Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026

Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026

Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026

Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 21, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A practical generalization metric for deep networks benchmarking.

Mengqing Huang¹, Hongchuan Yu², Jianjun Zhang¹

¹National Centre for Computer Animation, Bournemouth University, Poole, BH12 5BB, UK.

Scientific Reports

|March 22, 2025

Summary

This summary is machine-generated.

This study introduces a practical metric to evaluate deep learning model generalization, finding it depends on accuracy and data diversity. Most existing theoretical estimations poorly correlate with practical measurements.

More Related Videos

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Modeling the Functional Network for Spatial Navigation in the Human Brain

Modeling the Functional Network for Spatial Navigation in the Human Brain

Published on: October 13, 2023

Related Experiment Videos

Last Updated: May 21, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Modeling the Functional Network for Spatial Navigation in the Human Brain

Modeling the Functional Network for Spatial Navigation in the Human Brain

Published on: October 13, 2023

Area of Science:

Artificial Intelligence
Machine Learning
Deep Learning

Background:

Estimating generalization error in deep learning models is crucial for both practical applications and theoretical validation.
Current research lacks standardized methods for benchmarking deep network generalization and verifying theoretical predictions.
Practical evaluation is essential to bridge the gap between theoretical estimations and real-world performance.

Purpose of the Study:

To introduce a practical generalization metric for benchmarking diverse deep learning networks.
To propose a novel testbed for verifying theoretical generalization estimations.
To quantify the relationship between model accuracy, data diversity, and generalization capacity.

Main Methods:

Development of a novel practical generalization metric.
Creation of a benchmarking testbed for deep learning models.
Comparative analysis of the proposed metric against existing theoretical generalization estimations.

Main Results:

Deep network generalization in classification is influenced by both classification accuracy and the diversity of unseen data.
The proposed metric quantifies model accuracy and data diversity, offering an intuitive trade-off evaluation.
Most existing theoretical generalization estimations showed poor correlation with practical measurements from the new testbed.

Conclusions:

The proposed metric provides a quantitative evaluation of deep learning model generalization.
Significant discrepancies exist between current theoretical generalization estimations and practical performance.
This work highlights the limitations of existing theories and motivates further research into more accurate generalization assessment methods.