Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

375
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
375
Improving Translational Accuracy02:07

Improving Translational Accuracy

8.5K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
8.5K
Mean Absolute Deviation01:13

Mean Absolute Deviation

2.5K
The mean absolute deviation is also a measure of the variability of data in a sample. It is the absolute value of the average difference between the data values and the mean.
Let us consider a dataset containing the number of unsold cupcakes in five shops: 10, 15, 8, 7, and 10. Initially, calculate the sample mean. Then calculate the deviation, or the difference, between each data value and the mean. Next, the absolute values of these deviations are added and divided by the sample size to...
2.5K
Multiple Comparison Tests01:13

Multiple Comparison Tests

3.8K
Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
3.8K
Testing a Claim about Standard Deviation01:19

Testing a Claim about Standard Deviation

2.4K
A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...
2.4K
Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

157
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
157

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

GSDeformer: Direct, Real-Time and Extensible Cage-Based Deformation for 3D Gaussian Splatting.

IEEE transactions on visualization and computer graphics·2026
Same author

Computation-aided design of rod-shaped nanoparticles for tumoral targeting.

Journal of controlled release : official journal of the Controlled Release Society·2025
Same author

An end-to-end implicit neural representation architecture for medical volume data.

PloS one·2025
Same author

Author Correction: BigNeuron: a resource to benchmark and predict performance of algorithms for automated tracing of neurons in light microscopy datasets.

Nature methods·2024
Same author

Exploring therapeutic targets for molecular therapy of idiopathic pulmonary fibrosis.

Science progress·2024
Same author

Computation-aided Design of Rod-Shaped Janus Base Nanopieces for Improved Tissue Penetration and Therapeutics Delivery.

bioRxiv : the preprint server for biology·2024
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: May 21, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

448

A practical generalization metric for deep networks benchmarking.

Mengqing Huang1, Hongchuan Yu2, Jianjun Zhang1

  • 1National Centre for Computer Animation, Bournemouth University, Poole, BH12 5BB, UK.

Scientific Reports
|March 22, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a practical metric to evaluate deep learning model generalization, finding it depends on accuracy and data diversity. Most existing theoretical estimations poorly correlate with practical measurements.

More Related Videos

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K
Modeling the Functional Network for Spatial Navigation in the Human Brain
05:55

Modeling the Functional Network for Spatial Navigation in the Human Brain

Published on: October 13, 2023

966

Related Experiment Videos

Last Updated: May 21, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

448
Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K
Modeling the Functional Network for Spatial Navigation in the Human Brain
05:55

Modeling the Functional Network for Spatial Navigation in the Human Brain

Published on: October 13, 2023

966

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Deep Learning

Background:

  • Estimating generalization error in deep learning models is crucial for both practical applications and theoretical validation.
  • Current research lacks standardized methods for benchmarking deep network generalization and verifying theoretical predictions.
  • Practical evaluation is essential to bridge the gap between theoretical estimations and real-world performance.

Purpose of the Study:

  • To introduce a practical generalization metric for benchmarking diverse deep learning networks.
  • To propose a novel testbed for verifying theoretical generalization estimations.
  • To quantify the relationship between model accuracy, data diversity, and generalization capacity.

Main Methods:

  • Development of a novel practical generalization metric.
  • Creation of a benchmarking testbed for deep learning models.
  • Comparative analysis of the proposed metric against existing theoretical generalization estimations.

Main Results:

  • Deep network generalization in classification is influenced by both classification accuracy and the diversity of unseen data.
  • The proposed metric quantifies model accuracy and data diversity, offering an intuitive trade-off evaluation.
  • Most existing theoretical generalization estimations showed poor correlation with practical measurements from the new testbed.

Conclusions:

  • The proposed metric provides a quantitative evaluation of deep learning model generalization.
  • Significant discrepancies exist between current theoretical generalization estimations and practical performance.
  • This work highlights the limitations of existing theories and motivates further research into more accurate generalization assessment methods.