Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Distillation: Vapor–Liquid Equilibria01:01

Distillation: Vapor–Liquid Equilibria

2.8K
Distillation is a separation technique that takes advantage of the boiling point properties of disparate elements in a mixture. To perform distillation, we begin by heating a miscible mixture of two liquids with a significant difference in boiling points (at least 20°C). As the solution heats up and reaches the bubble point of the more volatile component, some molecules of the more volatile component transition into the gas phase and travel upward into the condenser, which is a glass tube...
2.8K
Special Features of Adaptive Immunity01:20

Special Features of Adaptive Immunity

816
The adaptive immune system, a crucial component of the overall immune response, offers a highly specialized defense against pathogens. It involves specific cell types and features, enabling it to combat infections effectively and efficiently.
The primary cell types involved in adaptive immunity are T cells and B cells. Each type has a unique role in defending the body against pathogens. T cells are responsible for cell-mediated immunity. They identify and eliminate infected cells directly,...
816

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Exploring the Stochastic Regularisation in Normalisation Layers for Semi-Supervised Learning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Intelligent multimodal-energy-driven piezoelectric antibacterial platforms: From structural control to system-level diagnosis.

Materials today. Bio·2026
Same author

Background-enhanced NIR-II single-pixel imaging for fluorescent localization and dynamic tracking.

Optics letters·2026
Same author

SD<sup>2</sup>-SNN: Self-distillation and structural decomposition framework for SNNs in continual learning.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Enhancing the high-cycle fatigue performance of GH4169 alloy by eliminating structural stress concentration through high-energy impact composite modification.

Fundamental research·2026
Same author

Fluorescence localization and tracking imaging with a spectral-splitting perovskite single-pixel detector.

Nature communications·2026
Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jul 1, 2025

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation
08:47

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

1.4K

Tolerant Self-Distillation for image classification.

Mushui Liu1, Yunlong Yu1, Zhong Ji2

  • 1College of Information Science and Electronic Engineering, Zhejiang University, China.

Neural Networks : the Official Journal of the International Neural Network Society
|March 12, 2024
PubMed
Summary
This summary is machine-generated.

Deep neural networks can overfit with insufficient data. Tolerant Self-Distillation (TSD) uses intra-class distribution metrics and knowledge distillation to mitigate overfitting without a pre-trained teacher model.

Keywords:
Deep LearningOverfittingSelf-DistillationTolerant

More Related Videos

Evaporation-reducing Culture Condition Increases the Reproducibility of Multicellular Spheroid Formation in Microtiter Plates
11:24

Evaporation-reducing Culture Condition Increases the Reproducibility of Multicellular Spheroid Formation in Microtiter Plates

Published on: March 7, 2017

7.0K
Multimodal Optical Imaging Platform for Studying Cellular Metabolism
04:47

Multimodal Optical Imaging Platform for Studying Cellular Metabolism

Published on: June 6, 2025

260

Related Experiment Videos

Last Updated: Jul 1, 2025

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation
08:47

Author Spotlight: UAV Remote Sensing for Efficient Invasive Plant Biomass Estimation

Published on: February 9, 2024

1.4K
Evaporation-reducing Culture Condition Increases the Reproducibility of Multicellular Spheroid Formation in Microtiter Plates
11:24

Evaporation-reducing Culture Condition Increases the Reproducibility of Multicellular Spheroid Formation in Microtiter Plates

Published on: March 7, 2017

7.0K
Multimodal Optical Imaging Platform for Studying Cellular Metabolism
04:47

Multimodal Optical Imaging Platform for Studying Cellular Metabolism

Published on: June 6, 2025

260

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Deep neural networks (DNNs) are prone to overfitting, especially with limited training data.
  • Overfitting occurs when models learn training data too well, including noise and outliers, leading to poor generalization.
  • Existing methods often require large datasets or complex pre-training strategies.

Purpose of the Study:

  • To introduce novel metrics from intra-class distribution for analyzing overfitting in DNNs.
  • To propose a new knowledge distillation method, Tolerant Self-Distillation (TSD), to alleviate overfitting.
  • To enable effective knowledge distillation without the need for a pre-trained teacher model.

Main Methods:

  • Developed two metrics based on the intra-class distribution of correctly and incorrectly predicted samples.
  • Proposed Tolerant Self-Distillation (TSD), an online knowledge distillation approach.
  • TSD utilizes an online updating memory to store past predictions, distilling knowledge across iterations by using incorrect predictions to supervise correct ones and vice versa.

Main Results:

  • The proposed TSD method effectively mitigates the overfitting issue in deep neural networks.
  • TSD alleviates premature convergence caused by over-confident predictions, leading to better local optima.
  • Experiments on various image classification benchmarks (small-scale, large-scale, fine-grained) demonstrate TSD's superiority.

Conclusions:

  • The novel intra-class distribution metrics offer a new perspective on understanding overfitting.
  • Tolerant Self-Distillation (TSD) provides an effective and efficient approach to combat overfitting in DNNs.
  • TSD shows significant improvements in model generalization across diverse datasets.