Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

294
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
294

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Molecular insight into the interplay among heterogeneous plasmacytes and microenvironment cells and their clinical relevance in myeloma.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Local hyperthermia for cutaneous sporotrichosis: A randomized clinical trial.

Journal of the American Academy of Dermatology·2026
Same author

Low-temperature and rapid determination of water content in solid samples using a MAPbBr<sub>3</sub>/SiO<sub>2</sub> paper-based sensor.

Mikrochimica acta·2026
Same author

Privacy-Preserving Average-Tracking Control for Multi-Agent Systems with Constant Reference Signals.

Entropy (Basel, Switzerland)·2026
Same author

A primal-dual approach to double-risk-constrained LQR for practical control under non-Gaussian noise.

ISA transactions·2025
Same author

From System 1 to System 2: A Survey of Reasoning Large Language Models.

IEEE transactions on pattern analysis and machine intelligence·2025
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026
Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Dec 5, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

860

Deep Neural Network Self-Distillation Exploiting Data Representation Invariance.

Ting-Bing Xu, Cheng-Lin Liu

    IEEE Transactions on Neural Networks and Learning Systems
    |October 19, 2020
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces self-distillation (SD) for creating accurate small neural networks without needing a separate teacher model. This method enhances generalization and outperforms existing model distillation techniques.

    More Related Videos

    Deep Neural Networks for Image-Based Dietary Assessment
    13:19

    Deep Neural Networks for Image-Based Dietary Assessment

    Published on: March 13, 2021

    9.8K
    Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
    09:47

    Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

    Published on: December 15, 2023

    1.5K

    Related Experiment Videos

    Last Updated: Dec 5, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    860
    Deep Neural Networks for Image-Based Dietary Assessment
    13:19

    Deep Neural Networks for Image-Based Dietary Assessment

    Published on: March 13, 2021

    9.8K
    Author Spotlight: Advancing Alzheimer's Research &#8211; Exploring Early Detection and Multi-Omics Approaches
    09:47

    Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

    Published on: December 15, 2023

    1.5K

    Area of Science:

    • Computer Science
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Existing methods for creating accurate small networks rely on model compression or knowledge distillation, often requiring complex assistive models.
    • These assistive models increase training complexity, memory usage, and time costs.

    Purpose of the Study:

    • To propose a self-distillation (SD) mechanism for directly obtaining high-accuracy models without an assistive model.
    • To leverage the concept of data representation invariance inspired by human vision.

    Main Methods:

    • Developed a self-distillation (SD) learning algorithm where a single network learns from its own distorted versions.
    • Utilized maximum mean discrepancy to ensure global feature consistency.
    • Employed Kullback-Leibler divergence to maintain posterior class probability consistency across distorted branches.

    Main Results:

    • The proposed SD method effectively reduces generalization error across various network architectures (AlexNet, VGGNet, ResNet, etc.).
    • Achieved superior performance compared to existing model distillation methods on benchmark datasets (MNIST, CIFAR-10/100, ImageNet).
    • Demonstrated minimal extra training effort required for the SD approach.

    Conclusions:

    • Self-distillation offers an efficient and effective approach to developing high-accuracy small networks.
    • The method successfully learns data representation invariance, leading to improved model generalization.
    • This technique provides a viable alternative to traditional knowledge distillation, reducing computational overhead.