Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Graded Potential

Graded Potential

Graded potentials are localized fluctuations in the cell membrane's electrical charge, commonly found in the dendrites of neurons. The magnitude of these potential changes depends on the strength of the initiating stimulus. In a membrane at its resting potential, a graded potential signifies a voltage shift either above -70 mV or below -70 mV.
Graded potentials fall into two categories: depolarizing and hyperpolarizing. Depolarizing graded potentials typically occur when sodium (Na+) or...

What is an Electrochemical Gradient?

What is an Electrochemical Gradient?

Adenosine triphosphate, or ATP, is considered the primary energy source in cells. However, energy can also be stored in the electrochemical gradient of an ion across the plasma membrane, which is determined by two factors: its chemical and electrical gradients.
The chemical gradient relies on differences in the abundance of a substance on the outside versus the inside of a cell and flows from areas of high to low ion concentration. In contrast, the electrical gradient revolves around an...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Poisson's And Laplace's Equation

Poisson's And Laplace's Equation

The electric potential of the system can be calculated by relating it to the electric charge densities that give rise to the electric potential. The differential form of Gauss's law expresses the electric field's divergence in terms of the electric charge density.

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The purine-rich element-binding protein ChPur-α negatively regulates Hsc70 transcription in Crassostrea hongkongensis.

Cell stress & chaperones·2017

Same author

Improved antitumor effect of ionizing radiation in combination with rapamycin for treating nasopharyngeal carcinoma.

Oncology letters·2017

Same author

Roles of Cells from the Arterial Vessel Wall in Atherosclerosis.

Mediators of inflammation·2017

Same author

Metabolic and microbial signatures in rat hepatocellular carcinoma treated with caffeic acid and chlorogenic acid.

Scientific reports·2017

Same author

Arsenic removal in aqueous solution by a novel Fe-Mn modified biochar composite: Characterization and mechanism.

Ecotoxicology and environmental safety·2017

Same author

Antidiabetic activities of polysaccharides separated from Inonotus obliquus via the modulation of oxidative stress in mice with streptozotocin-induced diabetes.

PloS one·2017

Same journal

A Model-Free Reinforcement Learning Implementation of Decision Making Under Uncertainty by Sequential Sampling.

Neural computation·2026

Same journal

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026

Same journal

Hierarchical Active Inference Using Successor Representations.

Neural computation·2026

Same journal

W-Kernel and Its Principal Space for Frequentist Evaluation of Bayesian Estimators.

Neural computation·2026

Same journal

A Hidden Markov Model-Inspired Sequence Classification Method for Hyperdimensional Computing.

Neural computation·2026

Same journal

Sparse Graphical Modeling for Electrophysiological Phase-Based Connectivity Using Circular Statistics.

Neural computation·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 7, 2025

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Generalization Guarantees of Gradient Descent for Shallow Neural Networks.

Puyu Wang¹, Yunwen Lei², Di Wang³

¹Hong Kong Baptist University, Hong Kong wangpuyu1026@gmail.com.

Neural Computation

|November 18, 2024

Summary

This summary is machine-generated.

This study analyzes the generalization of neural networks (NNs) using algorithmic stability, extending previous work to two- and three-layer networks. We show gradient descent (GD) can achieve O(1/n) risk rates, revealing conditions for effective training.

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

Related Experiment Videos

Last Updated: Jun 7, 2025

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

A Virtual Machine Platform for Non-Computer Professionals for Using Deep Learning to Classify Biological Sequences of Metagenomic Data

Published on: September 25, 2021

Area of Science:

Machine Learning
Deep Learning Theory
Algorithmic Stability

Background:

Understanding neural network (NN) generalization is crucial for reliable AI.
Algorithmic stability provides a framework for analyzing generalization.
Previous studies primarily focused on single-hidden-layer networks, neglecting network scaling effects.

Purpose of the Study:

To extend algorithmic stability and generalization analysis to two- and three-layer neural networks trained by gradient descent (GD).
To investigate the impact of network scaling on generalization.
To derive conditions for achieving optimal risk rates in NNs.

Main Methods:

Comprehensive stability and generalization analysis of GD for two- and three-layer NNs.
Relaxing previous conditions for two-layer NNs under general network scaling.
Utilizing a novel induction strategy to demonstrate the nearly co-coercive property of three-layer NNs, considering overparameterization.

Main Results:

Derived an excess risk rate of O(1/n) for GD in both two- and three-layer NNs.
Identified sufficient and necessary conditions for under- and over-parameterized NNs to achieve the O(1/n) risk rate.
Demonstrated that increased scaling factors or decreased network complexity reduce the required overparameterization for optimal error rates.
Achieved a fast O(1/n) risk rate under low-noise conditions for both network types.

Conclusions:

The study provides a generalized understanding of GD generalization for deeper networks.
Network scaling and complexity are key factors influencing generalization performance.
The findings offer practical insights into training NNs for improved generalization and error rates.