Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Avoidance Learning and Learned Helplessness01:14

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...
Observational Learning01:12

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning because...
Introduction to Learning01:18

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...
Implicit Differentiation: Problem Solving01:29

Implicit Differentiation: Problem Solving

Curves defined implicitly, where variables cannot be separated algebraically, require specialized techniques for analysis. The conchoid of Nicomedes exemplifies such a case. Its equation links x and y in a way that prevents isolation of one variable, making implicit differentiation essential to determine the slope and behavior at any point on the curve.The implicit form of the conchoid can be expressed as:To differentiate this equation, y is treated as a function of x, and the chain rule is...
Derivatives: Problem Solving01:26

Derivatives: Problem Solving

Temperature-Dependent Growth of Brook TroutThe growth of brook trout is closely influenced by water temperature. Experimental data demonstrate how trout weight changes over a 24-day period in response to varying water temperatures. At lower temperatures, such as 15.5 degrees Celsius, brook trout show significant weight gain. However, as the temperature increases, the amount of weight gained steadily decreases. At the highest temperature measured, 24.4 degrees Celsius, trout experience a net...
Implicit Differentiation01:25

Implicit Differentiation

In classical mechanics, motion is often described through relationships between spatial coordinates and time. A car moving along a straight highway with constant acceleration serves as a simple case where velocity is an explicit function of time. This scenario results in a linear equation, enabling straightforward analysis using basic differentiation techniques.In contrast, a satellite in circular orbit follows a path defined by an implicit function. The position of the satellite is constrained...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Implications of hierarchical Markov models of behavior: on irreversibility, predictability, and dimensionality.

ArXiv·2026
Same author

Fast efficient coding and sensory adaptation in gain-adaptive recurrent networks.

Nature communications·2026
Same author

Human-level learning of complex novel tasks as theory-based modelling, exploration and planning.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks.

Advances in neural information processing systems·2026
Same author

Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments.

Advances in neural information processing systems·2026
Same author

POCO: Scalable Neural Forecasting through Population Conditioning.

Advances in neural information processing systems·2026
Same journal

JADE: Joint Alignment and Deep Embedding for Multi-Slice Spatial Transcriptomics.

Advances in neural information processing systems·2026
Same journal

Learning to Route: Per-Sample Adaptive Routing for Multimodal Multitask Prediction.

Advances in neural information processing systems·2026
Same journal

Emergence and Evolution of Interpretable Concepts in Diffusion Models.

Advances in neural information processing systems·2026
Same journal

The Rich and the Simple: On the Implicit Bias of Adam and SGD.

Advances in neural information processing systems·2026
Same journal

Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs.

Advances in neural information processing systems·2026
Same journal

Towards Straggler-Resilient Split Federated Learning: An Unbalanced Update Approach.

Advances in neural information processing systems·2026
See all related articles
  1. Home
  2. Gradient Descent As Loss Landscape Navigation: A Normative Framework For Deriving Learning Rules.
  1. Home
  2. Gradient Descent As Loss Landscape Navigation: A Normative Framework For Deriving Learning Rules.

Related Experiment Videos

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules.

John J Vastola1,2,3, Samuel J Gershman2,3, Kanaka Rajan1,3

  • 1Department of Neurobiology, Harvard Medical School.

Advances in Neural Information Processing Systems
|May 11, 2026

View abstract on PubMed

Summary
This summary is machine-generated.

This study frames learning rules as optimal control problems for navigating loss landscapes. It unifies gradient descent, momentum, and adaptive methods under a single theoretical framework for designing better machine learning algorithms.

Related Experiment Videos

Area of Science:

  • Machine Learning
  • Optimization Theory
  • Computational Neuroscience

Background:

  • Learning rules are fundamental to improving model performance but are often empirically chosen rather than theoretically derived.
  • Understanding the optimality conditions for various learning rules remains a significant challenge in artificial intelligence.

Purpose of the Study:

  • To develop a unified theoretical framework for understanding and deriving optimal learning rules.
  • To investigate the assumptions under which different learning rules are optimal.
  • To provide a principled foundation for designing novel adaptive algorithms.

Main Methods:

  • Casting learning rules as policies for navigating loss landscapes within an optimal control problem framework.
  • Analyzing the emergence of known learning rules (e.g., gradient descent, momentum, Adam) under specific assumptions.
  • Connecting continual learning strategies to task uncertainty within the proposed framework.
  • Main Results:

    • Demonstrated that gradient descent arises from short-horizon optimization.
    • Showed momentum emerges from longer-horizon planning.
    • Explained natural gradients via parameter space geometry and adaptive optimizers like Adam via Bayesian inference.
    • Characterized continual learning as optimal responses to task uncertainty.

    Conclusions:

    • The proposed optimal control framework unifies diverse learning rules under a single objective.
    • This provides a principled approach for designing more effective and adaptive machine learning algorithms.
    • The framework clarifies the computational structure underlying machine learning optimization.