Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Purposive Learning

Purposive Learning

E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...

Deductive Reasoning

Deductive Reasoning

Deductive reasoning, or deduction, is the type of logic used in hypothesis-based science. In deductive reasoning, the pattern of thinking moves in the opposite direction as compared to inductive reasoning, which means that it uses a general principle or law to predict specific results. From those general principles, a scientist can deduce and predict the specific results that would be valid as long as the general principles are valid.
For example, a researcher can deduce specific predictions...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Inductive Reasoning

Inductive Reasoning

Inductive reasoning is a form of logical thinking that uses related observations to arrive at a general conclusion. It is uncertain and operates in degrees to which the conclusions are credible. As such, inductive arguments can be weak or strong, rather than valid or invalid, and conclusions can be used to formulate testable, falsifiable hypotheses.
Inductive reasoning is common in descriptive science. A life scientist makes observations and records them. This data can be qualitative or...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

State-switching navigation strategies in <i>Caenorhabditis elegans</i> are beneficial for chemotaxis.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

Compact deep neural network models of the visual cortex.

Nature·2026

Same author

Identifying the factors governing internal state switches during nonstationary sensory decision-making.

Nature communications·2025

Same author

Fast Optimization of Robust Transcriptomics Embeddings using Probabilistic Inference Autoencoder Networks for multi-Omics.

bioRxiv : the preprint server for biology·2025

Same author

Improved inference of latent neural states from calcium imaging data.

bioRxiv : the preprint server for biology·2025

Same author

Bridging the gap between the connectome and whole-brain activity in <i>C. elegans</i>.

bioRxiv : the preprint server for biology·2025

Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026

Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026

Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026

Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026

Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026

Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 11, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Inferring learning rules during de novo task learning.

Victor Geadah¹, Jonathan W Pillow^1,2

¹Program in Applied and Computational Mathematics, Princeton University, NJ.

Biorxiv : the Preprint Server for Biology

|November 19, 2025

Summary

This summary is machine-generated.

Neuroscientists developed a new statistical framework to uncover how animals learn new tasks from scratch. This approach reveals policy-gradient-like learning rules, differing from standard reinforcement learning models.

More Related Videos

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Exploring the Role of Deontic Reasoning and World Knowledge in Wason´s Selection Task

Exploring the Role of Deontic Reasoning and World Knowledge in Wason´s Selection Task

Published on: July 22, 2025

Related Experiment Videos

Last Updated: Jan 11, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Exploring the Role of Deontic Reasoning and World Knowledge in Wason´s Selection Task

Exploring the Role of Deontic Reasoning and World Knowledge in Wason´s Selection Task

Published on: July 22, 2025

Area of Science:

Neuroscience
Computational Neuroscience
Animal Behavior

Background:

Identifying learning rules governing behavior is a key neuroscience challenge.
Reinforcement learning (RL) provides a framework, but studies often use non-stationary environments, not de novo learning.
Understanding how animals acquire entirely new tasks is crucial.

Purpose of the Study:

To introduce a statistical framework for inferring RL rules directly from single-animal behavior.
To compare policy-gradient-like rules with classical temporal-difference algorithms for de novo task learning.
To uncover systematic deviations from standard RL models in animal learning.

Main Methods:

Developed a statistical framework to infer RL rules from behavioral data.
Applied the framework to mice learning a perceptual decision-making task.
Fitted flexible parametric learning rules to behavioral data.

Main Results:

Policy-gradient-like rules better explain de novo task learning than temporal-difference algorithms.
Identified deviations from standard RL, including side-specific learning rates and negative reward baselines.
Discovered that animals adapt learning rates dynamically over training and across curricula.

Conclusions:

The framework provides a statistical account of how animals learn new tasks from scratch.
Animal learning exhibits key departures from classical reinforcement learning algorithms.
Findings offer insights into the neural mechanisms underlying adaptive learning.