Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Observational Learning01:12

Observational Learning

802
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
802
Associative Learning01:27

Associative Learning

1.2K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
1.2K
Purposive Learning01:22

Purposive Learning

426
E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...
426
Deductive Reasoning01:16

Deductive Reasoning

63.7K
Deductive reasoning, or deduction, is the type of logic used in hypothesis-based science. In deductive reasoning, the pattern of thinking moves in the opposite direction as compared to inductive reasoning, which means that it uses a general principle or law to predict specific results. From those general principles, a scientist can deduce and predict the specific results that would be valid as long as the general principles are valid.
For example, a researcher can deduce specific predictions...
63.7K
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

1.3K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
1.3K
Inductive Reasoning00:59

Inductive Reasoning

64.5K
Inductive reasoning is a form of logical thinking that uses related observations to arrive at a general conclusion. It is uncertain and operates in degrees to which the conclusions are credible. As such, inductive arguments can be weak or strong, rather than valid or invalid, and conclusions can be used to formulate testable, falsifiable hypotheses.
Inductive reasoning is common in descriptive science. A life scientist makes observations and records them. This data can be qualitative or...
64.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

State-switching navigation strategies in <i>Caenorhabditis elegans</i> are beneficial for chemotaxis.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Compact deep neural network models of the visual cortex.

Nature·2026
Same author

Identifying the factors governing internal state switches during nonstationary sensory decision-making.

Nature communications·2025
Same author

Fast Optimization of Robust Transcriptomics Embeddings using Probabilistic Inference Autoencoder Networks for multi-Omics.

bioRxiv : the preprint server for biology·2025
Same author

Improved inference of latent neural states from calcium imaging data.

bioRxiv : the preprint server for biology·2025
Same author

Bridging the gap between the connectome and whole-brain activity in <i>C. elegans</i>.

bioRxiv : the preprint server for biology·2025
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 11, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K

Inferring learning rules during de novo task learning.

Victor Geadah1, Jonathan W Pillow1,2

  • 1Program in Applied and Computational Mathematics, Princeton University, NJ.

Biorxiv : the Preprint Server for Biology
|November 19, 2025
PubMed
Summary
This summary is machine-generated.

Neuroscientists developed a new statistical framework to uncover how animals learn new tasks from scratch. This approach reveals policy-gradient-like learning rules, differing from standard reinforcement learning models.

More Related Videos

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

8.0K
Exploring the Role of Deontic Reasoning and World Knowledge in Wason&#180;s Selection Task
06:08

Exploring the Role of Deontic Reasoning and World Knowledge in Wason´s Selection Task

Published on: July 22, 2025

791

Related Experiment Videos

Last Updated: Jan 11, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K
Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

8.0K
Exploring the Role of Deontic Reasoning and World Knowledge in Wason&#180;s Selection Task
06:08

Exploring the Role of Deontic Reasoning and World Knowledge in Wason´s Selection Task

Published on: July 22, 2025

791

Area of Science:

  • Neuroscience
  • Computational Neuroscience
  • Animal Behavior

Background:

  • Identifying learning rules governing behavior is a key neuroscience challenge.
  • Reinforcement learning (RL) provides a framework, but studies often use non-stationary environments, not de novo learning.
  • Understanding how animals acquire entirely new tasks is crucial.

Purpose of the Study:

  • To introduce a statistical framework for inferring RL rules directly from single-animal behavior.
  • To compare policy-gradient-like rules with classical temporal-difference algorithms for de novo task learning.
  • To uncover systematic deviations from standard RL models in animal learning.

Main Methods:

  • Developed a statistical framework to infer RL rules from behavioral data.
  • Applied the framework to mice learning a perceptual decision-making task.
  • Fitted flexible parametric learning rules to behavioral data.

Main Results:

  • Policy-gradient-like rules better explain de novo task learning than temporal-difference algorithms.
  • Identified deviations from standard RL, including side-specific learning rates and negative reward baselines.
  • Discovered that animals adapt learning rates dynamically over training and across curricula.

Conclusions:

  • The framework provides a statistical account of how animals learn new tasks from scratch.
  • Animal learning exhibits key departures from classical reinforcement learning algorithms.
  • Findings offer insights into the neural mechanisms underlying adaptive learning.