Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Introduction to Learning

Introduction to Learning

Learning is the process of acquiring knowledge or skills through practice or experience, leading to long-lasting behavioral changes. This acquisition occurs through interaction with the environment and requires practice or experience. For instance, mastering a skill such as surfing requires considerable practice and experience, highlighting the essential role of repeated interactions with the environment in learning.
In contrast to learned behaviors, unlearned behaviors such as crying, sexual...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Confinement controls bacterial spreading at all scales.

Journal of the Royal Society, Interface·2026

Same author

Zoology of collective patterns modulated by non-reciprocal, long-range interactions.

Soft matter·2026

Same author

Active stop and go motion: A strategy to improve spatial exploration and survival.

Physical review. E·2025

Same author

Spreading processes on heterogeneous active systems: Spreading threshold, immunization strategies, and vaccination noise.

Physical review. E·2025

Same author

Self-trapping of active particles with nonreciprocal interactions in disordered media.

Physical review. E·2025

Same author

Structural dynamics and optimal transport of an active polymer.

Soft matter·2024

Same journal

Erratum: Low-dimensional model for adaptive networks of spiking neurons [Phys. Rev. E 111, 014422 (2025)].

Physical review. E·2026

Same journal

Disentangling the effects of many-body forces on depletion interactions.

Physical review. E·2026

Same journal

Charge transport and mode transition in dual-energy electron beam diodes.

Physical review. E·2026

Same journal

Optimization of multisite reactions in complex compartmentalized media.

Physical review. E·2026

Same journal

Origin of geometric cohesion in nonconvex granular materials: Interplay between interdigitation and rotational constraints enhancing frictional stability.

Physical review. E·2026

Same journal

Interaction of walkers with a standing Faraday wave.

Physical review. E·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 12, 2025

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

Learning to flock through reinforcement.

Mihir Durve¹, Fernando Peruani², Antonio Celani³

¹Department of Physics, Università degli studi di Trieste, Trieste 34127, Italy and Quantitative Life Sciences Unit, The Abdus Salam International Centre for Theoretical Physics (ICTP), Trieste 34151, Italy.

Physical Review. E

|August 16, 2020

Summary

This summary is machine-generated.

Multiagent reinforcement learning enables agents to learn flocking behavior. Agents learn to maintain group cohesion by aligning their velocity with neighbors, mimicking natural flocking strategies.

More Related Videos

A Method for Investigating Change Blindness in Pigeons Columba Livia

A Method for Investigating Change Blindness in Pigeons Columba Livia

Published on: September 7, 2018

Operant Learning of Drosophila at the Torque Meter

Operant Learning of Drosophila at the Torque Meter

Published on: June 16, 2008

Related Experiment Videos

Last Updated: Dec 12, 2025

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

A Method for Investigating Change Blindness in Pigeons Columba Livia

A Method for Investigating Change Blindness in Pigeons Columba Livia

Published on: September 7, 2018

Operant Learning of Drosophila at the Torque Meter

Operant Learning of Drosophila at the Torque Meter

Published on: June 16, 2008

Area of Science:

Collective behavior
Artificial intelligence
Computational neuroscience

Background:

Coordinated group motion, like bird flocking and fish schooling, emerges from individual actions.
Understanding the mechanisms behind collective behavior is crucial in various biological and artificial systems.

Purpose of the Study:

To investigate flocking behavior using multiagent reinforcement learning.
To determine if agents can learn flocking strategies through interaction and limited sensory input.
To analyze the emergent navigation strategies and their relation to known models.

Main Methods:

Utilized standard reinforcement learning algorithms.
Simulated agents with limited sensory input (neighboring velocities).
Trained agents against 'teacher' agents and in self-organized groups.

Main Results:

Learning agents successfully acquired flocking behavior, either by imitating teachers or through self-organization.
The emergent strategy matched the polar velocity alignment of the Vicsek model.
Velocity alignment was identified as an optimal strategy for maintaining group cohesion with limited sensory information.

Conclusions:

Velocity alignment may be an evolved adaptive behavior for minimizing neighbor loss.
This alignment strategy effectively promotes local polar order and group cohesion.
Reinforcement learning provides a framework for understanding the emergence of complex collective behaviors.