Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Entropy Change in Reversible Processes

Entropy Change in Reversible Processes

In the Carnot engine, which achieves the maximum efficiency between two reservoirs of fixed temperatures, the total change in entropy is zero. The observation can be generalized by considering any reversible cyclic process consisting of many Carnot cycles. Thus, it can be stated that the total entropy change of any ideal reversible cycle is zero.
The statement can be further generalized to prove that entropy is a state function. Take a cyclic process between any two points on a p-V diagram.

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Dynamic Equilibrium

Dynamic Equilibrium

A reversible chemical reaction represents a chemical process that proceeds in both forward (left to right) and reverse (right to left) directions. When the rates of the forward and reverse reactions are equal, the concentrations of the reactant and product species remain constant over time and the system is at equilibrium. A special double arrow is used to emphasize the reversible nature of the reaction. The relative concentrations of reactants and products in equilibrium systems vary greatly;...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Constitutive and inducible oleoresin defenses share genetic architectures and mechanisms in Pinus taeda.

The New phytologist·2026

Same author

Transcriptomic and functional analyses uncover a conserved effector driving genotype-dependent virulence in the <i>Sphaerulina musiva-Populus trichocarpa</i> interaction.

mBio·2026

Same author

Integrated epidemiological and molecular data inform the relationship between precancer and cancer states of esophageal adenocarcinoma.

Nature medicine·2026

Same author

Spatio-temporal dynamics of Hendra virus in Australia reveal stable maintenance of diverse viral clades among Pteropus bats.

Nature microbiology·2026

Same author

A multi-layered approach to elucidate mechanisms of physical function in response to rehabilitation in heart failure with preserved ejection fraction.

medRxiv : the preprint server for health sciences·2026

Same author

Environmental and ecological signals predict food shortages for subtropical populations of Australian flying foxes, reservoirs of Hendra virus.

Biology letters·2026

Same journal

The influence of chirality on the macroscopic behavior of multiferroic smectic phases.

The Journal of chemical physics·2026

Same journal

Polaron transformed canonically consistent quantum master equation.

The Journal of chemical physics·2026

Same journal

The x-ray absorption spectrum of the propargyl radical C3H3●.

The Journal of chemical physics·2026

Same journal

Transient hydroperoxyalkyl intermediates (•QOOH) in isopentane oxidation. I. Conformer- and isomer-resolved infrared spectra.

The Journal of chemical physics·2026

Same journal

Transient hydroperoxyalkyl intermediates (•QOOH) in isopentane oxidation. II. Isomer-resolved unimolecular dynamics.

The Journal of chemical physics·2026

Same journal

Quantum state-to-state dynamics studies of the C(3P) + OH(X2Π) → CO(a3Π) + H(2S) reaction based on a new HCO(12A″) potential energy surface.

The Journal of chemical physics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 13, 2025

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis

Published on: September 23, 2025

Evolutionary reinforcement learning of dynamical large deviations.

Stephen Whitelam¹, Daniel Jacobson², Isaac Tamblyn³

¹Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA.

The Journal of Chemical Physics

|August 6, 2020

Summary

This summary is machine-generated.

This study introduces evolutionary reinforcement learning to calculate dynamical large deviations. This method uses agents to model stochastic processes, enabling the computation of rate functions for complex physics problems.

More Related Videos

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

Related Experiment Videos

Last Updated: Dec 13, 2025

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis

Published on: September 23, 2025

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

Area of Science:

Computational Physics
Machine Learning
Statistical Mechanics

Background:

Dynamical large deviations are crucial for understanding rare events in stochastic systems.
Calculating these deviations often involves computationally intensive methods.
Existing frameworks may not fully capture the complexities of path-extensive quantities.

Purpose of the Study:

To develop a novel method for bounding and calculating the likelihood of dynamical large deviations.
To leverage evolutionary reinforcement learning for analyzing stochastic models.
To bridge the gap between physics problems and machine learning frameworks.

Main Methods:

An agent, representing a stochastic model, propagates continuous-time Monte Carlo trajectories.
Rewards are assigned based on the values of path-extensive quantities.
Evolutionary algorithms optimize agents to improve the calculation of large-deviation rate functions.
For large state spaces, neural networks parameterize the model's rates.

Main Results:

Demonstrated the feasibility of using evolutionary reinforcement learning to bound and calculate dynamical large deviations.
Showcased the method's applicability to models with varying state space sizes.
Successfully linked path-extensive physics problems to a machine learning framework.

Conclusions:

Evolutionary reinforcement learning offers a powerful new approach for tackling complex problems in statistical mechanics and physics.
This framework facilitates the computation of large-deviation rate functions, previously a significant challenge.
The study highlights the potential of integrating advanced machine learning techniques into physical modeling.