Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Entropy Change in Reversible Processes01:10

Entropy Change in Reversible Processes

3.1K
In the Carnot engine, which achieves the maximum efficiency between two reservoirs of fixed temperatures, the total change in entropy is zero. The observation can be generalized by considering any reversible cyclic process consisting of many Carnot cycles. Thus, it can be stated that the total entropy change of any ideal reversible cycle is zero.
The statement can be further generalized to prove that entropy is a state function. Take a cyclic process between any two points on a p-V diagram.
3.1K
Observational Learning01:12

Observational Learning

713
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
713
Avoidance Learning and Learned Helplessness01:14

Avoidance Learning and Learned Helplessness

2.4K
Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...
2.4K
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

1.2K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
1.2K
Reinforcement01:23

Reinforcement

693
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
693
Dynamic Equilibrium02:20

Dynamic Equilibrium

60.5K
A reversible chemical reaction represents a chemical process that proceeds in both forward (left to right) and reverse (right to left) directions. When the rates of the forward and reverse reactions are equal, the concentrations of the reactant and product species remain constant over time and the system is at equilibrium. A special double arrow is used to emphasize the reversible nature of the reaction. The relative concentrations of reactants and products in equilibrium systems vary greatly;...
60.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Constitutive and inducible oleoresin defenses share genetic architectures and mechanisms in Pinus taeda.

The New phytologist·2026
Same author

Transcriptomic and functional analyses uncover a conserved effector driving genotype-dependent virulence in the <i>Sphaerulina musiva-Populus trichocarpa</i> interaction.

mBio·2026
Same author

Integrated epidemiological and molecular data inform the relationship between precancer and cancer states of esophageal adenocarcinoma.

Nature medicine·2026
Same author

Spatio-temporal dynamics of Hendra virus in Australia reveal stable maintenance of diverse viral clades among Pteropus bats.

Nature microbiology·2026
Same author

A multi-layered approach to elucidate mechanisms of physical function in response to rehabilitation in heart failure with preserved ejection fraction.

medRxiv : the preprint server for health sciences·2026
Same author

Environmental and ecological signals predict food shortages for subtropical populations of Australian flying foxes, reservoirs of Hendra virus.

Biology letters·2026
Same journal

The influence of chirality on the macroscopic behavior of multiferroic smectic phases.

The Journal of chemical physics·2026
Same journal

Polaron transformed canonically consistent quantum master equation.

The Journal of chemical physics·2026
Same journal

The x-ray absorption spectrum of the propargyl radical C3H3●.

The Journal of chemical physics·2026
Same journal

Transient hydroperoxyalkyl intermediates (•QOOH) in isopentane oxidation. I. Conformer- and isomer-resolved infrared spectra.

The Journal of chemical physics·2026
Same journal

Transient hydroperoxyalkyl intermediates (•QOOH) in isopentane oxidation. II. Isomer-resolved unimolecular dynamics.

The Journal of chemical physics·2026
Same journal

Quantum state-to-state dynamics studies of the C(3P) + OH(X2Π) → CO(a3Π) + H(2S) reaction based on a new HCO(12A″) potential energy surface.

The Journal of chemical physics·2026
See all related articles

Related Experiment Video

Updated: Dec 13, 2025

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis
06:44

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis

Published on: September 23, 2025

366

Evolutionary reinforcement learning of dynamical large deviations.

Stephen Whitelam1, Daniel Jacobson2, Isaac Tamblyn3

  • 1Molecular Foundry, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, California 94720, USA.

The Journal of Chemical Physics
|August 6, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces evolutionary reinforcement learning to calculate dynamical large deviations. This method uses agents to model stochastic processes, enabling the computation of rate functions for complex physics problems.

More Related Videos

Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

1.2K
Author Spotlight: Advancing Protein Engineering &#8211; Harnessing Evolution Through PRANCE and Lab Automation
05:08

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

2.1K

Related Experiment Videos

Last Updated: Dec 13, 2025

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis
06:44

Age-dependent Dynamics of Locomotion in Caenorhabditis elegans: A Lyapunov Exponent Analysis

Published on: September 23, 2025

366
Following the Dynamics of Structural Variants in Experimentally Evolved Populations
04:52

Following the Dynamics of Structural Variants in Experimentally Evolved Populations

Published on: February 3, 2023

1.2K
Author Spotlight: Advancing Protein Engineering &#8211; Harnessing Evolution Through PRANCE and Lab Automation
05:08

Author Spotlight: Advancing Protein Engineering – Harnessing Evolution Through PRANCE and Lab Automation

Published on: January 12, 2024

2.1K

Area of Science:

  • Computational Physics
  • Machine Learning
  • Statistical Mechanics

Background:

  • Dynamical large deviations are crucial for understanding rare events in stochastic systems.
  • Calculating these deviations often involves computationally intensive methods.
  • Existing frameworks may not fully capture the complexities of path-extensive quantities.

Purpose of the Study:

  • To develop a novel method for bounding and calculating the likelihood of dynamical large deviations.
  • To leverage evolutionary reinforcement learning for analyzing stochastic models.
  • To bridge the gap between physics problems and machine learning frameworks.

Main Methods:

  • An agent, representing a stochastic model, propagates continuous-time Monte Carlo trajectories.
  • Rewards are assigned based on the values of path-extensive quantities.
  • Evolutionary algorithms optimize agents to improve the calculation of large-deviation rate functions.
  • For large state spaces, neural networks parameterize the model's rates.

Main Results:

  • Demonstrated the feasibility of using evolutionary reinforcement learning to bound and calculate dynamical large deviations.
  • Showcased the method's applicability to models with varying state space sizes.
  • Successfully linked path-extensive physics problems to a machine learning framework.

Conclusions:

  • Evolutionary reinforcement learning offers a powerful new approach for tackling complex problems in statistical mechanics and physics.
  • This framework facilitates the computation of large-deviation rate functions, previously a significant challenge.
  • The study highlights the potential of integrating advanced machine learning techniques into physical modeling.