Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
Decision Making: P-value Method01:09

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim  is also stated. These statements can act as null and alternative hypotheses:  a null hypothesis would be a neutral statement while the alternative hypothesis can have a...
Reinforcement Schedules01:24

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
Observational Learning01:12

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning because...
Randomized Experiments01:13

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
Rolling Resistance: Problem Solving01:17

Rolling Resistance: Problem Solving

Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Hybrid atomistic-parametric decoherence model for molecular spin qubits.

The Journal of chemical physics·2026
Same author

Excess entropy scaling explains the enhanced dynamics of the ionic liquid 1-ethyl-3-methylimidazolium chloride in external electric fields.

Physical chemistry chemical physics : PCCP·2025
Same author

Accelerating phase diagram construction through activity coefficient prediction.

The Journal of chemical physics·2025
Same author

Automated High-Content, High-Throughput Spatial Analysis Pipeline for Drug Screening in 3D Tumor Spheroid Inverted Colloidal Crystal Arrays.

ACS applied materials & interfaces·2025
Same author

Structure-Property Relationship for the Electrostrictive Properties of Ionic Liquids.

The journal of physical chemistry. B·2025
Same author

Expanding Cluster, Enhancing Adsorption: Investigating the Role of Electrostatic Configurations on Water Vapor Adsorption.

Langmuir : the ACS journal of surfaces and colloids·2025
Same journal

Synergistic Interactions During Co-Hydrothermal Liquefaction of Food Waste and Biomass Model Compounds for Increased Sustainable Aviation Fuel Production.

ACS engineering Au·2026
Same journal

Flow Modeling, Reactor Engineering, and Process Intensification: From Insight to Performance.

ACS engineering Au·2026
Same journal

Structural Modulation of TpPa‑1 Covalent Organic Framework in Flow Synthesis Guided by Green Chemistry Principles.

ACS engineering Au·2026
Same journal

Mechanistic Insights for Plasma-Catalytic CO<sub>2</sub> Reduction over TiO<sub>2</sub> in a Dielectric Barrier Discharge Reactor.

ACS engineering Au·2026
Same journal

Optical Coherence Tomography Velocimetry for In-Line Processing of Biologics: Concentrated and Gelling Monoclonal Antibody Solutions.

ACS engineering Au·2026
Same journal

Graphene Oxide Membranes for Sustainable Recycling: Poly(styrene) Fractionation by Organic Solvent Nanofiltration.

ACS engineering Au·2026
See all related articles

Related Experiment Videos

PPO-GPR: A Custom Proximal Policy Optimization Tool for Active Reinforcement Learning.

Etinosa Osaro1, Yamil J Colón1

  • 1Department of Chemical and Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana 46556, United States.

ACS Engineering Au
|June 22, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a new active learning framework using Proximal Policy Optimization (PPO) and Gaussian Process Regression (GPR) for efficient data selection in material science. The method significantly reduces data acquisition needs for predicting gas selectivity in metal-organic frameworks (MOFs).

Keywords:
Gaussian process regressionactive learninggas mixture separationsmetal−organic frameworksproximal policy optimizationreinforcement learning

Related Experiment Videos

Area of Science:

  • Computational Material Science
  • Machine Learning in Chemistry
  • Reinforcement Learning for Scientific Discovery

Background:

  • Expensive and time-consuming data acquisition hinders progress in material science.
  • Predictive modeling requires strategic data selection to maximize efficiency.
  • Active learning offers a promising approach to optimize data acquisition.

Purpose of the Study:

  • To develop a novel active learning framework integrating Proximal Policy Optimization (PPO) with Gaussian Process Regression (GPR).
  • To strategically select informative data points for enhanced predictive modeling in material science.
  • To accelerate the discovery of new materials and optimize gas separation processes.

Main Methods:

  • Integration of PPO with GPR for guided data acquisition.
  • Development of a custom Gymnasium environment for PPO agent training.
  • Utilizing R-squared score for GPR performance evaluation and action masking to prevent data redundancy.

Main Results:

  • Achieved 77-86% data savings compared to full GCMC grids for predicting methane selectivity in CuBTC and IRMOF-1.
  • Successfully queried only ~14-23% of the candidate pool while maintaining high predictive accuracy (R-squared, MAE, RMSE).
  • Demonstrated stable convergence of the clipped-update PPO policy by focusing on critical pressure-temperature-composition regions.

Conclusions:

  • The PPO-GPR framework enables highly efficient data selection for predictive modeling in material science.
  • This approach significantly accelerates material discovery and optimizes gas separation processes.
  • Combines reinforcement learning and regression models for effective scientific acceleration.