Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Entropy Change in Reversible Processes

Entropy Change in Reversible Processes

In the Carnot engine, which achieves the maximum efficiency between two reservoirs of fixed temperatures, the total change in entropy is zero. The observation can be generalized by considering any reversible cyclic process consisting of many Carnot cycles. Thus, it can be stated that the total entropy change of any ideal reversible cycle is zero.
The statement can be further generalized to prove that entropy is a state function. Take a cyclic process between any two points on a p-V diagram.

Entropy

Entropy

Salt particles that have dissolved in water never spontaneously come back together in solution to reform solid particles. Moreover, a gas that has expanded in a vacuum remains dispersed and never spontaneously reassembles. The unidirectional nature of these phenomena is the result of a thermodynamic state function called entropy (S). Entropy is the measure of the extent to which the energy is dispersed throughout a system, or in other words, it is proportional to the degree of disorder of a...

Entropy

Entropy

The first law of thermodynamics is quantitatively formulated via an equation relating the internal energy of a system, the heat exchanged by it, and the work done on it. A quantitative formulation of the second law of thermodynamics leads to defining a state function, the entropy.
When an ideal gas expands isothermally, the disorder in the gas increases. From the molecular perspective, the gas molecules have more volume to move around in.
Consider an infinitesimal step in the expansion, which...

Standard Entropy Change for a Reaction

Standard Entropy Change for a Reaction

Entropy is a state function, so the standard entropy change for a chemical reaction (ΔS°rxn) can be calculated from the difference in standard entropy between the products and the reactants.

Stability of Equilibrium Configuration: Problem Solving

Stability of Equilibrium Configuration: Problem Solving

The stability of equilibrium configurations is an important concept in physics, engineering, and other related fields. In simple terms, it refers to the tendency of an object or system to return to its equilibrium position after being disturbed. The stability of an equilibrium configuration can be analyzed by considering the potential energy function of the system and examining its behavior near the equilibrium point.
Problem-solving in the context of the stability of equilibrium configuration...

Stability of Equilibrium Configuration

Stability of Equilibrium Configuration

Understanding the stability of equilibrium configurations is a fundamental part of mechanical engineering. In any system, there are three distinct types of equilibrium: stable, neutral, and unstable.
A stable equilibrium occurs when a system tends to return to its original position when given a small displacement, and the potential energy is at its minimum. An example of a stable equilibrium is when a cantilever beam is fixed at one end and a weight is attached to the other end. If the weight...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

RNA regulation in plants.

Science China. Life sciences·2026

Same author

RBM10 Deficiency Promotes Anti-PD-1 Resistance in LUAD via STING Alternative Splicing-Driven CCL7 Signaling and Macrophage Polarization.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026

Same author

Development and translation of biodegradable metal stents: from heart to brain.

Regenerative biomaterials·2026

Same author

Vertical Interaction between Thiourea and Perovskite Surface Results in Obviously Enhanced Performance with PCE Surpassing 24% Efficiency.

ACS applied materials & interfaces·2026

Same author

The mutated CYTOKININ OXIDASE/DEHYDROGENASE 7 promotes cell division in pith and plays a critical role in the development of stem lettuce.

The Plant journal : for cell and molecular biology·2026

Same author

Analysis of the effect and correlation of the co-care model on the diagnosis and treatment of type 2 diabetes patients.

Open medicine (Warsaw, Poland)·2026

Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026

Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026

Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026

Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Videos

Universal Stabilization for Maximum Entropy Optimization in Reinforcement Learning.

Xing Chen, Yewen Li, Xiaofeng Cao

IEEE Transactions on Neural Networks and Learning Systems

|November 20, 2025

Summary

This summary is machine-generated.

Maximum entropy reinforcement learning (RL) methods face stability issues. A new beta-symmetric KL divergence objective stabilizes policies and Q-functions, improving RL performance.

Related Experiment Videos

Area of Science:

Reinforcement Learning
Machine Learning Theory
Decision Making

Background:

Maximum entropy reinforcement learning (RL) methods enhance robustness but suffer from convergence difficulties.
Issues include suboptimal policy stabilization and unstable Q-value updates, termed 'tremulous policy' and 'spiky Q-function'.

Purpose of the Study:

To address the stability and convergence challenges in maximum entropy RL.
To introduce a novel objective function that mitigates tremulous policies and spiky Q-functions.

Main Methods:

Introduced a beta-symmetric Kullback-Leibler (KL) divergence objective within the maximum entropy framework.
Developed a method called max-entropy stable optimization (MeSO) involving iterative Q-value and policy updates.
Annealed entropy in the target Q-value to avoid spiky Q-functions.

Main Results:

The beta-symmetric KL divergence objective controls policy tremulousness with a large beta value.
Minimizing the new objective function theoretically improves the Q-value.
MeSO demonstrated stability, flexibility, and enhanced overall performance in experiments.

Conclusions:

The proposed beta-symmetric KL divergence objective effectively stabilizes maximum entropy RL.
MeSO offers a robust and performant alternative for real-world decision-making tasks.
The method improves upon existing maximum entropy RL approaches.