Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Mathematical Modeling: Problem Solving

Mathematical Modeling: Problem Solving

Mathematical modeling transforms real-world scenarios into mathematical expressions, allowing for structured problem-solving and analysis. This process involves defining the situation, assigning variables to measurable quantities, selecting an appropriate model, and solving the resulting equation. Such models are invaluable in finance, providing precise methods to evaluate investments, loans, and repayment structures.A widely used example is the calculation of fixed monthly payments on a loan,...

Gaussian Elimination: Problem Solving

Gaussian Elimination: Problem Solving

Systems of linear equations in several variables are pivotal in modeling complex scenarios involving multiple unknowns and constraints. Such systems are widely used in various fields to represent relationships where several conditions must be simultaneously satisfied. Each variable in the system corresponds to an unknown quantity, while each equation imposes a linear constraint, leading to a structured approach for analyzing and solving real-world problems.A system of three equations with three...

Machines: Problem Solving II

Machines: Problem Solving II

Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.

Machines: Problem Solving I

Machines: Problem Solving I

A toggle clamp is a mechanical device commonly used for holding and clamping objects in various applications, such as woodworking, metalworking, and assembly operations. Consider a toggle clamp subjected to a force of 200 N at the handle. The vertical clamping force can be calculated, provided the dimensions of the toggle clamp are known.
The toggle clamp system is a machine structure consisting of movable, pin-connected multi-force members that form a stabilized system to transmit forces. The...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Federated Multi-View Unsupervised Feature Selection via Bio-Inspired Hierarchical-Cognitive Tianji's Horse Racing Optimization and Tensor Learning.

Biomimetics (Basel, Switzerland)·2026

Same author

A Post-Quantum Authentication and Key Agreement Protocol Based on Lattice-Based KEM for Secure Network Environments.

Entropy (Basel, Switzerland)·2026

Same author

A Training-Free Paradigm for Data-Scarce Maritime Scene Classification Using Vision-Language Models.

Sensors (Basel, Switzerland)·2026

Same author

Interpretable Sensor Change Detection via Conditional Cauchy-Schwarz Divergence.

Sensors (Basel, Switzerland)·2026

Same author

Information-Theoretic Intrinsic Motivation for Reinforcement Learning in Combinatorial Routing.

Entropy (Basel, Switzerland)·2026

Same author

Privacy-Preserving ECC-Based AKA for Resource-Constrained IoT Sensor Networks with Forgotten Password Reset.

Entropy (Basel, Switzerland)·2026

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 7, 2026

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

Information Bottleneck-Enhanced Reinforcement Learning for Solving Operation Research Problems.

Ruozhang Xi¹, Yao Ni², Wangyu Wu³

¹Krieger School of Arts and Sciences, Johns Hopkins University, Washington, DC 20001, USA.

Sensors (Basel, Switzerland)

|December 31, 2025

Summary

This summary is machine-generated.

Information Bottleneck-Enhanced Reinforcement Learning (IBE) improves reinforcement learning (RL) for complex optimization tasks. This novel framework enhances representation learning and exploration, outperforming existing RL methods in logistics and manufacturing.

Keywords:

information bottleneck operation research problems reinforcement learning

More Related Videos

Operant Procedures for Assessing Behavioral Flexibility in Rats

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

Related Experiment Videos

Last Updated: Jan 7, 2026

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

Operant Procedures for Assessing Behavioral Flexibility in Rats

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

Area of Science:

Artificial Intelligence
Operations Research
Machine Learning

Background:

Reinforcement learning (RL) faces challenges in high-dimensional state spaces and unstable training for combinatorial optimization problems.
Applications in operations research (OR) and smart manufacturing require robust decision-making frameworks.

Purpose of the Study:

To introduce Information Bottleneck-Enhanced Reinforcement Learning (IBE), a novel framework designed to improve RL performance in structured combinatorial optimization.
To enhance representation learning and exploration efficiency in RL for complex industrial decision-making.

Main Methods:

IBE integrates information-theoretic regularization into attention-based RL architectures.
It employs two bottleneck objectives: a state representation bottleneck for compact data representation and a policy bottleneck for exploration bonus.
The framework utilizes mutual information between states and actions for policy regularization.

Main Results:

IBE demonstrated superior performance and stability compared to established RL baselines (PPO, REINFORCE, AM, NeuOpt).
Evaluations on routing and scheduling problems in logistics and manufacturing showed consistent outperformance.
Ablation studies validated the synergistic effect of the two bottleneck components.

Conclusions:

IBE offers a principled and generalizable approach to enhance RL for combinatorial optimization and Industry 4.0 environments.
The framework effectively addresses challenges in representation learning and exploration for complex decision spaces.
IBE provides a robust solution for real-world industrial decision-making applications.