Multi-input and Multi-variable systems
Reinforcement
Reinforcement Schedules
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving
Observational Learning
Collisions in Multiple Dimensions: Problem Solving
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Apr 7, 2026

Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models
Published on: December 23, 2025
Xiaotian Liu1, Ming Hu2, Yijie Peng3
1Guanghua School of Management, Peking University, Beijing, China.
Heterogeneous-agent proximal policy optimization (HAPPO) significantly reduces supply chain costs and the bullwhip effect. This multi-agent deep reinforcement learning approach outperforms single-agent methods by balancing individual and system-wide cost objectives.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: