Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Operant Conditioning Intervention

Operant Conditioning Intervention

Operant conditioning serves as a foundational principle in therapeutic interventions aimed at modifying maladaptive behaviors. Central to this approach is the notion that behaviors, both adaptive and maladaptive, are learned through reinforcement. By analyzing the environmental factors that reinforce problematic behaviors, clinicians can design interventions to weaken these reinforcements and replace maladaptive behaviors with healthier alternatives.
In operant conditioning, behaviors that are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Enantioselective Radical Ring-Opening Cyanation of Oxime Esters by Dual Photoredox and Copper Catalysis.

Organic letters·2019

Same author

ACCELERATING MAGNETIC RESONANCE IMAGING VIA DEEP LEARNING.

Proceedings. IEEE International Symposium on Biomedical Imaging·2019

Same author

Technical note: Development and application of KASP assays for rapid screening of 8 genetic defects in Holstein cattle.

Journal of dairy science·2019

Same author

Sesquiterpenes and diterpenes from Euphorbia thymifolia.

Fitoterapia·2019

Same author

Glechomanamides A-C, Germacrane Sesquiterpenoids with an Unusual Δ<sup>8</sup>-7,12-Lactam Moiety from <i>Salvia scapiformis</i> and Their Antiangiogenic Activity.

Journal of natural products·2019

Same author

Parameter optimization framework on wave gradients of Wave-CAIPI imaging.

Magnetic resonance in medicine·2019

Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026

Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026

Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026

Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026

Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026

Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 4, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

A multi-agent reinforcement learning based approach for automatic filter pruning.

Zhemin Li¹, Xiaojing Zuo¹, Yiping Song¹

¹College of Sciences, National University of Defense Technology, 410073, Changsha, China.

Scientific Reports

|December 28, 2024

Summary

This summary is machine-generated.

This study introduces QMIX_FP, a multi-agent reinforcement learning method for automatic filter pruning in Deep Convolutional Neural Networks (DCNNs). It efficiently reduces model size and computational needs for deployment on resource-constrained devices while preserving accuracy.

Keywords:

Filter pruning Knowledge distillation QMIX algorithm

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: Jun 4, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Deep Convolutional Neural Networks (DCNNs) face deployment challenges on resource-constrained devices due to high computational and memory demands.
Network pruning is a key technique for compressing DCNNs, with reinforcement learning (RL) offering adaptive strategies over rule-based methods.
Existing RL pruning methods often use a single agent, neglecting inter-layer dependencies and varying sensitivities within DCNNs.

Purpose of the Study:

To propose an automatic filter pruning method, QMIX_FP, utilizing a multi-agent reinforcement learning algorithm (QMIX).
To model Deep Convolutional Neural Networks (DCNNs) as a multi-agent system, accounting for layer-specific sensitivities and interactions.
To enhance model compression and enable efficient deployment of DCNNs on resource-constrained hardware.

Main Methods:

Developed QMIX_FP, a novel automatic filter pruning approach based on the QMIX multi-agent reinforcement learning algorithm.
Modeled the multi-layer structure of DCNNs as a multi-agent system to capture layer interactions and sensitivities.
Incorporated knowledge distillation for fine-tuning pruned networks to accelerate performance recovery.

Main Results:

Demonstrated the effectiveness of QMIX_FP on benchmark DCNNs (VGG-16, AlexNet) using CIFAR-10 and CIFAR-100 datasets.
Achieved significant reductions in computational and memory requirements for the pruned networks.
Maintained network accuracy post-pruning, validating the method's efficacy.

Conclusions:

QMIX_FP offers an advanced solution for Deep Convolutional Neural Network (DCNN) model compression.
The multi-agent approach effectively addresses layer interactions, leading to optimized filter pruning strategies.
This method facilitates the efficient deployment of DCNNs on devices with limited resources without compromising performance.