Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

178
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
178
Reinforcement Schedules01:24

Reinforcement Schedules

130
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
130
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

40
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
40
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

432
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
432
Operant Conditioning Intervention01:24

Operant Conditioning Intervention

43
Operant conditioning serves as a foundational principle in therapeutic interventions aimed at modifying maladaptive behaviors. Central to this approach is the notion that behaviors, both adaptive and maladaptive, are learned through reinforcement. By analyzing the environmental factors that reinforce problematic behaviors, clinicians can design interventions to weaken these reinforcements and replace maladaptive behaviors with healthier alternatives.
In operant conditioning, behaviors that are...
43

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Enantioselective Radical Ring-Opening Cyanation of Oxime Esters by Dual Photoredox and Copper Catalysis.

Organic letters·2019
Same author

ACCELERATING MAGNETIC RESONANCE IMAGING VIA DEEP LEARNING.

Proceedings. IEEE International Symposium on Biomedical Imaging·2019
Same author

Technical note: Development and application of KASP assays for rapid screening of 8 genetic defects in Holstein cattle.

Journal of dairy science·2019
Same author

Sesquiterpenes and diterpenes from Euphorbia thymifolia.

Fitoterapia·2019
Same author

Glechomanamides A-C, Germacrane Sesquiterpenoids with an Unusual Δ<sup>8</sup>-7,12-Lactam Moiety from <i>Salvia scapiformis</i> and Their Antiangiogenic Activity.

Journal of natural products·2019
Same author

Parameter optimization framework on wave gradients of Wave-CAIPI imaging.

Magnetic resonance in medicine·2019
Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026
Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026
Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026
Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026
Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026
Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Jun 4, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.3K

A multi-agent reinforcement learning based approach for automatic filter pruning.

Zhemin Li1, Xiaojing Zuo1, Yiping Song1

  • 1College of Sciences, National University of Defense Technology, 410073, Changsha, China.

Scientific Reports
|December 28, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces QMIX_FP, a multi-agent reinforcement learning method for automatic filter pruning in Deep Convolutional Neural Networks (DCNNs). It efficiently reduces model size and computational needs for deployment on resource-constrained devices while preserving accuracy.

Keywords:
Filter pruningKnowledge distillationQMIX algorithm

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Related Experiment Videos

Last Updated: Jun 4, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.3K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.7K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Deep Convolutional Neural Networks (DCNNs) face deployment challenges on resource-constrained devices due to high computational and memory demands.
  • Network pruning is a key technique for compressing DCNNs, with reinforcement learning (RL) offering adaptive strategies over rule-based methods.
  • Existing RL pruning methods often use a single agent, neglecting inter-layer dependencies and varying sensitivities within DCNNs.

Purpose of the Study:

  • To propose an automatic filter pruning method, QMIX_FP, utilizing a multi-agent reinforcement learning algorithm (QMIX).
  • To model Deep Convolutional Neural Networks (DCNNs) as a multi-agent system, accounting for layer-specific sensitivities and interactions.
  • To enhance model compression and enable efficient deployment of DCNNs on resource-constrained hardware.

Main Methods:

  • Developed QMIX_FP, a novel automatic filter pruning approach based on the QMIX multi-agent reinforcement learning algorithm.
  • Modeled the multi-layer structure of DCNNs as a multi-agent system to capture layer interactions and sensitivities.
  • Incorporated knowledge distillation for fine-tuning pruned networks to accelerate performance recovery.

Main Results:

  • Demonstrated the effectiveness of QMIX_FP on benchmark DCNNs (VGG-16, AlexNet) using CIFAR-10 and CIFAR-100 datasets.
  • Achieved significant reductions in computational and memory requirements for the pruned networks.
  • Maintained network accuracy post-pruning, validating the method's efficacy.

Conclusions:

  • QMIX_FP offers an advanced solution for Deep Convolutional Neural Network (DCNN) model compression.
  • The multi-agent approach effectively addresses layer interactions, leading to optimized filter pruning strategies.
  • This method facilitates the efficient deployment of DCNNs on devices with limited resources without compromising performance.