Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Interpreting ¹H NMR Signal Splitting: The (n + 1) Rule01:10

Interpreting ¹H NMR Signal Splitting: The (n + 1) Rule

1.2K
In the AX proton spin system, proton A can sense the two spin states of a coupled proton X, resulting in a doublet NMR signal with two peaks of equal (1:1) intensity. When proton A is coupled to two equivalent protons (AX2 spin system), the spin states of each X can be aligned with or against the external field, creating three possible scenarios. This results in a 1:2:1  triplet signal, where the central peak corresponds to the chemical shift of A and is twice as large or intense as the...
1.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

MixTrain: accelerating DNN training via input mixing.

Frontiers in artificial intelligence·2024
Same author

Compute in-Memory with Non-Volatile Elements for Neural Networks: A Review from a Co-Design Perspective.

Advanced materials (Deerfield Beach, Fla.)·2022
Same author

Neural Network Training With Asymmetric Crosspoint Elements.

Frontiers in artificial intelligence·2022
Same author

Accelerating DNN Training Through Selective Localized Learning.

Frontiers in neuroscience·2022
Same author

Probabilistic Spike Propagation for Efficient Hardware Implementation of Spiking Neural Networks.

Frontiers in neuroscience·2021
Same author

Algorithm for Training Neural Networks on Resistive Device Arrays.

Frontiers in neuroscience·2020
Same journal

Cross-linguistic patterns of cognitive biases in large language models: a comparative study in English, Hebrew, and Russian.

Frontiers in artificial intelligence·2026
Same journal

From human-like AI to user adoption: the role of trust, attitude, and social influence in shaping behavioral intention.

Frontiers in artificial intelligence·2026
Same journal

Building large-scale English-Romanian literary translation resources with open models.

Frontiers in artificial intelligence·2026
Same journal

Logic, inference, understanding: cross-domain generalization for generative language models.

Frontiers in artificial intelligence·2026
Same journal

Label tree semantic losses for rich multi-class medical image segmentation.

Frontiers in artificial intelligence·2026
Same journal

Score-based generative diffusion models to synthesize full-dose FDG brain PET from MRI in epilepsy patients.

Frontiers in artificial intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 10, 2025

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits
13:36

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits

Published on: January 21, 2010

14.3K

LRMP: Layer Replication with Mixed Precision for spatial in-memory DNN accelerators.

Abinand Nallathambi1, Christin David Bose1, Wilfried Haensch2

  • 1Elmore Family School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, United States.

Frontiers in Artificial Intelligence
|October 21, 2024
PubMed
Summary
This summary is machine-generated.

We introduce LRMP, a method combining layer replication and mixed precision quantization to boost Deep Neural Network (DNN) performance on in-memory computing (IMC) accelerators. This approach significantly reduces latency and increases throughput for DNNs with minimal accuracy loss.

Keywords:
analog acceleratorin-memory computingmixed integer linear programmingquantizationreinforcement learning

More Related Videos

Quantifying Intermembrane Distances with Serial Image Dilations
07:45

Quantifying Intermembrane Distances with Serial Image Dilations

Published on: September 28, 2018

6.4K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

475

Related Experiment Videos

Last Updated: Jun 10, 2025

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits
13:36

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits

Published on: January 21, 2010

14.3K
Quantifying Intermembrane Distances with Serial Image Dilations
07:45

Quantifying Intermembrane Distances with Serial Image Dilations

Published on: September 28, 2018

6.4K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

475

Area of Science:

  • Computer Engineering
  • Artificial Intelligence
  • Hardware Acceleration

Background:

  • Deep Neural Networks (DNNs) face increasing computational demands, driving research into efficient hardware solutions.
  • In-memory computing (IMC) utilizing non-volatile memories (NVMs) offers a promising avenue for accelerating DNNs through spatial parallelism.
  • Existing NVM-based IMC accelerators struggle with non-uniform layer processing times and area constraints, limiting DNN performance.

Purpose of the Study:

  • To develop a novel method, LRMP, for enhancing DNN performance on area-constrained NVM-based IMC accelerators.
  • To address the challenges of non-uniform layer processing times and high area requirements in IMC architectures.
  • To optimize DNN mapping by jointly considering layer replication and mixed-precision quantization.

Main Methods:

  • LRMP employs a hybrid approach combining reinforcement learning and mixed integer linear programming.
  • The method intelligently searches the design space of layer replication and mixed-precision quantization.
  • A hardware-aware model guides the search, closely reflecting the target IMC accelerator architecture.

Main Results:

  • LRMP demonstrates significant performance gains across five DNN benchmarks.
  • Achieved 2.6-9.3x reduction in latency and 8-18x improvement in throughput.
  • Maintained high accuracy with minimal degradation (<1%).

Conclusions:

  • LRMP effectively optimizes DNN deployment on area-constrained NVM-based IMC accelerators.
  • The joint application of layer replication and mixed precision quantization is crucial for performance enhancement.
  • This method offers a practical solution for accelerating DNNs in resource-limited hardware environments.