Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Interpreting ¹H NMR Signal Splitting: The (n + 1) Rule

Interpreting ¹H NMR Signal Splitting: The (n + 1) Rule

In the AX proton spin system, proton A can sense the two spin states of a coupled proton X, resulting in a doublet NMR signal with two peaks of equal (1:1) intensity. When proton A is coupled to two equivalent protons (AX2 spin system), the spin states of each X can be aligned with or against the external field, creating three possible scenarios. This results in a 1:2:1 triplet signal, where the central peak corresponds to the chemical shift of A and is twice as large or intense as the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

MixTrain: accelerating DNN training via input mixing.

Frontiers in artificial intelligence·2024

Same author

Compute in-Memory with Non-Volatile Elements for Neural Networks: A Review from a Co-Design Perspective.

Advanced materials (Deerfield Beach, Fla.)·2022

Same author

Neural Network Training With Asymmetric Crosspoint Elements.

Frontiers in artificial intelligence·2022

Same author

Accelerating DNN Training Through Selective Localized Learning.

Frontiers in neuroscience·2022

Same author

Probabilistic Spike Propagation for Efficient Hardware Implementation of Spiking Neural Networks.

Frontiers in neuroscience·2021

Same author

Algorithm for Training Neural Networks on Resistive Device Arrays.

Frontiers in neuroscience·2020

Same journal

Cross-linguistic patterns of cognitive biases in large language models: a comparative study in English, Hebrew, and Russian.

Frontiers in artificial intelligence·2026

Same journal

From human-like AI to user adoption: the role of trust, attitude, and social influence in shaping behavioral intention.

Frontiers in artificial intelligence·2026

Same journal

Building large-scale English-Romanian literary translation resources with open models.

Frontiers in artificial intelligence·2026

Same journal

Logic, inference, understanding: cross-domain generalization for generative language models.

Frontiers in artificial intelligence·2026

Same journal

Label tree semantic losses for rich multi-class medical image segmentation.

Frontiers in artificial intelligence·2026

Same journal

Score-based generative diffusion models to synthesize full-dose FDG brain PET from MRI in epilepsy patients.

Frontiers in artificial intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 10, 2025

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits

Published on: January 21, 2010

LRMP: Layer Replication with Mixed Precision for spatial in-memory DNN accelerators.

Abinand Nallathambi¹, Christin David Bose¹, Wilfried Haensch²

¹Elmore Family School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, United States.

Frontiers in Artificial Intelligence

|October 21, 2024

Summary

This summary is machine-generated.

We introduce LRMP, a method combining layer replication and mixed precision quantization to boost Deep Neural Network (DNN) performance on in-memory computing (IMC) accelerators. This approach significantly reduces latency and increases throughput for DNNs with minimal accuracy loss.

Keywords:

analog accelerator in-memory computing mixed integer linear programming quantization reinforcement learning

More Related Videos

Quantifying Intermembrane Distances with Serial Image Dilations

Quantifying Intermembrane Distances with Serial Image Dilations

Published on: September 28, 2018

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Jun 10, 2025

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits

Scalable Fluidic Injector Arrays for Viral Targeting of Intact 3-D Brain Circuits

Published on: January 21, 2010

Quantifying Intermembrane Distances with Serial Image Dilations

Quantifying Intermembrane Distances with Serial Image Dilations

Published on: September 28, 2018

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Area of Science:

Computer Engineering
Artificial Intelligence
Hardware Acceleration

Background:

Deep Neural Networks (DNNs) face increasing computational demands, driving research into efficient hardware solutions.
In-memory computing (IMC) utilizing non-volatile memories (NVMs) offers a promising avenue for accelerating DNNs through spatial parallelism.
Existing NVM-based IMC accelerators struggle with non-uniform layer processing times and area constraints, limiting DNN performance.

Purpose of the Study:

To develop a novel method, LRMP, for enhancing DNN performance on area-constrained NVM-based IMC accelerators.
To address the challenges of non-uniform layer processing times and high area requirements in IMC architectures.
To optimize DNN mapping by jointly considering layer replication and mixed-precision quantization.

Main Methods:

LRMP employs a hybrid approach combining reinforcement learning and mixed integer linear programming.
The method intelligently searches the design space of layer replication and mixed-precision quantization.
A hardware-aware model guides the search, closely reflecting the target IMC accelerator architecture.

Main Results:

LRMP demonstrates significant performance gains across five DNN benchmarks.
Achieved 2.6-9.3x reduction in latency and 8-18x improvement in throughput.
Maintained high accuracy with minimal degradation (<1%).

Conclusions:

LRMP effectively optimizes DNN deployment on area-constrained NVM-based IMC accelerators.
The joint application of layer replication and mixed precision quantization is crucial for performance enhancement.
This method offers a practical solution for accelerating DNNs in resource-limited hardware environments.