Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Transformers with Off-Nominal Turns Ratios01:25

Transformers with Off-Nominal Turns Ratios

246
In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...
246
Equivalent Circuits for Practical Transformers01:28

Equivalent Circuits for Practical Transformers

878
The practical equivalent circuits of single-phase two-winding transformers exhibit significant deviations from their idealized versions due to the inherent properties of winding resistance and finite core permeability. These properties result in real and reactive power losses, affecting the transformer's performance. Understanding these deviations is crucial for designing more efficient transformers.
In a practical transformer, each winding exhibits resistance and leakage reactance. The...
878
Improving Translational Accuracy02:07

Improving Translational Accuracy

12.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
12.0K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.1K
3.1K
Accuracy, limits, and approximation01:28

Accuracy, limits, and approximation

897
Accuracy, limits, and approximations are common in many fields, especially in engineering calculations. These concepts are imperative for ensuring that a given value is as close as possible to its true value.
Accuracy is defined as the closeness of the measured value to the true or actual value. In engineering mechanics, repeated measurements are taken during theoretical or experimental analyses to ensure that the result is precise and accurate.
The accuracy of any solution is based on the...
897
Transformers01:26

Transformers

1.4K
A device that transforms voltages from one value to another using induction is called a transformer. A transformer consists of two separate coils, or windings, wrapped around the same soft iron core. However, they are electrically insulated from each other.
The iron core has a substantial relative permeability. Therefore, the magnetic field lines generated due to the current in one winding are almost entirely confined within the core, such that the same magnetic flux permeates each turn of both...
1.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Wealth Effect on Resilience Building: Exploring the Interactions Between Disaster Loss and Adaptive Capacity in China.

Risk analysis : an official publication of the Society for Risk Analysis·2026
Same author

Circumventing Concentration Limitations in Electrocatalytic Hydrogenation of 5-Hydroxymethylfurfural through Alkali Metal Ion Mediated Supramolecular Control.

Angewandte Chemie (International ed. in English)·2026
Same author

Crystal engineering for poorly water-soluble drugs: From design to applications.

Acta pharmaceutica Sinica. B·2026
Same author

DECR1 degradation by ursolic acid alleviates vascular calcification through inhibition of NF-κB/NLRP3 signaling pathway.

Free radical biology & medicine·2026
Same author

Relaxation Suppressed Exchange Tuning MRI Integrated with Manganese-Based Nanozyme Probes for Ferroptosis Induction and GPX4 Monitoring.

ACS applied bio materials·2026
Same author

A cocrystal-based long-acting injectable suspension platform enables tunable drug release.

Journal of controlled release : official journal of the Controlled Release Society·2026
Same journal

AI-driven neuroanalytic modeling for mental health: multichannel CNN-based autism spectrum disorder detection via facial pattern analysis.

Frontiers in computational neuroscience·2026
Same journal

Modeling multiscale neural dynamics for EEG-based emotion recognition using an attentive wavelet-transformer framework.

Frontiers in computational neuroscience·2026
Same journal

New directions for complex systems in contemporary neuroscience: a morphodynamic and emergent function approach.

Frontiers in computational neuroscience·2026
Same journal

NMDA receptor kinetics drive distinct routes to chaotic firing in pyramidal neurons.

Frontiers in computational neuroscience·2026
Same journal

Schumann-anchored golden ratio organization of human neural oscillations.

Frontiers in computational neuroscience·2026
Same journal

Toward model-guided electrophysiology-Encoding of chirps in the electrosensory periphery of <i>Apteronotus leptorhynchus</i>.

Frontiers in computational neuroscience·2026
See all related articles

Related Experiment Video

Updated: Oct 27, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.5K

Toward Software-Equivalent Accuracy on Transformer-Based Deep Neural Networks With Analog Memory Devices.

Katie Spoon1, Hsinyu Tsai1, An Chen1

  • 1IBM Research-Almaden, San Jose, CA, United States.

Frontiers in Computational Neuroscience
|July 22, 2021
PubMed
Summary
This summary is machine-generated.

This article explores how specialized hardware using analog memory can run large language models efficiently. By using specific training techniques and adjusting digital calculations, the authors show that these systems can match the performance of traditional software.

Keywords:
BERTDNNPCMRRAMTransformeranalog acceleratorsin-memory computingAnalog AI acceleratorsBERT model inferenceEnergy-efficient computingNeural network hardware

Frequently Asked Questions

More Related Videos

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

9.5K

Related Experiment Videos

Last Updated: Oct 27, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.5K
Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

9.5K

Area of Science:

  • Computer engineering and hardware architecture
  • Artificial intelligence research focusing on Phase Change Memory integration

Background:

Current deep learning models require massive computational power that exceeds the capabilities of standard hardware. Researchers have struggled to maintain high accuracy when transitioning these complex networks to energy-efficient analog systems. That uncertainty drove the investigation into specialized hardware accelerators. Prior research has shown that analog devices often suffer from inherent noise and physical instability. No prior work had fully resolved the challenge of achieving software-equivalent performance on large-scale language tasks. This gap motivated the exploration of non-volatile memory architectures. Scientists have long sought ways to bridge the performance divide between digital software and analog hardware. This study addresses the limitations of existing memory-based systems for modern transformer architectures.

Purpose Of The Study:

The aim of this study is to evaluate the potential of analog artificial intelligence accelerators for performing accurate inference in language processing applications. Researchers seek to overcome the energy inefficiencies associated with massive model sizes in current deep learning. This project addresses the challenge of maintaining high accuracy when using physical memory devices that are prone to noise. The authors investigate whether non-volatile memory can support the complex requirements of transformer-based architectures. They specifically examine if noise-aware training can mitigate the physical instability inherent in these hardware systems. The study explores the feasibility of hybrid computation by combining analog memory with digital attention blocks. This work intends to establish a clear path toward hardware that is both fast and energy-efficient. The team focuses on validating these methods using standard industry benchmarks for language understanding.

Main Methods:

Review approach involves evaluating the performance of transformer architectures on specialized analog hardware. The team utilizes the General Language Understanding Evaluation benchmark to assess inference accuracy. They implement noise-aware training protocols to stabilize the physical characteristics of the memory devices. The design incorporates a hybrid architecture that splits tasks between analog memory and digital computation blocks. Researchers systematically reduce the precision of digital attention components to INT6 to improve efficiency. This methodology allows for a direct comparison between analog-based inference and standard software results. The study focuses on the Bidirectional Encoder Representations from Transformers model as the primary test case. Data collection centers on measuring the impact of device-level noise on overall model output.

Main Results:

Key findings from the literature show that analog accelerators can reach software-equivalent accuracy for the General Language Understanding Evaluation benchmark. The researchers successfully deployed the Bidirectional Encoder Representations from Transformers model on these systems. Their approach effectively combats inherent device drift through specialized training methods. The team achieved successful inference by lowering digital attention-block computation to INT6 precision. These results confirm that physical hardware noise does not prevent high-level model performance. The study provides quantitative evidence that analog systems handle large-scale language tasks reliably. The findings highlight a significant reduction in energy requirements compared to traditional digital processors. This performance parity represents a major step forward for energy-efficient artificial intelligence hardware.

Conclusions:

The authors demonstrate that analog accelerators can achieve performance parity with digital software for language tasks. Synthesis and implications suggest that noise-aware training effectively mitigates physical device instability. The researchers propose that combining this training with low-precision digital blocks optimizes energy efficiency. Their findings indicate that BERT models remain highly accurate even when hardware precision is reduced. This work provides a viable roadmap for deploying large models on specialized analog chips. The evidence supports the feasibility of using phase change memory for complex inference applications. Future hardware designs may benefit from the integration of these specific training and computation strategies. These results confirm that analog systems can support the demands of modern transformer-based deep learning.

The researchers propose a dual-pronged strategy: implementing noise-aware training to counteract physical device drift and utilizing reduced-precision digital computation at the INT6 level for attention blocks. This combination allows analog systems to match the accuracy of traditional software-based inference.

The study utilizes Phase Change Memory, a type of non-volatile memory, to store and process model parameters. This hardware is specifically chosen for its potential to provide fast and energy-efficient inference compared to conventional digital processors.

The authors indicate that reduced-precision digital attention-block computation is necessary to maintain efficiency. By lowering precision to INT6, the system balances the computational load while ensuring that the overall model accuracy remains comparable to standard software implementations.

Digital attention blocks play a critical role by handling the most complex parts of the transformer architecture. By performing these specific calculations digitally while keeping other layers in the analog domain, the system optimizes both speed and energy consumption.

The researchers measure performance using the General Language Understanding Evaluation benchmark. This standard test evaluates how well the BERT model performs on various natural language processing tasks when deployed on the analog hardware.

The authors propose that their findings offer a clear path toward deploying large-scale models on energy-efficient analog hardware. This implication suggests that future artificial intelligence systems could significantly reduce power consumption without sacrificing the accuracy required for complex language understanding.