Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Current Growth And Decay In RL Circuits

Current Growth And Decay In RL Circuits

The current growth and decay in RL circuits can be understood by considering a series RL circuit consisting of a resistor, an inductor, a constant source of emf, and two switches. When the first switch is closed, the circuit is equivalent to a single-loop circuit consisting of a resistor and an inductor connected to a source of emf. In this case, the source of emf produces a current in the circuit. If there were no self-inductance in the circuit, the current would rise immediately to a steady...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

Woodward–Hoffmann Selection Rules and Microscopic Reversibility

Electrocyclic reactions, cycloadditions, and sigmatropic rearrangements are concerted pericyclic reactions that proceed via a cyclic transition state. These reactions are stereospecific and regioselective. The stereochemistry of the products depends on the symmetry characteristics of the interacting orbitals and the reaction conditions. Accordingly, pericyclic reactions are classified as either symmetry-allowed or symmetry-forbidden. Woodward and Hoffmann presented the selection criteria for...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Asciminib Provides Better Efficacy and Favorable Safety and Tolerability Against Investigator-Selected Tyrosine Kinase Inhibitors in East Asian Patients With Newly Diagnosed Chronic Myeloid Leukemia: Results From a Subgroup Analysis of the Pivotal ASC4FIRST Study.

Cancer medicine·2026

Same author

Strategies to Enhance the Utilization of National Health Surveys.

Public health weekly report·2026

Same author

Thromboembolic Events after Surgery for Localized Prostate Cancer: Incidence, Risk Factors, and Survival Outcomes.

Clinical and applied thrombosis/hemostasis : official journal of the International Academy of Clinical and Applied Thrombosis/Hemostasis·2026

Same author

Real-world outcomes of ponatinib in heavily pretreated patients with chronic myeloid leukemia and Philadelphia chromosome-positive acute lymphoblastic leukemia.

Annals of hematology·2026

Same author

Energy-efficient neural stimulation system design for implantable medical devices.

Biomedical engineering letters·2026

Same author

Cellular Senescence of Patient-derived Fibroblasts Reveals the Mid-old Stage as a Critical Window for Transcriptomic Signatures Linked to Alzheimer's Disease Biomarkers and Classification.

Clinical psychopharmacology and neuroscience : the official scientific journal of the Korean College of Neuropsychopharmacology·2026

Same journal

Deep and repetitive transcranial magnetic stimulation improves motor dysfunction after basal ganglia infarction: preliminary findings on efficacy and electrophysiological mechanisms.

Frontiers in neuroscience·2026

Same journal

Without getting under your skin: non-invasive stimulation activates the vagus nerve.

Frontiers in neuroscience·2026

Same journal

Acupuncture modulates the microbiota-gut-brain axis to treat irritable bowel syndrome: a mechanistic exploration.

Frontiers in neuroscience·2026

Same journal

Do stretch sensors expressed by aortic baroreceptors interact with circulating estradiol to mediate baroreflex sensitivity in hypertension?

Frontiers in neuroscience·2026

Same journal

A high-dimensional atlas of parvalbumin interneuron soma morphology in mouse visual and somatosensory cortex.

Frontiers in neuroscience·2026

Same journal

Predicting early neurological deterioration in acute branch atheromatous disease without reperfusion therapy: a machine learning model.

Frontiers in neuroscience·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 10, 2026

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

Spike-based Q-learning in a non-von Neumann architecture.

Donghyuk Shin^1,2, Hyeongcheol Jo^1,2, Hyeseung Jang^1,2

¹Korea University, Seoul, Republic of Korea.

Frontiers in Neuroscience

|March 9, 2026

Summary

This summary is machine-generated.

This study introduces a novel non-von Neumann architecture using spiking neural networks (SNNs) for efficient reinforcement learning (RL). The hardware-feasible design accelerates Q-learning by integrating memory and computation, reducing power consumption.

Keywords:

Q-learning SNN cart-pole neuromorphic architecture non-von Neumann architecture reinforcement learning

More Related Videos

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

Published on: March 25, 2014

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

Related Experiment Videos

Last Updated: Mar 10, 2026

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

A Simple Stimulatory Device for Evoking Point-like Tactile Stimuli: A Searchlight for LFP to Spike Transitions

Published on: March 25, 2014

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Large Scale Energy Efficient Sensor Network Routing Using a Quantum Processor Unit

Published on: September 8, 2023

Area of Science:

Computer Science
Artificial Intelligence
Neuroscience

Background:

Von Neumann architectures face data-transfer bottlenecks and high power consumption due to memory-compute separation.
Reinforcement learning (RL) workloads, especially Q-learning, require frequent updates across large state-action spaces, exacerbating these issues.
Spiking neural networks (SNNs) offer computational efficiency through event-driven, sparse processing.

Purpose of the Study:

To propose a hardware-feasible, non-von Neumann architecture based on SNNs for efficient Q-learning.
To overcome the limitations of traditional architectures for RL tasks.
To leverage SNNs' sparse processing for enhanced computational efficiency.

Main Methods:

Developed a non-von Neumann architecture mapping states/actions to neurons and Q-values to synapses.
Implemented a lateral inhibition structure for identifying maximum Q-values for updates.
Incorporated a delay circuit for temporal consistency and local learning signals for targeted synapse updates.

Main Results:

Simulations on the Cart-pole benchmark demonstrated stable learning performance.
The architecture achieved comparable accuracy to software-based Q-learning with sufficient bit precision.
Effective learning was observed even under low-bit precision conditions.

Conclusions:

The proposed SNN-based non-von Neumann architecture effectively performs Q-learning.
This approach significantly reduces data-transfer bottlenecks and power consumption for RL workloads.
The architecture shows promise for efficient, hardware-accelerated reinforcement learning applications.