Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mnemonic Devices

Mnemonic Devices

Mnemonic devices are cognitive tools that facilitate memory retention by linking new information to familiar patterns or organizational strategies. These techniques are beneficial for remembering complex or lengthy sets of information by simplifying and structuring them in easily retrievable ways.
Acronyms
Acronyms are created by using the initial letters of a series of words to form a new word or phrase. This approach condenses complex information into a single, memorable entity. For example,...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Elaborative Rehearsals

Elaborative Rehearsals

Elaborative rehearsal is a crucial cognitive strategy that strengthens information encoding in long-term memory by making meaningful connections between new data and pre-existing knowledge. This approach contrasts with maintenance rehearsal, which involves simple repetition without delving into the significance of the information. While maintenance rehearsal might temporarily keep information active in short-term memory, it is less effective for long-term retention.
The effectiveness of...

Arithmetic Mean

Arithmetic Mean

The arithmetic mean is the most commonly used measure of the central tendency of a data set. It is defined as the sum of all the elements constituting the data set, divided by the total number of elements. It is sometimes loosely referred to as the “average.”
When all the values in a data set are not unique, the sum in the numerator can be calculated by multiplying each distinct value by its frequency.
Sometimes, the arithmetic mean of a sample can be affected by a few data points...

Implicit Memories

Implicit Memories

Implicit memories, also known as non-declarative memories, are long-term memories that function outside of conscious awareness. These memories influence behavior and skills without explicit knowledge. This type of memory is evident in tasks like playing tennis, snowboarding, and texting. Implicit memory has three subsystems: procedural memory, conditioning, and priming. This type of memory is essential in various activities, from everyday tasks to specialized skills.
One key aspect of implicit...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Health-related quality of life and adverse events during 14 days after receiving a live-attenuated influenza vaccine in Japanese children 4-15 years of age.

Human vaccines & immunotherapeutics·2026

Same author

Evaluation of Presurgical Outcome Predictors in Oncological Neurosurgery.

World neurosurgery·2025

Same author

Optimizing Data Flow in Binary Neural Networks.

Sensors (Basel, Switzerland)·2024

Same author

Generative negative replay for continual learning.

Neural networks : the official journal of the International Neural Network Society·2023

Same author

Is Class-Incremental Enough for Continual Learning?

Frontiers in artificial intelligence·2022

Same author

A Double Siamese Framework for Differential Morphing Attack Detection.

Sensors (Basel, Switzerland)·2021

Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 18, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Arithmetic with language models: From memorization to computation.

Davide Maltoni¹, Matteo Ferrara¹

¹Department of Computer Science and Engineering, University of Bologna, Italy.

Neural Networks : the Official Journal of the International Neural Network Society

|July 28, 2024

Summary

This summary is machine-generated.

Large language models can perform arithmetic computations, like binary addition and multiplication, by generalizing beyond their training data. These models function as Encoding-Regression-Decoding machines for computational tasks.

Keywords:

AI explainability Arithmetic Interpretability Language models Probing

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Related Experiment Videos

Last Updated: Jun 18, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

Area of Science:

Artificial Intelligence
Computational Linguistics
Machine Learning

Background:

Recent large language models (LLMs) demonstrate emergent computational abilities.
Understanding these capabilities is crucial for improving LLM performance and applications.

Purpose of the Study:

Investigate how LLMs trained on next-token prediction perform arithmetic computations.
Analyze the generalization capabilities of LLMs beyond their training data for mathematical tasks.

Main Methods:

Trained a lightweight language model on binary addition and multiplication tasks.
Conducted experiments to assess extrapolation capabilities and internal processing.
Utilized binary arithmetic as a testbed due to its small vocabulary and discontinuities.

Main Results:

Successfully trained a language model to perform binary addition and multiplication.
Demonstrated that the language model can generalize arithmetic computations to novel data.
Evidence suggests a computational process involving encoding, regression, and decoding within the model.

Conclusions:

Language models can be trained to perform arithmetic computations with generalization.
The model appears to operate as an Encoding-Regression-Decoding system for these tasks.
Computation occurs in a value space after mapping input tokens to internal representations.