Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Long-term Potentiation01:35

Long-term Potentiation

55.1K
Long-term potentiation, or LTP, is one of the ways by which synaptic plasticity—changes in the strength of chemical synapses—can occur in the brain. LTP is the process of synaptic strengthening that occurs over time between pre- and postsynaptic neuronal connections. The synaptic strengthening of LTP works in opposition to the synaptic weakening of long-term depression (LTD) and together are the main mechanisms that underlie learning and memory.
55.1K
Language and Cognition01:27

Language and Cognition

340
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
340
Forgetting01:21

Forgetting

67
Forgetting is an intrinsic aspect of human memory, characterized by the gradual loss or inaccessibility of information over time. Hermann Ebbinghaus, a pioneering psychologist, extensively studied this phenomenon and formulated the forgetting curve. This curve illustrates that memory loss occurs rapidly immediately after learning and then decelerates over time. Several mechanisms contribute to forgetting, including encoding failure, storage decay, retrieval failure, and interference.
Encoding...
67
Role of Cerebellum and Prefrontal Cortex in Memory01:14

Role of Cerebellum and Prefrontal Cortex in Memory

413
The cerebellum, while traditionally associated with motor control, also plays a crucial role in memory, particularly in procedural memory, which involves learning motor tasks that become automatic through repetition. For example, studies have shown that when the cerebellum is damaged, individuals or animals lose the ability to learn conditioned motor responses, such as the conditioned eye-blink response in classical conditioning experiments with rabbits. This study demonstrates the...
413
Interference and Decay01:16

Interference and Decay

127
Forgetting is a complex cognitive phenomenon influenced by several factors, among which interference and decay are particularly prominent. These processes explain why individuals often struggle to retrieve specific information from memory, leading to lapses in recall that can be observed in everyday situations.
Interference occurs when competing memories hinder the retrieval of particular information. It can be classified into two types: proactive and retroactive interference. Proactive...
127
Implicit Memories01:24

Implicit Memories

119
Implicit memories, also known as non-declarative memories, are long-term memories that function outside of conscious awareness. These memories influence behavior and skills without explicit knowledge. This type of memory is evident in tasks like playing tennis, snowboarding, and texting. Implicit memory has three subsystems: procedural memory, conditioning, and priming. This type of memory is essential in various activities, from everyday tasks to specialized skills.
One key aspect of implicit...
119

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

From pixels to perception: A benchmark for human-like symmetry detection.

Vision research·2026
Same author

Tricyclic Pyrrole-Based Compounds as Zika Virus Inhibitors.

International journal of molecular sciences·2026
Same author

Novel 2-Aryl-1H-Benzimidazole Derivatives and Their Aza-Analogues as Promising Anti-Poxvirus Agents.

Viruses·2026
Same author

Early Recurrence of Esophagogastric junction adenoCarcinoma after Surgery: a multicentre analysis of risk factors (ERECS Trial).

Journal of gastrointestinal surgery : official journal of the Society for Surgery of the Alimentary Tract·2026
Same author

Staging Laparoscopy in High-Risk Gastric Cancer: A Decade of Real-World Evidence and Therapeutic Impact from a Tertiary Referral Center.

Cancers·2026
Same author

Current Challenges and Future Directions in the Multimodal Management of Gastric Cancer with Peritoneal Metastases.

Cancers·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jun 21, 2025

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

322

Continual pre-training mitigates forgetting in language and vision.

Andrea Cossu1, Antonio Carta1, Lucia Passaro1

  • 1University of Pisa, Largo B. Pontecorvo, 3, Pisa, 56127, Italy.

Neural Networks : the Official Journal of the International Neural Network Society
|July 10, 2024
PubMed
Summary
This summary is machine-generated.

Continual pre-training, where models learn from streaming data before task fine-tuning, can mitigate forgetting. Self-supervised pre-training in natural language processing (NLP) and vision proves effective without extra continual learning strategies.

Keywords:
Continual-learningForgettingLifelong-learningPre-trainingSelf-supervised

More Related Videos

Vision Training Methods for Sports Concussion Mitigation and Management
12:54

Vision Training Methods for Sports Concussion Mitigation and Management

Published on: May 5, 2015

17.4K
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.5K

Related Experiment Videos

Last Updated: Jun 21, 2025

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

322
Vision Training Methods for Sports Concussion Mitigation and Management
12:54

Vision Training Methods for Sports Concussion Mitigation and Management

Published on: May 5, 2015

17.4K
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.5K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Deep Learning

Background:

  • Pre-trained models commonly initialize Continual Learning (CL) but are seldom pre-trained *during* CL.
  • Investigating Continual Pre-Training (CPT) where models adapt to data streams before downstream task fine-tuning is crucial.

Purpose of the Study:

  • To evaluate the Continual Pre-Training scenario and its impact on model forgetting.
  • To analyze factors influencing forgetting in CPT, including input modality, architecture, and pre-training protocol.
  • To introduce a Sample-Efficient Pre-training (SEP) method to accelerate the pre-training phase.

Main Methods:

  • Developed an evaluation protocol for CPT using a Forgetting Control dataset.
  • Disentangled the effects of input modality (NLP, Vision), architecture (Transformer, ResNet), and pre-training (supervised, self-supervised) on forgetting.
  • Proposed and evaluated the Sample-Efficient Pre-training (SEP) method.

Main Results:

  • The pre-training protocol significantly impacts forgetting, more so than other factors.
  • Self-supervised CPT in both NLP and Vision effectively mitigates forgetting without specialized CL techniques.
  • Factors like model depth, input modality, and architecture type were less critical for reducing forgetting.

Conclusions:

  • Self-supervised continual pre-training is a powerful, surprisingly simple strategy to prevent catastrophic forgetting.
  • CPT, particularly with self-supervision, offers a robust approach to building adaptable models for non-stationary data streams.