Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Elaborative Rehearsals01:07

Elaborative Rehearsals

539
Elaborative rehearsal is a crucial cognitive strategy that strengthens information encoding in long-term memory by making meaningful connections between new data and pre-existing knowledge. This approach contrasts with maintenance rehearsal, which involves simple repetition without delving into the significance of the information. While maintenance rehearsal might temporarily keep information active in short-term memory, it is less effective for long-term retention.
The effectiveness of...
539
Auditory Pathway01:15

Auditory Pathway

9.1K
Auditory pathways constitute the complex neural circuits responsible for transmitting and interpreting auditory information from the peripheral auditory system to the brain. Sound waves are initially captured by the outer ear, funneled through the ear canal, and reach the tympanic membrane (eardrum). These vibrations are transmitted via the middle ear's ossicles to the inner ear's cochlea.
When viewed cross-sectionally, the cochlea reveals the scala vestibuli and scala tympani flanking...
9.1K
Impression Management Techniques IV: Altercasting01:14

Impression Management Techniques IV: Altercasting

253
Altercasting is a strategic communication technique in which an individual imposes a specific identity or social role onto another person to influence their behavior and shape the interaction. By presuming a role—such as “responsible leader” or “patient person”—altercasting encourages the target to conform to that identity, often aligning their behavior with the expectations associated with the role. The power of this tactic lies in its subtlety; once a role...
253
Automatic Processing and Automatic Social Behavior01:28

Automatic Processing and Automatic Social Behavior

361
Automatic processing refers to the cognitive operations that occur without conscious intent or awareness, playing a fundamental role in shaping social cognition and behavior. These processes enable individuals to navigate complex social environments efficiently by relying on mental shortcuts and pre-existing knowledge structures known as schemas. One of the most influential mechanisms underlying automatic processing is priming, which subtly activates mental representations through exposure to...
361
Auditory Perception01:17

Auditory Perception

1.5K
The auditory system is essential for sound perception, utilizing various critical structures. When sound waves enter the outer ear, they travel through the ear canal and cause the eardrum to vibrate. These vibrations are then transmitted to the middle ear, where three tiny bones – the malleus, incus, and stapes – amplify the sound. This amplification is crucial, as it ensures that the sound vibrations are strong enough to be conveyed to the inner ear. These vibrations then reach the...
1.5K
Chunking and Rehearsal in Sensory Memory01:22

Chunking and Rehearsal in Sensory Memory

770
Improving short-term memory can be achieved through techniques like chunking and rehearsal. Chunking involves organizing information into larger, more manageable units. This technique is particularly useful for information that exceeds the typical memory span of between five and nine items. For instance, logging into an online account with a password like "ta89vq0179gz" involves grouping letters and numbers into three chunks—ta89, vq01, and 79gz. It makes large amounts of...
770

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Effects of Talker Sex Differences on Binaural Summation in Cochlear Implant Users and Normal Hearing Listeners.

Trends in hearing·2026
Same author

Dental Pulp Stem Cell-Derived Intracellular Vesicles Inhibit OSCC by Delivering PTEN to Suppress PI3K/AKT/mTOR Signalling Pathway.

Cell proliferation·2026
Same author

Satellite radar and AIS reveal a 97% decline in shipping traffic through the Strait of Hormuz.

Innovation (Cambridge (Mass.))·2026
Same author

A Systematic Comparison of Multiple Models for Depth-Dependent Decay of Hydraulic Conductivity in Salt Lake Areas: A Case Study of Typical Boreholes in the Qaidam Basin.

Water environment research : a research publication of the Water Environment Federation·2026
Same author

Octanoic acid treatment alleviates cold-induced depression-like behaviors via targeting the AKR1B1-PGF2α pathway.

iScience·2026
Same author

Multiplexed Peptide-Based Fluorescent Probe Platform for Analysis of HER2-Positive Exosomal Membrane Proteins.

Analytical chemistry·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Apr 16, 2026

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.1K

Emphasizing Domain Differences Through Interactive-Augmented Prompts in Continual Audio-Visual Speech Recognition.

Dongjie Fu, Xize Cheng, Jingyuan Chen

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |April 14, 2026
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces Continual Audio-Visual Speech Recognition (CL-AVSR) to improve speech recognition in real-world conditions. The proposed Interaction-enhanced Multimodal Prompt learning (IMP) framework significantly enhances model adaptability and performance on diverse audio-visual data.

    More Related Videos

    Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models
    07:14

    Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models

    Published on: December 23, 2025

    867
    Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
    09:09

    Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

    Published on: September 27, 2024

    1.0K

    Related Experiment Videos

    Last Updated: Apr 16, 2026

    Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
    05:48

    Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

    Published on: August 9, 2024

    2.1K
    Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models
    07:14

    Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models

    Published on: December 23, 2025

    867
    Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
    09:09

    Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

    Published on: September 27, 2024

    1.0K

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Speech Processing

    Background:

    • Audio-Visual Speech Recognition (AVSR) excels with complementary acoustic and visual data but struggles with real-world domain shifts.
    • Heterogeneous data distributions in AVSR lead to catastrophic forgetting and reduced generalization.
    • Existing AVSR models often fail in dynamic, non-stationary environments.

    Purpose of the Study:

    • Introduce the Continual Audio-Visual Speech Recognition (CL-AVSR) problem formulation.
    • Establish a benchmark for CL-AVSR with scenarios reflecting real-world challenges (noise, video degradation, speaker variation).
    • Develop a novel framework to enhance AVSR adaptability and mitigate catastrophic forgetting.

    Main Methods:

    • Proposed the Interaction-enhanced Multimodal Prompt learning (IMP) framework using a pre-trained AV-HuBERT backbone.
    • Integrated task-relevant soft prompts with cross-modal and cross-task interactions for knowledge transfer.
    • Employed contrastive regularization and a multi-modal prompt selection strategy for dynamic adaptation.

    Main Results:

    • IMP demonstrated substantial improvements over strong baselines on the LRS2 dataset.
    • Achieved new state-of-the-art performance across all CL-AVSR experimental scenarios.
    • Validated the framework's effectiveness in handling domain shifts and diverse data streams.

    Conclusions:

    • The IMP framework significantly enhances continual learning capabilities for AVSR systems.
    • IMP offers a robust solution for adaptable multi-modal speech recognition in real-world applications.
    • This work paves the way for more resilient AVSR systems facing dynamic environmental and data variations.