Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

329
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
329
Chunking and Rehearsal in Sensory Memory01:22

Chunking and Rehearsal in Sensory Memory

496
Improving short-term memory can be achieved through techniques like chunking and rehearsal. Chunking involves organizing information into larger, more manageable units. This technique is particularly useful for information that exceeds the typical memory span of between five and nine items. For instance, logging into an online account with a password like "ta89vq0179gz" involves grouping letters and numbers into three chunks—ta89, vq01, and 79gz. It makes large amounts of...
496
Neuronal Communication01:28

Neuronal Communication

2.7K
Neurons, the fundamental units of the brain and nervous system, communicate through complex electrochemical signals that underpin all cognitive and bodily functions. This communication is primarily facilitated by a process involving the generation and propagation of an action potential along the axon of the neuron. When the internal electrical charge of a neuron surpasses a certain threshold, an action potential is triggered. This rapid change in voltage travels swiftly along the axon to the...
2.7K
Elaborative Rehearsals01:07

Elaborative Rehearsals

261
Elaborative rehearsal is a crucial cognitive strategy that strengthens information encoding in long-term memory by making meaningful connections between new data and pre-existing knowledge. This approach contrasts with maintenance rehearsal, which involves simple repetition without delving into the significance of the information. While maintenance rehearsal might temporarily keep information active in short-term memory, it is less effective for long-term retention.
The effectiveness of...
261

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Leveraging machine learning to predict de novo skin malignancy following lung transplantation.

Lung cancer management·2026
Same author

<i>GeSTICS</i>: A Multimodal Corpus for Studying Gesture Synthesis in Two-party Interactions with Contextualized Speech.

Proceedings of the ... ACM International Conference on Intelligent Virtual Agents. IVA (Conference)·2025
Same author

Representation Learning for Interpersonal and Multimodal Behavior Dynamics: A Multiview Extension of Latent Change Score Models.

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)·2025
Same author

Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions.

Findings of ACL. EMNLP. Conference on Empirical Methods in Natural Language Processing·2025
Same author

Dynamic and dyadic relationships between facial behavior, working alliance, and treatment outcomes during depression therapy.

Journal of consulting and clinical psychology·2025
Same author

Neural encoding of real world face perception.

ArXiv·2025
Same journal

Interpretable Failure Detection with Human-Level Concepts.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

ChatCLIDS: Simulating Persuasive AI Dialogues to Promote Closed-Loop Insulin Adoption in Type 1 Diabetes Care.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

Beyond Accuracy: On the Effects of Fine-tuning Towards Vision-Language Model's Prediction Rationality.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

<i>OrgaCast</i>: A Trustworthy Spatiotemporal Diffusion Model for Fluorescence Organoid Forecasting.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

iDT-diet: Toward Personalized Health Forecasting-An Intelligent Digital Twin Model for Diet-Influenced Biomarker Trajectories (Student Abstract).

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
See all related articles

Related Experiment Video

Updated: Dec 24, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

5.1K

Multi-attention Recurrent Network for Human Communication Comprehension.

Amir Zadeh1, Paul Pu Liang1, Soujanya Poria2

  • 1Carnegie Mellon University, USA.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence
|April 8, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces the Multi-attention Recurrent Network (MARN), a novel AI architecture for understanding human communication. MARN effectively processes multimodal signals like language, vision, and acoustics, achieving state-of-the-art results across various recognition tasks.

More Related Videos

A Novel Experimental and Analytical Approach to the Multimodal Neural Decoding of Intent During Social Interaction in Freely-behaving Human Infants
11:14

A Novel Experimental and Analytical Approach to the Multimodal Neural Decoding of Intent During Social Interaction in Freely-behaving Human Infants

Published on: October 4, 2015

11.3K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.2K

Related Experiment Videos

Last Updated: Dec 24, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

5.1K
A Novel Experimental and Analytical Approach to the Multimodal Neural Decoding of Intent During Social Interaction in Freely-behaving Human Infants
11:14

A Novel Experimental and Analytical Approach to the Multimodal Neural Decoding of Intent During Social Interaction in Freely-behaving Human Infants

Published on: October 4, 2015

11.3K
Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

16.2K

Area of Science:

  • Artificial Intelligence
  • Human-Computer Interaction
  • Computational Linguistics

Background:

  • Human communication is inherently multimodal, integrating language, gestures, and vocal tone.
  • Current Artificial Intelligence (AI) systems struggle to interpret the complex interplay of these signals.
  • Understanding multimodal communication is crucial for advancing AI's ability to interact naturally with humans.

Purpose of the Study:

  • To develop a novel neural network architecture capable of understanding complex human multimodal communication.
  • To address the challenge of AI comprehending the interactions between different communication modalities.
  • To achieve state-of-the-art performance in tasks requiring multimodal understanding.

Main Methods:

  • Introduction of the Multi-attention Recurrent Network (MARN), a novel neural architecture.
  • Utilizing a Multi-attention Block (MAB) to discover cross-modal interactions over time.
  • Employing a Long-short Term Hybrid Memory (LSTHM) for storing discovered interactions.
  • Extensive evaluation on six public datasets for multimodal sentiment, speaker trait, and emotion recognition.

Main Results:

  • MARN demonstrated superior performance across all tested datasets.
  • The model successfully captured and utilized interactions between language, vision, and acoustic modalities.
  • State-of-the-art results were achieved in multimodal sentiment analysis, speaker trait recognition, and emotion recognition tasks.

Conclusions:

  • The proposed MARN architecture offers a significant advancement in AI's ability to understand human communication.
  • The MAB and LSTHM components are effective in processing and integrating multimodal information.
  • MARN provides a robust framework for future research in multimodal AI and human-computer interaction.