Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

<i>GeSTICS</i>: A Multimodal Corpus for Studying Gesture Synthesis in Two-party Interactions with Contextualized Speech.

Proceedings of the ... ACM International Conference on Intelligent Virtual Agents. IVA (Conference)·2025

Same author

Representation Learning for Interpersonal and Multimodal Behavior Dynamics: A Multiview Extension of Latent Change Score Models.

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)·2025

Same author

Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions.

Findings of ACL. EMNLP. Conference on Empirical Methods in Natural Language Processing·2025

Same author

Dynamic and dyadic relationships between facial behavior, working alliance, and treatment outcomes during depression therapy.

Journal of consulting and clinical psychology·2025

Same author

Neural encoding of real world face perception.

ArXiv·2025

Same author

Ground-truthed and high-resolution drone images of the leafy spurge weed plant (Euphorbia esula).

Scientific data·2025

Same journal

VideoPASTA: 7K Preference Pairs That Matter for Video-LLM Alignment.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 6, 2025

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties

Published on: September 27, 2020

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.

Yao-Hung Hubert Tsai¹, Martin Q Ma¹, Muqiao Yang¹

¹Carnegie Mellon University, Pittsburgh, PA, USA.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

|May 10, 2021

Summary

This summary is machine-generated.

This study introduces Multimodal Routing, a new method for interpreting human language across different information sources. It enhances understanding of how various communication styles influence predictions, offering both global and local insights.

More Related Videos

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Related Experiment Videos

Last Updated: Nov 6, 2025

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties

Published on: September 27, 2020

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Area of Science:

Artificial Intelligence
Natural Language Processing
Machine Learning

Background:

Human language relies on multiple information sources (modalities) like tone, facial expressions, and speech.
Current multimodal learning models excel in tasks like sentiment analysis but often lack interpretability, acting as black boxes.

Purpose of the Study:

To develop a novel method, Multimodal Routing, for enhancing the interpretability of multimodal learning systems.
To dynamically adjust the influence of different input modalities based on individual data samples.

Main Methods:

Propose Multimodal Routing, a technique that assigns dynamic weights to input modalities and output representations.
The routing mechanism identifies the importance of individual modalities and cross-modality interactions.

Main Results:

Multimodal Routing provides interpretable insights into modality-prediction relationships.
Interpretations are available globally (dataset-wide trends) and locally (per-sample analysis).
The method achieves performance competitive with state-of-the-art approaches.

Conclusions:

Multimodal Routing offers a significant advancement in understanding how different communication modalities contribute to predictions.
This approach enhances transparency in complex AI systems, enabling more reliable human-centric task performance.