Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

<i>GeSTICS</i>: A Multimodal Corpus for Studying Gesture Synthesis in Two-party Interactions with Contextualized Speech.

Proceedings of the ... ACM International Conference on Intelligent Virtual Agents. IVA (Conference)·2025
Same author

Representation Learning for Interpersonal and Multimodal Behavior Dynamics: A Multiview Extension of Latent Change Score Models.

Proceedings of the ... ACM International Conference on Multimodal Interaction. ICMI (Conference)·2025
Same author

Beyond Additive Fusion: Learning Non-Additive Multimodal Interactions.

Findings of ACL. EMNLP. Conference on Empirical Methods in Natural Language Processing·2025
Same author

Dynamic and dyadic relationships between facial behavior, working alliance, and treatment outcomes during depression therapy.

Journal of consulting and clinical psychology·2025
Same author

Neural encoding of real world face perception.

ArXiv·2025
Same author

Ground-truthed and high-resolution drone images of the leafy spurge weed plant (Euphorbia esula).

Scientific data·2025
Same journal

VideoPASTA: 7K Preference Pairs That Matter for Video-LLM Alignment.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026
Same journal

Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026
Same journal

X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026
Same journal

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026
Same journal

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026
Same journal

BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026
See all related articles

Related Experiment Video

Updated: Nov 6, 2025

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties
12:55

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties

Published on: September 27, 2020

8.7K

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis.

Yao-Hung Hubert Tsai1, Martin Q Ma1, Muqiao Yang1

  • 1Carnegie Mellon University, Pittsburgh, PA, USA.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing
|May 10, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces Multimodal Routing, a new method for interpreting human language across different information sources. It enhances understanding of how various communication styles influence predictions, offering both global and local insights.

More Related Videos

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

5.3K
Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.2K

Related Experiment Videos

Last Updated: Nov 6, 2025

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties
12:55

Multimodal Protocol for Assessing Metacognition and Self-Regulation in Adults with Learning Difficulties

Published on: September 27, 2020

8.7K
Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
09:44

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

5.3K
Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.2K

Area of Science:

  • Artificial Intelligence
  • Natural Language Processing
  • Machine Learning

Background:

  • Human language relies on multiple information sources (modalities) like tone, facial expressions, and speech.
  • Current multimodal learning models excel in tasks like sentiment analysis but often lack interpretability, acting as black boxes.

Purpose of the Study:

  • To develop a novel method, Multimodal Routing, for enhancing the interpretability of multimodal learning systems.
  • To dynamically adjust the influence of different input modalities based on individual data samples.

Main Methods:

  • Propose Multimodal Routing, a technique that assigns dynamic weights to input modalities and output representations.
  • The routing mechanism identifies the importance of individual modalities and cross-modality interactions.

Main Results:

  • Multimodal Routing provides interpretable insights into modality-prediction relationships.
  • Interpretations are available globally (dataset-wide trends) and locally (per-sample analysis).
  • The method achieves performance competitive with state-of-the-art approaches.

Conclusions:

  • Multimodal Routing offers a significant advancement in understanding how different communication modalities contribute to predictions.
  • This approach enhances transparency in complex AI systems, enabling more reliable human-centric task performance.