Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Tagging and Fusion Proteins

Tagging and Fusion Proteins

Proteins are involved in several cellular processes and biochemical reactions. Analyzing a specific protein of interest requires it to be isolated from the other proteins in the cell. This is achieved by overexpressing the specific gene in a suitable host to produce large quantities of the target protein. A tag or label is recombined with the gene to produce a fusion protein containing the target protein and the tag. The tags on these fusion proteins can then be used for easy detection and...

Classification of Systems-II

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

In situ sol-gel-sol transformation formed by sodium alginate realizes folic acid-modified chitosan nanoparticles to deliver hops β-acids for colorectal cancer therapy.

Journal of nanobiotechnology·2026

Same author

Corrigendum to "Characteristics of the long non-coding RNA-mRNA regulatory network in American eel (Anguilla rostrata) during Vibrio harveyi infection" [Comparative Biochemistry and Physiology - Part D: Genomics and Proteomics 59 (2026) 101871].

Comparative biochemistry and physiology. Part D, Genomics & proteomics·2026

Same author

Experience in the diagnosis and treatment of venous adventitial cystic disease: a case series and literature review.

Frontiers in surgery·2026

Same author

Characteristics of the long non-coding RNA-mRNA regulatory network in American eel (Anguilla rostrata) during Vibrio harveyi infection.

Comparative biochemistry and physiology. Part D, Genomics & proteomics·2026

Same author

The ferric uptake regulator (Fur) positively regulates the virulence of Vibrio harveyi in shrimp.

International journal of biological macromolecules·2026

Same author

Comparative Transcriptomic Analysis Underlies the Differential Virulence of <i>Vibrio harveyi</i> and <i>Vibrio vulnificus</i> in American Eels (<i>Anguilla rostrata</i>).

International journal of molecular sciences·2025

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 12, 2025

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

A Short Video Classification Framework Based on Cross-Modal Fusion.

Nuo Pang¹, Songlin Guo², Ming Yan²

¹School of Design, Dalian University of Science and Technology, Dalian 116052, China.

Sensors (Basel, Switzerland)

|October 28, 2023

Summary

This summary is machine-generated.

This study introduces a novel short video categorization architecture using cross-modal fusion of visual and text features. This approach enhances video classification accuracy in sensor systems by combining visual data with subtitle information.

Keywords:

Timesformer cross-modal fusion text features video classification video features

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Related Experiment Videos

Last Updated: Jul 12, 2025

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Online short videos present significant challenges for content classification and management.
Traditional video classification methods relying solely on visual features are computationally intensive and may lack accuracy.
Existing single-modality approaches struggle to meet specific scenario accuracy requirements.

Purpose of the Study:

To develop an efficient short video categorization architecture for visual sensor systems.
To improve the accuracy of short video classification by integrating multiple data modalities.
To reduce the computational load associated with frame-by-frame video processing.

Main Methods:

Utilized a self-attention mechanism to extend image frames into a 3D space-time representation.
Extracted video features using the Timesformer network by mapping image patches to an embedding layer.
Extracted text features from subtitles using the Bidirectional Encoder Representation from Transformers (BERT) model.
Implemented a cross-modal fusion strategy to combine video and text features for classification.

Main Results:

The proposed cross-modal fusion framework significantly outperformed baseline video classification methods.
The architecture effectively classifies short videos by jointly analyzing visual and textual information.
Achieved improved accuracy in short video classification tasks compared to single-modality approaches.

Conclusions:

The developed framework offers a superior approach to short video classification in sensor systems.
Cross-modal fusion of visual and text features is a promising strategy for enhancing video analysis.
The method provides an efficient and accurate solution for managing and classifying large volumes of short video content.