Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Force Classification01:22

Force Classification

1.2K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.2K
Aggregates Classification01:29

Aggregates Classification

328
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
328
Classification of Signals01:30

Classification of Signals

484
In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...
484
Tagging and Fusion Proteins01:24

Tagging and Fusion Proteins

6.7K
Proteins are involved in several cellular processes and biochemical reactions. Analyzing a specific protein of interest requires it to be isolated from the other proteins in the cell. This is achieved by overexpressing the specific gene in a suitable host to produce large quantities of the target protein. A tag or label is recombined with the gene to produce a fusion protein containing the target protein and the tag. The tags on these fusion proteins can then be used for easy detection and...
6.7K
Classification of Systems-II01:31

Classification of Systems-II

150
Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,
150
Classification of Systems-I01:26

Classification of Systems-I

191
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
191

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

In situ sol-gel-sol transformation formed by sodium alginate realizes folic acid-modified chitosan nanoparticles to deliver hops β-acids for colorectal cancer therapy.

Journal of nanobiotechnology·2026
Same author

Corrigendum to "Characteristics of the long non-coding RNA-mRNA regulatory network in American eel (Anguilla rostrata) during Vibrio harveyi infection" [Comparative Biochemistry and Physiology - Part D: Genomics and Proteomics 59 (2026) 101871].

Comparative biochemistry and physiology. Part D, Genomics & proteomics·2026
Same author

Experience in the diagnosis and treatment of venous adventitial cystic disease: a case series and literature review.

Frontiers in surgery·2026
Same author

Characteristics of the long non-coding RNA-mRNA regulatory network in American eel (Anguilla rostrata) during Vibrio harveyi infection.

Comparative biochemistry and physiology. Part D, Genomics & proteomics·2026
Same author

The ferric uptake regulator (Fur) positively regulates the virulence of Vibrio harveyi in shrimp.

International journal of biological macromolecules·2026
Same author

Comparative Transcriptomic Analysis Underlies the Differential Virulence of <i>Vibrio harveyi</i> and <i>Vibrio vulnificus</i> in American Eels (<i>Anguilla rostrata</i>).

International journal of molecular sciences·2025
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Jul 12, 2025

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.0K

A Short Video Classification Framework Based on Cross-Modal Fusion.

Nuo Pang1, Songlin Guo2, Ming Yan2

  • 1School of Design, Dalian University of Science and Technology, Dalian 116052, China.

Sensors (Basel, Switzerland)
|October 28, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a novel short video categorization architecture using cross-modal fusion of visual and text features. This approach enhances video classification accuracy in sensor systems by combining visual data with subtitle information.

Keywords:
Timesformercross-modal fusiontext featuresvideo classificationvideo features

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

565

Related Experiment Videos

Last Updated: Jul 12, 2025

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.0K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

565

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Online short videos present significant challenges for content classification and management.
  • Traditional video classification methods relying solely on visual features are computationally intensive and may lack accuracy.
  • Existing single-modality approaches struggle to meet specific scenario accuracy requirements.

Purpose of the Study:

  • To develop an efficient short video categorization architecture for visual sensor systems.
  • To improve the accuracy of short video classification by integrating multiple data modalities.
  • To reduce the computational load associated with frame-by-frame video processing.

Main Methods:

  • Utilized a self-attention mechanism to extend image frames into a 3D space-time representation.
  • Extracted video features using the Timesformer network by mapping image patches to an embedding layer.
  • Extracted text features from subtitles using the Bidirectional Encoder Representation from Transformers (BERT) model.
  • Implemented a cross-modal fusion strategy to combine video and text features for classification.

Main Results:

  • The proposed cross-modal fusion framework significantly outperformed baseline video classification methods.
  • The architecture effectively classifies short videos by jointly analyzing visual and textual information.
  • Achieved improved accuracy in short video classification tasks compared to single-modality approaches.

Conclusions:

  • The developed framework offers a superior approach to short video classification in sensor systems.
  • Cross-modal fusion of visual and text features is a promising strategy for enhancing video analysis.
  • The method provides an efficient and accurate solution for managing and classifying large volumes of short video content.