Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

The Anchoring-and-Adjustment Heuristic01:25

The Anchoring-and-Adjustment Heuristic

6.7K
In order to make good decisions, we use our knowledge and our reasoning. Often, this knowledge and reasoning is sound and solid. However, sometimes, we are swayed by biases or by others manipulating a situation. For example, let’s say you and three friends wanted to rent a house and had a combined target budget of $1,600. The realtor shows you only very run-down houses for $1,600 and then shows you a very nice house for $2,000. Might you ask each person to pay more in rent to get the...
6.7K
Concepts and Prototypes01:24

Concepts and Prototypes

677
The human nervous system handles vast amounts of information by translating sensory stimuli into neural impulses, which the brain processes, creating thoughts expressed through language or stored as memories. The brain also synthesizes information from emotions and memories, which significantly influence thoughts and behaviors. This intricate process creates a comprehensive mental picture.
The brain organizes this information using concepts, which are mental categories grouping linguistic data,...
677
Relative Motion Analysis using Rotating Axes01:25

Relative Motion Analysis using Rotating Axes

1.0K
Consider a component AB undergoing a linear motion. Along with a linear motion, point B also rotates around point A. To comprehend this complex movement, position vectors for both points A and B are established using a stationary reference frame.
However, to express the relative position of point B relative to point A, an additional frame of reference, denoted as x'y', is necessary. This additional frame not only translates but also rotates relative to the fixed frame, making it...
1.0K
Taping Over Different Ground Profiles01:12

Taping Over Different Ground Profiles

497
Taping over varying ground profiles requires careful adaptation to achieve accurate measurements. On smooth, level ground with minimal vegetation, the tape can rest directly on the ground. Here, the taping team, typically consisting of a head and a rear tapeman, coordinates their positions with clear communication. The rear tapeman holds the tape at the starting point and guides the head tapeman toward a range pole placed beyond the endpoint, using hand or voice signals to ensure alignment.On...
497
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

839
Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
839

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Nitrate-Sialin2 axis couples ER-mitochondrial calcium signaling with fatty acid metabolism to drive white adipose browning.

Nature communications·2026
Same author

A study of the limitations of musical experience in Ancient Chinese Poetry - The case of the creation concert of Wei's Music Score.

PloS one·2026
Same author

Achieving high-performance manganese oxide-based aqueous zinc-ion batteries via a heterostructure strategy.

Journal of colloid and interface science·2026
Same author

Stimuli-Responsive Intelligent Coatings With Nano/Microcarriers for Early Corrosion Sensing: Advances and Challenges.

Small (Weinheim an der Bergstrasse, Germany)·2026
Same author

Restorative campus landscapes and student engagement: Dual affective pathways linking environmental perception to learning and social interaction.

Acta psychologica·2026
Same author

Mastication Regulates Nitrate Homeostasis Through Lipid Raft-Mediated Sialin Translocation.

Oral diseases·2026
Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026
Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026
Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026
Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026
Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026
Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: May 5, 2026

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.0K

Hierarchical Prototype Alignment for Video Temporal Grounding.

Yun Tian1, Xiaobo Guo1, Jinsong Wang1

  • 1School of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China.

Entropy (Basel, Switzerland)
|May 4, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a hierarchical prototype alignment method for video temporal grounding, improving cross-modal understanding by linking specific visual regions to text. The approach enhances accuracy in complex scenarios by modeling fine-grained spatial and temporal event structures.

Keywords:
cross-modal alignmentprototype learningvideo temporal grounding

Related Experiment Videos

Last Updated: May 5, 2026

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.0K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Video temporal grounding performance has improved with vision-language learning.
  • Existing methods often fail to align global video features with localized textual semantics.
  • This limitation causes unstable alignment in complex videos with intertwined events.

Purpose of the Study:

  • To propose a hierarchical prototype alignment approach for enhanced video temporal grounding.
  • To jointly model fine-grained spatial semantics and temporal event structures for accurate cross-modal alignment.
  • To improve the quality of cross-modal alignment and the accuracy of temporal grounding.

Main Methods:

  • Developed a hierarchical prototype alignment method using structured intermediate prototypes.
  • Decomposed alignment into object-phrase and event-sentence stages for fine-grained correspondence.
  • Integrated cross-modal alignment information into candidate moment aggregation to focus on relevant temporal regions.

Main Results:

  • The proposed method significantly outperforms existing approaches on benchmark datasets (Charades-STA, ActivityNet Captions, TACoS).
  • Demonstrated improved cross-modal alignment quality.
  • Achieved higher temporal grounding accuracy, especially in complex scenarios.

Conclusions:

  • Hierarchical prototype alignment effectively models cross-modal correspondence between video and text.
  • The method successfully addresses limitations of global feature association in video temporal grounding.
  • The approach offers a promising direction for advancing vision-language understanding in video analysis.