Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Position and Displacement Vectors01:00

Position and Displacement Vectors

12.6K
To describe the motion of an object, one should first be able to describe its position (where it is at any particular time). More precisely, the position needs to be specified relative to a convenient frame of reference. A frame of reference is an arbitrary set of axes from which the position and motion of an object are described. Earth is often used as a frame of reference to describe the position of an object in relation to stationary objects on Earth.
Further, several important kinds of...
12.6K
Position Vectors01:29

Position Vectors

1.8K
A position vector is a fundamental concept in mathematics that helps determine the position of one point with respect to another point in space. It is a vector that describes the direction and distance between two points. Position vectors are highly useful in the field of math and science, as they help represent spatial relationships and make calculations easier.
For instance, we want to locate a point P(x, y, z) relative to the origin of coordinates O. In that case, we can define a position...
1.8K
Scaling01:26

Scaling

560
In designing and analyzing filters, resonant circuits, or circuit analysis at large, working with standard element values like 1 ohm, 1 henry, or 1 farad can be convenient before scaling these values to more realistic figures. This approach is widely utilized by not employing realistic element values in numerous examples and problems; it simplifies mastering circuit analysis through convenient component values. The complexity of calculations is thereby reduced, with the understanding that...
560
Position and Displacement01:31

Position and Displacement

24.6K
The position of an object defines its location relative to a convenient frame of reference at any particular time. A frame of reference is an arbitrary set of axes from which the position and motion of an object are described. Earth is often used as a frame of reference, and we often describe the position of an object as it relates to stationary objects on Earth. For example, a rocket launch could be described in terms of the position of the rocket with respect to Earth as a whole. On the other...
24.6K
Cross Product01:25

Cross Product

733
The cross product is a fundamental concept in vector algebra that is a vector operation on two different vectors to obtain a third vector. Unlike the scalar product, the cross product results in a vector quantity perpendicular to both the original vectors.
The magnitude of the cross product is obtained by multiplying the magnitude of both the vectors and the sine of the angle between them. This means that a larger angle between the vectors will lead to a greater magnitude of the cross product.
733
Vector Transformation in Rotating Coordinate Systems01:16

Vector Transformation in Rotating Coordinate Systems

2.6K
Consider a vector rotating about an axis with an angular velocity, such that its tip sweeps a circular path.
2.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

<i>APOE</i> Genotype Modifies the Predictive Performance of Plasma Biomarkers for Amyloid Plaque Burden in Subjective Cognitive Decline.

Journal of Korean medical science·2026
Same author

Impact of adjunctive quantitative analysis on visual interpretation of amyloid PET: a multiple tracer, multicentre study.

European radiology·2026
Same author

Pioneering a Hybrid Decentralized Clinical Trial in South Korea: Assessing Feasibility, Data Integrity, and Participant Perceptions.

Clinical and translational science·2026
Same author

Pharmacokinetics and safety of a new free-base formulation of tenofovir alafenamide 25 mg in healthy volunteers: a randomized, open-label, four-period, fully replicated crossover bioequivalence study.

Naunyn-Schmiedeberg's archives of pharmacology·2026
Same author

Population Pharmacokinetic Analysis of MIT-001, a Novel Ferroptosis Inhibitor, for Dose Optimization.

Journal of clinical pharmacology·2026
Same author

Systemic proteomic and organ aging signatures associated with plasma Aβ oligomerization in a Korean cohort: a cross-sectional study.

Frontiers in aging neuroscience·2026
Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K

Dynamic scale position embedding for cross-modal representation learning.

Jungkyoo Shin1, Sungmin Kang1, Yoonsik Cho1

  • 1Department of Artificial Intelligence, Chung-Ang University, 84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea, Seoul, 06974, Seoul, Korea.

Neural Networks : the Official Journal of the International Neural Network Society
|September 13, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces Dynamic Scale Position Embedding (DSPE) for advanced cross-modal learning in videos. DSPE effectively captures temporal information across multiple scales, improving video understanding and retrieval.

Keywords:
Multi-modal learningPosition embeddingRepresentation learning

More Related Videos

Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

605
Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine
07:05

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine

Published on: October 27, 2016

9.6K

Related Experiment Videos

Last Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K
Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

605
Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine
07:05

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine

Published on: October 27, 2016

9.6K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Existing cross-modal learning methods struggle to capture diverse temporal information in videos.
  • Fine- and coarse-grained contrastive learning may miss inherent semantic details due to varied video durations.

Purpose of the Study:

  • To propose a novel approach for capturing multi-scale temporal information in videos for cross-modal learning.
  • To enhance semantic comprehension and integrity in video analysis.

Main Methods:

  • Introducing Dynamic Scale Position Embedding (DSPE) to enable a single transformer to interpret videos at various temporal scales.
  • Developing an efficient multi-scale temporal encoder that dynamically adjusts temporal position embeddings.
  • Preserving distinct features of video clips instead of aggregating them, maintaining semantic integrity.

Main Results:

  • Demonstrated consistent performance improvements across four benchmark datasets (MSR-VTT, LSMDC, MSVD, ActivityNet-Captions).
  • Achieved significant enhancements in both text-video retrieval and video-captioning tasks.
  • Validated the effectiveness of the multi-scale approach in capturing fine- to coarse-grained temporal information.

Conclusions:

  • The proposed DSPE approach effectively addresses limitations in capturing multi-scale temporal dynamics in videos.
  • This method offers a significant advancement for cross-modal learning, improving video understanding and related applications.
  • The multi-scale temporal encoder provides a robust solution for comprehensive video semantic analysis.