Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Types Of Transformers01:16

Types Of Transformers

1.1K
Transformers can provide desired voltages to a circuit by modifying the number of turns in the secondary windings.
If the ratio of the number of turns in the secondary winding to that of the primary winding is greater than one, then the transformer is said to be a step-up transformer. In a step-up transformer, the voltage at the secondary winding is greater than the voltage applied at the primary winding.
However, if this ratio is less than one, the transformer is said to be a step-down...
1.1K
The Ideal Transformer01:26

The Ideal Transformer

938
In single-phase two-winding transformers, two windings are coiled around a magnetic core characterized by cross-sectional area A and magnetic permeability μ. A phasor current i1 enters the left winding while i2 exits the right winding, establishing the fundamental working of the transformer through electromagnetic principles.
Ampere's Law forms the basis of understanding the magnetic field within the transformer. It states that the integral of the magnetic field intensity's...
938
Convolution Properties I01:20

Convolution Properties I

250
Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:
250
Convolution Properties II01:17

Convolution Properties II

297
The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...
297
Equivalent Circuits for Practical Transformers01:28

Equivalent Circuits for Practical Transformers

819
The practical equivalent circuits of single-phase two-winding transformers exhibit significant deviations from their idealized versions due to the inherent properties of winding resistance and finite core permeability. These properties result in real and reactive power losses, affecting the transformer's performance. Understanding these deviations is crucial for designing more efficient transformers.
In a practical transformer, each winding exhibits resistance and leakage reactance. The...
819
Transformers with Off-Nominal Turns Ratios01:25

Transformers with Off-Nominal Turns Ratios

220
In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...
220

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A novel water-soluble near-infrared fluorescent probe for monitoring viscosity fluctuations in plants and zebrafish under abiotic stresses.

Smart molecules : open access·2026
Same author

Chemodivergent aminocarbonylation enabled by oxygen vacancy-engineered Pd-doped In<sub>2</sub>O<sub>3</sub> nanocatalysts.

Science advances·2026
Same author

Human iPSC-NSC-Derived Extracellular Vesicles Can Alleviate Alzheimer's Disease-Linked Impairments in Mitochondria, mTOR Signaling, Autophagy, and Hippocampal Neurogenesis.

Aging cell·2026
Same author

Study on tensile mechanical response and microstructure of polypropylene fiber reinforced loess under freezing.

PloS one·2026
Same author

Global plastics treaty must be built on a foundation of monitoring.

Nature·2026
Same author

Pituitary adenomas associated with hydrocephalus: clinical characteristics, risk stratification, and clinical management.

Journal of neuro-oncology·2026
Same journal

Unsupervised Identification of Protein Compositions and Conformations via Implicit Content-Transformation Disentanglement.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

Scaling 3D Compositional Models for Robust Classification and Pose Estimation.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers.

Proceedings. IEEE International Conference on Computer Vision·2024
Same journal

Multi-Class Cell Detection Using Spatial Context Representation.

Proceedings. IEEE International Conference on Computer Vision·2024
See all related articles

Related Experiment Video

Updated: Sep 23, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K

VidTr: Video Transformer Without Convolutions.

Yanyi Zhang1,2, Xinyu Li1, Chunhui Liu1

  • 1Amazon Web Service.

Proceedings. IEEE International Conference on Computer Vision
|May 13, 2022
PubMed
Summary
This summary is machine-generated.

We developed the Video Transformer (VidTr), an efficient model for video classification. VidTr uses separable-attention to achieve state-of-the-art results with lower computational costs and excels at recognizing actions needing long-term temporal reasoning.

More Related Videos

A Human Cerebral Organoid Model of Neural Cell Transplantation
08:58

A Human Cerebral Organoid Model of Neural Cell Transplantation

Published on: July 21, 2023

1.4K
Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment
08:12

Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment

Published on: February 20, 2014

31.5K

Related Experiment Videos

Last Updated: Sep 23, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

2.0K
A Human Cerebral Organoid Model of Neural Cell Transplantation
08:58

A Human Cerebral Organoid Model of Neural Cell Transplantation

Published on: July 21, 2023

1.4K
Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment
08:12

Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment

Published on: February 20, 2014

31.5K

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Video classification is crucial for understanding video content.
  • Existing 3D convolutional networks struggle with spatio-temporal modeling efficiency.
  • Transformer models show promise but often incur high memory usage.

Purpose of the Study:

  • To introduce an efficient and effective video classification model.
  • To address the high computational and memory costs of vanilla transformers for video.
  • To improve performance on tasks requiring long-term temporal reasoning.

Main Methods:

  • Developed Video Transformer (VidTr) utilizing separable-attention mechanisms.
  • Implemented stacked attention layers for spatio-temporal information aggregation.
  • Introduced standard deviation based topK pooling (pool_topK_std) to reduce temporal computation.

Main Results:

  • VidTr achieved state-of-the-art performance on five benchmark datasets.
  • Reduced memory cost by 3.3x compared to vanilla transformers with maintained performance.
  • Demonstrated lower computational requirements than existing 3D networks.
  • Showcased superior ability in predicting actions requiring long-term temporal reasoning.

Conclusions:

  • VidTr offers a highly efficient and effective solution for video classification.
  • The separable-attention and pooling strategies significantly optimize transformer performance for video.
  • VidTr's architecture is particularly well-suited for complex action recognition tasks involving extended temporal dependencies.