Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Types Of Transformers

Types Of Transformers

Transformers can provide desired voltages to a circuit by modifying the number of turns in the secondary windings.
If the ratio of the number of turns in the secondary winding to that of the primary winding is greater than one, then the transformer is said to be a step-up transformer. In a step-up transformer, the voltage at the secondary winding is greater than the voltage applied at the primary winding.
However, if this ratio is less than one, the transformer is said to be a step-down...

The Ideal Transformer

The Ideal Transformer

In single-phase two-winding transformers, two windings are coiled around a magnetic core characterized by cross-sectional area A and magnetic permeability μ. A phasor current i1 enters the left winding while i2 exits the right winding, establishing the fundamental working of the transformer through electromagnetic principles.
Ampere's Law forms the basis of understanding the magnetic field within the transformer. It states that the integral of the magnetic field intensity's...

Convolution Properties I

Convolution Properties I

Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:

Convolution Properties II

Convolution Properties II

The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...

Equivalent Circuits for Practical Transformers

Equivalent Circuits for Practical Transformers

The practical equivalent circuits of single-phase two-winding transformers exhibit significant deviations from their idealized versions due to the inherent properties of winding resistance and finite core permeability. These properties result in real and reactive power losses, affecting the transformer's performance. Understanding these deviations is crucial for designing more efficient transformers.
In a practical transformer, each winding exhibits resistance and leakage reactance. The...

Transformers with Off-Nominal Turns Ratios

Transformers with Off-Nominal Turns Ratios

In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A novel water-soluble near-infrared fluorescent probe for monitoring viscosity fluctuations in plants and zebrafish under abiotic stresses.

Smart molecules : open access·2026

Same author

Chemodivergent aminocarbonylation enabled by oxygen vacancy-engineered Pd-doped In<sub>2</sub>O<sub>3</sub> nanocatalysts.

Science advances·2026

Same author

Human iPSC-NSC-Derived Extracellular Vesicles Can Alleviate Alzheimer's Disease-Linked Impairments in Mitochondria, mTOR Signaling, Autophagy, and Hippocampal Neurogenesis.

Aging cell·2026

Same author

Study on tensile mechanical response and microstructure of polypropylene fiber reinforced loess under freezing.

PloS one·2026

Same author

Global plastics treaty must be built on a foundation of monitoring.

Nature·2026

Same author

Pituitary adenomas associated with hydrocephalus: clinical characteristics, risk stratification, and clinical management.

Journal of neuro-oncology·2026

Same journal

Unsupervised Identification of Protein Compositions and Conformations via Implicit Content-Transformation Disentanglement.

Proceedings. IEEE International Conference on Computer Vision·2026

Same journal

Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior.

Proceedings. IEEE International Conference on Computer Vision·2026

Same journal

Scaling 3D Compositional Models for Robust Classification and Pose Estimation.

Proceedings. IEEE International Conference on Computer Vision·2026

Same journal

UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation.

Proceedings. IEEE International Conference on Computer Vision·2026

Same journal

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers.

Proceedings. IEEE International Conference on Computer Vision·2024

Same journal

Multi-Class Cell Detection Using Spatial Context Representation.

Proceedings. IEEE International Conference on Computer Vision·2024

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 23, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

VidTr: Video Transformer Without Convolutions.

Yanyi Zhang^1,2, Xinyu Li¹, Chunhui Liu¹

¹Amazon Web Service.

Proceedings. IEEE International Conference on Computer Vision

|May 13, 2022

Summary

This summary is machine-generated.

We developed the Video Transformer (VidTr), an efficient model for video classification. VidTr uses separable-attention to achieve state-of-the-art results with lower computational costs and excels at recognizing actions needing long-term temporal reasoning.

More Related Videos

A Human Cerebral Organoid Model of Neural Cell Transplantation

A Human Cerebral Organoid Model of Neural Cell Transplantation

Published on: July 21, 2023

Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment

Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment

Published on: February 20, 2014

Related Experiment Videos

Last Updated: Sep 23, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

A Human Cerebral Organoid Model of Neural Cell Transplantation

A Human Cerebral Organoid Model of Neural Cell Transplantation

Published on: July 21, 2023

Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment

Two-photon Calcium Imaging in Mice Navigating a Virtual Reality Environment

Published on: February 20, 2014

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Video classification is crucial for understanding video content.
Existing 3D convolutional networks struggle with spatio-temporal modeling efficiency.
Transformer models show promise but often incur high memory usage.

Purpose of the Study:

To introduce an efficient and effective video classification model.
To address the high computational and memory costs of vanilla transformers for video.
To improve performance on tasks requiring long-term temporal reasoning.

Main Methods:

Developed Video Transformer (VidTr) utilizing separable-attention mechanisms.
Implemented stacked attention layers for spatio-temporal information aggregation.
Introduced standard deviation based topK pooling (pool_topK_std) to reduce temporal computation.

Main Results:

VidTr achieved state-of-the-art performance on five benchmark datasets.
Reduced memory cost by 3.3x compared to vanilla transformers with maintained performance.
Demonstrated lower computational requirements than existing 3D networks.
Showcased superior ability in predicting actions requiring long-term temporal reasoning.

Conclusions:

VidTr offers a highly efficient and effective solution for video classification.
The separable-attention and pooling strategies significantly optimize transformer performance for video.
VidTr's architecture is particularly well-suited for complex action recognition tasks involving extended temporal dependencies.