Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Convolution Properties II01:17

Convolution Properties II

184
The important convolution properties include width, area, differentiation, and integration properties.
The width property indicates that if the durations of input signals are T1 and T2, then the width of the output response equals the sum of both durations, irrespective of the shapes of the two functions. For instance, convolving two rectangular pulses with durations of 2 seconds and 1 second results in a function with a width of 3 seconds.
The area property asserts that the area under the...
184
Association Areas of the Cortex01:21

Association Areas of the Cortex

5.3K
Association areas are regions of the cerebral cortex that do not have a specific sensory or motor function. Instead, they integrate and interpret information from various sources to enable higher cognitive processes such as memory, learning, and decision-making. Some key association areas include the following:
Prefrontal Association Area: This area is located in the frontal lobe and is involved in planning, decision-making, and moderating social behavior. It connects with primary motor areas,...
5.3K
Deconvolution01:20

Deconvolution

156
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
156
Force Classification01:22

Force Classification

1.2K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.2K
Convolution Properties I01:20

Convolution Properties I

147
Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:
147
Convolution: Math, Graphics, and Discrete Signals01:24

Convolution: Math, Graphics, and Discrete Signals

248
In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...
248

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

RPCANet$^{++}$: Deep Interpretable Robust PCA for Sparse Object Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

MRCNet: Motion Reasoning Chain for Cross Modal Video Camouflaged Object Detection.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

SRFormerV2: Taking a Closer Look at Permuted Self-Attention for Image Super-Resolution.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Determining chronicity and frequency of histologic lung lesions in feedyard cattle mortalities.

Journal of veterinary diagnostic investigation : official publication of the American Association of Veterinary Laboratory Diagnosticians, Inc·2026
Same author

DFormer++: Improving RGBD Representation Learning for Semantic Segmentation.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Step-Wise Distribution-Aligned Style Prompt Tuning for Source-Free Cross-Domain Few-Shot Learning.

IEEE transactions on pattern analysis and machine intelligence·2025
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 26, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

522

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition.

Qibin Hou, Cheng-Ze Lu, Ming-Ming Cheng

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |May 15, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces Conv2Former, a novel convolutional neural network (ConvNet) that simplifies Vision Transformer self-attention. Conv2Former achieves superior performance in image classification and object detection tasks.

    More Related Videos

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
    04:23

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

    Published on: April 21, 2023

    1.8K
    Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing
    06:25

    Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing

    Published on: February 23, 2024

    588

    Related Experiment Videos

    Last Updated: Jun 26, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    522
    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
    04:23

    A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

    Published on: April 21, 2023

    1.8K
    Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing
    06:25

    Author Spotlight: Assessment of Visual Acuity in Central Vision Loss Through Motion-Based Peripheral Vision Testing

    Published on: February 23, 2024

    588

    Area of Science:

    • Computer Vision
    • Deep Learning
    • Artificial Intelligence

    Background:

    • Vision Transformers (ViTs) excel at encoding global information but suffer from high computational costs with high-resolution images.
    • Existing Convolutional Neural Networks (ConvNets) have limitations in capturing global context compared to ViTs.

    Purpose of the Study:

    • To develop an efficient and high-performing network architecture by integrating the strengths of ConvNets and ViTs.
    • To address the computational limitations of ViTs in visual recognition tasks.

    Main Methods:

    • Proposed a Transformer-style ConvNet by simplifying self-attention using a convolutional modulation operation.
    • Investigated the impact of large kernel sizes (≥ 7x7) within convolutional layers.
    • Developed a family of hierarchical ConvNets named Conv2Former.

    Main Results:

    • Conv2Former demonstrates consistent performance improvements with increasing kernel sizes (5x5 to 21x21).
    • The proposed convolutional modulation effectively leverages large kernels for enhanced feature extraction.
    • Conv2Former surpasses popular architectures like Swin Transformer and ConvNeXt across multiple benchmarks.

    Conclusions:

    • Conv2Former offers a simple yet effective approach to visual recognition, outperforming existing state-of-the-art models.
    • The architecture provides a computationally efficient alternative to ViTs for high-resolution image processing.
    • Conv2Former shows strong generalization capabilities in image classification, object detection, and semantic segmentation.