Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Determining chronicity and frequency of histologic lung lesions in feedyard cattle mortalities.

Journal of veterinary diagnostic investigation : official publication of the American Association of Veterinary Laboratory Diagnosticians, Inc·2026
Same author

Calcinosis Circumscripta with Iron Mineralization in an African Green Monkey (Chlorocebus aethiops sabaeus).

Journal of the American Association for Laboratory Animal Science : JAALAS·2025
Same author

Ocular and perineal squamous cell carcinomas in a Holstein Friesian cow.

Open veterinary journal·2024
Same author

Impact of psychiatric disorders on the risk of glioma: Mendelian randomization and biological annotation.

Journal of affective disorders·2024
Same author

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition.

IEEE transactions on pattern analysis and machine intelligence·2024
Same author

Factorization Vision Transformer: Modeling Long-Range Dependency With Local Window Cost.

IEEE transactions on neural networks and learning systems·2023
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 12, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

376

Low-Resolution Self-Attention for Semantic Segmentation.

Yu-Huan Wu, Shi-Chen Zhang, Yun Liu

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |June 10, 2025
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a novel Low-Resolution Self-Attention (LRSA) mechanism for efficient semantic segmentation. The LRFormer model significantly reduces computational cost while achieving state-of-the-art performance on benchmark datasets.

    More Related Videos

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    483
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.7K

    Related Experiment Videos

    Last Updated: Jun 12, 2025

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    376
    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    483
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    2.7K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Semantic segmentation requires high-resolution details for pixel accuracy and global context for class prediction.
    • Existing vision transformers face computational bottlenecks due to high-resolution context modeling.

    Purpose of the Study:

    • To introduce a computationally efficient mechanism for capturing global context in semantic segmentation.
    • To develop a vision transformer that addresses the computational limitations of current models.

    Main Methods:

    • Developed the Low-Resolution Self-Attention (LRSA) mechanism for global context modeling.
    • Computed self-attention in a fixed low-resolution space, augmented with depth-wise convolutions for high-resolution details.
    • Built the LRFormer, an encoder-decoder vision transformer incorporating the LRSA mechanism.

    Main Results:

    • The LRFormer model demonstrated superior performance compared to state-of-the-art methods.
    • Achieved significant reductions in computational cost (FLOPs) through the LRSA mechanism.
    • Validated effectiveness across diverse datasets: ADE20K, COCO-Stuff, and Cityscapes.

    Conclusions:

    • The LRSA mechanism offers an effective and efficient approach to semantic segmentation.
    • LRFormer presents a promising alternative to existing vision transformers for high-performance, low-computation tasks.
    • The proposed method advances the field of efficient deep learning for computer vision.