Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Jun 25, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.8K

Adaptive sparse attention-based compact transformer for object tracking.

Fei Pan1, Lianyu Zhao1, Chenglin Wang2

  • 1School of Computer Science and Engineering, Tianjin University of Technology, Liqizhaung street, Tianjin, 300384, China.

Scientific Reports
|May 28, 2024
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Antibacterial, In Vitro Anti-Inflammatory and Anti-Acne Activities of <i>Patchouli</i> Essential Oil.

Pharmaceuticals (Basel, Switzerland)·2026
Same author

Single-cell analyses identify the ginseng embryonic protoderm as a native compartment for high-efficiency ginsenoside production.

Nature communications·2026
Same author

[Analysis of complicating factors in endodontic microsurgery].

Hua xi kou qiang yi xue za zhi = Huaxi kouqiang yixue zazhi = West China journal of stomatology·2026
Same author

Impact of aerosol acidity on the heterogeneous chemistry of halogenated phenols.

Environmental pollution (Barking, Essex : 1987)·2026
Same author

Network toxicology and molecular docking identify BRCA1 as a functional target of the dietary carcinogen PhIP in colorectal cancer.

Discover oncology·2026
Same author

Lysosome-related biomarkers in peripheral blood immune cells discriminate sepsis from SIRS.

Scientific reports·2026
Same journal

Integrated multi-assessment and structural performance index framework for stacking-sequence optimisation of natural fibre reinforced laminates.

Scientific reports·2026
Same journal

SuperiorGAT: graph attention networks for sparse LiDAR point cloud reconstruction in autonomous systems.

Scientific reports·2026
Same journal

The effect of stretching the pectoralis major, sternocleidomastoid, and iliopsoas muscles on 800 m swimming performance in master swimmers.

Scientific reports·2026
Same journal

ISNR-PQC: isometry noise resilience post quantum cryptography primitive.

Scientific reports·2026
Same journal

Identification of high-yielding and stable genotypes of barley in the cold climate of Iran using AMMI and GGE biplot models.

Scientific reports·2026
Same journal

Bayesian negative binomial modelling of spatial and temporal patterns of road traffic deaths in Ghana.

Scientific reports·2026
See all related articles

This study introduces ASACTT, a novel object tracking method that enhances global information capture and target-specific attention. ASACTT achieves state-of-the-art performance with improved efficiency and robustness in object tracking.

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Transformer-based Siamese networks are effective for object tracking but struggle with global information capture and feature representation due to ResNet backbones.
  • Existing methods face challenges in attending to target-relevant information using multi-head self-attention (MSA) and exhibit robustness issues and high model complexity during online tracking.

Purpose of the Study:

  • To address the limitations of current object tracking methods by proposing a novel tracker, ASACTT.
  • To enhance global information extraction, improve target-specific attention, and ensure robust tracking with adaptive appearance variations.

Main Methods:

  • Improved Swin-Transformer-Tiny backbone for enhanced global information extraction.
  • Adaptive Sparse Attention (ASA) mechanism to focus on target-specific details within the search region.
Keywords:
Adaptive sparse attentionObject trackingSiamese networkTransformer

More Related Videos

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.6K
Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

11.8K

Related Experiment Videos

Last Updated: Jun 25, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.8K
A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.6K
Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

11.8K
  • Dynamic Template Updater (DTU) using position encoding and historical data for adaptive appearance tracking and reduced complexity.
  • Main Results:

    • ASACTT demonstrates high comparability to state-of-the-art methods across five benchmark datasets.
    • Achieved an outstanding success score of 75.3% at 36 FPS on the GOT-10K dataset.
    • Significantly surpassed other trackers with comparable model parameters, indicating improved accuracy and efficiency.

    Conclusions:

    • The proposed ASACTT tracker effectively overcomes limitations in global information capture and target attention in Transformer-based Siamese networks.
    • ASACTT offers a robust and efficient solution for object tracking, balancing accuracy with reduced model complexity.
    • The adaptive sparse attention and dynamic template updater contribute to superior performance and adaptability in diverse tracking scenarios.