Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Sep 2, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.0K

AAF-Net: Scene text detection based on attention aggregation features.

Mengmeng Chen1,2, Mayire Ibrayim1,2, Askar Hamdulla1,2

  • 1College of Information Science and Engineering, Xinjiang University, Urumqi, China.

Plos One
|August 5, 2022
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

Association Areas of the Cortex01:21

Association Areas of the Cortex

6.1K
Association areas are regions of the cerebral cortex that do not have a specific sensory or motor function. Instead, they integrate and interpret information from various sources to enable higher cognitive processes such as memory, learning, and decision-making. Some key association areas include the following:
Prefrontal Association Area: This area is located in the frontal lobe and is involved in planning, decision-making, and moderating social behavior. It connects with primary motor areas,...
6.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

DGCFNet: Dual Global Context Fusion Network for remote sensing image semantic segmentation.

PeerJ. Computer science·2025
Same author

Multimodal false information detection method based on Text-CNN and SE module.

PloS one·2022
Same author

Scene Text Detection Based on Two-Branch Feature Extraction.

Sensors (Basel, Switzerland)·2022
Same author

ARDformer: Agroforestry Road Detection for Autonomous Driving Using Hierarchical Transformer.

Sensors (Basel, Switzerland)·2022
Same author

Detection of Pine Wilt Nematode from Drone Images Using UAV.

Sensors (Basel, Switzerland)·2022
Same author

Infrared Small Target Detection Using Regional Feature Difference of Patch Image.

Sensors (Basel, Switzerland)·2022
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

This study introduces a novel Cross-Scale Attention Aggregation Feature Pyramid Network (CSAA-FPN) for improved scene text detection. The CSAA-FPN enhances feature representation, accurately detecting small and adjacent text instances, outperforming existing methods.

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Scene text detection faces challenges with small, multi-directional, and adjacent text instances due to neural network receptive field limitations.
  • Existing methods often struggle with low detection rates and false positives for complex text arrangements.

Purpose of the Study:

  • To propose a new feature pyramid network, the Cross-Scale Attention Aggregation Feature Pyramid Network (CSAA-FPN), for enhanced scene text detection.
  • To address limitations in detecting small, arbitrarily oriented, and closely packed text instances.

Main Methods:

  • Developed a novel Cross-Scale Attention Aggregation Feature Pyramid Network (CSAA-FPN).
  • Incorporated an Attention Aggregation Feature Module (AAFM) to enhance features and handle multi-scale information.

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

488
Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

12.0K

Related Experiment Videos

Last Updated: Sep 2, 2025

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

4.0K
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

488
Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

12.0K
  • Utilized CBAM attention module for focused feature extraction and an Adaptive Fusion Module (AFM) for feature refinement.
  • Main Results:

    • The proposed CSAA-FPN effectively enhances features, improving the detection of small and adjacent text instances.
    • Experiments on CTW1500, Total-Text, ICDAR2015, and MSRA-TD500 datasets demonstrate the model's superior performance.

    Conclusions:

    • The CSAA-FPN model offers a significant advancement in scene text detection accuracy.
    • The integration of attention mechanisms and adaptive fusion effectively tackles challenges posed by complex text scenes.