Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Super-resolution Fluorescence Microscopy

Super-resolution Fluorescence Microscopy

Super-resolution fluorescence microscopy (SRFM) provides a better resolution than conventional fluorescence microscopy by reducing the point spread function (PSF). PSF is the light intensity distribution from a point that causes it to appear blurred. Due to PSF, each fluorescing point appears bigger than its actual size, and it is the PSF interference of nearby fluorophores that causes the blurred image. Various approaches to achieving higher resolution through SRFM have recently been...

Upsampling

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...

Downsampling

Downsampling

When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...

Superposition Theorem

Superposition Theorem

The superposition principle is a fundamental concept stating that in a linear circuit, the voltage across (or current through) an element can be determined by summing the individual contributions of each independent source acting in isolation. When dealing with linear circuits containing multiple independent sources, this principle serves as a valuable tool for analysis. To apply the superposition principle effectively, one should focus on a single independent source at a time while...

Cross Product

Cross Product

The cross product is a fundamental concept in vector algebra that is a vector operation on two different vectors to obtain a third vector. Unlike the scalar product, the cross product results in a vector quantity perpendicular to both the original vectors.
The magnitude of the cross product is obtained by multiplying the magnitude of both the vectors and the sine of the angle between them. This means that a larger angle between the vectors will lead to a greater magnitude of the cross product.

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

FreqPose: Frequency-Aware Diffusion with Fractional Gabor Filters and Global Pose-Semantic Alignment.

Sensors (Basel, Switzerland)·2026

Same author

CLIP-RL: Closed-Loop Video Inpainting with Detection-Guided Reinforcement Learning.

Sensors (Basel, Switzerland)·2026

Same author

Micro-Expression Recognition via LoRA-Enhanced DinoV2 and Interactive Spatio-Temporal Modeling.

Sensors (Basel, Switzerland)·2026

Same author

A Semi-Supervised Object Detector Based on Adaptive Weighted Active Learning and Orthogonal Data Augmentation.

Sensors (Basel, Switzerland)·2025

Same author

SP-IGAN: An Improved GAN Framework for Effective Utilization of Semantic Priors in Real-World Image Super-Resolution.

Entropy (Basel, Switzerland)·2025

Same author

SCFusion: Infrared and Visible Fusion Based on Salient Compensation.

Entropy (Basel, Switzerland)·2023

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 14, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Single-Character-Based Embedding Feature Aggregation Using Cross-Attention for Scene Text Super-Resolution.

Meng Wang¹, Qianqian Li¹, Haipeng Liu¹

¹School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China.

Sensors (Basel, Switzerland)

|April 12, 2025

Summary

This summary is machine-generated.

This study introduces a novel method for scene text super-resolution (STSR) using single-character embeddings and cross-attention. The approach enhances text readability in complex backgrounds, improving recognition accuracy on benchmarks.

Keywords:

cross-attention cross-fertilization scene text image super-resolution text recognition

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Related Experiment Videos

Last Updated: May 14, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

Computer Vision
Artificial Intelligence
Image Processing

Background:

Scene text super-resolution (STSR) aims to improve text quality for better readability and downstream tasks.
Challenges in STSR include character ambiguity and interference from complex backgrounds, especially with tightly connected characters.

Purpose of the Study:

To propose a single-character-based embedding feature aggregation method using cross-attention for scene text super-resolution (SCE-STISR).
To address the challenges of character ambiguity and background interference in complex scene text images.

Main Methods:

Employs a dynamic feature extraction mechanism with adaptive multi-scale feature weights.
Introduces a dual-level cross-attention mechanism for aggregating single-character features with textual priors and aligning visual-semantic information.
Applies adaptive normalized color correction to reduce background-induced color distortion.

Main Results:

Achieved improved text recognition accuracies of 0.9-1.4% over the baseline TATT on the TextZoom benchmark.
Obtained an optimal SSIM value of 0.7951 and a PSNR of 21.84 on TextZoom.
Demonstrated accuracy improvements of 0.2-2.2% over existing baselines on five text recognition datasets.

Conclusions:

The proposed SCE-STISR method effectively enhances scene text super-resolution by addressing character ambiguity and background interference.
The approach validates the effectiveness of single-character embedding aggregation and cross-attention for improving text recognition accuracy in challenging scenarios.