Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Super-resolution Fluorescence Microscopy01:37

Super-resolution Fluorescence Microscopy

6.8K
Super-resolution fluorescence microscopy (SRFM) provides a better resolution than conventional fluorescence microscopy by reducing the point spread function (PSF). PSF is the light intensity distribution from a point that causes it to appear blurred. Due to PSF, each fluorescing point appears bigger than its actual size, and it is the PSF interference of nearby fluorophores that causes the blurred image. Various approaches to achieving higher resolution through SRFM have recently been...
6.8K
Upsampling01:22

Upsampling

171
Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
171
Downsampling01:20

Downsampling

112
When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...
112
Superposition Theorem01:18

Superposition Theorem

489
The superposition principle is a fundamental concept stating that in a linear circuit, the voltage across (or current through) an element can be determined by summing the individual contributions of each independent source acting in isolation. When dealing with linear circuits containing multiple independent sources, this principle serves as a valuable tool for analysis. To apply the superposition principle effectively, one should focus on a single independent source at a time while...
489
Cross Product01:25

Cross Product

190
The cross product is a fundamental concept in vector algebra that is a vector operation on two different vectors to obtain a third vector. Unlike the scalar product, the cross product results in a vector quantity perpendicular to both the original vectors.
The magnitude of the cross product is obtained by multiplying the magnitude of both the vectors and the sine of the angle between them. This means that a larger angle between the vectors will lead to a greater magnitude of the cross product.
190
Deconvolution01:20

Deconvolution

116
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
116

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

FreqPose: Frequency-Aware Diffusion with Fractional Gabor Filters and Global Pose-Semantic Alignment.

Sensors (Basel, Switzerland)·2026
Same author

CLIP-RL: Closed-Loop Video Inpainting with Detection-Guided Reinforcement Learning.

Sensors (Basel, Switzerland)·2026
Same author

Micro-Expression Recognition via LoRA-Enhanced DinoV2 and Interactive Spatio-Temporal Modeling.

Sensors (Basel, Switzerland)·2026
Same author

A Semi-Supervised Object Detector Based on Adaptive Weighted Active Learning and Orthogonal Data Augmentation.

Sensors (Basel, Switzerland)·2025
Same author

SP-IGAN: An Improved GAN Framework for Effective Utilization of Semantic Priors in Real-World Image Super-Resolution.

Entropy (Basel, Switzerland)·2025
Same author

SCFusion: Infrared and Visible Fusion Based on Salient Compensation.

Entropy (Basel, Switzerland)·2023
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: May 14, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

322

Single-Character-Based Embedding Feature Aggregation Using Cross-Attention for Scene Text Super-Resolution.

Meng Wang1, Qianqian Li1, Haipeng Liu1

  • 1School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China.

Sensors (Basel, Switzerland)
|April 12, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel method for scene text super-resolution (STSR) using single-character embeddings and cross-attention. The approach enhances text readability in complex backgrounds, improving recognition accuracy on benchmarks.

Keywords:
cross-attentioncross-fertilizationscene text image super-resolutiontext recognition

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

436
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

465

Related Experiment Videos

Last Updated: May 14, 2025

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

322
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

436
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

465

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Image Processing

Background:

  • Scene text super-resolution (STSR) aims to improve text quality for better readability and downstream tasks.
  • Challenges in STSR include character ambiguity and interference from complex backgrounds, especially with tightly connected characters.

Purpose of the Study:

  • To propose a single-character-based embedding feature aggregation method using cross-attention for scene text super-resolution (SCE-STISR).
  • To address the challenges of character ambiguity and background interference in complex scene text images.

Main Methods:

  • Employs a dynamic feature extraction mechanism with adaptive multi-scale feature weights.
  • Introduces a dual-level cross-attention mechanism for aggregating single-character features with textual priors and aligning visual-semantic information.
  • Applies adaptive normalized color correction to reduce background-induced color distortion.

Main Results:

  • Achieved improved text recognition accuracies of 0.9-1.4% over the baseline TATT on the TextZoom benchmark.
  • Obtained an optimal SSIM value of 0.7951 and a PSNR of 21.84 on TextZoom.
  • Demonstrated accuracy improvements of 0.2-2.2% over existing baselines on five text recognition datasets.

Conclusions:

  • The proposed SCE-STISR method effectively enhances scene text super-resolution by addressing character ambiguity and background interference.
  • The approach validates the effectiveness of single-character embedding aggregation and cross-attention for improving text recognition accuracy in challenging scenarios.