Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Encoding

Encoding

Information enters the brain through encoding, which is the input of information into the memory system. Once sensory information is received from the environment, the brain labels or codes it. The information is then organized with similar information and connected to existing concepts. Encoding occurs through automatic processing and effortful processing.
Automatic processing involves the encoding of details like time, space, frequency, and the meaning of words, usually done without conscious...

Extraction: Advanced Methods

Extraction: Advanced Methods

Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Correction to: SIRT3 activation protects from nabumetone-induced mitochondrial toxicity in adult human cardiomyocytes.

Cellular and molecular life sciences : CMLS·2026

Same author

Enantioselective Assembly of Planar-Chiral and Axial-Planar-Chiral Macrocycles <i>via De Novo</i> Isoquinolinone Formation.

Organic letters·2026

Same author

The cGAS-STING pathway contributes to cisplatin-induced skeletal muscle atrophy through altered proteostasis and myogenic signaling.

Cell communication and signaling : CCS·2026

Same author

Flat-top vortex pumping enables high-contrast amplification of femtosecond vortex pulse.

Optics express·2026

Same author

CsWRKY6-2/CsWRKY12 module regulates gallated catechin biosynthesis in tea plants.

The Plant journal : for cell and molecular biology·2026

Same author

De Novo Design of Near-Infrared Fluorescence-Activating Proteins.

Journal of the American Chemical Society·2026

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 7, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Enhancing Scene Text Recognition with Encoder-Decoder Interactive Model.

Yongbin Mu^1,2,3,4, Mieradilijiang Maimaiti^1,2,3,4, Miaomiao Xu^1,2,3,4

¹School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Sensors (Basel, Switzerland)

|December 31, 2025

Summary

This summary is machine-generated.

This study introduces the Encoder-Decoder Interactive Model (EDIM) for scene text recognition. EDIM significantly improves accuracy, especially for distorted text, by enhancing feature extraction and semantic understanding.

Keywords:

encoder-decoder interactive model multi-scale dilated fusion attention scene text recognition sequential encoder-decoder context fusion

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Related Experiment Videos

Last Updated: Jan 7, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Scene text recognition is crucial for applications like autonomous driving and smart retail.
Existing models (CRNN, ViT, PARSeq) face challenges with multi-scale variations, distortions, and complex backgrounds, limiting feature extraction and semantic modeling.
There is a need for improved scene text recognition methods that can handle these complexities effectively.

Purpose of the Study:

To propose a novel scene text recognition model, the Encoder-Decoder Interactive Model (EDIM).
To enhance feature extraction and semantic modeling capabilities in scene text recognition.
To achieve state-of-the-art performance, particularly on irregular and distorted text recognition tasks.

Main Methods:

The proposed Encoder-Decoder Interactive Model (EDIM) utilizes an encoder-decoder framework.
Incorporates a Multi-scale Dilated Fusion Attention (MSFA) module in the encoder for superior multi-scale feature representation.
Features a Sequential Encoder-Decoder Context Fusion (SeqEDCF) mechanism in the decoder for efficient semantic interaction.

Main Results:

EDIM was validated on six regular and irregular benchmark datasets and subsets of Union14M-L.
The model demonstrated superior performance compared to state-of-the-art methods across multiple metrics.
Significant performance gains were observed, particularly in recognizing irregular and distorted scene text.

Conclusions:

The Encoder-Decoder Interactive Model (EDIM) effectively addresses limitations in current scene text recognition methods.
EDIM's novel modules (MSFA and SeqEDCF) enhance feature representation and semantic interaction.
The proposed model represents a significant advancement in scene text recognition accuracy and robustness.