Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

14.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
14.0K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.5K
3.5K
Force Classification01:22

Force Classification

2.2K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
2.2K
Encoding01:19

Encoding

713
Information enters the brain through encoding, which is the input of information into the memory system. Once sensory information is received from the environment, the brain labels or codes it. The information is then organized with similar information and connected to existing concepts. Encoding occurs through automatic processing and effortful processing.
Automatic processing involves the encoding of details like time, space, frequency, and the meaning of words, usually done without conscious...
713
Extraction: Advanced Methods00:56

Extraction: Advanced Methods

1.0K
Metal ions can be separated from one another by complexation with organic ligands–the chelating agent– to form uncharged chelates. Here, the chelating agent must contain hydrophobic groups and behave as a weak acid, losing a proton to bind with the metal. Since most organic ligands used in this process are insoluble or undergo oxidation in the aqueous phase, the chelating agent is initially added to the organic phase and extracted into the aqueous phase. The metal-ligand complex is...
1.0K
Deconvolution01:20

Deconvolution

520
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
520

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Correction to: SIRT3 activation protects from nabumetone-induced mitochondrial toxicity in adult human cardiomyocytes.

Cellular and molecular life sciences : CMLS·2026
Same author

Enantioselective Assembly of Planar-Chiral and Axial-Planar-Chiral Macrocycles <i>via De Novo</i> Isoquinolinone Formation.

Organic letters·2026
Same author

The cGAS-STING pathway contributes to cisplatin-induced skeletal muscle atrophy through altered proteostasis and myogenic signaling.

Cell communication and signaling : CCS·2026
Same author

Flat-top vortex pumping enables high-contrast amplification of femtosecond vortex pulse.

Optics express·2026
Same author

CsWRKY6-2/CsWRKY12 module regulates gallated catechin biosynthesis in tea plants.

The Plant journal : for cell and molecular biology·2026
Same author

De Novo Design of Near-Infrared Fluorescence-Activating Proteins.

Journal of the American Chemical Society·2026
Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026
Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026
Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026
Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026
Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026
Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Jan 7, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

990

Enhancing Scene Text Recognition with Encoder-Decoder Interactive Model.

Yongbin Mu1,2,3,4, Mieradilijiang Maimaiti1,2,3,4, Miaomiao Xu1,2,3,4

  • 1School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Sensors (Basel, Switzerland)
|December 31, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces the Encoder-Decoder Interactive Model (EDIM) for scene text recognition. EDIM significantly improves accuracy, especially for distorted text, by enhancing feature extraction and semantic understanding.

Keywords:
encoder-decoder interactive modelmulti-scale dilated fusion attentionscene text recognitionsequential encoder-decoder context fusion

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

983

Related Experiment Videos

Last Updated: Jan 7, 2026

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

990
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

983

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Machine Learning

Background:

  • Scene text recognition is crucial for applications like autonomous driving and smart retail.
  • Existing models (CRNN, ViT, PARSeq) face challenges with multi-scale variations, distortions, and complex backgrounds, limiting feature extraction and semantic modeling.
  • There is a need for improved scene text recognition methods that can handle these complexities effectively.

Purpose of the Study:

  • To propose a novel scene text recognition model, the Encoder-Decoder Interactive Model (EDIM).
  • To enhance feature extraction and semantic modeling capabilities in scene text recognition.
  • To achieve state-of-the-art performance, particularly on irregular and distorted text recognition tasks.

Main Methods:

  • The proposed Encoder-Decoder Interactive Model (EDIM) utilizes an encoder-decoder framework.
  • Incorporates a Multi-scale Dilated Fusion Attention (MSFA) module in the encoder for superior multi-scale feature representation.
  • Features a Sequential Encoder-Decoder Context Fusion (SeqEDCF) mechanism in the decoder for efficient semantic interaction.

Main Results:

  • EDIM was validated on six regular and irregular benchmark datasets and subsets of Union14M-L.
  • The model demonstrated superior performance compared to state-of-the-art methods across multiple metrics.
  • Significant performance gains were observed, particularly in recognizing irregular and distorted scene text.

Conclusions:

  • The Encoder-Decoder Interactive Model (EDIM) effectively addresses limitations in current scene text recognition methods.
  • EDIM's novel modules (MSFA and SeqEDCF) enhance feature representation and semantic interaction.
  • The proposed model represents a significant advancement in scene text recognition accuracy and robustness.