Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

AnyDesign: Versatile area fashion editing via mask-free diffusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Continual Instruction Tuning for Large Multimodal Models.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Adversarial discriminant attack on text-to-image diffusion models.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Generation of surgical reports for lymph node dissection during laparoscopic gastric cancer surgery based on artificial intelligence.

International journal of computer assisted radiology and surgery·2025
Same author

Hierarchical Contrastive Learning for Semantic Segmentation.

IEEE transactions on neural networks and learning systems·2025
Same author

MoS<sub>2</sub> quantum dot-decorated CNT networks as a sulfur host for enhanced electrochemical kinetics in advanced lithium-sulfur batteries.

Nanoscale advances·2024
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

GoP-based Quality Enhancement on Video Compression.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Align then Tensorize: Multi-Level Consistent Anchor Graph Learning for Scalable Multi-View Clustering.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Beyond Fidelity: Diverse Image Synthesis via Retrieval-Augmented Diffusion.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Nov 20, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

807

Semi-Supervised Scene Text Recognition.

Yunze Gao, Yingying Chen, Jinqiao Wang

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |January 20, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new semi-supervised method for scene text recognition, reducing the need for extensive labeled data. The approach uses novel rewards to improve accuracy with unlabeled images, making it efficient for low-resource languages.

    More Related Videos

    Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
    05:56

    Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

    Published on: April 14, 2023

    2.9K
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.3K

    Related Experiment Videos

    Last Updated: Nov 20, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    807
    Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
    05:56

    Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

    Published on: April 14, 2023

    2.9K
    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.3K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Natural Language Processing

    Background:

    • Supervised methods dominate scene text recognition but require costly labeled data.
    • Character-level or pixel-wise annotations are often needed, increasing data preparation effort.
    • Unlabeled data is abundant, especially for low-resource languages, presenting an opportunity for improvement.

    Purpose of the Study:

    • To develop a novel semi-supervised method for scene text recognition.
    • To reduce reliance on expensive labeled data by leveraging unlabeled data.
    • To improve recognition performance and reduce annotation effort.

    Main Methods:

    • Proposed a semi-supervised approach for scene text recognition.
    • Introduced two global metrics: edit reward and embedding reward.
    • Utilized reinforcement learning to optimize these rewards for generated text quality.

    Main Results:

    • The edit reward measures the discrepancy between ground truth and generated text.
    • The embedding reward assesses similarity between image and text features in a common space.
    • Experimental evaluations on challenging benchmarks demonstrated significant effectiveness.

    Conclusions:

    • The proposed semi-supervised method effectively utilizes unlabeled data for scene text recognition.
    • The approach significantly reduces annotation effort while maintaining competitive performance.
    • This method offers a viable solution for low-resource languages and reduces data labeling costs.