Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Parallel Processing01:20

Parallel Processing

143
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
143
Visual System01:26

Visual System

475
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
475
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

508
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
508
Vision01:24

Vision

52.9K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
52.9K
Perception01:28

Perception

429
Perception is a fundamental psychological process that enables individuals to organize, interpret, and consciously experience sensory information. This process is crucial for understanding and interacting with the world around us. It includes both bottom-up and top-down processing, each playing a distinct role in how we perceive our environment.
Bottom-up processing begins at the sensory level, where receptors detect external environmental stimuli. These could include the tactile sensation of...
429
Gestalt Principles of Perception01:21

Gestalt Principles of Perception

269
Gestalt principles provide a framework for understanding how humans perceive objects as unified wholes within their context. These principles are essential in explaining the cognitive processes that make sense of complex visual stimuli by organizing them into coherent groups. One fundamental principle is proximity, which posits that objects located close to each other are perceived as a collective group. For instance, when dots are positioned near one another, the visual system interprets them...
269

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same authorSame journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Event-Aware Instructed Assistant for Referring Video Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Network Pharmacology and Experimental Verification to Explore Cinnamomi Cortex against Steroid-induced Osteonecrosis of the Femoral Head.

Journal of visualized experiments : JoVE·2026
Same author

A modified Rim APHE criterion to improve LI-RADS diagnostic performance for primary liver malignancies.

Abdominal radiology (New York)·2026
Same author

Three-dimensional image hierarchical encryption method based on structured light holography and chained iris keys.

Applied optics·2026
Same author

Optical image encryption using multimodal biometric keys under the framework of phase-shifting digital holography.

Applied optics·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
07:36

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

15.6K

Context Perception Parallel Decoder for Scene Text Recognition.

Yongkun Du, Zhineng Chen, Caiyan Jia

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |March 3, 2025
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a Context Perception Parallel Decoder (CPPD) for scene text recognition (STR). CPPD achieves high accuracy and fast inference speeds, outperforming existing methods in both English and Chinese text recognition tasks.

    More Related Videos

    Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism
    06:15

    Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism

    Published on: October 3, 2018

    7.6K
    Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
    09:27

    Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

    Published on: October 13, 2018

    9.9K

    Related Experiment Videos

    Last Updated: May 24, 2025

    Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects
    07:36

    Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

    Published on: November 30, 2018

    15.6K
    Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism
    06:15

    Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism

    Published on: October 3, 2018

    7.6K
    Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
    09:27

    Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

    Published on: October 13, 2018

    9.9K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Scene Text Recognition (STR) methods face challenges balancing accuracy and inference speed.
    • Auto-Regressive (AR) models offer high accuracy but slow processing.
    • Parallel Decoding (PD) models are fast but less accurate.

    Purpose of the Study:

    • To develop a novel STR method achieving AR-level accuracy and PD-level speed.
    • To introduce the Context Perception Parallel Decoder (CPPD) for enhanced STR.

    Main Methods:

    • CPPD integrates character counting and ordering modules for context perception.
    • These modules predict character occurrences, reading order, and positions.
    • An attention mechanism leverages this context for accurate character prediction.

    Main Results:

    • CPPD models demonstrate competitive accuracy with significantly faster inference than leading models.
    • Integrating CPPD modules into existing STR decoders improved their accuracy.
    • Experiments were conducted on both English and Chinese text recognition benchmarks.

    Conclusions:

    • The proposed CPPD effectively addresses the accuracy-speed trade-off in STR.
    • CPPD offers a promising approach for efficient and accurate scene text recognition.
    • The modular design allows for easy integration and improvement of existing STR systems.