Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Parallel Processing

Parallel Processing

The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Perception

Perception

Perception is a fundamental psychological process that enables individuals to organize, interpret, and consciously experience sensory information. This process is crucial for understanding and interacting with the world around us. It includes both bottom-up and top-down processing, each playing a distinct role in how we perceive our environment.
Bottom-up processing begins at the sensory level, where receptors detect external environmental stimuli. These could include the tactile sensation of...

Gestalt Principles of Perception

Gestalt Principles of Perception

Gestalt principles provide a framework for understanding how humans perceive objects as unified wholes within their context. These principles are essential in explaining the cognitive processes that make sense of complex visual stimuli by organizing them into coherent groups. One fundamental principle is proximity, which posits that objects located close to each other are perceived as a collective group. For instance, when dots are positioned near one another, the visual system interprets them...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same authorSame journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Event-Aware Instructed Assistant for Referring Video Segmentation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Network Pharmacology and Experimental Verification to Explore Cinnamomi Cortex against Steroid-induced Osteonecrosis of the Femoral Head.

Journal of visualized experiments : JoVE·2026

Same author

A modified Rim APHE criterion to improve LI-RADS diagnostic performance for primary liver malignancies.

Abdominal radiology (New York)·2026

Same author

Three-dimensional image hierarchical encryption method based on structured light holography and chained iris keys.

Applied optics·2026

Same author

Optical image encryption using multimodal biometric keys under the framework of phase-shifting digital holography.

Applied optics·2026

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: May 24, 2025

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

语境感知并行解码器用于场景文本识别.

Yongkun Du, Zhineng Chen, Caiyan Jia

IEEE transactions on pattern analysis and machine intelligence

|March 3, 2025

概括

此摘要是机器生成的。

本研究介绍了用于场景文本识别 (STR) 的上下文感知并行解码器 (CPPD). CPPD实现了高精度和快速推断速度,在英语和中文文本识别任务中表现优于现有方法.

更多相关视频

Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism

Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism

Published on: October 3, 2018

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

相关实验视频

Last Updated: May 24, 2025

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Eye Tracking During Visually Situated Language Comprehension: Flexibility and Limitations in Uncovering Visual Context Effects

Published on: November 30, 2018

Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism

Using the Visual World Paradigm to Study Sentence Comprehension in Mandarin-Speaking Children with Autism

Published on: October 3, 2018

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

科学领域:

计算机视觉计算机视觉
人工智能的人工智能
机器学习机器学习

背景情况:

场景文本识别 (STR) 方法面临着平衡准确性和推断速度的挑战.
自动回归 (AR) 模型提供高精度但处理速度缓慢.
平行解码 (PD) 模型是快速的,但不那么准确.

研究的目的:

开发一种新的STR方法,实现AR级准确度和PD级速度.
引入语境感知并行解码器 (CPPD) 进行增强的STR.

主要方法:

CPPD集成了字符计数和排序模块,用于上下文感知.
这些模块可以预测字符的出现,阅读顺序和位置.
一个注意力机制利用这种上下文来准确预测角色.

主要成果:

CPPD模型表现出具有竞争力的准确性,与领先模型相比,推断速度明显更快.
将CPPD模块集成到现有的STR解码器中,提高了它们的准确性.
对英语和中文文本识别基准进行了实验.

结论:

拟议的CPPD有效地解决了STR.中的准确性-速度权衡问题.
CPPD为高效准确的场景文本识别提供了一个有前途的方法.
模块化设计允许轻松集成和改进现有的STR系统.