Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

Color Vision

Color Vision

Color perception begins in the retina, the light-sensitive layer at the back of the eye. Two main theories explain how colors are seen: the trichromatic theory and the opponent-process theory. The trichromatic theory, proposed by Thomas Young in 1802 and extended by Hermann von Helmholtz in 1852, suggests that color vision is based on three types of cone receptors in the retina. These cones are sensitive to different but overlapping ranges of wavelengths corresponding to red, blue, and green.

Anatomy of the Eyeball

Anatomy of the Eyeball

The eye is a spherical, hollow structure composed of three tissue layers. The outer layer — the fibrous tunic, comprises the sclera — a white structure — and the cornea, which is transparent. The sclera encompasses some of the ocular surface, most of which is not visible. However, the 'white of the eye' is distinctively visible in humans compared to other species. The cornea, a clear covering at the front of the eye, enables light penetration. The eye's middle...

Parallel Processing

Parallel Processing

The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...

The Retina

The Retina

The retina is a layer of nervous tissue at the back of the eye that transduces light into neural signals. This process, called phototransduction, is carried out by rod and cone photoreceptor cells in the back of the retina.

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Sensing the Action: Rethinking Sensor Modalities and Multi-Modal Fusion in Vision-Language-Action Models for Robotic Manipulation.

Sensors (Basel, Switzerland)·2026

Same author

Baseline Clinical and Neuropsychological Characteristics of Amyloid PET-Confirmed Alzheimer's Disease Treated With Lecanemab: Early Experience at a Tertiary Hospital in Korea.

Dementia and neurocognitive disorders·2026

Same author

Demographic, Social, and Clinical Profiles of Patients Initiating Lecanemab in Clinical Practice: A Single-Center Experience in Korea.

Dementia and neurocognitive disorders·2026

Same author

Cell flocculation and phase-separation support macro-scale tissue slab construction in a scaffold-free manner.

Materials today. Bio·2026

Same author

Keyword-Conditioned Image Segmentation via the Cross-Attentive Alignment of Language and Vision Sensor Data.

Sensors (Basel, Switzerland)·2025

Same author

Scene Graph and Natural Language-Based Semantic Image Retrieval Using Vision Sensor Data.

Sensors (Basel, Switzerland)·2025

Same journal

RETRACTED: Zhang et al. A Novel Framework for Reconstruction and Imaging of Target Scattering Centers via Wide-Angle Incidence in Radar Networks. <i>Sensors</i> 2025, <i>25</i>, 6802.

Sensors (Basel, Switzerland)·2026

Same journal

Enhancing Unsupervised Multi-Source Domain Adaptation for Person Re-Identification via Mixture of Experts and Graph-Based Relation.

Sensors (Basel, Switzerland)·2026

Same journal

Development of an Instrumented Glove for Palmar Pressure Assessment in Kayakers.

Sensors (Basel, Switzerland)·2026

Same journal

Development and Experimental Validation of an Autonomous IoT-Based Monitoring System for Real-Time Water Quality Assessment in the Amazon River.

Sensors (Basel, Switzerland)·2026

Same journal

Semi-Supervised Adversarial Learning Framework for Controller Area Network Bus Intrusion Detection.

Sensors (Basel, Switzerland)·2026

Same journal

Smart Optimization Method for Safety Signs in Innovative Manufacturing Environments Integrating Industrial Field IoT Sensors and Knowledge Graphs.

Sensors (Basel, Switzerland)·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 2, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

重新思考视觉转换器中的注意力机制与图形结构.

Hyeongjin Kim¹, Byoung Chul Ko¹

¹Department of Computer Engineering, Keimyung University, Daegu 42601, Republic of Korea.

Sensors (Basel, Switzerland)

|February 24, 2024

概括

此摘要是机器生成的。

本研究介绍了图形头部注意力视觉转换器 (GHA-ViT),通过维护本地和全球补丁信息来改进图像分析. 与标准视觉变压器相比,GHA-ViT提高了性能并降低了参数.

关键词:

图表注意力网络图表注意力网络图表头部的注意力注意力.轻量级的模型轻量级的模型.多头注意力多头注意力视觉变压器视觉变压器

更多相关视频

How to Build a Dichoptic Presentation System That Includes an Eye Tracker

How to Build a Dichoptic Presentation System That Includes an Eye Tracker

Published on: September 6, 2017

Visualizing Visual Adaptation

Visualizing Visual Adaptation

Published on: April 24, 2017

相关实验视频

Last Updated: Jul 2, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

How to Build a Dichoptic Presentation System That Includes an Eye Tracker

How to Build a Dichoptic Presentation System That Includes an Eye Tracker

Published on: September 6, 2017

Visualizing Visual Adaptation

Visualizing Visual Adaptation

Published on: April 24, 2017

科学领域:

计算机视觉计算机视觉
机器学习机器学习
人工智能的人工智能

背景情况:

标准视觉变压器 (ViT) 使用多头注意力 (MHA),这是参数密集型的,可以损害图像局部性.
需要更高效和有效的ViT架构来保护空间信息.

研究的目的:

提出一种新的视觉变压器架构,GHA-ViT,结合图形头部注意力 (GHA).
提高ViT的性能,同时降低计算复杂性和参数数量.

主要方法:

在标准ViT中取代了多头注意力 (MHA) 机制,采用了新的图形头注意力 (GHA).
将图形结构应用于变压器的注意力头,以更好地捕捉图像补丁中的关系.
在各种数据集上评估了GHA-ViT,包括CIFAR-10/100,MNIST,MNIST-F和ImageNet-1K.

主要成果:

在多个数据集中,GHA-ViT表现出了比纯ViT模型更高的性能.
在ImageNet-1K上使用GHA-B模型 (约1.7%) 实现了81.7%的Top-1精度. 29M参数). 这就是为什么.
与现有的ViT相比,在CIFAR-10/100上显著减少参数 (17倍) 和提高性能 (0.4%/4.3%).

结论:

拟议的GHA-ViT有效地保持了图像补丁的本地性和全球性,确保了注意力的多样性.
GHA-ViT为当前最先进的ViT模型提供了一个有希望的轻量级替代方案,平衡精度,参数数量和计算操作.