Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

678
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
678
Vision01:24

Vision

53.5K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
53.5K
Parallel Processing01:20

Parallel Processing

157
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
157

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Effects of dietary octapeptin supplementation on growth performance, intestinal morphology, immune function, and serum metabolism of weaned piglets.

Journal of animal science·2026
Same author

Synergistic anion-cation descriptor for bidirectional electrocatalyst in Li-CO<sub>2</sub> battery.

Science advances·2026
Same author

Correction: Hu et al. Dietary Net Energy Concentration Affects Growth Performance, Carcass Traits, Intramuscular Fatty Acid Profile, and Cecal Microbiota of Pigs with Restricted Feed Allowance. <i>Animals</i> 2025, <i>15</i>, 3514.

Animals : an open access journal from MDPI·2026
Same author

AniFeats: Animate 3D Feature Meshes for Character Video Generation.

IEEE transactions on visualization and computer graphics·2026
Same author

[Construction of a ubiquitination-related gene prognostic model for breast cancer based on taxane treatment response].

Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences·2026
Same author

Light-Induced Charge Order Mode in a Metastable Cuprate Ladder.

Physical review letters·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
查看所有相关文章

相关实验视频

Updated: Jul 12, 2025

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
07:12

Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

Published on: April 11, 2025

402

视觉视觉的MetaFormer基线

Weihao Yu, Chenyang Si, Pan Zhou

    IEEE transactions on pattern analysis and machine intelligence
    |November 1, 2023
    PubMed
    概括
    此摘要是机器生成的。

    MetaFormer 架构通过基本的代币混合器实现了强的性能,证明了其坚实的基础. 即使是像身份映射或随机矩阵这样的简单混合器也会产生高精度,而像CAFormer这样的高级模型则会创下新的记录.

    更多相关视频

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
    07:45

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

    Published on: July 21, 2020

    4.5K
    Quantification of Oculomotor Responses and Accommodation Through Instrumentation and Analysis Toolboxes
    08:27

    Quantification of Oculomotor Responses and Accommodation Through Instrumentation and Analysis Toolboxes

    Published on: March 3, 2023

    982

    相关实验视频

    Last Updated: Jul 12, 2025

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss
    07:12

    Development of a Gaze-Contingent Display Framework Designed for Perceptual and Oculomotor Research with Simulated Central Vision Loss

    Published on: April 11, 2025

    402
    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition
    07:45

    Assessing Binocular Central Visual Field and Binocular Eye Movements in a Dichoptic Viewing Condition

    Published on: July 21, 2020

    4.5K
    Quantification of Oculomotor Responses and Accommodation Through Instrumentation and Analysis Toolboxes
    08:27

    Quantification of Oculomotor Responses and Accommodation Through Instrumentation and Analysis Toolboxes

    Published on: March 3, 2023

    982

    科学领域:

    • 计算机视觉 计算机视觉
    • 深度学习架构 深度学习架构
    • 神经网络设计 神经网络设计

    背景情况:

    • 该MetaFormer架构,一个抽象的变压器,已经显示在实现竞争性表现的承诺.
    • 以前的研究集中在这些架构中的代币混合器设计上.
    • 这项研究将重点转移到MetaFormer框架本身的内在能力.

    研究的目的:

    • 探索MetaFormer架构的性能潜力,独立于复杂的令牌混合器设计.
    • 用基本和传统的令牌混合器验证MetaFormer的有效性.
    • 为了介绍和评估一个新的激活功能,StarReLU.

    主要方法:

    • 使用简单的令牌混合器开发了基线MetaFormer模型:身份映射 (IdentityFormer) 和随机矩阵 (RandFormer).
    • 实现了使用深度可分离卷积的ConvFormer,与ConvNeXt.进行比较.
    • 通过将深度可分离的卷积和自我注意力结合起来,创建了CAFormer.
    • 作为激活功能,引入并测试了Squared ReLU的变体StarReLU.

    主要成果:

    • IdentityFormer在ImageNet-1K上实现了超过80%的准确性,建立了一个坚实的性能基线.
    • RandFormer以超过81%的准确度超过了IdentityFormer,展示了MetaFormer与任意混合器的兼容性.
    • 在ConvNet-1K上,ConvFormer表现优于ConvNeXt,CAFormer在ImageNet-1K上实现了85.5%准确度的新纪录.
    • 与GELU相比,StarReLU减少了71%的激活FLOP,同时提高了性能.

    结论:

    • MetaFormer提供了一个强大的框架,可以确保高性能下界,即使使用最小的令牌混合器.
    • 该架构展示了多功能性,与多种甚至随机的代币混合器表现良好.
    • 当MetaFormer与StarReLU等传统或新型组件相结合时,可以有效地实现最先进的结果.