Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

631
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
631
Vision01:24

Vision

53.1K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
53.1K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Nested Grover's Algorithm for Tree Search.

Entropy (Basel, Switzerland)·2026
Same author

Multi-Season Genome-Wide Association Study Reveals Loci and Candidate Genes for Fruit Quality and Maturity Traits in Peach.

Plants (Basel, Switzerland)·2026
Same author

Multilevel Data Representation for Training Deep Helmholtz Machines.

Neural computation·2025
Same author

Quantum Machine Learning-Quo Vadis?

Entropy (Basel, Switzerland)·2024
Same author

Can a Hebbian-like learning rule be avoiding the curse of dimensionality in sparse distributed data?

Biological cybernetics·2024
Same author

Competitive learning to generate sparse representations for associative memory.

Neural networks : the official journal of the International Neural Network Society·2023
Same journal

A Model-Free Reinforcement Learning Implementation of Decision Making Under Uncertainty by Sequential Sampling.

Neural computation·2026
Same journal

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026
Same journal

Hierarchical Active Inference Using Successor Representations.

Neural computation·2026
Same journal

W-Kernel and Its Principal Space for Frequentist Evaluation of Bayesian Estimators.

Neural computation·2026
Same journal

A Hidden Markov Model-Inspired Sequence Classification Method for Hyperdimensional Computing.

Neural computation·2026
Same journal

Sparse Graphical Modeling for Electrophysiological Phase-Based Connectivity Using Circular Statistics.

Neural computation·2026
查看所有相关文章

相关实验视频

Updated: Jun 25, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K

通过重新思考计算机视觉基准数据集,促进从像素级相关性向对象语义学习的转变.

Maria Osório1, Andreas Wichert2

  • 1Department of Computer Science and Engineering, INESC-ID and Instituto Superior Técnico, University of Lisbon, 2744-016 Porto Salvo, Portugal maria.osorio@tecnico.ulisboa.pt.

Neural computation
|May 22, 2024
PubMed
概括
此摘要是机器生成的。

卷积神经网络 (CNN) 在模式识别方面表现出色,但与人类视觉不同. 我们的研究表明,CNN优先考虑像素数据,而不是颜色,纹理和形状等核心特征,因此需要更有语义意识的模型.

更多相关视频

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

520
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.5K

相关实验视频

Last Updated: Jun 25, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

520
Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping
07:11

Author Spotlight: Insights into Visual Cortex Research Through Wide-View fMRI Mapping

Published on: December 8, 2023

1.5K

科学领域:

  • 计算机视觉 计算机视觉
  • 人工智能的人工智能
  • 机器学习 机器学习

背景情况:

  • 卷积神经网络 (CNN) 通过从原始像素数据中学习来实现图像识别的高精度.
  • CNNs的模式识别不同于人类的视觉感知,重点是统计相关性,而不是对象语义.
  • 了解这些差异对于开发更强大的和类似人类的人工智能视觉系统至关重要.

研究的目的:

  • 调查CNN视觉特征提取和人类感知之间的差异.
  • 为了确定CNN是否优先考虑像素级的相关性,而不是基本的视觉特征 (颜色,纹理,形状).
  • 强调需要人工智能模型来学习对象语义,以改善视觉理解.

主要方法:

  • 将核心视觉特征 (颜色,纹理,形状) 分离并单独输入到神经网络中.
  • 在不同的基准数据集上进行实验:水果360,CIFAR-10和时尚MNIST.
  • 评估CNN在不同分布的数据集上的表现 (CIFAR-10与斯坦福犬),以评估概括性.

主要成果:

  • 分类准确性因数据集而异,表明CNN学习数据集特定的像素相关性.
  • 当训练和测试数据分布相似时,CNN表现良好,但在分布转移方面遇到了困难.
  • 与CIFAR-10相比,尽管视觉内容相似,但CNN在斯坦福犬的图像上表现不佳,这凸显了语义理解的缺乏.

结论:

  • 深度学习模型,特别是CNN,倾向于学习表面的像素级模式,而不是对人类认知至关重要的基本视觉特征.
  • 美国有线电视新闻网 (CNN) 的表现与人类视觉感知之间的差异凸显了当前人工智能模型掌握对象语义能力的局限性.
  • 专注于语义理解的专用数据集和模型的开发对于推进计算机视觉研究向人类类认知发展至关重要.