Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Visual System01:26

Visual System

795
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
795
Vision01:24

Vision

55.8K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
55.8K
Parallel Processing01:20

Parallel Processing

331
The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...
331
Visual Agnosia01:12

Visual Agnosia

425
Visual agnosia is a condition characterized by the inability to recognize visually presented objects despite having normal vision. For instance, a person with visual agnosia can describe the shape and color of an object but cannot identify or name it. This impairment does not affect their visual field, acuity, color vision, brightness discrimination, language, or memory. An example of this condition in a social setting is someone at a dinner party asking for "that silver thing with a round...
425
Motor and Sensory Areas of the Cortex01:14

Motor and Sensory Areas of the Cortex

5.0K
The cerebral cortex, the brain's outermost layer, is pivotal in processing complex cognitive tasks, emotions, and various sensory inputs and executing voluntary motor activities. This intricate structure is divided into three primary functional areas: the motor areas, sensory areas, and association areas.
Motor Areas
The motor areas located in the frontal lobe are central to controlling voluntary movements. This region is further subdivided into the primary motor cortex and the premotor cortex....
5.0K
Cognitivism01:17

Cognitivism

2.3K
Cognitive psychology emerged as a significant field in the mid-20th century. It focused on understanding humans' internal mental processes. This approach emphasizes how people perceive, remember, think, and solve problems—elements critical to human cognition.
Previously dominated by behaviorism, which prioritized observable behaviors and largely ignored mental processes, psychology transformed in the 1950s. Cognitive psychologists argue that understanding how we think and process...
2.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Depression as a potential early indicator of pancreatic cancer: a systematic review of prevalence, mechanisms, and temporal associations.

Journal of psychosocial oncology·2026
Same author

A novel CYP2E1 inhibitor, 4-Methyl-5-Acetylthiazole, mitigates LPS-induced acute lung injury by suppressing macrophage-mediated inflammation and preserving mitochondria.

Free radical biology & medicine·2026
Same author

The Inter-Eye Differences of the Circumpapillary Microvasculature in Primary Open-Angle Glaucoma.

Translational vision science & technology·2026
Same author

Retinal nerve fibre layer thickness reflects characteristics of brain grey and white matter.

Imaging neuroscience (Cambridge, Mass.)·2026
Same author

Reconstructing the ischemic osteogenic microenvironment through hierarchical scaffolds orchestrating Mg<sup>2+</sup> signaling and neuropilin-1-mediated angiogenesis.

Bioactive materials·2026
Same author

Hospital Fall Prevention Practices and Implementation Strategies: A Multisite Observational Study.

Journal of patient safety·2026
See all related articles

Related Experiment Video

Updated: Oct 5, 2025

Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

12.0K

Interpretable Visual Understanding with Cognitive Attention Network.

Xuejiao Tang1, Wenbin Zhang2, Yi Yu3

  • 1Leibniz University of Hannover, Germany.

Artificial Neural Networks, ICANN : International Conference ... Proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)
|January 24, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces a Cognitive Attention Network (CAN) for visual commonsense reasoning, enhancing image understanding beyond recognition. The novel approach effectively fuses image and text data for improved cognition-level insights.

More Related Videos

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments
13:00

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

Published on: January 23, 2017

10.0K
A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.8K

Related Experiment Videos

Last Updated: Oct 5, 2025

Methods to Test Visual Attention Online
09:44

Methods to Test Visual Attention Online

Published on: February 19, 2015

12.0K
Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments
13:00

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

Published on: January 23, 2017

10.0K
A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
12:39

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

7.8K

Area of Science:

  • Artificial Intelligence
  • Computer Vision
  • Natural Language Processing

Background:

  • Current image understanding excels at recognition but lacks deeper cognitive abilities.
  • Reliable visual scene understanding necessitates both recognition and cognition levels.
  • Integrating multi-source information and commonsense knowledge is crucial for advanced visual understanding.

Purpose of the Study:

  • To propose a novel Cognitive Attention Network (CAN) for interpretable visual understanding.
  • To enhance visual commonsense reasoning capabilities.
  • To effectively fuse multi-modal information for comprehensive scene comprehension.

Main Methods:

  • Developed a Cognitive Attention Network (CAN) architecture.
  • Introduced an image-text fusion module for integrated information processing.
  • Designed a novel inference module to encode commonsense reasoning across image, query, and response.

Main Results:

  • Demonstrated the effectiveness of the CAN approach on the Visual Commonsense Reasoning (VCR) benchmark dataset.
  • Achieved significant improvements in visual commonsense reasoning tasks.
  • The proposed method provides interpretable visual understanding.

Conclusions:

  • The Cognitive Attention Network (CAN) advances visual commonsense reasoning.
  • Effective fusion of image and text data is key to cognitive-level visual understanding.
  • The approach offers a promising direction for interpretable AI in complex visual tasks.