Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Association Areas of the Cortex

Association Areas of the Cortex

Association areas are regions of the cerebral cortex that do not have a specific sensory or motor function. Instead, they integrate and interpret information from various sources to enable higher cognitive processes such as memory, learning, and decision-making. Some key association areas include the following:
Prefrontal Association Area: This area is located in the frontal lobe and is involved in planning, decision-making, and moderating social behavior. It connects with primary motor areas,...

Visual System

Visual System

Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...

Attribution Theory

Attribution Theory

Behavior is a product of both the situation (e.g., cultural influences, social roles, and the presence of bystanders) and of the person (e.g., personality characteristics). Subfields of psychology tend to focus on one influence or behavior over others. Situationism is the view that our behavior and actions are determined by our immediate environment and surroundings. In contrast, dispositionism holds that our behavior is determined by internal factors (Heider, 1958).

Vision

Vision

Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same author

Construction of phosphorus and cobalt co-modified tubular carbon nitride with dual reaction sites for boosted imidacloprid degradation.

Journal of colloid and interface science·2026

Same author

Latent profile analysis of digital health literacy among community-dwelling older adults and its influencing factors.

Digital health·2026

Same author

Four-dimensional left ventricular motion clustering reveals cardiovascular phenotypes at population scale.

Scientific reports·2026

Same author

Clip Combined With Rubber Band vs Clip-Assisted Endoscopic Retrograde Cholangiopancreatography for Difficult Biliary Cannulation in Periampullary Diverticulum: A Propensity Score-Matched Analysis.

Clinical and translational gastroenterology·2026

Same author

Novel Traction-Assisted Endoscopic Resection for Superficial Non-ampullary Duodenal Epithelial Tumors: 15-Year Experience from a Large Tertiary Center.

Digestive diseases and sciences·2026

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 13, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Learning From Human Attention for Attribute-Assisted Visual Recognition.

Xiao Bai, Pengcheng Zhang, Xiaohan Yu

IEEE Transactions on Pattern Analysis and Machine Intelligence

|September 11, 2024

Summary

This summary is machine-generated.

This study introduces an Attribute Attention Network (A²Net) that learns from human gaze data to improve zero-shot learning (ZSL) and fine-grained visual classification (FGVC). By aligning AI attention with human attention, the model enhances object recognition accuracy.

More Related Videos

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Related Experiment Videos

Last Updated: Jun 13, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Area of Science:

Computer Vision
Machine Learning
Cognitive Science

Background:

Human object recognition relies on local attributes, crucial for zero-shot learning (ZSL) and fine-grained visual classification (FGVC).
Attention mechanisms in neural networks learn discriminative attributes but often neglect localization and alignment with human attention.
Existing methods focus on region embeddings, overlooking the importance of precise attribute localization.

Purpose of the Study:

To develop a novel approach for visual recognition by integrating real human gaze data into neural networks.
To propose a unified Attribute Attention Network (A²Net) for both ZSL and FGVC tasks that learns from human attention.
To investigate whether learned attention in AI models truly mimics human visual attention.

Main Methods:

Designed a unified Attribute Attention Network (A²Net) with an attribute attention branch and a baseline classification network.
Utilized attribute prototypes to generate attribute attention maps and features, aligning them with human gaze data.
Collected real human gaze data using an eye-tracker on a bird classification game with the CUB dataset.
Aligned extracted attribute features with attribute-defined class embeddings for enhanced learning.

Main Results:

The A²Net model demonstrated improved accuracy in ZSL and FGVC tasks when trained with human gaze data.
Experiments validated the effectiveness of learning from human attention for visual recognition.
The study confirmed the benefits of collecting human gaze datasets for AI model development.

Conclusions:

Integrating real human gaze data significantly enhances the performance of visual recognition models like A²Net.
The proposed A²Net effectively learns from human attention, improving attribute localization and recognition.
This research highlights the value of human gaze data and gaze estimation algorithms for advancing high-level computer vision tasks.