Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Association Areas of the Cortex01:21

Association Areas of the Cortex

5.2K
Association areas are regions of the cerebral cortex that do not have a specific sensory or motor function. Instead, they integrate and interpret information from various sources to enable higher cognitive processes such as memory, learning, and decision-making. Some key association areas include the following:
Prefrontal Association Area: This area is located in the frontal lobe and is involved in planning, decision-making, and moderating social behavior. It connects with primary motor areas,...
5.2K
Visual System01:26

Visual System

557
Light enters the eye through the cornea, a transparent, dome-shaped surface covering the surface of the eyeball that helps to direct and focus incoming light. This light is then channeled toward the pupil, an adjustable opening whose size is controlled by the iris. The iris, a pigmented muscle, regulates the amount of light entering the eye by contracting or dilating the pupil, thereby ensuring optimal light levels for clear vision.
Once through the pupil, the light passes through the lens, a...
557
Attribution Theory00:56

Attribution Theory

12.9K
Behavior is a product of both the situation (e.g., cultural influences, social roles, and the presence of bystanders) and of the person (e.g., personality characteristics). Subfields of psychology tend to focus on one influence or behavior over others. Situationism is the view that our behavior and actions are determined by our immediate environment and surroundings. In contrast, dispositionism holds that our behavior is determined by internal factors (Heider, 1958).
12.9K
Vision01:24

Vision

53.1K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
53.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026
Same author

Construction of phosphorus and cobalt co-modified tubular carbon nitride with dual reaction sites for boosted imidacloprid degradation.

Journal of colloid and interface science·2026
Same author

Latent profile analysis of digital health literacy among community-dwelling older adults and its influencing factors.

Digital health·2026
Same author

Four-dimensional left ventricular motion clustering reveals cardiovascular phenotypes at population scale.

Scientific reports·2026
Same author

Clip Combined With Rubber Band vs Clip-Assisted Endoscopic Retrograde Cholangiopancreatography for Difficult Biliary Cannulation in Periampullary Diverticulum: A Propensity Score-Matched Analysis.

Clinical and translational gastroenterology·2026
Same author

Novel Traction-Assisted Endoscopic Resection for Superficial Non-ampullary Duodenal Epithelial Tumors: 15-Year Experience from a Large Tertiary Center.

Digestive diseases and sciences·2026
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 13, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

8.9K

Learning From Human Attention for Attribute-Assisted Visual Recognition.

Xiao Bai, Pengcheng Zhang, Xiaohan Yu

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |September 11, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces an Attribute Attention Network (A²Net) that learns from human gaze data to improve zero-shot learning (ZSL) and fine-grained visual classification (FGVC). By aligning AI attention with human attention, the model enhances object recognition accuracy.

    More Related Videos

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    2.6K
    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
    12:39

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

    Published on: January 18, 2020

    7.6K

    Related Experiment Videos

    Last Updated: Jun 13, 2025

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    8.9K
    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    2.6K
    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
    12:39

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

    Published on: January 18, 2020

    7.6K

    Area of Science:

    • Computer Vision
    • Machine Learning
    • Cognitive Science

    Background:

    • Human object recognition relies on local attributes, crucial for zero-shot learning (ZSL) and fine-grained visual classification (FGVC).
    • Attention mechanisms in neural networks learn discriminative attributes but often neglect localization and alignment with human attention.
    • Existing methods focus on region embeddings, overlooking the importance of precise attribute localization.

    Purpose of the Study:

    • To develop a novel approach for visual recognition by integrating real human gaze data into neural networks.
    • To propose a unified Attribute Attention Network (A²Net) for both ZSL and FGVC tasks that learns from human attention.
    • To investigate whether learned attention in AI models truly mimics human visual attention.

    Main Methods:

    • Designed a unified Attribute Attention Network (A²Net) with an attribute attention branch and a baseline classification network.
    • Utilized attribute prototypes to generate attribute attention maps and features, aligning them with human gaze data.
    • Collected real human gaze data using an eye-tracker on a bird classification game with the CUB dataset.
    • Aligned extracted attribute features with attribute-defined class embeddings for enhanced learning.

    Main Results:

    • The A²Net model demonstrated improved accuracy in ZSL and FGVC tasks when trained with human gaze data.
    • Experiments validated the effectiveness of learning from human attention for visual recognition.
    • The study confirmed the benefits of collecting human gaze datasets for AI model development.

    Conclusions:

    • Integrating real human gaze data significantly enhances the performance of visual recognition models like A²Net.
    • The proposed A²Net effectively learns from human attention, improving attribute localization and recognition.
    • This research highlights the value of human gaze data and gaze estimation algorithms for advancing high-level computer vision tasks.