Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perceptual Constancy01:12

Perceptual Constancy

683
Perceptual constancy is the ability to recognize that objects remain consistent and unchanged even when their appearance varies due to changes in sensory input. There are four main types of perceptual constancy: size constancy, shape constancy, color constancy, and brightness constancy.
Size constancy is the recognition that an object remains the same size, even when its image on the retina changes. For instance, a bus is perceived to be large enough to carry people, even if it looks tiny from...
683
Associative Learning01:27

Associative Learning

650
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
650
Depth Perception and Spatial Vision01:15

Depth Perception and Spatial Vision

1.1K
Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.
1.1K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

195
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
195

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Environment and reproductive health in China: challenges and opportunities.

Environmental health perspectives·2012
Same author

Posttransplant mortality risk assessment for adult-to-adult right-lobe living donor liver recipients with benign end-stage liver disease.

Scandinavian journal of gastroenterology·2012
Same author

Sodium nitrite protects against kidney injury induced by brain death and improves post-transplant function.

Kidney international·2012
Same author

OIC-A006-loaded true bone ceramic heals rabbit critical-sized segmental radial defect.

Die Pharmazie·2012
Same author

Liquid chromatography-mass spectrometric multiple reaction monitoring-based strategies for expanding targeted profiling towards quantitative metabolomics.

Current drug metabolism·2012
Same author

Structural and functional characterization of mature forms of metalloprotease E495 from Arctic sea-ice bacterium Pseudoalteromonas sp. SM495.

PloS one·2012
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Oct 9, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

677

Multiscale Visual-Attribute Co-Attention for Zero-Shot Image Recognition.

Hao Zhang, Long Tian, Zhengjue Wang

    IEEE Transactions on Neural Networks and Learning Systems
    |December 17, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a multi-scale visual-attribute co-attention (mVACA) model for zero-shot image recognition. mVACA improves classification of unseen classes by analyzing features at multiple scales, outperforming existing methods.

    More Related Videos

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    548
    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
    12:39

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

    Published on: January 18, 2020

    7.8K

    Related Experiment Videos

    Last Updated: Oct 9, 2025

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
    03:31

    Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

    Published on: December 15, 2023

    677
    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
    04:48

    Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

    Published on: July 5, 2024

    548
    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers
    12:39

    A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

    Published on: January 18, 2020

    7.8K

    Area of Science:

    • Computer Science
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Zero-shot image recognition (ZSR) aims to classify data from unseen classes by linking visual features with semantic representations.
    • Current ZSR methods often use a single-scale embedding space, overlooking the varying semantics present in different visual feature scales.

    Purpose of the Study:

    • To propose a novel multi-scale visual-attribute co-attention (mVACA) model for enhanced zero-shot image recognition.
    • To address limitations of single-scale approaches by incorporating multi-scale visual semantics and improving visual discrimination.

    Main Methods:

    • The mVACA model employs a hybrid visual attention mechanism at each scale, combining attribute-related attention and visual self-attention.
    • Attribute-related attention is guided by pseudo-attribute vectors derived from mutual information regularization (MIR).
    • Visual self-attention refines attribute attention, emphasizing visually relevant attributes and leveraging multi-scale visual discrimination.

    Main Results:

    • The mVACA model demonstrates state-of-the-art or competitive performance on standard benchmarks for both zero-shot learning (ZSL) and generalized ZSL (GZSL) tasks.
    • The framework effectively unifies standard ZSL and GZSL by utilizing multi-scale visual discrimination.

    Conclusions:

    • The proposed mVACA model offers a significant advancement in zero-shot image recognition by effectively integrating multi-scale visual and semantic information.
    • The model's ability to handle both ZSL and GZSL tasks, coupled with visualized analysis of image-attribute interactions, provides valuable insights into ZSR mechanisms.