Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Perceptual Constancy

Perceptual Constancy

Perceptual constancy is the ability to recognize that objects remain consistent and unchanged even when their appearance varies due to changes in sensory input. There are four main types of perceptual constancy: size constancy, shape constancy, color constancy, and brightness constancy.
Size constancy is the recognition that an object remains the same size, even when its image on the retina changes. For instance, a bus is perceived to be large enough to carry people, even if it looks tiny from...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Depth Perception and Spatial Vision

Depth Perception and Spatial Vision

Depth perception is the ability to perceive objects three-dimensionally. It relies on two types of cues: binocular and monocular. Binocular cues depend on the combination of images from both eyes and how the eyes work together. Since the eyes are in slightly different positions, each eye captures a slightly different image. This disparity between images, known as binocular disparity, helps the brain interpret depth. When the brain compares these images, it determines the distance to an object.

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Environment and reproductive health in China: challenges and opportunities.

Environmental health perspectives·2012

Same author

Posttransplant mortality risk assessment for adult-to-adult right-lobe living donor liver recipients with benign end-stage liver disease.

Scandinavian journal of gastroenterology·2012

Same author

Sodium nitrite protects against kidney injury induced by brain death and improves post-transplant function.

Kidney international·2012

Same author

OIC-A006-loaded true bone ceramic heals rabbit critical-sized segmental radial defect.

Die Pharmazie·2012

Same author

Liquid chromatography-mass spectrometric multiple reaction monitoring-based strategies for expanding targeted profiling towards quantitative metabolomics.

Current drug metabolism·2012

Same author

Structural and functional characterization of mature forms of metalloprotease E495 from Arctic sea-ice bacterium Pseudoalteromonas sp. SM495.

PloS one·2012

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 9, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Multiscale Visual-Attribute Co-Attention for Zero-Shot Image Recognition.

Hao Zhang, Long Tian, Zhengjue Wang

IEEE Transactions on Neural Networks and Learning Systems

|December 17, 2021

Summary

This summary is machine-generated.

This study introduces a multi-scale visual-attribute co-attention (mVACA) model for zero-shot image recognition. mVACA improves classification of unseen classes by analyzing features at multiple scales, outperforming existing methods.

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Related Experiment Videos

Last Updated: Oct 9, 2025

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

A Methodology for Capturing Joint Visual Attention Using Mobile Eye-Trackers

Published on: January 18, 2020

Area of Science:

Computer Science
Artificial Intelligence
Machine Learning

Background:

Zero-shot image recognition (ZSR) aims to classify data from unseen classes by linking visual features with semantic representations.
Current ZSR methods often use a single-scale embedding space, overlooking the varying semantics present in different visual feature scales.

Purpose of the Study:

To propose a novel multi-scale visual-attribute co-attention (mVACA) model for enhanced zero-shot image recognition.
To address limitations of single-scale approaches by incorporating multi-scale visual semantics and improving visual discrimination.

Main Methods:

The mVACA model employs a hybrid visual attention mechanism at each scale, combining attribute-related attention and visual self-attention.
Attribute-related attention is guided by pseudo-attribute vectors derived from mutual information regularization (MIR).
Visual self-attention refines attribute attention, emphasizing visually relevant attributes and leveraging multi-scale visual discrimination.

Main Results:

The mVACA model demonstrates state-of-the-art or competitive performance on standard benchmarks for both zero-shot learning (ZSL) and generalized ZSL (GZSL) tasks.
The framework effectively unifies standard ZSL and GZSL by utilizing multi-scale visual discrimination.

Conclusions:

The proposed mVACA model offers a significant advancement in zero-shot image recognition by effectively integrating multi-scale visual and semantic information.
The model's ability to handle both ZSL and GZSL tasks, coupled with visualized analysis of image-attribute interactions, provides valuable insights into ZSR mechanisms.