Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A functionally guided fusion Vision Transformer for predicting IDH status in gliomas: a multicenter study with external validation and incomplete multimodal evaluation.

Radiologie (Heidelberg, Germany)·2026
Same author

Booster vaccine reduces BCG-primed mice's protection against primary Mycobacterium tuberculosis infection by raising IL-10 levels.

Vaccine·2026
Same author

The IL-36 Cytokine Rheostat: Hierarchical Regulation of Epithelial-Immune Crosstalk and Precision Therapy in Psoriatic and Related Dermatoses.

Clinical, cosmetic and investigational dermatology·2026
Same author

Cold plasma-modified goat milk casein/Chinese yam polysaccharide/chitosan composite films: Structural, functional, and preservative properties for fresh pork.

Food chemistry·2026
Same author

Autoantibodies Targeting Complement Regulating Factors Induced C3 Glomerulopathy Resambling C4 Dense Deposit Disease: A Case Report.

Nephrology (Carlton, Vic.)·2026
Same author

Patient safety competency among nurses and nursing students: A meta-analysis.

Nurse education in practice·2026
Same journal

Style-Aware Contrastive Test-Time Adaptation: A Dual-Cache Model for Robust Vision-Language Alignment.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Semantic Frame Interpolation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Physics-Guided Cross-Modal Decoupling with Test-Time Adaptation for Hyperspectral Image Restoration.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Change-Prior-Guided Unsupervised Change Detection of Heterogeneous Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

AgonicDreamer: Enhancing Multi-View Consistency in Text-to-3D Generation via Rectified Score Distillation.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

BiCM-Prompt: Bidirectional Cross-Modal Prompt Tuning for Class-Incremental Learning on Multisource Remote Sensing Images.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
See all related articles

Related Experiment Video

Updated: Jul 10, 2025

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K

Point-Based Learnable Query Generator for Human-Object Interaction Detection.

Wang-Kai Lin, Hong-Bo Zhang, Zongwen Fan

    IEEE Transactions on Image Processing : a Publication of the IEEE Signal Processing Society
    |November 23, 2023
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new Transformer-based framework to improve human-object interaction (HOI) detection. The novel approach enhances feature correlation for more accurate detection of interactions between humans and objects.

    More Related Videos

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    3.8K
    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.4K

    Related Experiment Videos

    Last Updated: Jul 10, 2025

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
    08:25

    Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

    Published on: May 7, 2019

    9.0K
    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
    06:37

    Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

    Published on: December 15, 2023

    3.8K
    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.4K

    Area of Science:

    • Computer Vision
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Transformer-based and interaction point-based methods show promise in human-object interaction (HOI) detection.
    • Directly integrating these distinct model types is challenging due to structural and property differences.
    • Current Transformer HOI methods use separate decoders for instance detection and interaction recognition, limiting feature correlation.

    Purpose of the Study:

    • To propose a novel Transformer-based HOI detection framework that enhances the intrinsic correlation between instance and action features.
    • To improve the accuracy of HOI detection by developing a more effective query generation mechanism.
    • To advance the state-of-the-art in human-object interaction detection.

    Main Methods:

    • A novel Transformer-based HOI detection framework is proposed, featuring a decoder with three components: a learnable query generator, an instance decoder, and an interaction classifier.
    • The learnable query generator is designed to create effective queries, guiding the instance decoder and interaction classifier to learn accurate instance and interaction features.
    • The query generator incorporates prior bounding boxes, keypoint detection, and spatial relation features, inspired by interaction point-based methods.

    Main Results:

    • The proposed framework demonstrates improved performance in human-object interaction detection.
    • Experimental validation on the HICO-DET and V-COCO datasets shows superior results compared to existing state-of-the-art methods.
    • The novel learnable query generator effectively enhances the learning of instance and interaction features.

    Conclusions:

    • The proposed Transformer-based HOI detection framework successfully increases the intrinsic correlation between instance and action features.
    • The method achieves better performance on benchmark datasets, indicating its effectiveness and potential for real-world applications.
    • The integration of prior bounding boxes, keypoint detection, and spatial relation features in the query generator is a key contribution.