Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Aug 30, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.9K

PointTransformer: Encoding Human Local Features for Small Target Detection.

Yudi Tang1, Bing Wang1, Wangli He1

  • 1Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, No. 130 Meilong Road, Shanghai 200237, China.

Computational Intelligence and Neuroscience
|September 1, 2022
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

[Platelet parameters and platelet Toll-like receptor 4 (TLR4) expression in patients with sepsis, and the effect of a joint treatment-plan integrating traditional Chinese and western medicine: a clinical study].

Zhongguo wei zhong bing ji jiu yi xue = Chinese critical care medicine = Zhongguo weizhongbing jijiuyixue·2011
Same author

A novel kernel Fisher discriminant analysis: constructing informative kernel by decision tree ensemble for metabolomics data analysis.

Analytica chimica acta·2011
Same author

Anterior debridement and reconstruction via thoracoscopy-assisted mini-open approach for the treatment of thoracic spinal tuberculosis: minimum 5-year follow-up.

European spine journal : official publication of the European Spine Society, the European Spinal Deformity Society, and the European Section of the Cervical Spine Research Society·2011
Same author

[A family-based association study of FXYD6 gene polymorphisms and schizophrenia].

Zhonghua yi xue yi chuan xue za zhi = Zhonghua yixue yichuanxue zazhi = Chinese journal of medical genetics·2011
Same author

Prenatal diagnosis of penoscrotal transposition with 2- and 3-dimensional ultrasonography.

Journal of ultrasound in medicine : official journal of the American Institute of Ultrasound in Medicine·2011
Same author

Differentiation of α- or β-aspartic isomers in the heptapeptides by the fragments of [M + Na]+ using ion trap tandem mass spectrometry.

Journal of the American Society for Mass Spectrometry·2011
Same journal

RETRACTION: Real-Time Modulation of Physical Training Intensity Based on Wavelet Recursive Fuzzy Neural Networks.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Multidimensional Heterogeneous Network Link Adaptation Based on Mobile Environment.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Framework to Segment and Evaluate Multiple Sclerosis Lesion in MRI Slices Using VGG-UNet.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Facial Emotion Recognition Using a Novel Fusion of Convolutional Neural Network and Local Binary Pattern in Crime Investigation.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Automatic Intelligent System Using Medical of Things for Multiple Sclerosis Detection.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Intangible Cultural Heritage Reproduction and Revitalization: Value Feedback, Practice, and Exploration Based on the IPA Model.

Computational intelligence and neuroscience·2026
See all related articles

This study introduces a novel Point Transformer model to improve small object detection in chemical plants, outperforming existing methods by leveraging human skeletal key points for enhanced feature extraction and accuracy.

Area of Science:

  • Computer Vision
  • Artificial Intelligence
  • Industrial Safety

Background:

  • Small object detection and occlusion handling are critical challenges in industrial settings, particularly in chemical plants.
  • Existing object detection methods often fail due to worker occlusion and long-distance surveillance, leading to missed detections.
  • Current multi-feature fusion strategies inadequately utilize local features, hindering small target detection performance.

Purpose of the Study:

  • To develop an advanced object detection framework capable of accurately identifying small targets in occluded environments within chemical plants.
  • To address the limitations of existing methods in handling missed detections caused by worker occlusion and long-distance surveillance.
  • To improve the performance of small target detection by effectively utilizing local features and positional information.

More Related Videos

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

616
Medical-grade Sterilizable Target for Fluid-immersed Fetoscope Optical Distortion Calibration
07:03

Medical-grade Sterilizable Target for Fluid-immersed Fetoscope Optical Distortion Calibration

Published on: February 23, 2017

7.8K

Related Experiment Videos

Last Updated: Aug 30, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.9K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

616
Medical-grade Sterilizable Target for Fluid-immersed Fetoscope Optical Distortion Calibration
07:03

Medical-grade Sterilizable Target for Fluid-immersed Fetoscope Optical Distortion Calibration

Published on: February 23, 2017

7.8K

Main Methods:

  • Introduction of Point Transformer, a transformer encoder, as the core backbone for object detection.
  • Utilizing a priori information of human skeletal points to extract local features.
  • Employing self-attention and cross-attention mechanisms to reconstruct local features for each key point.
  • Proposing a learnable positional encoding method to enhance the model's sensitivity to skeletal point positions.

Main Results:

  • The proposed Point Transformer model significantly outperforms classical object detection algorithms on a chemical plant field operation dataset.
  • The model achieves a 12% improvement in mean average precision (mAP) for small target detection compared to state-of-the-art methods.
  • Demonstrated superior performance in handling occlusion and detecting small targets in challenging industrial environments.

Conclusions:

  • The Point Transformer framework effectively addresses the limitations of traditional object detection methods for small targets in occluded industrial settings.
  • The integration of skeletal point information and learnable positional encoding significantly boosts detection accuracy.
  • The developed model offers a promising solution for enhancing safety and operational efficiency in chemical plant field operations through improved surveillance.