Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Experimental assessment of rheological behavior and environmental impact of low-carbon cementitious paste with fly ash and GGBS.

Journal of the Air & Waste Management Association (1995)·2026
Same author

Real-world road damage dataset with potholes, cracks, and maintenance holes.

Scientific reports·2026
Same author

Data-related Ablation for Reinforcing Deep Learning in Explaining Complex Phenomena.

International journal of neural systems·2026
Same author

Correction: A Benchmark Dataset for Radio Signal Image-based Person Re-Identification.

Scientific data·2025
Same author

A Benchmark Dataset for Radio Signal Image-based Person Re-Identification.

Scientific data·2025
Same author

Evaluating the Efficacy of Bioceramic versus Resin-Based Sealers in Endodontic Treatments: A Comparative Analysis.

Journal of pharmacy & bioallied sciences·2025
Same journal

Latent Space Projections and Atlases, a Cautionary Tale in Deep Neuroimaging using Autoencoders.

International journal of neural systems·2026
Same journal

Transformer-Based Anomaly Detection for Neurodegenerative Screening in MRI Images.

International journal of neural systems·2026
Same journal

Discrete Wavelet Convolution for Learnable Time-Frequency Representation with Application to Seizure Prediction.

International journal of neural systems·2026
Same journal

Automatic Seizure Detection using Hierarchical Spectral-Temporal Feature Learning with an Imbalance-Aware Transformer.

International journal of neural systems·2026
Same journal

Pyramid Vision Transformer-Enhanced Conformer Network for Epileptic Seizure Recognition Using MultiChannel EEG Signals.

International journal of neural systems·2026
Same journal

A Time-Frequency Decoupled Contrastive Learning Framework for Electroencephalography-Based Parkinson's Disease Diagnosis.

International journal of neural systems·2026
See all related articles

Related Experiment Video

Updated: Sep 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.9K

Masked Transformer for Image Anomaly Localization.

Axel De Nardin1, Pankaj Mishra1, Gian Luca Foresti1

  • 1Department of Mathematics, Computer Science and Physics, Università Degli Studi di Udine, via Delle, Scienze 206, 33100 Udine, Italy.

International Journal of Neural Systems
|June 22, 2022
PubMed
Summary
This summary is machine-generated.

This study introduces a novel Vision Transformer model for image anomaly detection. By masking image patches and reconstructing them from surrounding data, the model effectively identifies visual anomalies in datasets.

Keywords:
Anomaly detectionimage inpaintingself-supervised learningvision transformer

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

490
Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues
07:34

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

17.5K

Related Experiment Videos

Last Updated: Sep 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.9K
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

490
Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues
07:34

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

17.5K

Area of Science:

  • Computer Vision
  • Machine Learning
  • Artificial Intelligence

Background:

  • Image anomaly detection is crucial for applications like industrial inspection and medical imaging.
  • Current deep learning methods often use image reconstruction, which can fail when anomalies are similar to normal data.
  • Limitations exist in reconstruction-based anomaly detection when anomalies share characteristics with normal data.

Purpose of the Study:

  • To develop a more robust image anomaly detection model.
  • To overcome the limitations of traditional reconstruction-based deep learning approaches.
  • To improve the accuracy and reliability of identifying visual anomalies.

Main Methods:

  • A novel Vision Transformer architecture incorporating patch masking was developed.
  • Input images are divided into patches, with each patch reconstructed solely from surrounding contextual information.
  • Multi-resolution patches and their combined embeddings were utilized to enhance performance.

Main Results:

  • The proposed patch masking approach demonstrated superior performance compared to traditional reconstruction methods.
  • Utilizing multi-resolution patches significantly improved anomaly detection accuracy.
  • The model achieved competitive results on benchmark datasets like MVTec and head CT.

Conclusions:

  • The Vision Transformer with patch masking offers a promising alternative for image anomaly detection.
  • Reconstructing patches from surrounding context effectively handles anomalous regions.
  • Multi-resolution patch embeddings are key to enhancing the model's detection capabilities.