Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Experimental assessment of rheological behavior and environmental impact of low-carbon cementitious paste with fly ash and GGBS.

Journal of the Air & Waste Management Association (1995)·2026

Same author

Real-world road damage dataset with potholes, cracks, and maintenance holes.

Scientific reports·2026

Same author

Data-related Ablation for Reinforcing Deep Learning in Explaining Complex Phenomena.

International journal of neural systems·2026

Same author

Correction: A Benchmark Dataset for Radio Signal Image-based Person Re-Identification.

Scientific data·2025

Same author

A Benchmark Dataset for Radio Signal Image-based Person Re-Identification.

Scientific data·2025

Same author

Evaluating the Efficacy of Bioceramic versus Resin-Based Sealers in Endodontic Treatments: A Comparative Analysis.

Journal of pharmacy & bioallied sciences·2025

Same journal

Latent Space Projections and Atlases, a Cautionary Tale in Deep Neuroimaging using Autoencoders.

International journal of neural systems·2026

Same journal

Transformer-Based Anomaly Detection for Neurodegenerative Screening in MRI Images.

International journal of neural systems·2026

Same journal

Discrete Wavelet Convolution for Learnable Time-Frequency Representation with Application to Seizure Prediction.

International journal of neural systems·2026

Same journal

Automatic Seizure Detection using Hierarchical Spectral-Temporal Feature Learning with an Imbalance-Aware Transformer.

International journal of neural systems·2026

Same journal

Pyramid Vision Transformer-Enhanced Conformer Network for Epileptic Seizure Recognition Using MultiChannel EEG Signals.

International journal of neural systems·2026

Same journal

A Time-Frequency Decoupled Contrastive Learning Framework for Electroencephalography-Based Parkinson's Disease Diagnosis.

International journal of neural systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Masked Transformer for Image Anomaly Localization.

Axel De Nardin¹, Pankaj Mishra¹, Gian Luca Foresti¹

¹Department of Mathematics, Computer Science and Physics, Università Degli Studi di Udine, via Delle, Scienze 206, 33100 Udine, Italy.

International Journal of Neural Systems

|June 22, 2022

Summary

This summary is machine-generated.

This study introduces a novel Vision Transformer model for image anomaly detection. By masking image patches and reconstructing them from surrounding data, the model effectively identifies visual anomalies in datasets.

Keywords:

Anomaly detection image inpainting self-supervised learning vision transformer

More Related Videos

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

Related Experiment Videos

Last Updated: Sep 7, 2025

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Perceptual and Category Processing of the Uncanny Valley Hypothesis' Dimension of Human Likeness: Some Methodological Issues

Published on: June 3, 2013

Area of Science:

Computer Vision
Machine Learning
Artificial Intelligence

Background:

Image anomaly detection is crucial for applications like industrial inspection and medical imaging.
Current deep learning methods often use image reconstruction, which can fail when anomalies are similar to normal data.
Limitations exist in reconstruction-based anomaly detection when anomalies share characteristics with normal data.

Purpose of the Study:

To develop a more robust image anomaly detection model.
To overcome the limitations of traditional reconstruction-based deep learning approaches.
To improve the accuracy and reliability of identifying visual anomalies.

Main Methods:

A novel Vision Transformer architecture incorporating patch masking was developed.
Input images are divided into patches, with each patch reconstructed solely from surrounding contextual information.
Multi-resolution patches and their combined embeddings were utilized to enhance performance.

Main Results:

The proposed patch masking approach demonstrated superior performance compared to traditional reconstruction methods.
Utilizing multi-resolution patches significantly improved anomaly detection accuracy.
The model achieved competitive results on benchmark datasets like MVTec and head CT.

Conclusions:

The Vision Transformer with patch masking offers a promising alternative for image anomaly detection.
Reconstructing patches from surrounding context effectively handles anomalous regions.
Multi-resolution patch embeddings are key to enhancing the model's detection capabilities.