Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 5, 2026

Advanced Diffusion Imaging in The Hippocampus of Rats with Mild Traumatic Brain Injury
10:33

Advanced Diffusion Imaging in The Hippocampus of Rats with Mild Traumatic Brain Injury

Published on: August 14, 2019

8.2K

Adversarial discriminant attack on text-to-image diffusion models.

Hanxiao Wu1, Shengwu Xiong2, Dong Yi3

  • 1School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, Hubei, China; Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, 101408, China; Wuhan AI Research, Wuhan, 430000, Hubei, China.

Neural Networks : the Official Journal of the International Neural Network Society
|February 18, 2026
PubMed
Summary

Related Concept Videos

Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

8.6K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
8.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Visible-Light-Driven Photoredox-Catalyzed [4 + 2] Annulation of Unsaturated α-Bromocarbonyls with α-Substituted Styrenes.

The Journal of organic chemistry·2026
Same author

A Biological Alignment Framework for Interpreting Anti-Inflammatory Cardiovascular Trials in Atherosclerosis.

Aging and disease·2026
Same author

Discovery of Novel Plant O-Methyltransferases for Directed Methylation of Flavonoids and Stilbenoids.

Chembiochem : a European journal of chemical biology·2026
Same author

Ethical considerations and management strategies for fertility preservation in women of reproductive age with malignant tumors: Chinese practices and perspectives.

Frontiers in endocrinology·2026
Same author

Photoredox-Catalyzed Transamidation of Amides with Weakly Nucleophilic Aromatic Amines.

The Journal of organic chemistry·2026
Same author

AnyDesign: Versatile area fashion editing via mask-free diffusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles
This summary is machine-generated.

A new attack method, Adversarial Discriminant Attack (ADAtk), effectively bypasses safety mechanisms in concept-erased diffusion models. ADAtk generates images classified as Not-Safe-For-Work (NSFW) with over 90% success, revealing vulnerabilities in current AI safety techniques.

Area of Science:

  • Artificial Intelligence
  • Computer Vision
  • Generative Models

Background:

  • Concept-erased diffusion models face challenges in preventing Not-Safe-For-Work (NSFW) content generation.
  • Existing attack methods focus on image similarity, which does not guarantee successful NSFW reconstruction.

Purpose of the Study:

  • To propose a novel attack method, Adversarial Discriminant Attack (ADAtk), to expose vulnerabilities in concept-erased diffusion models.
  • To address the limitations of existing generation-focused attacks by adopting a discriminative approach.

Main Methods:

  • ADAtk optimizes the likelihood of generating NSFW content by creating adversarial perturbations in the model's latent space.
  • The method guides image reconstruction towards a target discriminant class, aiming for classification as inappropriate.
Keywords:
AI securityAdversarial discriminant attackConcept-erased diffusion modelText-to-image generation

Related Experiment Videos

Last Updated: May 5, 2026

Advanced Diffusion Imaging in The Hippocampus of Rats with Mild Traumatic Brain Injury
10:33

Advanced Diffusion Imaging in The Hippocampus of Rats with Mild Traumatic Brain Injury

Published on: August 14, 2019

8.2K

Main Results:

  • ADAtk achieved over a 90% success rate in bypassing current internal security mechanisms.
  • The attack successfully exposed critical limitations in existing concept-erasure techniques for diffusion models.

Conclusions:

  • ADAtk provides crucial insights into improving the safety and reliability of text-to-image generation systems.
  • The findings pave the way for developing more secure generative AI models and robust safety protocols.