Defense Mechanism Against Infection
Stereotype Content Model
Protecting Self-Esteem
Defenses Against Pathogens and Herbivores
Defense Against Bacterial Pathogens
Understanding Deception
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Yi Zhong1, Zhenzhu Chen1, Rui Zhang1
1School of Computing and Artificial Intelligence, Southwestern University of Finance and Economics, Chengdu, 611130, China.
SEEK is a new defense against model hijacking attacks in NLP. It removes malicious prompt words and restores meaning, reducing attack success by 75% while preserving model performance.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: