TY - UNPB
T1 - Adversarial Attacks on Image Generation With Made-Up Words
AU - Millière, Raphaël
PY - 2022/8/1
Y1 - 2022/8/1
N2 - Text-guided image generation models can be prompted to generate images using nonce words adversarially designed to robustly evoke specific visual concepts. Two approaches for such generation are introduced: macaronic prompting, which involves designing cryptic hybrid words by concatenating subword units from different languages; and evocative prompting, which involves designing nonce words whose broad morphological features are similar enough to that of existing words to trigger robust visual associations. The two methods can also be combined to generate images associated with more specific visual concepts. The implications of these techniques for the circumvention of existing approaches to content moderation, and particularly the generation of offensive or harmful images, are discussed.
AB - Text-guided image generation models can be prompted to generate images using nonce words adversarially designed to robustly evoke specific visual concepts. Two approaches for such generation are introduced: macaronic prompting, which involves designing cryptic hybrid words by concatenating subword units from different languages; and evocative prompting, which involves designing nonce words whose broad morphological features are similar enough to that of existing words to trigger robust visual associations. The two methods can also be combined to generate images associated with more specific visual concepts. The implications of these techniques for the circumvention of existing approaches to content moderation, and particularly the generation of offensive or harmful images, are discussed.
KW - Computer Science - Computation and Language
KW - Computer Science - Computer Vision and Pattern Recognition
KW - Computer Science - Cryptography and Security
KW - Computer Science - Machine Learning
U2 - 10.48550/arXiv.2208.04135
DO - 10.48550/arXiv.2208.04135
M3 - Preprint
T3 - arXiv
BT - Adversarial Attacks on Image Generation With Made-Up Words
ER -