Black-Box Targeted Adversarial Attack on Segment Anything (SAM)

Sheng Zheng; Chaoning Zhang; Xinhong Hao

doi:10.1109/TMM.2024.3521769

Black-Box Targeted Adversarial Attack on Segment Anything (SAM)

Sheng Zheng, Chaoning Zhang^*, Xinhong Hao

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Deep recognition models are widely vulnerable to adversarial examples, which change the model output by adding quasi-imperceptible perturbation to the image input. Recently, Segment Anything Model (SAM) has emerged to become a popular foundation model in computer vision due to its impressive generalization to unseen data and tasks. Realizing flexible attacks on SAM is beneficial for understanding the robustness of SAM in the adversarial context. To this end, this work aims to achieve a targeted adversarial attack (TAA) on SAM. Specifically, under a specific prompt, the goal is to make the predicted mask of an adversarial example resemble that of a given target image. The task of TAA on SAM has been realized in the white-box setup by assuming access to prompt and model, which is thus less practical. To address the issue of prompt dependence, we propose a simple yet effective approach by only attacking the image encoder. Moreover, we propose a novel regularization loss to enhance the cross-model transferability by increasing the feature dominance of adversarial images over random natural images. Extensive experiments verify the effectiveness of our proposed method to conduct a successful black-box TAA on SAM.

源语言	英语
页（从-至）	1901-1913
页数	13
期刊	IEEE Transactions on Multimedia
卷	27
DOI	http://doi.org/10.1109/TMM.2024.3521769
出版状态	已出版 - 2025
已对外发布	是

访问文件

10.1109/TMM.2024.3521769

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{a4d0e858783946528dd8227f4f9a8164,

title = "Black-Box Targeted Adversarial Attack on Segment Anything (SAM)",

abstract = "Deep recognition models are widely vulnerable to adversarial examples, which change the model output by adding quasi-imperceptible perturbation to the image input. Recently, Segment Anything Model (SAM) has emerged to become a popular foundation model in computer vision due to its impressive generalization to unseen data and tasks. Realizing flexible attacks on SAM is beneficial for understanding the robustness of SAM in the adversarial context. To this end, this work aims to achieve a targeted adversarial attack (TAA) on SAM. Specifically, under a specific prompt, the goal is to make the predicted mask of an adversarial example resemble that of a given target image. The task of TAA on SAM has been realized in the white-box setup by assuming access to prompt and model, which is thus less practical. To address the issue of prompt dependence, we propose a simple yet effective approach by only attacking the image encoder. Moreover, we propose a novel regularization loss to enhance the cross-model transferability by increasing the feature dominance of adversarial images over random natural images. Extensive experiments verify the effectiveness of our proposed method to conduct a successful black-box TAA on SAM.",

keywords = "Targeted adversarial attack, black-box, practical, robustness, segment anything model (SAM)",

author = "Sheng Zheng and Chaoning Zhang and Xinhong Hao",

note = "Publisher Copyright: {\textcopyright} 1999-2012 IEEE.",

year = "2025",

doi = "10.1109/TMM.2024.3521769",

language = "English",

volume = "27",

pages = "1901--1913",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Black-Box Targeted Adversarial Attack on Segment Anything (SAM)

AU - Zheng, Sheng

AU - Zhang, Chaoning

AU - Hao, Xinhong

PY - 2025

Y1 - 2025

N2 - Deep recognition models are widely vulnerable to adversarial examples, which change the model output by adding quasi-imperceptible perturbation to the image input. Recently, Segment Anything Model (SAM) has emerged to become a popular foundation model in computer vision due to its impressive generalization to unseen data and tasks. Realizing flexible attacks on SAM is beneficial for understanding the robustness of SAM in the adversarial context. To this end, this work aims to achieve a targeted adversarial attack (TAA) on SAM. Specifically, under a specific prompt, the goal is to make the predicted mask of an adversarial example resemble that of a given target image. The task of TAA on SAM has been realized in the white-box setup by assuming access to prompt and model, which is thus less practical. To address the issue of prompt dependence, we propose a simple yet effective approach by only attacking the image encoder. Moreover, we propose a novel regularization loss to enhance the cross-model transferability by increasing the feature dominance of adversarial images over random natural images. Extensive experiments verify the effectiveness of our proposed method to conduct a successful black-box TAA on SAM.

AB - Deep recognition models are widely vulnerable to adversarial examples, which change the model output by adding quasi-imperceptible perturbation to the image input. Recently, Segment Anything Model (SAM) has emerged to become a popular foundation model in computer vision due to its impressive generalization to unseen data and tasks. Realizing flexible attacks on SAM is beneficial for understanding the robustness of SAM in the adversarial context. To this end, this work aims to achieve a targeted adversarial attack (TAA) on SAM. Specifically, under a specific prompt, the goal is to make the predicted mask of an adversarial example resemble that of a given target image. The task of TAA on SAM has been realized in the white-box setup by assuming access to prompt and model, which is thus less practical. To address the issue of prompt dependence, we propose a simple yet effective approach by only attacking the image encoder. Moreover, we propose a novel regularization loss to enhance the cross-model transferability by increasing the feature dominance of adversarial images over random natural images. Extensive experiments verify the effectiveness of our proposed method to conduct a successful black-box TAA on SAM.

KW - Targeted adversarial attack

KW - black-box

KW - practical

KW - robustness

KW - segment anything model (SAM)

UR - http://www.scopus.com/pages/publications/105002264877

U2 - 10.1109/TMM.2024.3521769

DO - 10.1109/TMM.2024.3521769

M3 - Article

AN - SCOPUS:105002264877

SN - 1520-9210

VL - 27

SP - 1901

EP - 1913

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

ER -

Black-Box Targeted Adversarial Attack on Segment Anything (SAM)

摘要

访问文件

其它文件与链接

指纹

引用此