0% Complete
English
صفحه اصلی
/
شانزدهمین کنفرانس بین المللی فناوری اطلاعات و دانش
Detection of Backdoor Attacks in Neural Networks Using Input Optimization
نویسندگان :
Parsa Hashemi Khorsand
1
Ahmad Nickabadi
2
1- Amirkabir University of Technology (Tehran Polytechnic)
2- Amirkabir University of Technology (Tehran Polytechnic)
کلمات کلیدی :
backdoor attacks،adversarial robustness،backdoor detection،model contamination detection،input optimization،regularization
چکیده :
This paper presents a clean-data-free framework for detecting backdoor attacks in neural networks via input optimization. We introduce two complementary strategies. First, joint input optimization with a cleanliness detector: for each label, we optimize an input that simultaneously (i) maximizes the target-label logit on the suspected model and (ii) maintains in-domain naturalness according to an auxiliary diagnostic model; the resulting patterns are then inspected for trigger-like artifacts. Second, input optimization with the largest feasible regularization coefficient: for each label, we find the largest feasible regularization coefficient that still attains a preset confidence threshold, forming a per-class signature vector; Median Absolute Deviation (MAD) is then used to flag outlier labels as compromised. On MNIST, our framework achieves 89.5 percent detection accuracy on backdoored models with 100 percent recall in poisoned-label flagging, while requiring no access to clean training data. We further compare our methods with Neural Cleanse and the Certified Backdoor Detector (CBD).
لیست مقالات
لیست مقالات بایگانی شده
Multi-Modal Longitudinal Tooth Labeling with Temporal Graph–Transformer Integration
Maral Mirza mohammadi - Mahdi Tarom
Design and Simulation of a New Multiplexer with Energy Analysis in Quantum Cellular Automata Technology
- - -
بررسی تأثیر استقرار استاندارد COBIT در افزایش بهره وری سازمانها (مطالعه موردی: شعب نمایندگیهای همراه اول، ایرانسل، رایتل)
دکتر محمد ابراهیم سمیع - ساره رحمانیان محمد ابراهیم سمیع - ساره رحمانیان -
Designing an AI-assisted toolbox for fitness activity recognition based on deep CNN
Ali Bidaran - Dr Saeed Sharifian
ParsEL 1.0: Unsupervised Entity Linking in Persian Social Media Texts
Majid Asgari-bidhendi - Farzane Fakhrian - Dr Behrouz Minaei-bidgoli
آسیب شناسی استقرار بلاکچین در صنعت بانکی کشور ایران
نیلوفر مرادحاصل
Vi-Net: A Deep Violent Flow Network for Violence Detection in Video Sequences
Tahereh Zarrat Ehsan - Seyed Mehdi Mohtavipour
Two Novel Designs of Efficient Single-Bit Comparators in QCA Technology with Ultra-Low Energy Dissipation
Shobeir Fayazi - Hatam Abdoli
Embedding-Consistent Contrastive Learning: A Robust Approach for Imbalanced Classification
Sobhan Siamak - Eghbal Mansoori
Improved Weighting in the Automated Texts Classification using Fuzzy Method
Hamidreza Sadrarhami - S. Mohammadali Zanjani - Ghazanfar Shahgholian
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.8.0