Deflecting Adversarial Attacks with Pixel Deflection

The code in this repository demonstrates that Deflecting Adversarial Attacks with Pixel Deflection (Prakash et al. 2018) is ineffective in the white-box threat model.

With an L-infinity perturbation of 4/255, we generate targeted adversarial examples with 97% success rate, and can reduce classifier accuracy to 0%.

See our note for more context and details.

Pretty pictures

Obligatory picture of sample of adversarial examples against this defense.

@unpublished{cvpr2018breaks,
  author = {},
  title = {},
  year = {2018},
url = {https://arxiv.org/abs/TODO},
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
images		images
maps		maps
originals		originals
README.md		README.md
demo.ipynb		demo.ipynb
imagenet_labels.json		imagenet_labels.json
main.py		main.py
methods.py		methods.py
utils.py		utils.py