AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

source

arxiv

source_type

latex

converted_with

pandoc

paper_version

1912.02781v2

title

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

authors

["Dan Hendrycks","Norman Mu","Ekin D. Cubuk","Barret Zoph","Justin Gilmer","Balaji Lakshminarayanan"]

date_published

2019-12-05 18:18:10+00:00

data_last_modified

2020-02-17 06:16:13+00:00

abstract

Modern deep neural networks can achieve high accuracy when the training distribution and test distribution are identically distributed, but this assumption is frequently violated in practice. When the train and test distributions are mismatched, accuracy can plummet. Currently there are few techniques that improve robustness to unforeseen data shifts encountered during deployment. In this work, we propose a technique to improve the robustness and uncertainty estimates of image classifiers. We propose AugMix, a data processing technique that is simple to implement, adds limited computational overhead, and helps models withstand unforeseen corruptions. AugMix significantly improves robustness and uncertainty measures on challenging image classification benchmarks, closing the gap between previous methods and the best possible performance in some cases by more than half.

author_comment

Code available at https://github.com/google-research/augmix

journal_ref

null

doi

null

primary_category

stat.ML

categories

["stat.ML","cs.CV","cs.LG"]

citation_level

alignment_text

pos

confidence_score

1.0

main_tex_filename

main.tex

bibliography_bbl

\begin{thebibliography}{47} \providecommand{\natexlab}[1]{#1} \providecommand{\url}[1]{\texttt{#1}} \expandafter\ifx\csname urlstyle\endcsname\relax \providecommand{\doi}[1]{doi: #1}\else \providecommand{\doi}{doi: \begingroup \urlstyle{rm}\Url}\fi \bibitem[Azulay \& Weiss(2018)Azulay and Weiss]{transforms} Aharon Azulay and Yair Weiss. \newblock Why do deep convolutional networks generalize so poorly to small image transformations? \newblock \emph{arXiv preprint}, 2018. \bibitem[Bachman et~al.(2014)Bachman, Alsharif, and Precup]{Bachman} Philip Bachman, Ouais Alsharif, and Doina Precup. \newblock Learning with pseudo-ensembles. \newblock In Z.~Ghahramani, M.~Welling, C.~Cortes, N.~D. Lawrence, and K.~Q. Weinberger (eds.), \emph{Advances in Neural Information Processing Systems 27}, pp.\ 3365--3373. Curran Associates, Inc., 2014. \newblock URL \url{http://papers.nips.cc/paper/5487-learning-with-pseudo-ensembles.pdf}. \bibitem[Chun et~al.(2019)Chun, Oh, Yun, Han, Choe, and Yoo]{empiricalpaper} Sanghyuk Chun, Seong~Joon Oh, Sangdoo Yun, Dongyoon Han, Junsuk Choe, and Youngjoon Yoo. \newblock An empirical evaluation on robustness and uncertainty of regularization methods. \newblock \emph{ICML Workshop on Uncertainty and Robustness in Deep Learning}, 2019. \bibitem[Cubuk et~al.(2019)Cubuk, Zoph, Shlens, and Le]{cubuk2019randaugment} Ekin~D Cubuk, Barret Zoph, Jonathon Shlens, and Quoc~V Le. \newblock Randaugment: Practical automated data augmentation with a reduced search space. \newblock \emph{arXiv preprint arXiv:1909.13719}, 2019. \bibitem[Cubuk et~al.(2018)Cubuk, Zoph, Man{\'e}, Vasudevan, and Le]{Cubuk2018AutoAugmentLA} Ekin~Dogus Cubuk, Barret Zoph, Dandelion Man{\'e}, Vijay Vasudevan, and Quoc~V. Le. \newblock {AutoAugment}: Learning augmentation policies from data. \newblock \emph{CVPR}, 2018. \bibitem[Deng et~al.(2009)Deng, Dong, Socher, jia Li, Li, and Fei-Fei]{imagenet} Jia Deng, Wei Dong, Richard Socher, Li~jia Li, Kai Li, and Li~Fei-Fei. \newblock {I}mage{N}et: A large-scale hierarchical image database. \newblock \emph{CVPR}, 2009. \bibitem[Devries \& Taylor(2017)Devries and Taylor]{Devries2017ImprovedRO} Terrance Devries and Graham~W. Taylor. \newblock Improved regularization of convolutional neural networks with {Cutout}. \newblock \emph{arXiv preprint arXiv:1708.04552}, 2017. \bibitem[Engstrom et~al.(2018)Engstrom, Ilyas, and Athalye]{alpbroken} Logan Engstrom, Andrew Ilyas, and Anish Athalye. \newblock Evaluating and understanding the robustness of adversarial logit pairing. \newblock \emph{arXiv preprint}, 2018. \bibitem[Geirhos et~al.(2018)Geirhos, Temme, Rauber, Sch\"{u}tt, Bethge, and Wichmann]{geirhos} Robert Geirhos, Carlos R.~M. Temme, Jonas Rauber, Heiko~H. Sch\"{u}tt, Matthias Bethge, and Felix~A. Wichmann. \newblock Generalisation in humans and deep neural networks. \newblock \emph{NeurIPS}, 2018. \bibitem[Gilmer \& Hendrycks(2019)Gilmer and Hendrycks]{gilmer2019discussion} Justin Gilmer and Dan Hendrycks. \newblock A discussion of'adversarial examples are not bugs, they are features': Adversarial example researchers need to expand what is meant by'robustness'. \newblock \emph{Distill}, 4\penalty0 (8):\penalty0 e00019--1, 2019. \bibitem[Gilmer et~al.(2018)Gilmer, Adams, Goodfellow, Andersen, and Dahl]{Gilmer2018MotivatingTR} Justin Gilmer, Ryan~P. Adams, Ian~J. Goodfellow, David Andersen, and George~E. Dahl. \newblock Motivating the rules of the game for adversarial example research. \newblock \emph{CoRR}, abs/1807.06732, 2018. \bibitem[Goyal et~al.(2017)Goyal, Doll{\'a}r, Girshick, Noordhuis, Wesolowski, Kyrola, Tulloch, Jia, and He]{Goyal2017AccurateLM} Priya Goyal, Piotr Doll{\'a}r, Ross~B. Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. \newblock Accurate, large minibatch {SGD}: Training {ImageNet} in 1 hour. \newblock \emph{CoRR}, abs/1706.02677, 2017. \bibitem[Gu et~al.(2019)Gu, Yang, Ngiam, Le, and Shlens]{gu2019using} Keren Gu, Brandon Yang, Jiquan Ngiam, Quoc Le, and Jonathon Shlens. \newblock Using videos to evaluate image model robustness, 2019. \bibitem[Guo et~al.(2017)Guo, Pleiss, Sun, and Weinberger]{kilian} Chuan Guo, Geoff Pleiss, Yu~Sun, and Kilian~Q. Weinberger. \newblock On calibration of modern neural networks. \newblock \emph{ICML}, 2017. \bibitem[Guo et~al.(2019)Guo, Mao, and Zhang]{GuoMixup} Hongyu Guo, Yongyi Mao, and Richong Zhang. \newblock Mixup as locally linear out-of-manifold regularization. \newblock In \emph{AAAI}, 2019. \bibitem[He et~al.(2015)He, Zhang, Ren, and Sun]{resnet} Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. \newblock Deep residual learning for image recognition. \newblock \emph{CVPR}, 2015. \bibitem[Hendrycks \& Dietterich(2019)Hendrycks and Dietterich]{hendrycks2019robustness} Dan Hendrycks and Thomas Dietterich. \newblock Benchmarking neural network robustness to common corruptions and perturbations. \newblock \emph{ICLR}, 2019. \bibitem[Hendrycks \& Gimpel(2017)Hendrycks and Gimpel]{hendrycks17baseline} Dan Hendrycks and Kevin Gimpel. \newblock A baseline for detecting misclassified and out-of-distribution examples in neural networks. \newblock \emph{ICLR}, 2017. \bibitem[Hendrycks et~al.(2019{\natexlab{a}})Hendrycks, Lee, and Mazeika]{hendrycks2019pretrain} Dan Hendrycks, Kimin Lee, and Mantas Mazeika. \newblock Using pre-training can improve model robustness and uncertainty. \newblock In \emph{ICML}, 2019{\natexlab{a}}. \bibitem[Hendrycks et~al.(2019{\natexlab{b}})Hendrycks, Mazeika, and Dietterich]{hendrycks2019oe} Dan Hendrycks, Mantas Mazeika, and Thomas Dietterich. \newblock Deep anomaly detection with outlier exposure. \newblock \emph{ICLR}, 2019{\natexlab{b}}. \bibitem[Huang et~al.(2017)Huang, Liu, van~der Maaten, and Weinberger]{densenet} Gao Huang, Zhuang Liu, Laurens van~der Maaten, and Kilian~Q Weinberger. \newblock Densely connected convolutional networks. \newblock In \emph{CVPR}, 2017. \bibitem[Kang et~al.(2019)Kang, Sun, Hendrycks, Brown, and Steinhardt]{uar} Daniel Kang, Yi~Sun, Dan Hendrycks, Tom Brown, and Jacob Steinhardt. \newblock Testing robustness against unforeseen adversaries. \newblock \emph{arXiv preprint}, 2019. \bibitem[Kannan et~al.(2018)Kannan, Kurakin, and Goodfellow]{alp} Harini Kannan, Alexey Kurakin, and Ian Goodfellow. \newblock Adversarial logit pairing. \newblock \emph{NeurIPS}, 2018. \bibitem[Krizhevsky \& Hinton(2009)Krizhevsky and Hinton]{cifar_datasets} Alex Krizhevsky and Geoffrey Hinton. \newblock Learning multiple layers of features from tiny images. \newblock 2009. \bibitem[Krizhevsky et~al.(2012)Krizhevsky, Sutskever, and Hinton]{AlexNet} Alex Krizhevsky, Ilya Sutskever, and Geoffrey~E Hinton. \newblock {I}mage{N}et classification with deep convolutional neural networks. \newblock \emph{NeurIPS}, 2012. \bibitem[Lakshminarayanan et~al.(2017)Lakshminarayanan, Pritzel, and Blundell]{lakshminarayanan2017simple} Balaji Lakshminarayanan, Alexander Pritzel, and Charles Blundell. \newblock Simple and scalable predictive uncertainty estimation using deep ensembles. \newblock In \emph{NeurIPS}, 2017. \bibitem[Lipton et~al.(2018)Lipton, Wang, and Smola]{Lipton2018DetectingAC} Zachary~Chase Lipton, Yu-Xiang Wang, and Alexander~J. Smola. \newblock Detecting and correcting for label shift with black box predictors. \newblock \emph{ArXiv}, abs/1802.03916, 2018. \bibitem[Lopes et~al.(2019)Lopes, Yin, Poole, Gilmer, and Cubuk]{Lopes2019ImprovingRW} Raphael~Gontijo Lopes, Dong Yin, Ben Poole, Justin Gilmer, and Ekin~Dogus Cubuk. \newblock Improving robustness without sacrificing accuracy with patch {Gaussian} augmentation. \newblock \emph{arXiv preprint arXiv:1906.02611}, 2019. \bibitem[Loshchilov \& Hutter(2016)Loshchilov and Hutter]{sgdr} Ilya Loshchilov and Frank Hutter. \newblock {SGDR:} stochastic gradient descent with warm restarts. \newblock \emph{ICLR}, 2016. \bibitem[Madry et~al.(2018)Madry, Makelov, Schmidt, Tsipras, and Vladu]{madry} Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, and Adrian Vladu. \newblock Towards deep learning models resistant to adversarial attacks. \newblock \emph{ICLR}, 2018. \bibitem[Nguyen \& O'Connor(2015)Nguyen and O'Connor]{oconnor} Khanh Nguyen and Brendan O'Connor. \newblock Posterior calibration and exploratory analysis for natural language processing models. \newblock \emph{EMNLP}, 2015. \bibitem[Ovadia et~al.(2019)Ovadia, Fertig, Ren, Nado, Sculley, Nowozin, Dillon, Lakshminarayanan, and Snoek]{ovadia2019can} Yaniv Ovadia, Emily Fertig, Jie Ren, Zachary Nado, D~Sculley, Sebastian Nowozin, Joshua~V Dillon, Balaji Lakshminarayanan, and Jasper Snoek. \newblock Can you trust your model's uncertainty? {E}valuating predictive uncertainty under dataset shift. \newblock \emph{NeurIPS}, 2019. \bibitem[Raghunathan et~al.(2019)Raghunathan, Xie, Yang, Duchi, and Liang]{Raghunathan2019AdversarialTC} Aditi Raghunathan, Sang~Michael Xie, Fanny Yang, John~C. Duchi, and Percy Liang. \newblock Adversarial training can hurt generalization. \newblock \emph{arXiv preprint arXiv:1906.06032}, 2019. \bibitem[Salimans \& Kingma(2016)Salimans and Kingma]{weightnorm} Tim Salimans and Diederik Kingma. \newblock Weight normalization: A simple reparameterization to accelerate training of deep neural networks. \newblock \emph{NeurIPS}, 2016. \bibitem[Springenberg et~al.(2014)Springenberg, Dosovitskiy, Brox, and Riedmiller]{allconv} Jost~Tobias Springenberg, Alexey Dosovitskiy, Thomas Brox, and Martin~A. Riedmiller. \newblock Striving for simplicity: The all convolutional net. \newblock \emph{CoRR}, abs/1412.6806, 2014. \bibitem[Takahashi et~al.(2019)Takahashi, Matsubara, and Uehara]{Takahashi2019DataAU} Ryo Takahashi, Takashi Matsubara, and Kuniaki Uehara. \newblock Data augmentation using random image cropping and patching for deep cnns. \newblock \emph{ArXiv}, abs/1811.09030, 2019. \bibitem[Tokozume et~al.(2018)Tokozume, Ushiku, and Harada]{Tokozume2017BetweenClassLF} Yuji Tokozume, Yoshitaka Ushiku, and Tatsuya Harada. \newblock Between-class learning for image classification. \newblock \emph{CVPR}, 2018. \bibitem[Torralba \& Efros(2011)Torralba and Efros]{Torralba2011UnbiasedLA} Antonio Torralba and Alexei~A. Efros. \newblock Unbiased look at dataset bias. \newblock \emph{CVPR}, 2011. \bibitem[Vasiljevic et~al.(2016)Vasiljevic, Chakrabarti, and Shakhnarovich]{igor} Igor Vasiljevic, Ayan Chakrabarti, and Gregory Shakhnarovich. \newblock Examining the impact of blur on recognition by convolutional networks, 2016. \bibitem[Xie et~al.(2016)Xie, Girshick, Dollár, Tu, and He]{resnext} Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, and Kaiming He. \newblock Aggregated residual transformations for deep neural networks. \newblock \emph{CVPR}, 2016. \bibitem[Yin et~al.(2019)Yin, Lopes, Shlens, Cubuk, and Gilmer]{yin2019fourier} Dong Yin, Raphael~Gontijo Lopes, Jonathon Shlens, Ekin~D Cubuk, and Justin Gilmer. \newblock A {F}ourier perspective on model robustness in computer vision. \newblock \emph{arXiv preprint arXiv:1906.08988}, 2019. \bibitem[Yun et~al.(2019)Yun, Han, Oh, Chun, Choe, and Yoo]{Yun2019CutMixRS} Sangdoo Yun, Dongyoon Han, Seong~Joon Oh, Sanghyuk Chun, Junsuk Choe, and Youngjoon Yoo. \newblock Cutmix: Regularization strategy to train strong classifiers with localizable features. \newblock \emph{ICCV}, 2019. \bibitem[Zagoruyko \& Komodakis(2016)Zagoruyko and Komodakis]{wideresnet} Sergey Zagoruyko and Nikos Komodakis. \newblock Wide residual networks. \newblock In \emph{BMVC}, 2016. \bibitem[Zhang et~al.(2017)Zhang, Ciss{\'e}, Dauphin, and Lopez-Paz]{Zhang2017mixupBE} Hongyi Zhang, Moustapha Ciss{\'e}, Yann Dauphin, and David Lopez-Paz. \newblock mixup: Beyond empirical risk minimization. \newblock \emph{ICLR}, 2017. \bibitem[Zhang(2019)]{zhang2019shiftinvar} Richard Zhang. \newblock Making convolutional networks shift-invariant again. \newblock In \emph{ICML}, 2019. \bibitem[Zheng et~al.(2016)Zheng, Song, Leung, and Goodfellow]{Zheng_2016} Stephan Zheng, Yang Song, Thomas Leung, and Ian Goodfellow. \newblock Improving the robustness of deep neural networks via stability training. \newblock \emph{CVPR}, 2016. \bibitem[Zhong et~al.(2017)Zhong, Zheng, Kang, Li, and Yang]{Zhong2017RandomED} Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, and Yi~Yang. \newblock Random erasing data augmentation. \newblock \emph{arXiv preprint arXiv:1708.04896}, 2017. \end{thebibliography}

bibliography_bib

@article{Xiao, title = "Learning from massive noisy labeled data for image classification", author = "Xiao, Tong and Xia, Tian and Yang, Yi and Huang, Chang and Wang, Xiaogang", year = 2015, journal = "CVPR" } @article{Lipton2018DetectingAC, title={Detecting and Correcting for Label Shift with Black Box Predictors}, author={Zachary Chase Lipton and Yu-Xiang Wang and Alexander J. Smola}, journal={ArXiv}, year={2018}, volume={abs/1802.03916} } @article{Torralba2011UnbiasedLA, title={Unbiased look at dataset bias}, author={Antonio Torralba and Alexei A. Efros}, journal={CVPR}, year={2011}, } @article{cubuk2019randaugment, title={RandAugment: Practical automated data augmentation with a reduced search space}, author={Cubuk, Ekin D and Zoph, Barret and Shlens, Jonathon and Le, Quoc V}, journal={arXiv preprint arXiv:1909.13719}, year={2019} } @article{Takahashi2019DataAU, title={Data Augmentation using Random Image Cropping and Patching for Deep CNNs}, author={Ryo Takahashi and Takashi Matsubara and Kuniaki Uehara}, journal={ArXiv}, year={2019}, volume={abs/1811.09030} } @incollection{Bachman, title = {Learning with Pseudo-Ensembles}, author = {Bachman, Philip and Alsharif, Ouais and Precup, Doina}, booktitle = {Advances in Neural Information Processing Systems 27}, editor = {Z. Ghahramani and M. Welling and C. Cortes and N. D. Lawrence and K. Q. Weinberger}, pages = {3365--3373}, year = {2014}, publisher = {Curran Associates, Inc.}, url = {http://papers.nips.cc/paper/5487-learning-with-pseudo-ensembles.pdf} } @article{chawla2002smote, title={{SMOTE}: synthetic minority over-sampling technique}, author={Chawla, Nitesh V and Bowyer, Kevin W and Hall, Lawrence O and Kegelmeyer, W Philip}, journal={JAIR}, year={2002} } @article{hendrycks2019robustness, title={Benchmarking Neural Network Robustness to Common Corruptions and Perturbations}, author={Dan Hendrycks and Thomas Dietterich}, journal={ICLR}, year={2019} } @article{zhou2017places, title={Places: A 10 million Image Database for Scene Recognition}, author={Zhou, Bolei and Lapedriza, Agata and Khosla, Aditya and Oliva, Aude and Torralba, Antonio}, journal={PAMI}, year={2017} } @article{resnet, author = {Kaiming He and Xiangyu Zhang and Shaoqing Ren and Jian Sun}, title = {Deep Residual Learning for Image Recognition}, journal = {CVPR}, year = {2015} } @inproceedings{GuoMixup, title={MixUp as Locally Linear Out-Of-Manifold Regularization}, author={Hongyu Guo and Yongyi Mao and Richong Zhang}, booktitle={AAAI}, year={2019} } @article{cimpoi14describing, Author = {Mircea Cimpoi and Subhransu Maji and Iasonas Kokkinos and Sammy Mohamed and Andrea Vedaldi}, Title = {Describing Textures in the Wild}, Journal = {Computer Vision and Pattern Recognition}, Year = {2014}} @INPROCEEDINGS{wideresnet, author = {Sergey Zagoruyko and Nikos Komodakis}, title = {Wide Residual Networks}, booktitle = {BMVC}, year = {2016}} @inproceedings{densenet, title={Densely connected convolutional networks}, author={Huang, Gao and Liu, Zhuang and van der Maaten, Laurens and Weinberger, Kilian Q }, booktitle={CVPR}, year={2017} } @inproceedings{lakshminarayanan2017simple, title={Simple and scalable predictive uncertainty estimation using deep ensembles}, author={Lakshminarayanan, Balaji and Pritzel, Alexander and Blundell, Charles}, booktitle={NeurIPS}, year={2017} } @article{berthelot2019mixmatch, title={Mixmatch: A holistic approach to semi-supervised learning}, author={Berthelot, David and Carlini, Nicholas and Goodfellow, Ian and Papernot, Nicolas and Oliver, Avital and Raffel, Colin}, journal={arXiv preprint arXiv:1905.02249}, year={2019} } @inproceedings{lee2017training, title={Training confidence-calibrated classifiers for detecting out-of-distribution samples}, author={Lee, Kimin and Lee, Honglak and Lee, Kibok and Shin, Jinwoo}, booktitle={ICLR}, year={2018} } @article{xie2019unsupervised, title={Unsupervised data augmentation}, author={Xie, Qizhe and Dai, Zihang and Hovy, Eduard and Luong, Minh-Thang and Le, Quoc V}, journal={arXiv preprint arXiv:1904.12848}, year={2019} } @article{empiricalpaper, title = {An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods}, author = {Chun, Sanghyuk and Oh, Seong Joon and Yun, Sangdoo and Han, Dongyoon and Choe, Junsuk and Yoo, Youngjoon}, journal = {ICML Workshop on Uncertainty and Robustness in Deep Learning}, year = {2019}, } @article{geirhos, title={Generalisation in humans and deep neural networks}, author={Geirhos, Robert and Temme, Carlos R. M. and Rauber, Jonas and Sch\"{u}tt, Heiko H. and Bethge, Matthias and Wichmann, Felix A.}, journal={NeurIPS}, year={2018} } @article{dpm, title={Object Detection with Discriminatively Trained Part-Based Models}, author={Pedro Felzenszwalb and Ross Girshick and David McAllester and Deva Ramanan}, journal={PAMI}, year={2010} } @article{geirhos2019, title={ImageNet-trained {CNNs} are biased towards texture; increasing shape bias improves accuracy and robustness}, author={Geirhos, Robert and Rubisch, Patricia and Michaelis, Claudio and Bethge, Matthias and Wichmann, Felix A and Brendel, Wieland}, journal={ICLR}, year={2019} } @ARTICLE{Patrini, title = "Making Deep Neural Networks Robust to Label Noise: a Loss Correction Approach", author = "Patrini, Giorgio and Rozza, Alessandro and Menon, Aditya and Nock, Richard and Qu, Lizhen", year = 2017, journal = "CVPR" } @article{mosbach2018logit, title={Logit pairing methods can fool gradient-based attacks}, author={Mosbach, Marius and Andriushchenko, Maksym and Trost, Thomas and Hein, Matthias and Klakow, Dietrich}, journal={arXiv preprint arXiv:1810.12042}, year={2018} } @article{rolnick2017, author = {David Rolnick and Andreas Veit and Serge J. Belongie and Nir Shavit}, title = {Deep Learning is Robust to Massive Label Noise}, journal = {CoRR}, volume = {abs/1705.10694}, year = {2017}, url = {http://arxiv.org/abs/1705.10694}, archivePrefix = {arXiv}, eprint = {1705.10694}, timestamp = {Mon, 13 Aug 2018 16:47:07 +0200}, biburl = {https://dblp.org/rec/bib/journals/corr/RolnickVBS17}, bibsource = {dblp computer science bibliography, https://dblp.org} } @misc{igor, Author = {Igor Vasiljevic and Ayan Chakrabarti and Gregory Shakhnarovich}, Title = {Examining the Impact of Blur on Recognition by Convolutional Networks}, Year = {2016} } @article{Kurakin2017AdversarialEI, title={Adversarial examples in the physical world}, author={Alexey Kurakin and Ian J. Goodfellow and Samy Bengio}, journal={CoRR}, year={2017}, volume={abs/1607.02533} } @article{oconnor, title={Posterior calibration and exploratory analysis for natural language processing models}, author={Nguyen, Khanh and O'Connor, Brendan}, journal={EMNLP}, year={2015} } @article{Sukhbaatar, title = "Training Convolutional Networks with Noisy Labels", author = "Sukhbaatar, Sainbayar and Bruna, Joan and Paluri, Manohar and Bourdev, Lubomir and Fergus, Rob", year = 2014, journal = "ICLR Workshop" } @misc{tiny_imagenet, title={Tiny {I}mage{N}et Visual Recognition Challenge}, author={{Johnson et al.}}, url={https://tiny-imagenet.herokuapp.com} } @article{oodnotestknowledge, author = {Wenhu Chen and Yilin Shen and Xin Wang and William Wang}, title = {Enhancing the Robustness of Prior Network in Out-of-Distribution Detection}, journal = {arXiv}, year = {2018}, } @article{coco, author = {Tsung-Yi Lin and Michael Maire and Serge Belongie and Lubomir Bourdev and Ross Girshick and James Hays and Pietro Perona and Deva Ramanan and C. Lawrence Zitnick and Piotr Dollar }, title = {Microsoft {COCO}: Common Objects in Context}, journal = {ECCV}, year = {2014}, } @article{hepretrain, author = {Kaiming He and Ross Girshick and Piotr Dollar}, title = {Rethinking {I}mage{N}et Pre-training}, journal = {arXiv}, year = {2018}, } @article{maskrcnn, author = {Kaiming He and Georgia Gkioxari and Piotr Dollar and Ross Girshick}, title = {Mask {R-CNN}}, journal = {ICCV}, year = {2017}, } @article{Huang2017SpeedAccuracyTF, title={Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors}, author={Jonathan Huang and Vivek Rathod and Chen Sun and Menglong Zhu and Anoop Korattikara Balan and Alireza Fathi and Ian Fischer and Zbigniew Wojna and Yang Song and Sergio Guadarrama and Kevin Murphy}, journal={2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year={2017} } @misc{wang2019learning, title={Learning Robust Global Representations by Penalizing Local Predictive Power}, author={Haohan Wang and Songwei Ge and Eric P. Xing and Zachary C. Lipton}, year={2019}, archivePrefix={arXiv}, } @inproceedings{He2018MaskR, title={Mask R-CNN}, author={Kaiming He and Georgia Gkioxari and Piotr Doll{\'a}r and Ross B. Girshick}, booktitle = {CVPR}, year={2018} } @article{Brown2018UnrestrictedAE, title={Unrestricted Adversarial Examples}, author={Tom B. Brown and Nicholas Carlini and Chiyuan Zhang and Catherine Olsson and Paul Francis Christiano and Ian J. Goodfellow}, journal={CoRR}, year={2018}, volume={abs/1809.08352} } @article{Recht2019DoIC, title={Do {ImageNet} Classifiers Generalize to {ImageNet}?}, author={Benjamin Recht and Rebecca Roelofs and Ludwig Schmidt and Vaishaal Shankar}, journal={ArXiv}, year={2019}, volume={abs/1902.10811} } @inproceedings{Song2018ConstructingUA, title={Constructing Unrestricted Adversarial Examples with Generative Models}, author={Yang Song and Rui Shu and Nate Kushman and Stefano Ermon}, booktitle={NeurIPS}, year={2018} } @article{Baluja2017AdversarialTN, title={Adversarial Transformation Networks: Learning to Generate Adversarial Examples}, author={Shumeet Baluja and Ian Fischer}, journal={CoRR}, year={2017}, volume={abs/1703.09387} } @inproceedings{Xiao2018GeneratingAE, title={Generating Adversarial Examples with Adversarial Networks}, author={Chaowei Xiao and Bo Li and Jun-Yan Zhu and Weixiong He and Mingyan Liu and Dawn Xiaodong Song}, booktitle={IJCAI}, year={2018} } @article{Gilmer2018MotivatingTR, title={Motivating the Rules of the Game for Adversarial Example Research}, author={Justin Gilmer and Ryan P. Adams and Ian J. Goodfellow and David Andersen and George E. Dahl}, journal={CoRR}, year={2018}, volume={abs/1807.06732} } @article{goodfellowblog, title={Attacking Machine Learning with Adversarial Examples}, author={Ian Goodfellow and Nicolas Papernot and Sandy Huang and Yan Duan and and Peter Abbeel}, journal={OpenAI Blog}, year={2017} } @article{Kornblith2018DoBI, title={Do Better {ImageNet} Models Transfer Better?}, author={Simon Kornblith and Jonathon Shlens and Quoc V. Le}, journal={CoRR}, year={2018}, volume={abs/1805.08974} } @inproceedings{zhang2018perceptual, title={The Unreasonable Effectiveness of Deep Features as a Perceptual Metric}, author={Zhang, Richard and Isola, Phillip and Efros, Alexei A and Shechtman, Eli and Wang, Oliver}, booktitle={CVPR}, year={2018} } @inproceedings{Chen2017DualPN, title={Dual Path Networks}, author={Yunpeng Chen and Jianan Li and Huaxin Xiao and Xiaojie Jin and Shuicheng Yan and Jiashi Feng}, booktitle={NeurIPS}, year={2017} } @article{instagram2018, author = {Dhruv Mahajan and Ross Girshick and Vignesh Ramanathan and Kaiming He and Manohar Paluri abd Yixuan Li and Ashwin Bharambe and Laurens van der Maaten}, title = {Exploring the Limits of Weakly Supervised Pretraining}, journal = {ECCV}, year = {2018}, } @article{hendrycks17baseline, author = {Dan Hendrycks and Kevin Gimpel}, title = {A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks}, journal = {ICLR}, year = {2017}, } @article{hendrycks2019oe, title={Deep Anomaly Detection with Outlier Exposure}, author={Hendrycks, Dan and Mazeika, Mantas and Dietterich, Thomas}, journal={ICLR}, year={2019} } @inproceedings{pacanomaly, author = {Si Liu and Risheek Garrepalli and Thomas Dietterich and Alan Fern and Dan Hendrycks}, title = {Open Category Detection with {PAC} Guarantees}, booktitle = {ICML}, year = {2018}, } @article{ kimin, title={Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples}, author={Kimin Lee and Honglak Lee and Kibok Lee and Jinwoo Shin}, journal={ICLR}, year={2018} } @inproceedings{malach2017decoupling, title={Decoupling" when to update" from" how to update"}, author={Malach, Eran and Shalev-Shwartz, Shai}, booktitle={NeurIPS}, year={2017} } @inproceedings{jiang2017mentornet, title={MentorNet: Regularizing very deep neural networks on corrupted labels}, author={Jiang, Lu and Zhou, Zhengyuan and Leung, Thomas and Li, Li-Jia and Fei-Fei, Li}, booktitle={ICML}, year={2018} } @inproceedings{han2018co, title={Co-teaching: robust training deep neural networks with extremely noisy labels}, author={Han, Bo and Yao, Quanming and Yu, Xingrui and Niu, Gang and Xu, Miao and Hu, Weihua and Tsang, Ivor and Sugiyama, Masashi}, booktitle={NeurIPS}, year={2018} } @inproceedings{ma2018dimensionality, title={Dimensionality-Driven Learning with Noisy Labels}, author={Ma, Xingjun and Wang, Yisen and Houle, Michael E and Zhou, Shuo and Erfani, Sarah M and Xia, Shu-Tao and Wijewickrema, Sudanthi and Bailey, James}, booktitle={ICML}, year={2018} } @article{reed2014training, title={Training deep neural networks on noisy labels with bootstrapping}, author={Reed, Scott and Lee, Honglak and Anguelov, Dragomir and Szegedy, Christian and Erhan, Dumitru and Rabinovich, Andrew}, journal={arXiv preprint arXiv:1412.6596}, year={2014} } @article{he2008learning, title={Learning from imbalanced data}, author={He, Haibo and Garcia, Edwardo A}, journal={TKDE}, year={2008}, } @article{ mahal, title={A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks}, author={Kimin Lee and Kibok Lee and Honglak Lee and Jinwoo Shin}, journal={NeurIPS}, year={2018} } @inproceedings{van2018inaturalist, title={The inaturalist species classification and detection dataset}, author={Van Horn, Grant and Mac Aodha, Oisin and Song, Yang and Cui, Yin and Sun, Chen and Shepard, Alex and Adam, Hartwig and Perona, Pietro and Belongie, Serge}, booktitle={CVPR}, year={2018} } @article{kilian, title = {On Calibration of Modern Neural Networks}, author = {Chuan Guo and Geoff Pleiss and Yu Sun and Kilian Q. Weinberger}, journal = {ICML}, year = {2017}} @article{hendrycks2018glc, title={Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise}, author={Hendrycks, Dan and Mazeika, Mantas and Wilson, Duncan and Gimpel, Kevin}, journal={NeurIPS}, year={2018} } @inproceedings{huang2016learning, title={Learning deep representation for imbalanced classification}, author={Huang, Chen and Li, Yining and Change Loy, Chen and Tang, Xiaoou}, booktitle={CVPR}, year={2016} } @article{yin2019fourier, title={A {F}ourier perspective on model robustness in computer vision}, author={Yin, Dong and Lopes, Raphael Gontijo and Shlens, Jonathon and Cubuk, Ekin D and Gilmer, Justin}, journal={arXiv preprint arXiv:1906.08988}, year={2019} } @article{hendrycks2019using, title={Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty}, author={Hendrycks, Dan and Mazeika, Mantas and Kadavath, Saurav and Song, Dawn}, journal={arXiv preprint arXiv:1906.12340}, year={2019} } @inproceedings{elkan2001foundations, title={The foundations of cost-sensitive learning}, author={Elkan, Charles}, booktitle={IJCAI}, year={2001} } @article{dong2018imbalanced, title={Imbalanced Deep Learning by Minority Class Incremental Rectification}, author={Dong, Qi and Gong, Shaogang and Zhu, Xiatian}, journal={IEEE TPAMI}, year={2018}, } @inproceedings{japkowicz2000class, title={The class imbalance problem: Significance and strategies}, author={Japkowicz, Nathalie}, booktitle={ICAI}, year={2000} } @article{madrydata, Author = {Ludwig Schmidt and Shibani Santurkar and Dimitris Tsipras and Kunal Talwar and Aleksander Madry}, Title = {Adversarially Robust Generalization Requires More Data}, Year = {2018}, journal={NeurIPS} } @article{madry, Author = {Aleksander Madry and Aleksandar Makelov and Ludwig Schmidt and Dimitris Tsipras and Adrian Vladu}, Title = {Towards Deep Learning Models Resistant to Adversarial Attacks}, Year = {2018}, journal = {ICLR} } @misc{gu2019using, title={Using Videos to Evaluate Image Model Robustness}, author={Keren Gu and Brandon Yang and Jiquan Ngiam and Quoc Le and Jonathon Shlens}, year={2019}, eprint={1904.10076}, archivePrefix={arXiv}, primaryClass={cs.CV} } @article{defense2, Author = {Dan Hendrycks and Kevin Gimpel}, Title = {Early Methods for Detecting Adversarial Images}, Year = {2017}, journal = {ICLR Workshop} } @misc{defense, Author = {Jan Hendrik Metzen and Tim Genewein and Volker Fischer and Bastian Bischoff}, Title = {On Detecting Adversarial Perturbations}, Year = {2017}, journal = {ICLR} } @misc{bypass, Author = {Nicholas Carlini and David Wagner}, Title = {Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods}, Year = {2017}, journal = {ACM Workshop on Artificial Intelligence and Security} } @ARTICLE{Zhu2005, title = "Semi-supervised learning literature survey", author = "Zhu, X", year = 2005 } @BOOK{Chapelle, title = "{Semi-Supervised} Learning", author = "Chapelle, Olivier and Schlkopf, Bernhard and Zien, Alexander", publisher = "The MIT Press", edition = "1st", year = 2010 } @article{Mnih, title = "Learning to label aerial images from noisy data", journal = "ICML", author = "Mnih, Volodymyr and Hinton, Geoffrey E", year = 2012 } @article{semiverified, title = "Learning from Untrusted Data", author = "Charikar, Moses and Steinhardt, Jacob and Valiant, Gregory", journal = "STOC", year = 2017 } @INPROCEEDINGS{Larsen, title = "Design of robust neural network classifiers", booktitle = "Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 {IEEE} International Conference on", author = "Larsen, J and Nonboe, L and Hintz-Madsen, M and Hansen, L K", year = 1998, } @ARTICLE{Frenay, title = "Classification in the presence of label noise: a survey", author = "Fr{\'e}nay, Beno{\^\i}t and Verleysen, Michel", journal = "IEEE Trans Neural Netw Learn Syst", year = 2014 } @ARTICLE{Biggio, title = "Support Vector Machines Under Adversarial Label Noise", author = "Biggio, B and Nelson, B and Laskov, P", journal = "ACML", year = 2011 } @INCOLLECTION{Natarajan, title = "Learning with Noisy Labels", booktitle = "Advances in Neural Information Processing Systems 26", author = "Natarajan, Nagarajan and Dhillon, Inderjit S and Ravikumar, Pradeep K and Tewari, Ambuj", year = 2013 } @article{Li, author = {Yuncheng Li and Jianchao Yang and Yale Song and Liangliang Cao and Jiebo Luo and Jia Li}, title = {Learning from Noisy Labels with Distillation}, journal = {ICCV}, year = {2017}, } @ARTICLE{Nettleton, title = "A study of the effect of different types of noise on the precision of supervised learning techniques", author = "Nettleton, David F and Orriols-Puig, Albert and Fornells, Albert", journal = "Artif Intell Rev", year = 2010, } @ARTICLE{Khardon, title = "Noise Tolerant Variants of the Perceptron Algorithm", author = "Khardon, Roni and Wachman, Gabriel", journal = "J. Mach. Learn. Res.", publisher = "JMLR", year = 2007 } @INPROCEEDINGS{Pechenizkiy, title = "Class Noise and Supervised Learning in Medical Domains: The Effect of Feature Extraction", booktitle = "19th {IEEE} Symposium on {Computer-Based} Medical Systems ({CBMS'06})", author = "Pechenizkiy, M and Tsymbal, A and Puuronen, S and Pechenizkiy, O", year = 2006 } @article{dropout, title={Dropout: A simple way to prevent neural networks from overfitting}, author={Srivastava, Nitish and Hinton, Geoffrey and Krizhevsky, Alex and Sutskever, Ilya and Salakhutdinov, Ruslan}, journal={The Journal of Machine Learning Research}, year={2014}, } @article{Zheng_2016, title={Improving the Robustness of Deep Neural Networks via Stability Training}, journal={CVPR}, author={Zheng, Stephan and Song, Yang and Leung, Thomas and Goodfellow, Ian}, year={2016}, } @article{adam, author = {Diederik P. Kingma and Jimmy Ba}, title = {Adam: {A} Method for Stochastic Optimization}, journal = {ICLR}, year = {2014} } @article{gelu, title={Gaussian Error Linear Units (GELUs)}, author={Hendrycks, Dan and Gimpel, Kevin}, journal={arXiv preprint 1606.08415}, year={2016} } @article{allconv, title={Striving for Simplicity: The All Convolutional Net}, author={Jost Tobias Springenberg and Alexey Dosovitskiy and Thomas Brox and Martin A. Riedmiller}, journal={CoRR}, year={2014}, volume={abs/1412.6806} } @article{l1madry, Author = {Yash Sharma and Pin-Yu Chen}, Title = {Attacking the {Madry} Defense Model with $L_1$-based Adversarial Examples}, Year = {2018}, journal = {ICLR Workshop} } @article{adverkaiming, Author = {Cihang Xie and Yuxin Wu and Laurens van der Maaten and Alan Yuille and Kaiming He}, Title = {Feature Denoising for Improving Adversarial Robustness}, Year = {2018}, journal = {arXiv preprint} } @article{alp, Author = {Harini Kannan and Alexey Kurakin and Ian Goodfellow}, Title = {Adversarial Logit Pairing}, Year = {2018}, journal = {NeurIPS} } @article{alpbroken, Author = {Logan Engstrom and Andrew Ilyas and Anish Athalye}, Title = {Evaluating and Understanding the Robustness of Adversarial Logit Pairing}, Year = {2018}, journal = {arXiv preprint} } @article{AlexNet, title = {{I}mage{N}et Classification with Deep Convolutional Neural Networks}, author = {Alex Krizhevsky and Sutskever, Ilya and Hinton, Geoffrey E}, journal = {NeurIPS}, year={2012} } @article{ILSVRC15, Author = {Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael Bernstein and Alexander C. Berg and Li Fei-Fei}, Title = {{ImageNet Large Scale Visual Recognition Challenge}}, Year = {2015}, journal = {International Journal of Computer Vision (IJCV)}, doi = {10.1007/s11263-015-0816-y}, volume={115}, number={3}, pages={211-252} } @article{imagenet, AUTHOR = {Jia Deng and Wei Dong and Richard Socher and Li-jia Li and Kai Li and Li Fei-Fei}, TITLE = {{I}mage{N}et: A Large-Scale Hierarchical Image Database}, journal = {CVPR}, YEAR = {2009} } @article{krizhevsky2009learning, title={Learning multiple layers of features from tiny images}, author={Krizhevsky, Alex and Hinton, Geoffrey}, year={2009}, journal={Technical report, University of Toronto} } @article{universal, author = {Sylvestre-Alvise Rebuffi and Hakan Bilen and Andrea Vedaldi}, title = {Learning multiple visual domains with residual adapters}, year = {2017}, journal = {NeurIPS} } @article{zeilerpretrain, author = {Matthew D Zeiler and Rob Fergus}, title = {Visualizing and Understanding Convolutional Networks}, year = {2014}, journal = {ECCV} } @article{kurakin, author = {Alexey Kurakin and Ian Goodfellow and Samy Bengio}, title = {Adversarial Machine Learning at Scale}, year = {2017}, journal = {ICLR} } @misc{adversarial, Author = {Christian Szegedy and Wojciech Zaremba and Ilya Sutskever and Joan Bruna and Dumitru Erhan and Ian Goodfellow and Rob Fergus}, Title = {Intriguing properties of neural networks}, Year = {2014}, journal = {ICLR} } @article{Brendel2018ApproximatingCW, title={Approximating {CNNs} with Bag-of-local-Features models works surprisingly well on {ImageNet}}, author={Wieland Brendel and Matthias Bethge}, journal={CoRR}, year={2018}, volume={abs/1904.00760} } @article{Goyal2017AccurateLM, title={Accurate, Large Minibatch {SGD}: Training {ImageNet} in 1 Hour}, author={Priya Goyal and Piotr Doll{\'a}r and Ross B. Girshick and Pieter Noordhuis and Lukasz Wesolowski and Aapo Kyrola and Andrew Tulloch and Yangqing Jia and Kaiming He}, journal={CoRR}, year={2017}, volume={abs/1706.02677} } @inproceedings{Hu2018GatherExciteE, title={Gather-Excite : Exploiting Feature Context in Convolutional Neural Networks}, author={Jie Hu and Li Shen and Samuel Albanie and Gang Sun and Andrea Vedaldi}, booktitle = {NeurIPS}, year={2018} } @article{Zhang2018SelfAttentionGA, title={Self-Attention Generative Adversarial Networks}, author={Han Zhang and Ian J. Goodfellow and Dimitris N. Metaxas and Augustus Odena}, journal={CoRR}, year={2018}, volume={abs/1805.08318} } @article{xie2018feature, title={Feature denoising for improving adversarial robustness}, author={Xie, Cihang and Wu, Yuxin and van der Maaten, Laurens and Yuille, Alan and He, Kaiming}, journal={arXiv preprint arXiv:1812.03411}, year={2018} } @article{Hu2018SqueezeandExcitationN, title={Squeeze-and-Excitation Networks}, author={Jie Hu and Li Shen and Gang Sun}, journal={2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition}, year={2018} } @inproceedings{improved, author = {Tim Salimans and Ian Goodfellow and Wojciech Zaremba and Vicki Cheung and Alec Radford and Xi Chen}, journal = {NeurIPS}, title = {Improved Techniques for Training GANs}, year = {2016}, } @inproceedings{goldberger, author = {Jacob Goldberger and Shiri Gordon and Hayit Greenspan}, booktitle = {ICCV}, title = {Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models}, year = {2003}, } @inproceedings{adverarial, author = {Daniel Jiwoong Im and Roland Memisevic and Chris Dongjoo Kim and Hui Jiang}, booktitle = {ICLR Workshop}, title = {Generative Adverarial Metric [sic]}, year = {2016}, } @inproceedings{gans, author = {Ian J. Goodfellow and Jean Pouget-Abadie and Mehdi Mirza and Bing Xu and David Warde-Farley and Sherjil Ozair and Aaron Courville and Yoshua Bengio}, booktitle={NeurIPS}, title = {Generative Adversarial Networks}, year = {2014}, } @inproceedings{dcgan, author = {Alec Radford and Luke Metz and Soumith Chintala}, booktitle = {ICLR}, title = {Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks}, year = {2016}, } @inproceedings{bn, author = {Sergey Ioffe and Christian Szegedy}, booktitle = {ICML}, title = {Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift}, year = {2015}, } @inproceedings{two, author = {David Lopez-Paz and Maxime Oquab}, booktitle = {International Conference on Learning Representations}, title = {Revisiting Classifier Two-Sample Tests}, year = {2017}, } @article{Gimpel2011, title = "Part-of-speech Tagging for Twitter: Annotation, Features, and Experiments", author = "Gimpel, Kevin and Schneider, Nathan and O'Connor, Brendan and Das, Dipanjan and Mills, Daniel and Eisenstein, Jacob and Heilman, Michael and Yogatama, Dani and Flanigan, Jeffrey and Smith, Noah A", journal = "ACL", year = 2011, } @article{Xiao2018SpatiallyTA, title={Spatially Transformed Adversarial Examples}, author={Chaowei Xiao and Jun-Yan Zhu and Bo Li and Warren He and Mingyan Liu and Dawn Xiaodong Song}, journal={CoRR}, year={2018}, volume={abs/1801.02612} } @article{kang, title={Transfer of Adversarial Robustness Between Perturbation Types}, author={Daniel Kang and Yi Sun and Tom Brown and Dan Hendrycks and Jacob Steinhardt}, journal={CoRR}, year={2019}, volume={abs/1905.01034} } @article{transforms, title={Why do deep convolutional networks generalize so poorly to small image transformations?}, author={Aharon Azulay and Yair Weiss}, journal={arXiv preprint}, year={2018} } @article{Hosseini2018SemanticAE, title={Semantic Adversarial Examples}, author={Hossein Hosseini and Radha Poovendran}, journal={2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)}, year={2018}, pages={1695-16955} } @article{Bhattad2019BigBI, title={Big but Imperceptible Adversarial Perturbations via Semantic Manipulation}, author={Anand Bhattad and Min Jin Chong and Kaizhao Liang and Bo Li and David A. Forsyth}, journal={CoRR}, year={2019}, volume={abs/1904.06347} } @article{Nguyen2015DeepNN, title={Deep neural networks are easily fooled: High confidence predictions for unrecognizable images}, author={Anh Mai Nguyen and Jason Yosinski and Jeff Clune}, journal={2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, year={2015}, pages={427-436} } @article{evaluatingadvrobustness, Author = {Nicholas Carlini and Anish Athalye and Nicolas Papernot and Wieland Brendel and Jonas Rauber and Dimitris Tsipras and Ian Goodfellow and Aleksander Madry and Alexey Kurakin}, Title = {On Evaluating Adversarial Robustness}, Year = {2019}, journal = {arXiv pre-print} } @ARTICLE{Zhu2004, title = "Class Noise vs. Attribute Noise: A Quantitative Study", author = "Zhu, Xingquan and Wu, Xindong", journal = "Artificial Intelligence Review", year = 2004 } @article{Sun_2017_ICCV, author = {Sun, Chen and Shrivastava, Abhinav and Singh, Saurabh and Gupta, Abhinav}, title = {Revisiting Unreasonable Effectiveness of Data in Deep Learning Era}, journal = {ICCV}, year = {2017} } @article{amodei2016concrete, title={Concrete problems in {AI} safety}, author={Amodei, Dario and Olah, Chris and Steinhardt, Jacob and Christiano, Paul and Schulman, John and Man{\'e}, Dan}, journal={arXiv preprint arXiv:1606.06565}, year={2016} } @InProceedings{bergamo, author = {Alessandro Bergamo, Lorenzo Torresani}, title = {Exploiting weakly-labeled Web images to improve object classification: a domain adaptation approach}, booktitle = {NeurIPS}, year = 2010 } @inproceedings{goldberger2016training, title={Training deep neural-networks using a noise adaptation layer}, author={Goldberger, Jacob and Ben-Reuven, Ehud}, booktitle={ICLR}, year={2017} } @ARTICLE{Reed, title = "Training Deep Neural Networks on Noisy Labels with Bootstrapping", author = "Reed, Scott and Lee, Honglak and Anguelov, Dragomir and Szegedy, Christian and Erhan, Dumitru and Rabinovich, Andrew", year = 2014, journal = "ICLR Workshop" } @article{li2016, author = {Bo Li and Yining Wang and Aarti Singh and Yevgeniy Vorobeychik}, title = {Data Poisoning Attacks on Factorization-Based Collaborative Filtering}, journal = {NeurIPS}, year = {2016}, } @inproceedings{hendrycks2019pretrain, title={Using Pre-Training Can Improve Model Robustness and Uncertainty}, author={Dan Hendrycks and Kimin Lee and Mantas Mazeika}, booktitle={ICML}, year={2019} } @article{gilmer2019a, author = {Gilmer, Justin and Hendrycks, Dan}, title = {A Discussion of 'Adversarial Examples Are Not Bugs, They Are Features': Adversarial Example Researchers Need to Expand What is Meant by 'Robustness'}, journal = {Distill}, year = {2019}, url = {https://distill.pub/2019/advex-bugs-discussion/response-1}, } @article{gilmer2019discussion, title={A Discussion of'Adversarial Examples Are Not Bugs, They Are Features': Adversarial Example Researchers Need to Expand What is Meant by'Robustness'}, author={Gilmer, Justin and Hendrycks, Dan}, journal={Distill}, volume={4}, number={8}, pages={e00019--1}, year={2019} } @article{uar, author = {Daniel Kang and Yi Sun and Dan Hendrycks and Tom Brown and Jacob Steinhardt}, title = {Testing Robustness Against Unforeseen Adversaries}, journal = {arXiv preprint}, year = {2019} } @article{sgdr, author = {Ilya Loshchilov and Frank Hutter}, title = {{SGDR:} Stochastic Gradient Descent with Warm Restarts}, journal = "ICLR", year = {2016} } @article{resnext, title={Aggregated Residual Transformations for Deep Neural Networks}, author={Saining Xie and Ross Girshick and Piotr Dollár and Zhuowen Tu and Kaiming He}, journal={CVPR}, year={2016} } @InProceedings{imdb, author = {Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher}, title = {Learning Word Vectors for Sentiment Analysis}, booktitle = {Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies}, year = {2011}, } @InProceedings{sst, author = {Richard Socher and Alex Perelygin and Jean Wu and Jason Chuang and Christopher Manning and Andrew Ng and Christopher Potts}, title = {Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank}, booktitle = {Conference on Empirical Methods in Natural Language Processing}, year = {2013} } @InProceedings{steinhardtpoison, author = {Jacob Steinhardt and Pang Wei Koh and Percy Liang}, title = {Certified Defenses for Data Poisoning Attacks}, booktitle = {NeurIPS}, year = {2017}, } @article{Menon, author = {Aditya Krishna Menon and Brendan van Rooyen and Nagarajan Natarajan}, title = {Learning from Binary Labels with Instance-Dependent Corruption}, journal = {CoRR}, year = {2016} } @article{Yin2019AFP, title={A Fourier Perspective on Model Robustness in Computer Vision}, author={Dong Yin and Raphael Gontijo Lopes and Jonathon Shlens and Ekin Dogus Cubuk and Justin Gilmer}, journal={ArXiv}, year={2019}, volume={abs/1906.08988} } @article{ovadia2019can, title={Can You Trust Your Model's Uncertainty? {E}valuating Predictive Uncertainty Under Dataset Shift}, author={Ovadia, Yaniv and Fertig, Emily and Ren, Jie and Nado, Zachary and Sculley, D and Nowozin, Sebastian and Dillon, Joshua V and Lakshminarayanan, Balaji and Snoek, Jasper}, journal={NeurIPS}, year={2019} } @article{park2019specaugment, title={Specaugment: A simple data augmentation method for automatic speech recognition}, author={Park, Daniel S and Chan, William and Zhang, Yu and Chiu, Chung-Cheng and Zoph, Barret and Cubuk, Ekin D and Le, Quoc V}, journal={arXiv preprint arXiv:1904.08779}, year={2019} } @article{cifar_datasets, title={Learning multiple layers of features from tiny images}, author={Krizhevsky, Alex and Hinton, Geoffrey}, year={2009} } @article{Devries2017ImprovedRO, title={Improved Regularization of Convolutional Neural Networks with {Cutout}}, author={Terrance Devries and Graham W. Taylor}, journal={arXiv preprint arXiv:1708.04552}, year={2017}, } @article{Zhang2017mixupBE, title={mixup: Beyond Empirical Risk Minimization}, author={Hongyi Zhang and Moustapha Ciss{\'e} and Yann Dauphin and David Lopez-Paz}, journal={ICLR}, year={2017}, } @inproceedings{zhang2019shiftinvar, title={Making Convolutional Networks Shift-Invariant Again}, author={Zhang, Richard}, booktitle={ICML}, year={2019} } @article{Yun2019CutMixRS, title={CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features}, author={Sangdoo Yun and Dongyoon Han and Seong Joon Oh and Sanghyuk Chun and Junsuk Choe and Youngjoon Yoo}, journal={ICCV}, year={2019}, } @article{Cubuk2018AutoAugmentLA, title={{AutoAugment}: Learning Augmentation Policies from Data}, author={Ekin Dogus Cubuk and Barret Zoph and Dandelion Man{\'e} and Vijay Vasudevan and Quoc V. Le}, journal={CVPR}, year={2018}, } @article{Raghunathan2019AdversarialTC, title={Adversarial Training Can Hurt Generalization}, author={Aditi Raghunathan and Sang Michael Xie and Fanny Yang and John C. Duchi and Percy Liang}, journal={arXiv preprint arXiv:1906.06032}, year={2019}, } @article{Tokozume2017BetweenClassLF, title={Between-Class Learning for Image Classification}, author={Yuji Tokozume and Yoshitaka Ushiku and Tatsuya Harada}, journal={CVPR}, year={2018}, } @article{Zhong2017RandomED, title={Random Erasing Data Augmentation}, author={Zhun Zhong and Liang Zheng and Guoliang Kang and Shaozi Li and Yi Yang}, journal={arXiv preprint arXiv:1708.04896}, year={2017}, } @article{Lopes2019ImprovingRW, title={Improving Robustness Without Sacrificing Accuracy with Patch {Gaussian} Augmentation}, author={Raphael Gontijo Lopes and Dong Yin and Ben Poole and Justin Gilmer and Ekin Dogus Cubuk}, journal={arXiv preprint arXiv:1906.02611}, year={2019}, } @article{weightnorm, Author = {Tim Salimans and Diederik Kingma}, Title = {Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks}, Year = {2016}, journal = {NeurIPS} } @article{tsipras2018robustness, title={Robustness may be at odds with accuracy}, author={Tsipras, Dimitris and Santurkar, Shibani and Engstrom, Logan and Turner, Alexander and Madry, Aleksander}, journal={arXiv preprint arXiv:1805.12152}, year={2018} } @article{GoogleMultiLabel, author = {Andreas Veit and Neil Alldrin and Gal Chechik and Ivan Krasin and Abhinav Gupta and Serge J. Belongie}, title = {Learning From Noisy Large-Scale Datasets With Minimal Supervision}, journal = {CVPR}, year = {2017} } @article{LearningToReweight, title = {Learning to Reweight Examples for Robust Deep Learning}, author = {Ren, Thu Mengye and Zeng, Wenyuan and Yang, Bin and Urtasun, Raquel}, journal = {ICML}, year={2018} } @article{DownsampledImageNet, author = {Patryk Chrabaszcz and Ilya Loshchilov and Frank Hutter}, title = {A Downsampled Variant of {I}mage{N}et as an Alternative to the {CIFAR} datasets}, journal = {arXiv preprint arXiv:1707.08819}, year = {2017}, } @article{Huh2016, author = {Mi{-}Young Huh and Pulkit Agrawal and Alexei A. Efros}, title = {What makes {I}mage{N}et good for transfer learning?}, year = {2016}, journal = {arXiv} } @article{objectdetectionanalysis, author = {Pulkit Agrawal and Ross Girshick and Jitendra Malik}, title = {Analyzing the Performance of Multilayer Neural Networks for Object Recognition}, year = {2014}, journal = {ECCV} } @article{zhang2019theoretically, title={Theoretically principled trade-off between robustness and accuracy}, author={Zhang, Hongyang and Yu, Yaodong and Jiao, Jiantao and Xing, Eric P and Ghaoui, Laurent El and Jordan, Michael I}, journal={arXiv preprint arXiv:1901.08573}, year={2019} } @article{yosinski, author = {Jason Yosinski and Jeff Clune and Yoshua Bengio and Hod Lipson}, title = {How transferable are features in deep neural networks?}, year = {2014}, journal = {NeurIPS} } @article{zhang2018, title = {Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels}, author = {Zhang, Zhilu and Sabuncu, Mert}, journal = {NeurIPS}, year = {2018} } @article{brown2017adversarial, title={Adversarial patch}, author={Brown, Tom B and Man{\'e}, Dandelion and Roy, Aurko and Abadi, Mart{\'\i}n and Gilmer, Justin}, journal={arXiv preprint arXiv:1712.09665}, year={2017} } @article{athalye2017synthesizing, title={Synthesizing robust adversarial examples}, author={Athalye, Anish and Engstrom, Logan and Ilyas, Andrew and Kwok, Kevin}, journal={arXiv preprint arXiv:1707.07397}, year={2017} } @inproceedings{sharif2016accessorize, title={Accessorize to a crime: Real and stealthy attacks on state-of-the-art face recognition}, author={Sharif, Mahmood and Bhagavatula, Sruti and Bauer, Lujo and Reiter, Michael K}, booktitle={Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security}, pages={1528--1540}, year={2016}, organization={ACM} }

arxiv_citations

{"1909.13719":true,"1708.04552":true,"1807.06732":true,"1706.02677":true,"1802.03916":true,"1906.02611":true,"1906.06032":true,"1412.6806":true,"1811.09030":true,"1906.08988":true,"1708.04896":true,"1905.02249":true,"1904.12848":true,"1810.12042":true,"1705.10694":true,"1607.02533":true,"1809.08352":true,"1902.10811":true,"1703.09387":true,"1805.08974":true,"1412.6596":true,"1906.12340":true,"1904.00760":true,"1805.08318":true,"1812.03411":true,"1801.02612":true,"1905.01034":true,"1904.06347":true,"1606.06565":true,"1904.08779":true,"1805.12152":true,"1707.08819":true,"1901.08573":true,"1712.09665":true,"1707.07397":true}

alignment_newsletter

{"source":"alignment-newsletter","source_type":"google-sheets","converted_with":"python","venue":"ICLR 2020","newsletter_category":"Robustness","highlight":false,"newsletter_number":"AN #82","newsletter_url":"https://mailchi.mp/7ba40faa7eed/an-82-how-openai-five-distributed-their-training-computation","summarizer":"Dan H","summary":"This paper introduces a data augmentation technique to improve robustness and uncertainty estimates. The idea is to take various random augmentations such as random rotations, produce several augmented versions of an image with compositions of random augmentations, and then pool the augmented images into a single image by way of an elementwise convex combination. Said another way, the image is augmented with various traditional augmentations, and these augmented images are “averaged” together. This produces highly diverse augmentations that have similarity to the original image. Unlike techniques such as AutoAugment, this augmentation technique uses typical resources, not 15,000 GPU hours. It also greatly improves generalization to unforeseen corruptions, and it makes models more stable under small perturbations. Most importantly, even as the distribution shifts and accuracy decreases, this technique produces models that can [remain calibrated under distributional shift](https://openreview.net/pdf?id=S1gmrxHFvB#page=8&zoom=100,144,298).","opinion":"nan","prerequisites":"nan","read_more":"nan","paper_version":null,"arxiv_id":"1912.02781","title":"AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty","authors":"Dan Hendrycks*, Norman Mu*, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan","date_published":2020.0,"data_last_modified":"","url":"https://arxiv.org/abs/1912.02781","abstract":"","author_comment":"","journal_ref":"","doi":"","primary_category":"","categories":"","individual_summary":"Title: AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty\nAuthors: Dan Hendrycks*, Norman Mu*, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan\nSummary: This paper introduces a data augmentation technique to improve robustness and uncertainty estimates. The idea is to take various random augmentations such as random rotations, produce several augmented versions of an image with compositions of random augmentations, and then pool the augmented images into a single image by way of an elementwise convex combination. Said another way, the image is augmented with various traditional augmentations, and these augmented images are “averaged” together. This produces highly diverse augmentations that have similarity to the original image. Unlike techniques such as AutoAugment, this augmentation technique uses typical resources, not 15,000 GPU hours. It also greatly improves generalization to unforeseen corruptions, and it makes models more stable under small perturbations. Most importantly, even as the distribution shifts and accuracy decreases, this technique produces models that can [remain calibrated under distributional shift](https://openreview.net/pdf?id=S1gmrxHFvB#page=8&zoom=100,144,298).\nMy opinion: nan","paper_text":"","text":"Highlights\n[Dota 2 with Large Scale Deep Reinforcement Learning](http://arxiv.org/abs/1912.06680) *(OpenAI et al)* (summarized by Nicholas): In April, [OpenAI Five](https://openai.com/blog/how-to-train-your-openai-five/) ([AN #54](https://mailchi.mp/3e2f43012b07/an-54-boxing-a-finite-horizon-ai-system-to-keep-it-unambitious)) defeated the world champion Dota 2 team, OG. This paper describes its training process. OpenAI et al. hand-engineered the reward function as well as some features, actions, and parts of the policy. The rest of the policy was trained using PPO with an LSTM architecture at a massive scale. They trained this in a distributed fashion as follows:\n- The *Controller* receives and distributes the updated parameters.\n- The *Rollout Worker CPUs* simulate the game, send observations to the *Forward Pass GPUs* and publish samples to the *Experience Buffer*.\n- The *Forward Pass GPUs* determine the actions to use and send them to the *Rollout Workers*.\n- The *Optimizer GPUs* sample experience from the *Experience Buffer*, calculate gradient updates, and then publish updated parameters to the *Controller*.\nThe model trained over 296 days. In that time, OpenAI needed to adapt it to changes in the code and game mechanics. This was done via model “surgery”, in which they would try to initialize a new model to maintain the same input-output mapping as the old one. When this was not possible, they gradually increased the proportion of games played with the new version over time.\n**Nicholas's opinion:** I feel similarly to my opinion on [AlphaStar](https://deepmind.com/blog/article/AlphaStar-Grandmaster-level-in-StarCraft-II-using-multi-agent-reinforcement-learning) ([AN #73](https://mailchi.mp/ef55eb52b0fd/an-73-detecting-catastrophic-failures-by-learning-how-agents-tend-to-break)) here. The result is definitely impressive and a major step up in complexity from shorter, discrete games like chess or go. However, I don’t see how the approach of just running PPO at a large scale brings us closer to AGI because we can’t run massively parallel simulations of real world tasks. Even for tasks that can be simulated, this seems prohibitively expensive for most use cases (I couldn’t find the exact costs, but I’d estimate this model cost tens of millions of dollars). I’d be quite excited to see an example of deep RL being used for a complex real world task without training in simulation.\nTechnical AI alignment\n \n\nTechnical agendas and prioritization\n[Just Imitate Humans?](https://www.alignmentforum.org/posts/LTFaD96D9kWuTibWr/just-imitate-humans) *(Michael Cohen)* (summarized by Rohin): This post asks whether it is safe to build AI systems that just imitate humans. The comments have a lot of interesting debate.\nAgent foundations\n[Conceptual Problems with UDT and Policy Selection](https://www.alignmentforum.org/posts/9sYzoRnmqmxZm4Whf/conceptual-problems-with-udt-and-policy-selection) *(Abram Demski)* (summarized by Rohin): In Updateless Decision Theory (UDT), the agent decides \"at the beginning of time\" exactly how it will respond to every possible sequence of observations it could face, so as to maximize the expected value it gets with respect to its prior over how the world evolves. It is updateless because it decides ahead of time how it will respond to evidence, rather than updating once it sees the evidence. This works well when the agent can consider the full environment and react to it, and often gets the right result even when the environment can model the agent (as in Newcomblike problems), as long as the agent knows how the environment will model it.\nHowever, it seems unlikely that UDT will generalize to logical uncertainty and multiagent settings. Logical uncertainty occurs when you haven't computed all the consequences of your actions and is reduced by thinking longer. However, this effectively is a form of updating, whereas UDT tries to know everything upfront and never update, and so it seems hard to make it compatible with logical uncertainty. With multiagent scenarios, the issue is that UDT wants to decide on its policy \"before\" any other policies, which may not always be possible, e.g. if another agent is also using UDT. The philosophy behind UDT is to figure out how you will respond to everything ahead of time; as a result, UDT aims to precommit to strategies assuming that other agents will respond to its commitments; so two UDT agents are effectively \"racing\" to make their commitments as fast as possible, reducing the time taken to consider those commitments as much as possible. This seems like a bad recipe if we want UDT agents to work well with each other.\n**Rohin's opinion:** I am no expert in decision theory, but these objections seem quite strong and convincing to me.\n[A Critique of Functional Decision Theory](https://www.alignmentforum.org/posts/ySLYSsNeFL5CoAQzN/a-critique-of-functional-decision-theory) *(Will MacAskill)* (summarized by Rohin): *This summary is more editorialized than most.* This post critiques [Functional Decision Theory](https://arxiv.org/abs/1710.05060) (FDT). I'm not going to go into detail, but I think the arguments basically fall into two camps. First, there are situations in which there is no uncertainty about the consequences of actions, and yet FDT chooses actions that do not have the highest utility, because of their impact on counterfactual worlds which \"could have happened\" (but ultimately, the agent is just leaving utility on the table). Second, FDT relies on the ability to tell when someone is \"running an algorithm that is similar to you\", or is \"logically correlated with you\". But there's no such crisp concept, and this leads to all sorts of problems with FDT as a decision theory.\n**Rohin's opinion:** Like [Buck from MIRI](https://forum.effectivealtruism.org/posts/tDk57GhrdK54TWzPY/i-m-buck-shlegeris-i-do-research-and-outreach-at-miri-ama#iX6knDPMXZb696tDc), I feel like I understand these objections and disagree with them. On the first argument, I agree with [Abram](https://www.alignmentforum.org/posts/ySLYSsNeFL5CoAQzN/a-critique-of-functional-decision-theory#y8zRwcpNeu2ZhM3yE) that a decision should be evaluated based on how well the agent performs with respect to the probability distribution used to define the problem; FDT only performs badly if you evaluate on a decision problem produced by conditioning on a highly improbable event. On the second class of arguments, I certainly agree that there isn't (yet) a crisp concept for \"logical similarity\"; however, I would be shocked if the *intuitive concept* of logical similarity was not relevant in the general way that FDT suggests. If your goal is to hardcode FDT into an AI agent, or your goal is to write down a decision theory that in principle (e.g. with infinite computation) defines the correct action, then it's certainly a problem that we have no crisp definition yet. However, FDT can still be useful for getting more clarity on how one ought to reason, without providing a full definition.\nLearning human intent\n[Learning to Imitate Human Demonstrations via CycleGAN](https://bair.berkeley.edu/blog/2019/12/13/humans-cyclegan/) *(Laura Smith et al)* (summarized by Zach): Most methods for imitation learning, where robots learn from a demonstration, assume that the actions of the demonstrator and robot are the same. This means that expensive techniques such as teleoperation have to be used to generate demonstrations. **This paper presents a method to engage in automated visual instruction-following with demonstrations (AVID) that works by translating video demonstrations done by a human into demonstrations done by a robot.** To do this, the authors use [CycleGAN](https://junyanz.github.io/CycleGAN/), a method to translate an image from one domain to another domain using unpaired images as training data. CycleGAN allows them to translate videos of humans performing the task into videos of the robot performing the task, which the robot can then imitate. In order to make learning tractable, the demonstrations had to be divided up into 'key stages' so that the robot can learn a sequence of more manageable tasks. In this setup, the robot only needs supervision to ensure that it's copying each stage properly before moving on to the next one. To test the method, the authors have the robot retrieve a coffee cup and make coffee. AVID significantly outperforms other imitation learning methods and can achieve 70% / 80% success rate on the tasks, respectively.\n**Zach's opinion:** In general, I like the idea of 'translating' demonstrations from one domain into another. It's worth noting that there do exist methods for translating visual demonstrations into latent policies. I'm a bit surprised that we didn't see any comparisons with other adversarial methods like [GAIfO](https://arxiv.org/pdf/1807.06158.pdf), but I understand that those methods have high sample complexity so perhaps the methods weren't useful in this context. It's also important to note that these other methods would still require demonstration translation. Another criticism is that AVID is not fully autonomous since it relies on human feedback to progress between stages. However, compared to kinetic teaching or teleoperation, sparse feedback from a human overseer is a minor inconvenience.\n**Read more:** [Paper: AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos](https://arxiv.org/abs/1912.04443)\nPreventing bad behavior\n[When Goodharting is optimal: linear vs diminishing returns, unlikely vs likely, and other factors](https://www.alignmentforum.org/posts/megKzKKsoecdYqwb7/when-goodharting-is-optimal-linear-vs-diminishing-returns) *(Stuart Armstrong)* (summarized by Flo): Suppose we were uncertain about which arm in a bandit provides reward (and we don’t get to observe the rewards after choosing an arm). Then, maximizing expected value under this uncertainty is equivalent to picking the most likely reward function as a proxy reward and optimizing that; Goodhart’s law doesn’t apply and is thus not universal. This means that our fear of Goodhart effects is actually informed by more specific intuitions about the structure of our preferences. If there are actions that contribute to multiple possible rewards, optimizing the most likely reward does not need to maximize the expected reward. Even if we optimize for that, we have a problem if value is complex and the way we do reward learning implicitly penalizes complexity. Another problem arises if the correct reward is comparatively difficult to optimize: if we want to maximize the average, it can make sense to only care about rewards that are both likely and easy to optimize. Relatedly, we could fail to correctly account for diminishing marginal returns in some of the rewards.\n \n \nGoodhart effects are a lot less problematic if we can deal with all of the mentioned factors. Independent of that, Goodhart effects are most problematic when there is little middle ground that all rewards can agree on.\n**Flo's opinion:** I enjoyed this article and the proposed factors match my intuitions. Predicting variable diminishing returns seems especially hard to me. I also worry that the interactions between rewards will be negative-sum, due to resource constraints.\n**Rohin's opinion:** Note that this post considers the setting where we have uncertainty over the true reward function, but *we can't learn about the true reward function*. If you can gather information about the true reward function, which [seems necessary to me](https://www.alignmentforum.org/posts/4783ufKpx8xvLMPc6/human-ai-interaction) ([AN #41](https://mailchi.mp/8c3f02cabccd/alignment-newsletter-41)), then it is almost always worse to take the most likely reward or expected reward as a proxy reward to optimize.\nRobustness\n[AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty](https://openreview.net/pdf?id=S1gmrxHFvB) *(Dan Hendrycks, Norman Mu et al)* (summarized by Dan H): This paper introduces a data augmentation technique to improve robustness and uncertainty estimates. The idea is to take various random augmentations such as random rotations, produce several augmented versions of an image with compositions of random augmentations, and then pool the augmented images into a single image by way of an elementwise convex combination. Said another way, the image is augmented with various traditional augmentations, and these augmented images are “averaged” together. This produces highly diverse augmentations that have similarity to the original image. Unlike techniques such as AutoAugment, this augmentation technique uses typical resources, not 15,000 GPU hours. It also greatly improves generalization to unforeseen corruptions, and it makes models more stable under small perturbations. Most importantly, even as the distribution shifts and accuracy decreases, this technique produces models that can [remain calibrated under distributional shift](https://openreview.net/pdf?id=S1gmrxHFvB#page=8&zoom=100,144,298).\nMiscellaneous (Alignment)\n[Defining and Unpacking Transformative AI](https://arxiv.org/abs/1912.00747) *(Ross Gruetzemacher et al)* (summarized by Flo): The notion of **transformative AI** (TAI) is used to highlight that even narrow AI systems can have large impacts on society. This paper offers a clearer definition of TAI and distinguishes it from **radical transformative AI** (RTAI).\n\"Discontinuities or other anomalous patterns in metrics of human progress, as well as *irreversibility* are common indicators of transformative change. TAI is then broadly defined as an AI technology, which leads to an irreversible change of some important aspects of society, making it a (multi-dimensional) spectrum along the axes of **extremity**, **generality** and **fundamentality**. \" For example, advanced AI weapon systems might have strong implications for great power conflicts but limited effects on people's daily lives; extreme change of limited generality, similar to nuclear weapons. There are two levels: while TAI is comparable to general-purpose technologies (GPTs) like the internal combustion engine, RTAI leads to changes that are comparable to the agricultural or industrial revolution. Both revolutions have been driven by GPTs like the domestication of plants and the steam engine. Similarly, we will likely see TAI before RTAI. The scenario where we don't is termed a **radical shift**.\nNon-radical TAI could still contribute to existential risk in conjunction with other factors. Furthermore, if TAI precedes RTAI, our management of TAI can affect the risks RTAI will pose.\n**Flo's opinion:** I enjoyed this article and the proposed factors match my intuitions. There seem to be two types of problems: extreme beliefs and concave Pareto boundaries. Dealing with the second is more important since a concave Pareto boundary favours extreme policies, even for moderate beliefs. Luckily, diminishing returns can be used to bend the Pareto boundary. However, I expect it to be hard to find the correct rate of diminishing returns, especially in novel situations.\n[Six AI Risk/Strategy Ideas](https://www.alignmentforum.org/posts/dt4z82hpvvPFTDTfZ/six-ai-risk-strategy-ideas) *(Wei Dai)* (summarized by Rohin): This post briefly presents three ways that power can become centralized in a world with [Comprehensive AI Services](https://www.fhi.ox.ac.uk/reframing/) ([AN #40](https://mailchi.mp/b649f32b07da/alignment-newsletter-40)), argues that under risk aversion \"logical\" risks can be more concerning than physical risks because they are more correlated, proposes combining human imitations and oracles to remove the human in the loop and become competitive, and suggests doing research to generate evidence of difficulty of a particular strand of research. |\n\n\n |\n\n |\n| \n\n| | | | | | | | | | | | | |\n| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |\n| \n\n| | | | | | | | | | | | |\n| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |\n| \n\n| | | | | | | | | | | |\n| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |\n| \n\n| | | | | | | | | | |\n| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |\n| \n\n\n| | | |\n| --- | --- | --- |\n| \n\n| | |\n| --- | --- |\n| \n\n| |\n| --- |\n| [Twitter](http://www.twitter.com/) |\n\n |\n\n |\n\n\n\n\n| | | |\n| --- | --- | --- |\n| \n\n| | |\n| --- | --- |\n| \n\n| |\n| --- |\n| [Facebook](http://www.facebook.com) |\n\n |\n\n |\n\n\n\n\n| | | |\n| --- | --- | --- |\n| \n\n| | |\n| --- | --- |\n| \n\n| |\n| --- |\n| [Website](http://mailchimp.com) |\n\n |\n\n |\n\n\n |\n\n |\n\n |\n\n |\n\n\n\n| | |\n| --- | --- |\n| \n\n| |\n| --- |\n| |\n\n |\n\n\n\n| | |\n| --- | --- |\n| \n\n\n| |\n| --- |\n| *Copyright © 2020 Rohin Shah, All rights reserved.*\n\n\n"}

abstract: | Modern deep neural networks can achieve high accuracy when the training distribution and test distribution are identically distributed, but this assumption is frequently violated in practice. When the train and test distributions are mismatched, accuracy can plummet. Currently there are few techniques that improve robustness to unforeseen data shifts encountered during deployment. In this work, we propose a technique to improve the robustness and uncertainty estimates of image classifiers. We propose [AugMix]{.smallcaps}, a data processing technique that is simple to implement, adds limited computational overhead, and helps models withstand unforeseen corruptions. [AugMix]{.smallcaps} significantly improves robustness and uncertainty measures on challenging image classification benchmarks, closing the gap between previous methods and the best possible performance in some cases by more than half. author:

| Dan Hendrycks[^1]
DeepMind
[hendrycks@berkeley.edu](mailto:hendrycks@berkeley.edu)
Norman Mu
Google
[normanmu@google.com](mailto:normanmu@google.com)
Ekin D. Cubuk
Google
[cubuk@google.com](mailto:cubuk@google.com)
Barret Zoph
Google
[barretzoph@google.com](mailto:barretzoph@google.com)
Justin Gilmer
Google
[gilmer@google.com](mailto:gilmer@google.com)
Balaji Lakshminarayanan[^2]
DeepMind
[balajiln@google.com](mailto:balajiln@google.com) bibliography:
main.bib title: "AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty "

Introduction

Current machine learning models depend on the ability of training data to faithfully represent the data encountered during deployment. In practice, data distributions evolve [@Lipton2018DetectingAC], models encounter new scenarios [@hendrycks17baseline], and data curation procedures may capture only a narrow slice of the underlying data distribution [@Torralba2011UnbiasedLA]. Mismatches between the train and test data are commonplace, yet the study of this problem is not. As it stands, models do not robustly generalize across shifts in the data distribution. If models could identify when they are likely to be mistaken, or estimate uncertainty accurately, then the impact of such fragility might be ameliorated. Unfortunately, modern models already produce overconfident predictions when the training examples are independent and identically distributed to the test distribution. This overconfidence and miscalibration is greatly exacerbated by mismatched training and testing distributions.

Small corruptions to the data distribution are enough to subvert existing classifiers, and techniques to improve corruption robustness remain few in number. @hendrycks2019robustness show that classification error of modern models rises from 22% on the usual ImageNet test set to 64% on ImageNet-C, a test set consisting of various corruptions applied to ImageNet test images. Even methods which aim to explicitly quantify uncertainty, such as probabilistic and Bayesian neural networks, struggle under data shift, as recently demonstrated by @ovadia2019can. Improving performance in this setting has been difficult. One reason is that training against corruptions only encourages networks to memorize the specific corruptions seen during training and leaves models unable to generalize to new corruptions [@igor; @geirhos]. Further, networks trained on translation augmentations remain highly sensitive to images shifted by a single pixel [@gu2019using; @hendrycks2019robustness]. Others have proposed aggressive data augmentation schemes [@Cubuk2018AutoAugmentLA], though at the cost of a computational increase. @empiricalpaper demonstrates that many techniques may improve clean accuracy at the cost of robustness while many techniques which improve robustness harm uncertainty, and contrariwise. In all, existing techniques have considerable trade-offs.

In this work, we propose a technique to improve both the robustness and uncertainty estimates of classifiers under data shift. We propose [AugMix]{.smallcaps}, a method which simultaneously achieves new state-of-the-art results for robustness and uncertainty estimation while maintaining or improving accuracy on standard benchmark datasets. [AugMix]{.smallcaps} utilizes stochasticity and diverse augmentations, a Jensen-Shannon Divergence consistency loss, and a formulation to mix multiple augmented images to achieve state-of-the-art performance. On CIFAR-10 and CIFAR-100, our method roughly halves the corruption robustness error of standard training procedures from 28.4% to 12.4% and 54.3% to 37.8% error, respectively. On ImageNet, [AugMix]{.smallcaps} also achieves state-of-the-art corruption robustness and decreases perturbation instability from 57.2% to 37.4%. Code is available at https://github.com/google-research/augmix.

Related Work

A visual comparison of data augmentation techniques. [AugMix]{.smallcaps} produces images with variety while preserving much of the image semantics and local statistics. {width="\textwidth"}

R0.36 {width="\linewidth"}

Robustness under Data Shift. [@geirhos] show that training against distortions can often fail to generalize to unseen distortions, as networks have a tendency to memorize properties of the specific training distortion. [@igor] show training with various blur augmentations can fail to generalize to unseen blurs or blurs with different parameter settings. [@hendrycks2019robustness] propose measuring generalization to unseen corruptions and provide benchmarks for doing so. [@uar] construct an adversarial version of the aforementioned benchmark. [@Gilmer2018MotivatingTR; @gilmer2019discussion] argue that robustness to data shift is a pressing problem which greatly affects the reliability of real-world machine learning systems.

Calibration under Data Shift. @kilian [@oconnor] propose metrics for determining the calibration of machine learning models. [@lakshminarayanan2017simple] find that simply ensembling classifier predictions improves prediction calibration. [@hendrycks2019pretrain] show that pre-training can also improve calibration. [@ovadia2019can] demonstrate that model calibration substantially deteriorates under data shift.

Data Augmentation. Data augmentation can greatly improve generalization performance. For image data, random left-right flipping and cropping are commonly used [@resnet]. Random occlusion techniques such as Cutout can also improve accuracy on clean data [@Devries2017ImprovedRO; @Zhong2017RandomED]. Rather than occluding a portion of an image, CutMix replaces a portion of an image with a portion of a different image [@Yun2019CutMixRS; @Takahashi2019DataAU]. Mixup also uses information from two images. Rather than implanting one portion of an image inside another, Mixup produces an elementwise convex combination of two images [@Zhang2017mixupBE; @Tokozume2017BetweenClassLF]. [@GuoMixup] show that Mixup can be improved with an adaptive mixing policy, so as to prevent manifold intrusion. Separate from these approaches are learned augmentation methods such as AutoAugment [@Cubuk2018AutoAugmentLA], where a group of augmentations is tuned to optimize performance on a downstream task. Patch Gaussian augments data with Gaussian noise applied to a randomly chosen portion of an image [@Lopes2019ImprovingRW]. A popular way to make networks robust to $\ell_p$ adversarial examples is with adversarial training [@madry], which we use in this paper. However, this tends to increase training time by an order of magnitude and substantially degrades accuracy on non-adversarial images [@Raghunathan2019AdversarialTC].

AugMix

[AugMix]{.smallcaps} is a data augmentation technique which improves model robustness and uncertainty estimates, and slots in easily to existing training pipelines. At a high level, AugMix is characterized by its utilization of simple augmentation operations in concert with a consistency loss. These augmentation operations are sampled stochastically and layered to produce a high diversity of augmented images. We then enforce a consistent embedding by the classifier across diverse augmentations of the same input image through the use of Jensen-Shannon divergence as a consistency loss.

Mixing augmentations allows us to generate diverse transformations, which are important for inducing robustness, as a common failure mode of deep models in the arena of corruption robustness is the memorization of fixed augmentations [@igor; @geirhos]. Previous methods have attempted to increase diversity by directly composing augmentation primitives in a chain, but this can cause the image to quickly degrade and drift off the data manifold, as depicted in . Such image degradation can be mitigated and the augmentation diversity can be maintained by mixing together the results of several augmentation chains in convex combinations. A concrete account of the algorithm is given in the pseudocode below.

A cascade of successive compositions can produce images which drift far from the original image, and lead to unrealistic images. However, this divergence can be balanced by controlling the number of steps. To increase variety, we generate multiple augmented images and mix them. {#fig:composition width="\textwidth"}

[[fig:composition]]{#fig:composition label="fig:composition"}

Input: Model $\hat{p}$, Classification Loss $\mathcal{L}$, Image $x_\text{orig}$, Operations $\mathcal{O} = {\text{rotate}, \ldots, \text{posterize}}$ Fill $x_\text{aug}$ with zeros Sample mixing weights $(w_1, w_2, \ldots, w_k) \sim \text{Dirichlet}(\alpha,\alpha,\ldots,\alpha)$ Sample operations $\text{op}1, \text{op}2, \text{op}3 \sim \mathcal{O}$ Compose operations with varying depth $\text{op}{12} = \text{op}2 \circ \text{op}1$ and $\text{op}{123} = \text{op}3 \circ \text{op}2 \circ \text{op}1$ Sample uniformly from one of these operations $\text{chain} \sim {\text{op}1, \text{op}{12}, \text{op}{123} }$ $x\text{aug} \mathrel{+}= w_i \cdot \text{chain}(x\text{orig})$ $\triangleright$ Addition is elementwise Sample weight $m \sim \text{Beta}(\alpha, \alpha)$ Interpolate with rule $x\text{augmix} = m x\text{orig} + (1 - m) x\text{aug}$ $x_\text{augmix1} = \text{AugmentAndMix}(x_\text{orig})$ $\triangleright$ $x_\text{augmix1}$ is stochastically generated $x_\text{augmix2} = \text{AugmentAndMix}(x_\text{orig})$ $\triangleright$ $x_\text{augmix1}\ne x_\text{augmix2}$ Loss Output: $\mathcal{L}(\hat{p}(y\mid x_\text{orig}), y) + \lambda \ \text{Jensen-Shannon}(\hat{p}(y\mid x_\text{orig}); \hat{p}(y | x_\text{augmix1}); \hat{p}(y | x_\text{augmix2}))$

Augmentations. Our method consists of mixing the results from augmentation chains or compositions of augmentation operations. We use operations from AutoAugment. Each operation is visualized in . Crucially, we exclude operations which overlap with ImageNet-C corruptions. In particular, we remove the contrast, color, brightness, sharpness, and Cutout operations so that our set of operations and the ImageNet-C corruptions are disjoint. In turn, we do not use any image noising nor image blurring operations so that ImageNet-C corruptions are encountered only at test time. Operations such as rotate can be realized with varying severities, like $2^\circ$ or $-15^\circ$. For operations with varying severities, we uniformly sample the severity upon each application. Next, we randomly sample $k$ augmentation chains, where $k=3$ by default. Each augmentation chain is constructed by composing from one to three randomly selected augmentation operations.

Mixing. The resulting images from these augmentation chains are combined by mixing. While we considered mixing by alpha compositing, we chose to use elementwise convex combinations for simplicity. The $k$-dimensional vector of convex coefficients is randomly sampled from a $\text{Dirichlet}(\alpha,\ldots,\alpha)$ distribution. Once these images are mixed, we use a "skip connection" to combine the result of the augmentation chain and the original image through a second random convex combination sampled from a $\text{Beta}(\alpha,\alpha)$ distribution. The final image incorporates several sources of randomness from the choice of operations, the severity of these operations, the lengths of the augmentation chains, and the mixing weights.

$A realization of [AugMix]{.smallcaps}. Augmentation operations such as translate_x and weights such as $m$ are randomly sampled. Randomly sampled operations and their compositions allow us to explore the semantically equivalent input space around an image. Mixing these images together produces a new image without veering too far from the original.$ {#fig:augmix:illustration width="98%"}

Jensen-Shannon Divergence Consistency Loss. We couple with this augmentation scheme a loss that enforces smoother neural network responses. Since the semantic content of an image is approximately preserved with [AugMix]{.smallcaps}, we should like the model to embed $x_\text{orig}$, $x_\text{augmix1}$, $x_\text{augmix2}$ similarly. Toward this end, we minimize the Jensen-Shannon divergence among the posterior distributions of the original sample $x_\text{orig}$ and its augmented variants. That is, for $p_\text{orig} = \hat{p}(y\mid x_\text{orig}), p_\text{augmix1} = \hat{p}(y \mid x_\text{augmix1}), p_\text{augmix2} = \hat{p}(y | x_\text{augmix2})$, we replace the original loss $\mathcal{L}$ with the loss $$\begin{aligned} \mathcal{L}(p_\text{orig}, y) + \lambda , \text{JS}(p_\text{orig};p_\text{augmix1};p_\text{augmix2}).\end{aligned}$$

To interpret this loss, imagine a sample from one of the three distributions $p_\text{orig},p_\text{augmix1},p_\text{augmix2}$. The Jensen-Shannon divergence can be understood to measure the average information that the sample reveals about the identity of the distribution from which it was sampled.

This loss can be computed by first obtaining $M = (p_\text{orig} + p_\text{augmix1} + p_\text{augmix2})/3$ and then computing $$\begin{aligned} \text{JS}(p_\text{orig};p_\text{augmix1};p_\text{augmix2}) = \frac{1}{3}\Bigl(\text{KL}[p_\text{orig} | M] + \text{KL}[p_\text{augmix1} | M] + \text{KL}[p_\text{augmix2} | M]\Bigr).
\end{aligned}$$ Unlike an arbitrary KL Divergence between $p_\text{orig}$ and $p_\text{augmix}$, the Jensen-Shannon divergence is upper bounded, in this case by the logarithm of the number of classes. Note that we could instead compute $\text{JS}(p_\text{orig};p_\text{augmix1})$, though this does not perform as well. The gain of training with $\text{JS}(p_\text{orig};p_\text{augmix1};p_\text{augmix2};p_\text{augmix3})$ is marginal. The Jensen-Shannon Consistency Loss impels to model to be stable, consistent, and insensitive across a diverse range of inputs [@Bachman; @Zheng_2016; @alp]. Ablations are in and .

Experiments

Datasets. The two CIFAR [@cifar_datasets] datasets contain small $32\times32\times3$ color natural images, both with 50,000 training images and 10,000 testing images. CIFAR-10 has 10 categories, and CIFAR-100 has 100. The ImageNet [@imagenet] dataset contains 1,000 classes of approximately 1.2 million large-scale color images.

In order to measure a model's resilience to data shift, we evaluate on the CIFAR-10-C, CIFAR-100-C, and ImageNet-C datasets [@hendrycks2019robustness]. These datasets are constructed by corrupting the original CIFAR and ImageNet test sets. For each dataset, there are a total of 15 noise, blur, weather, and digital corruption types, each appearing at 5 severity levels or intensities. Since these datasets are used to measure network behavior under data shift, we take care not to introduce these 15 corruptions into the training procedure.

The CIFAR-10-P, CIFAR-100-P, and ImageNet-P datasets also modify the original CIFAR and ImageNet datasets. These datasets contain smaller perturbations than CIFAR-C and are used to measure the classifier's prediction stability. Each example in these datasets is a video. For instance, a video with the brightness perturbation shows an image getting progressively brighter over time. We should like the network not to give inconsistent or volatile predictions between frames of the video as the brightness increases. Thus these datasets enable the measurement of the "jaggedness" [@transforms] of a network's prediction stream.

Metrics. The Clean Error is the usual classification error on the clean or uncorrupted test data. In our experiments, corrupted test data appears at five different intensities or severity levels $1\le s \le 5$. For a given corruption $c$, the error rate at corruption severity $s$ is $E_{c,s}$. We can compute the average error across these severities to create the unnormalized corruption error $\text{uCE}c=\sum{s=1}^5 E_{c,s}$. On CIFAR-10-C and CIFAR-100-C we average these values over all 15 corruptions. Meanwhile, on ImageNet we follow the convention of normalizing the corruption error by the corruption error of AlexNet [@AlexNet]. We compute $\text{CE}c=\sum{s=1}^5 E_{c,s}/\sum_{s=1}^5 E_{c,s}^\text{AlexNet}$. The average of the 15 corruption errors $\text{CE}\text{Gaussian Noise}, \text{CE}\text{Shot Noise}, \ldots, \text{CE}\text{Pixelate}, \text{CE}\text{JPEG}$ gives us the Mean Corruption Error (mCE).

Perturbation robustness is not measured by accuracy but whether video frame predictions match. Consequently we compute what is called the flip probability. Concretely, for videos such as those with steadily increasing brightness, we determine the probability that two adjacent frames, or two frames with slightly different brightness levels, have "flipped" or mismatched predictions. There are 10 different perturbation types, and the mean across these is the mean Flip Probability (mFP). As with ImageNet-C, we can normalize by AlexNet's flip probabilities and obtain the mean Flip Rate (mFR).

In order to assess a model's uncertainty estimates, we measure its miscalibration. Classifiers capable of reliably forecasting their accuracy are considered "calibrated." For instance, a calibrated classifier should be correct 70% of the time on examples to which it assigns 70% confidence. Let the classifier's confidence that its prediction $\hat{Y}$ is correct be written $C$. Then the idealized RMS Calibration Error is $\oldsqrt[\ ]{\mathbb{E}_C[(\mathbb{P}(Y=\hat{Y} | C = c) - c)^2] }$, which is the squared difference between the accuracy at a given confidence level and actual the confidence level. In , we show how to empirically estimate this quantity and calculate the Brier Score.

CIFAR-10 and CIFAR-100 {#sec:cifar}

Training Setup. In the following experiments we show that [AugMix]{.smallcaps} endows robustness to various architectures including an All Convolutional Network [@allconv; @weightnorm], a DenseNet-BC ($k = 12, d=100$) [@densenet] , a 40-2 Wide ResNet [@wideresnet], and a ResNeXt-29 ($32\times4$) [@resnext]. All networks use an initial learning rate of $0.1$ which decays following a cosine learning rate [@sgdr]. All input images are pre-processed with standard random left-right flipping and cropping prior to any augmentations. We do not change [AugMix]{.smallcaps} parameters across CIFAR-10 and CIFAR-100 experiments for consistency. The All Convolutional Network and Wide ResNet train for 100 epochs, and the DenseNet and ResNeXt require 200 epochs for convergence. We optimize with stochastic gradient descent using Nesterov momentum. Following [@Zhang2017mixupBE; @GuoMixup], we use a weight decay of $0.0001$ for Mixup and $0.0005$ otherwise.

Error rates of various methods on CIFAR-10-C using a ResNeXt backbone. Observe that [AugMix]{.smallcaps} halves the error rate of prior methods and approaches the clean error rate. {#fig:c10-bars width="\textwidth"}

::: {#tab:cifar} Standard Cutout Mixup CutMix AutoAugment* Adv Training [AugMix]{.smallcaps}

     AllConvNet      30.8      32.9    24.6     31.3        29.2            28.1             **15.0**
     DenseNet        30.7      32.1    24.6     33.5        26.6            27.6             **12.7**
     WideResNet      26.9      26.8    22.3     27.1        23.9            26.2             **11.2**
     ResNeXt         27.5      28.9    22.6     29.5        24.2            27.0             **10.9**

Mean 29.0 30.2 23.5 30.3 26.0 27.2 12.5 AllConvNet 56.4 56.8 53.4 56.0 55.1 56.0 42.7 DenseNet 59.3 59.6 55.4 59.2 53.9 55.2 39.6 WideResNet 53.3 53.5 50.4 52.9 49.6 55.1 35.9 ResNeXt 53.4 54.6 51.4 54.1 51.3 54.4 34.9 Mean 55.6 56.1 52.6 55.5 52.5 55.2 38.3

: Average classification error as percentages. Across several architectures, [AugMix]{.smallcaps} obtains CIFAR-10-C and CIFAR-100-C corruption robustness that exceeds the previous state of the art. :::

Results. Simply mixing random augmentations and using the Jensen-Shannon loss substantially improves robustness and uncertainty estimates. Compared to the "Standard" data augmentation baseline ResNeXt on CIFAR-10-C, [AugMix]{.smallcaps} achieves 16.6% lower absolute corruption error as shown in . In addition to surpassing numerous other data augmentation techniques, demonstrates that these gains directly transfer across architectures and on CIFAR-100-C with zero additional tuning. Crucially, the robustness gains do not only exist when measured in aggregate. shows that [AugMix]{.smallcaps} improves corruption robustness across every individual corruption and severity level. Our method additionally achieves the lowest mFP on CIFAR-10-P across three different models all while maintaining accuracy on clean CIFAR-10, as shown in (left) and . Finally, we demonstrate that [AugMix]{.smallcaps} improves the RMS calibration error on CIFAR-10 and CIFAR-10-C, as shown in (right) and . Expanded CIFAR-10-P and calibration results are in , and Fourier Sensitivity analysis is in .

CIFAR-10-P prediction stability and Root Mean Square Calibration Error values for ResNeXt. [AugMix]{.smallcaps} simultaneously reduces flip probabilities and calibration error. {#fig:pertandcalibrationcifar width="93.5%"}

ImageNet

Baselines. To demonstrate the utility of [AugMix]{.smallcaps} on ImageNet, we compare to many techniques designed for large-scale images. While techniques such as Cutout [@Devries2017ImprovedRO] have not been demonstrated to help on the ImageNet scale, and while few have had success training adversarially robust models on ImageNet [@alpbroken], other techniques such as Stylized ImageNet have been demonstrated to help on ImageNet-C. Patch Uniform [@Lopes2019ImprovingRW] is similar to Cutout except that randomly chosen regions of the image are injected with uniform noise; the original paper uses Gaussian noise, but that appears in the ImageNet-C test set so we use uniform noise. We tune Patch Uniform over 30 hyperparameter settings. Next, AutoAugment [@Cubuk2018AutoAugmentLA] searches over data augmentation policies to find a high-performing data augmentation policy. We denote AutoAugment results with AutoAugment* since we remove augmentation operations that overlap with ImageNet-C corruptions, as with [AugMix]{.smallcaps}. We also test with Random AutoAugment*, an augmentation scheme where each image has a randomly sampled augmentation policy using AutoAugment* operations. In contrast to AutoAugment, Random AutoAugment* and [AugMix]{.smallcaps} require far less computation and provide more augmentation variety, which can offset their lack of optimization. Note that Random AutoAugment* is different from RandAugment introduced recently by @cubuk2019randaugment: RandAugment uses AutoAugment operations and optimizes a single distortion magnitude hyperparameter for all operations, while Random AutoAugment* randomly samples magnitudes for each operation and uses the same operations as [AugMix]{.smallcaps}. MaxBlur Pooling [@zhang2019shiftinvar] is a recently proposed architectural modification which smooths the results of pooling. Now, Stylized ImageNet (SIN) is a technique where models are trained with the original ImageNet images and also ImageNet images with style transfer applied. Whereas the original Stylized ImageNet technique pretrains on ImageNet-C and performs style transfer with a content loss coefficient of $0$ and a style loss coefficient of $1$, we find that using $0.5$ content and style loss coefficients decreases the mCE by 0.6%. Later, we show that SIN and [AugMix]{.smallcaps} can be combined. All models are trained from scratch, except MaxBlur Pooling models which has trained models available.

Training Setup. Methods are trained with ResNet-50 and we follow the standard training scheme of [@Goyal2017AccurateLM], in which we linearly scale the learning rate with the batch size, and use a learning rate warm-up for the first 5 epochs, and AutoAugment and [AugMix]{.smallcaps} train for 180 epochs. All input images are first pre-processed with standard random cropping horizontal mirroring.

@l ? c | c c c | c c c c | c c c c | c c c c@ | c & & & & &
Network & & Gauss. & Shot & Impulse & Defocus & Glass & Motion & Zoom & Snow & Frost & Fog & Bright & Contrast & Elastic & Pixel & JPEG & mCE
Standard & 23.9 & 79 & 80 & 82 & 82 & 90 & 84 & 80 & 86 & 81 & 75 & 65 & 79 & 91 & 77 & 80 & 80.6
Patch Uniform & 24.5 & 67 & 68 & 70 & 74 & 83 & 81 & 77 & 80 & 74 & 75 & 62 & 77 & 84 & 71 & 71 & 74.3
AutoAugment* (AA) & 22.8 & 69 & 68 & 72 & 77 & 83 & 80 & 81 & 79 & 75 & 64 & 56 & 70 & 88 & 57 & 71 & 72.7
Random AA* & 23.6 & 70 & 71 & 72 & 80 & 86 & 82 & 81 & 81 & 77 & 72 & 61 & 75 & 88 & 73 & 72 & 76.1
MaxBlur pool & 23.0 & 73 & 74 & 76 & 74 & 86 & 78 & 77 & 77 & 72 & 63 & 56 & 68 & 86 & 71 & 71 & 73.4
SIN & 27.2 & 69 & 70 & 70 & 77 & 84 & 76 & 82 & 74 & 75 & 69 & 65 & 69 & 80 & 64 & 77 & 73.3
[AugMix]{.smallcaps} & 22.4 & 65 & 66 & 67 & 70 & 80 & 66 & 66 & 75 & 72 & 67 & 58 & 58 & 79 & 69 & 69 & 68.4
[AugMix]{.smallcaps}+SIN & 25.2 & 61 & 62 & 61 & 69 & 77 & 63 & 72 & 66 & 68 & 63 & 59 & 52 & 74 & 60 & 67 & 64.9\

Results. Our method achieves 68.4% mCE as shown in , down from the baseline 80.6% mCE. Additionally, we note that [AugMix]{.smallcaps} allows straightforward stacking with other methods such as SIN to achieve an even lower corruption error of 64.1% mCE. Other techniques such as AutoAugment* require much tuning, while ours does not. Across increasing severities of corruptions, our method also produces much more calibrated predictions measured by both the Brier Score and RMS Calibration Error as shown in . As shown in , [AugMix]{.smallcaps} also achieves a state-of-the art result on ImageNet-P at with an mFR of 37.4%, down from 57.2%. We demonstrate that scaling up [AugMix]{.smallcaps} from CIFAR to ImageNet also leads to state-of-the-art results in robustness and uncertainty estimation.

@l ? c | cc | c c | c c | c c c c | c@ & & & & &
Network & & Gaussian & Shot & Motion & Zoom & Snow & Bright & Translate & Rotate & Tilt & Scale & mFR
Standard & 23.9 & 57 & 55 & 62 & 65 & 66 & 65 & 43 & 53 & 57 & 49 & 57.2
Patch Uniform & 24.5 & 32 & 25 & 50 & 52 & 54 & 57 & 40 & 48 & 49 & 46 & 45.3
AutoAugment* (AA) & 22.8 & 50 & 45 & 57 & 68 & 63 & 53 & 40 & 44 & 50 & 46 & 51.7
Random AA* & 23.6 & 53 & 46 & 53 & 63 & 59 & 57 & 42 & 48 & 54 & 47 & 52.2
SIN & 27.2 & 53 & 50 & 57 & 72 & 51 & 62 & 43 & 53 & 57 & 53 & 55.0
MaxBlur pool & 23.0 & 52 & 51 & 59 & 63 & 57 & 64 & 34 & 43 & 49 & 40 & 51.2
[AugMix]{.smallcaps} & 22.4 & 46 & 41 & 30 & 47 & 38 & 46 & 25 & 32 & 35 & 33 & 37.4
[AugMix]{.smallcaps}+SIN & 25.2 & 45 & 40 & 30 & 54 & 32 & 48 & 27 & 35 & 38 & 39 & 38.9\

Uncertainty results on ImageNet-C. Observe that under severe data shifts, the RMS calibration error with ensembles and [AugMix]{.smallcaps} is remarkably steady. Even though classification error increases, calibration is roughly preserved. Severity zero denotes clean data. {#fig:across-severities:imagenet width="98%"}

Ablations {#sec:ablations}

We locate the utility of [AugMix]{.smallcaps} in three factors: training set diversity, our Jensen-Shannon divergence consistency loss, and mixing. Improving training set diversity via increased variety of augmentations can greatly improve robustness. For instance, augmenting each example with a randomly sampled augmentation chain decreases the error rate of Wide ResNet on CIFAR-10-C from 26.9% to 17.0% . Adding in the Jensen-Shannon divergence consistency loss drops error rate further to 14.7%. Mixing random augmentations without the Jenson-Shannon divergence loss gives us an error rate of $13.1%$. Finally, re-introducing the Jensen-Shannon divergence gives us [AugMix]{.smallcaps} with an error rate of $11.2%$. Note that adding even more mixing is not necessarily beneficial. For instance, applying [AugMix]{.smallcaps} on top of Mixup increases the error rate to $13.3%$, possibly due to an increased chance of manifold intrusion [@GuoMixup]. Hence [AugMix]{.smallcaps}'s careful combination of variety, consistency loss, and mixing explain its performance.

::: {#tab:ablations} Method CIFAR-10-C Error Rate CIFAR-100-C Error Rate

Standard 26.9 53.3 AutoAugment* 23.9 49.6 Random AutoAugment* 17.0 43.6 Random AutoAugment* + JSD Loss 14.7 40.8 AugmentAndMix (No JSD Loss) 13.1 39.8 [AugMix]{.smallcaps} (Mixing + JSD Loss) 11.2 35.9

: Ablating components of [AugMix]{.smallcaps} on CIFAR-10-C and CIFAR-100-C. Variety through randomness, the Jensen-Shannon divergence (JSD) loss, and augmentation mixing confer robustness. :::

[[tab:ablations]]{#tab:ablations label="tab:ablations"}

Conclusion

[AugMix]{.smallcaps} is a data processing technique which mixes randomly generated augmentations and uses a Jensen-Shannon loss to enforce consistency. Our simple-to-implement technique obtains state-of-the-art performance on CIFAR-10/100-C, ImageNet-C, CIFAR-10/100-P, and ImageNet-P. [AugMix]{.smallcaps} models achieve state-of-the-art calibration and can maintain calibration even as the distribution shifts. We hope that [AugMix]{.smallcaps} will enable more reliable models, a necessity for models deployed in safety-critical environments.

Hyperparameter Ablations {#app:moreablations}

In this section we demonstrate that [AugMix]{.smallcaps}'s hyperparameters are not highly sensitive, so that [AugMix]{.smallcaps} performs reliably without careful tuning. For this set of experiments, the baseline [AugMix]{.smallcaps} model trains for 90 epochs, has a mixing coefficient of $\alpha=0.5$, has 3 examples per Jensen-Shannon Divergence (1 clean image, 2 augmented images), has a chain depth stochastically varying from 1 to 3, and has $k=3$ augmentation chains. shows that the performance of various [AugMix]{.smallcaps} models with different hyperparameters. Under these hyperparameter changes, the mCE does not change substantially.

[AugMix]{.smallcaps} hyperparameter ablations on ImageNet-C. ImageNet-C classification performance is stable changes to [AugMix]{.smallcaps}'s hyperparameters. {#fig:moreablations width="\textwidth"}

{#fig:fourier width="\textwidth"}

Fourier Analysis {#app:fourier}

A commonly mentioned hypothesis [@gilmer2019discussion] for the lack of robustness of deep neural networks is that they readily latch onto spurious high-frequency correlations that exist in the data. In order to better understand the reliance of models to such correlations, we measure model sensitivity to additive noise at differing frequencies. We create a $32\times32$ sensitivity heatmap. That is, we add a total of $32\times32$ Fourier basis vectors to the CIFAR-10 test set, one at a time, and record the resulting error rate after adding each Fourier basis vector. Each point in the heatmap shows the error rate on the CIFAR-10 test set after it has been perturbed by a single Fourier basis vector. Points corresponding to low frequency vectors are shown in the center of the heatmap, whereas high frequency vectors are farther from the center. For further details on Fourier sensitivity analysis, we refer the reader to Section 2 of [@yin2019fourier]. In we observe that the baseline model is robust to low frequency perturbations but severely lacks robustness to high frequency perturbations, where error rates exceed 80%. The model trained with Cutout shows a similar lack of robustness. In contrast, the model trained with [AugMix]{.smallcaps} maintains robustness to low frequency perturbations, and on the mid and high frequencies [AugMix]{.smallcaps} is conspicuously more robust.

Augmentation Operations {#app:augops}

The augmentation operations we use for [AugMix]{.smallcaps} are shown in .

Illustration of augmentation operations applied to the same image. Some operation severities have been increased to show detail. {#fig:augops width="50%"}

We do not use augmentations such as contrast, color, brightness, sharpness, and Cutout as they may overlap with ImageNet-C test set corruptions. We should note that augmentation choice requires additional care. [@GuoMixup] show that blithely applying augmentations can potentially cause augmented images to take different classes. shows how histogram color swapping augmentation may change a bird's class, leading to a manifold intrusion.

An illustration of manifold intrusion [@GuoMixup], where histogram color augmentation can change the image's class. {#fig:intrusion width="50%"}

Additional results {#app:additional}

We include various additional results for CIFAR-10, CIFAR-10-C and CIFAR-10-P below. reports accuracy for each corruption, reports calibration results for various architectures and reports clean error and mFR. We refer to Section 4.1{reference-type="ref" reference="sec:cifar"} for details about the architecture and training setup.

[AugMix]{.smallcaps} improves corruption robustness across all CIFAR-10-C noise, blur, weather, and digital corruptions, despite the model never having seen these corruptions during training. {#fig:per-corr width="\textwidth"}

::: {#tab:cifar:calibration} Standard Cutout Mixup CutMix AutoAugment* Adv Training [AugMix]{.smallcaps}

     AllConvNet      5.4       4.0     12.6     3.1          4.2            11.1               2.2
     DenseNet        7.5       6.4     15.6     5.4          6.0            16.2               5.0
     WideResNet      6.8       3.8     14.0     5.0          4.7            10.7               4.2
     ResNeXt         3.0       4.4     13.5     3.5          3.3            5.8                3.0

Mean 5.7 4.7 13.9 4.2 4.6 11.0 3.6 AllConvNet 21.2 21.3 9.7 15.4 16.2 10.4 5.2 DenseNet 26.7 27.8 12.9 25.6 21.1 15.0 11.7 WideResNet 27.6 19.6 11.1 17.8 17.1 10.6 8.7 ResNeXt 16.4 21.4 11.7 19.6 15.1 11.6 8.3 Mean 23.0 22.5 11.4 19.6 17.4 11.9 8.5

: RMS Calibration Error of various models and data augmentation methods across CIFAR-10 and CIFAR-10-C. All values are reported as percentages. :::

::: {#tab:cifar10-p} Standard Cutout Mixup CutMix AutoAugment* Adv Training [AugMix]{.smallcaps}

     AllConvNet      6.1       6.1      6.3     6.4          6.6            18.9               6.5
     DenseNet        5.8       4.8      5.5     5.3          4.8            17.9               4.9
     WideResNet      5.2       4.4      4.9     4.6          4.8            17.1               4.9
     ResNeXt         4.3       4.4      4.2     3.9          3.8            15.4               4.2

Mean 5.4 4.9 5.2 5.0 5.0 17.3 5.1 AllConvNet 4.2 5.0 3.9 4.5 4.0 2.0 1.5 DenseNet 5.0 5.7 3.9 6.3 4.8 2.1 1.8 WideResNet 4.2 4.3 3.4 4.6 4.2 2.2 1.6 ResNeXt 4.0 4.5 3.2 5.2 4.2 2.5 1.5 Mean 4.3 4.9 3.6 5.2 4.3 2.2 1.6

: CIFAR-10 Clean Error and CIFAR-10-P mean Flip Probability. All values are percentages. While adversarial training performs well on CIFAR-10-P, it induces a substantial drop in accuracy (increase in error) on clean CIFAR-10 where [AugMix]{.smallcaps} does not. :::

Calibration Metrics {#app:calibration}

Due to the finite size of empirical test sets, the RMS Calibration Error must be estimated by partitioning all $n$ test set examples into $b$ contiguous bins ${B_1,B_2,\ldots,B_b}$ ordered by prediction confidence. In this work we use bins which contain $100$ predictions, so that we adaptively partition confidence scores on the interval $[0,1]$ [@oconnor; @hendrycks2019oe]. Other works partition the interval $[0,1]$ with 15 bins of uniform length [@kilian]. With these $b$ bins, we estimate the RMS Calibration Error empirically with the formula $$\begin{aligned} \oldsqrt[\ ]{\sum_{i=1}^b \frac{|B_i|}{n} \bigg( \frac{1}{|B_i|}\sum_{k\in B_i}\mathds{1}(y_k = \hat{y}k) - \frac{1}{|B_i|}\sum{k\in B_i} c_k \bigg)^2}.\end{aligned}$$ This is separate from classification error because a random classifier with an approximately uniform posterior distribution is approximately calibrated. Also note that adding the "refinement" $\mathbb{E}_C[(\mathbb{P}(Y=\hat{Y} | C = c)(1 - (\mathbb{P}(Y=\hat{Y} | C = c))]$ to the square of the RMS Calibration Error gives us the Brier Score [@oconnor].

[^1]: Equal Contribution.

[^2]: Corresponding author.