Classification and Comparison of Data Augmentation Techniques
DOI:
https://doi.org/10.62051/7e91md96Keywords:
Data augmentation; Image classification; Object detection; Speech recognition.Abstract
Data augmentation is a technical method that generates additional data samples by applying different transformations and processes to the original training data. It aims to increase the size and diversity of the dataset, thus improving the models' generalization ability. This paper provides a detailed introduction to geometric transformations, data mixing, automated data augmentation methods, and generative adversarial methods. Geometric transformations manipulate the spatial configuration of data samples to create new variants that retain the original content. Data mixing techniques blend multiple data samples to produce novel training examples, promoting robustness and reducing overfitting. Automated data augmentation methods leverage search algorithms to discover optimal augmentation policies that improve model performance without manual intervention. Generative adversarial methods employ generative adversarial networks to synthesize realistic data samples that enrich the training dataset. These methods find broad application in domains including object detection, image classification, natural language processing, and speech recognition, significantly improving model performance. The paper explores the advantages and limitations of data augmentation techniques in practical applications and identifies future research directions and challenges. By systematically classifying and selecting appropriate data augmentation methods for specific application scenarios, this review provides a theoretical foundation and practical guidance for real-world applications.
Downloads
References
[1] Mumuni A, Mumuni F. Data augmentation: A comprehensive survey of modern approaches. Array, 2022, 16: 100258.
[2] Shorten C, Khoshgoftaar T M. A survey on image data augmentation for deep learning. Journal of big data, 2019, 6(1): 1-48.
[3] Chen P, Liu S, Zhao H, et al. Gridmask data augmentation. arXiv preprint arXiv:2001.04086, 2020.
[4] Perez L, Wang J. The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621, 2017.
[5] Yun S, Han D, Oh S J, et al. Cutmix: Regularization strategy to train strong classifiers with localizable features. Proceedings of the IEEE/CVF international conference on computer vision. 2019: 6023-6032.
[6] Goodfellow I J, Shlens J, Szegedy C. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572, 2014.
[7] DeVries T, Taylor G W. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552, 2017.
[8] Zhang H, Cisse M, Dauphin Y N, et al. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412, 2017.
[9] Yun S, Han D, Oh S J, et al. Cutmix: Regularization strategy to train strong classifiers with localizable features. Proceedings of the IEEE/CVF international conference on computer vision. 2019: 6023-6032.
[10] Cubuk E D, Zoph B, Shlens J, et al. Randaugment: Practical automated data augmentation with a reduced search space. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 2020: 702-703.
[11] Fawzi A, Samulowitz H, Turaga D, et al. Adaptive data augmentation for image classification. 2016 IEEE international conference on image processing (ICIP). Ieee, 2016: 3688-3692.
[12] Ho D, Liang E, Chen X, et al. Population based augmentation: Efficient learning of augmentation policy schedules. International conference on machine learning. PMLR, 2019: 2731-2741.
[13] Yang Z, Sinnott R O, Bailey J, et al. A survey of automated data augmentation algorithms for deep learning-based image classification tasks. Knowledge and Information Systems, 2023, 65(7): 2805-2861.
[14] Su Z, Yao K, Yang X, et al. Rethinking data augmentation for single-source domain generalization in medical image segmentation. Proceedings of the AAAI Conference on Artificial Intelligence. 2023, 37(2): 2366-2374.
Downloads
Published
Conference Proceedings Volume
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







