Research on Image Recognition Problems Based on Deep Learning Models

Authors

  • Ao Liu

DOI:

https://doi.org/10.62051/r8dtd190

Keywords:

Alexnet, Flower, Image Recognition, Image Augmentation.

Abstract

Deep learning has achieved impressive results in computer vision but typically requires extensive datasets, which are often costly and difficult to obtain, particularly in specialized domains like medical imaging and autonomous driving. To address this challenge, various image augmentation algorithms have been introduced, aimed at enhancing the size and diversity of training datasets. While model based techniques make use of picture production models, model-free techniques employ conventional image processing techniques. The goal of optimizing policy-based techniques is to enhance model performance by determining the optimal set of augmentation procedures. The seminal AlexNet model, introduced in 2012, marked a significant breakthrough in image classification, demonstrating the power of deep learning. Through advanced techniques such as multi-scale convolution, cross-connections, and global average pooling, these approaches have substantially improved image classification accuracy. Despite ongoing challenges, these advancements underscore the critical role of data augmentation and optimized neural network architectures in enhancing performance when data is limited.

Downloads

Download data is not yet available.

References

[1] Zixian W, Shuping H. Improvement of the AlexNet Networks for Large-Scale Recognition Applications Received: 6 May 2019/Accepted: 26 0ctober 2020/Published online:7 November 2020 Shiraz University 2020.

[2] Anwar S, Hwang K, Sung W. Fixed point optimization of deep convolutional neural networks for object recognition.In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2022, 1131-1135.

[3] Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on Machine Learning, 2015, 448-456.

[4] Connor S, Taghi M. A survey on lmage data augmentation for deep learning. Journal of Big Data, 2019, 6: 60.

[5] Yang X, Shang ZH. Expression recognition based on improved. AlexNet Laser&Optoelectronics Progress, 2020, 57(14).

[6] Zhou S, Liang YY, Wan J, et al. Facial expression recognition based on multi-scale CNN. Biometric Recognition. 2016, 503-510.

[7] Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 2012, 25(2): 1097-1105.

[8] Gao R, Wang R, Feng L, et al. Dual-branch, efficient, channel attention-based crop disease identification. Comput. Electron. Agric. 2021, 190, 106410.

[9] Zhou S, Liang YY, Wan J, et al. Facial expression recognition based on multi-scale CNNS. Biometric Recognition. Cham: Springer International Publishing, 2016, 503-510.

[10] Szegedy C, Liu W, Jia Y Q, et al. Going deeper with convolutions. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 1-9.

Downloads

Published

25-11-2024

How to Cite

Liu, A. (2024) “Research on Image Recognition Problems Based on Deep Learning Models”, Transactions on Computer Science and Intelligent Systems Research, 7, pp. 224–228. doi:10.62051/r8dtd190.