Object Detection Based on Deep Learning
DOI:
https://doi.org/10.62051/bcefz924Keywords:
Object detection; deep learning; small object detection; YOLO; Fast R-CNN.Abstract
Object detection technology is a crucial research direction in the field of computer vision. Its primary task is to identify and locate objects within images. This technology finds extensive applications across various domains such as autonomous driving, medical diagnostics, and security monitoring. With the development of machine learning, especially the progress of deep learning in related fields of image processing, the accuracy rate of object detection has achieved better results. However, different algorithms have different characteristics, which increases the difficulty of algorithm selection in actual use. This paper starts with the development of object detection, and summarizes three aspects: data set, metrics and object detection algorithm. This paper mainly introduces the commonly used data sets of object detection (such as COCO, VOC), traditional object detection algorithms and deep learning-based object detection algorithms. In addition, the current challenges encountered in object detection, especially in the detection of small objects, and future research directions are discussed.
Downloads
References
Cao Y, Lu X, Zhu Y, et al. Context-based fine hierarchical object detection with deep reinforcement learning. 2020 7th International Conference on Information Science and Control Engineering (ICISCE), 2020. DOI: https://doi.org/10.1109/ICISCE50968.2020.00091
Pang Y, Cao J. Deep learning in object detection. Deep Learning in Object Detection and Recognition, 2019, 26(7): 19-57. DOI: https://doi.org/10.1007/978-981-10-5152-4_2
Song D, Liu B, Li Y. Based on end-to-end object detection algorithm with Transformers for detecting wafer maps. 2022 International Conference on Computer Network, Electronic and Automation (ICCNEA), 2022. DOI: https://doi.org/10.1109/ICCNEA57056.2022.00072
Felzenszwalb P, McAllester D, Ramanan D. A discriminatively trained, multiscale, deformable part model. 2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008. DOI: https://doi.org/10.1109/CVPR.2008.4587597
Dalal N, Triggs B. Histograms of oriented gradients for human detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005.
Russakovsky O, Deng J, Su H, et al. Imagenet large scale visual recognition challenge. International Journal of Computer Vision, 2015, 115(3): 211-252. DOI: https://doi.org/10.1007/s11263-015-0816-y
Han G, Zhang X, Li C. Revisiting faster R-CNN: A deeper look at region proposal network. Neural Information Processing, 2017: 14-24. DOI: https://doi.org/10.1007/978-3-319-70090-8_2
Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection. 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016. DOI: https://doi.org/10.1109/CVPR.2016.91
Williams K, Winn J, Zisserman A, et al. The pascal visual object classes challenge: A retrospective. International Journal of Computer Vision, 2014, 111(1): 98-136. DOI: https://doi.org/10.1007/s11263-014-0733-5
Lin T Y, Maire M, Belongie S, et al. Microsoft coco: Common objects in context. Computer Vision – ECCV 2014, 2014: 740-755. DOI: https://doi.org/10.1007/978-3-319-10602-1_48
Zhao Z Q, Zheng P, Xu S T, et al. Object detection with deep learning: A review. arXiv e-prints, 2018, 30(11): 3212-3232. DOI: https://doi.org/10.1109/TNNLS.2018.2876865
Lu S Y, Wang B Z, Wang H J, et al. A real-time object detection algorithm for video.Computers & Electrical Engineering, 2019, 77(1): 398-408. DOI: https://doi.org/10.1016/j.compeleceng.2019.05.009
Sharma V, Mir R N. A comprehensive and systematic look up into deep learning based object detection techniques: A review. Computer Science Review, 2020, 38(11): 1-29. DOI: https://doi.org/10.1016/j.cosrev.2020.100301
Aziz L, Salam M S B H, Sheikh U, et al. Exploring deep learning-based architecture, strategies, applications and current trends in generic object detection: A comprehensive review. IEEE Access, 2020, 8(1): 170461-170495. DOI: https://doi.org/10.1109/ACCESS.2020.3021508
Shin H C, Lee K I, Lee C E. Data augmentation method of object detection for deep learning in maritime image. 2020 IEEE International Conference on Big Data and Smart Computing (BigComp). IEEE, 2020. DOI: https://doi.org/10.1109/BigComp48618.2020.00-25
Downloads
Published
Conference Proceedings Volume
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







