Exploring Object Detection: Datasets, Metrics and Algorithms

Authors

  • Yifei Zhao

DOI:

https://doi.org/10.62051/zzeqve39

Keywords:

Object Detection; Datasets; CNN; Deep Learning; Computer Vision.

Abstract

Object detection, as one of the basic issues in the computer vision (CV) field, has huge application value in realm such as autonomous driving and face recognition. Since the object differences in images in different scenes, angles, and lighting environments are too great, the number and size of objects are also different, those greatly increase the difficulty of detection. Therefore, how to detect objects quickly and accurately has attracted widespread attention from researchers. This article provides a review from the perspectives of data and algorithms in accordance with the order of technological development. It mainly introduces commonly used data sets, evaluation indicators, traditional object detection algorithms, and object detection algorithms based on deep learning. Finally, this paper will discuss future research directions, explore what difficulties still exist in the field of target detection, and how researchers should further improve the performance and generalization capabilities of target detection to cope with more complex and changeable scenarios and needs.

Downloads

Download data is not yet available.

References

Padilla R, Netto S L, Da Silva E A B. A survey on performance metrics for object-detection algorithms, 2020 international conference on systems, signals and image processing (IWSSIP). IEEE, 2020: 237-242.

Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman. The PASCAL Visual Object Classes (VOC) Challenge. International Journal of Computer Vision, 2010, 88: 303–338.

Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang. DOTA: ALarge-scale Dataset for Object Detection in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018: 3974-3983.

ZhengxiaZou, KeyanChen, ZhenweiShi, YuhongGuo, and JiepingYe. Object Detection in 20 Years: A Survey. Proceedings of the IEEE, 2023, 111(3): 257-276.

Wang X, Han T X, Yan S. An HOG-LBP human detector with partial occlusion handling, 2009 IEEE 12th international conference on computer vision. 2009: 32-39.

Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.

Dalal N, Triggs B. Histograms of oriented gradients for human detection. 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05). 2005, 1: 886-893.

Ross Girshick. Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015: 1440-1448.

S. Ren, K. He, R. Girshick, and J. Sun, Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems, 2015: 91–99.

Kaiming He, Georgia Gkioxari, Piotr Doll´ar, Ross Girshick. Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017: 2961-2969.

Yi-Qing Wang. An Analysis of the Viola-Jones Face Detection Algorithm. Image Processing On Line, 2014, 4: 128-148.

Downloads

Published

12-08-2024

How to Cite

Zhao , Y. (2024) “Exploring Object Detection: Datasets, Metrics and Algorithms”, Transactions on Computer Science and Intelligent Systems Research, 5, pp. 556–561. doi:10.62051/zzeqve39.