Overview of Object Detection Methods
DOI:
https://doi.org/10.62051/ijcsit.v8n1.11Keywords:
Object detection, Deep learning, Convolutional neural networks, YOLO, Faster R-CNN, Evaluation Criteria1Abstract
Object detection is an important task in computer vision, aimed at detecting and recognizing the position and category of target objects from images or videos. With the rise of deep learning, the accuracy and efficiency of object detection have significantly improved, especially the application of convolutional neural networks (CNN) in this field, which has made significant breakthroughs in object detection methods. This article provides an overview of the development history of object detection, with a focus on classic object detection algorithms, deep learning methods, and their evolution. It explores the evaluation criteria and challenges faced by object detection, and looks forward to future development trends.
Downloads
References
[1] Luis R. Mercado Diaz, Derek Aguiar & Hugo F. Posada Quintero. (2025). Graph-based multi-modal MRI analysis with probabilistic attention for stroke lesion detection. Neurocomputing, 657, 131620-131620. https://doi.org/10.1016/J.NEUCOM.2025.131620
[2] Mingjian Yuan, Gongcheng Peng, Ruru Xiao, Bing Zeng, Wenxuan Xu, Guangyao Li... & Hao Jiang. (2025). Prediction model for 1 mm thickness A1060 aluminum and T2 copper magnetic pulse welding joints based on CNN-LSTM-AM deep learning algorithm and DeepSHAP interpretability analysis. Welding in the World, (prepublish), 1-19. https://doi.org/10.1007/S40194-025-02195-Z
[3] Hamed Sabahno & Davood Khodadad. (2025). A convolutional neural network-based joint detection and localization spatiotemporal scheme for process control through speckle pattern imaging. Computers & Industrial Engineering, 210, 111538-111538. https://doi.org/10.1016/J.CIE.2025.111538
[4] N. Regnier, V. Mungkung & L. Mezeix. (2025). A CNN-based statistical method for land cover classification to assess urban vulnerability to explosions: Case study of Paris, France. International Journal of Applied Earth Observation and Geoinformation, 144, 104878-104878. https://doi.org/10.1016/J.JAG.2025.104878
[5] Yichi Zhang, Yuxin Ma, Hui Fang & Hongqing Wang. (2025). Investigation on forecast of offshore wind power generation hybrid attention mechanism and bi-directional long short-term memory based on deep learning. Ocean and Coastal Management, 270, 107884-107884. https://doi.org/10.1016/J.OCECOAMAN.2025.107884
[6] Fujiao Ju, Shuhan Zhao & Shaotao Zhu. (2025). Enhanced pneumonia lesion segmentation using a hybrid CNN-BiFormer network with residual haar wavelet downsampling and shared attention. Multimedia Systems, 31 (6), 409-409. https://doi.org/10.1007/S00530-025-01990-4
[7] Jinshuai Xu & Juan Zhang. (2025). Infrared and visible image fusion based on Haar wavelet downsampling and multi-scale feature aggregation. Signal, Image and Video Processing, 19 (14), 1200-1200. https://doi.org/10.1007/S11760-025-04746-9
[8] Haixiang Yao & Chunzhuo Wan. (2025). Multi-factor portfolio optimization: A combined random Forest–AdaBoost model with cost-sensitive learning. Pacific-Basin Finance Journal, 94, 102946-102946. https://doi.org/10.1016/J.PACFIN.2025.102946
[9] Jidong Li, Jian Cui & Qian Su. (2025). Boosting fuzzy classification rules under footprint of uncertainty of type-2 fuzzy sets. Applied Soft Computing, 185 (PA), 113876-113876. https://doi.org/10.1016/J.ASOC.2025.113876
[10] Kumaresh Pal, Kumari Namrata, Ashok Kumar Akella, Akshit Samadhiya, Ahmad Taher Azar, Mohamed Tounsi... & Walid El Shafai. (2025). Intelligent islanding detection framework for smart grids using wavelet scalograms and HOG feature fusion. Scientific Reports, 15(1), 30351-30351. https://doi.org/10.1038/S41598-025-08391-7
[11] Peiran Zhang, Fuqiang Zhou, Xinghan Wang, Wentao Guo & Zhipeng Song. (2025). Monocular vision measurement for kinematic parameters of separated load in drop tower test. Optics and Lasers in Engineering, 193, 109074-109074. https://doi.org/10.1016/J.OPTLASENG.2025.109074
[12] H.P.D. Shiran Madhuranga, Wong Jee Keen Raymond, Hazlee Azil Illias & Nurulafiqah Nadzirah Binti Mansor. (2025). Robust open-set partial discharge diagnosis based on hybrid supervised contrastive learning and SVM framework. Ain Shams Engineering Journal, 16(12), 103762-103762. https://doi.org/10.1016/J.ASEJ.2025.103762
[13] Zhenman Gao, Jianyong Zhuang & Xiaoyong He. (2025). FALCON-Net: A hybrid fuzzy-attentive LSTM-convolutional neural network architecture for high-precision classification of steel alloys via femtosecond laser-ablation spark-induced breakdown spectroscopy. Analytica Chimica Acta, 1378, 344701-344701. https://doi.org/10.1016/J.ACA.2025.344701
[14] Tao Shen, Bo Li, Facai Ren, Enxiang Fan & Jianrui Zhang. (2025). A monitoring framework for predicting laser directed energy deposition property. International Journal of Mechanical Sciences, 307, 110861-110861. https://doi.org/10.1016/J.IJMECSCI.2025.110861
[15] Sirui Guo, Jinchang Li, Wei Jiang, Jun Yang, Yingying Du, Tao Luo... & Limei Qi. (2025). Chirality recognition of amino acid by combining machine learning method and sliding window technique. Optics and Laser Technology, 192 (PE), 113937-113937. https://doi.org/10.1016/J.OPTLASTEC.2025.113937
[16] Mbietieu Amos Mbietieu, Tapamo Kenfack Hippolyte Michel, Kouamou Georges Edouard & Norbert Tsopze. (2025). ShipFPN: New Feature Pyramid Network architecture for object detection in high-resolution satellite images: Application to ship detection. Science of Remote Sensing, 12, 100270-100270. https://doi.org/10.1016/J.SRS.2025.100270
[17] Hritu Raj & Gargi Srivastava. (2026). A novel data augmentation strategy for gas leak detection and segmentation using Mask R-CNN and bit plane slicing in chemical process environments. Computers and Chemical Engineering, 204, 109407-109407. https://doi.org/10.1016/J.COMPCHEMENG.2025.109407
[18] Seunghyeon Wang. (2025). Effectiveness of traditional augmentation methods for rebar counting using UAV imagery with Faster R-CNN and YOLOv10-based transformer architectures. Scientific Reports, 15 (1), 33702-33702. https://doi.org/10.1038/S41598-025-18964-1
[19] Seunghyeon Wang. (2025). Effectiveness of traditional augmentation methods for rebar counting using UAV imagery with Faster R-CNN and YOLOv10-based transformer architectures. Scientific Reports, 15(1), 33702-33702. https://doi.org/10.1038/S41598-025-18964-1
[20] Mehmet Özgür Özemre, Jale Bektaş, Hüseyin Yanik, Lütfiye Baysal & Hazal Karslioğlu. (2025). Enhanced diagnostic pipeline for maxillary sinus-maxillary molars relationships: a novel implementation of Detectron2 with faster R-CNN R50 FPN 3x on CBCT images. BMC Oral Health, 25 (1), 1473-1473. https://doi.org/10.1186/S12903-025-06337-Z
[21] Seunghyeon Wang. (2025). Effectiveness of traditional augmentation methods for rebar counting using UAV imagery with Faster R-CNN and YOLOv10-based transformer architectures. Scientific Reports, 15(1), 33702-33702. https://doi.org/10.1038/S41598-025-18964-1
[22] Mehmet Özgür Özemre, Jale Bektaş, Hüseyin Yanik, Lütfiye Baysal & Hazal Karslioğlu. (2025). Enhanced diagnostic pipeline for maxillary sinus-maxillary molars relationships: a novel implementation of Detectron2 with faster R-CNN R50 FPN 3x on CBCT images. BMC Oral Health, 25(1), 1473-1473. https://doi.org/10.1186/S12903-025-06337-Z
[23] Meiyun Chen, Jiacheng Tian, Xiuhua Cao, Zhenxiao Fu & Dawei Zhang. (2025). DM-YOLO for MLCCs’ automatic defect detection. Optics and Laser Technology, 192 (PE), 113977-113977. https://doi.org/10.1016/J.OPTLASTEC.2025.113977
[24] Yingying Zhou, Chao Liu, Hao Tian, Xin Zhang & Nan Li. (2025). CSST Slitless Spectra: Target Detection and Classification with Yolo. The Astronomical Journal, 170(5), 256-256. https://doi.org/10.3847/1538-3881/ADEE1C
[25] Fan Zhang, Chi Fai Cheung, Yanbin Zhang & Chunjin Wang. (2025). Modelling and experimental analysis of subsurface damage in low-temperature nano-lubrication grinding. International Journal of Mechanical Sciences, 306, 110729-110729. https://doi.org/10.1016/J.IJMECSCI.2025.110729
[26] Priyanka P. Chavan, Rahul S. Redekar, Umesh D. Babar, Ashok D. Chougale, N.L. Tarwal & Pradip D. Kamble. (2025). Reaction time-driven structural and electrochemical analysis of MnFe2O4 for solid-state supercapacitor application. Journal of Power Sources, 659, 238459-238459. https://doi.org/10.1016/J.JPOWSOUR.2025.238459
[27] Shwetha V., Barnini Banerjee, Vijaya Laxmi & Priya Kamath. (2025). Optimizing TB Bacteria Detection Efficiency: Utilizing RetinaNet‐Based Preprocessing Techniques for Small Image Patch Classification. International Journal of Biomedical Imaging, 2025(1), 3559598-3559598. https://doi.org/10.1155/IJBI/3559598
[28] S. Priya & K. Amshakala. (2025). An adaptive fall detection system based on ensemble learning using variants of YOLO V8 retinanet and DETR. Scientific Reports, 15(1), 33161-33161. https://doi.org/10.1038/S41598-025-97634-8
Downloads
Published
Issue
Section
License
Copyright (c) 2026 International Journal of Computer Science and Information Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







