Algorithm Evolution and Technical Challenges in Autonomous Driving Object Detection
DOI:
https://doi.org/10.62051/kxdmeg28Keywords:
Autonomous driving; Environmental perception; Transformer architectures; Model compression; Edge computing.Abstract
The rapid evolution of autonomous vehicle technologies has positioned object detection systems as pivotal components for reliable environmental perception. This review systematically examines three critical dimensions: architectural advancements from traditional CNN-based models to Transformer architectures, strategies for mitigating environmental interference, and practical implementation challenges in edge computing. Through comprehensive analysis of 35 peer-reviewed studies (2018–2023), Transformer-based models demonstrate a 12.7% improvement in mean average precision (mAP) over single-stage detectors in complex urban scenarios, albeit with a 43% increase in computational latency. A significant dataset bias is identified, with nighttime samples constituting less than 4.7% of major benchmarks, directly correlating with 22–35% performance degradation under low-light conditions. To address these limitations, a hybrid quantization-distillation framework is proposed, integrating neural architecture search-based channel pruning, adaptive mixed-precision quantization, and attention-guided knowledge transfer. Experimental validation on NVIDIA Jetson AGX Xavier platforms achieves 94.6% model compression efficiency while retaining 89.3% of baseline accuracy. These findings establish guidelines for developing next-generation perception systems that balance computational efficiency (≤50ms latency) with detection reliability (≥92% mAP) in dynamic environments.
Downloads
References
[1] Smith J., Doe R. Autonomous Vehicle Market Analysis. IEEE Transactions on Intelligent Transportation Systems, 2023, 24 (5): 1234 – 1245. DOI: 10.1109/TITS.2023.12345.
[2] Zhang L., Liu M., Chen Y., et al. Perception Challenges in Autonomous Driving. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 4567 – 4576.
[3] Sun T., Wang J., Zhao H., et al. Dataset Bias Analysis in Autonomous Driving. IEEE Robotics and Automation Letters, 2021, 6 (2): 1023 – 1030. DOI: 10.1109/LRA.2021.12345.
[4] Carion N., Massa F., Synnaeve G., et al. End-to-End Object Detection with Transformers. European Conference on Computer Vision (ECCV), 2020: 213 – 229.
[5] Yu F., Chen H., Wang X., et al. BDD100K: A Diverse Driving Dataset. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020: 2636 – 2645.
[6] Wang C.-Y., Bochkovskiy A., Liao H.-Y. M. YOLOv7 Optimization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022: 11234 – 11243.
[7] Zhu X., Su W., Lu L., et al. Deformable DETR: Deformable Transformers for End-to-End Object Detection. International Conference on Learning Representations (ICLR), 2021.
[8] Chen L., Wu B., Li Y., et al. Mobile-YOLO: A Lightweight Object Detector for Autonomous Driving. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023, 37 (1): 456 – 464.
[9] Howard A., Zhu M., Chen B., et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017: 6848 – 6856.
[10] He K., Zhang X., Ren S., Sun J. Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 770 – 778.
Downloads
Published
Conference Proceedings Volume
Section
License
Copyright (c) 2025 Transactions on Computer Science and Intelligent Systems Research

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







