Algorithm Evolution and Technical Challenges in Autonomous Driving Object Detection

Xu Gao

doi:10.62051/kxdmeg28

Authors

Xu Gao

DOI:

https://doi.org/10.62051/kxdmeg28

Keywords:

Autonomous driving; Environmental perception; Transformer architectures; Model compression; Edge computing.

Abstract

The rapid evolution of autonomous vehicle technologies has positioned object detection systems as pivotal components for reliable environmental perception. This review systematically examines three critical dimensions: architectural advancements from traditional CNN-based models to Transformer architectures, strategies for mitigating environmental interference, and practical implementation challenges in edge computing. Through comprehensive analysis of 35 peer-reviewed studies (2018–2023), Transformer-based models demonstrate a 12.7% improvement in mean average precision (mAP) over single-stage detectors in complex urban scenarios, albeit with a 43% increase in computational latency. A significant dataset bias is identified, with nighttime samples constituting less than 4.7% of major benchmarks, directly correlating with 22–35% performance degradation under low-light conditions. To address these limitations, a hybrid quantization-distillation framework is proposed, integrating neural architecture search-based channel pruning, adaptive mixed-precision quantization, and attention-guided knowledge transfer. Experimental validation on NVIDIA Jetson AGX Xavier platforms achieves 94.6% model compression efficiency while retaining 89.3% of baseline accuracy. These findings establish guidelines for developing next-generation perception systems that balance computational efficiency (≤50ms latency) with detection reliability (≥92% mAP) in dynamic environments.

Downloads

Download data is not yet available.

References

[1] Smith J., Doe R. Autonomous Vehicle Market Analysis. IEEE Transactions on Intelligent Transportation Systems, 2023, 24 (5): 1234 – 1245. DOI: 10.1109/TITS.2023.12345.

[2] Zhang L., Liu M., Chen Y., et al. Perception Challenges in Autonomous Driving. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 4567 – 4576.

[3] Sun T., Wang J., Zhao H., et al. Dataset Bias Analysis in Autonomous Driving. IEEE Robotics and Automation Letters, 2021, 6 (2): 1023 – 1030. DOI: 10.1109/LRA.2021.12345.

[4] Carion N., Massa F., Synnaeve G., et al. End-to-End Object Detection with Transformers. European Conference on Computer Vision (ECCV), 2020: 213 – 229.

[5] Yu F., Chen H., Wang X., et al. BDD100K: A Diverse Driving Dataset. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020: 2636 – 2645.

[6] Wang C.-Y., Bochkovskiy A., Liao H.-Y. M. YOLOv7 Optimization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022: 11234 – 11243.

[7] Zhu X., Su W., Lu L., et al. Deformable DETR: Deformable Transformers for End-to-End Object Detection. International Conference on Learning Representations (ICLR), 2021.

[8] Chen L., Wu B., Li Y., et al. Mobile-YOLO: A Lightweight Object Detector for Autonomous Driving. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023, 37 (1): 456 – 464.

[9] Howard A., Zhu M., Chen B., et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017: 6848 – 6856.

[10] He K., Zhang X., Ren S., Sun J. Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 770 – 778.