A Siamfc Target-Tracking Algorithm Based on an Improved Spatiotemporal Attention Mechanism
DOI:
https://doi.org/10.62051/ijcsit.v2n2.14Keywords:
Target Tracking; Siamese Network; Space-Time AttentionAbstract
The motion target tracking algorithm has developed rapidly. In this paper, the SiamFC (Fully Convolutional Siamese Networks) algorithm mainly relies on the first frame of the video as a template, and lacks an effective update mechanism. Based on the SiamFC algorithm, This paperĀ introduced an improved spatiotemporal attention mechanism, and the model pays more attention to key historical frames and target regions in the video sequence by introducing an improved spatiotemporal attention mechanism in the backbone network. Moreover, the pixels in the response map are divided between background and foreground by a pixel-by-pixel classification regression method. By combining the centrality branch to limit the generation of lower quality prediction box, increase the accuracy of target prediction and reduce the complexity of prediction, the algorithm improves the accuracy and success rate, effectively realizing the target tracking in complex scenarios, while maintaining the accuracy and stability of tracking.
Downloads
References
Zhao Z Q, Zheng P, Xu S, et al. Object detection with deep learning: A review[J]. IEEE transactions on neural networks and learning systems, 2019, 30(11): 3212-3232.
Yilmaz A, Javed O, Shah M. Object tracking: A survey[J]. Acm computing surveys (CSUR), 2006, 38(4): 13-es.
Han S, Huang P, Wang H, et al. Mat: Motion-aware multi-object tracking[J]. Neurocomputing, 2022, 476: 75-86.
Petsiuk A, Pearce J M. Towards smart monitored AM: Open source in-situ layer-wise 3D printing image anomaly detection using histograms of oriented gradients and a physics-based rendering engine[J]. Additive Manufacturing, 2022, 52: 102690.
Alvarado-Robles G, Osornio-Rios R A, Solis-Munoz F J, et al. An approach for shadow detection in aerial images based on multi-channel statistics[J]. IEEE Access, 2021, 9: 34240-34250.
Ali R, Chuah J H, Talip M S A, et al. Structural crack detection using deep convolutional neural networks[J]. Automation in Construction, 2022, 133: 103989.
Vijayan T, Sangeetha M, Kumaravel A, et al. Feature selection for simple color histogram filter based on retinal fundus images for diabetic retinopathy recognition[J]. IETE Journal of Research, 2023, 69(2): 987-994.
Cooper L A. Spatial information processing: Strategies for research[M]//Aptitude, learning, and instruction. Routledge, 2021: 149-176.
Szeliski R. Computer vision: algorithms and applications[M]. Springer Nature, 2022.
Niu Z, Zhong G, Yu H. A review on the attention mechanism of deep learning[J]. Neurocomputing, 2021, 452: 48-62.
Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7794-7803.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Xu Zhang, Jun Lu, Lin Shi, Yuan Cao

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







