A Siamfc Target-Tracking Algorithm Based on an Improved Spatiotemporal Attention Mechanism

Authors

  • Xu Zhang
  • Jun Lu
  • Lin Shi
  • Yuan Cao

DOI:

https://doi.org/10.62051/ijcsit.v2n2.14

Keywords:

Target Tracking; Siamese Network; Space-Time Attention

Abstract

The motion target tracking algorithm has developed rapidly. In this paper, the SiamFC (Fully Convolutional Siamese Networks) algorithm mainly relies on the first frame of the video as a template, and lacks an effective update mechanism. Based on the SiamFC algorithm, This paperĀ  introduced an improved spatiotemporal attention mechanism, and the model pays more attention to key historical frames and target regions in the video sequence by introducing an improved spatiotemporal attention mechanism in the backbone network. Moreover, the pixels in the response map are divided between background and foreground by a pixel-by-pixel classification regression method. By combining the centrality branch to limit the generation of lower quality prediction box, increase the accuracy of target prediction and reduce the complexity of prediction, the algorithm improves the accuracy and success rate, effectively realizing the target tracking in complex scenarios, while maintaining the accuracy and stability of tracking.

Downloads

Download data is not yet available.

References

Zhao Z Q, Zheng P, Xu S, et al. Object detection with deep learning: A review[J]. IEEE transactions on neural networks and learning systems, 2019, 30(11): 3212-3232.

Yilmaz A, Javed O, Shah M. Object tracking: A survey[J]. Acm computing surveys (CSUR), 2006, 38(4): 13-es.

Han S, Huang P, Wang H, et al. Mat: Motion-aware multi-object tracking[J]. Neurocomputing, 2022, 476: 75-86.

Petsiuk A, Pearce J M. Towards smart monitored AM: Open source in-situ layer-wise 3D printing image anomaly detection using histograms of oriented gradients and a physics-based rendering engine[J]. Additive Manufacturing, 2022, 52: 102690.

Alvarado-Robles G, Osornio-Rios R A, Solis-Munoz F J, et al. An approach for shadow detection in aerial images based on multi-channel statistics[J]. IEEE Access, 2021, 9: 34240-34250.

Ali R, Chuah J H, Talip M S A, et al. Structural crack detection using deep convolutional neural networks[J]. Automation in Construction, 2022, 133: 103989.

Vijayan T, Sangeetha M, Kumaravel A, et al. Feature selection for simple color histogram filter based on retinal fundus images for diabetic retinopathy recognition[J]. IETE Journal of Research, 2023, 69(2): 987-994.

Cooper L A. Spatial information processing: Strategies for research[M]//Aptitude, learning, and instruction. Routledge, 2021: 149-176.

Szeliski R. Computer vision: algorithms and applications[M]. Springer Nature, 2022.

Niu Z, Zhong G, Yu H. A review on the attention mechanism of deep learning[J]. Neurocomputing, 2021, 452: 48-62.

Wang X, Girshick R, Gupta A, et al. Non-local neural networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 7794-7803.

Downloads

Published

26-04-2024

Issue

Section

Articles

How to Cite

Zhang, X., Lu, J., Shi, L., & Cao, Y. (2024). A Siamfc Target-Tracking Algorithm Based on an Improved Spatiotemporal Attention Mechanism. International Journal of Computer Science and Information Technology, 2(2), 129-138. https://doi.org/10.62051/ijcsit.v2n2.14