Hyperspectral Image Classification Based on the Improved Spectral Former

Tianyu Geng; Hanwen Fan

doi:10.62051/ijcsit.v5n2.04

Authors

Tianyu Geng
Hanwen Fan

DOI:

https://doi.org/10.62051/ijcsit.v5n2.04

Keywords:

Hyperspectral Image, Transformer, Atrous Spatial Pyramid Pooling, Deep Learning

Abstract

Hyperspectral imaging has become a powerful tool in remote sensing, enabling fine-grained material identification and revealing the chemical and physical properties of materials. Its applications span urban land-use mapping, object recognition, crop classification, and agricultural yield prediction. The typical hyperspectral image classification workflow includes image loading, correction, noise reduction, feature extraction, classifier selection, training, classification, and result output. Feature extraction plays a critical role, but traditional methods such as SIFT, PCA, and LDA are limited in efficiency and accuracy, especially with large-scale datasets. Deep learning, particularly Convolutional Neural Networks (CNNs), has significantly improved classification performance by extracting hierarchical features from raw data. However, challenges remain in capturing both spectral and spatial information effectively. Transformer models, such as SpectralFormer, have been proposed to address these issues by leveraging attention mechanisms to capture long-range dependencies. Yet, they struggle with preserving spatial structures in hyperspectral images. The integration of Atrous Spatial Pyramid Pooling (ASPP) into SpectralFormer offers a promising solution to this problem, enhancing spatial feature extraction and improving overall classification performance. This paper discusses these advancements and highlights the potential of combining deep learning and spatial feature extraction techniques to address the unique challenges of hyperspectral image classification.

Downloads

Download data is not yet available.

References

[1] W. Cao, K. Wang, G. Han, J. Yao, and A. Cichocki, “A robust PCA approach with noise structure learning and spatial–spectral low-rank modeling for hyperspectral image restoration,” IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 11, no. 10, pp. 3863–3879, Oct. 2018.

[2] J. Peng et al., “Low-rank and sparse representation for hyperspectral image processing: A review,” IEEE Geosci. Remote Sens. Mag., early access, Jun. 10, 2021, doi: 10.1109/MGRS.2021.3075491.

[3] D. Hong, N. Yokoya, J. Chanussot, J. Xu, and X. X. Zhu, “Learning to propagate labels on graphs: An iterative multitask regression framework for semi-supervised hyperspectral dimensionality reduction,” ISPRS J. Photogramm. Remote Sens., vol. 158, pp. 35–49, Dec. 2019.

[4] Q. Li, B. Zheng, B. Tu, J. Wang, and C. Zhou, “Ensemble EMD based spectral-spatial feature extraction for hyperspectral image classifi-cation,” IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 13, pp. 5134–5148, 2020.

[5] Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521, pp. 436–444, May 2015.

[6] X. Zhao et al., “Joint classification of hyperspectral and LiDAR data using hierarchical random walk and deep CNN architecture,” IEEE Trans. Geosci. Remote Sens., vol. 58, no. 10, pp. 7355–7370, Oct. 2020.

[7] Y. Chen, Z. Lin, X. Zhao, G. Wang, and Y. Gu, “Deep learning-based classification of hyperspectral data,” IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 7, no. 6, pp. 2094–2107, Jun. 2014.

[8] H. Abdi and L. J. Williams, “Principal component analysis,” Wiley Interdiscipl. Rev., Comput. Statist., vol. 2, no. 4, pp. 433-459, 2010.

[9] Ahmad M, Ghous U, Hong D, et al. A disjoint samples-based 3d-cnn with active transferlearning for hyperspectral image classification[J]. IEEE Transactions on Geoscience andRemote Sensing, 2022, 60: 1-16.

[10] Yang L, Yang Y, Yang J, et al. FusionNet: a convolution-transformer fusion network forhyperspectra1 image classification [J]. Remote Sensing, 2022, 14(16): 4066.

[11] A. Vaswani et al., “Attention is all you need,” 2017, arXiv:1706.03762.

[12] G. Ke, D. He, and T.-Y. Liu, “Rethinking positional encoding in language pre-training,” 2020, arXiv:2006.15595.

[13] D. Hong et al., "SpectralFormer: Rethinking Hyperspectral Image Classification With Transformers," in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-15, 2022, Art no. 5518615, doi: 10.1109/ TGRS. 2021. 3130716.

[14] L. -C. Chen, G. Papandreou, I. Kokkinos, K. Murphy and A. L. Yuille, "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 834-848, 1 April 2018, doi: 10.1109/TPAMI.2017.2699184.