Unsupervised Image Classifier based on Manifold Learning
DOI:
https://doi.org/10.62051/31s5nw90Keywords:
Manifold Learning; Agglomerative Clustering; Neural Networks; Unsupervised Learning; Image Classification.Abstract
Currently most of image classification tasks are achieved by supervised learning. High-quality datasets naturally bring difficulties in annotation, and the datasets in real-world applications present a nonlinear structure, and the annotation cost grows exponentially with the number of targets and the difficulty of recognisability. In this context, research about unsupervised image classification is the way to go. Traditional unsupervised learning for classification is mostly based on the Euclidean distance and various paradigms, which is unable to extract the nonlinear structure of the dataset. This shortcoming makes the accuracy of traditional unsupervised image classification drop drastically. In this paper, we propose to first extract the nonlinear structure of the original dataset using the manifold learning method, and then produce pseudo-labels through the agglomerative clustering algorithm. The pseudo-labels obtained in this way can effectively retain the special mathematical structure of the original data with high accuracy. The neural network is trained with these pseudo labels to obtain an unsupervised usable image classifier. The classifier can be trained on small-scale data and then applied to large-scale data sets, thus saving the cost of manual labelling. The experiments are carried out by setting up a control group and two manifold learning groups for the extraction of non-linear structures using LLE and Isomap algorithms respectively. After that, the production of pseudo-labels and the training of neural networks are completed, and the accuracy of the three groups is compared. Finally, it is concluded that the correct rate of the two groups that have gone through the manifold learning algorithm to extract the nonlinear structure is much higher than that of the other one, and the image classifier based on the Isomap algorithm achieves an accuracy of 85% in the test set, which is highly practical.
Downloads
References
Yin Ruigang, Wei Shuai, Li Han et al. A review of unsupervised learning methods in deep learning. Computer System Applications,2016,25(08):1-7. DOI:10.15888/j.cnki.csa.005283.
Zhou Y, Xia H, Liu HY et al. DPC clustering algorithm based on K mutual nearest neighbours and kernel density estimation[J/OL]. Journal of Beijing University of Aeronautics and Astronautics:1-16[2023-10-21]. https: //doi.org/ 10. 13700/j.bh.1001-5965.2023.0342.
MAC Q J. Some methods for classification and analysis of multivariate observations. Berkeley: Berkeley Symposium on Mathematical Statistics and Probability, 1967.
Chuang, Chuan-Chi. Research on multivariate discriminative clustering algorithm. Nanjing University of Aeronautics and Astronautics,2011.
Yuhao Cai, Yongquan Liang, Jiancong Fan et al. K-means algorithm for weighted local variance optimization of initial cluster centres. Computer Science and Exploration,2016,10(05):732-741.
Jia Ruiyu, Li Yugong. K-means algorithm for self-determination of the number of class clusters and initial centroids [J]. Computer Engineering and Applications,2018,54(07):152-158.
Jianren Wang, Xin Ma, Ganglong Duan. Improved k-means clustering k-value selection algorithm. Computer Engineering and Applications,2019,55(08):27-33.
Liu Quanhong, Tang Fuxing. Optimisation of site selection for faulty shared bicycle recycling centre based on K-means clustering algorithm and centre of gravity method. Operations Research and Management,2023,32(07):85-91.
Johnson SC. Hierarchical clustering schemes. Psychometrika, 1967, 32(2): 241-254.
Lv L, YU YQ, REN M et al. Cohesive hierarchical clustering based on ant colony optimization algorithm. Computer Application Research,2017,34(01):114-117.
Kaufman L, Rousseeuw PJ. Finding Groups in Data: an Introduction to Cluster Analysis. New York: John Wiley & Sons, 1990.
Guha S, Rastogi R, Shim K. CURE: an efficient clustering algorithm for clustering large databases // ACM SIGMOD International Conference on Management of Data, 1998: 73-84
Chu Kexin, Xun Yaling. A hierarchical cluster analysis algorithm for categorical data based on similarity mean. Computer Technology and Development,2022,32(11):154-163.
Liu Yi-Wei. Application research on person name disambiguation based on improved cohesive hierarchical clustering. Shandong University of Science and Technology, 2021. DOI: 10.27275/d.cnki.gsdku.2020.001929.
Zhang R. Research on bearing fault diagnosis method based on improved LLE algorithm. Northeast Petroleum University, 2023. DOI: 10.26995/d.cnki.gdqsc.2023.000376.
TENENBAUM J B, DESILVA V, LANGFORD J C. A global geometric framework for nonlinear dimension reduction. Sci- ence, 2000, 290(5500):2319-2323.
Li Gapeng. Research on point cloud denoising algorithm based on non-local self-similarity and stream learning. Southwest University, 2023. DOI: 10.27684/d.cnki.gxndx.2023.001242.
L.G. Wang, F. Wu. Colour visualization of hyperspectral images based on KL-ISOMAP. Journal of Nanjing University of Information Engineering (Natural Science Edition),2018,10(01): 63-71. DOI:10. 13878/j. cnki. jnuist. 2018. 01.006.
Jiefei Liu. Research on multi-stream shape learning algorithm for high-dimensional data. Shanxi University,2018.
Roweis S T, Saul L K. Nonlinear Dimensionality Reduction by Locally Linear Embedding. Science, 2000, 290 (5500): 232
Yu WB. Research on Feature Extraction from Hyperspectral Data Based on Stream Embedding and Deep Feature Analysis. Harbin Institute of Technology, 2021. DOI: 10.27061/d.cnki.ghgdu.2021.005341.
Qiuying Yang, Xiaoqing Weng. Time series clustering based on LLE and Gaussian mixture model. Computer Technology and Development,2022,32(08):33-41.
Lv Bingqian, Fan Linyuan. Improvement of LLE algorithm based on commuting time distance and Rank-Order distance. Information Systems Engineering,2021(07):141-144.
Jiang Sanshan. Research on gas outflow prediction based on LLE-WPA-BP algorithm. Liaoning University of Engineering and Technology, 2023. DOI: 10.27210/d.cnki.glnju.2022.000747.
LIU Tao, WANG Shen, WU Jiajing et al. Research and application of intelligent power technology based on BP neural network metro shield. China Water Transport (the second half of the month),2023,23(11):37-39.
Gao Jia-Nan, Ma Le-Tian, Bai Jin-Yang et al. A wind temperature prediction model for drenching wellbore based on GA-BP neural network. China Mining:1-6[2023-10-22].http://kns.cnki. net/kcms/ detail/11. 3033.TD. 2023 1017. 1044. 004.html.
Zhang Liang, He Shan, Ai Chunyu. Load prediction of wind turbine leaf root based on Sine-SSA-BP neural network model. Renewable Energy,2023,41(10): 1322-1328. DOI: 10.13941/j.cnki.21-1469/tk.2023.10.006.
J. Chen. Research on image style migration based on convolutional neural network. Journal of Hubei Institute of Technology,2023,39(05):35-38.
LECUN Y, BOTTOU L, BENGIO Y, et al. Gradient, based learning applied to document recognition. Pro- ceedings of the IEEE, 1998, 86(11): 2278-2324.
SIMONYAN K, ZISSERMAN A. Very deep convolu tional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.
SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions, Proceedings of the 2015 IEEE Confer- ence on Computer Vision and Pattern Recognition, 2015:1-9.
Downloads
Published
Conference Proceedings Volume
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







