Leiden Clustering Based on Single-cell Sequencing Data of Human Bone Marrow

Authors

  • Jianzhang Li
  • Zixuan Zhao

DOI:

https://doi.org/10.62051/7vqdpy80

Keywords:

Single-cell sequencing, Principal Component Analysis, Leiden Clustering.

Abstract

As single-cell sequencing technology has gradually become a popular method for obtaining effective data in biology, the experimental steps corresponding to different experimental goals are also very different, which often causes researchers to make mistakes in choosing data processing methods. In response to this situation, this paper selects a part of the NeurIPS 2021 benchmark dataset of openproblem because the data processing difficulty is moderate, the data noise is low, and the dataset is bone marrow mononuclear cell data of healthy human donors, which is representative in cell processing, and conducts an in-depth compilation and analysis of the dataset, summarizing a set of more universal single-cell sequencing experimental steps. This paper first filters cells and genes by setting different thresholds of corresponding indicators to achieve the purpose of data preprocessing; in terms of data dimensionality reduction, this paper uses principal component analysis (PCA) to reduce the dimensionality of the data and mark more than 2,000 highly variable genes. Then clustering the cells, and this paper divides the cell clusters into 27 categories through the Leiden clustering method. Meanwhile, this paper also identifies and analyzes the marker genes based on the cell clusters obtained by clustering. Through the research of this article, it is found that compared with traditional sequencing methods (such as K-means, hierarchical clustering, Louvain clustering, etc.), Leiden clustering is the most locally distributed for all subsets of all communities. Salient features, and considering that Louvain clustering’s lack of community connectivity is difficult to solve, make Leiden clustering more superior to Louvain clustering in processing single-cell data. At the same time, the research of this paper not only innovates in the method of single-cell sequencing, but also achieves a driving significance in the derivation of its biological functions.

Downloads

Download data is not yet available.

References

[1] Wang Quan, Wang Zhu, Zhang Zhen, Li Chen, Zhang Mengmeng, Ye Yingjiang, Wang Shan, Jiang Kewei. Overview of the Technology of Single Cell Sequencing[J]. CHINESE JOURNAL OF MEDICINAL GUIDE, 2020, 22(7): 433-439.

[2] See P, Lum J, Chen J, et al. A Single-cell sequencing guide for immunologists[J]. Frontiers in Immunology, 2018, 9 :2425.

[3] Zhang Miao, Sun Xiang-rui, Xu Chun-ming. Research Progress of Approaches in Single Cell RNA Sequencing Data Analysis[J]. Biotechnology Bulletin, 2021, 37(1): 52-59.

[4] Traag, Vincent A., Ludo Waltman, and Nees Jan Van Eck. From Louvain to Leiden: guaranteeing well-connected communities[J]. Scientific reports, 2019, 9(1): 1-12.

[5] Newman, Mark EJ, and Michelle Girvan. Finding and evaluating community structure in networks[J]. Physical review E, 2004, 69(2): 026113.

[6] Li, Wenjun, et al. Parameterized algorithms of fundamental NP-hard problems: a survey[J]. Human-centric Computing and Information Sciences, 2020(10): 1-24.

[7] Zhang, Jicun, et al. An improved Louvain algorithm for community detection[J]. Mathematical Problems in Engineering, 2021,2021(1): 1485592.

[8] Christensen, Alexander P., et al. Comparing community detection algorithms in psychological data: A Monte Carlo simulation[M]. 2020.

[9] Smith, Natalie R., et al. A guide for choosing community detection algorithms in social network studies: The question alignment approach[J]. American journal of preventive medicine, 2020, 59(4): 597-605.

[10] Rozikov U A. Gibbs measures in biology and physics: The Potts model[M]. 2022.

Downloads

Published

24-12-2024

How to Cite

Li, J., & Zhao, Z. (2024). Leiden Clustering Based on Single-cell Sequencing Data of Human Bone Marrow. Transactions on Materials, Biotechnology and Life Sciences, 7, 759-766. https://doi.org/10.62051/7vqdpy80