Navigating the CRISPR-Cas9 Frontier: AI-Enabled off-target prediction and sgRNA Design for Unprecedented Precision

Authors

  • Haoyu Sun

DOI:

https://doi.org/10.62051/h0n3wd61

Keywords:

CRISPR-Cas9; Machine learning; sgRNA design; efficient sgRNA.

Abstract

CRISPR-Cas9, composed of Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9, is a pivotal tool for precise genetic manipulation with diverse biomedical applications. In this system, the scientists use single guide RNA (sgRNA) synthesized from tracrRNA and crRNA to lead the Cas9 protein to target specific gene locations, thereby achieving gene editing. Nonetheless, drawbacks like single guide RNA's limited efficiency led to frequent base mismatch and off-target effects, which hampers CRISPR-Cas9's potential. Under these circumstances, entering machine learning, adept at adapting to variations and handling intricate datasets, is a viable avenue for optimizing CRISPR-Cas9's guide RNA by rectifying these limitations. Nevertheless, machine learning is not exempt from limitations. Within this framework, this paper presents a succinct overview of the challenges linked to sgRNA's efficiency issues. It then outlines existing mechanisms in machine learning and assesses the efficacy of machine learning in enhancing sgRNA design to improve CRISPR/Cas9 sgRNA specificity. Additionally, this paper scrutinizes notable restrictions and suggestions of machine learning in the quest for superior sgRNAs to guide future research.

Downloads

Download data is not yet available.

References

Jinek, M., Chylinski, K., Fonfara, I., Hauer, M., Doudna, J. A., & Charpentier, E. (2012). A programmable dual-rna–guided DNA endonuclease in adaptive bacterial immunity. Science, 337 (6096), 816 - 821.

Konstantakos, V., Nentidis, A., Krithara, A., & Paliouras, G. (2022). CRISPR–cas9 gRNA efficiency prediction: An overview of predictive tools and the role of deep learning. Nucleic Acids Research, 50 (7), 3616 - 3637.

Foschi, N., Athanasakis, E., Gasparini, P., Stazio, M. D., & D’Adamo, A. P. (2020). Systematic analysis of factors that improve HDR efficiency in CRISPR / Cas9 technique.

Lisa Li, H., Nakano, T., & Hotta, A. (2013). Genetic correction using engineered nucleases for gene therapy applications. Development, Growth & Differentiation, 56 (1), 63 - 77.

Chen, L., Wang, S., Zhang, Y., Li, J., Xing, Z., Yang, J., Huang, T., & Cai, Y. (2017). Identify key sequence features to improve CRISPR sgRNA efficacy. IEEE Access, 5, 26582 - 26590.

Louie, W., Shen, M. W., Tahiry, Z., Zhang, S., Worstell, D., Cassa, C. A., Sherwood, R. I., & Gifford, D. K. (2021). Machine learning based CRISPR gRNA design for therapeutic exon skipping. PLOS Computational Biology, 17 (1), e1008605.

Kim, H. K., Min, S., Song, M., Jung, S., Choi, J. W., Kim, Y., Lee, S., Yoon, S., & Kim, H. (. (2018). Deep learning improves prediction of CRISPR–cpf1 guide RNA activity. Nature Biotechnology, 36 (3), 239 - 241.

Sherkatghanad, Z., Abdar, M., Charlier, J., & Makarenkov, V. (2023). Using traditional machine learning and deep learning methods for on- and off-target prediction in CRISPR/Cas9: A review. Briefings in Bioinformatics, 24 (3).

Abadi, S., Yan, W. X., Amar, D., & Mayrose, I. (2017). A machine learning approach for predicting CRISPR-cas9 cleavage efficiencies and patterns underlying its mechanism of action. PLOS Computational Biology, 13 (10), e1005807.

Doench, J. G., Fusi, N., Sullender, M., Hegde, M., Vaimberg, E. W., Donovan, K. F., Smith, I., Tothova, Z., Wilen, C., Orchard, R., Virgin, H. W., Listgarten, J., & Root, D. E. (2016). Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-cas9. Nature Biotechnology, 34 (2), 184 - 191.

Koch, B., Nijmeijer, B., Kueblbeck, M., Cai, Y., Walther, N., & Ellenberg, J. (2018). Generation and validation of homozygous fluorescent knock-in cells using CRISPR–cas9 genome editing. Nature Protocols, 13 (6), 1465 - 1487.

Liu, X., Yang, Y., Qiu, Y., Reyad-ul-ferdous, M., Ding, Q., & Wang, Y. (2020). SeqCor: Correct the effect of guide RNA sequences in clustered regularly interspaced short palindromic repeats/Cas9 screening by machine learning algorithm. Journal of Genetics and Genomics, 47 (11), 672 - 680.

Charlier, J., Nadon, R., & Makarenkov, V. (2021). Accurate deep learning off-target prediction with novel sgRNA-DNA sequence encoding in CRISPR-cas9 gene editing. Bioinformatics, 37 (16), 2299 - 2307.

Chuai, G., Ma, H., Yan, J., Chen, M., Hong, N., Xue, D., Zhou, C., Zhu, C., Chen, K., Duan, B., Gu, F., Qu, S., Huang, D., Wei, J., & Liu, Q. (2018). DeepCRISPR: Optimized CRISPR guide RNA design by deep learning. Genome Biology, 19 (1).

Hatture, S. M., & Kadakol, N. (2021). Clinical diagnostic systems based on machine learning and deep learning. Demystifying Big Data, Machine Learning, and Deep Learning for Healthcare Analytics, 159 - 183.

Dhanjal, J. K., Dammalapati, S., Pal, S., & Sundar, D. (2020). Evaluation of off-targets predicted by sgRNA design tools. Genomics, 112 (5), 3609 - 3614.

High-density guide RNA tiling and machine learning for designing CRISPR interference in Synechococcus Sp. PCC 7002. (n.d.).

Haeussler, M., Schönig, K., Eckert, H., Eschstruth, A., Mianné, J., Renaud, J., Schneider-Maunoury, S., Shkumatava, A., Teboul, L., Kent, J., Joly, J., & Concordet, J. (2016). Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR. Genome Biology, 17 (1).

Fanzor: First CRISPR-like system found in eukaryotes. (2023). GEN Biotechnology, 2 (4), 276 - 277.

Downloads

Published

24-03-2024

How to Cite

Sun, H. (2024). Navigating the CRISPR-Cas9 Frontier: AI-Enabled off-target prediction and sgRNA Design for Unprecedented Precision. Transactions on Materials, Biotechnology and Life Sciences, 3, 522-531. https://doi.org/10.62051/h0n3wd61