Text Processing Method based on Natural Language Processing
-- Taking Two Model as Examples
DOI:
https://doi.org/10.62051/qqbb7096Keywords:
Chinese Natural Language Processing Tasks; Sequence Labeling Model; BERT Pre-training Model; Text Processing.Abstract
Today, natural language processing is booming and is widely used in different fields, especially for text processing. In many Chinese natural language processing tasks such as information retrieval, named entity recognition, syntactic analysis, etc., word segmentation and part-of-speech tagging are often the first steps. This paper mainly selects sequence labeling model and BERT pre-training model, specifically studies how to use natural language processing technology for text processing, and analyzes and summarizes the advantages and disadvantages of the two models.
Downloads
References
REN Zhihui, XU Haoyu, FENG Songlin, et al. Sequence labeling Chinese word segmentation method based on LSTM networks[J]. Application Research of Computers, 2017, 34(5): 1321-1324.
LIN Jiarui, CHENG Zhigang, HAN Yu, et al. Classification method of disaster tweet based on BERT pre-training model [J]. Journal of Graphics,2022,43(03):530-536.
Hovy, D., & Spruit, S. L. (2016). The social impact of natural language processing. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp. 591-598).
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). Why should i trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135-1144).
WEI Xinyang, TANG Xianghong. Research on Extractive Judgment Document Abstract Generation Method based on BERT[J]. SOFTWARE ENGINEERING,2022:1-4.
LIN Jia-rui, CHENG Zhi-gang, HAN Yu, YIN Yun-peng. Disaster tweets classification method based on pretrained BERT model[J]. JOURNAL OF GRAPHICS,2022,43(3):530-536.
Downloads
Published
Conference Proceedings Volume
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







