Feature Engineering and Data Visualization Analysis in Artificial Intelligence in Big Data Era

Authors

  • Zongze Li

DOI:

https://doi.org/10.62051/ijcsit.v3n3.41

Keywords:

Big data, Feature construction, Feature extraction, Parallel coordinate graph, Data visualization

Abstract

In the environment of massive data, the selection and construction of feature engineering plays a crucial role in the performance and accuracy of sgon models. It is true that the classic hand-driven feature building method can incorporate insights from the professional field, but this method is potentially accompanied by the hidden trouble of information omission, and does not necessarily touch the boundary of the optimal solution. In order to solve these problems, this paper proposes two strategies of feature extraction: ensemble learning and deep learning. Ensemble learning enhances generalization by combining the opinions of multiple models, while deep learning allows models to automatically learn features, reducing the need for human intervention. Both of these methods can overcome the limitations of manual feature design to varying degrees. In addition, the paper also introduces the application of parallel coordinate graph in feature selection. By using the parallel axis system to implement projection transformation of high-dimensional data, scholars can intuitively analyze the data structure, so as to promote the process of feature selection and optimization. This method not only gives insight into the subtle relationship between the data, but also cleverly stimulates the potential of human pattern recognition and further improves the comprehensive performance of the model.

Downloads

Download data is not yet available.

References

[1] Michaels G S, Carr D B, Askenazi M, et al. Cluster analysis and data visualization of large-scale gene expression data[C]//Pacific symposium on biocomputing. 1997, 98: 42-53.

[2] Azzam T, Evergreen S, Germuth A A, et al. Data visualization and evaluation [J]. New Directions for Evaluation, 2013, 2013(139): 7-32.

[3] Wang W, Lu C. Visualization analysis of big data research based on Citespace [J]. Soft Computing, 2020, 24(11): 8173-8186.

[4] Dzemyda G, Kurasova O, Zilinskas J. Multidimensional data visualization [J]. Methods and applications series: Springer optimization and its applications, 2013, 75(122): 10-5555.

[5] Huang B, Jiang B, Li H. An integration of GIS, virtual reality and the Internet for visualization, analysis and exploration of spatial data [J]. International Journal of Geographical Information Science, 2001, 15(5): 439-456.

Downloads

Published

12-08-2024

Issue

Section

Articles

How to Cite

Li, Z. (2024). Feature Engineering and Data Visualization Analysis in Artificial Intelligence in Big Data Era. International Journal of Computer Science and Information Technology, 3(3), 390-395. https://doi.org/10.62051/ijcsit.v3n3.41