Research on Product Demand Forecasting Based on Random Forest and ARIMA Time Series: Precision Forecasting Method for Data-Scarce Environments
DOI:
https://doi.org/10.62051/g9r9ca46Keywords:
Random Forest; ARIMA Time Series Forecasting; Euclidean Distance; K-Means Cluster Analysis; Product Demand Prediction.Abstract
In the field of e-commerce, accurately predicting the future demand for specific products is crucial for efficient inventory management and supply chain optimization. Although traditional time series forecasting methods have shown reliability in handling rich data, they are significantly affected by seasonal fluctuations and sudden market events. This is especially true for newly launched products or long-tail products with insufficient sales data, which often results in poor performance, limiting the accuracy and feasibility of traditional models in practical applications. To address this issue, this paper proposes an innovative forecasting framework based on extensive historical sales data from e-commerce platforms. This framework employs random forests to deeply mine and learn from historical data, and uses the K-Means clustering algorithm and Euclidean distance to identify the most matching sales feature trends for target products with limited data. It further provides accurate predictions for these products through the ARIMA time series model, offering robust support for further optimization of the forecasting model. Ultimately, by comparing with traditional models, the innovative forecasting framework presented in this paper demonstrates a significant improvement in accuracy when predicting sales of products with limited data, overcoming the challenges of forecasting under data constraints, and fully proving its effectiveness and innovativeness.
Downloads
References
[1] WangXuan, ZhouYun. Research on the digital application of manufacturing enterprise procurement supply chain platform [J]. China Shipping Weekly, 2024 (13): 72 - 74.
[2] Yi Keke. Research on Q Company’s FMCG demand forecast based on machine learning [D]. Shanghai: Donghua University, 2023.
[3] Xu Dongdong. Research on data-driven retail network inventory replenishment[D]. Shanghai: Donghua University, 2022.
[4] Fang Kuangnan, Wu Jianbin, Zhu Jianping, et al. A review of research on random forest methods [J]. Statistics and Information Forum, 2011, 26 (3): 32 - 38.
[5] Wei Kaicong. Research on taxi demand forecasting based on machine learning methods [D]. Guangzhou University, 2020.
[6] Zhang Jingjing. PM2.5 combination forecast based on ARIMA-BP neural network [D]. Lanzhou University, 2023.
[7] Wen Jiaxuan, Wang Miao, Liu Ji. A Time Series Multi step Prediction Algorithm Based on Time Series Decomposition and Random Forest [J]. Journal of East China University of Science and Technology (Natural Science Edition), 2023, 49 (06): 873 - 881.
[8] Zhao Juanhe. Research on short-term demand forecasting method for e-commerce commodities based on multi-layer hybrid deep neural network [D]. Chang'an University, 2020.
[9] Zhang Yifan, Hu Jiahao, Li Yiqiao. Research on commodity clustering based on kmeans algorithm [J]. Digital Technology and Applications, 2020, 38 (04): 108 - 110.
[10] Li Kewen, Lin Yalin, Yang Yaozhong. An improved SDRSMOTE algorithm based on Euclidean distance [J]. Computer Engineering and Science, 2019, 41 (11): 2063 - 2070.
[11] Nie Yuxuan. Research on automatic pricing and replenishment strategy of fresh commodities based on ARIMA prediction optimization model - taking vegetable commodities as an example [J]. Trade Exhibition Economics, 2024 (05): 19 - 22.
Downloads
Published
Conference Proceedings Volume
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.