Impact of Key Drivers on New York Traffic Flow: A Comprehensive Study

Authors

  • Zhaoxi Chen

DOI:

https://doi.org/10.62051/ghjj7s10

Keywords:

Traffic volume prediction; Urban traffic management; Random Forest Regressor; XGBoost; LightGBM.

Abstract

This study aims to predict traffic volumes in New York City using integrated datasets from transportation, weather, and collision records. The author employed multiple machine learning models, including Random Forest Regressor, XGBoost, LightGBM, and an Ensemble Model combining Random Forest and LightGBM, to utilize their strengths. The data preprocessing involved cleaning, merging, and encoding categorical features, resulting in a comprehensive dataset that integrated traffic volume, collision, and weather data. The findings indicated that the LightGBM model achieved the highest accuracy with the lowest error rates (MSE: 1336.12, RMSE: 36.55, MAE: 26.35, R²: 0.9788). While comparing with other studies, the models, particularly LightGBM, provided superior predictive performance. Despite the promising results, the limited dataset size and computational complexity posed challenges. Therefore, in the future, the research should focus on expanding the dataset, exploring advanced ensemble techniques, and evaluating the applicability of the model across different subpopulations to enhance the robustness and universality of traffic volume predictions.

Downloads

Download data is not yet available.

References

[1] Baghestani A, Tayarani M, Allahviranloo M, Gao H O. Evaluating the traffic and emissions impacts of congestion pricing in New York City. Sustainability, 2020, 12(9): 3655.

[2] Cheng W, Li J, Xiao H, Ji L. Combination Predicting Model of Traffic Congestion Index in Weekdays Based on LightGBM-GRU. Scientific Reports, 2022, 12(1).

[3] Laifa H, Khcherif R, Ghezalaa H H B. Train delay prediction in Tunisian railway through LightGBM model. Procedia Computer Science, 2021, 192: 981–990.

[4] Navarro-Espinoza A, López-Bonilla O R, García-Guerrero E E, Tlelo-Cuautle E, López-Mancilla D, Hernández-Mejía C, Inzunza-González E. Traffic flow prediction for smart traffic lights using machine learning algorithms. Technologies, 2022, 10(1): 5.

[5] Shang Q, Feng L, Gao S. A hybrid method for traffic incident detection using random Forest-Recursive feature elimination and long Short-Term memory network with Bayesian optimization algorithm. IEEE Access, 2021, 9: 1219–1232.

Downloads

Published

25-11-2024

How to Cite

Chen, Z. (2024) “Impact of Key Drivers on New York Traffic Flow: A Comprehensive Study”, Transactions on Computer Science and Intelligent Systems Research, 7, pp. 195–199. doi:10.62051/ghjj7s10.