Comparing Machine Learning Models for Predicting YouTube Video Like-to-View Ratios

Xinming Zhao

doi:10.62051/ybmr8j39

Authors

Xinming Zhao

DOI:

https://doi.org/10.62051/ybmr8j39

Keywords:

YouTube video prediction; machine learning; ensemble learning.

Abstract

With YouTube's dominance as a video-sharing platform, the like-to-view ratio (LVR) has emerged as a critical metric for quantifying audience engagement beyond raw view counts. Current approaches lack systematic model comparisons and actionable optimization strategies. This study bridges these gaps by rigorously evaluating tree-based ensembles (Random Forest, XGBoost, LightGBM) on raw metadata, demonstrating XGBoost's superior performance (R²=0.8703) over both individual models and a Voting Regressor (R²=0.8654); developing an operational framework that combines XGBoost predictions with LLM-generated suggestions (e.g., optimal publishing times); and deploying this system via a web tool for real-time decision support. The results reveal temporal features and hyperparameter tuning as key accuracy drivers. Despite excluding visual/audio elements due to data limitations, the implemented XGBoost-LLM hybrid system successfully bridges predictive analytics and creator guidance. Future work should explore multimodal data integration and cross-platform trends to enhance prediction robustness. This research advances video analytics by demonstrating how machine learning can optimize data-driven content optimization, offering both theoretical insights into model selection and practical tools for creators.

Downloads

Download data is not yet available.

References

[1] Statista. YouTube: Global Active Users 2023. Statista, 2023.

[2] Covington P, Adams J, Sargin E. Deep Neural Networks for YouTube Recommendations. Proceedings of the 10th ACM Conference on Recommender Systems, 2016: 191 – 198.

[3] Sikdar S, Chaudhary A, Kumar S, Ganguly N, Chakraborty A, Kumar G, Patil A, Mukherjee A. Identifying and Characterizing Sleeping Beauties on YouTube. Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work and Social Computing Companion. New York: ACM, 2016: 405 – 408.

[4] Fu W W, Sim C. Aggregate Bandwagon Effect on Online Videos' Viewership: Value Uncertainty, Popularity Cues, and Heuristics. Journal of the American Society for Information Science and Technology, 2011, 62 (12): 2382 – 2395.

[5] Abisheva A, Garimella V R K, Garcia D, Weber I. Who Watches (and Shares) What on YouTube? And when? Using Twitter to Understand YouTube Viewership. Proceedings of the 7th ACM International Conference on Web Search and Data Mining (WSDM '14). New York: ACM, 2014: 593 – 602.

[6] Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2016: 785 – 794.

[7] Song Y, Redi M, Vallmitjana J, Jaimes A. To Click or Not to Click: Automatic Selection of Beautiful Thumbnails from Videos. Proceedings of the 25th ACM International Conference on Information and Knowledge Management (CIKM '16). New York: ACM, 2016: 659 – 668.

[8] Koh B W, Cui F. Give a Gist: The Impact of Thumbnails on the View-through of Videos. SSRN Electronic Journal, 2020.

[9] Sculley D, Holt G, Golovin D, Davydov E, Phillips T, Ebner D, et al. Hidden Technical Debt in Machine Learning Systems. Advances in Neural Information Processing Systems, 2015, 28: 1 – 15.

[10] Liu J, Wang Y, Lyu Y, Su Y, Niu S, Xu X, Zhang Y. Harnessing LLMs for Automated Video Content Analysis: An Exploratory Workflow of Short Videos on Depression. Companion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing. New York: ACM, 2024: 190 – 196.