Analisis dan Prediksi Customer Churn pada Platform Streaming Berbasis Langganan Menggunakan Metode Random Forest
DOI:
https://doi.org/10.61132/jubid.v3i1.1226Keywords:
Customer Churn, Customer Retention, Machine Learning, Random Forest, Streaming PlatformAbstract
Customer churn is a strategic challenge for digital streaming platforms because it directly Impacts revenue and business sustainability. This study aims to analyze the factors influencing customer Churn and develop a churn prediction model using the Random Forest algorithm. The study uses a Quantitative approach with an explanatory design and utilizes secondary data from the Netflix Customer Churn and Engagement Dataset available on Kaggle. The dataset consists of 1,000 customer data with 16 Variables covering demographic characteristics, service usage behavior, financial condition, and customer Satisfaction level. The data was processed through preprocessing, one-hot encoding, and a 70:30 split Between training and test data. Model performance was evaluated using accuracy, precision, recall, F1 Score, and ROC-AUC metrics. The results show that the Random Forest model produces an accuracy of 53.7%, precision of 56.3%, recall of 63.6%, F1-score of 59.7%, and ROC-AUC of 0.534, indicating Moderate predictive ability and only slightly better than random classification. Feature importanceAn.evealed that user engagement levels, such as viewing duration and frequency of interactions, Were the most dominant factors influencing churn, followed by economic factors and customer satisfaction. The results of this study are expected to provide a basis for streaming platforms to design more effective Customer retention strategies.
Downloads
References
Ahmad, A., Jafar, A., & Aljoumaa, K. (2019). Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data, 6(1). https://doi.org/10.1186/s40537-019-0191-6
Anderson, S. (2019). Customer churn prediction using random forests: Analysis of machine learning techniques for churn prediction in the telecom sector. IEEE Access, 7, 60134-60149. https://doi.org/10.1109/access.2019.2914999
Beeharry, Y., & Fokone, R. (2021). Hybrid approach using machine learning algorithms for customers' churn prediction in the telecommunications industry. Concurrency and Computation Practice and Experience, 34(4). https://doi.org/10.1002/cpe.6627
Çallı, L., & Kasim, S. (2022). Using machine learning algorithms to analyze customer churn in the Software as a Service (SaaS) industry. Academic Platform Journal of Engineering and Smart Systems, 10(3), 115-123. https://doi.org/10.21541/apjess.1139862
Chang, V., Hall, K., Xu, Q., Amao, F., Ganatra, M., & Benson, V. (2024). Prediction of customer churn behavior in the telecommunication industry using machine learning models. Algorithms, 17(6), 231. https://doi.org/10.3390/a17060231
Edwine, N., Wang, W., Song, W., & Ssebuggwawo, D. (2022). Detecting the risk of customer churn in telecom sector: A comparative study. Mathematical Problems in Engineering, 2022, 1-16. https://doi.org/10.1155/2022/8534739
Hussain, F., Neelakandan, S., Geetha, B., Selvalakshmi, V., Umadevi, A., & Martinson, E. (2022). Artificial intelligence-based customer churn prediction model for business markets. Computational Intelligence and Neuroscience, 2022, 1-14. https://doi.org/10.1155/2022/1703696
Keramati, A., Ghaneei, H., & Mirmohammadi, S. (2016). Developing a prediction model for customer churn from electronic banking services using data mining. Financial Innovation, 2(1). https://doi.org/10.1186/s40854-016-0029-6
Muneer, A., Ali, R., Alghamdi, A., Taib, S., Almaghthawi, A., & Ghaleb, E. (2022). Predicting customers churning in banking industry: A machine learning approach. Indonesian Journal of Electrical Engineering and Computer Science, 26(1), 539. https://doi.org/10.11591/ijeecs.v26.i1.pp539-549
Thakkar, H., Desai, A., Ghosh, S., Singh, P., & Sharma, G. (2022). Clairvoyant: AdaBoost with cost-enabled cost-sensitive classifier for customer churn prediction. Computational Intelligence and Neuroscience, 2022, 1-11. https://doi.org/10.1155/2022/9028580
Vo, N., Liu, S., Li, X., & Xu, G. (2021). Leveraging unstructured call log data for customer churn prediction. Knowledge-Based Systems, 212, 106586. https://doi.org/10.1016/j.knosys.2020.106586
Xie, Y., Li, X., Ngai, E., & Ying, W. (2009). Customer churn prediction using improved balanced forests. Expert Systems With Applications, 36(3), 5445-5449. https://doi.org/10.1016/j.eswa.2008.06.121
Xu, T., Ma, Y., & Kim, K. (2021). Telecom churn prediction system based on ensemble learning using feature grouping. Applied Sciences, 11(11), 4742. https://doi.org/10.3390/app11114742
Zhou, Y., Chen, W., Sun, X., & Yang, D. (2023). Early warning of telecom enterprise customer based on ensemble learning. PLoS ONE, 18(10), e0292466. https://doi.org/10.1371/journal.pone.0292466
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Jurnal Bisnis Inovatif dan Digital

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


