Analisis dan Prediksi Customer Churn pada Platform Streaming Berbasis Langganan Menggunakan Metode Random Forest

Authors

  • Imakulata Kresnawati M Bili Institut Bisnis dan Teknologi Indonesia
  • I Wayan Sudiarta Institut Bisnis dan Teknologi Indonesia
  • Maria Yuditia Wungabelen Institut Bisnis dan Teknologi Indonesia
  • Ni Kadek Alika Rosdiana Institut Bisnis dan Teknologi Indonesia
  • Putri Rafiana Institut Bisnis dan Teknologi Indonesia

DOI:

https://doi.org/10.61132/jubid.v3i1.1226

Keywords:

Customer Churn, Customer Retention, Machine Learning, Random Forest, Streaming Platform

Abstract

Customer churn is a strategic challenge for digital streaming platforms because it directly Impacts revenue and business sustainability. This study aims to analyze the factors influencing customer Churn and develop a churn prediction model using the Random Forest algorithm. The study uses a Quantitative approach with an explanatory design and utilizes secondary data from the Netflix Customer Churn and Engagement Dataset available on Kaggle. The dataset consists of 1,000 customer data with 16 Variables covering demographic characteristics, service usage behavior, financial condition, and customer Satisfaction level. The data was processed through preprocessing, one-hot encoding, and a 70:30 split Between training and test data. Model performance was evaluated using accuracy, precision, recall, F1 Score, and ROC-AUC metrics. The results show that the Random Forest model produces an accuracy of 53.7%, precision of 56.3%, recall of 63.6%, F1-score of 59.7%, and ROC-AUC of 0.534, indicating Moderate predictive ability and only slightly better than random classification. Feature importanceAn.evealed that user engagement levels, such as viewing duration and frequency of interactions, Were the most dominant factors influencing churn, followed by economic factors and customer satisfaction. The results of this study are expected to provide a basis for streaming platforms to design more effective Customer retention strategies.

Downloads

Download data is not yet available.

References

Ahmad, A., Jafar, A., & Aljoumaa, K. (2019). Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data, 6(1). https://doi.org/10.1186/s40537-019-0191-6

Anderson, S. (2019). Customer churn prediction using random forests: Analysis of machine learning techniques for churn prediction in the telecom sector. IEEE Access, 7, 60134-60149. https://doi.org/10.1109/access.2019.2914999

Beeharry, Y., & Fokone, R. (2021). Hybrid approach using machine learning algorithms for customers' churn prediction in the telecommunications industry. Concurrency and Computation Practice and Experience, 34(4). https://doi.org/10.1002/cpe.6627

Çallı, L., & Kasim, S. (2022). Using machine learning algorithms to analyze customer churn in the Software as a Service (SaaS) industry. Academic Platform Journal of Engineering and Smart Systems, 10(3), 115-123. https://doi.org/10.21541/apjess.1139862

Chang, V., Hall, K., Xu, Q., Amao, F., Ganatra, M., & Benson, V. (2024). Prediction of customer churn behavior in the telecommunication industry using machine learning models. Algorithms, 17(6), 231. https://doi.org/10.3390/a17060231

Edwine, N., Wang, W., Song, W., & Ssebuggwawo, D. (2022). Detecting the risk of customer churn in telecom sector: A comparative study. Mathematical Problems in Engineering, 2022, 1-16. https://doi.org/10.1155/2022/8534739

Hussain, F., Neelakandan, S., Geetha, B., Selvalakshmi, V., Umadevi, A., & Martinson, E. (2022). Artificial intelligence-based customer churn prediction model for business markets. Computational Intelligence and Neuroscience, 2022, 1-14. https://doi.org/10.1155/2022/1703696

Keramati, A., Ghaneei, H., & Mirmohammadi, S. (2016). Developing a prediction model for customer churn from electronic banking services using data mining. Financial Innovation, 2(1). https://doi.org/10.1186/s40854-016-0029-6

Muneer, A., Ali, R., Alghamdi, A., Taib, S., Almaghthawi, A., & Ghaleb, E. (2022). Predicting customers churning in banking industry: A machine learning approach. Indonesian Journal of Electrical Engineering and Computer Science, 26(1), 539. https://doi.org/10.11591/ijeecs.v26.i1.pp539-549

Thakkar, H., Desai, A., Ghosh, S., Singh, P., & Sharma, G. (2022). Clairvoyant: AdaBoost with cost-enabled cost-sensitive classifier for customer churn prediction. Computational Intelligence and Neuroscience, 2022, 1-11. https://doi.org/10.1155/2022/9028580

Vo, N., Liu, S., Li, X., & Xu, G. (2021). Leveraging unstructured call log data for customer churn prediction. Knowledge-Based Systems, 212, 106586. https://doi.org/10.1016/j.knosys.2020.106586

Xie, Y., Li, X., Ngai, E., & Ying, W. (2009). Customer churn prediction using improved balanced forests. Expert Systems With Applications, 36(3), 5445-5449. https://doi.org/10.1016/j.eswa.2008.06.121

Xu, T., Ma, Y., & Kim, K. (2021). Telecom churn prediction system based on ensemble learning using feature grouping. Applied Sciences, 11(11), 4742. https://doi.org/10.3390/app11114742

Zhou, Y., Chen, W., Sun, X., & Yang, D. (2023). Early warning of telecom enterprise customer based on ensemble learning. PLoS ONE, 18(10), e0292466. https://doi.org/10.1371/journal.pone.0292466

Downloads

Published

2026-02-13

How to Cite

Imakulata Kresnawati M Bili, I Wayan Sudiarta, Maria Yuditia Wungabelen, Ni Kadek Alika Rosdiana, & Putri Rafiana. (2026). Analisis dan Prediksi Customer Churn pada Platform Streaming Berbasis Langganan Menggunakan Metode Random Forest. Jurnal Bisnis Inovatif Dan Digital, 3(1), 01–15. https://doi.org/10.61132/jubid.v3i1.1226