Customer churn prediction in the software as a service industry

Zaranka, Eimantas; Zhyhun, Bohdan; Songailaitė, Milita; Juozaitienė, Rūta; Krilavičius, Tomas

Use this url to cite publication: https://hdl.handle.net/20.500.12259/259201

Customer churn prediction in the software as a service industry

Type of publication

Tezės kitame recenzuojamame leidinyje / Theses in other peer-reviewed publication (T1e)

Author(s)

Author	Affiliation
Zaranka, Eimantas	Gamtos ir tech.mokslų tyr.institut / GTMTI Gamtos ir tech.mokslų tyr.institut		Centre for Applied Research and Development
Zhyhun, Bohdan	Taikomosios informatikos katedra / Department of Applied Informatics	LT	Centre for Applied Research and Development
Songailaitė, Milita	Gamtos ir tech.mokslų tyr.institut / GTMTI Gamtos ir tech.mokslų tyr.institut		Centre for Applied Research and Development
Juozaitienė, Rūta	Matematikos ir statistikos katedra / Department of Mathematics and Statistics		Centre for Applied Research and Development
Krilavičius, Tomas	Taikomosios informatikos katedra / Department of Applied Informatics		Centre for Applied Research and Development

Title

Customer churn prediction in the software as a service industry

[en]

Is part of

DAMSS-2023 : Data analysis methods for software systems : 14th conference, Druskininkai, Lithuania, November 30 – December 2, 2023 : [book of abstracts]

Date Issued

Date	Start Page	End Page
2023	99	99

Publisher

Vilnius : Vilnius University

URI

URI
https://www.journals.vu.lt/plugins/generic/pdfJsViewer/pdf.js/web/viewer.html?file=https%3A%2F%2Fwww.journals.vu.lt%2Fproceedings%2Farticle%2Fdownload%2F33673%2F32252%2F84674#%5B%7B%22num%22%3A63%2C%22gen%22%3A0%7D%2C%7B%22name%22%3A%22XYZ%22%7D%2C-246%2C978%2C0%5D
https://hdl.handle.net/20.500.12259/259201

Field of Science

Informatika / Informa...

OECD Classification

Natural sciences::Com...

Abstract (en)

In the modern commercial environment characterised by a plethora of alternatives available to consumers for identical products, customer retention plays a pivotal role in sustainable business success. This research investigates customer churn prediction through the application of a diverse array of machine learning algorithms, including logistic regression, support vector machines, decision trees, random forests, and gradientboosted trees. We use real-world data obtained from a company specialising in offering subscription-based services designed to enhance individuals’ personal development. The dataset included business-related customer data such as money spent, the last payment date, total orders completed, and customer platform usage data, including the number of activities completed and the timeframe since account creation, etc. Several experiments were conducted, involving the exploration of various feature subsets obtained via “Boruta”, “Boruta Shap”, decision tree feature importance, and correlation coefficient techniques to identify the most promising feature set within different prediction time horizon windows. The trained models underwent evaluation based on multiple performance metrics, including accuracy, precision, recall, and F1 score. This investigation concluded that the gradient-boosted trees algorithm emerged as the most promising model for predicting customer churn, delivering an impressive overall accuracy of 95.5%.

Type of document

text::conference output::conference proceedings::conference paper

Language

Anglų / English (en)

Coverage Spatial

Lietuva / Lithuania (LT)

ISBN (of the container)

9786090709856

Matematikos ir statistikos katedra / Department of Mathematics and Statistics

Taikomosios informatikos katedra / Department of Applied Informatics

Vytauto Didžiojo universitetas / Vytautas Magnus University

Informatikos fakultetas / Faculty of Informatics

Gamtos ir tech.mokslų tyr.institut / Research Institute of Natural and Technological Sciences