New Churn Prediction Strategies in the Telecom Industry

New Churn Prediction Strategies in the Telecom Industry

Dymitr Ruta (BT Group, Research and Venturing, UK), Christoph Adl (BT Group, Research and Venturing, UK) and Detlef Nauck (BT Group, Research and Venturing, UK)
DOI: 10.4018/978-1-59904-982-3.ch013
OnDemand PDF Download:
$37.50

Abstract

In the telecom industry, high installation and marketing costs make it six to 10 times more expensive to acquire a new customer than it is to retain an existing one. Prediction and prevention of customer churn is therefore a key priority for industrial research. While all the motives of customer decision to churn are highly uncertain there is a lot of related temporal data generated as a result of customer interaction with the service provider. The major problem with this data is its time discontinuity resulting from the transactional character of events they describe. Moreover, such irregular temporal data sequences are typically a chaotic mixture of different data types, which further hinders its exploitation for any predictive task. Existing churn prediction methods like decision trees typically classify customers into churners and non-churners based on the static data collected in a snapshot of time while completely ignoring the timing of churn and hence the circumstances of this event. In this work, we propose new churn prediction strategies that are suitable for application at different levels of the information content available in customers’ data. Gradually enriching the data information content from the prior churn rate and lifetime expectancy then typical static events data up to decay-weighted data sequences, we propose a set of new churn prediction tools based on: customer lifetime modelling, hidden markov model (HMM) of customer events, and the most powerful k nearest sequence (kNS) algorithm that deliver robust churn predictions at different levels of data availability. Focussing further on kNS we demonstrate how the sequential techprocessing of appropriately pre-processed data streams lead to better performance of customer churn prediction. Given histories of other customers and the current customer data, the presented kNS uses an original combination of sequential nearest neighbour algorithm and original sequence aggregation technique to predict the whole remaining customer data sequence path up to the churn event. On the course of experimental trials, it is demonstrated that the new kNS model better exploits time-ordered customer data sequences and surpasses existing churn prediction methods in terms of performance and capabilities offered.
Chapter Preview
Top

Introduction

Today’s global telecommunication market environment can be characterised by the strong competition among different telcos and a decline in growth rate due to maturity of the market. Furthermore, there is a huge pressure on those companies to make healthy profits and increase their market shares. Most telecom companies are in fact customer-centric service providers and offer to their customers a variety of subscription services. One of the major issues in such environment is customer churn known as a process by which a company loses a customer to a competitor. Recent estimates suggest that churn rates in the telecom industry could be anywhere from25 percent to 50 percent (Furnas, 2003). Moreover on average it costs around $400 to acquire a new customer, which takes years to recoup (Furnas, 2003). These huge acquisition costs are estimated to be five to eight times higher than it is to retain the existing customer by offering him some incentives (Yan, Miller, Mozer, & Wolniewicz, 2001). In this competitive and volatile environment, it makes every economic sense to have a strategy to retain customers, which is only possible if the customer intention to churn is detected early enough.

There are many different reasons for customers to churn, some of them, like moving home, unstoppable, others like sudden death, undetectable. The churn prediction systems therefore should focus on detecting those churners that are deliberately moving to a competitor as these customers are most likely to leave data traces of their intent prior to churn and can be potentially persuaded to stay. This work is not concerned with the effectiveness of actual actions preventing customer churn or rescuing customers who cancelled their contract. The only concern here is the prediction of customer churn in order to provide the information about which customers are most likely to leave the service in the near future.

Churn prediction attracts recently a lot of both scientific and business attention. In the presence of large data warehouses as well as terabytes of data from Web resources, data mining techniques are increasingly being appreciated and adopted to business applications (Lemmen, 2000; Morgan, 2003), in an attempt to explain drivers of customer actions, in particular sudden falls in customer satisfaction and value ultimately leading to churn. There is a number of churn prediction models used commercially at present, however churn is only being modelled statically by analysing event-driven customer data and running regression or predictive classification models at a particular time (Duda, Hart, & Stork, 2001) over aggregated customer data. Some improvement is obtained after segmenting customers into specific groups and dealing with different groups separately yet this segmentation only supports company’s customer relationship management (CRM) and on its own does not improve weak performance in churn prediction. In practise the most common churn management systems are even simpler as they try to device a churn risk based on regression against available data variables. On the research arena the focus is shifted towards more complex classification and non-linear regression techniques like neural networks (Mozer, Wolniewicz, Grimes, Johnson, & Kaushansky, 2000),decision trees (Blundon, 2003) or support vector machines (Morik & Kopcke, 2004) yet applied in the same static context to customer data and hence not generating any promising results.

Complete Chapter List

Search this Book:
Reset
Editorial Advisory Board
Table of Contents
Preface
Hsiao-Fan Wang
Acknowledgment
Hsiao-Fan Wang
Chapter 1
Martin Spott, Detlef Nauck
This chapter introduces a new way of using soft constraints for selecting data analysis methods that match certain user requirements. It presents a... Sample PDF
Automatic Intelligent Data Analysis
$37.50
Chapter 2
Hung T. Nguyen, Vladik Kreinovich, Gang Xiang
It is well known that in decision making under uncertainty, while we are guided by a general (and abstract) theory of probability and of statistical... Sample PDF
Random Fuzzy Sets: Theory & Applications
$37.50
Chapter 3
Gráinne Kerr, Heather Ruskin, Martin Crane
Microarray technology1 provides an opportunity to monitor mRNA levels of expression of thousands of genes simultaneously in a single experiment. The... Sample PDF
Pattern Discovery in Gene Expression Data
$37.50
Chapter 4
Erica Craig, Falk Huettmann
The use of machine-learning algorithms capable of rapidly completing intensive computations may be an answer to processing the sheer volumes of... Sample PDF
Using "Blackbox" Algorithms Such AS TreeNET and Random Forests for Data-Ming and for Finding Meaningful Patterns, Relationships and Outliers in Complex Ecological Data: An Overview, an Example Using G
$37.50
Chapter 5
Eulalia Szmidt, Marta Kukier
We present a new method of classification of imbalanced classes. The crucial point of the method lies in applying Atanassov’s intuitionistic fuzzy... Sample PDF
A New Approach to Classification of Imbalanced Classes via Atanassov's Intuitionistic Fuzzy Sets
$37.50
Chapter 6
Arun Kulkarni, Sara McCaslin
This chapter introduces fuzzy neural network models as means for knowledge discovery from databases. It describes architectures and learning... Sample PDF
Fuzzy Neural Network Models for Knowledge Discovery
$37.50
Chapter 7
Ivan Bruha
This chapter discusses the incorporation of genetic algorithms into machine learning. It does not present the principles of genetic algorithms... Sample PDF
Genetic Learning: Initialization and Representation Issues
$37.50
Chapter 8
Evolutionary Computing  (pages 131-142)
Thomas E. Potok, Xiaohui Cui, Yu Jiao
The rate at which information overwhelms humans is significantly more than the rate at which humans have learned to process, analyze, and leverage... Sample PDF
Evolutionary Computing
$37.50
Chapter 9
M. C. Bartholomew-Biggs, Z. Ulanowski, S. Zakovic
We discuss some experience of solving an inverse light scattering problem for single, spherical, homogeneous particles using least squares global... Sample PDF
Particle Identification Using Light Scattering: A Global Optimization Problem
$37.50
Chapter 10
Dominic Savio Lee
This chapter describes algorithms that use Markov chains for generating exact sample values from complex distributions, and discusses their use in... Sample PDF
Exact Markov Chain Monte Carlo Algorithms and Their Applications in Probabilistic Data Analysis and Inference
$37.50
Chapter 11
J. P. Ganjigatti, Dilip Kumar Pratihar
In this chapter, an attempt has been made to design suitable knowledge bases (KBs) for carrying out forward and reverse mappings of a Tungsten inert... Sample PDF
Design and Development of Knowledge Bases for Forward and Reverse Mappings of TIG Welding Process
$37.50
Chapter 12
Malcolm J. Beynon
This chapter considers the role of fuzzy decision trees as a tool for intelligent data analysis in domestic travel research. It demonstrates the... Sample PDF
A Fuzzy Decision Tree Analysis of Traffic Fatalities in the US
$37.50
Chapter 13
Dymitr Ruta, Christoph Adl, Detlef Nauck
In the telecom industry, high installation and marketing costs make it six to 10 times more expensive to acquire a new customer than it is to retain... Sample PDF
New Churn Prediction Strategies in the Telecom Industry
$37.50
Chapter 14
Malcolm J. Beynon
This chapter demonstrates intelligent data analysis, within the environment of uncertain reasoning, using the recently introduced CaRBS technique... Sample PDF
Intelligent Classification and Ranking Analyses Using CARBS: Bank Rating Applications
$37.50
Chapter 15
Fei-Chen Hsu, Hsiao-Fan Wang
In this chapter, we used Cumulative Prospect Theory to propose an individual risk management process (IRM) including a risk analysis stage and a... Sample PDF
Analysis of Individual Risk Attitude for Risk Management Based on Cumulative Prospect Theory
$37.50
Chapter 16
Francesco Giordano, Michele La Rocca, Cira Perna
This chapter introduces the use of the bootstrap in a nonlinear, nonparametric regression framework with dependent errors. The aim is to construct... Sample PDF
Neural Networks and Bootstrap Methods for Regression Models with Dependent Errors
$37.50
Chapter 17
Lean Yu, Shouyang Wang, Kin Keung Lai
Financial crisis is a kind of typical rare event, but it is harmful to economic sustainable development if occurs. In this chapter, a... Sample PDF
Financial Crisis Modeling and Prediction with a Hilbert-EMD-Based SVM Approachs
$37.50
Chapter 18
Chun-Jung Huang, Hsiao-Fan Wang, Shouyang Wang
One of the key problems in supervised learning is due to the insufficient size of the training data set. The natural way for an intelligent learning... Sample PDF
Virtual Sampling with Data Construction Analysis
$37.50
About the Contributors