Demand Forecasting in Supply Chain Management Using Different Deep Learning Methods

Demand Forecasting in Supply Chain Management Using Different Deep Learning Methods

Asma Husna, Saman Hassanzadeh Amin, Bharat Shah
DOI: 10.4018/978-1-7998-3805-0.ch005
(Individual Chapters)
No Current Special Offers


Supply chain management (SCM) is a fast growing and largely studied field of research. Forecasting of the required materials and parts is an important task in companies and can have a significant impact on the total cost. To have a reliable forecast, some advanced methods such as deep learning techniques are helpful. The main goal of this chapter is to forecast the unit sales of thousands of items sold at different chain stores located in Ecuador with holistic techniques. Three deep learning approaches including artificial neural network (ANN), convolutional neural network (CNN), and long short-term memory (LSTM) are adopted here for predictions from the Corporación Favorita grocery sales forecasting dataset collected from Kaggle website. Finally, the performances of the applied models are evaluated and compared. The results show that LSTM network tends to outperform the other two approaches in terms of performance. All experiments are conducted using Python's deep learning library and Keras and Tensorflow packages.
Chapter Preview

1. Introduction

Supply Chain Management (SCM) is a very fast-growing and largely studied field of research that is gaining popularity and importance (Meherishi et al., 2019). According to Mentzer et al., (2001), a supply chain is a collection of some elements that are connected by flows of products, information, and/or services. Most organizations focus on cost optimization and maintaining ideal inventory levels to keep consumer’s satisfaction particularly in SCM of fresh products. Accurate demand forecasts enable industries to predict demand and maintain the right amount of inventory.

Machine Learning (ML) is a subset of Artificial Intelligence (AI). It enables machines for learning from the past data, experiences, and patterns to have correct forecast. Generally, ML means extracting knowledge about future behaviour from the older data. ML approaches mostly fall into three broad categories depending on the nature of the learning system including Supervised, Unsupervised, and Reinforcement Learning (RL). During a supervised learning, a large amount of labelled input data and desired output are provided for learning in the algorithms. In contrast, an unsupervised learning system uses only “unlabelled” input data for learning. Generally, unsupervised algorithms work with raw data for finding hidden patterns and achieve the best result. Reinforcement Learning (RL) is another subcategory of machine learning. RL interacts with a dynamic environment and utilizes trial and error technique to obtain a human-level performance. Besides of the three-fold categorisation, there is another classification which is called semi-supervised learning. In these algorithms usually small amounts of labelled data and large unlabelled data are utilized together.

Deep Learning is a subfield of ML where algorithms are inspired by the human brain to solve complex problems, learn from large amounts of very diverse, unstructured and inter-connected data sets. These algorithmic approaches have various layers (deep) to enable learning. Deep architectures can be supervised or unsupervised. This biologically-inspired programming paradigm currently provides the best solutions to many real-life problems such as image and video processing, speech recognition, text analysis, natural language processing, and different types of classifiers. Deep learning techniques are novel and useful methods for obtaining accurate forecasts in SCM. However, diverse deep learning techniques perform differently on different types of problems, and some techniques perform better than the others.

In this study, the main aim is to predict the unit sales of thousands of items sold at different chain stores in Ecuador to avoid overstocking, minimize understocking, reduce waste and loss, and increase customer’s satisfaction. In this case, good predictions are highly desirable to increase efficiency and determine the prices of products for customers. In this investigation, Corporación Favorita Grocery Sales Forecasting dataset is collected from Kaggle website for forecasting the product sales accurately. Three diverse deep learning methods including Artificial Neural Network (ANN), Convolutional Neural Network (CNN), and Long Short Term Memory (LSTM) are used to build and train the predictive models. In these models, different parameters and weights are used to forecast the unit sales. In this work, some open-source data science tools and Python and packages are used. Furthermore, Mean Squared Error (MSE) and Root Mean Squared Error (RMSE) are adopted as the two indicators for evaluating and comparing the models. The results show that LSTM performs better than ANN and CNN for forecasting the sales units in this case.

This book chapter is structured as follows. Section 2 includes a literature review about some approaches related to this field. Section 3 describes the exploratory analysis of the data, and Section 4 focuses on the systems and the experiment. The obtained results from the experiment are included in Section 5. Finally, conclusions and future research are provided in Section 6. Figure 1(a) represents artificial intelligence, machine learning, and deep learning together. In addition, Figure 1(b) shows the categories of machine learning.

Figure 1.

(a) Artificial Intelligence versus Machine Learning versus Deep Learning, (b) the categories of Machine Learning


Complete Chapter List

Search this Book: