Prediction of Bike Share Demand by Machine Learning: Role of Vehicle Accident as the New Feature

Prediction of Bike Share Demand by Machine Learning: Role of Vehicle Accident as the New Feature

Tae You Kim, Min Jae Park, Jiho Shin, Sungwon Oh
Copyright: © 2022 |Pages: 16
DOI: 10.4018/IJBAN.288513
Article PDF Download
Open access articles are freely available for download

Abstract

In the fourth industrial revolution period, multinational companies and start-ups have applied a sharing economy concept to their business and have attempted to better serve customer demand by integrating demand prediction results into their business operations. For survival amongst today’s fierce competition, companies need to upgrade their prediction model to better predict customer demand in a more accurate manner. This study explores a new feature for bike share demand prediction models that resulted in an improved RMSLE score. By applying this new feature, the number of daily vehicle accidents reported in the Washington, D.C. area, to the Random Forest, XGBoost, and LightGBM models, the RMSLE score results improved. Many previous studies have primarily focused on feature engineering and regression techniques within given dataset. However, this study is meaningful because it focuses more on finding a new feature from an external data source.
Article Preview
Top

1. Introduction

Predicting customer demand accurately in the data-driven business environment of the mobility industry is a key factor of success (Sohrabi et al., 2020; Wessel, 2020). In the data technology period, it is very easy to find multinational companies that provide their various services to customers based on demand prediction results. In the short term, companies have achieved success because they have sufficiently predicted demand based on internal and external features. Digital transformation is occurring in most industries around the world based on machine learning and deep learning algorithms (Veres & Moussa, 2019; Wang et al., 2019). In particular, rapidly growing start-ups are often making their important business decisions based on data and algorithms, not on management experience or intuition.

In a modern society where environmental problems are increasingly serious, the bike is an eco-friendly means of transportation that benefits from individuals maximizing its exercise effects during the COVID-19 epidemic. The bike has attracted worldwide attention since a few years ago as it has been transformed into a “shared” good. It is not just for leisure purposes, but for the transportation itself (Böcker et al., 2020; Gu et al., 2019). Shared bike programs in many cities around the world are considered a means of responding to climate change and energy crises when implementing sustainable transportation systems. A shared bike system is an important and necessary means to promote the use of bikes and further implement an efficient urban transportation system. However, despite its many advantages, the number of cities and businesses that are reluctant to introduce a shared bike program is increasing due to the disadvantages of high fixed costs such as installation cost or operating cost. Thus, an accurate demand prediction is required for the continuous operation of a bike sharing program. One of the earliest shared economy models in the mobility industry was the bike sharing system. In particular, Capital Bikeshare, which launched the bike rental business in the Washington, D.C. area, hosted a data science competition in the Kaggle Competition platform to predict customer bike share demand (Fanaee-T & Gama, 2014). In response, a number of efforts have been made by data scientists and data analysts around the world to predict demand by using various data mining techniques.

Shared bikes are emerging as an alternative means of transportation that can comprehensively improve environmental problems, traffic jams, poor quality of life caused by car-oriented transportation systems, and can guarantee practical effects (de Chardon, 2019; Gu et al., 2019). In addition, shared bikes have several advantages when establishing an integrated public transportation system in connection with public buses and urban rail systems. First, the average distance of a bike trip is around 3-5 km (last mile), but it is possible to travel longer distances in connection with public transportation. Secondly, the catchment area of public transportation can be extended from an existing pedestrian area to an area coverable by bike. Third, during non-adjacent hours (except rush hour when traffic demand is concentrated during the day), highly demanded shared bikes can accommodate leisure traffic on existing transit, which can increase the overall utilization rate of public transportation. Bikes are the best means to realize an integrated public transportation system by supporting traffic in combination with walking or public transportation (Böcker et al., 2020; Gu et al., 2019). An integrated public transportation system can increase the availability of public transportation by both merging public transportation services, such as buses or metros, into high capacity corridors where economies of scale are built and strengthening cooperation with corridors or feeders using bikes (Gu et al., 2019). Therefore, it is imperative to predict the demand for shared bikes to efficiently implement an integrated public transportation system that includes shared bikes.

Complete Article List

Search this Journal:
Reset
Volume 11: 1 Issue (2024)
Volume 10: 1 Issue (2023)
Volume 9: 6 Issues (2022): 4 Released, 2 Forthcoming
Volume 8: 4 Issues (2021)
Volume 7: 4 Issues (2020)
Volume 6: 4 Issues (2019)
Volume 5: 4 Issues (2018)
Volume 4: 4 Issues (2017)
Volume 3: 4 Issues (2016)
Volume 2: 4 Issues (2015)
Volume 1: 4 Issues (2014)
View Complete Journal Contents Listing