A Study on Prediction Performance Measurement of Automated Machine Learning: Focusing on WiseProphet, a Korean Auto ML Service

A Study on Prediction Performance Measurement of Automated Machine Learning: Focusing on WiseProphet, a Korean Auto ML Service

Euntack Im, Jina Lee, Sungbyeong An, Gwangyong Gim
Copyright: © 2023 |Pages: 11
DOI: 10.4018/IJSI.315656
Article PDF Download
Open access articles are freely available for download

Abstract

In digital economics, where value creation using big data becomes important, the ability to analyze data using machine learning and deep learning technology is a key activity in corporate activities. Nevertheless, companies consider it difficult to introduce machine learning and artificial intelligence technologies because they need an understanding of the business as well as data and analysis algorithms. Accordingly, services such as automated machine learning have emerged for easy use of machine learning. In this study, the authors explored the automated machine learning service and compared the random forest and extreme gradient boosting analysis results using WiseProphet and Python. WiseProphet is used as a representative of automated machine learning solutions because it is a cloud-based service that anyone can easily access and can be used in various ways. It is contrasted with the model implemented by Python, which writes code with No coding. As a result of comparing the prediction performance, WiseProphet automatically outperformed the analysis result by parameter optimization.
Article Preview
Top

Introduction

The Fourth Industrial Revolution, which refers to digital transformation, created a digital economic system in which economic activities were carried out as a major factor in digital input such as digital technology, infrastructure, and data. As the COVID-19 pandemic promoted a non-face-to-face society centered on digital technology, the digital era came as digital technology was brought into a mainstream way of life, not an instrumental dimension for efficiency. Changes in the current economic environment require the use of strategic digital technology by companies (Jeong, 2019).

Through this, the corporate mindset is changing from product-oriented thinking to solving problems and inconveniences experienced by customers. Key accelerating factors include Connectivity and data collection (Lee et al, 2021). D.N.A (Data, Network, AI) is a key technology in the digital era and is rapidly growing around new businesses of companies using D.N.A. The existing resource and capital-oriented business model is not only changing to a data and artificial intelligence based business model, but also developing differentiated services using machine learning in existing industries such as finance and energy. In addition, D.N.A. is used in corporate activities in various ways such as reducing repetitive tasks or reducing existing costs by using it as a reference for important decisions(Jeong & Kim, 2021).

However, the Korea Institute for Information and Communication Policy suffers from a lack of quality data, difficulty in specifying tasks, understanding AI technology, and lack of internal talent in preparation, model development, and service operation after introduction. In a survey of 152 AI demand and supply companies in Korea, 34% of respondents chose a shortage of internal manpower as an obstacle to the introduction of AI technology (total %: 200%)(Lee&Kim, 2021). Also, Glue Coding, a simple task of putting multiple cords together, accounts for 90% of the total work in the stage of model development using machine learning. The version management of machine learning, the heterogeneity of the development environment, and the actual environment cause inefficiency and causes machine learning project failure(Valohai et al, 2021).

Automated Machine Learning means automating the steps from data preprocessing to model learning to build a model using machine learning. In automatic data processing, feature selection, machine learning algorithm selection, and parameter optimization, unnecessary repetitive tasks in place of human settings and coding can be reduced to enable efficient machine learning utilization projects(Simkek, 2019; Dawid, 2021).

Therefore, the introduction of Automated Machine Learning can contribute to the development of new business models through the universal data utilization of companies, as companies can reduce the probability of failure by reducing inefficiency in machine learning projects.

Thus, this study used python to evaluate the performance of automated machine learning that is being introduced. Random Forest and Extreme Gradient Boosting models were used to predict prediction accuracy. For comparison data, 'the German Credit Prediction' dataset(UCI machine learning, 2016), which is released on the big data platform Kaggle, was used.

Complete Article List

Search this Journal:
Reset
Volume 12: 1 Issue (2024)
Volume 11: 1 Issue (2023)
Volume 10: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 9: 4 Issues (2021)
Volume 8: 4 Issues (2020)
Volume 7: 4 Issues (2019)
Volume 6: 4 Issues (2018)
Volume 5: 4 Issues (2017)
Volume 4: 4 Issues (2016)
Volume 3: 4 Issues (2015)
Volume 2: 4 Issues (2014)
Volume 1: 4 Issues (2013)
View Complete Journal Contents Listing