Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

A Strategy on Selecting Performance Metrics for Classifier Evaluation

Yangguang Liu, Yangming Zhou, Shiting Wen, Chaogang Tang

Source Title: International Journal of Mobile Computing and Multimedia Communications (IJMCMC) 6(4)

DOI: 10.4018/IJMCMC.2014100102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The evaluation of classifiers' performances plays a critical role in construction and selection of classification model. Although many performance metrics have been proposed in machine learning community, no general guidelines are available among practitioners regarding which metric to be selected for evaluating a classifier's performance. In this paper, we attempt to provide practitioners with a strategy on selecting performance metrics for classifier evaluation. Firstly, the authors investigate seven widely used performance metrics, namely classification accuracy, F-measure, kappa statistic, root mean square error, mean absolute error, the area under the receiver operating curve, and the area under the precision-recall curve. Secondly, the authors resort to using Pearson linear correlation and Spearman rank correlation to analyses the potential relationship among these seven metrics. Experimental results show that these commonly used metrics can be divided into three groups, and all metrics within a given group are highly correlated but less correlated with metrics from different groups.

Article Preview

Top

1. Introduction

The correct selection of performance metrics is one of the most key issues in evaluating classifiers' performances. A number of performance metrics have been proposed in different application scenarios. For example, accuracy is typically used to measure the percentage of correctly classified test instances. It is so far the primary metric for assessing classifier performance (Ben et al.2007) and (Huang et al.2005); precision and recall metrics are widely applied in information retrieval (Baeza-Yates 1999); medical decision making community prefers the area under the receiver operating characteristic (ROC) curves (i.e., AUC) (Lasko et al. 2005). It is a very common situation, where a classifier performs well on one performance metric but badly on others. For example, boosted trees and SVM classifiers achieve good performances on classification accuracy, while they yield poor performances on root mean square error (Caruana et al. 2004).

In general, a widely accepted consensus is to choose performance metrics depending on the practical requirements of specific applications. For example, neural networks typically optimize squared error, and thus the metric of root mean square error can better reflect the actual performance of a classifier than other metrics. However, in some cases, specific criteria are unknown in advance, and practitioners tend to select several measures from widely adopted ones, such as classification accuracy, kappa statistic, F-measure and AUC, for evaluating a new classifier (Sokolova et al.2006), and (Sokolova et al.2009). Additionally, most metrics are derived by calculating the confusion matrix of the classifier. It could be reasonable to think that some of such performance metrics are closely related, which may cause redundancy on measuring the performance of classifiers. On the other hand, it is difficult for practitioners to reach concrete conclusion when two metrics provide conflicting results.

This study focuses on providing a strategy on selecting appropriate performance metrics for classifiers by using Pearson linear correlation and Spearman rank correlation to analyses the potential relationship among seven widely used performance metrics, namely accuracy, F-measure, kappa statistic, root mean squared error (i.e., RMSE), mean absolute error (i.e., MAE), AUC, and area under the precision recall (PR) curve (i.e., AUPRC). We first briefly describe these performance metrics. By definition, we sketch out their characteristic features by confusion matrix and preliminarily classify them into three groups, namely threshold metrics, rank metrics, and probability metrics. Then, we use correlation analysis to measure the correlations of these metrics. The experimental results show that metrics from the same group are closely correlated but less correlated with metrics from different groups. Additionally, we compare the correlation changes caused by the size and class distribution of the datasets, which are the main factors affecting measured values.

The main contributions made in this work are summarized as follows. First, we divide these seven performance metrics into three groups by analyzing their definitions. Experimental results confirm that metrics inside the same group have high correlation, and metrics from different groups have low correlation. Second, we also provide practitioners with the following strategies on selecting performance metrics for evaluating a classifier's performance based on experiment results. For balanced training data sets, one should select multiple metrics to evaluate the classifier, and at least one metric is selected in each group. For imbalanced training data sets, a classifier is not necessary to achieve the optimal performance on all groups of metrics; instead, as long as the classifier meets the performance requirement of an application measured by certain group(s) of metrics, we recommend adopting it regardless of its less satisfactory performance on other groups of metrics.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 1 Issue (2023)

Volume 13: 4 Issues (2022): 2 Released, 2 Forthcoming

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2014)

Volume 5: 4 Issues (2013)

Volume 4: 4 Issues (2012)

Volume 3: 4 Issues (2011)

Volume 2: 4 Issues (2010)

Volume 1: 4 Issues (2009)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A Strategy on Selecting Performance Metrics for Classifier Evaluation

Abstract

1. Introduction

Complete Article List