A Comparison of Revision Schemes for Cleaning Labeling Noise

A Comparison of Revision Schemes for Cleaning Labeling Noise

Chuck P. Lam, David G. Stork
Copyright: © 2008 |Pages: 13
ISBN13: 9781599045283|ISBN10: 1599045281|ISBN13 Softcover: 9781616926564|EISBN13: 9781599045306
DOI: 10.4018/978-1-59904-528-3.ch013
Cite Chapter Cite Chapter

MLA

Lam, Chuck P., and David G. Stork. "A Comparison of Revision Schemes for Cleaning Labeling Noise." Mathematical Methods for Knowledge Discovery and Data Mining, edited by Giovanni Felici and Carlo Vercellis, IGI Global, 2008, pp. 220-232. https://doi.org/10.4018/978-1-59904-528-3.ch013

APA

Lam, C. P. & Stork, D. G. (2008). A Comparison of Revision Schemes for Cleaning Labeling Noise. In G. Felici & C. Vercellis (Eds.), Mathematical Methods for Knowledge Discovery and Data Mining (pp. 220-232). IGI Global. https://doi.org/10.4018/978-1-59904-528-3.ch013

Chicago

Lam, Chuck P., and David G. Stork. "A Comparison of Revision Schemes for Cleaning Labeling Noise." In Mathematical Methods for Knowledge Discovery and Data Mining, edited by Giovanni Felici and Carlo Vercellis, 220-232. Hershey, PA: IGI Global, 2008. https://doi.org/10.4018/978-1-59904-528-3.ch013

Export Reference

Mendeley
Favorite

Abstract

Data quality is an important factor in building effective classifiers. One way to improve data quality is by cleaning labeling noise. Label cleaning can be divided into two stages. The first stage identifies samples with suspicious labels. The second stage processes the suspicious samples using some revision scheme. This chapter examines three such revision schemes: (1) removal of the suspicious samples, (2) automatic replacement of the suspicious labels to what the machine believes to be correct, and (3) escalation of the suspicious samples to a human supervisor for relabeling. Experimental and theoretical analyses show that only escalation is effective when the original labeling noise is very large or very small. Furthermore, for a wide range of situations, removal is better than automatic replacement.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.