Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution

Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution

K. Nitalaksheswara Rao
ISBN13: 9781668442258|ISBN10: 1668442256|EISBN13: 9781668442272
DOI: 10.4018/978-1-6684-4225-8.ch013
Cite Chapter Cite Chapter

MLA

Rao, K. Nitalaksheswara. "Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution." Futuristic Trends for Sustainable Development and Sustainable Ecosystems, edited by Fernando Ortiz-Rodriguez, et al., IGI Global, 2022, pp. 215-236. https://doi.org/10.4018/978-1-6684-4225-8.ch013

APA

Rao, K. N. (2022). Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution. In F. Ortiz-Rodriguez, S. Tiwari, S. Iyer, & J. Medina-Quintero (Eds.), Futuristic Trends for Sustainable Development and Sustainable Ecosystems (pp. 215-236). IGI Global. https://doi.org/10.4018/978-1-6684-4225-8.ch013

Chicago

Rao, K. Nitalaksheswara. "Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution." In Futuristic Trends for Sustainable Development and Sustainable Ecosystems, edited by Fernando Ortiz-Rodriguez, et al., 215-236. Hershey, PA: IGI Global, 2022. https://doi.org/10.4018/978-1-6684-4225-8.ch013

Export Reference

Mendeley
Favorite

Abstract

Software defect prediction using data mining techniques is one of the best practices for finding defective modules. The existing classification techniques can be used for efficient knowledge discovery on normal datasets. Most of the real-world data sources are biased towards any one of the classes. This type of data source is known as class imbalance or skewed data sources. The defect prediction rate for the class imbalance datasets reduces with the increases in the class imbalance nature. To handle such type of datasets, an approach with specific designing technique is required for improved performance. In this chapter, the authors propose an algorithm known as improved integrated sampling strategy (IISS) for improved performance using noisy removal strategy for software defect prediction. The experimental analysis conducted on skewed software defect prediction datasets provides the results that IISS algorithm have performed well when compared with C4.5, C4.5+Balance, RF, and RF+Balance algorithms with various class imbalance evaluation measures.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.