Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution

K. Nitalaksheswara Rao

Source Title: Futuristic Trends for Sustainable Development and Sustainable Ecosystems

DOI: 10.4018/978-1-6684-4225-8.ch013

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Software defect prediction using data mining techniques is one of the best practices for finding defective modules. The existing classification techniques can be used for efficient knowledge discovery on normal datasets. Most of the real-world data sources are biased towards any one of the classes. This type of data source is known as class imbalance or skewed data sources. The defect prediction rate for the class imbalance datasets reduces with the increases in the class imbalance nature. To handle such type of datasets, an approach with specific designing technique is required for improved performance. In this chapter, the authors propose an algorithm known as improved integrated sampling strategy (IISS) for improved performance using noisy removal strategy for software defect prediction. The experimental analysis conducted on skewed software defect prediction datasets provides the results that IISS algorithm have performed well when compared with C4.5, C4.5+Balance, RF, and RF+Balance algorithms with various class imbalance evaluation measures.

Chapter Preview

Top

Introduction

Software engineering is the process of building software with the desired properties of the user. The complete process of software engineering consists of different phases such as requirement analysis, designing, coding and testing. The complete or exhaustive testing for finding all the errors in the software modules is a tedious job.

This research uses a unique strategy for replicating and generating instances in the minority subset and at the same time reducing the instances from majority subset. The proposed technique is known as improved integrated sampling strategy (IISS) as it integrates both sampling strategies in a single method. This rationale behind combining both the strategies is to address the issues of both majority and minority subsets. The task of combing these strategies in the single class is a challenging task as the counter effects need to be properly under taken for consideration of the learning process for class imbalance problem of software defect prediction.

Figure 1.

A common software defect prediction process

A common method for software defect prediction of class imbalance nature, need to be very accurate and precise, in spite of very less number of defective module instances. There by developing such a model is ineffective in the practical implementation due to a very high Imbalance ratio. In this study, we propose to use correlation based oversampling, instance ranges specific under sampling strategy and Improved integrated sampling techniques to help improve both majority and minority sub sets. The main rationale behind the approach is feature to feature correlation index and feature to class correlation index in the implementation of improved correlation based over sampling algorithm to learn range of instance. The proposals are supported with sound experimental setup for effective evaluation of class imbalance software defect datasets significantly improves classification over a decision tree as baseline.

The recent research in software defect prediction learning has not laid much stress to consider the software defect prediction as an efficient implementation in all the scenarios (Zi,Li, 2018) . The software defect prediction is also considered in the class balance framework where all the class are regard as equally. The main focus of our research is to overcome the issues with high imbalance ratio scenario in the knowledge discovery process of software defect prediction. The proposal, Improved Integrated Sampling Strategy (IISS) is well capable of handling effectively the process of knowledge discovery from the skewed software prediction datasets.

The remaining paper is presented as given: The recent literature on class imbalance learning in connection with software defect prediction is presented in section 2. The detailed problem statement with objectives is presented in section 3. The proposed approach of Integrated Sampling Strategy is presented in the section 4. The proposed approach is presented in detail in section 5. The dataset and evaluation criteria’s are presented in section 6. The proposed algorithm is compared with benchmark algorithms and the details are presented in section 7. In section 8, the conclusion remarks and extension of work for future scope are resented. .

Top

The special purpose databases execution traces analysis is conducted for software development process for quality improvement(Florian,2019). Different inherent properties, strategic learning tools and performance evaluation models are used to conduct a review of software quality analysis (Ayse Tousan 2009). A better paradigm of software testing strategy is presented recently for consumer support and maintaince in the field of durable electronic systems (Hasan,2012).

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution

Abstract

Introduction

Complete Chapter List

Improved Hybrid Sampling Strategy for Software Defect Prediction of Imbalanced Data Distribution

Abstract

Introduction

Related Work

Complete Chapter List