Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

A Local Approach and Comparison with Other Data Mining Approaches in Software Application

QingE Wu, Weidong Yang

Source Title: Examining Information Retrieval and Image Processing Paradigms in Multidisciplinary Contexts

DOI: 10.4018/978-1-5225-1884-6.ch001

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In order to complete an online, real-time and effective aging detection to software, this paper studies a local approach that is also called a fuzzy incomplete and a statistical data mining approaches, and gives their algorithm implementation in the software system fault diagnosis. The application comparison of the two data mining approaches with four classical data mining approaches in software system fault diagnosis is discussed. The performance of each approach is evaluated from the sensitivity, specificity, accuracy rate, error classified rate, missed classified rate, and run-time. An optimum approach is chosen from several approaches to do comparative study. On the data of 1020 samples, the operating results show that the fuzzy incomplete approach has the highest sensitivity, the forecast accuracy that are 96.13% and 94.71%, respectively, which is higher than those of other approaches. It has also the relatively less error classified rate is or so 4.12%, the least missed classified rate is or so 1.18%, and the least runtime is 0.35s, which all are less than those of the other approaches. After the performance, indices are all evaluated and synthesized, the results indicate the performance of the fuzzy incomplete approach is best. Moreover, from the test analysis known, the fuzzy incomplete approach has also some advantages, such as it has the faster detection speed, the lower storage capacity, and does not need any prior information in addition to data processing. These results indicate that the mining approach is more effective and feasible than the old data mining approaches in software aging detection.

Chapter Preview

Top

Introduction

Because of the rapid increase of measurement data in engineering application and the participation of human, the uncertainty of information in data is more prominent, and the relationship among data is more complex. How to mine some potential and useful information from plentiful, fuzzy, disorderly and unsystematic, strong interferential data, so as to perform real-time and effective engineering applications, this is a problem needs to be urgently further study.

Data mining is a process of selection, exploration and modeling to a mass of data for discovering beforehand unknown rules and relations, whose purpose is to get some clear and useful results for the owner of the database (Giudici et al., 2004).

The spread speed of data mining was very fast, and its application scope was widespread day by day (Giudici et al., 2004, Liang 2006, Zhang et al., 2008, Hu et al., 2008, Liao and Yang, 2009, Chen et al., 2008). The literatures provided several data mining algorithms and some applications in engineering, and introduced three data mining algorithms in medicine applications. However, the data mining industry was still in the initial stage of development in China, the domestic industries basically didn’t have their own data mining systems.

In 1989 (Arai, 1989), at the 11^th International symposium on Artificial Intelligence, scholars first proposed the conception of knowledge discovery in database (KDD). At the United States’ annual meeting on Computer in 1995, some scholars began to regard data mining as a fundamental step in knowledge discovery in databases, or discussed the two as synonyms.

Now, some algorithms on data mining have been relatively mature (Arai, 1989), (Farzanyar, Kangavari et al., 2012), (Qiu and Tamhane, 2007), (Wolff, Bhaduri et al., 2009), (Balzano and Del Sorbo, 2007), (Alp, Büyükbebeci et al., 2011). The decision Tree algorithm based on CHAID, some rules generated by Scenario could be applied to the unclassified data set to predict which records would have promising results. Scenario’s decision tree algorithm is very flexible, which gives the user the choice to split any variable, or the choice of splitting with statistical significance. He carried out the graphical analysis to the crude data by using the fold line chart, histogram and scatter plot. Liang Xun listed several main software developers on data mining (Liang, 2006).

This paper introduces two new approaches on data mining, uses them and other four classical supervised learning data mining technologies to learn and classify 1020 data, validates the feasibility and effectiveness for the new data mining approaches, and compares the performance of each approach with each other, so as to hope that can select an optimum mining approach for fault diagnosis in software system. The neural network (NN), support vector machine (SVM), decision tree and logistic regression are the best approaches to depict the nonlinearity of data in the data mining, moreover, the fuzzy incomplete and statistical approaches can also depict the nonlinearity of data, so they are very suitable for the characteristic of data of fault diagnosis in software system. This paper evaluates the performance of each approach from sensitivity, specificity, accuracy, error classified rate, missed classified rate, respectively, and also records the running time on the Pentium 4, 2.66GHz, 1GB memory machine, uses the 6 indexes as standards to evaluate the advantages and disadvantages of each approach, and selects an approach with optimal performance from these approaches as the approach of fault diagnosis in software system.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

A Local Approach and Comparison with Other Data Mining Approaches in Software Application

Abstract

Introduction

Complete Chapter List