Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Perturbation Method Based on Singular Value Decomposition and Feature Selection for Privacy Preserving Data Mining

Mohammad Reza Keyvanpour, Somayyeh Seifi Moradi

Source Title: Business Intelligence: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-4666-9562-7.ch015

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In this study, a new model is provided for customized privacy in privacy preserving data mining in which the data owners define different levels for privacy for different features. Additionally, in order to improve perturbation methods, a method combined of singular value decomposition (SVD) and feature selection methods is defined so as to benefit from the advantages of both domains. Also, to assess the amount of distortion created by the proposed perturbation method, new distortion criteria are defined in which the amount of created distortion in the process of feature selection is considered based on the value of privacy in each feature. Different tests and results analysis show that offered method based on this model compared to previous approaches, caused the improved privacy, accuracy of mining results and efficiency of privacy preserving data mining systems.

Chapter Preview

Top

Introduction

Data mining or knowledge discovery is a process that analyzes voluminous digital data in order to discover hidden but effective patterns from digital data (Ashrafi, Taniar, & Smith, 2005). In other words, this is a powerful tool for data analysis, with the goal of accurate and efficient identification of hidden and valuable patterns in the data, can facilitate the process of decision making, improve the allocation of resources, reduce costs and the exploitation of opportunities. Data mining is tip-top described as the union of historical and recent developments in statistics, artificial intelligence, and machine learning. These methods are then used together to study information and find previously hidden trends or patterns within (Daly, & Taniar, 2004). Data mining applications have extremely altered the strategic decision-making procedures of organizations (Tjioe & Taniar, 2005). Hence, the various applications of this scope are used by various governmental, industrial, commercial, medical, financial, and scientific due to several advantages. In fact, wide range of data mining applications has made it an important field of research (Keyvanpour, Javadieh, & Ebrahimi, 2011).

As privacy is an issue of individual perception, an infallible and general solution to this dichotomy is infeasible. However, there are measures that can be undertaken to raise privacy protection (Wahlstrom, Roddick, Sarre, Estivill-Castro, & de Vries, 2009). Accordingly in recent years due to increasing concerns related to privacy, data mining methods are faced with a serious challenge which is to preserve the privacy of sensitive data. This method is under attack from privacy advocates because of a misunderstanding about what it really is and a credible concern about how it’s generally done (Vaidya & Clifton, 2004). The organizations from one side should publish their customized information so as to access the benefits of data mining and on the other hand, are not unwilling to share their data due to preserving the privacy. The occurrence of such problems in data collection can be undesirable for data mining methods success as to achieve its goals (Seifi & Keyvanpour, 2012).

Hence, a new aspect of in the development of data mining is the approaches which are related to the concerns about privacy, in particular, in regard to this issue that data mining methods can produce accurate models without access to precise information of given records and to access valid results of the data mining (Clifton, Kantarcioglu, & Vaidya, 2002). In response to such anxieties, the data mining researches started to work on methods which preserved privacy along with data mining. As a result of this research, various approaches of privacy preserving data mining (PPDM) approaches are defined.

Data modification is one of the most popular approaches of privacy preserving data mining, especially for applications that require data owners to publish their personal and sensitive data. In this way, the data prior to publication are changed through certain methods so as to hide sensitive information (Keyvanpour & Seifi, 2010).

Approaches based on the data modification usually have good efficiency in terms of calculation but possess a few guarantees in preserving privacy and create balance with difficulty between ensuring privacy and data utility (important information and patterns existing in the data which should be preserved during data modification so that the accuracy of the data mining results in one level should be acceptable). As a result, the main challenge of the data modification based methods is to create a good and fair balance between privacy and data utility (Liu, Giannella, & Kargupta, 2006).

Recently, one of the most effective approaches to meet the challenges in privacy preserving data mining is the use of methods based on dimension reduction. The above methods operate based on this idea that they first identify worthless information in the dataset and then eliminate these worthless data so as to be perturbed. On the other side, since in the data mining applications, the eliminated parts are considered as noise, in many cases, the use of these methods can produce better results in terms of accuracy compared to mining on the original dataset (Xu, Zhang, Han, & Wang, 2006). One of the dimension reduction based methods which is used in PPDM is a Singular Value Decomposition (SVD) method (Keyvanpour & Seifi, 2010).

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

A Perturbation Method Based on Singular Value Decomposition and Feature Selection for Privacy Preserving Data Mining

Abstract

Introduction

Complete Chapter List