Privacy Preserving Classification of Biomedical Data With Secure Removing of Duplicate Records

Privacy Preserving Classification of Biomedical Data With Secure Removing of Duplicate Records

Boudheb Tarik, Elberrichi Zakaria
DOI: 10.4018/IJOCI.2018070104
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Classifying data is to automatically assign predefined classes to data. It is one of the main applications of data mining. Having complete access to all data is critical for building accurate models. Data can be highly sensitive, such as biomedical data, which cannot be disclosed or shared with third party, because it can harm individuals and organizations. The challenge is how to preserve privacy and usefulness of data. Privacy preserving classification addresses this problem. Collaborative models are constructed over networks without violating the data owners' privacy. In this article, the authors address two problems: privacy records deduplication of the same records and privacy-preserving classification. They propose a randomized hash technic for deduplication and an enhanced privacy preserving classification of biomedical data over horizontally distributed data based on two homomorphic encryptions. No private, intermediate or final results are disclosed. Experimentations show that their solution is efficient and secure without loss of accuracy.
Article Preview
Top

2. State Of The Art

Data preprocessing is a vital task for data mining. It is mainly important for making data appropriate, e.g., avoiding duplicate records, estimating missing data, selecting best attributes, etc. Deduplication is a field of the preprocessing step. It can be local (centralized database), or over distributed data (multiple databases). Removal and prevention of duplication is an essential part of the security (Chang and Ramachandran, 2016). According to the authors (Yigzaw et al., 2017), duplicate records may lead to incorrect statistical results. Therefore, to increase the accuracy of analysis, deduplication is important.

Complete Article List

Search this Journal:
Reset
Volume 14: 1 Issue (2024): Forthcoming, Available for Pre-Order
Volume 13: 1 Issue (2023)
Volume 12: 4 Issues (2022)
Volume 11: 4 Issues (2021)
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing