Sanitization and Anonymization of Document Repositories

Sanitization and Anonymization of Document Repositories

Yücel Saygin, Dilek Hakkani-Tür, Gökhan Tür
DOI: 10.4018/978-1-60566-058-5.ch129
(Individual Chapters)
No Current Special Offers


Information security and privacy in the context of the World Wide Web (WWW) are important issues that are still being investigated. However, most of the present research is dealing with access control and authentication-based trust. Especially with the popularity of WWW as one of the largest information sources, privacy of individuals is now as important as the security of information. In this chapter, our focus is text, which is probably the most frequently seen data type in the WWW. Our aim is to highlight the possible threats to privacy that exist due to the availability of document repositories and sophisticated tools to browse and analyze these documents. We first identify possible threats to privacy in document repositories. We then discuss a measure for privacy in documents with some possible solutions to avoid or, at least, alleviate these threats.

Complete Chapter List

Search this Book: