Reference Hub7
Recent Developments on Security and Reliability in Large-Scale Data Processing with MapReduce

Recent Developments on Security and Reliability in Large-Scale Data Processing with MapReduce

Christian Esposito, Massimo Ficco
Copyright: © 2016 |Volume: 12 |Issue: 1 |Pages: 20
ISSN: 1548-3924|EISSN: 1548-3932|EISBN13: 9781466689244|DOI: 10.4018/IJDWM.2016010104
Cite Article Cite Article

MLA

Esposito, Christian, and Massimo Ficco. "Recent Developments on Security and Reliability in Large-Scale Data Processing with MapReduce." IJDWM vol.12, no.1 2016: pp.49-68. http://doi.org/10.4018/IJDWM.2016010104

APA

Esposito, C. & Ficco, M. (2016). Recent Developments on Security and Reliability in Large-Scale Data Processing with MapReduce. International Journal of Data Warehousing and Mining (IJDWM), 12(1), 49-68. http://doi.org/10.4018/IJDWM.2016010104

Chicago

Esposito, Christian, and Massimo Ficco. "Recent Developments on Security and Reliability in Large-Scale Data Processing with MapReduce," International Journal of Data Warehousing and Mining (IJDWM) 12, no.1: 49-68. http://doi.org/10.4018/IJDWM.2016010104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The demand to access to a large volume of data, distributed across hundreds or thousands of machines, has opened new opportunities in commerce, science, and computing applications. MapReduce is a paradigm that offers a programming model and an associated implementation for processing massive datasets in a parallel fashion, by using non-dedicated distributed computing hardware. It has been successfully adopted in several academic and industrial projects for Big Data Analytics. However, since such analytics is increasingly demanded within the context of mission-critical applications, security and reliability in MapReduce frameworks are strongly required in order to manage sensible information, and to obtain the right answer at the right time. In this paper, the authors present the main implementation of the MapReduce programming paradigm, provided by Apache with the name of Hadoop. They illustrate the security and reliability concerns in the context of a large-scale data processing infrastructure. They review the available solutions, and their limitations to support security and reliability within the context MapReduce frameworks. The authors conclude by describing the undergoing evolution of such solutions, and the possible issues for improvements, which could be challenging research opportunities for academic researchers.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.