A distributed and scalable storage system of Hadoop framework which stores large quantities of data in a distributed fashion across clusters of commodity hardware.
Published in Chapter:
Hadoop Framework for Handling Big Data Needs
Rupali Ahuja (University of Delhi, India)
Copyright: © 2018
|Pages: 22
DOI: 10.4018/978-1-5225-3142-5.ch004
Abstract
The data generated today has outgrown the storage as well as computing capabilities of traditional software frameworks. Large volumes of data if aggregated and analyzed properly may provide useful insights to predict human behavior, to increase revenues, get or retain customers, improve operations, combat crime, cure diseases, etc. In conclusion, the results of effective Big Data analysis can be used to provide actionable intelligence for humans, as well as for machine consumption. New tools, techniques, technologies and methods are being developed to store, retrieve, manage, aggregate, correlate and analyze Big Data. Hadoop is a popular software framework for handling Big Data needs. Hadoop provides a distributed framework for processing and storage of large datasets. This chapter discusses in detail the Hadoop framework, its features, applications and popular distributions, and its Storage and Visualization tools.