A command line interface application that facilitates to import the data directly from the RDBMS systems into any platform of the Hadoop file system.
Published in Chapter:
Driving Big Data with Hadoop Technologies
Siddesh G. M. (M. S. Ramaiah Institute of Technology, India), Srinidhi Hiriyannaiah (M. S. Ramaiah Institute of Technology, India), and K. G. Srinivasa (M. S. Ramaiah Institute of Technology, India)
Copyright: © 2014
|Pages: 31
DOI: 10.4018/978-1-4666-5864-6.ch010
Abstract
The world of Internet has driven the computing world from a few gigabytes of information to terabytes, petabytes of information turning into a huge volume of information. These volumes of information come from a variety of sources that span over from structured to unstructured data formats. The information needs to update in a quick span of time and be available on demand with the cheaper infrastructures. The information or the data that spans over three Vs, namely Volume, Variety, and Velocity, is called Big Data. The challenge is to store and process this Big Data, running analytics on the stored Big Data, making critical decisions on the results of processing, and obtaining the best outcomes. In this chapter, the authors discuss the capabilities of Big Data, its uses, and processing of Big Data using Hadoop technologies and tools by Apache foundation.