ETL stands for Extract, Transform and Load. ETL is a process used in data warehousing to populate one database from another database.
Published in Chapter:
Programming and Pre-Processing Systems for Big Data Storage and Visualization
Hidayat Ur Rahman (University of Swat, Pakistan), Rehan Ullah Khan (Al Qassim University, Saudi Arabia), and Amjad Ali (University of Swat, Pakistan)
Copyright: © 2018
|Pages: 26
DOI: 10.4018/978-1-5225-3142-5.ch009
Abstract
This chapter of the book chapter provides detailed overview of the major concept used in Big Data. In order to process the huge volume of data, the first step is the pre-processing which is required to anomalies such as, missing values by applying various transformations. This chapter provides a detail overview of preprocessing tools used for Big Data such as, R, Yahoo! Pipes, Mechanical Turk, Elasticsearch etc. Beside preprocessing tools, the chapter provides detailed overview of storage tools, programming tools, data visualization, log processing tools and caching tools used for Big Data analytics. In other words, this chapter is the core of the book and provides the overview of the major technologies discussed later in the book.