Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is MapReduce

Encyclopedia of Information Science and Technology, Third Edition
A programming model that uses a divide and conquer method to speed-up processing large datasets, with a special focus on semi-structured data.
Published in Chapter:
Data Science and Distributed Intelligence
Alfredo Cuzzocrea (ICAR-CNR and University of Calabria, Italy) and Mohamed Medhat Gaber (Robert Gordon University, Aberdeen, UK)
DOI: 10.4018/978-1-4666-5888-2.ch166
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Database Systems for Big Data Storage and Retrieval
Is a computational paradigm for processing massive datasets in parallel if the computation fits a three-step pattern: map, shard and reduce. The map process is a parallel one. Each process executes on a different part of data and produces (key, value) pairs. The shard process collects the generated pairs, sorts and partitions them. Each partition is assigned to a different reduce process which produces a single result.
Full Text Chapter Download: US $37.50 Add to Cart
Data Analytics in Electric Vehicles
The MapReduce programming model can be used on clusters to process large datasets using concurrent distributed algorithms.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Analysis and Mining
A high-level programming model, which uses the “map” and “reduce” functions, for processing high volumes of data.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Analytics in Action: Examples
Is a programming model or algorithm for the processing of data using a parallel programming implementation and was originally used for academic purposes associated with parallel programming techniques.
Full Text Chapter Download: US $37.50 Add to Cart
Query Languages in NoSQL Databases
An engine which idea is divide a work in many tasks. With base in a table distribution, the algorithm divide a ad-hoc query in different sub-queries in the same time, with replicas, where one sub-query is mapped in k+1 sub-queries. Contains the map and reduce function to execute these tasks.
Full Text Chapter Download: US $37.50 Add to Cart
Overview of Big Data and Its Visualization
A programming algorithm that divides and maps the elements of datasets; then shuffles and distributes to cluster computing powers for big data processing.
Full Text Chapter Download: US $37.50 Add to Cart
Speedy Management of Data Using MapReduce Approach
It is a programming approach used to manage large set of data.
Full Text Chapter Download: US $37.50 Add to Cart
Oracle 19c's Multitenant Container Architecture and Big Data
An engine that provides the platform for massive parallel execution of algorithms written in Java.
Full Text Chapter Download: US $37.50 Add to Cart
Harnessing the Power of Big Data Analytics
Algorithm that is used to split massive data sets among many commodity hardware pieces in an effort to reduce computing time.
Full Text Chapter Download: US $37.50 Add to Cart
Rough Set Based Green Cloud Computing in Emerging Markets
A concept which is an abstraction of the primitives ‘map’ and ‘reduce’. Most of the computations are carried by applying a ‘map’ operation to each global record in order to generate key/value pairs and then apply the reduce operation in order to combine the derived data appropriately.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data and Analytics: Application to Healthcare Industry
MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster.
Full Text Chapter Download: US $37.50 Add to Cart
The Clustering of Large Scale E-Learning Resources
A software framework introduced by Google to support distributed computing on large data sets on clusters of computers. The framework is inspired by map and reduce functions commonly used in functional programming, MapReduce libraries have been written in C++, Java, Python and other programming languages.
Full Text Chapter Download: US $37.50 Add to Cart
Overview of Big-Data-Intensive Storage and Its Technologies
Programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster. (Chen et al., 2014).
Full Text Chapter Download: US $37.50 Add to Cart
Hadoop Framework for Handling Big Data Needs
A data processing framework of Hadoop which provides data intensive computation of large data sets by dividing tasks across several machines and finally combining the result.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Problem, Technologies and Solutions
Is a programming model for processing large data sets.
Full Text Chapter Download: US $37.50 Add to Cart
Data Streams Processing Techniques Data Streams Processing Techniques
A programming model which process massive amounts of unstructured data in parallel and distributed cluster of processors.
Full Text Chapter Download: US $37.50 Add to Cart
Storage and Query Processing Architectures for RDF Data
MapReduce is a programming model used to process large datasets using a parallel and distributed algorithm in a reliable manner.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Analysis: Basic Review on Techniques
MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster.
Full Text Chapter Download: US $37.50 Add to Cart
NoSQL Databases
It is quite easy to use programming model that supports parallel design since it is very scalable and works in a distributed way. It is also helpful for huge data processing, large scale searching and data analysis within the cloud. It provides related abstraction by a process of “mapper” and “reducer”. The “mapper” is applicable to each input key-value pair trying to come up with an associated absolute range of intermediate key-value pairs. Map: produce a list of ( key , value ) pairs from the input structured as a key( k ) value( v ) pair of a different type i.e. (k1, v1) ? list (k2, v2) The “reducer” is applicable to some or all values related to identifying the intermediate key to come up with output key-value pairs. Reduce: produce a list of values from an input that consists of a key and a list of values associated with that key i.e. (k2, list (v2)) ? list (v2) MapReduce is having adequate capability to support many real and global algorithms and tasks. It can divide the input data, schedule the execution of programs over a set of machines and handle machine failures. MapReduce can also handle the inter-machine communication. Map/Reduce is: 1) a Programming model from Lisp and other functional languages; 2) Many problems can be phrased this way; 3) Easy to distribute across nodes; and 4) Nice retry/failure semantics. MapReduce provides: 1) Automatic parallelization and distribution; 2) Fault tolerance; 3) I/O scheduling; and 4) Monitoring & status updates. The limitations of MapReduce are: 1) Extremely rigid data flow; 2) Constantly hacked in Join, Union, Split; 3) Common operations must be coded by user; and 4) Semantics hidden inside map-reduce functions, Difficult to maintain, extend, and optimize.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Analytics and Mining for Knowledge Discovery
A high-level programming model, which uses the “map” and “reduce” functions, for processing huge volumes of data.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Mining and Analytics
Is a high-level programming model, which uses the “map” and “reduce” functions, for processing high volumes of data.
Full Text Chapter Download: US $37.50 Add to Cart
The Potential and Capabilities of NoSQL Databases for ERP Systems
A programming model that process large amounts of data stored in commodity machines for processing massive datasets in parallel.
Full Text Chapter Download: US $37.50 Add to Cart
Modelling and Assessing Spatial Big Data: Use Cases of the OpenStreetMap Full-History Dump
Is a programming model for processing and generating large datasets in a parallel and distributed manner on a cluster.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Mining and Analytics With MapReduce
A high-level programming model, which uses the “map” and “reduce” functions, for processing huge volumes of data.
Full Text Chapter Download: US $37.50 Add to Cart
Understanding Data Analytics Is Good but Knowing How to Use It Is Better!
Is a programming model or algorithm for the processing of data using a parallel programming implementation and was originally used for academic purposes associated with parallel programming techniques.
Full Text Chapter Download: US $37.50 Add to Cart
Overview of Big Data in Healthcare
A program which is used for processing Big Data by utilizing a distributed model on a cluster.
Full Text Chapter Download: US $37.50 Add to Cart
Open Source Software (OSS) for Big Data
A programming algorithm that divide and map the elements of dataset, then shuffle and distribute to cluster of computing powers for Big Data processing.
Full Text Chapter Download: US $37.50 Add to Cart
Importance of Applying Big Data Concept in Marketing Decision Making
Is a specific programming model, which as such represents a new approach to solving the problem of processing large amounts of differently structured data. It consists of two functions - Map (sorting and filtering data) and Reduce (summarizing intermediate results), and it is executed in parallel and distributed.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Analytics Demystified
MapReduce is a software framework for processing vast amounts of data by using divide and conquare method.
Full Text Chapter Download: US $37.50 Add to Cart
Adapting Big Data Ecosystem for Landscape of Real World Applications
MapReduce is a parallel programming model proposed by Google and is used to distribute computing on clusters of computers for processing large data sets.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Analytics Tools and Platform in Big Data Landscape
Algorithm that is used to split massive data sets among many commodity hardware pieces in an effort to reduce computing time.
Full Text Chapter Download: US $37.50 Add to Cart
Driving Big Data with Hadoop Technologies
MapReduce is a framework or a programming model that allows carrying out tasks in parallel across a large cluster of computers.
Full Text Chapter Download: US $37.50 Add to Cart
Emergence of NoSQL Platforms for Big Data Needs
MapReduce is a parallel programming model proposed by Google and is used to distribute computing on clusters of computers for processing large data sets.
Full Text Chapter Download: US $37.50 Add to Cart
Accessing Big Data in the Cloud Using Mobile Devices
A programming model consisting of two logical steps—Map and Reduce—for processing massively parallelizable problems across extremely large datasets using a large cluster of commodity computers.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR