Reference Hub1
A Survey of Scheduling and Management Techniques for Data-Intensive Application Workflows

A Survey of Scheduling and Management Techniques for Data-Intensive Application Workflows

Suraj Pandey, Rajkumar Buyya
ISBN13: 9781615209712|ISBN10: 1615209719|EISBN13: 9781615209729
DOI: 10.4018/978-1-61520-971-2.ch007
Cite Chapter Cite Chapter

MLA

Pandey, Suraj, and Rajkumar Buyya. "A Survey of Scheduling and Management Techniques for Data-Intensive Application Workflows." Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management, edited by Tevfik Kosar, IGI Global, 2012, pp. 156-176. https://doi.org/10.4018/978-1-61520-971-2.ch007

APA

Pandey, S. & Buyya, R. (2012). A Survey of Scheduling and Management Techniques for Data-Intensive Application Workflows. In T. Kosar (Ed.), Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management (pp. 156-176). IGI Global. https://doi.org/10.4018/978-1-61520-971-2.ch007

Chicago

Pandey, Suraj, and Rajkumar Buyya. "A Survey of Scheduling and Management Techniques for Data-Intensive Application Workflows." In Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management, edited by Tevfik Kosar, 156-176. Hershey, PA: IGI Global, 2012. https://doi.org/10.4018/978-1-61520-971-2.ch007

Export Reference

Mendeley
Favorite

Abstract

This chapter presents a comprehensive survey of algorithms, techniques, and frameworks used for scheduling and management of data-intensive application workflows. Many complex scientific experiments are expressed in the form of workflows for structured, repeatable, controlled, scalable, and automated executions. This chapter focuses on the type of workflows that have tasks processing huge amount of data, usually in the range from hundreds of mega-bytes to petabytes. Scientists are already using Grid systems that schedule these workflows onto globally distributed resources for optimizing various objectives: minimize total makespan of the workflow, minimize cost and usage of network bandwidth, minimize cost of computation and storage, meet the deadline of the application, and so forth. This chapter lists and describes techniques used in each of these systems for processing huge amount of data. A survey of workflow management techniques is useful for understanding the working of the Grid systems providing insights on performance optimization of scientific applications dealing with data-intensive workloads.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.