Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Reliability Based Scheduling Model (RSM) for Computational Grids

Zahid Raza, Deo P. Vidyarthi

Source Title: International Journal of Distributed Systems and Technologies (IJDST) 2(2)

DOI: 10.4018/jdst.2011040102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Computational Grid attributed with distributed load sharing has evolved as a platform to large scale problem solving. Grid is a collection of heterogeneous resources, offering services of varying natures, in which jobs are submitted to any of the participating nodes. Scheduling these jobs in such a complex and dynamic environment has many challenges. Reliability analysis of the grid gains paramount importance because grid involves a large number of resources which may fail anytime, making it unreliable. These failures result in wastage of both computational power and money on the scarce grid resources. It is normally desired that the job should be scheduled in an environment that ensures maximum reliability to the job execution. This work presents a reliability based scheduling model for the jobs on the computational grid. The model considers the failure rate of both the software and hardware grid constituents like application demanding execution, nodes executing the job, and the network links supporting data exchange between the nodes. Job allocation using the proposed scheme becomes trusted as it schedules the job based on a priori reliability computation.

Article Preview

Top

Introduction

The scientific community always thirsts for powerful computational tools and methods. This has resulted in enormous developments in the computing world with regard to processor speed, fast and large memory and efficient network devices for fast and reliable data transmission along with the advancement in software technology. The thirst for computational energy led to newer tools, which again fed back to improve the scientific research. The result of this self-feeding cycle resulted in the aggregation of heterogeneous resources known as Grid, empowering towards collaborative engineering (Foster & Kesselman, 1998; Foster, 2002; Tarricone & Esposito, 2005; Taylor & Harrison, 2009).

A grid can be considered as consisting of a number of clusters with each cluster comprising of computing resources of nearly the same nature. Though, across the clusters the nature of the nodes may differ. Participants inside cluster agree to cooperate in problem solving thus making a virtual organization (VO). At any moment of time there could be many virtual organizations inside the grid with a dynamic constitution. Jobs may enter to the grid through any of the participating nodes. To harness the advantages of the grid these jobs should be scheduled over the grid so as to utilize the parallel and concurrent nature of the jobs. Scheduling is the problem of mapping the jobs over the grid resources and is said to be efficient if this mapping is done keeping in mind the job requirements e.g. the nature of the job, its inherent parallelism, proper load balancing etc. Since scheduling is an NP-hard problem many scheduling models have been proposed in the literature optimizing one or the other parameters.

Whenever a job enters the grid for execution the chances for its failure may spread from the application failure to the resource failure (node failure etc.). Failure can be the result of many things viz. specification mistake (incorrect algorithms, architectures etc.) hardware failures (hot crash, network partition etc.), software failure (numerical exception, failed application etc.), implementation mistakes, component defects, external disturbance (radiation, electromagnetic waves, interference etc.), performance failures (application not completing within a specified time etc.) or some other failures (machine rebooted by the owner, excessive CPU load, decreased priority by the local resource for the current task etc.) (Huda, Schmidt, & Peake, 2005). A fault tolerant system is one which continues to perform even in the presence of hardware and software failure. A fault is a physical defect, imperfection, or flaw that occurs within some hardware or software component, whereas an error is the manifestation of a fault and is a deviation from accuracy or incorrectness. Specifically, faults are the cause of error and errors causes the failures. Depending on the type of grid it may be susceptible to either or all types of faults.

Reliability is the ability of a system to perform and maintain its functions in routine circumstances, as well as hostile or unexpected circumstances. More the fault tolerance of the system more reliable it is. Reliability adds quality to the system and is an often desired parameter for schedulers owing to large size of the grid and the composition consisting of scarce resources. Failures can result in a huge loss both in terms of money and utilization of computational energy. Thus, it is always desired from a grid scheduler that it ensures the reliable environment to the job execution. Whenever a grid is designed, the hardware components are specified with a failure rate by the manufacturer and are supplied as a part of the hardware specifications. Software components also has failure rate specified during software design using software engineering paradigm. These failure rates reflect the reliability of the system, which is desired to be high. For the scheduling decision, reliability should be computed beforehand keeping in mind the contribution of both the hardware and the software so that the probability of successful job execution may increase. In this work, we propose a Reliability Based Scheduling Model (RSM) which allocates the modular job on the cluster of the grid that matches the job's requirements and offers the most reliable environment to the job execution.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 2 Issues (2023)

Volume 13: 8 Issues (2022)

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2015)

Volume 5: 4 Issues (2014)

Volume 4: 4 Issues (2013)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Reliability Based Scheduling Model (RSM) for Computational Grids

Abstract

Introduction

Complete Article List