Metadata Management in PetaShare Distributed Storage Network

Metadata Management in PetaShare Distributed Storage Network

Ismail Akturk, Xinqi Wang, Tevfik Kosar
ISBN13: 9781615209712|ISBN10: 1615209719|EISBN13: 9781615209729
DOI: 10.4018/978-1-61520-971-2.ch005
Cite Chapter Cite Chapter

MLA

Akturk, Ismail, et al. "Metadata Management in PetaShare Distributed Storage Network." Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management, edited by Tevfik Kosar, IGI Global, 2012, pp. 118-139. https://doi.org/10.4018/978-1-61520-971-2.ch005

APA

Akturk, I., Wang, X., & Kosar, T. (2012). Metadata Management in PetaShare Distributed Storage Network. In T. Kosar (Ed.), Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management (pp. 118-139). IGI Global. https://doi.org/10.4018/978-1-61520-971-2.ch005

Chicago

Akturk, Ismail, Xinqi Wang, and Tevfik Kosar. "Metadata Management in PetaShare Distributed Storage Network." In Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management, edited by Tevfik Kosar, 118-139. Hershey, PA: IGI Global, 2012. https://doi.org/10.4018/978-1-61520-971-2.ch005

Export Reference

Mendeley
Favorite

Abstract

The unbounded increase in the size of data generated by scientific applications necessitates collaboration and sharing among the nation’s education and research institutions. Simply purchasing high-capacity, high-performance storage systems and adding them to the existing infrastructure of the collaborating institutions does not solve the underlying and highly challenging data handling problem. Scientists are compelled to spend a great deal of time and energy on solving basic data-handling issues, such as the physical location of data, how to access it, and/or how to move it to visualization and/or compute resources for further analysis. This chapter presents the design and implementation of a reliable and efficient distributed data storage system, PetaShare, which spans multiple institutions across the state of Louisiana. At the back-end, PetaShare provides a unified name space and efficient data movement across geographically distributed storage sites. At the front-end, it provides light-weight clients the enable easy, transparent, and scalable access. In PetaShare, the authors have designed and implemented an asynchronously replicated multi-master metadata system for enhanced reliability and availability. The authors also present a high level cross-domain metadata schema to provide a structured systematic view of multiple science domains supported by PetaShare.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.