Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Stochastically Balancing Trees for File and Database Systems

Aziz Barbar, Anis Ismail

Source Title: International Journal of Green Computing (IJGC) 4(1)

DOI: 10.4018/jgc.2013010104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

With the constant improvement in data storage technologies, a new generation of indexing mechanisms is to be created to exploit the improvements in disk access speeds that were previously impractical. The self-balancing tree B-Tree, has long been the indexing structure of choice for reducing the amount of disk access at the expense of size of data block to be read or written. A new technique based on a dynamically growing multilevel list structure, which is stochastically balanced rather than self balanced, is discussed and compared to the B-Tree. An analogy between the technique and the structures is established to better compare the computational complexities.

Article Preview

Top

Introduction

As far as regularly available storage devices are concerned (Sugaya, 2006; Kawamoto, 2008) the read and write time can be described as:Tread=T_A +S*T_RTwrite=T_A +S*T_R(1) Where, T_A is the access time for the disk, S represents how much data needs to be read/written and T_R/T_W represents how much time is needed to actually read/write data.

T_A is an overhead which is present in hard drives due to the mechanical nature of the access that is vastly slower than the electronic operation.

Historically indexing techniques were designed to reduce the total amount of disk operations to minimize the effect of T_A on the overall performance of the technique. However, these techniques might be inefficient in the case of newer storage devices such as flash memory and other forms of random access memory where T_A is due to an electronic process and therefore becomes negligible with respect to T_R/T_W.

In case of random access memory, the dominant factor in the read operation becomes S and T_R/T_W under which circumstances we should optimize the indexing technique to reduce the product. For a fixed T_R/T_W, the only variable that can be reduced is S, that is the total amount of data that is read/written at once.

There are two main data structures that are of interest in this document. They are the Self Balancing B-Tree, and the List structure.

Self Balancing B-trees (Bayer, 1971; Bayer et al., 2002) are most commonly found in databases and file systems. The idea behind B-trees is that internal nodes can have a variable number of child nodes within some predefined range called the order of the B-Tree. As data are inserted or removed from the data structure, the number of child nodes varies within a node and so internal nodes are coalesced or split so as to maintain the designed range.

For a 3-4 B-tree (shown in Figure 1), each internal node may have only 3 or 4 child nodes. A node is considered to be in an illegal state if it has an invalid number of child nodes; it must be split. Accessing a key in the tree on average takes logn(N)operations where N is the amount of keys in the tree and n is the order of the tree.

Figure 1.

3-4 B-tree

Many types of variants to the B-Tree have been developed with very subtle differences to the B-Tree. Such variants include the B*-Tree (Berliner, 1978) and the B+-Tree (Taniar et al., 2003).

B-Trees are not the only types of indexing structures; other indexing structures, which are specialized in certain types of indexing, have been developed. These include structures dedicated to Video Indexing (Chen et al., 2002), Image Indexing (Ljosa et al., 2006), String Indexing (Kahveci et al., 2001), Regular Expression Indexing (Chan et al., 2003), and Indexing techniques for Data Warehouses (Ester et al., 2000).

One of the most active areas of research is the use of indexing in applications like geographic information systems where we refer to it as Spatial Indexing (Guttman, 1984). There are many approaches to such Spatial Indexing, especially high dimensional spatial indexing (Sakuraim et al., 2000; Chakrabarti et al., 1999; Berchtold et al., 1996; Katayama et al., 1997).

General purpose indexing techniques include the graph index approach (Yan et al., 2004) and hashing (Ramabhadran et al., 2004). Charguéraud (Charguéraud, 2010) verified many functional tree algorithms in Okasaki’s book (Okasaki, 2010) with a new method of transforming a program into a proposition transformer. However, neither Charguéraud’s verification nor the book contains WBT algorithms. Fundamental modules Data.Set and Data.Map in Haskell (Marlow, 2010) and the wttree.scm library in MIT/GNU Scheme and slib are based on a variant of the WBT algorithm.

Complete Article List

Search this Journal:

Reset

Open Access Articles: Forthcoming

Volume 10: 1 Issue (2019)

Volume 9: 2 Issues (2018)

Volume 8: 2 Issues (2017)

Volume 7: 1 Issue (2016)

Volume 6: 2 Issues (2015)

Volume 5: 2 Issues (2014)

Volume 4: 2 Issues (2013)

Volume 3: 2 Issues (2012)

Volume 2: 2 Issues (2011)

Volume 1: 2 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Stochastically Balancing Trees for File and Database Systems

Abstract

Introduction

Complete Article List