XML Stream Query Processing: Current Technologies and Open Challenges

Mingzhu Wei; Ming Li; Elke A. Rundensteiner; Murali Mani; Hong Su

doi:10.4018/978-1-60566-308-1.ch005

Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

XML Stream Query Processing: Current Technologies and Open Challenges

Mingzhu Wei, Ming Li, Elke A. Rundensteiner, Murali Mani, Hong Su

Source Title: Open and Novel Issues in XML Database Applications: Future Directions and Advanced Technologies

DOI: 10.4018/978-1-60566-308-1.ch005

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Stream applications bring the challenge of efficiently processing queries on sequentially accessible XML data streams. In this chapter, the authors study the current techniques and open challenges of XML stream processing. Firstly, they examine the input data semantics in XML streams and introduce the state-of-the-art of XML stream processing. Secondly, they compare and contrast the automatonbased and algebra-based techniques used in XML stream query execution. Thirdly, they study different optimization strategies that have been investigated for XML stream processing – in particular, they discuss cost-based optimization as well as schema-based optimization strategies. Lastly but not least, the authors list several key open challenges in XML stream processing.

Chapter Preview

Top

Introduction

In our increasingly fast-paced digital world all activities of humans and surrounding environments are being tagged and thus digitally accessible in real time. This opens up the novel opportunity to develop a variety of applications that monitor and make use of such data streams, typically stock, traffic and network activities (Babcock et al., 2002). Many projects, both in industry and academia, have recently sprung up to tackle newly emerging challenges related to stream processing. On the academic side, projects include Aurora (Abadi et al., 2003), Borealis (Abadi et al., 2005), STREAM (Babu & Widom, 2001), Niagara (Chen et al., 2002), TelegraphCQ (Chandrasekaran et al., 2003), and CAPE (Rundensteiner et al., 2004). On the industrial side, existing major players in database industry such as Oracle (Witkowski et al., 2007) and IBM (Amini et al., 2006) have embarked on stream projects and new startup companies have also emerged (Streambase, 2008; Coral8, 2008).

While most of these activities initially focused on simple relational data, it is apparent that XML is an established format and has been widely accepted as the standard data representation for exchanging information on the internet. Due to the proliferation of XML data in web services (Carey et al., 2002), there is also a surge in XML stream applications (Koch et al., 2004; Florescu et al., 2003; Diao & Franklin, 2003; Bose et al., 2003; Russell et al., 2003; Ludascher et al., 2002; Peng & Chawathe, 2003). For instance, a message broker routes the XML messages to interested parties (Gupta & Suciu, 2003). In addition, message brokers can also perform message restructuring or backups. For example, in an on-line order handling system (Carey et al., 2002), suppliers can register their available products with the broker. The broker will then match each incoming purchase order with the subscription and forward it to the corresponding suppliers, possibly in a restructured format at the request of the suppliers. Other typical applications include XML packet routing (Snoeren & Conkey, 2001), selective dissemination of information (Altinel & Franklin, 2000), and notification systems (Nguyen et al., 2001).

XML streams are often handled as a sequence of primitive tokens, such as a start tag, an end tag or a PCDATA item. To perform query evaluation over such on-the-fly XML token streams, most systems (Diao et al., 2003; Gupta & Suciu, 2003; Ludascher et al., 2002; Peng & Chawathe, 2003) propose to use automata to retrieve patterns from XML token streams. However, although automata is a suitable technique for matching expressions, how to improve and extend automata functionality in order to efficiently answer queries over XML streams has been a topic of active debate by the XML community. Further, one distinguishing feature of pattern retrieval on XML streams is that it relies solely on the token-by-token sequential traversal. It is not possible to jump to a certain portion of the stream (analogous to sequential access on magnetic tapes). Thus, the traditional index-based technologies cannot be applied for effective query optimization. In static XML processing, cost-based and schema-based optimization techniques are widely used. How to perform such optimization and other optimization techniques in the streaming XML context is a major challenge, and is thus one of the topics of this chapter.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

XML Stream Query Processing: Current Technologies and Open Challenges

Abstract

Introduction

Complete Chapter List