Recent Advances and Challenges in XML Document Routing

Recent Advances and Challenges in XML Document Routing

Mirella M. Moro (Universidade Federal de Minas Gerais (UFMG)-Belo Horizonte, Brazil), Zografoula Vagena (Microsoft Research, UK) and Vassilis J. Tsotras (University of California, Riverside, USA)
DOI: 10.4018/978-1-60566-308-1.ch007
OnDemand PDF Download:


Content-based routing is a form of data delivery whereby the flow of messages is driven by their content rather than the IP address of their destination. With the recognition of XML as the standard for data exchange, specialized XML routing services become necessary. In this chapter, the authors first demonstrate the relevance of such systems by presenting different world application scenarios where XML routing systems are needed and/or employed. Then, they present a survey of the current state of the art. Lastly, they attempt to identify issues and problems that have yet to be investigated. Their discussion will help identify open problems and issues and suggest directions for further research in the context of such systems.
Chapter Preview


Content-based routing is a form of data delivery whereby the flow of messages is driven by their content rather than the IP address of their destination. Specifically, in XML Routing, there is a continuous stream of XML messages (usually, one message has one XML document) from data producers to consumers, without any of the parties having knowledge of the other (Snoeren, 2001). Message transmission is performed by a sophisticated overlay network of application-level, content-based routers (called message brokers or XML routers) that match data messages against registered client subscriptions, and forward those messages (based on such matching) to output links, i.e. other routers or clients. The task of matching incoming messages to the set of client subscriptions is called message filtering.

This form of communication is widely employed by content-based information dissemination services, which are usually instantiated as publish/subscribe systems (pub/sub for short). For example, pub/sub systems have created opportunities for new applications such as a plethora of alert and notification services that notify interested users of new products in the market, stock price changes, currency variation, better offer deals and so on. Furthermore, with the expansion of Web services, new pub/sub systems are released every week. For instance, online travel agencies such as and inform their clients of price changes and hot deals that take into consideration the subscriber’s interests. Likewise, sends to its users email alerts about upcoming events and pre-sale information according to the user’s signed up artists and locations.

With the recognition of XML as the standard for data exchange, specialized XML-aware information dissemination services become necessary (Diao, 2004). These services can be implemented as publish/subscribe systems in which the information to be routed is encoded using XML, and the user subscriptions (or profiles) are expressed using XML query languages. Figure 1 illustrates the general architecture of an XML routing system.

Figure 1.

General architecture of XML routing system

Recent research on XML-aware information dissemination has investigated issues related to different parts of the routing system architecture. The most relevant aspects include: the discovery of Semantic communities of users with similar interests (Chand, 2007), the construction of the overlay dissemination network structure (Fenner, 2005; Diao, 2004; Snoeren, 2001), the indexing and aggregation of the profiles within a message broker (Chan, 2002; Diao, 2003; Gong, 2005; Kwon, 2005; Li, 2007; Moro, 2007a; Raj, 2007), the distribution of consumer profiles (Diao, 2004; Li, 2007; Papaemmanouil, 2005; Yoo, 2006), the encoding of the routed messages (Vagena, 2007a; Vagena, 2007b), the message filtering task (Altinel, 2000; Chan, 2002; Diao, 2003; Gong, 2005; He, 2006; Li, 2007; Kwon, 2005; Moro, 2007a; Raj, 2007; Tian, 2004; Vagena, 2007a; Vagena, 2007b), in-situ transformation of the original information (Diao, 2004), and computation sharing among message brokers (Chan, 2007).

Complete Chapter List

Search this Book:
Table of Contents
Ernesto Damiani
Eric Pardede
Eric Pardede
Chapter 1
Mary Ann Malloy, Irena Mlynkova
As XML technologies have become a standard for data representation, it is inevitable to propose and implement efficient techniques for managing XML... Sample PDF
Closing the Gap Between XML and Relational Database Technologies: State-of-the-Practice, State-of-the-Art and Future Directions
Chapter 2
Mirella M. Moro, Lipyeow Lim, Yuan-Chi Chang
It is well known that XML has been widely adopted for its flexible and self-describing nature. However, relational data will continue to co-exist... Sample PDF
Challenges on Modeling Hybrid XML-Relational Databases
Chapter 3
Vassiliki Koutsonikola, Athena Vakali
Nowadays, XML has become the standard for representing and exchanging data over the Web and several approaches have been proposed for efficiently... Sample PDF
XML and LDAP Integration: Issues and Trends
Chapter 4
Giovanna Guerrini, Marco Mesiti
The large dynamicity of XML documents on the Web has created the need to adequately support structural changes and to account for the possibility of... Sample PDF
XML Schema Evolution and Versioning: Current Approaches and Future Trends
Chapter 5
Mingzhu Wei, Ming Li, Elke A. Rundensteiner, Murali Mani, Hong Su
Stream applications bring the challenge of efficiently processing queries on sequentially accessible XML data streams. In this chapter, the authors... Sample PDF
XML Stream Query Processing: Current Technologies and Open Challenges
Chapter 6
Sven Groppe, Jinghua Groppe, Christoph Reinke, Nils Hoeller, Volker Linnemann
The widespread usage of XML in the last few years has resulted in the development of a number of XML query languages like XSLT or the later... Sample PDF
XSLT: Common Issues with XQuery and Special Issues of XSLT
Chapter 7
Mirella M. Moro, Zografoula Vagena, Vassilis J. Tsotras
Content-based routing is a form of data delivery whereby the flow of messages is driven by their content rather than the IP address of their... Sample PDF
Recent Advances and Challenges in XML Document Routing
Chapter 8
Philippe Poulard
XML engines are usually designed to solve a single class of problems: transformations of XML structures, validations of XML instances, Web... Sample PDF
Native XML Programming: Make Your Tags Active
Chapter 9
Stéphane Bressan, Wee Hyong Tok, Xue Zhao
Since XML technologies have become a standard for data representation, a great amount of discussion has been generated by the persisting open issues... Sample PDF
Continuous and Progressive XML Query Processing and its Applications
Chapter 10
Fabio Grandi, Federica Mandreoli, Riccardo Martoglia
In several application fields including legal and medical domains, XML documents are “versioned” along different dimensions of interest, whose... Sample PDF
Issues in Personalized Access to Multi-Version XML Documents
Chapter 11
Tran Khanh Dang
In an outsourced XML database service model, organizations rely upon the premises of external service providers for the storage and retrieval... Sample PDF
Security Issues in Outsourced XML Databases
Chapter 12
Marco Mesiti, Ernesto Jiménez Ruiz, Ismael Sanz, Rafael Berlanga Llavori, Giorgio Valentini, Paolo Perlasca, David Manset
There is a proliferation of research and industrial organizations that produce sources of huge amounts of biological data issuing from... Sample PDF
Data Integration Issues and Opportunities in Biological XML Data Management
Chapter 13
Doulkifli Boukraa, Riadh Ben Messaoud, Omar Boussaid
Current data warehouses deal for the most part with numerical data. However, decision makers need to analyze data presented in all formats which one... Sample PDF
Modeling XML Warehouses for Complex Data: The New Issues
Chapter 14
Irena Mlynkova
Since XML technologies have become a standard for data representation, numerous methods for processing XML data emerge every day. Consequently, it... Sample PDF
XML Benchmarking: The State of the Art and Possible Enhancements
About the Contributors