Using Semantic Web Concepts to Retrieve Specific Domain Information from the Web
Rafael Cunha Cardoso (Federal University of Pernambuco, Brazil), Fernando da Fonseca de Souza (Federal University of Pernambuco, Brazil) and Ana Carolina Salgado (Federal University of Pernambuco, Brazil)
Copyright: © 2008
Currently, systems dedicated to information retrieval/extraction perform an important role on fetching relevant and qualified information from the World Wide Web (WWW). The Semantic Web can be described as the Web’s future once it introduces a set of new concepts and tools. For instance, ontology is used to insert knowledge into contents of the current WWW to give meaning to such contents. This allows software agents to better understand the Web’s content meaning so that such agents can execute more complex and useful tasks to users. This work introduces an architecture that uses some Semantic Web concepts allied to Regular Expressions (REGEX) in order to develop a system that retrieves/extracts specific domain information from the Web. A prototype, based on such architecture, was developed to find information about offers announced on supermarkets Web sites.