Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Web Knowledge Discovery Engine Based on Concept Algebra

Kai Hu, Yingxu Wang, Yousheng Tian

Source Title: Developments in Natural Intelligence Research and Knowledge Engineering: Advancing Applications

DOI: 10.4018/978-1-4666-1743-8.ch010

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Autonomous on-line knowledge discovery and acquisition play an important role in cognitive informatics, cognitive computing, knowledge engineering, and computational intelligence. On the basis of the latest advances in cognitive informatics and denotational mathematics, this paper develops a web knowledge discovery engine for web document restructuring and comprehension, which decodes on-line knowledge represented in informal documents into cognitive knowledge represented by concept algebra and concept networks. A visualized concept network explorer and a semantic analyzer are implemented to capture and refine queries based on concept algebra. A graphical interface is built using concept and semantic models to refine users’ queries. To enable autonomous information restructuring by machines, a two-level knowledge base that mimics human lexical/syntactical and semantic cognition is introduced. The information restructuring model provides a foundation for automatic concept indexing and knowledge extraction from web documents. The web knowledge discovery engine extends machine learning capability from imperative and adaptive information processing to autonomous and cognitive knowledge processing with unstructured documents in natural languages.

Chapter Preview

Top

Introduction

A central problem in web knowledge discovery, retrieval, and acquisition is how to formulate structured and effective queries on-line with a concept-oriented knowledge discovery tool. In the Internet environment, users often only submit short and incomplete queries that do not clearly express their actual needs (Spink et al., 2002). Therefore, an important issue in web knowledge mining is to improve search results by assisting users to express their information needs accurately and completely.

In order to achieve the above objectives, the following important issues must be dealt with for web-based knowledge searching engines: a) Query Formulation: An on-line search is preprocessed by a cognitive process to represent and formulate a query. In most information retrieval systems, this process is supposed to be an external activity and is not supported by the system. b) Query Refinement: When a primary query is formed with clearly identified domain, type, and attributes in an existing knowledge network, an accurate query refining process is needed to help users to efficiently formulate the query. c) Query Expression: There are a great variety of expression structures between the query initiator and the on-line information systems. Therefore, query expression is an important process in knowledge retrieval systems to transfer information between two heterogeneous information forms: the concept networks in the brain and the indexed databases in the web. It is the key for query expressing to effectively reduce the information leak in the transformation process from internal cognitive expressions to external formulated expressions.

A wide variety of techniques have been proposed to assist users to express a search request. Among them, an important method is query expansion, which adds relevant query terms to an initial query in order to improve retrieval results (Shaoira & Meirav, 2005; Na et al., 2005). Query limitation is another query-improvement strategy (Na et al., 2005) opposite to query expansion, where users are provided with options to limit their search in order to receive more focused results. These methods have not got satisfactory effectiveness due to uncompleted consideration of all crucial features in query formulations.

This paper presents a web knowledge discovery and acquisition engine on the basis of a denotational mathematics known as concept algebra (Wang, 2006b, 2008a, 2008c). A formal concept-driven methodology is adopted in information restructuring for web documents. Knowledge organizations and representations are modeled by concept algebra, which represents a two-level normalized semantic space that simulates the cognitive knowledge representation inside the brain. At the lower level, concepts are formalized by a 5-tuple in concept algebra with a set of algebraic concept manipulation rules. At the higher level, knowledge is formally modeled by concept networks with nine concept associations. The web knowledge discovery engine encompasses four coherent components known as the concept network explorer, the semantic analyzer, the conceptual query editor, and the XML query generator. The concept network explorer provides a visual thinking navigator for assisting users to locate, capture, and refine a query efficiently. A graphical interface of the knowledge query engine is developed to facilitate direct expression and refinement of queries. The computer-aided knowledge retrieval system generates refined queries that best fit not only users’ requirements, but also rational knowledge structures of existing information systems based on concept algebra. An information restructuring model is designed to decode and map informal texts in web documents into structured concept network represented by a concept graph. Applying WorldNet, ConceptNet, and other domain ontology, a concept-based clustering method that considers semantic relations and dependencies are proposed to index the restructured information of on-line documents.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

A Web Knowledge Discovery Engine Based on Concept Algebra

Abstract

Introduction

Complete Chapter List