Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Text Semantic Mining Model Based on the Algebra of Human Concept Learning

Jun Zhang, Xiangfeng Luo, Xiang He, Chuanliang Cai

Source Title: Cognitive Informatics for Revealing Human Cognition: Knowledge Manipulations in Natural Intelligence

DOI: 10.4018/978-1-4666-2476-4.ch014

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Dealing with the large-scale text knowledge on the Web has become increasingly important with the development of the Web, yet it confronts with several challenges, one of which is to find out as much semantics as possible to represent text knowledge. As the text semantic mining process is also the knowledge representation process of text, this paper proposes a text knowledge representation model called text semantic mining model (TSMM) based on the algebra of human concept learning, which both carries rich semantics and is constructed automatically with a lower complexity. Herein, the algebra of human concept learning is introduced, which enables TSMM containing rich semantics. Then the formalization and the construction process of TSMM are discussed. Moreover, three types of reasoning rules based on TSMM are proposed. Lastly, experiments and the comparison with current text representation models show that the given model performs better than others.

Chapter Preview

Top

Introduction

With the rapid growth of the Web, how to represent and organize the large-scale texts have drawn a lot of attentions. One of the most important works on text knowledge representation is to find out the semantics in texts. Plenty of scholars focus on many kinds of models that are used to represent text knowledge through various text analyzing methods. Such models are always expected to contain rich semantics, to obtain a robust reasoning ability and to be automatically constructed.

Currently, models referring to represent text knowledge can be mainly divided into four types. (1) Statistics models, which are generated by statistical methods. The typical ones include vector space model (VSM) (Salton & Wong, 1975) and latent semantic analysis (LSA) (Landauer & Foltz, 1998). VSM uses some words extracted from a text and their weights to represent the text semantics, but it doesn’t take the relations between the words into account. Thus VSM is only able to express a little semantics in the text while much more semantics has been lost. On the contrary, the LSA model carries more semantics than the former one but its complexity is high because the construction of LSA is involved with the operation of singular value decomposition, whose complexity goes very high. (2) Cognition based models, whose basic idea is inspired by cognitive theories. Element fuzzy cognitive map (EFCM) (Luo & Xu, 2008) is one of the typical models. It obtains more semantics than VSM and a lower computation complexity than LSA. Meanwhile, it can be applied to large-scale text collections since it is constructed automatically. (3) Probability topic models, such as author-topic model (ATM) (Michal & Thomas, 2004), author-recipient-topic model (ART) (McCallum & Corrada-Emmanuel, 2004) and correlated topic models (CTM) (Blei & Lafferty, 2006). These models always need a lot of complex computations, which make probability topic models unsuitable to be used in large-scale text collections. (4) Ontology based models, which are based on ontology languages and most of them are semi-automatically constructed. Ontology inference layer (OIL) (Horrocks & Fensel, 2000), web ontology language (OWL) (McGuinness & Harmelen, 2004) and simple html ontology extensions (SHOE) (Heflin & Hendler, 1999) are typical ontology based models. Since possessing a lot of semantics, ontology based models attracts plenty of researches on them. However, they can only be applied to special areas that contain a lot of human experiential knowledge, as the generation of ontology based models needs a mass of manual work. Thus, up to now, ontology based models still cannot be applied to automatically process large-scale text collections.

Consequently, according to the discussions above, we can see that some models are carrying abundant semantics but cannot be constructed automatically (e.g. OWL); some ones are both allowed to be automatically established and carrying a lot of semantics but still can’t be applied to large-scale collections for their high complexities (e.g. CTM and ATM); some ones can be set up automatically with a lower complexity but carry little semantics (e.g. VSM). As a result, through the analysis of those models, we consider that a good text knowledge representation model should satisfy the two conditions listed below.

1.
Contain rich text semantics;
2.
Construct automatically with a lower complexity;

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Text Semantic Mining Model Based on the Algebra of Human Concept Learning

Abstract

Introduction

Complete Chapter List