Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Classification Method for Learning Morpheme Analysis

László Kovács

Source Title: Journal of Information Technology Research (JITR) 5(4)

DOI: 10.4018/jitr.2012100106

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The morpheme analysis module is an important component in natural language processing engines. The parser modules are usually based on rule systems created by human experts. In the paper, a novel approach is tested for implementation of the morpheme analyzer module. The proposed structure is based on the theory of formal concept analysis. The word inflection can be considered as a classification problem, where the class label denotes the corresponding transformation rule. The main benefit of the proposed method is the efficient generalization feature. The proposed morpheme analyzer module was implemented in a prototype question generation application.

Article Preview

Top

Introduction

Natural Language Processing (NLP) is an active area within human-machine interface development. The processing of input sentences given in human language or generating sentences of human language is still a challenging task in IT world. There are many problem areas in NLP where no standard solutions are available for every related task. The input sentences are processed in many different phases, where the usual process includes tokenization, cleaning, morpheme analysis, sentence analysis, semantic graph construction and sentence interpretation. The goal of the morpheme analysis module is to determine the stem of the word and to determine the grammatical role of the word within the sentence. The stem can be used to determine the concept related to the given word. Using some external ontology, the domain specific and universal knowledge elements can be extracted from the related external knowledge base. The ontology databases usually contain information on the specific relationships of the concepts like specialization, generalization, synonyms and specific application. The grammatical role of the words can be encoded on many ways. In some languages, the position of the word conveys the grammatical role. In some other languages, there is no dominant word order, thus other formal elements, like suffixes or prefixes are used to describe the role of the word. As a word may have several grammatical and semantic roles at the same time, several suffix or prefix parts can be attached to the stem word. The main goal of the morpheme analyzer module is to determine both the different suffix and prefix layers and the stem word.

In the literature there are some standard methods for morpheme analysis which use some rule based systems. These rules are usually created by human experts, thus the generation of the rule set is always a very costly operation. The main goal of our investigation was to investigate the possibility of a learning system which can inference the morpheme structure of the target words. This task has a high complexity as it has a lot of unknown parameters like the set of suffix and prefix elements and the agglutination rules of the morpheme elements. In this paper, the first phase of the research is summarized which aims at the generation and testing a concept lattice based morpheme analyzer. The proposed system uses a supervised learning mechanism. The training data should contain valid inflection examples: a transaction unit includes the base word, the inflected word and the corresponding morpheme structure. Thus the set of suffixes and prefixes are given as an input parameter. The goal of the concept lattice based classifier is to learn the relationship between the stem form and the corresponding transformation rule. In the proposed system for every possible grammatical roles (for example accusative), a separate concept lattice classifier is generated. Thus the resulting structure is the cluster of classifiers. The possible ordering of the different morpheme units is encoded with a probabilistic finite state automaton. The edges, the transition edges of the automaton are set during the training process. The classification is executed with the application of a concept lattice. The concept lattice is a very flexible structure to determine the most important clusters of the attributes and determine the generalization relationship among them. Using a special, class label attribute in the intent part, the lattice can be used as a classification tool. The main benefit of the concept lattice based classification is that it uses a human-like generalization mechanism. The performed test focused on this property of the classification. The tests were executed with smaller training sets in order to investigate the generalization accuracy of the different morpheme classifiers.

The paper first gives a survey on the internal structure of the NLP engines and it presents the key modules of the engine. The next section presents an overview of the different important stemmer and morpheme methods. Then the formal definition of the concept lattice structure is given and the proposed architecture for concept lattice based classification is introduced. The last section presents a prototype system for automated question generation task. The question generation application uses the proposed morpheme analysis module to determine the stems in the source sentences.

Complete Article List

Search this Journal:

Reset

Volume 16: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 15: 6 Issues (2022): 1 Released, 5 Forthcoming

Volume 14: 4 Issues (2021)

Volume 13: 4 Issues (2020)

Volume 12: 4 Issues (2019)

Volume 11: 4 Issues (2018)

Volume 10: 4 Issues (2017)

Volume 9: 4 Issues (2016)

Volume 8: 4 Issues (2015)

Volume 7: 4 Issues (2014)

Volume 6: 4 Issues (2013)

Volume 5: 4 Issues (2012)

Volume 4: 4 Issues (2011)

Volume 3: 4 Issues (2010)

Volume 2: 4 Issues (2009)

Volume 1: 4 Issues (2008)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Classification Method for Learning Morpheme Analysis

Abstract

Introduction

Complete Article List