Linguistic Analyzers of the Arabic Language: Linguistic Engineering Basis

Linguistic Analyzers of the Arabic Language: Linguistic Engineering Basis

Copyright: © 2024 |Pages: 26
DOI: 10.4018/979-8-3693-0728-1.ch003
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

This scientific initiative aims to formulate a number of hypotheses in order to prepare linguistic analyzers for the Arabic language based on the theoretical foundations and methodological frameworks of platform linguistics, which are the appropriate framework for building linguistic resources that can be invested in the field of automatic processing of natural languages. In this context, the theory of the syntactic lexicon was adopted, as it is the solid nucleus of dependency grammar that has proven to be effective in the field of describing natural languages, and thus being able to simulate and computerize them and crystallize linguistic algorithms that can be automated by using the open source unitex platform, which is based mainly on final state automata, final graphs, and transducers.
Chapter Preview
Top

1. Introduction

This research falls within the framework of Dependency Grammar Systems such as the theory of the syntactic lexicon, which is considered as Theoretical and methodological fulcrum of platform linguistics, or as it is termed fourth-generation linguistics. It proved useful in developing research on the linguistic architecture of various natural languages, through its reliance on the techniques of electronic dictionaries and the local grammar based on several automates and transducers. This system enabled the achievement of an accumulation of knowledge whose level differed from one language to another, despite the early scholars’ efforts in this regard, the Arabic language still occupied a weak place. This fact is due to the absence of institutional structures incubating full-fledged scientific projects, as most of the attempts remained Scattered, individual initiatives governed by academic goals in general. All these initiatives did not go beyond the morphological aspects, and thus the focus on building a synthetic analyzer for the Arabic language, along with the rest of the other linguistic analyzers, becomes an urgent need in order to complete the construction process of comprehensive linguistic resources for the Arabic language.

With this in mind, this scientific participation seeks to achieve the following objectives:

  • Platform linguistic and knowledge society

  • Linguistic analyzers and Arabic language engineering

  • Determining methods for building a syntax analyzer for the Arabic language through the technique of syntactic lexicon tables.

  • Demonstrating techniques for converting synthetic tables into parametric graphs.

  • Defining the technical procedures for the elaboration of patron graphs through applications in the Arabic language.

  • Processing technologies through the “unitex”

  • Summaries, conclusions, and prospects.

Top

2. Platform Linguistics And Knowledge Society

Contemporary linguistics has developed from viewing language as finite structural frameworks upon which linguistic codes are built, to perceiving language as an infinite and algorithmic formal system inherent to human competencies, namely, natural language. Despite the temporal distance and epistemological divergence between these two perspectives, they share a commonality in their utilization of descriptive language rooted in the humanities, such as logic and psychology, including cognitive sciences, all falling within the same domain albeit with variations in semantic content. Each perspective has formulated a lexicon of linguistic concepts that defines the language it investigates, all emerging from the core of its theoretical concerns and requirements.

After presenting the theoretical frameworks of the linguistic journey in its two main traditional directions, in light of the technological advancements witnessed in natural language processing research, it becomes apparent that the adoption of either perspective, or even both together, is insufficient for the development of sound computer applications. Thus, we have constructed a new theory that integrates elements from both directions, which we refer to as “Lexicon-Grammar.”

In this context, Arabic linguistics has gone through four successive chronological stages. We can explain this transition from one generation to another through the evolution of scientific discoveries. Human knowledge evolves based on what it discovers within the realm of word’s phenomena. It's not just because scientists desire cognitive development; rather, each scientific era has its own scientific tools for describing natural phenomena. As humans realize that the tools, they have developed for scientific research are no longer capable of achieving their cognitive aspirations, they create new tools that enable them to explore the world and describe its phenomena.

Complete Chapter List

Search this Book:
Reset