Semantic Analysis Based Approach for Relevant Text Extraction Using Ontology

Semantic Analysis Based Approach for Relevant Text Extraction Using Ontology

Poonam Chahal (Manav Rachna International University, Faridabad, India), Manjeet Singh (YMCA University of Science and Technology, Faridabad, India) and Suresh Kumar (Manav Rachna International University, Faridabad, India)
Copyright: © 2017 |Pages: 18
DOI: 10.4018/IJIRR.2017100102
OnDemand PDF Download:
No Current Special Offers


Semantic analysis computation is done by extracting the interrelated concepts used by an author in the text/content of document. The concepts and linking i.e. relationships that are available among the concepts are most relevant as they provide the maximum information related to the event or activity as described by an author in the document. The retrieved relevant information from the text helps in the construction of the summary of a large text present in the document. This summary can further be represented in form of ontology and utilized in various application areas of information retrieval process like crawling, indexing, ranking, etc. The constructed ontologies can be compared with each other for calculation of similarity index based on semantic analysis between any texts. This paper gives a novel technique for retrieving the relevant semantic information represented in the form of ontology for true semantic analysis of given text.
Article Preview

(Pal & Saha, 2014) have given an algorithm named simplified Lesk which is based on an unsupervised approach. This algorithm helps in extracting the summary from a large content of a document. The construction of summary is done for obtaining the important sentences with the help of conventional online dictionary. These significant sentences are assigned weights according to the importance of the content written by an author. Further these assigned weights are evaluated using the steps of the proposed simplified Lesk algorithm. This simplified Lesk algorithm only considered the keywords by using the WordNet which is not enough to extract the meaningful information from a document. So, the simplified Lesk algorithm is modified which considers the semantics extracted from conventional online dictionary like WordNet.

(Kulkarni & Apte, 2014) has given lexical chain computing algorithm in which the division of original documents is done into sentences using segmentation. Then a map is constructed for each sentence using four types of relations i.e. synonym, hypernym, hyponym, meronym. The distance of each word with another related word is calculated and a lexical chain is constructed which is assigned weight accordingly. The chain having highest weight is selected as the longest chain and the relevant text extraction is done depending on the relevant lexical chains obtained.

(Patil et al., 2004) used the statistical approach for extracting the relevant information from a document. The given approach can be used in application for retrieving the semantic association from a text depending on the query from the user in the form of keyword. In this paper, the process of extracting relevant information in the form of summary is done in various phases like extraction of data items, generation of query, documents gathering, creation of summary, and arrangement of summary to any user of a query.

Complete Article List

Search this Journal:
Open Access Articles: Forthcoming
Volume 11: 4 Issues (2021): 2 Released, 2 Forthcoming
Volume 10: 4 Issues (2020)
Volume 9: 4 Issues (2019)
Volume 8: 4 Issues (2018)
Volume 7: 4 Issues (2017)
Volume 6: 4 Issues (2016)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2013)
Volume 2: 4 Issues (2012)
Volume 1: 4 Issues (2011)
View Complete Journal Contents Listing