Reference Hub2
Word Sense Based Hindi-Tamil Statistical Machine Translation

Word Sense Based Hindi-Tamil Statistical Machine Translation

Vimal Kumar K., Divakar Yadav
Copyright: © 2018 |Volume: 14 |Issue: 1 |Pages: 11
ISSN: 1548-3657|EISSN: 1548-3665|EISBN13: 9781522542780|DOI: 10.4018/IJIIT.2018010102
Cite Article Cite Article

MLA

Kumar K., Vimal, and Divakar Yadav. "Word Sense Based Hindi-Tamil Statistical Machine Translation." IJIIT vol.14, no.1 2018: pp.17-27. http://doi.org/10.4018/IJIIT.2018010102

APA

Kumar K., V. & Yadav, D. (2018). Word Sense Based Hindi-Tamil Statistical Machine Translation. International Journal of Intelligent Information Technologies (IJIIT), 14(1), 17-27. http://doi.org/10.4018/IJIIT.2018010102

Chicago

Kumar K., Vimal, and Divakar Yadav. "Word Sense Based Hindi-Tamil Statistical Machine Translation," International Journal of Intelligent Information Technologies (IJIIT) 14, no.1: 17-27. http://doi.org/10.4018/IJIIT.2018010102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Corpus based natural language processing has emerged with great success in recent years. It is not only used for languages like English, French, Spanish, and Hindi but also is widely used for languages like Tamil, Telugu etc. This paper focuses to increase the accuracy of machine translation from Hindi to Tamil by considering the word's sense as well as its part-of-speech. This system works on word by word translation from Hindi to Tamil language which makes use of additional information such as the preceding words, the current word's part of speech and the word's sense itself. For such a translation system, the frequency of words occurring in the corpus, the tagging of the input words and the probability of the preceding word of the tagged words are required. Wordnet is used to identify various synonym for the words specified in the source language. Among these words, the one which is more relevant to the word specified in source language is considered for the translation to target language. The introduction of the additional information such as part-of-speech tag, preceding word information and semantic analysis has greatly improved the accuracy of the system.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.