Word Sense Based Hindi-Tamil Statistical Machine Translation

Vimal Kumar K., Divakar Yadav

Source Title: International Journal of Intelligent Information Technologies (IJIIT)14(1)

ISSN: 1548-3657|EISSN: 1548-3665|EISBN13: 9781522542780|DOI: 10.4018/IJIIT.2018010102

MLA

Kumar K., Vimal, and Divakar Yadav. "Word Sense Based Hindi-Tamil Statistical Machine Translation." IJIIT vol.14, no.1 2018: pp.17-27. http://doi.org/10.4018/IJIIT.2018010102

APA

Kumar K., V. & Yadav, D. (2018). Word Sense Based Hindi-Tamil Statistical Machine Translation. International Journal of Intelligent Information Technologies (IJIIT), 14(1), 17-27. http://doi.org/10.4018/IJIIT.2018010102

Chicago

Kumar K., Vimal, and Divakar Yadav. "Word Sense Based Hindi-Tamil Statistical Machine Translation," International Journal of Intelligent Information Technologies (IJIIT) 14, no.1: 17-27. http://doi.org/10.4018/IJIIT.2018010102

Export Reference

Favorite Full-Issue Download

View Full Text HTML

View Full Text PDF

Abstract

Corpus based natural language processing has emerged with great success in recent years. It is not only used for languages like English, French, Spanish, and Hindi but also is widely used for languages like Tamil, Telugu etc. This paper focuses to increase the accuracy of machine translation from Hindi to Tamil by considering the word's sense as well as its part-of-speech. This system works on word by word translation from Hindi to Tamil language which makes use of additional information such as the preceding words, the current word's part of speech and the word's sense itself. For such a translation system, the frequency of words occurring in the corpus, the tagging of the input words and the probability of the preceding word of the tagged words are required. Wordnet is used to identify various synonym for the words specified in the source language. Among these words, the one which is more relevant to the word specified in source language is considered for the translation to target language. The introduction of the additional information such as part-of-speech tag, preceding word information and semantic analysis has greatly improved the accuracy of the system.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

Word Sense Based Hindi-Tamil Statistical Machine Translation

MLA

APA

Chicago

Export Reference

Abstract

Request Access