Reference Hub1
Dealing with Relevance Ranking in Cross-Lingual Cross-Script Text Reuse

Dealing with Relevance Ranking in Cross-Lingual Cross-Script Text Reuse

Aarti Kumar, Sujoy Das
Copyright: © 2016 |Volume: 6 |Issue: 1 |Pages: 20
ISSN: 2155-6377|EISSN: 2155-6385|EISBN13: 9781466692480|DOI: 10.4018/IJIRR.2016010102
Cite Article Cite Article

MLA

Kumar, Aarti, and Sujoy Das. "Dealing with Relevance Ranking in Cross-Lingual Cross-Script Text Reuse." IJIRR vol.6, no.1 2016: pp.16-35. http://doi.org/10.4018/IJIRR.2016010102

APA

Kumar, A. & Das, S. (2016). Dealing with Relevance Ranking in Cross-Lingual Cross-Script Text Reuse. International Journal of Information Retrieval Research (IJIRR), 6(1), 16-35. http://doi.org/10.4018/IJIRR.2016010102

Chicago

Kumar, Aarti, and Sujoy Das. "Dealing with Relevance Ranking in Cross-Lingual Cross-Script Text Reuse," International Journal of Information Retrieval Research (IJIRR) 6, no.1: 16-35. http://doi.org/10.4018/IJIRR.2016010102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Proliferation of multilingual content on the web has paved way for text reuse to get cross-lingual and also cross script. Identifying cross language text reuse becomes tougher if one considers cross-script less resourced languages. This paper focuses on identifying text reuse between English-Hindi news articles and improving their relevance ranking using two phases (i) Heuristic retrieval phase for reducing search space and (ii) post processing phase for improving the relevance ranking. Dictionary based strategy of Cross-Language Information Retrieval is used for heuristic retrieval and Parse Feature Vector Model (PFVS) is proposed for post processing to improve the relevance ranking. The application of this model has been successful in tackling the obfuscation problems of synonymy, hyponymy, hypernymy, antonym, sentence addition/ deletion and word inflection. Instead of using traditional approaches, Parse Feature Vectors have been explored to detect the reused documents and as per the knowledge of the authors it is a novel contribution with regards to these two language pairs.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.