Article Preview
Top1. Introduction And Problematic
We are living in the age of information technologies that rendered digital libraries a concrete possibility. The easy access to the information via electronic resources, such as the Web has made billions of web pages easily accessible to anyone providing plenty of potential sources for the plagiarists and rendere the digital documents more vulnerable to be copied by turning cheating extremely easy. Lancaster and Culwin in [13] state that “plagiarism is theft of intellectual property, it is not only dishonest, but also an offense that may result in sanctions”. It is considered as one of the biggest problems in publishing, science, and education.
Recently the Plagiarism phenomenon has spread, where it has even touched the most popular politicians in the world designed by the Germans ministers like the Minister of defense, the atypical KARL-THEODOR ZU GUTTENBERG that was resigned after accusations of plagiarism concerning the writing of his doctoral thesis in law at the university of Bayreuth (Bavière). In 2014 also the minister of defense URSULA VON DER LEYEN who was considered as a possible heiress of Angela Mmarkel, was suspected by a site specialized in the analysis of theses written by politicians to have plagiarized a number of passages in his medical thesis. Without forgetting the minister of education and research SCHAVAN ANNETTE resigned in 2014 even due to plagiarism (Bouarara1, 2015).An example of plagiarised text is presented in the next figure:
Figure 1.
(a) Plagiarised Text VS (b) Source Text (Potthast, 2010)
In order to give you a global view about our work the plagiarism is mentioned as a lack of moral, civil or commercial, which can be subject to criminal penalty. It can be defined in three points:
- •
Appropriating the creative work of someone else and present it as his own;
- •
To grab snippets of text, images, data, etc. from external sources and integrate them into his own work without citing the source;
- •
Summarize even the original idea of an author by expressing it in his own words, but omitting to mention the source.
Depending on the behaviour of plagiarist, we can distinguish several plagiarism types such as the plagiarism verbatim, The paraphrase and the cases of plagiarism the most difficult to detect are plagiarism with translation and plagiarism of ideas.
Plagiarism detection, the automatic identification of plagiarism and the retrieval of the original sources, is developed and investigated as a possible countermeasure. Although humans can identify cases of plagiarism in their areas of expertise quite easily, it requires much effort to be aware of all potential sources on a given topic and to provide strong evidence against an offender. The manual analysis of text with respect to plagiarism becomes infeasible on a large scale, so that automatic plagiarism detection attracts considerable attention