Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

H. A. Ali (Mansoura University, Egypt), Ali I.El Desouky (Mansoura University, Egypt) and Ahmed I. Saleh (Mansoura University, Egypt)
DOI: 10.4018/jitwe.2007040101
OnDemand PDF Download:
$37.50

Abstract

Recently it will be more valued to build vertical classifiers to classify pages related to a specific domain and compensate those classifiers with novel learning techniques to achieve better performance. The contribution of this paper is three edged; firstly, a novel continuous learning technique is introduced. Secondly, the paper presents a new trend for Web page classification by presenting the domain-oriented classifiers. A new way of applying Bayes and K-Nearest Neighbor algorithms is introduced in order to build Domain Oriented (DONB) and (DOKNN) classifiers. The third contribution is combining both disciplines by introducing a novel classification strategy. Such strategy adds the continuous learning ability to Bayes theorem to build a (CLNB) classifier. It allows the classifier to adapt itself continuously for achieving better performance, and overcome the problem of overfitting. Experimental results have shown that CLNB demonstrates significant performance improvement over both DONB and DOKNN where its accuracy goes beyond 94.1% after testing 1000 pages.

Complete Article List

Search this Journal:
Reset
Open Access Articles: Forthcoming
Volume 12: 4 Issues (2017): 1 Released, 3 Forthcoming
Volume 11: 4 Issues (2016)
Volume 10: 4 Issues (2015)
Volume 9: 4 Issues (2014)
Volume 8: 4 Issues (2013)
Volume 7: 4 Issues (2012)
Volume 6: 4 Issues (2011)
Volume 5: 4 Issues (2010)
Volume 4: 4 Issues (2009)
Volume 3: 4 Issues (2008)
Volume 2: 4 Issues (2007)
Volume 1: 4 Issues (2006)
View Complete Journal Contents Listing