Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

H. A. Ali, Ali I.El Desouky, Ahmed I. Saleh
Copyright: © 2007 |Volume: 2 |Issue: 2 |Pages: 44
ISSN: 1554-1045|EISSN: 1554-1053|ISSN: 1554-1045|EISBN13: 9781615203536|EISSN: 1554-1053|DOI: 10.4018/jitwe.2007040101
Cite Article Cite Article

MLA

Ali, H. A., et al. "Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm." IJITWE vol.2, no.2 2007: pp.1-44. http://doi.org/10.4018/jitwe.2007040101

APA

Ali, H. A., Desouky, A. I., & Saleh, A. I. (2007). Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm. International Journal of Information Technology and Web Engineering (IJITWE), 2(2), 1-44. http://doi.org/10.4018/jitwe.2007040101

Chicago

Ali, H. A., Ali I.El Desouky, and Ahmed I. Saleh. "Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm," International Journal of Information Technology and Web Engineering (IJITWE) 2, no.2: 1-44. http://doi.org/10.4018/jitwe.2007040101

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Recently it will be more valued to build vertical classifiers to classify pages related to a specific domain and compensate those classifiers with novel learning techniques to achieve better performance. The contribution of this paper is three edged; firstly, a novel continuous learning technique is introduced. Secondly, the paper presents a new trend for Web page classification by presenting the domain-oriented classifiers. A new way of applying Bayes and K-Nearest Neighbor algorithms is introduced in order to build Domain Oriented (DONB) and (DOKNN) classifiers. The third contribution is combining both disciplines by introducing a novel classification strategy. Such strategy adds the continuous learning ability to Bayes theorem to build a (CLNB) classifier. It allows the classifier to adapt itself continuously for achieving better performance, and overcome the problem of overfitting. Experimental results have shown that CLNB demonstrates significant performance improvement over both DONB and DOKNN where its accuracy goes beyond 94.1% after testing 1000 pages.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.