Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

doi:10.4018/jitwe.2007040101

Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

H. A. Ali, Ali I.El Desouky, Ahmed I. Saleh

Source Title: International Journal of Information Technology and Web Engineering (IJITWE)2(2)

Cite Article Cite Article

MLA

Ali, H. A., et al. "Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm." IJITWE vol.2, no.2 2007: pp.1-44. http://doi.org/10.4018/jitwe.2007040101

APA

Ali, H. A., Desouky, A. I., & Saleh, A. I. (2007). Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm. International Journal of Information Technology and Web Engineering (IJITWE), 2(2), 1-44. http://doi.org/10.4018/jitwe.2007040101

Chicago

Ali, H. A., Ali I.El Desouky, and Ahmed I. Saleh. "Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm," International Journal of Information Technology and Web Engineering (IJITWE) 2, no.2: 1-44. http://doi.org/10.4018/jitwe.2007040101

Export Reference

Favorite Full-Issue Download

View Full Text PDF

Abstract

Recently it will be more valued to build vertical classifiers to classify pages related to a specific domain and compensate those classifiers with novel learning techniques to achieve better performance. The contribution of this paper is three edged; firstly, a novel continuous learning technique is introduced. Secondly, the paper presents a new trend for Web page classification by presenting the domain-oriented classifiers. A new way of applying Bayes and K-Nearest Neighbor algorithms is introduced in order to build Domain Oriented (DONB) and (DOKNN) classifiers. The third contribution is combining both disciplines by introducing a novel classification strategy. Such strategy adds the continuous learning ability to Bayes theorem to build a (CLNB) classifier. It allows the classifier to adapt itself continuously for achieving better performance, and overcome the problem of overfitting. Experimental results have shown that CLNB demonstrates significant performance improvement over both DONB and DOKNN where its accuracy goes beyond 94.1% after testing 1000 pages.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

Studying and Analysis of a Vertical Web Page Classifier Based on Continuous Learning Naïve Bayes (CLNB) Algorithm

MLA

APA

Chicago

Export Reference

Abstract

Request Access