Process Model for Content Extraction from Weblogs

Process Model for Content Extraction from Weblogs

Andreas Schieber, Andreas Hilbert
Copyright: © 2014 |Volume: 10 |Issue: 2 |Pages: 17
ISSN: 1548-3657|EISSN: 1548-3665|EISBN13: 9781466654808|DOI: 10.4018/ijiit.2014040102
Cite Article Cite Article

MLA

Schieber, Andreas, and Andreas Hilbert. "Process Model for Content Extraction from Weblogs." IJIIT vol.10, no.2 2014: pp.20-36. http://doi.org/10.4018/ijiit.2014040102

APA

Schieber, A. & Hilbert, A. (2014). Process Model for Content Extraction from Weblogs. International Journal of Intelligent Information Technologies (IJIIT), 10(2), 20-36. http://doi.org/10.4018/ijiit.2014040102

Chicago

Schieber, Andreas, and Andreas Hilbert. "Process Model for Content Extraction from Weblogs," International Journal of Intelligent Information Technologies (IJIIT) 10, no.2: 20-36. http://doi.org/10.4018/ijiit.2014040102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

This paper develops and evaluates a BPMN-based process model which identifies and extracts blog content from the web and stores its textual data in a data warehouse for further analyses. Depending on the characteristics of the technologies used to create the weblogs, the process has to perform specific tasks in order to extract blog content correctly. The paper describes three phases: extraction, transformation and loading of data in a repository specifically adapted for blog content extraction. It highlights the objectives in these phases which must be achieved to ensure the correct extraction. The authors integrate the described process in a previously developed framework for blog mining. The authors' process model closes the conceptual gap in this framework as well as the gap in current research of blog mining process models. Furthermore, it can easily be adapted for other web extraction proposals.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.