Web Mining: Creating Structure out of Chaos

Web Mining: Creating Structure out of Chaos

Roderick L. Lee (Pennsylvania State University at Harrisburg, USA)
DOI: 10.4018/978-1-59140-057-8.ch013
OnDemand PDF Download:


This chapter presents an overview of web mining. The three areas of web mining—Web content mining, Web usage mining, and Web structure mining—are identified. In this chapter specific attention is paid to Web structure mining, which is the study of the link topology. The link topology of the Web is analyzed in the context of a cyber-community in order to explore the connection between the link topology and conferral of authority. Millions, soon to be billions, of people are annotating Web documents, which results in an abundance of information. Herein lies the problem: topic distillation—searching through the sea of documents for relevant information. To address the problem of overabundance and relevancy, models are needed that can assist in creating order at the local level. The hub and spoke model identified in this chapter takes a proactive approach to creating an online community in a centralized or planned fashion and provides control over the architecture of the Web graph. In the end users can be assured with a certain level of confidence that the Web content contained in a hyperlinked community is both accurate and relevant.

Complete Chapter List

Search this Book:
Table of Contents
Parag C. Pendharkar
Parag C. Pendharkar
Chapter 1
Witold Abramowicz, Marek Nowak, Joanna Sztykiel
The main purpose of this article is to discuss applicability of Bayesian belief networks (BBN) within the procedures of working-capital credit... Sample PDF
Bayesian Networks as a Decision Support Tool in Credit Scoring Domain
Chapter 2
Marvin D. Troutt, Michael Hu, Murali Shanker, William Acar
Frontier Regression Models seek to explain boundary, frontier or optimal behavior rather than average behavior as in ordinary regression models.... Sample PDF
Frontier Versus Ordinary Regression Models for Data Mining
Chapter 3
Parag C. Pendharkar, Sudhir Nanda, James A. Rodger, Rahul Bhaskar
This chapter illustrates how a misclassification cost matrix can be incorporated into an evolutionary classification system for medical diagnosis.... Sample PDF
An Evolutionary Misclassification Cost Minimization Approach for Medical Diagnosis
Chapter 4
Aaron Ceglar, John Roddick, Paul Calder
Knowledge discovery is the process of eliciting interesting knowledge from data repositories. Due to the inability of computers to understand... Sample PDF
Guiding Knowledge Discovery Through Interactive Data Mining
Chapter 5
Karim K. Hirji
There is an enormous amount of data generated by academic, business, and governmental organizations alike; however, only a small portion of the data... Sample PDF
A Proposed Process for Performing Data Mining Projects
Chapter 6
Chi Kin Chan, Heung Wong, Wan Kai Pang, Marvin D. Troutt
This chapter is a case study in combining forecasts for inventory management in which the need for data mining in combination forecasts is... Sample PDF
Data Mining for Optimal Combination Demand Forecasts
Chapter 7
David Paper, Kenneth B. Tingey, Wai Yin Mok
This chapter illustrates how IT-enabled business process reengineering can fail if top management fails to understand the underlying process... Sample PDF
The Myth of Enterprise Database Redesign
Chapter 8
Sudhakar Kuppuraju, Girish Subramanian
Recent interest in relationship management and relationship marketing has led many firms to consider how to improve customer retention rates. The... Sample PDF
New Information Technologies and Other Pertinent Issues Impacting the Strategic Dimension of CRM for Business Excellence
Chapter 9
James A. Rodger
Accounting information systems enable the process of internal control and external auditing to provide a first-line defense in detecting fraud... Sample PDF
Utilization of Data Mining Techniques to Detect and Predict Accounting Fraud: A Comparison of Neural Networks and Discriminant Analysis
Chapter 10
Jose Maria Cavero, Carmen Costilla, Esperanza Marcos, Mario G. Piattini, Adolfo Sanchez
Data warehousing and online analytical processing (OLAP) technologies have become growing interest areas in recent years. Specific issues such as... Sample PDF
A Multidimensional Data Warehouse Development Methodology
Chapter 11
Bahador Ghahramani
The telecommunications industry (TI) is challenged by a significant increase in the complexity of information transfer due to a recent proliferation... Sample PDF
A Telecommunications Model for Managing Complexity of Voice and Data Networks and Services
Chapter 12
Wan Kai Pang, Heung Wong, Chi Kin Chan, Marvin D. Troutt
This chapter proposes an approach to the combination of forecasts from a new perspective and uses a new estimation methodology. Concepts from... Sample PDF
Combination Forecasts Based on Markov Chain Monte Carlo Estimation of the Mode
Chapter 13
Roderick L. Lee
This chapter presents an overview of web mining. The three areas of web mining—Web content mining, Web usage mining, and Web structure mining—are... Sample PDF
Web Mining: Creating Structure out of Chaos
Chapter 14
Parag C. Pendharkar, Girish Subramanian
Mining information and knowledge from very large databases is recognized as a key research area in machine learning and expert systems. In the... Sample PDF
Connectionist and Evolutionary Models for Learning, Discovering and Forecasting Software Effort
About the Authors