Web Mining in Thematic Search Engines

Web Mining in Thematic Search Engines

Massimiliano Caramia (Istituto per le Applicazioni del Calcolo IAC-CNR, Italy) and Giovanni Felici (Istituto di Analisi dei Sistemi ed Informatica (IASI-CNR), Italy)
Copyright: © 2005 |Pages: 5
DOI: 10.4018/978-1-59140-557-3.ch226
OnDemand PDF Download:


The recent improvements of search engine technologies have made available to Internet users an enormous amount of knowledge that can be accessed in many different ways. The most popular search engines now provide search facilities for databases containing billions of Web pages, where queries are executed instantly. The focus is switching from quantity (maintaining and indexing large databases of Web pages and quickly selecting pages matching some criterion) to quality (identifying pages with a high quality for the user). Such a trend is motivated by the natural evolution of Internet users who are now more selective in their choice of the search tool and may be willing to pay the price of providing extra feedback to the system and to wait more time for their queries to be better matched. In this framework, several have considered the use of data-mining and optimization techniques, which are often referred to as Web mining (for a recent bibliography on this topic, see, e.g., Getoor, Senator, Domingos & Faloutsos, 2003), and Zaïane, Srivastava, Spiliopoulou, & Masand, 2002). Here, we describe a method for improving standard search results in a thematic search engine, where the documents and the pages made available are restricted to a finite number of topics, and the users are considered to belong to a finite number of user profiles. The method uses clustering techniques to identify, in the set of pages resulting from a simple query, subsets that are homogeneous with respect to a vectorization based on context or profile; then we construct a number of small and potentially good subsets of pages, extracting from each cluster the pages with higher scores. Operating on these subsets with a genetic algorithm, we identify the subset with a good overall score and a high internal dissimilarity. This provides the user with a few nonduplicated pages that represent more correctly the structure of the initial set of pages. Because pages are seen by the algorithms as vectors of fixed dimension, the role of the context- or profile-based vectorization is central and specific to the thematic approach of this method.

Complete Chapter List

Search this Book:
Editorial Advisory Board
Alexander Tuzhilin
John Wang
Chapter 1
Action Rules  (pages 1-5)
Zbigniew W. Ras, Angelina Tzacheva, Li-Shiang Tsay
There are two aspects of interestingness of rules that have been studied in data mining literature, objective and subjective measures (Liu, 1997;... Sample PDF
Action Rules
Chapter 2
Alexander Thomasian
Active disks allow the downloading of certain types of processing from the host computer onto disks, more specifically the disk controller, which... Sample PDF
Active Disks for Data Mining
Chapter 3
Ion Muslea
Inductive learning algorithms typically use a set of labeled examples to learn class descriptions for a set of user-specified concepts of interest.... Sample PDF
Active Learning with Multiple Views
Chapter 4
James E. Yao, Chang Liu, Qiyang Chen, June Lu
As internal and external demands on information from managers are increasing rapidly, especially the information that is processed to serve... Sample PDF
Administering and Managing a Data Warehouse
Chapter 5
Pasquale De Meo, Giovanni Quattrone, Giorgio Terracina, Domenico Ursino
An electronic service (e-service) can be defined as a collection of network-resident software programs that collaborate for supporting users in both... Sample PDF
Agent-Based Mining of User Profiles for E-Services
Chapter 6
Leonardo Tininini
An efficient query engine is certainly one of the most important components in data warehouses (also known as OLAP systems or multidimensional... Sample PDF
Aggregate Query Rewriting in Multidimensional Databases
Chapter 7
Claudia Perlich, Foster Provost
Most data mining and modeling techniques have been developed for data represented as a single table, where every row is a feature vector that... Sample PDF
Aggregation for Predictive Modeling with Relational Data
Chapter 8
Jaroslav Zendulka
Data mining technology just recently became actually usable in real-world scenarios. At present, the data mining models generated by commercial data... Sample PDF
API Standardization Efforts for Data Mining
Chapter 9
J. Ben Schafer
In a world where the number of choices can be overwhelming, recommender systems help users find and evaluate items of interest. They connect users... Sample PDF
Application of Data Mining to Recommender Systems
Chapter 10
Francesco Buccafurri, Gianluca Lax
Online analytical processing applications typically analyze a large amount of data by means of repetitive queries involving aggregate measures on... Sample PDF
Approximate Range Queries by Histograms in OLAP
Chapter 11
Rafael Marti
The design and implementation of intelligent systems with human capabilities is the starting point to design Artificial Neural Networks (ANNs). The... Sample PDF
Artificial Neural Networks for Prediction
Chapter 12
Yew-Kwong Woon, Wee-Keong Ng, Ee-Peng Lim
Association Rule Mining (ARM) is concerned with how items in a transactional database are grouped together. It is commonly known as market basket... Sample PDF
Association Rule Mining
Chapter 13
Raymond Chi-Wing Wong, Ada Wai-Chee fu
Association rule mining (Agrawal, Imilienski, & Swami, 1993) has been proposed for understanding the relationships among items in transactions or... Sample PDF
Association Rule Mining and Application to MPIS
Chapter 14
Anne Denton, Christopher Besemann
Most data of practical relevance are structured in more complex ways than is assumed in traditional data mining algorithms, which are based on a... Sample PDF
Association Rule Mining of Regional Data
Chapter 15
Martine Cadot, Jean-Baptiste Maj, Tarek Ziade
A manager would like to have a dashboard of his company without manipulating data. Usually, statistics have solved this challenge, but nowadays... Sample PDF
Assocation Rules and Statistics
Chapter 16
Brad Morantz
Preparing a dataset is a very important step in data mining. If the input to the process contains problems, noise, or errors, then the results will... Sample PDF
Automated Anomaly Detection
Chapter 17
Alicja A. Wieczorkowska
The aim of musical instrument sound classification is to process information from audio files by a classificatory system and accurately identify... Sample PDF
Automatic Muscial Instrument Sound Classification
Chapter 18
Bayesian Networks  (pages 89-93)
Ahmad Bashir, Latifur Khan, Mamoun Awad
A Bayesian network is a graphical model that finds probabilistic relationships among variables of a system. The basic components of a Bayesian... Sample PDF
Bayesian Networks
Chapter 19
Les Pang
Data warehousing has been a successful approach for supporting the important concept of knowledge management — one of the keys to organizational... Sample PDF
Best Practices in Data Warehousing from the Federal Perspective
Chapter 20
Scott Nicholson, Jeffrey Stanton
Most people think of a library as the little brick building in the heart of their community or the big brick building in the center of a college... Sample PDF
Bibliomining for Library Decision-Making
Chapter 21
Fang Chu, Lipo Wang
Accurate diagnosis of cancers is of great importance for doctors to choose a proper treatment. Furthermore, it also plays a key role in the... Sample PDF
Biomedical Data Mining Using RBF Neural Networks
Chapter 22
Hee Beng Kuan Tan, Yuan Zhao
Although the use of statistically probable properties is very common in the area of medicine, it is not so in software engineering. The use of such... Sample PDF
Building Empirical-Based Knowledge for Design Recovery
Chapter 23
Business Processes  (pages 118-123)
David Sundaram, Victor Portougal
The concept of a business process is central to many areas of business systems, specifically to business systems based on modern information... Sample PDF
Business Processes
Chapter 24
Fabiana Lorenzi, Francesco Ricci
Recommender systems are being used in e-commerce web sites to help the customers in selecting products more suitable to their needs. The growth of... Sample PDF
Case-Based Recommender Systems
Chapter 25
Maria Suzana Marc Amoretti
For some time, the fields of computer science and cognition have diverged. Researchers in these two areas know ever less about each other’s work... Sample PDF
Categorization Process and Data Mining
Chapter 26
Bin Zhang
Center-based clustering algorithms are generalized to more complex model-based, especially regression-model-based, clustering algorithms. This... Sample PDF
Center-Based Clustering and Regression Clustering
Chapter 27
Johannes Gehrke
It is the goal of classification and regression to build a data-mining model that can be used for prediction. To construct such a model, we are... Sample PDF
Classification and Regression Trees
Chapter 28
Classification Methods  (pages 144-149)
Aijun An
Generally speaking, classification is the action of assigning an object to a category according to the characteristics of the object. In data... Sample PDF
Classification Methods
Chapter 29
Luminita Dumitriu
Association rules, introduced by Agrawal, Imielinski and Swami (1993), provide useful means to discover associations in data. The problem of mining... Sample PDF
Closed-Itemset Incremental-Mining Problem
Chapter 30
Tom Burr
One data mining activity is cluster analysis, of which there are several types. One type deserving special attention is clustering that arises due... Sample PDF
Cluster Analysis in Fitting Mixtures of Curves
Chapter 31
Xiangji Huang
Clustering is the process of grouping a collection of objects (usually represented as points in a multidimensional space) into classes of similar... Sample PDF
Clustering Analysis and Algorithms
Chapter 32
Maribel Yasmina Santos, Adriano Moreira, Sofia Carneiro
Clustering is the process of grouping a set of objects into clusters so that objects within a cluster have high similarity with each other, but are... Sample PDF
Clustering in the Identification of Space Models
Chapter 33
Anne Denton
Time series data is of interest to most science and engineering disciplines and analysis techniques have been developed for hundreds of years. There... Sample PDF
Clustering of Time Series Data
Chapter 34
Clustering Techniques  (pages 176-179)
Sheng Ma, Tao Li
Clustering data into sensible groupings as a fundamental and effective tool for efficient data organization, summarization, understanding, and... Sample PDF
Clustering Techniques
Chapter 35
Frank Klawonn, Frank Rehm
For many applications in knowledge discovery in databases, finding outliers, which are rare events, is of importance. Outliers are observations that... Sample PDF
Clustering Techniques for Outlier Detection
Chapter 36
Mitja Lenic, Peter Kokol, Petra Povalej, Milan Zorman
The aggressive rate of growth of disk storage and, thus, the ability to store enormous quantities of data have far outpaced our ability to process... Sample PDF
Combining Induction Methods with the Multimethod Approach
Chapter 37
Zhi-Hua Zhou
Data mining attempts to identify valid, novel, potentially useful, and ultimately understandable patterns from huge volume of data. The mined... Sample PDF
Comprehensibility of Data Mining Algorithms
Chapter 38
Computation of OLAP Cubes  (pages 196-201)
Amin A. Abdulghani
The focus of Online Analytical Processing (OLAP) is to provide a platform for analyzing data (e.g., sales data) with multiple dimensions (e.g.... Sample PDF
Computation of OLAP Cubes
Chapter 39
Concept Drift  (pages 202-206)
Marcus A. Maloof
Traditional approaches to data mining are based on an assumption that the process that generated or is generating a data stream is static. Although... Sample PDF
Concept Drift
Chapter 40
Jean-Francois Boulicaut
Condensed representations have been proposed in Mannila and Toivonen (1996) as a useful concept for the optimization of typical data-mining tasks.... Sample PDF
Condensed Representations for Data Mining
Chapter 41
Timo R. Bretschneider, Odej Kao
Sensing and processing multimedia information is one of the basic traits of human beings: The audiovisual system registers and transports... Sample PDF
Content-Based Image Retrieval
Chapter 42
Edward J. Garrity, Joseph B. O’Donnell, G. Lawrence Sanders
Investor confidence in the financial markets has been rocked by recent corporate frauds and many in the investment community are searching for... Sample PDF
Continuous Auditing and Data Mining
Chapter 43
John M. Artz
Although data warehousing theory and technology have been around for well over a decade, they may well be the next hot technologies. How can it be... Sample PDF
Data Driven vs. Metric Driven Data Warehouse Design
Chapter 44
Xiong Wang
Data management in its general term refers to activities that involve the acquisition, storage, and retrieval of data. Traditionally, information... Sample PDF
Data Management in Three-Dimensional Structures
Chapter 45
Auroop R. Ganguly, Amar Gupta, Shiraj Khan
Information by itself is no longer perceived as an asset. Billions of business transactions are recorded in enterprise-scale data warehouses every... Sample PDF
Data Mining and Decision Support for Business and Science
Chapter 46
Andrew Kusiak, Shital C. Shah
Most processes in pharmaceutical industry are data driven. Company’s ability to capture the data and making use of it will grow in significance and... Sample PDF
Data Mining and Warehousing in Pharma Industry
Chapter 47
Ramdev Kanapady, Aleksandar Lazarevic
The process of implementing and maintaining a structural health monitoring system consists of operational evaluation, data processing, damage... Sample PDF
Data Mining for Damage Detection in Engineering Structures
Chapter 48
Aleksandar Lazarevic
Today computers control power, oil and gas delivery, communication systems, transportation networks, banking and financial services, and various... Sample PDF
Data Mining for Intrusion Detection
Chapter 49
Indranil Bose
Diabetes is a disease worrying hundreds of millions of people around the world. In the USA, the population of diabetic patients is about 15.7... Sample PDF
Data Mining in Diabetes Diagnosis and Detection
Chapter 50
Marvin D. Troutt, Lori K. Long
In this paper, we briefly review and update our earlier work (Long & Troutt, 2003) on the topic of data mining in the human resources area. To gain... Sample PDF
Data Mining in Human Resources
Chapter 51
Les Pang
Data mining has been a successful approach for improving the level of business intelligence and knowledge management throughout an organization.... Sample PDF
Data Mining In the Federal Government
Chapter 52
Pradip Kumar Bala, Shamik Sural, Rabindra Nath Banerjee
Data mining is a set of tools, techniques and methods that can be used to find new, hidden or unexpected patterns from a large volume of data... Sample PDF
Data Mining in the Soft Computing Paradigm
Chapter 53
Colleen Cunningham, Xiaohua Hu
Given the exponential growth rate of medical data and the accompanying biomedical literature, more than 10,000 documents per week (Leroy et al.... Sample PDF
Data Mining Medical Digital Libraries
Chapter 54
Lei Yu, Huan Liu
The advent of gene expression microarray technology enables the simultaneous measurement of expression levels for thousands or tens of thousands of... Sample PDF
Data Mining Methods for Microarray Data Analysis
Chapter 55
Amin A. Abdulghani
Much interest has been expressed in database mining by using association rules (Agrawal, Imielinski, & Swami, 1993). In this article, I provide a... Sample PDF
Data Mining with Cubegrades
Chapter 56
Hai Wang, Shouhong Wang
Survey is one of the common data acquisition methods for data mining (Brin, Rastogi & Shim, 2003). In data mining one can rarely find a survey data... Sample PDF
Data Mining with Incomplete Data
Chapter 57
Carla Marchetti, Massimo Mecella, Monica Scannapieco, Antoninio Virgillito
A Cooperative Information System (CIS) is a large-scale information system that interconnects various systems of different and autonomous... Sample PDF
Data Quality in Cooperative Information Systems
Chapter 58
William E. Winkler
Fayyad and Uthursamy (2002) have stated that the majority of the work (representing months or years) in creating a data warehouse is in cleaning up... Sample PDF
Data Quality in Data Warehouses
Chapter 59
Alexander Thomasian
Data compression is storing data such that it requires less space than usual. Data compression has been effectively used in storing data in a... Sample PDF
Data Reduction and Compression in Database Systems
Chapter 60
Alkis Simitsis, Dimitri Theodoratos
The back-end tools of a data warehouse are pieces of software responsible for the extraction of data from several sources, their cleansing... Sample PDF
Data Warehouse Back-End Tools
Chapter 61
Beixin (Betsy) Lin, Yu Hong, Zu-Hsu Lee
A data warehouse is a large electronic repository of information that is generated and updated in a structured manner by an enterprise over time to... Sample PDF
Data Warehouse Performance
Chapter 62
Richard Mathieu, Reuven R. Levary
Every finished product has gone through a series of transformations. The process begins when manufacturers purchase the raw materials that will be... Sample PDF
Data Warehousing and Mining in Supply Chains
Chapter 63
Hadrian Peter, Charles Greenidge
Modern database systems have incorporated the use of DSS (Decision Support Systems) to augment their decision-making business function and to allow... Sample PDF
Data Warehousing Search Engine
Chapter 64
Juha Kontio
Reporting is one of the basic processes in all organizations. It provides information for planning and decision making and, on the other hand... Sample PDF
Data Warehousing Solutions for Reporting Problems
Chapter 65
Lutz Hamel
Modern, commercially available relational database systems now routinely include a cadre of data retrieval and analysis tools. Here we shed some... Sample PDF
Database Queries, Data Mining, and OLAP
Chapter 66
Patricia E.N. Lutu
In data mining, sampling may be used as a technique for reducing the amount of data presented to a data mining algorithm. Other strategies for data... Sample PDF
Database Sampling for Data Mining
Chapter 67
Yao Chen, Luvai Motiwalla, M. Riaz Khan
The Internet has experienced a phenomenal growth in attracting people and commerce activities over the last decade—from a few thousand people in... Sample PDF
DEA Evaluation of Performance of E-Business Initiatives
Chapter 68
Decision Tree Inudction  (pages 353-358)
Roberta Siciliano, Claudio Conversano
Decision Tree Induction (DTI) is an important step of the segmentation methodology. It can be viewed as a tool for the analysis of large datasets... Sample PDF
Decision Tree Inudction
Chapter 69
Diabetic Data Warehouses  (pages 359-363)
Joseph L. Breault
The National Academy of Sciences convened in 1995 for a conference on massive data sets. The presentation on health care noted that “massive applies... Sample PDF
Diabetic Data Warehouses
Chapter 70
Takao Ito
One of the most important issues in data mining is to discover an implicit relationship between words in a large corpus and labels in a large... Sample PDF
Discovering an Effective Measure in Data Mining
Chapter 71
Richi Nayak
XML is the new standard for information exchange and retrieval. An XML document has a schema that defines the data definition and structure of the... Sample PDF
Discovering Knowledge from XML Documents
Chapter 72
Weiguo Fan, Praveen Pathak
The field of information retrieval deals with finding relevant documents from a large document collection or the World Wide Web in response to a... Sample PDF
Discovering Ranking Functions for Information Retrieval
Chapter 73
Jan H. Kroeze
A very large percentage of business and academic data is stored in textual format. With the exception of metadata, such as author, date, title and... Sample PDF
Discovering Unknown Patterns in Free Text
Chapter 74
Discovery Informatics  (pages 387-391)
William W. Agresti
Discovery informatics is an emerging methodology that brings together several threads of research and practice aimed at making sense out of massive... Sample PDF
Discovery Informatics
Chapter 75
Ying Yang, Geoffrey I. Webb
Discretization is a process that transforms quantitative data into qualitative data. Quantitative data are commonly involved in data mining... Sample PDF
Discretization for Data Mining
Chapter 76
Fabrice Muhlenbach, Ricco Rakotomalala
In the data-mining field, many learning methods — such as association rules, Bayesian networks, and induction rules (Grzymala-Busse & Stefanowski... Sample PDF
Discretization for Continuous Attributes
Chapter 77
Mafruz Zaman Ashrafi, David Taniar, Kate A. Smith
Data mining is an iterative and interactive process that explores and analyzes voluminous digital data to discover valid, novel, and meaningful... Sample PDF
Distributed Association Rule Mining
Chapter 78
Roberto Wolfler Calvo, Fabio de Luigi, Palle Haastrup, Vittorio Maniezzo
The increased human mobility, combined with high use of private cars, increases the load on the environment and raises issues about the quality of... Sample PDF
Distributed Data Management of Daily Car Pooling Problems
Chapter 79
Wen-Chi Hou, Hong Guo, Feng Yan, Qiang Zhu
Sampling has been used in areas like selectivity estimation (Hou & Ozsoyoglu, 1991; Haas & Swami, 1992, Jermaine, 2003; Lipton, Naughton &... Sample PDF
Drawing Representative Samples from Large Databases
Chapter 80
Leonardo Tininini
This paper reviews the main techniques for the efficient calculation of aggregate multidimensional views and data cubes, possibly using specifically... Sample PDF
Efficient Computation of Data Cubes and Aggregate Views
Chapter 81
Juan E. Vargas
In their simplest form, sensors are transducers that convert physical phenomena into electrical signals. By combining recent innovations in wireless... Sample PDF
Embedding Bayesian Networks in Sensor Grids
Chapter 82
Mohamed Salah Hamdi
Data-mining technology delivers two key benefits: (i) a descriptive function, enabling enterprises, regardless of industry or size, in the context... Sample PDF
Employing Neural Networks in Data Mining
Chapter 83
Ji-Rong Wen
Web query log is a type of file keeping track of the activities of the users who are utilizing a search engine. Compared to traditional information... Sample PDF
Enhancing Web Search through Query Log Mining
Chapter 84
Ji-Rong Wen
The Web is an open and free environment for people to publish and get information. Everyone on the Web can be either an author, a reader, or both.... Sample PDF
Enhancing Web Search through Web Structure Mining
Chapter 85
Nikunj C. Oza
Ensemble data mining methods, also known as committee methods or model combiners, are machine learning methods that leverage the power of multiple... Sample PDF
Ensemble Data Mining Methods
Chapter 86
Ethics Of Data Mining  (pages 454-458)
Jack Cook
Decision makers thirst for answers to questions. As more data is gathered, more questions are posed: Which customers are most likely to respond... Sample PDF
Ethics Of Data Mining
Chapter 87
Gary J. DeLorenzo
Ethnographic research offers an orientation to understand the process and structure of a social setting and employs research techniques consistent... Sample PDF
Ethnography to Define Requirements and Data Model
Chapter 88
Paolo Giudici
Several classes of computational and statistical methods for data mining are available. Each class can be parameterised so that models within the... Sample PDF
Evaluation of Data Mining Methods
Chapter 89
Rebecca Boon-Noi Tan
Aggregation is a commonly used operation in decision support database systems. Users of decision support queries are interested in identifying... Sample PDF
Evolution of Data Cube Computational Approaches
Chapter 90
William H. Hsu
A genetic algorithm (GA) is a procedure used to find approximate solutions to search problems through the application of the principles of... Sample PDF
Evolutionary Computation and Genetic Algorithms
Chapter 91
Laetitia Jourdan, Clarisse Dhaenens, El-Ghazali Talbi
Knowledge discovery from genomic data has become an important research area for biologists. Nowadays, a lot of data is available on the Web, but it... Sample PDF
Evolutionary Data Mining for Genomics
Chapter 92
Jorge Muruzabal
Ensemble rule based classification methods have been popular for a while in the machine-learning literature (Hand, 1997). Given the advent of... Sample PDF
Evolutionary Mining of Rule Ensembles
Chapter 93
Yiyu Yao, Yan Zhao
Data mining concerns theories, methodologies, and, in particular, computer systems for knowledge extraction or mining from large amounts of data... Sample PDF
Explanation-Oriented Data Mining
Chapter 94
Zu-Hsu Lee, Richard L. Peterson, Chen-Fu Chien, Ruben Xing
The rapid growth and advances of information technology enable data to be accumulated faster and in much larger quantities (i.e., data warehousing).... Sample PDF
Factor Analysis in Data Mining
Chapter 95
Roberto Kawakami Harrop Galvao, Victor M. Becerra, Magda Abou-Seada
Prediction of corporate financial distress is a subject that has attracted the interest of many researchers in finance. The development of... Sample PDF
Financial Ratio Selection for Distress Classification
Chapter 96
Hong Shen
The discovery of association rules showing conditions of data co-occurrence has attracted the most attention in data mining. An example of an... Sample PDF
Flexible Mining of Association Rules
Chapter 97
Jamil M. Saquer
Formal concept analysis (FCA) is a branch of applied mathematics with roots in lattice theory (Wille, 1982; Ganter & Wille, 1999). It deals with the... Sample PDF
Formal Concept Analysis Based Clustering
Chapter 98
Reinhard Viertl
The results of data warehousing and data mining are depending essentially on the quality of data. Usually data are assumed to be numbers or vectors... Sample PDF
Fuzzy Information and Data Analysis
Chapter 99
Michel Schneider
Basically, the schema of a data warehouse lies on two kinds of elements: facts and dimensions. Facts are used to memorize measures about situations... Sample PDF
General Model for Data Warehouses
Chapter 100
Genetic Programming  (pages 529-533)
William H. Hsu
Genetic programming (GP) is a subarea of evolutionary computation first explored by John Koza (1992) and independently developed by Nichael Lynn... Sample PDF
Genetic Programming
Chapter 101
Ingrid Fischer
As the beginning of the area of artificial neural networks, the introduction of the artificial neuron of McCulloch and Pitts is considered. They... Sample PDF
Graph Transformations and Neural Networks
Chapter 102
Graph-Based Data Mining  (pages 540-545)
Lawrence B. Holder, Diane J. Cook
Graph-based data mining represents a collection of techniques for mining the relational aspects of data represented as a graph. Two major approaches... Sample PDF
Graph-Based Data Mining
Chapter 103
Shichao Zhang, Chengqi Zhang
Multiple data source mining is the process of identifying potentially useful patterns from different data sources, or datasets (Zhang et al., 2003).... Sample PDF
Group Pattern Discovery Systems for Multiple Data Sources
Chapter 104
Benny Yiu-ming Fung, Vincent To-yee Ng
When classifying tumors using gene expression data, mining tasks commonly make use of only a single data set. However, classification models based... Sample PDF
Heterogeneous Gene Data for Classifying Tumors
Chapter 105
Benjamin C.M. Fung, Ke Wang, Martin Ester
Document clustering is an automatic grouping of text documents into clusters so that documents within a cluster have high similarity in comparison... Sample PDF
Hierarchical Document Clustering
Chapter 106
Tsau Young Lin
The principal focus is to examine the foundation of association (rule) mining (AM) via granular computing (GrC). The main results is: The set of all... Sample PDF
High Frequency Patterns in Data Mining
Chapter 107
Bhavani Thuraisingham
Data mining is the process of posing queries to large quantities of data and extracting information often previously unknown using mathematical... Sample PDF
Homeland Security Data Mining and Link Analysis
Chapter 108
Janet Delve
Data Warehousing is now a well-established part of the business and scientific worlds. However, up until recently, data warehouses were restricted... Sample PDF
Humanitites Data Warehousing
Chapter 109
Jörg Andreas Walter
For many tasks of exploratory data analysis, visualization plays an important role. It is a key for efficient integration of human expertise — not... Sample PDF
Hyperbolic Space for Interactive Visualization
Chapter 110
Frank Klawonn, Olga Georgieva
Most clustering methods have to face the problem of characterizing good clusters among noise data. The arbitrary noise points that just do not... Sample PDF
Identifying Single Clusters in Large Data Sets
Chapter 111
Xiaoqiang Liu, Henk Koppelaar, Ronald Hamers, Nico Bruining
Buried within the human body, the heart prohibits direct inspection, so most knowledge about heart failure is obtained by autopsy (in hindsight).... Sample PDF
Immersive Image Mining in Cardiology
Chapter 112
Marvin L. Brown, John F. Kros
Missing or inconsistent data has been a pervasive problem in data analysis since the origin of data collection. The management of missing data in... Sample PDF
Imprecise Data and the Data Mining Process
Chapter 113
Nilmini Wickramasinghe
Today’s economy is increasingly based on knowledge and information (Davenport & Grover, 2001). Knowledge is now recognized as the driver of... Sample PDF
Incorporating the People Perspective into Data mining
Chapter 114
Seokkyung Chung, Jongeun Jun, Dennis McLeod
With the rapid growth of the World Wide Web, Internet users are now experiencing overwhelming quantities of online information. Since manually... Sample PDF
Incremental Mining from News Streams
Chapter 115
Honghua Dai
Inexact fielding learning (IFL) (Ciesieski & Dai, 1994; Dai & Ciesieski, 1994a, 1994b, 1995, 2004; Dai & Li, 2001) is a rough-set, theory-based... Sample PDF
Inexact Field Learning Approach for Data Mining
Chapter 116
Min Song, Il-Yeol Song, Xiaohua Hu, Hyoil Han
Information extraction (IE) technology has been defined and developed through the US DARPA Message Understanding Conferences (MUCs). IE refers to... Sample PDF
Information Extraction in Biomedical Literature
Chapter 117
Instance Selection  (pages 621-624)
Huan Liu, Lei Yu
The amounts of data have become increasingly large in recent years as the capacity of digital data storage worldwide has significantly increased. As... Sample PDF
Instance Selection
Chapter 118
Andreas Koeller
Integration of data sources refers to the task of developing a common schema as well as data transformation solutions for a number of data sources... Sample PDF
Integration of Data Sources through Data Mining
Chapter 119
Intelligence Density  (pages 630-633)
David Sundaram, Victor Portougal
The amount of information that decision makers have to process has been increasing at a tremendous pace. A few years ago it was suggested that... Sample PDF
Intelligence Density
Chapter 120
Intelligent Data Analysis  (pages 634-638)
Xiaohui Liu
Intelligent Data Analysis (IDA) is an interdisciplinary study concerned with the effective analysis of data. IDA draws the techniques from diverse... Sample PDF
Intelligent Data Analysis
Chapter 121
Zbigniew W. Ras, Agnieszka Dardzinska
One way to make query answering system (QAS) intelligent is to assume a hierarchical structure of its attributes. Such systems have been... Sample PDF
Intelligent Query Answering
Chapter 122
Shouhong Wang, Hai Wang
In the data mining field, people have no doubt that high level information (or knowledge) can be extracted from the database through the use of... Sample PDF
Interactive Visual Data Mining
Chapter 123
Pasquale De Meo, Giorgio Terracina, Domenico Ursino
In this article, we illustrate a general approach for the semi-automatic construction and management of data warehouses. Our approach is... Sample PDF
Interscheme Properties' Role in Data Warehouses
Chapter 124
Ling Feng, Tharam Dillon
The discovery of association rules from large amounts of structured or semi-structured data is an important data-mining problem (Agrawal et al.... Sample PDF
Inter-Transactional Association Analysis for Prediction
Chapter 125
Pawan Lingras, Rui Yan, Mofreh Hogo, Chad West
The amount of information that is available in the new information age has made it necessary to consider various summarization techniques.... Sample PDF
Interval Set Representations of Clusters
Chapter 126
Huma Lodhi
Millions of people are suffering from fatal diseases such as cancer, AIDS, and many other bacterial and viral illnesses. The key issue is now how to... Sample PDF
Kernel Methods in Chemoinformatics
Chapter 127
Juan R. Rabuñal Dopico, Daniel Rivero Cebrian, Julián Dorado de la Calle, Nieves Pedreira Souto
The world of Data Mining (Cios, Pedrycz & Swiniarrski, 1998) is in constant expansion. New information is obtained from databases thanks to a wide... Sample PDF
Knowledge Discovery with Artificial Neural Networks
Chapter 128
Marco F. Ramoni, Paola Sebastiani
Born at the intersection of artificial intelligence, statistics, and probability, Bayesian networks (Pearl, 1988) are a representation formalism at... Sample PDF
Learning Bayesian Networks
Chapter 129
Chia-Hui Chang, Chun-Nan Hsu
The explosive growth and popularity of the World Wide Web has resulted in a huge number of information sources on the Internet. However, due to the... Sample PDF
Learning Information Extraction Rules for Web Data Mining
Chapter 130
Carlotta Domeniconi, Dimitrios Gunopulos
Pattern classification is a very general concept with numerous applications ranging from science, engineering, target marketing, medical diagnosis... Sample PDF
Locally Adaptive Techniques for Pattern Classification
Chapter 131
Logical Analysis of Data  (pages 689-692)
Endre Boros, Peter L. Hammer, Toshihide Ibaraki
The logical analysis of data (LAD) is a methodology aimed at extracting or discovering knowledge from data in logical form. The first paper in this... Sample PDF
Logical Analysis of Data
Chapter 132
Giovanni Felici, Klaus Truemper
The method described in this chapter is designed for data mining and learning on logic data. This type of data is composed of records that can be... Sample PDF
Lsquare System for Mining Logic Data
Chapter 133
Marketing Data Mining  (pages 698-704)
Victor S.Y. Lo
Data mining has been widely applied over the past two decades. In particular, marketing is an important application area. Many companies collect... Sample PDF
Marketing Data Mining
Chapter 134
Chien-Hsing Wu, Tzai-Zang Lee
Material acquisition is a time-consuming but important task for a library, because the quality of a library is not in the number of materials that... Sample PDF
Material Acquisitions Using Discovery Informatics Approach
Chapter 135
Giuseppe Sindoni
A hypertext view is a hypertext containing data from an underlying database. The materialization of such hypertexts, that is, the actual storage of... Sample PDF
Materialized Hypertext View Maintenance
Chapter 136
Giuseppe Sindoni
A materialized hypertext view can be defined as “a hypertext containing data coming from a database and whose pages are stored in files” (Sindoni... Sample PDF
Materialized Hypertext Views
Chapter 137
Dimitri Theodoratos, Alkis Simitsis
A data warehouse (DW) is a repository of information retrieved from multiple, possibly heterogeneous, autonomous, distributed databases and other... Sample PDF
Materialized View Selection for Data Warehouse Design
Chapter 138
Tom Burr
One data mining activity is cluster analysis, of which there are several types. One type deserving special attention is clustering that arises due... Sample PDF
Methods for Choosing Clusters in Phylogenetic Trees
Chapter 139
Microarray Data Mining  (pages 728-733)
Li M. Fu
Based on the concept of simultaneously studying the expression of a large number of genes, a DNA microarray is a chip on which numerous probes are... Sample PDF
Microarray Data Mining
Chapter 140
Richard S. Segall
Microarray informatics is a rapidly expanding discipline in which large amounts of multi-dimensional data are compressed into small storage units.... Sample PDF
Microarray Databases for Biotechnology
Chapter 141
Mine Rule  (pages 740-745)
Rosa Meo, Giuseppe Psaila
Mining of association rules is one of the most adopted techniques for data mining in the most widespread application domains. A great deal of work... Sample PDF
Mine Rule
Chapter 142
Soon M. Chung, Murali Mangamuri
Data mining from relations is becoming increasingly important with the advent of parallel database systems. In this paper, we propose a new... Sample PDF
Mining Associations Rules on a NCR Teradata System
Chapter 143
Nicolas Pasquier
In the domain of knowledge discovery in databases and its computational part called data mining, many works addressed the problem of association... Sample PDF
Mining Assocation Rules Using Frequent Closed Itemsets
Chapter 144
Mining Chat Discussions  (pages 758-762)
Stanley Loh, Daniel Licthnow, Thyago Borges, Tiago Primo, Rodrigo Branco Kockhofel, Gabriel Simoes, Gustavo Piltcher, Ramiro Saldana
According to Nonaka and Takeuchi (1995), the majority of the organizational knowledge comes from interactions between people. People tend to reuse... Sample PDF
Mining Chat Discussions
Chapter 145
Gabriele Kern-Isberner
Knowledge discovery refers to the process of extracting new, interesting, and useful knowledge from data and presenting it in an intelligible way to... Sample PDF
Mining Data with Group Theoretical Means
Chapter 146
Mining E-Mail Data  (pages 768-772)
Steffen Bickel, Tobias Scheffer
E-mail has become one of the most important communication media for business and private purposes. Large amounts of past e-mail records reside on... Sample PDF
Mining E-Mail Data
Chapter 147
Yu-Jin Zhang
The growth of the Internet and storage capability not only increasingly makes images a widespread information format on the World Wide Web (WWW)... Sample PDF
Mining for Image Classification Based on Feature Elements
Chapter 148
Yihua Philip Sheng, Wen-Chi Hou, Zhong Chen
The stock market, like other economic phenomena, is a very complex system. Many factors, such as company news, interest rates, macro economic data... Sample PDF
Mining for Profitable Patterns in the Stock Market
Chapter 149
Richi Nayak
A small shop owner builds a relationship with its customers by observing their needs, preferences and buying behaviour. A Web-enabled e-business... Sample PDF
Mining for Web-Enabled E-Business Applications
Chapter 150
Qinghua Zou, Wesley Chu
Pattern decomposition is a data-mining technology that uses known frequent or infrequent patterns to decompose a long itemset into many short ones.... Sample PDF
Mining Frequent Patterns Via Pattern Decomposition
Chapter 151
Mining Group Differences  (pages 795-799)
Shane M. Butler, Geoffrey I. Webb
Finding differences among two or more groups is an important data-mining task. For example, a retailer might want to know what the different is in... Sample PDF
Mining Group Differences
Chapter 152
Mining Historical XML  (pages 800-804)
Qiankun Zhao, Sourav Saha Bhowmick
Nowadays the Web poses itself as the largest data repository ever available in the history of humankind (Reis et al., 2004). However, the... Sample PDF
Mining Historical XML
Chapter 153
Terry Caelli
Most data warehousing and mining involves storing and retrieving data either in numerical or symbolic form, varying from tables of numbers to text.... Sample PDF
Mining Images for Structure
Chapter 154
Mining Microarray Data  (pages 810-814)
Nanxiang Ge, Li Liu
During the last 10 years and in particularly within the last few years, there has been a data explosion associated with the completion of the human... Sample PDF
Mining Microarray Data
Chapter 155
Hong Shen, Susumu Horiguchi
The problem of mining association rules from databases was introduced by Agrawal, Imielinski, & Swami (1993). In this problem, we give a set of... Sample PDF
Mining Quantitative and Fuzzy Association Rules
Chapter 156
Diego Liberati
In many fields of research as in everyday life, one has to face a huge amount of data, often not completely homogeneous and many times without an... Sample PDF
Model Indentification through Data Mining
Chapter 157
Hadrian Peter, Charles Greenidge
Good database design generates effective operational databases through which we can track customers, sales, inventories, and other variables of... Sample PDF
Modeling Web-Based Data in a Data Warehouse
Chapter 158
Kenneth W. Goodman
It has become a commonplace observation that scientific progress often, if not usually, outstrips or precedes the ethical analyses and tools that... Sample PDF
Moral Foundations of Data Mining
Chapter 159
Odej Kao, Ingo la Tendresse
A standard approach for content-based image retrieval (CBIR) is based on the extraction and comparison of features usually related to dominant... Sample PDF
Mosaic-Based Relevance Feedback for Image Retrieval
Chapter 160
Hrishikesh B. Aradhye, Chitra Dorai
The rapid adoption of broadband communications technology, coupled with ever-increasing capacity-to-price ratios for data storage, has made... Sample PDF
Multimodal Analysis in Multimedia Using Symbolic Kernels
Chapter 161
Sach Mukherjee
A number of important problems in data mining can be usefully addressed within the framework of statistical hypothesis testing. However, while the... Sample PDF
Multiple Hypothesis Testing for Data Mining
Chapter 162
Alicja A. Wieczorkowska
Music information retrieval is a multi-disciplinary research on retrieving information from music. This research involves scientists from... Sample PDF
Music Information Retrieval
Chapter 163
Olena Daly, David Taniar
Data Mining is a process of discovering new, unexpected, valuable patterns from existing databases (Chen, Han & Yu, 1996; Fayyad et. al., 1996;... Sample PDF
Negative Association Rules in Data Mining
Chapter 164
Kate A. Smith
Neural networks are simple computational tools for examining data and developing models that help to identify interesting patterns or structures.... Sample PDF
Neural Networks for Prediction and Classification
Chapter 165
Indrani Chakravarty, Nilesh Mishra, Mayank Vatsa, Richa Singh, P. Gupta
The most commonly used protection mechanisms today are based on either what a person possesses (e.g. an ID card) or what the person remembers (like... Sample PDF
Off-Line Signature Recognition
Chapter 166
Rebecca Boon-Noi Tan
Since its origin in the 1970’s research and development into databases systems has evolved from simple file storage and processing systems to... Sample PDF
Onling Analytical Processing Systems
Chapter 167
Indrani Chakravarty, Nilesh Mishra, Mayank Vatsa, Richa Singh, P. Gupta
Security is one of the major issues in today’s world and most of us have to deal with some sort of passwords in our daily lives; but, these... Sample PDF
Online Signature Recognition
Chapter 168
Hamid R. Nemati, Christopher D. Barko
Data mining is now largely recognized as a business imperative and considered essential for enabling the execution of successful organizational... Sample PDF
Organizational Data Mining
Chapter 169
Jorge Cardoso
Business process management systems (BPMSs) (Smith & Fingar, 2003) provide a fundamental infrastructure to define and manage business processes, Web... Sample PDF
Path Mining in Web Processes Using Profiles
Chapter 170
P. Viswanath, M. Narasimha Murty, Shalabh Bhatnagar
Two major problems in applying any pattern recognition technique for large and high-dimensional data are (a) high computational requirements and (b)... Sample PDF
Pattern Synthesis for Large-Scale Pattern Recognition
Chapter 171
Ladjel Bellatreche, Mukesh Mohania
Recently, organizations have increasingly emphasized applications in which current and historical data are analyzed and explored comprehensively... Sample PDF
Physical Data Warehousing Design
Chapter 172
D. R. Mani, Andrew L. Betz, James H. Drew
A structural conflict exists in businesses that sell services whose production costs are discontinuous and whose consumption is continuous but... Sample PDF
Predicting Resource Usage for Capital Efficient Marketing
Chapter 173
Yücel Saygin
Data regarding people and their activities have been collected over the years, which has become more pervasive with widespread usage of the... Sample PDF
Privacy and Confidentiality Issues in Data Mining
Chapter 174
Neha Jha, Shamik Sural
Data mining technology has emerged as a means for identifying patterns and trends from large sets of data. Mining encompasses various algorithms... Sample PDF
Privacy Protection in Association Rule Mining
Chapter 175
Profit Mining  (pages 930-934)
Senqiang Zhou, Ke Wang
A major obstacle in data mining applications is the gap between the statistic-based pattern extraction and the value-based decision-making. “Profit... Sample PDF
Profit Mining
Chapter 176
Pseudo Independent Models  (pages 935-940)
Yang Xiang
Graphical models such as Bayesian networks (BNs) (Pearl, 1988) and decomposable Markov networks (DMNs) (Xiang, Wong & Cercone, 1997) have been... Sample PDF
Pseudo Independent Models
Chapter 177
Marzena Kryszkiewicz
Discovering frequent patterns in large databases is an important data mining problem. The problem was introduced in (Agrawal, Imielinski, & Swami... Sample PDF
Reasoning about Frequent Patterns with Negation
Chapter 178
Hee Beng Kuan Tan, Yuan Zhao
Today, many companies have to deal with problems in maintaining legacy database applications, which were developed on old database technology. These... Sample PDF
Recovery of Data Dependencies
Chapter 179
Dan Zhu
With the explosive growth of information available on the World Wide Web, users must increasingly use automated tools to find, extract, filter, and... Sample PDF
Reinforcing CRM with Data Mining
Chapter 180
Dimitrios Katsaros, Gökhan Yavas, Alexandros Nanopoulos, Murat Karakaya, Özgür Ulusoy, Yannis Manolopoulos
During the past years, we have witnessed an explosive growth in our capabilities to both generate and collect data. Advances in scientific data... Sample PDF
Resource Allocation in Wireless Networks
Chapter 181
Luis M. de Campos, Juan M. Fernandez-Luna, Juan F. Huete
Bayesian networks (Jensen, 2001) are powerful tools for dealing with uncertainty. They have been successfully applied in a wide range of domains... Sample PDF
Retrieving Medical Records Using Bayesian Networks
Chapter 182
Brain C. Lovell, Shaokang Chen
While the technology for mining text documents in large databases could be said to be relatively mature, the same cannot be said for mining other... Sample PDF
Robust Face Recognition for Data Mining
Chapter 183
Jerzy W. Grzymala-Busse, Wojciech Ziarko
Discovering useful models capturing regularities of natural phenomena or complex systems until recently was almost entirely limited to finding... Sample PDF
Rough Sets and Data Mining
Chapter 184
Marco Muselli
One of the most relevant problems in artificial intelligence is allowing a synthetic device to perform inductive reasoning, i.e. to infer a set of... Sample PDF
Rule Generation Methods Based on Logic Synthesis
Chapter 185
Ivan Bruha
Within the past several years, research in decision-supporting systems has been investigating the possibilities of enhancing their overall... Sample PDF
Rule Qualities and Knowledge Combination for Decision-Making
Chapter 186
Gautam Das
In recent years, advances in data collection and management technologies have led to a proliferation of very large databases. These large data... Sample PDF
Sampling Methods in Approximate Query Answering Systems
Chapter 187
Mike Thelwall
Scientific Web Intelligence (SWI) is a research field that combines techniques from data mining, Web intelligence, and scientometrics to extract... Sample PDF
Scientific Web Intelligence
Chapter 188
Nils Pharo, Kalervo Jarvelin
In order to understand the nature of Web information search processes it is necessary to identify the interplay of factors at the micro-level, that... Sample PDF
Search Situations and Transitions
Chapter 189
Yehida Lindell
The increasing use of data-mining tools in both the public and private sectors raises concerns regarding the potentially sensitive nature of much of... Sample PDF
Secure Multiparty Computation for Privacy Preserving Data Mining
Chapter 190
Semantic Data Mining  (pages 1010-1014)
Protima Banerjee, Xiaohua Hu, Illhoi Yoo
Over the past few decades, data mining has emerged as a field of research critical to understanding and assimilating the large stores of data... Sample PDF
Semantic Data Mining
Chapter 191
Ludovic Denoyer, Patrick Gallinari
Document classification developed over the last 10 years, using techniques originating from the pattern recognition and machine-learning... Sample PDF
Semi-Structured Document Classification
Chapter 192
Semi-Supervised Learning  (pages 1022-1027)
Tobias Scheffer
For many classification problems, unlabeled training data are inexpensive and readily available, whereas labeling training data imposes costs.... Sample PDF
Semi-Supervised Learning
Chapter 193
Sequential Pattern Mining  (pages 1028-1032)
Florent Masseglia, Maguelonne Teisseire, Pascal Poncelet
Sequential pattern mining deals with data represented as sequences (a sequence contains sorted sets of items). Compared to the association rule... Sample PDF
Sequential Pattern Mining
Chapter 194
Software Warehouse  (pages 1033-1036)
Honghua Dai
A software warehouse is a facility providing an effective and yet efficient mechanism to store, manage, and utilize existing software resources... Sample PDF
Software Warehouse
Chapter 195
Wenyuan Li
With the rapid growth of the World Wide Web and the capacity of digital data storage, tremendous amount of data are generated daily from business... Sample PDF
Spectral Methods for Data Clustering
Chapter 196
Statistical Data Editing  (pages 1043-1047)
Claudio Conversano, Roberta Siciliano
Statistical Data Editing (SDE) is the process of checking data for errors and correcting them. Winkler (1999) defined it as the set of methods used... Sample PDF
Statistical Data Editing
Chapter 197
Maria Vardaki
The term metadata is frequently considered in many different sciences. Statistical metadata is a term generally used to denote data about data.... Sample PDF
Statistical Metadata in Data Processing and Interchange
Chapter 198
Xinjian Lu
A data warehouse stores and manages historical data for on-line analytical processing, rather than for on-line transactional processing. Data... Sample PDF
Storage Strategies in Data Warehouses
Chapter 199
Subgraph Mining  (pages 1059-1063)
Ingrid Fischer, Thorsten Meinl
The amount of available data is increasing very fast. With this data the desire for data mining is also growing. More and larger databases have to... Sample PDF
Subgraph Mining
Chapter 200
Support Vector Machines  (pages 1064-1070)
Mamoun Awad, Latifur Khan
The availability of reliable learning systems is of strategic importance, as many tasks cannot be solved by classical programming techniques... Sample PDF
Support Vector Machines
Chapter 201
David R. Musicant
In recent years, massive quantities of business and research data have been collected and stored, partly due to the plummeting cost of data storage.... Sample PDF
Support Vector Machines Illuminated
Chapter 202
Qiyang Chen, Alan Oppenheim, Dajin Wang
Survival analysis (SA) consists of a variety of methods for analyzing the timing of events and/or the times of transition among several states or... Sample PDF
Survival Analysis and Data Mining
Chapter 203
Symbiotic Data Mining  (pages 1083-1086)
Kuriakose Athappilly, Alan Rea
Symbiotic data mining is an evolutionary approach to how organizations analyze, interpret, and create new knowledge from large pools of data.... Sample PDF
Symbiotic Data Mining
Chapter 204
Symbolic Data Clustering  (pages 1087-1091)
Edwin Diday, M. Narasimha Murthy
In data mining, we generate class/cluster models from large datasets. Symbolic Data Analysis (SDA) is a powerful tool that permits dealing with... Sample PDF
Symbolic Data Clustering
Chapter 205
Hakikur Rahman
Today’s fast moving business world faces continuous challenges and abrupt changes in real-life situations at the context of data and information... Sample PDF
Synthesis with Data Warehouse Applications and Utilities
Chapter 206
Sherri K. Harms
The emergence of remote sensing, scientific simulation and other survey technologies has dramatically enhanced our capabilities to collect temporal... Sample PDF
Temporal Association Rule Mining in Event Sequences
Chapter 207
Víctor Fresno Fernandez, Luis Magdalena Layos
Since the creation of the Web until now, the Internet has become the greatest source of information available in the world. The Web is defined as a... Sample PDF
Text Content Approaches in Web Content Mining
Chapter 208
Dunja Mladenic
Intensive usage and growth of the World Wide Web and the daily increasing amount of text information in electronic form have resulted in a growing... Sample PDF
Text Mining-Machine Learning on Documents
Chapter 209
Han-Joon Kim
We have recently seen a tremendous growth in the volume of online text documents from networked resources such as the Internet, digital libraries... Sample PDF
Text Mining Methods for Hierarchical Document Indexing
Chapter 210
Mehmet Sayal
A time series is a sequence of data values that are recorded with equal or varying time intervals. Time series data usually includes timestamps that... Sample PDF
Time Series Analysis and Mining Techniques
Chapter 211
Time Series Data Forecasting  (pages 1125-1129)
Vincent Cho
Businesses are recognizing the value of data as a strategic asset. This is reflected by the high degree of interest in new technologies such as data... Sample PDF
Time Series Data Forecasting
Chapter 212
Hsin-Chang Yang, Chung-Hong Lee
Topic maps provide a general, powerful, and user-oriented way to navigate the information resources under consideration in any specific domain. A... Sample PDF
Topic Maps Generation by Text Mining
Chapter 213
Transferable Belief Model  (pages 1135-1139)
Philippe Smets
This note is a very short presentation of the transferable belief model (TBM), a model for the representation of quantified beliefs based on belief... Sample PDF
Transferable Belief Model
Chapter 214
Tree and Graph Mining  (pages 1140-1145)
Dimitrios Katsaros, Yannis Manolopoulos
During the past decade, we have witnessed an explosive growth in our capabilities to both generate and collect data. Various data mining techniques... Sample PDF
Tree and Graph Mining
Chapter 215
Anita Lee-Post, Haihao Jin
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents and services. This area of... Sample PDF
Trends in Web Content and Structure Mining
Chapter 216
Trends in Web Usage Mining  (pages 1151-1154)
Anita Lee-Post, Haihao Jin
In this paper, we will discuss research efforts devoted to the remaining area of Web mining, namely Web usage mining. Taken together, a complete... Sample PDF
Trends in Web Usage Mining
Chapter 217
Diego Liberati, Sergio Bittanti, Simone Garatti
Micro-arrays technology has marked a substantial improvement in making available a huge amount of data about gene expression in pathophysiological... Sample PDF
Unsupervised Mining of Genes Classifying Leukemia
Chapter 218
Jan Owens, Suresh Chalasani, Jayavel Sounderpandian
The use of Radio Frequency Identification (RFID) is becoming prevalent in supply chains, with large corporations such as Wal-Mart, Tesco, and the... Sample PDF
Use of RFID in Supply Chain Data Processing
Chapter 219
Malcolm J. Beynon
The origins of Dempster-Shafer theory (DST) go back to the work by Dempster (1967) who developed a system of upper and lower probabilities.... Sample PDF
Using Dempster-Shafer Theory in Data Mining
Chapter 220
Jaroslav Zendulka
There are three standardization initiatives concerning application programming interfaces (API) for data mining — OLE DB for Data Mining (OLE DB for... Sample PDF
Using Standard APIs for Data Mining in Prediction
Chapter 221
Malcolm J. Beynon
The seminal work of Zadeh (1965), fuzzy set theory (FST) has developed into a methodology fundamental to analysis that incorporates vagueness and... Sample PDF
Utilizing Fuzzy Decision Trees in Decision Making
Chapter 222
Vertical Data Mining  (pages 1181-1184)
William Perrizo, Qiang Ding, Qin Ding, Taufik Abidin
The volume of data keeps increasing. There are many data sets that have become extremely large. It is of importance and a challenge to develop... Sample PDF
Vertical Data Mining
Chapter 223
Video Data Mining  (pages 1185-1189)
Jung Hwan Oh, Jeong Kyu Lee, Sae Hwang
Data mining, which is defined as the process of extracting previously unknown knowledge and detecting interesting patterns from a massive set of... Sample PDF
Video Data Mining
Chapter 224
Herna L. Viktor, Eric Paquet
The current explosion of data and information, mainly caused by data warehousing technologies as well as the extensive use of the Internet and its... Sample PDF
Visualization Techniques for Data Mining
Chapter 225
Cyrus Shahabi, Dimitris Sacharidis, Mehrdad Jahangiri
Following the constant technological advancements that provide more processing power and storage capacity, scientific applications have emerged as a... Sample PDF
Wavelets for Querying Multidimensional Datasets
Chapter 226
Massimiliano Caramia, Giovanni Felici
The recent improvements of search engine technologies have made available to Internet users an enormous amount of knowledge that can be accessed in... Sample PDF
Web Mining in Thematic Search Engines
Chapter 227
Web Mining Overview  (pages 1206-1210)
Bamshad Mobasher
In the span of a decade, the World Wide Web has been transformed from a tool for information sharing among researchers into an indispensable part of... Sample PDF
Web Mining Overview
Chapter 228
Anthony Scime
Data warehouses are constructed to provide valuable and current information for decision-making. Typically this information is derived from the... Sample PDF
Web Page Extension of Data Warehouses
Chapter 229
Web Usage Mining  (pages 1216-1220)
Bamshad Mobasher
With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, the volumes of clickstream and user data... Sample PDF
Web Usage Mining
Chapter 230
Yongjian Fu
With the rapid development of the World Wide Web or the Web, many organizations now put their information on the Web and provide Web-based services... Sample PDF
Web Usage Mining and Its Applications
Chapter 231
Bamshad Mobasher
Web usage mining refers to the automatic discovery and analysis of patterns in clickstream and associated data collected or generated as a result of... Sample PDF
Web Usage Mining Data Preparation
Chapter 232
Paolo Giudici, Paola Cerchiello
The aim of this contribution is to show how the information, concerning the order in which the pages of a Web site are visited, can be profitably... Sample PDF
Web Usage Mining through Associative Models
Chapter 233
Olfa Nasraoui
The Web information age has brought a dramatic increase in the sheer amount of information (Web content), in the access to this information (Web... Sample PDF
World Wide Web Personalization
Chapter 234
World Wide Web Usage Mining  (pages 1242-1248)
Wen-Chen Hu, Hung-Jen Yang, Chung-wei Lee, Jyh-haw Yeh
World Wide Web data mining includes content mining, hyperlink structure mining, and usage mining. All three approaches attempt to extract knowledge... Sample PDF
World Wide Web Usage Mining
About the Editor
Index of Key Terms