Count Models for Software Quality EstimationKehan Gao (Eastern Connecticut State University, USA) and Taghi M. Khoshgoftaar (Florida Atlantic University, USA)
Copyright © 2009. 7 pages.
OnDemand Chapter PDF Download
Download link provided immediately after order completion
| $37.50 | |
Available.
Instant access upon order completion.
DOI: 10.4018/978-1-60566-010-3.ch055 Sample PDFCite
MLA
Gao, Kehan and Taghi M. Khoshgoftaar. "Count Models for Software Quality Estimation." Encyclopedia of Data Warehousing and Mining, Second Edition. IGI Global, 2009. 346-352. Web. 19 May. 2013. doi:10.4018/978-1-60566-010-3.ch055
APA
Gao, K., & Khoshgoftaar, T. M. (2009). Count Models for Software Quality Estimation. In J. Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Second Edition (pp. 346-352). Hershey, PA: Information Science Reference. doi:10.4018/978-1-60566-010-3.ch055
Chicago
Gao, Kehan and Taghi M. Khoshgoftaar. "Count Models for Software Quality Estimation." In Encyclopedia of Data Warehousing and Mining, Second Edition, ed. John Wang, 346-352 (2009), accessed May 19, 2013. doi:10.4018/978-1-60566-010-3.ch055
Export Reference
 Favorite  | | TopAbstractTimely and accurate prediction of the quality of software modules in the early stages of the software development life cycle is very important in the field of software reliability engineering. With such predictions, a software quality assurance team can assign the limited quality improvement resources to the needed areas and prevent problems from occurring during system operation. Software metrics-based quality estimation models are tools that can achieve such predictions. They are generally of two types: a classification model that predicts the class membership of modules into two or more quality-based classes (Khoshgoftaar et al., 2005b), and a quantitative prediction model that estimates the number of faults (or some other quality factor) that are likely to occur in software modules (Ohlsson et al., 1998). In recent years, a variety of techniques have been developed for software quality estimation (Briand et al., 2002; Khoshgoftaar et al., 2002; Ohlsson et al., 1998; Ping et al., 2002), most of which are suited for either prediction or classification, but not for both. For example, logistic regression (Khoshgoftaar & Allen, 1999) can only be used for classification, whereas multiple linear regression (Ohlsson et al., 1998) can only be used for prediction. Some software quality estimation techniques, such as case-based reasoning (Khoshgoftaar & Seliya, 2003), can be used to calibrate both prediction and classification models, however, they require distinct modeling approaches for both types of models. In contrast to such software quality estimation methods, count models such as the Poisson regression model (PRM) and the zero-inflated Poisson (ziP) regression model (Khoshgoftaar et al., 2001) can be applied to yield both with just one modeling approach. Moreover, count models are capable of providing the probability that a module has a given number of faults. Despite the attractiveness of calibrating software quality estimation models with count modeling techniques, we feel that their application in software reliability engineering has been very limited (Khoshgoftaar et al., 2001). This study can be used as a basis for assessing the usefulness of count models for predicting the number of faults and quality-based class of software modules. TopComplete Chapter List
Search this Book:
Reset | 1. |
Zbigniew W. Ras (University of North Carolina, Charlotte, USA), Elzbieta Wyrzykowska (University of Information Technology and Management, Warsaw, Poland), Li-Shiang Tsay (North Carolina A&T State University, USA)
There are two aspects of interestingness of rules that have been studied in data mining literature, objective and subjective measures (Liu et al., 1997), (Adomaviciu...
Sample PDF |
More details... | $37.50 |
| 2. |
Ion Muslea (SRI International, USA)
Inductive learning algorithms typically use a set of labeled examples to learn class descriptions for a set of user-specified concepts of interest. In practice, labe...
Sample PDF |
More details... | $37.50 |
| 3. |
Xueping Li (University of Tennessee, Knoxville, USA)
The Internet has become a popular medium to disseminate information and a new platform to conduct electronic business (e-business) and electronic commerce (e-commerc...
Sample PDF |
More details... | $37.50 |
| 4. |
Hadrian Peter (University of the West Indies, Barbados)
Data warehouses have established themselves as necessary components of an effective IT strategy for large businesses. To augment the streams of data being siphoned f...
Sample PDF |
More details... | $37.50 |
| 5. |
Dan Zhu (Iowa State University, USA)
With the advent of technology, information is available in abundance on the World Wide Web. In order to have appropriate and useful information users must increasing...
Sample PDF |
More details... | $37.50 |
| 6. |
Chun-Che Huang (National Chi Nan University, Taiwan), Tzu-Liang ("Bill") Tseng (The University of Texas at El Paso, USA)
The Information Technology and Internet techniques are rapidly developing. Interaction between enterprises and customers has dramatically changed. It becomes critica...
Sample PDF |
More details... | $37.50 |
| 7. |
Lisa Friedland (University of Massachusetts Amherst, USA)
In traditional data analysis, data points lie in a Cartesian space, and an analyst asks certain questions: (1) What distribution can I fit to the data? (2) Which poi...
Sample PDF |
More details... | $37.50 |
| 8. |
J. Ben Schafer (University of Northern Iowa, USA)
In a world where the number of choices can be overwhelming, recommender systems help users find and evaluate items of interest. They connect users with items to “con...
Sample PDF |
More details... | $37.50 |
| 9. |
Gustavo Camps-Valls (Universitat de València, Spain), Manel Martínez-Ramón (Universidad Carlos III de Madrid, Spain), José Luis Rojo-Álvarez (Universidad Rey Juan Carlos, Spain)
In this chapter, we give a survey of applications of the kernel methods introduced in the previous chapter. We focus on different application domains that are partic...
Sample PDF |
More details... | $37.50 |
| 10. |
Sandra Elizabeth González Císaro (Universidad Nacional del Centro de la Provincia de Buenos Aires, Argentina)
Much information stored in current databases is not always present at necessary different levels of detail or granularity for Decision-Making Processes (DMP). Some o...
Sample PDF |
More details... | $37.50 |
| 11. |
Wenxue Huang (Generation5 Mathematical Technologies, Inc., Canada), Milorad Krneta (Generation5 Mathematical Technologies, Inc., Canada), Limin Lin (Generation5 Mathematical Technologies, Inc., Canada), Jianhong Wu (Mathematics and Statistics Department, Yo)
An association pattern describes how a group of items (for example, retail products) are statistically associated together, and a meaningful association pattern iden...
Sample PDF |
More details... | $37.50 |
| 12. |
Vassilios S. Verykios (University of Thessaly, Greece)
The enormous expansion of data collection and storage facilities has created an unprecedented increase in the need for data analysis and processing power. Data minin...
Sample PDF |
More details... | $37.50 |
| 13. |
Yew-Kwong Woon (Nanyang Technological University, Singapore)
Association Rule Mining (ARM) is concerned with how items in a transactional database are grouped together. It is commonly known as market basket analysis, because i...
Sample PDF |
More details... | $37.50 |
| 14. |
Luminita Dumitriu (“Dunarea de Jos” University, Romania)
The concept of Quantitative Structure-Activity Relationship (QSAR), introduced by Hansch and co-workers in the 1960s, attempts to discover the relationship between t...
Sample PDF |
More details... | $37.50 |
| 15. |
Anne Denton (North Dakota State University, USA)
Most data of practical relevance are structured in more complex ways than is assumed in traditional data mining algorithms, which are based on a single table. The co...
Sample PDF |
More details... | $37.50 |
| 16. |
Martine Cadot (University of Henri Poincaré/LORIA, Nancy, France), Jean-Baptiste Maj (LORIA/INRIA, France), Tarek Ziadé (NUXEO, France)
A manager would like to have a dashboard of his company without manipulating data. Usually, statistics have solved this challenge, but nowadays, data have changed (J...
Sample PDF |
More details... | $37.50 |
| 17. |
Zheng-Hua Tan (Aalborg University, Denmark)
The explosive increase in computing power, network bandwidth and storage capacity has largely facilitated the production, transmission and storage of multimedia data...
Sample PDF |
More details... | $37.50 |
| 18. |
Gaël Richard (Ecole Nationale Supérieure des Télécommunications (TELECOM ParisTech), France)
The enormous amount of unstructured audio data available nowadays and the spread of its use as a data source in many applications are introducing new challenges to r...
Sample PDF |
More details... | $37.50 |
| 19. |
Jamel Feki (Mir@cl Laboratory, Université de Sfax, Tunisia)
Within today’s competitive economic context, information acquisition, analysis and exploitation became strategic and unavoidable requirements for every enterprise. M...
Sample PDF |
More details... | $37.50 |
| 20. |
Xiaoyan Yu (Virginia Tech, USA), Manas Tungare (Virginia Tech, USA), Weiguo Fan (Virginia Tech, USA), Manuel Pérez-Quiñones (Virginia Tech, USA), Edward A. Fox (Virginia Tech, USA), William Cameron (Villanova University, USA), Lillian Cassel (Villanova University, USA)
Starting with a vast number of unstructured or semistructured documents, text mining tools analyze and sift through them to present to users more valuable informatio...
Sample PDF |
More details... | $37.50 |
| 21. |
Xin Zhang (University of North Carolina at Charlotte, USA)
Music information indexing based on timbre helps users to get relevant musical data in large digital music databases. Timbre is a quality of sound that distinguishes...
Sample PDF |
More details... | $37.50 |
| 22. |
Shu-Chiang Lin (Purdue University, USA)
Many task analysis techniques and methods have been developed over the past decades, but identifying and decomposing a user’s task into small task components remains...
Sample PDF |
More details... | $37.50 |
| 23. |
Yinghui Yang (University of California, Davis, USA)
Customer segmentation is the process of dividing customers into distinct subsets (segments or clusters) that behave in the same way or have similar needs. Because ea...
Sample PDF |
More details... | $37.50 |
| 24. |
Les Pang (University of Maryland University College, USA)
Data warehousing has been a successful approach for supporting the important concept of knowledge management— one of the keys to organizational success at the enterp...
Sample PDF |
More details... | $37.50 |
| 25. |
Scott Nicholson (Syracuse University School of Information Studies, USA)
Most people think of a library as the little brick building in the heart of their community or the big brick building in the center of a college campus. However, the...
Sample PDF |
More details... | $37.50 |
| 26. |
Gustavo Camps-Valls (Universitat de València, Spain), Alistair Morgan Chalk (Eskitis Institute for Cell and Molecular Therapies, Griffiths University, Australia)
Bioinformatics is a new, rapidly expanding field that uses computational approaches to answer biological questions (Baxevanis, 2005). These questions are answered by...
Sample PDF |
More details... | $37.50 |
| 27. |
Jieping Ye (Arizona State University, USA), Ravi Janardan (University of Minnesota, USA), Sudhir Kumar (Arizona State University, USA)
Understanding the roles of genes and their interactions is one of the central challenges in genome research. One popular approach is based on the analysis of microar...
Sample PDF |
More details... | $37.50 |
| 28. |
Ladjel Bellatreche (Poitiers University, France)
Scientific databases and data warehouses store large amounts of data ith several tables and attributes. For instance, the Sloan Digital Sky Survey (SDSS) astronomica...
Sample PDF |
More details... | $37.50 |
| 29. |
Lei Tang (Arizona State University, USA), Huan Liu (Arizona State University, USA), Jiangping Zhang (The MITRE Corporation, USA)
The unregulated and open nature of the Internet and the explosive growth of the Web create a pressing need to provide various services for content categorization. Th...
Sample PDF |
More details... | $37.50 |
| 30. |
Arla Juntunen (Department of Marketing and Management Helsinki School of Economics, Finland)
The high level objectives of public authorities are to create value at minimal cost, and achieve ongoing support and commitment from its funding authority. Similar t...
Sample PDF |
More details... | $37.50 |
| 31. |
Johannes Gehrke (Cornell University, USA)
It is the goal of classification and regression to build a data mining model that can be used for prediction. To construct such a model, we are given a set of traini...
Sample PDF |
More details... | $37.50 |
| 32. |
Aijun An (York University, Canada)
Generally speaking, classification is the action of assigning an object to a category according to the characteristics of the object. In data mining, classification...
Sample PDF |
More details... | $37.50 |
| 33. |
Andrzej Dominik (Warsaw University of Technology, Poland)
Classification is a classical and fundamental data mining (machine learning) task in which individual items (objects) are divided into groups (classes) based on thei...
Sample PDF |
More details... | $37.50 |
| 34. |
Xinghua Fan (Chongqing University of Posts and Telecommunications, China)
Text categorization (TC) is a task of assigning one or multiple predefined category labels to natural language texts. To deal with this sophisticated task, a variety...
Sample PDF |
More details... | $37.50 |
| 35. |
Frank Klawonn (University of Applied Sciences Braunschweig/Wolfenbuettel, Germany), Frank Rehm (German Aerospace Center, Germany)
For many applications in knowledge discovery in databases finding outliers, rare events, is of importance. Outliers are observations, which deviate significantly fro...
Sample PDF |
More details... | $37.50 |
| 36. |
Tom Burr (Los Alamos National Laboratory, USA)
One data mining activity is cluster analysis, which consists of segregating study units into relatively homogeneous groups. There are several types of cluster analys...
Sample PDF |
More details... | $37.50 |
| 37. |
Dingxi Qiu (University of Miami, USA), Edward C. Malthouse (Northwestern University, USA)
Cluster analysis is a set of statistical models and algorithms that attempt to find “natural groupings” of sampling units (e.g., customers, survey respondents, plant...
Sample PDF |
More details... | $37.50 |
| 38. |
Ricardo Vilalta (University of Houston, USA), Tomasz Stepinski (Lunar and Planetary Institute, USA)
Spacecrafts orbiting a selected suite of planets and moons of our solar system are continuously sending long sequences of data back to Earth. The availability of suc...
Sample PDF |
More details... | $37.50 |
| 39. |
Athman Bouguettaya (CSIRO ICT Center, Australia), Qi Yu (Virginia Tech, USA)
Clustering analysis has been widely applied in diverse fields such as data mining, access structures, knowledge discovery, software engineering, organization of info...
Sample PDF |
More details... | $37.50 |
| 40. |
Joshua Zhexue Huang (The University of Hong Kong, Hong Kong)
A lot of data in real world databases are categorical. For example, gender, profession, position, and hobby of customers are usually defined as categorical attribute...
Sample PDF |
More details... | $37.50 |
| 41. |
Mei Li (Microsoft Corporation, USA), Wang-Chien Lee (Pennsylvania State University, USA)
With the advances in network communication, many large scale network systems have emerged. Peer-topeer (P2P) systems, where a large number of nodes self-form into a...
Sample PDF |
More details... | $37.50 |
| 42. |
Anne Denton (North Dakota State University, USA)
Time series data is of interest to most science and engineering disciplines and analysis techniques have been developed for hundreds of years. There have, however, i...
Sample PDF |
More details... | $37.50 |
| 43. |
Sheng Ma (Machine Learning for Systems IBM T. J. Watson Research Center, USA), Tao Li (School of Computer Science, Florida International University, USA)
Clustering data into sensible groupings, as a fundamental and effective tool for efficient data organization, summarization, understanding and learning, has been the...
Sample PDF |
More details... | $37.50 |
| 44. |
Richard S. Segall (Arkansas State University, USA)
This chapter discusses four-selected software for data mining that are not available as free open-source software. The four-selected software for data mining are SAS...
Sample PDF |
More details... | $37.50 |
| 45. |
Eamonn Keogh (University of California - Riverside, USA), Li Keogh (Google, Inc., USA), John C. Handley (Xerox Innovation Group, USA)
Compression-based data mining is a universal approach to clustering, classification, dimensionality reduction, and anomaly detection. It is motivated by results in b...
Sample PDF |
More details... | $37.50 |
| 46. |
Amin A. Abdulghani (Data Mining Engineer, USA)
The focus of online analytical processing (OLAP) is to provide a platform for analyzing data (e.g., sales data) with multiple dimensions (e.g., product, location, ti...
Sample PDF |
More details... | $37.50 |
| 47. |
Elzbieta Malinowski (Universidad de Costa Rica, Costa Rica), Esteban Zimányi (Université Libre de Bruxelles, Belgium)
The advantages of using conceptual models for database design are well known. In particular, they facilitate the communication between users and designers since they...
Sample PDF |
More details... | $37.50 |
| 48. |
Brad Morantz (Science Applications International Corporation, USA)
Mining a large data set can be time consuming, and without constraints, the process could generate sets or rules that are invalid or redundant. Some methods, for exa...
Sample PDF |
More details... | $37.50 |
| 49. |
Carson Kai-Sang Leung (The University of Manitoba, Canada)
The problem of association rule mining was introduced in 1993 (Agrawal et al., 1993). Since then, it has been the subject of numerous studies. Most of these studies...
Sample PDF |
More details... | $37.50 |
| 50. |
Francesco Bonchi (ISTI-C.N.R., Itay)
Devising fast and scalable algorithms, able to crunch huge amount of data, was for many years one of the main goals of data mining research. But then we realized tha...
Sample PDF |
More details... | $37.50 |
| 51. |
Alexander mirnov (Institution of the Russian Academy of Sciences, St. Petersburg Institute for Informatics and Automation RAS, Russia)
Decisions in the modern world are often made in rapidly changing, sometimes unexpected, situations. Such situations require availability of systems / tools allowing...
Sample PDF |
More details... | $37.50 |
| 52. |
Marko Robnik-Šikonja (University of Ljubljana, FRI)
The research in machine learning, data mining, and statistics has provided a number of methods that estimate the usefulness of an attribute (feature) for prediction...
Sample PDF |
More details... | $37.50 |
| 53. |
Yi-Cheng Tu (University of South Florida, USA), Gang Ding (Olympus Communication Technology of America, Inc., USA)
Database administration (tuning) is the process of adjusting database configurations in order to accomplish desirable performance goals. This job is performed by hum...
Sample PDF |
More details... | $37.50 |
| 54. |
Victor S. Sheng (New York University, USA), Charles X. Ling (The University of Western Ontario, Canada)
Classification is the most important task in inductive learning and machine learning. A classifier can be trained from a set of training examples with class labels,...
Sample PDF |
More details... | $37.50 |
| 55. |
Kehan Gao (Eastern Connecticut State University, USA), Taghi M. Khoshgoftaar (Florida Atlantic University, USA)
Timely and accurate prediction of the quality of software modules in the early stages of the software development life cycle is very important in the field of softwa...
Sample PDF |
More details... | $37.50 |
| 56. |
Christine W. Chan (University of Regina, Canada)
An economic evaluation of a new oil well is often required, and this evaluation depends heavily on how accurately production of the well can be estimated. Unfortunat...
Sample PDF |
More details... | $37.50 |
| 57. |
Seunghyun Im (University of Pittsburgh at Johnstown, USA), Zbigniew W. Ras (University of North Carolina, Charlotte, USA)
This article discusses data security in Knowledge Discovery Systems (KDS). In particular, we presents the problem of confidential data reconstruction by Chase (Dardz...
Sample PDF |
More details... | $37.50 |
| 58. |
Alfredo Cuzzocrea (University of Calabria, Italy)
OnLine Analytical Processing (OLAP) research issues (Gray, Chaudhuri, Bosworth, Layman, Reichart & Venkatrao, 1997) such as data cube modeling, representation, index...
Sample PDF |
More details... | $37.50 |
| 59. |
Junjie Wu (Tsinghua University, China), Jian Chen (Tsinghua University, China), Hui Xiong (Rutgers University, USA)
Cluster analysis (Jain & Dubes, 1988) provides insight into the data by dividing the objects into groups (clusters), such that objects in a cluster are more similar...
Sample PDF |
More details... | $37.50 |
| 60. |
John M. Artz (The George Washington University, USA)
Although data warehousing theory and technology have been around for well over a decade, they may well be the next hot technologies. How can it be that a technology...
Sample PDF |
More details... | $37.50 |
| 61. |
Esma Aïmeur (Université de Montréal, Canada)
With the emergence of Internet, it is now possible to connect and access sources of information and databases throughout the world. At the same time, this raises man...
Sample PDF |
More details... | $37.50 |
| 62. |
Paola Cerchiello (University of Pavia, Italy)
The aim of this contribution is to show one of the most important application of text mining. According to a wide part of the literature regarding the aforementioned...
Sample PDF |
More details... | $37.50 |
| 63. |
Joaquín Ordieres-Meré (University of La Rioja, Spain), Manuel Castejón-Limas (University of León, Spain), Ana González-Marcos (University of León, Spain)
The industrial plants, beyond subsisting, pursue to be leaders in increasingly competitive and dynamic markets. In this environment, quality management and technolog...
Sample PDF |
More details... | $37.50 |
| 64. |
Soo Kim (Montclair State University, USA)
Some people say that “success or failure often depends not only on how well you are able to collect data but also on how well you are able to convert them into knowl...
Sample PDF |
More details... | $37.50 |
| 65. |
Roberto Marmo (University of Pavia, Italy)
As a conseguence of expansion of modern technology, the number and scenario of fraud are increasing dramatically. Therefore, the reputation blemish and losses caused...
Sample PDF |
More details... | $37.50 |
| 66. |
Lior Rokach (Ben-Gurion University, Israel)
In many modern manufacturing plants, data that characterize the manufacturing process are electronically collected and stored in the organization’s databases. Thus,...
Sample PDF |
More details... | $37.50 |
| 67. |
Luciana Dalla Valle (University of Milan, Italy)
The term “internationalization” refers to the process of international expansion of firms realized through different mechanisms such as export, strategic alliances a...
Sample PDF |
More details... | $37.50 |
| 68. |
Silvia Figini (University of Pavia, Italy)
Customer lifetime value (LTV, see e.g. Bauer et al. 2005 and Rosset et al. 2003), which measures the profit generating potential, or value, of a customer, is increas...
Sample PDF |
More details... | $37.50 |
| 69. |
Diego Liberati (Italian National Research Council, Italy)
In many fields of research, as well as in everyday life, it often turns out that one has to face a huge amount of data, without an immediate grasp of an underlying s...
Sample PDF |
More details... | $37.50 |
| 70. |
Mª Dolores del Castillo (Instituto de Automática Industrial (CSIC), Spain)
Email is now an indispensable communication tool and its use is continually growing. This growth brings with it an increase in the number of electronic threats that...
Sample PDF |
More details... | $37.50 |
| 71. |
Ramdev Kanapady (University of Minnesota, USA), Aleksandar Lazarevic (United Technologies Research Center, USA)
Structural health monitoring denotes the ability to collect data about critical engineering structural elements using various sensors and to detect and interpret adv...
Sample PDF |
More details... | $37.50 |
| 72. |
Ng Yew Seng (National University of Singapore, Singapore), Rajagopalan Srinivasan (National University of Singapore and Institute of Chemical & Engineering Sciences, Singapore)
Advancements in sensors and database technologies have resulted in the collection of huge amounts of process data from chemical plants. A number of process quantitie...
Sample PDF |
More details... | $37.50 |
| 73. |
Tom Burr (Los Alamos National Laboratory, USA)
The genetic basis for some human diseases, in which one or a few genome regions increase the probability of acquiring the disease, is fairly well understood. For exa...
Sample PDF |
More details... | $37.50 |
| 74. |
Haipeng Wang (Institute of Computing Technology & Graduate University of Chinese Academy of Sciences, China)
Protein identification (sequencing) by tandem mass spectrometry is a fundamental technique for proteomics which studies structures and functions of proteins in large...
Sample PDF |
More details... | $37.50 |
| 75. |
Aleksandar Lazarevic (United Technologies Research Center, USA)
In recent years, research in many security areas has gained a lot of interest among scientists in academia, industry, military and governmental organizations. Resear...
Sample PDF |
More details... | $37.50 |
| 76. |
Gary Weiss (Fordham University, USA)
The telecommunications industry was one of the first to adopt data mining technology. This is most likely because telecommunication companies routinely generate and...
Sample PDF |
More details... | $37.50 |
| 77. |
Les Pang (National Defense University, USA)
Data mining has been a successful approach for improving the level of business intelligence and knowledge management throughout an organization. This article identif...
Sample PDF |
More details... | $37.50 |
| 78. |
Seung Ki Moon (The Pennsylvania State University, USA)
Many companies strive to maximize resource utilization by sharing and reusing distributed design knowledge and information when developing new products. By sharing a...
Sample PDF |
More details... | $37.50 |
| 79. |
Qin Ding (East Carolina University, USA)
With the growing usage of XML data for data storage and exchange, there is an imminent need to develop efficient algorithms to perform data mining on semistructured...
Sample PDF |
More details... | $37.50 |
| 80. |
Christophe Giraud-Carrier (Brigham Young University, USA)
It is sometimes argued that all one needs to engage in Data Mining (DM) is data and a willingness to “give it a try.” Although this view is attractive from the persp...
Sample PDF |
More details... | $37.50 |
| 81. |
Amin A. Abdulghani (Data Mining Engineer, USA)
A lot of interest has been expressed in database mining using association rules (Agrawal, Imielinski, & Swami, 1993). In this chapter, we provide a different view of...
Sample PDF |
More details... | $37.50 |
| 82. |
Hai Wang (Saint Mary’s University, Canada), Shouhong Wang (University of Massachusetts Dartmouth, USA)
Survey is one of the common data acquisition methods for data mining (Brin, Rastogi & Shim, 2003). In data mining one can rarely find a survey data set that contains...
Sample PDF |
More details... | $37.50 |
| 83. |
Mohammed Alshalalfa (University of Calgary, Canada)
Data mining can be described as data processing using sophisticated data search capabilities and statistical algorithms to discover patterns and correlations in larg...
Sample PDF |
More details... | $37.50 |
| 84. |
Magdi Kamel (Naval Postgraduate School, USA)
Practical experience of data mining has revealed that preparing data is the most time-consuming phase of any data mining project. Estimates of the amount of time and...
Sample PDF |
More details... | $37.50 |
| 85. |
Vikram Sorathia (Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT), India)
In recent years, our sensing capability has increased manifold. The developments in sensor technology, telecommunication, computer networking and distributed computi...
Sample PDF |
More details... | $37.50 |
| 86. |
William E. Winkler (U.S. Bureau of the Census, USA)
Fayyad and Uthursamy (2002) have stated that the majority of the work (representing months or years) in creating a data warehouse is in cleaning up duplicates and re...
Sample PDF |
More details... | $37.50 |
| 87. |
Richard Jensen (Aberystwyth University, UK)
Data reduction is an important step in knowledge discovery from data. The high dimensionality of databases can be reduced using suitable techniques, depending on the...
Sample PDF |
More details... | $37.50 |
| 88. |
João Gama (University of Porto, Portugal), Pedro Pereira Rodrigues (University of Porto, Portugal)
Nowadays, data bases are required to store massive amounts of data that are continuously inserted, and queried. Organizations use decision support systems to identif...
Sample PDF |
More details... | $37.50 |
| 89. |
Amitava Mitra (Auburn University, USA)
As the abundance of collected data on products, processes and service-related operations continues to grow with technology that facilitates the ease of data collecti...
Sample PDF |
More details... | $37.50 |
| 90. |
Alkis Simitsis (National Technical University of Athens, Greece), Dimitri Theodoratos (New Jersey Institute of Technology, USA)
The back-end tools of a data warehouse are pieces of software responsible for the extraction of data from several sources, their cleansing, customization, and insert...
Sample PDF |
More details... | $37.50 |
| 91. |
Beixin ("Betsy") Lin (Montclair State University, USA), Yu Hong (Colgate-Palmolive Company, USA), Zu-Hsu Lee (Montclair State University, USA)
A data warehouse is a large electronic repository of information that is generated and updated in a structured manner by an enterprise over time to aid business inte...
Sample PDF |
More details... | $37.50 |
| 92. |
Richard Mathieu (Saint Louis University, USA)
Every finished product has gone through a series of transformations. The process begins when manufacturers purchase the raw materials that will be transformed into t...
Sample PDF |
More details... | $37.50 |
| 93. |
Yuefeng Li (Queensland University of Technology, Australia)
With the phenomenal growth of electronic data and information, there are many demands for developments of efficient and effective systems (tools) to address the issu...
Sample PDF |
More details... | $37.50 |
| 94. |
Lutz Hamel (University of Rhode Island, USA)
Modern, commercially available relational database systems now routinely include a cadre of data retrieval and analysis tools. Here we shed some light on the interre...
Sample PDF |
More details... | $37.50 |
| 95. |
Patricia E.N. Lutu (University of Pretoria, South Africa)
In data mining, sampling may be used as a technique for reducing the amount of data presented to a data mining algorithm. Other strategies for data reduction include...
Sample PDF |
More details... | $37.50 |
| 96. |
Edgar R. Weippl (Secure Business Austria, Austria)
In this article we will present an introduction to issues relevant to database security and statistical database security. We will briefly cover various security mod...
Sample PDF |
More details... | $37.50 |
| 97. |
Martin Žnidaršic (Jožef Stefan Institute, Slovenia), Marko Bohanec (Jožef Stefan Institute, Slovenia), Blaž Zupan (University of Ljubljana, Slovenia, and Baylor College of Medicine, USA)
Computer models are representations of problem environment that facilitate analysis with high computing power and representation capabilities. They can be either inf...
Sample PDF |
More details... | $37.50 |
| 98. |
Roberta Siciliano (University of Naples, Federico II, Italy), Claudio Conversano (University of Cagliari, Italy)
Decision Tree Induction (DTI) is a tool to induce a classification or regression model from (usually large) datasets characterized by n objects (records), each one c...
Sample PDF |
More details... | $37.50 |
| 99. |
Monica Maceli (Drexel University, USA), Min Song (New Jersey Institute of Technology & Temple University, USA)
With the increase in Web-based databases and dynamically- generated Web pages, the concept of the “deep Web” has arisen. The deep Web refers to Web content that, whi...
Sample PDF |
More details... | $37.50 |
| 100. |
Matteo Golfarelli (University of Bologna, Italy)
Conceptual modeling is widely recognized to be the necessary foundation for building a database that is well-documented and fully satisfies the user requirements. In...
Sample PDF |
More details... | $37.50 |
| 101. |
Hanghang Tong (Carnegie Mellon University, USA), Yehuda Koren (AT&T Labs - Research, USA), Christos Faloutsos (Carnegie Mellon University, USA)
In many graph mining settings, measuring node proximity is a fundamental problem. While most of existing measurements are (implicitly or explicitly) designed for und...
Sample PDF |
More details... | $37.50 |
| 102. |
Takao Ito (Ube National College of Technology, Japan)
One of the most important issues in data mining is to discover an implicit relationship between words in a large corpus and labels in a large database. The relations...
Sample PDF |
More details... | $37.50 |
| 103. |
Richi Nayak (Queensland University of Technology, Australia)
XML is the new standard for information exchange and retrieval. An XML document has a schema that defines the data definition and structure of the XML document (Abit...
Sample PDF |
More details... | $37.50 |
| 104. |
Jan H Kroeze (University of Pretoria, South Africa)
A very large percentage of business and academic data is stored in textual format. With the exception of metadata, such as author, date, title and publisher, this da...
Sample PDF |
More details... | $37.50 |
| 105. |
William W. Agresti (Johns Hopkins University, USA)
It is routine to hear and read about the information explosion, how we are all overwhelmed with data and information. Is it progress when our search tools report tha...
Sample PDF |
More details... | $37.50 |
| 106. |
Haiquan Li (The Samuel Roberts Noble Foundation, Inc., USA), Jinyan Li (Nanyang Technological University, Singapore), Xuechun Zhao (The Samuel Roberts Noble Foundation, Inc., USA)
Physical interactions between proteins are important for many cellular functions. Since protein-protein interactions are mediated via their interaction sites, identi...
Sample PDF |
More details... | $37.50 |
| 107. |
Vladimír Bartík (Brno University of Technology, Czech Republic)
Association rules are one of the most frequently used types of knowledge discovered from databases. The problem of discovering association rules was first introduced...
Sample PDF |
More details... | $37.50 |
| 108. |
Mafruz Zaman Ashrafi (Monash University, Australia)
Data mining is an iterative and interactive process that explores and analyzes voluminous digital data to discover valid, novel, and meaningful patterns (Mohammed, 1...
Sample PDF |
More details... | $37.50 |
| 109. |
Yu Chen (State University of New York – Binghamton, USA), Wei-Shinn Ku (Auburn University, USA)
The information technology has revolutionized almost every facet of our lives. Government, commercial, and educational organizations depend on computers and Internet...
Sample PDF |
More details... | $37.50 |
| 110. |
Grigorios Tsoumakas (Aristotle University of Thessaloniki, Greece)
The continuous developments in information and communication technology have recently led to the appearance of distributed computing environments, which comprise sev...
Sample PDF |
More details... | $37.50 |
| 111. |
José Ignacio Serrano (Instituto de Automática Industrial (CSIC), Spain)
Owing to the growing amount of digital information stored in natural language, systems that automatically process text are of crucial importance and extremely useful...
Sample PDF |
More details... | $37.50 |
| 112. |
Richard Weber (University of Chile, Chile)
Since the First KDD Workshop back in 1989 when “Knowledge Mining” was recognized as one of the top 5 topics in future database research (Piatetsky-Shapiro 1991), man...
Sample PDF |
More details... | $37.50 |
| 113. |
Chang-Chia Liu (University of Florida, USA), W. Art Chaovalitwongse (Rutgers University, USA), Panos M. Pardalos (University of Florida, USA), Basim M. Uthman (NF/SG VHS & University of Florida, USA)
Neurologists typically study the brain activity through acquired biomarker signals such as Electroencephalograms (EEGs) which have been widely used to capture the in...
Sample PDF |
More details... | $37.50 |
| 114. |
Diego Reforgiato Recupero (University of Catania, Italy)
Application domains such as bioinformatics and web technology represent complex objects as graphs where nodes represent basic objects (i.e. atoms, web pages etc.) an...
Sample PDF |
More details... | $37.50 |
| 115. |
Xunkai Wei (University of Air Force Engineering, China)
As known to us, the cognition process is the instinct learning ability of the human being. This process is perhaps one of the most complex human behaviors. It is a h...
Sample PDF |
More details... | $37.50 |
| 116. |
Daniel Crabtree (Victoria University of Wellington, New Zealand)
Web search engines help users find relevant web pages by returning a result set containing the pages that best match the user’s query. When the identified pages have...
Sample PDF |
More details... | $37.50 |
| 117. |
Ji-Rong Wen (Microsoft Research Asia, China)
Web query log is a type of file keeping track of the activities of the users who are utilizing a search engine. Compared to traditional information retrieval setting...
Sample PDF |
More details... | $37.50 |
| 118. |
Ji-Rong Wen (Microsoft Research Asia, China)
The Web is an open and free environment for people to publish and get information. Everyone on the Web can be either an author, a reader, or both. The language of th...
Sample PDF |
More details... | $37.50 |
| 119. |
Nikunj C. Oza (NASA Ames Research Center, USA)
Ensemble Data Mining Methods, also known as Committee Methods or Model Combiners, are machine learning methods that leverage the power of multiple models to achieve...
Sample PDF |
More details... | $37.50 |
| 120. |
Niall Rooney (University of Ulster, UK)
The concept of ensemble learning has its origins in research from the late 1980s/early 1990s into combining a number of artificial neural networks (ANNs) models for...
Sample PDF |
More details... | $37.50 |
| 121. |
Jack Cook (Rochester Institute of Technology, USA)
Decision makers thirst for answers to questions. As more data is gathered, more questions are posed: Which customers are most likely to respond positively to a marke...
Sample PDF |
More details... | $37.50 |
| 122. |
Paolo Giudici (University of Pavia, Italy)
Several classes of computational and statistical methods for data mining are available. Each class can be parameterised so that models within the class differ in ter...
Sample PDF |
More details... | $37.50 |
| 123. |
Ivan Bruha (McMaster University, Canada)
A ‘traditional’ learning algorithm that can induce a set of decision rules usually represents a robust and comprehensive system that discovers a knowledge from usual...
Sample PDF |
More details... | $37.50 |
| 124. |
Caitlin Kelly Maurie (The Pennsylvania State University, USA)
Geospatial data and the technologies that drive them have altered the landscape of our understanding of the world around us. The data, software and services related...
Sample PDF |
More details... | $37.50 |
| 125. |
Amit Saxena (Guru Ghasida University, Bilaspur, India), Megha Kothari (St. Peter’s University, Chennai, India), Navneet Pandey (Indian Institute of Technology, Delhi, India)
Excess of data due to different voluminous storage and online devices has become a bottleneck to seek meaningful information therein and we are information wise rich...
Sample PDF |
More details... | $37.50 |
| 126. |
William H. Hsu (Kansas State University, USA)
A genetic algorithm (GA) is a method used to find approximate solutions to difficult search, optimization, and machine learning problems (Goldberg, 1989) by applying...
Sample PDF |
More details... | $37.50 |
| 127. |
Laetitia Jourdan (University of Lille, France)
Knowledge discovery from genomic data has become an important research area for biologists. Nowadays, a lot of data is available on the web, but the corresponding kn...
Sample PDF |
More details... | $37.50 |
| 128. |
Daniel Rivero (University of A Coruña, Spain)
Artificial Neural Networks (ANNs) are learning systems from the Artificial Intelligence (AI) world that have been used for solving complex problems related to differ...
Sample PDF |
More details... | $37.50 |
| 129. |
Jorge Muruzábal (University Rey Juan Carlos, Spain)
Ensemble rule based classification methods have been popular for a while in the machine-learning literature (Hand, 1997). Given the advent of low-cost, high-computin...
Sample PDF |
More details... | $37.50 |
| 130. |
Yiyu Yao (University of Regina, Canada)
The objective of data mining is to discover new and useful knowledge, in order to gain a better understanding of nature. This in fact is the goal of scientists when...
Sample PDF |
More details... | $37.50 |
| 131. |
Elzbieta Malinowski (Universidad de Costa Rica, Costa Rica), Esteban Zimányi (Université Libre de Bruxelles, Belgium)
Data warehouses keep large amounts of historical data in order to help users at different management levels to make more effective decisions. Conventional data wareh...
Sample PDF |
More details... | $37.50 |
| 132. |
Rory A. Lewis (UNC-Charlotte, USA), Zbigniew W. Ras (University of North Carolina, Charlotte, USA)
Over the past decade Facial Recognition has become more cohesive and reliable than ever before. We begin with an analysis explaining why certain facial recognition m...
Sample PDF |
More details... | $37.50 |
| 133. |
Seoung Bum Kim (The University of Texas at Arlington, USA)
Development of advanced sensing technology has multiplied the volume of spectral data, which is one of the most common types of data encountered in many research fie...
Sample PDF |
More details... | $37.50 |
| 134. |
Shouxian Cheng (Planet Associates, Inc., USA), Frank Y. Shih (New Jersey Institute of Technology, USA)
The Support Vector Machine (SVM) (Cortes and Vapnik, 1995; Vapnik, 1995; Burges, 1998) is intended to generate an optimal separating hyperplane by minimizing the gen...
Sample PDF |
More details... | $37.50 |
| 135. |
Damien François (Université catholique de Louvain, Belgium)
In many applications, like function approximation, pattern recognition, time series prediction, and data mining, one has to build a model relating some features desc...
Sample PDF |
More details... | $37.50 |
| 136. |
Indranil Bose (The University of Hong Kong, Hong Kong)
Movement of stocks in the financial market is a typical example of financial time series data. It is generally believed that past performance of a stock can indicate...
Sample PDF |
More details... | $37.50 |
| 137. |
Hong Shen (Japan Advanced Institute of Science and Technology, Japan)
The discovery of association rules showing conditions of data co-occurrence has attracted the most attention in data mining. An example of an association rule is the...
Sample PDF |
More details... | $37.50 |
| 138. |
Jamil M. Saquer (Southwest Missouri State University, USA)
Formal concept analysis (FCA) is a branch of applied mathematics with roots in lattice theory (Wille, 1982; Ganter & Wille, 1999). It deals with the notion of a conc...
Sample PDF |
More details... | $37.50 |
| 139. |
Xuan Hong Dang (Nanyang Technological University, Singapore), Wee-Keong Ng (Nanyang Technological University, Singapore), Kok-Leong Ong (Deakin University, Australia), Vincent Lee (Monash University, Australia)
In recent years, data streams have emerged as a new data type that has attracted much attention from the data mining community. They arise naturally in a number of a...
Sample PDF |
More details... | $37.50 |
| 140. |
Eyke Hüllermeier (Philipps-Universität Marburg, Germany)
Tools and techniques that have been developed during the last 40 years in the field of fuzzy set theory (FST) have been applied quite successfully in a variety of ap...
Sample PDF |
More details... | $37.50 |
| 141. |
Michel Schneider (Blaise Pascal University, France)
Basically, the schema of a data warehouse lies on two kinds of elements: facts and dimensions. Facts are used to memorize measures about situations or events. Dimens...
Sample PDF |
More details... | $37.50 |
| 142. |
Ladjel Bellatreche (Poitiers University, France)
Decision support applications require complex queries, e.g., multi way joins defining on huge warehouses usually modelled using star schemas, i.e., a fact table and...
Sample PDF |
More details... | $37.50 |
| 143. |
William H. Hsu (Kansas State University, USA)
Genetic programming (GP) is a sub-area of evolutionary computation first explored by John Koza (1992) and independently developed by Nichael Lynn Cramer (1985). It i...
Sample PDF |
More details... | $37.50 |
| 144. |
Alex A. Freitas (University of Kent, UK), Gisele L. Pappa (Federal University of Minas Geras, Brazil)
At present there is a wide range of data mining algorithms available to researchers and practitioners (Witten & Frank, 2005; Tan et al., 2006). Despite the great div...
Sample PDF |
More details... | $37.50 |
| 145. |
Marek Kretowski (Bialystok Technical University, Poland), Marek Grzes (Bialystok Technical University, Poland)
Decision trees are, besides decision rules, one of the most popular forms of knowledge representation in Knowledge Discovery in Databases process (Fayyad, Piatetsky-...
Sample PDF |
More details... | $37.50 |
| 146. |
Lawrence B. Holder (University of Texas at Arlington, USA)
Graph-based data mining represents a collection of techniques for mining the relational aspects of data represented as a graph. Two major approaches to graphbased da...
Sample PDF |
More details... | $37.50 |
| 147. |
Carol J. Romanowski (Rochester Institute of Technology, USA)
Data mining has grown to include many more data types than the “traditional” flat files with numeric or categorical attributes. Images, text, video, and the internet...
Sample PDF |
More details... | $37.50 |
| 148. |
Liang Xiong (Tsinghua University, China)
When we are faced with data, one common task is to learn the correspondence relationship between different data sets. More concretely, by learning data correspondenc...
Sample PDF |
More details... | $37.50 |
| 149. |
Abdullah N. Arslan (University of Vermont, USA)
Sequence alignment is one of the most fundamental problems in computational biology. Ordinarily, the problem aims to align symbols of given sequences in a way to opt...
Sample PDF |
More details... | $37.50 |
| 150. |
Benjamin C.M. Fung (Concordia University, Canada), Ke Wang (Simon Fraser University, Canada), Martin Ester (Simon Fraser University, Canada)
Document clustering is an automatic grouping of text documents into clusters so that documents within a cluster have high similarity in comparison to one another, bu...
Sample PDF |
More details... | $37.50 |
| 151. |
Francesco Buccafurri (DIMET, Università di Reggio Calabria, Italy)
Histograms are an important tool for data reduction both in the field of data-stream querying and in OLAP, since they allow us to represent large amount of data in a...
Sample PDF |
More details... | $37.50 |
| 152. |
Bhavani Thuraisingham (The MITRE Corporation, USA)
Data mining is the process of posing queries to large quantities of data and extracting information often previously unknown using mathematical, statistical, and mac...
Sample PDF |
More details... | $37.50 |
| 153. |
Janet Delve (University of Portsmouth, UK)
Data Warehousing is now a well-established part of the business and scientific worlds. However, up until recently, data warehouses were restricted to modeling essent...
Sample PDF |
More details... | $37.50 |
| 154. |
Sancho Salcedo-Sanz (Universidad de Alcalá, Spain), Gustavo Camps-Valls (Universitat de València, Spain), Carlos Bousoño-Calzón (Universidad Carlos III de Madrid, Spain)
Genetic algorithms (GAs) are a class of problem solving techniques which have been successfully applied to a wide variety of hard problems (Goldberg, 1989). In spite...
Sample PDF |
More details... | $37.50 |
| 155. |
Marvin L. Brown (Grambling State University, USA), John F. Kros (East Carolina University, USA)
Missing or inconsistent data has been a pervasive problem in data analysis since the origin of data collection. The management of missing data in organizations has r...
Sample PDF |
More details... | $37.50 |
| 156. |
Abdelhamid Bouchachia (University of Klagenfurt, Austria)
Data mining and knowledge discovery is about creating a comprehensible model of the data. Such a model may take different forms going from simple association rules t...
Sample PDF |
More details... | $37.50 |
| 157. |
Seokkyung Chung (University of Southern California, USA)
With the rapid growth of the World Wide Web, Internet users are now experiencing overwhelming quantities of online information. Since manually analyzing the data bec...
Sample PDF |
More details... | $37.50 |
| 158. |
Honghua Dai (Deakin University, Australia)
Inexact fielding learning (IFL) (Ciesieski & Dai, 1994; Dai & Ciesieski, 1994a, 1994b, 1995, 2004; Dai & Li, 2001) is a rough-set, theory-based (Pawlak, 1982) machin...
Sample PDF |
More details... | $37.50 |
| 159. |
Gary G. Yen (Oklahoma State University, USA)
Scientific literatures can be organized to serve as a roadmap for researchers by pointing where and when the scientific community has been and is heading to. They pr...
Sample PDF |
More details... | $37.50 |
| 160. |
Benjamin Griffiths (Cardiff University, UK)
Rough Set Theory (RST), since its introduction in Pawlak (1982), continues to develop as an effective tool in data mining. Within a set theoretical structure, its re...
Sample PDF |
More details... | $37.50 |
| 161. |
Huan Liu (Arizona State University, USA)
The amounts of data become increasingly large in recent years as the capacity of digital data storage worldwide has significantly increased. As the size of data grow...
Sample PDF |
More details... | $37.50 |
| 162. |
Stephan Meisel (University of Braunschweig, Germany)
Basically, Data Mining (DM) and Operations Research (OR) are two paradigms independent of each other. OR aims at optimal solutions of decision problems with respect...
Sample PDF |
More details... | $37.50 |
| 163. |
Andreas Koeller (Montclair State University, USA)
Integration of data sources refers to the task of developing a common schema as well as data transformation solutions for a number of data sources with related conte...
Sample PDF |
More details... | $37.50 |
| 164. |
Sai Moturu (Arizona State University, USA)
As John Muir noted, “When we try to pick out anything by itself, we find it hitched to everything else in the Universe” (Muir, 1911). In tune with Muir’s elegantly s...
Sample PDF |
More details... | $37.50 |
| 165. |
P. Punitha (University of Glasgow, UK), D.S. Guru (University of Mysore, India)
‘A visual idea is more powerful than verbal idea’, ‘A picture is worth more than ten thousand words’, ‘No words can convey what a picture speaks’, ‘A picture has to...
Sample PDF |
More details... | $37.50 |
| 166. |
Zbigniew W. Ras (University of North Carolina, Charlotte, USA), Agnieszka Dardzinska (Bialystok Technical University, Poland)
One way to make Query Answering System (QAS) intelligent is to assume a hierarchical structure of its attributes. Such systems have been investigated by (Cuppens & D...
Sample PDF |
More details... | $37.50 |
| 167. |
Zheng Zhao (Arizona State University, USA)
The high dimensionality of data poses a challenge to learning tasks such as classification. In the presence of many irrelevant features, classification algorithms te...
Sample PDF |
More details... | $37.50 |
| 168. |
Yan Zhao (University of Regina, Canada)
Exploring and extracting knowledge from data is one of the fundamental problems in science. Data mining consists of important tasks, such as description, prediction...
Sample PDF |
More details... | $37.50 |
| 169. |
Qi Li (Western Kentucky University, USA), Jieping Ye (Arizona State University, USA), Chandra Kambhamettu (University of Delaware, USA)
Visual media data such as an image is the raw data representation for many important applications, such as image retrieval (Mikolajczyk & Schmid 2001), video classif...
Sample PDF |
More details... | $37.50 |
| 170. |
Gustavo Camps-Valls (Universitat de València, Spain), Manel Martínez-Ramón (Universidad Carlos III de Madrid, Spain), José Luis Rojo-Álvarez (Universidad Rey Juan Carlos, Spain)
Machine learning has experienced a great advance in the eighties and nineties due to the active research in artificial neural networks and adaptive systems. These to...
Sample PDF |
More details... | $37.50 |
| 171. |
Malcolm J. Beynon (Cardiff University, UK)
The essence of data mining is to investigate for pertinent information that may exist in data (often large data sets). The immeasurably large amount of data present...
Sample PDF |
More details... | $37.50 |
| 172. |
Doina Caragea (Kansas State University, USA), Vasant Honavar (Iowa State University, USA)
Recent advances in sensors, digital storage, computing and communications technologies have led to a proliferation of autonomously operated, geographically distribut...
Sample PDF |
More details... | $37.50 |
| 173. |
QingXiang Wu (University of Ulster at Magee, UK), Martin McGinnity (University of Ulster at Magee, UK), Girijesh Prasad (University of Ulster at Magee, UK), David Bell (Queen’s University, UK)
Data mining and knowledge discovery aim at finding useful information from typically massive collections of data, and then extracting useful knowledge from the infor...
Sample PDF |
More details... | $37.50 |
| 174. |
Marco F. Ramoni (Harvard Medical School, USA), Paola Sebastiani (Boston University School of Public Health, USA)
Born at the intersection of artificial intelligence, statistics, and probability, Bayesian networks (Pearl, 1988) are a representation formalism at the cutting edge...
Sample PDF |
More details... | $37.50 |
| 175. |
Rallou Thomopoulos (INRA/LIRMM, France)
This chapter deals with the problem of the cooperation of heterogeneous knowledge for the construction of a domain expertise, and more specifically for the discovery...
Sample PDF |
More details... | $37.50 |
| 176. |
João Gama (University of Porto, Portugal), Pedro Pereira Rodrigues (University of Porto, Portugal)
In the last two decades, machine learning research and practice has focused on batch learning usually with small datasets. In batch learning, the whole training data...
Sample PDF |
More details... | $37.50 |
| 177. |
Bojun Yan (George Mason University, USA)
As a recent emerging technique, semi-supervised clustering has attracted significant research interest. Compared to traditional clustering algorithms, which only use...
Sample PDF |
More details... | $37.50 |
| 178. |
Feng Pan (University of Southern California, USA)
As an essential dimension of our information space, time plays a very important role in every aspect of our lives. Temporal information is necessarily required in ma...
Sample PDF |
More details... | $37.50 |
| 179. |
Abdelhamid Bouchachia (University of Klagenfurt, Austria)
Recently the field of machine learning, pattern recognition, and data mining has witnessed a new research stream that is learning with partial supervision -LPS- (kno...
Sample PDF |
More details... | $37.50 |
| 180. |
Kirsten Wahlstrom (University of South Australia, Australia), John F. Roddick (Flinders University, Australia), Rick Sarre (University of South Australia, Australia), Vladimir Estivill-Castro (Griffith University, Australia), Denise de Vries (Flinders University, Australia)
To paraphrase Winograd (1992), we bring to our communities a tacit comprehension of right and wrong that makes social responsibility an intrinsic part of our culture...
Sample PDF |
More details... | $37.50 |
| 181. |
Yinghui Yang (University of California, Davis, USA), Balaji Padmanabhan (University of South Florida, USA)
Classification is a form of data analysis that can be used to extract models to predict categorical class labels (Han & Kamber, 2001). Data classification has proven...
Sample PDF |
More details... | $37.50 |
| 182. |
Carlotta Domeniconi (George Mason University, USA), Dimitrios Gunopulos (University of California, USA)
Pattern classification is a very general concept with numerous applications ranging from science, engineering, target marketing, medical diagnosis and electronic com...
Sample PDF |
More details... | $37.50 |
| 183. |
Xiang Zhang (University of Louisville, USA), Seza Orcun (Purdue University, USA), Mourad Ouzzani (Purdue University, USA), Cheolhwan Oh (Purdue University, USA)
Systems biology aims to understand biological systems on a comprehensive scale, such that the components that make up the whole are connected to one another and work...
Sample PDF |
More details... | $37.50 |
| 184. |
Dimitri Theodoratos (New Jersey Institute of Technology, USA), Wugang Xu (New Jersey Institute of Technology, USA), Alkis Simitsis (National Technical University of Athens, Greece)
A Data Warehouse (DW) is a repository of information retrieved from multiple, possibly heterogeneous, autonomous, distributed databases and other information sources...
Sample PDF |
More details... | $37.50 |
| 185. |
Jun Zhang (University of Kentucky, USA), Jie Wang (University of Kentucky, USA), Shuting Xu (Virginia State University, USA)
Data mining technologies have now been used in commercial, industrial, and governmental businesses, for various purposes, ranging from increasing profitability to en...
Sample PDF |
More details... | $37.50 |
| 186. |
Raymond K. Pon (University of California - Los Angeles, USA), Alfonso F. Cardenas (University of California - Los Angeles, USA), David J. Buttler (Lawrence Livermore National Laboratory, USA)
An explosive growth of online news has taken place. Users are inundated with thousands of news articles, only some of which are interesting. A system to filter out u...
Sample PDF |
More details... | $37.50 |
| 187. |
Miguel García Torres (Universidad de La Laguna, Spain)
The Metaheuristics are general strategies for designing heuristic procedures with high performance. The term metaheuristic, which appeared in 1986 for the first time...
Sample PDF |
More details... | $37.50 |
| 188. |
Christophe Giraud-Carrier (Brigham Young University, USA), Pavel Brazdil (University of Porto, Portugal), Carlos Soares (University of Porto, Portugal), Ricardo Vilalta (University of Houston, USA)
The application of Machine Learning (ML) and Data Mining (DM) tools to classification and regression tasks has become a standard, not only in research but also in ad...
Sample PDF |
More details... | $37.50 |
| 189. |
Xinghua Fan (Chongqing University of Posts and Telecommunications, China)
Entity and relation recognition, i.e. assigning semantic classes (e.g., person, organization and location) to entities in a given sentence and determining the relati...
Sample PDF |
More details... | $37.50 |
| 190. |
Li-Min Fu (Southern California University of Health Sciences, USA)
Based on the concept of simultaneously studying the expression of a large number of genes, a DNA microarray is a chip on which numerous probes are placed for hybridi...
Sample PDF |
More details... | $37.50 |
| 191. |
Diego Liberati (Italian National Research Council, Italy)
In everyday life, it often turns out that one has to face a huge amount of data, often not completely homogeneous, often without an immediate grasp of an underlying...
Sample PDF |
More details... | $37.50 |
| 192. |
Li Shen (University of Massachusetts Dartmouth, USA), Fillia Makedon (University of Texas at Arlington, USA)
Recent technological advances in 3D digitizing, noninvasive scanning, and interactive authoring have resulted in an explosive growth of 3D models in the digital worl...
Sample PDF |
More details... | $37.50 |
| 193. |
Stanley Loh Daniel Licthnow (Catholic University of Pelotas, Brazil Catholic University of Pelotas, Brazil), Thyago Borges Tiago Primo (Lutheran University of Brazil, Brazil)
According to Nonaka & Takeuchi (1995), the majority of the organizational knowledge comes from interactions between people. People tend to reuse solutions from other...
Sample PDF |
More details... | $37.50 |
| 194. |
Tamraparni Dasu (AT&T Labs, USA), Gary Weiss (Fordham University, USA)
When a space shuttle takes off, tiny sensors measure thousands of data points every fraction of a second, pertaining to a variety of attributes like temperature, acc...
Sample PDF |
More details... | $37.50 |
| 195. |
Gabriele Kern-Isberner (University of Dortmund, Germany)
Knowledge discovery refers to the process of extracting new, interesting, and useful knowledge from data and presenting it in an intelligible way to the user. Roughl...
Sample PDF |
More details... | $37.50 |
| 196. |
Steffen Bickel (Humboldt-Universität zu Berlin, Germany)
E-mail has become one of the most important communication media for business and private purposes. Large amounts of past e-mail records reside on corporate servers a...
Sample PDF |
More details... | $37.50 |
| 197. |
Wen-Yang Lin (National University of Kaohsiung, Taiwan), Ming-Cheng Tseng (Institute of Information Engineering, Taiwan)
The mining of Generalized Association Rules (GARs) from a large transactional database in the presence of item taxonomy has been recognized as an important model for...
Sample PDF |
More details... | $37.50 |
| 198. |
Doru Tanasa (INRIA Sophia Antipolis, France)
Web Usage Mining (WUM) includes all the Data Mining techniques used to analyze the behavior of a Web site‘s users (Cooley, Mobasher & Srivastava, 1999, Spiliopoulou,...
Sample PDF |
More details... | $37.50 |
| 199. |
Shane M. Butler (Monash University, Australia)
Finding differences among two or more groups is an important data-mining task. For example, a retailer might want to know what the different is in customer purchasin...
Sample PDF |
More details... | $37.50 |
| 200. |
Junsong Yuan (Northwestern University, USA)
One of the focused themes in data mining research is to discover frequent and repetitive patterns from the data. The success of frequent pattern mining (Han, Cheng,...
Sample PDF |
More details... | $37.50 |
| 201. |
Bruno Agard (École Polytechnique de Montréal, Canada)
In large urban areas, smooth running public transit networks are key to viable development. Currently, economic and environmental issues are fueling the need for the...
Sample PDF |
More details... | $37.50 |
| 202. |
David Lo (National University of Singapore, Singapore)
Software is a ubiquitous component in our daily life. It ranges from large software systems like operating systems to small embedded systems like vending machines, b...
Sample PDF |
More details... | $37.50 |
| 203. |
Ramon F. Brena (Tecnológico de Monterrey, Mexico), Ana Maguitman (Tecnológico de Monterrey, Mexico)
The Internet has made available a big number of information services, such as file sharing, electronic mail, online chat, telephony and file transfer. However, servi...
Sample PDF |
More details... | $37.50 |
| 204. |
Lutz Hamel (University of Rhode Island, USA)
Classification models and in particular binary classification models are ubiquitous in many branches of science and business. Consider, for example, classification m...
Sample PDF |
More details... | $37.50 |
| 205. |
Claudia Perlich (IBM T.J. Watson Research, USA), Saharon Rosset (IBM T.J. Watson Research, USA), Bianca Zadrozny (Universidade Federal Fluminense, Brazil)
One standard Data Mining setting is defines by a set of n observations on a variable of interest Y and a set of p explanatory variables, or features, x = (x1,...,xp)...
Sample PDF |
More details... | $37.50 |
| 206. |
Anca Doloc-Mihu (University of Louisiana at Lafayette, USA)
The goal of a web-based retrieval system is to find data items that meet a user’s request as fast and accurately as possible. Such a search engine finds items releva...
Sample PDF |
More details... | $37.50 |
| 207. |
Vasudha Bhatnagar (University of Delhi, India), S. K. Gupta (IIT, Delhi, India)
Knowledge Discovery in Databases (KDD) is classically defined as the “nontrivial process of identifying valid, novel, potentially useful, and ultimately understandab...
Sample PDF |
More details... | $37.50 |
| 208. |
Pasquale De Meo (Università degli Studi Mediterranea di Reggio Calabria, Italy), Giovanni Quattrone (Università degli Studi Mediterranea di Reggio Calabria, Italy), Giorgio Terracina (Università degli Studi Della Calabria, Italy), Domenico Ursino (Università degli Studi Medit)
An Electronic-Service (E-Service) can be defined as a collection of network-resident software programs that collaborate for supporting users in both accessing and se...
Sample PDF |
More details... | $37.50 |
| 209. |
Chia Huey Ooi (Duke-NUS Graduate Medical School Singapore, Singapore)
Molecular classification involves the classification of samples into groups of biological phenotypes. Studies on molecular classification generally focus on cancer f...
Sample PDF |
More details... | $37.50 |
| 210. |
Omar Boussaid (University of Lyon, France), Doulkifli Boukraa (University of Jijel, Algeria)
While the classical databases aimed in data managing within enterprises, data warehouses help them to analyze data in order to drive their activities (Inmon, 2005)....
Sample PDF |
More details... | $37.50 |
| 211. |
Fadime Üney Yüksektepe (Koç University, Turkey)
Data classification is a supervised learning strategy that analyzes the organization and categorization of data in distinct classes. Generally, a training set, in wh...
Sample PDF |
More details... | $37.50 |
| 212. |
Amelia Zafra (University of Cordoba, Spain)
The multiple-instance problem is a difficult machine learning problem that appears in cases where knowledge about training examples is incomplete. In this problem, t...
Sample PDF |
More details... | $37.50 |
| 213. |
Peter A. Chew (Sandia National Laboratories, USA)
The principles of text mining are fundamental to technology in everyday use. The world wide web (WWW) has in many senses driven research in text mining, and with the...
Sample PDF |
More details... | $37.50 |
| 214. |
Gang Kou (University of Electronic Science and Technology of China, China), Yi Peng (University of Electronic Science and Technology of China, China), Yong Shi (CAS Research Center on Fictitious Economy and Data Sciences, China & U)
Multiple criteria optimization seeks to simultaneously optimize two or more objective functions under a set of constraints. It has a great variety of applications, r...
Sample PDF |
More details... | $37.50 |
| 215. |
Sach Mukherjee (University of Oxford, UK)
A number of important problems in data mining can be usefully addressed within the framework of statistical hypothesis testing. However, while the conventional treat...
Sample PDF |
More details... | $37.50 |
| 216. |
Alicja A. Wieczorkowska (Polish-Japanese Institute of Information Technology, Poland)
Music information retrieval (MIR) is a multi-disciplinary research on retrieving information from music, see Fig. 1. This research involves scientists from tradition...
Sample PDF |
More details... | $37.50 |
| 217. |
Ingrid Fischer (University of Konstanz, Germany)
As the beginning of the area of artificial neural networks the introduction of the artificial neuron by McCulloch and Pitts is considered. They were inspired by the...
Sample PDF |
More details... | $37.50 |
| 218. |
Victor S.Y. Lo (Fidelity Investments, USA)
Data mining has been widely applied in many areas over the past two decades. In marketing, many firms collect large amount of customer data to understand their needs...
Sample PDF |
More details... | $37.50 |
| 219. |
Dilip Kumar Pratihar (Indian Institute of Technology, Kharagpur, India)
Most of the complex real-world systems involve more than three dimensions and it may be difficult to model these higher dimensional data related to their inputoutput...
Sample PDF |
More details... | $37.50 |
| 220. |
Ioannis N. Kouris (University of Patras, Greece)
Research in association rules mining has initially concentrated in solving the obvious problem of finding positive association rules; that is rules among items that...
Sample PDF |
More details... | $37.50 |
| 221. |
Indrani Chakravarty (Indian Institute of Technology, India)
The most commonly used protection mechanisms today are based on either what a person possesses (e.g. an ID card) or what the person remembers (like passwords and PIN...
Sample PDF |
More details... | $37.50 |
| 222. |
Alfredo Cuzzocrea (University of Calabria, Italy), Svetlana Mansmann (University of Konstanz, Germany)
The problem of efficiently visualizing multidimensional data sets produced by scientific and statistical tasks/ processes is becoming increasingly challenging, and i...
Sample PDF |
More details... | $37.50 |
| 223. |
Rebecca Boon-Noi Tan (Monash University, Australia)
Since its origin in the 1970’s research and development into databases systems has evolved from simple file storage and processing systems to complex relational data...
Sample PDF |
More details... | $37.50 |
| 224. |
Indrani Chakravarty (Indian Institute of Technology, India)
Security is one of the major issues in today’s world and most of us have to deal with some sort of passwords in our daily lives; but, these passwords have some probl...
Sample PDF |
More details... | $37.50 |
| 225. |
James Geller (New Jersey Institute of Technology, USA)
The term “Ontology” was popularized in Computer Science by Thomas Gruber at the Stanford Knowledge Systems Lab (KSL). Gruber’s highly influential papers defined an o...
Sample PDF |
More details... | $37.50 |
| 226. |
Ioannis N. Kouris (University of Patras, Greece)
Data mining has emerged over the last decade as probably the most important application in databases. To reproduce one of the most popular but accurate definitions f...
Sample PDF |
More details... | $37.50 |
| 227. |
Sharanjit Kaur (University of Delhi, India)
Knowledge discovery in databases (KDD) is a nontrivial process of detecting valid, novel, potentially useful and ultimately understandable patterns in data (Fayyad,...
Sample PDF |
More details... | $37.50 |
| 228. |
Fabrizio Angiulli (University of Calabria, Italy)
Data mining techniques can be grouped in four main categories: clustering, classification, dependency detection, and outlier detection. Clustering is the process of...
Sample PDF |
More details... | $37.50 |
| 229. |
Jorge Cardoso (SAP AG, Germany), W.M.P. van der Aalst (Eindhoven University of Technology, The Netherlands)
Business process management systems (Smith and Fingar 2003) provide a fundamental infrastructure to define and manage business processes and workflows. These systems...
Sample PDF |
More details... | $37.50 |
| 230. |
Andrew K.C. Wong (University of Waterloo, Canada), Yang Wang (Pattern Discovery Technology, Canada), Gary C.L. Li (University of Waterloo, Canada)
A basic task of machine learning and data mining is to automatically uncover patterns that reflect regularities in a data set. When dealing with a large database, es...
Sample PDF |
More details... | $37.50 |
| 231. |
Hui Xiong (Rutgers University, USA), Michael Steinbach (University of Minnesota, USA), Pang-Ning Tan (Michigan State University, USA), Vipin Kumar (University of Minnesota, USA), Wenjun Zhou (Rutgers University, USA)
Clustering and association analysis are important techniques for analyzing data. Cluster analysis (Jain & Dubes, 1988) provides insight into the data by dividing obj...
Sample PDF |
More details... | $37.50 |
| 232. |
P. Viswanath (Indian Institute of Technology-Guwahati, India), Narasimha M. Murty (Indian Institute of Science, India), Bhatnagar Shalabh (Indian Institute of Science, India)
Parametric methods first choose the form of the model or hypotheses and estimates the necessary parameters from the given dataset. The form, which is chosen, based o...
Sample PDF |
More details... | $37.50 |
| 233. |
C. Radha (Indian Institute of Science, India)
An important problem in pattern recognition is that of pattern classification. The objective of classification is to determine a discriminant function which is consi...
Sample PDF |
More details... | $37.50 |
| 234. |
Clifton Phua (Monash University, Australia), Vincent Lee (Monash University, Australia), Kate Smith-Miles (Deakin University, Australia)
Almost every person has a life-long personal name which is officially recognised and has only one correct version in their language. Each personal name typically has...
Sample PDF |
More details... | $37.50 |
| 235. |
Konstantinos Kotis (University of the Aegean, Greece)
Current keyword-based Web search engines (e.g. Googlea) provide access to thousands of people for billions of indexed Web pages. Although the amount of irrelevant re...
Sample PDF |
More details... | $37.50 |
| 236. |
Nilmini Wickramasinghe (Stuart School of Business, Illinois Institute of Technology, USA), Rajeev K. Bali (Coventry University, UK)
Today’s economy is increasingly based on knowledge and information (Davenport, & Grover 2001). Knowledge is now recognized as the driver of productivity and economic...
Sample PDF |
More details... | $37.50 |
| 237. |
Ladjel Bellatreche (Poitiers University, France), Mukesh Mohania (IBM India Research Lab, India)
Recently, organizations have increasingly emphasized applications in which current and historical data are analyzed and explored comprehensively, identifying useful...
Sample PDF |
More details... | $37.50 |
| 238. |
Xiao-Li Li (Institute for Infocomm Research, A* STAR, Singapore)
In traditional supervised learning, a large number of labeled positive and negative examples are typically required to learn an accurate classifier. However, in prac...
Sample PDF |
More details... | $37.50 |
| 239. |
D. R. Mani (Massachusetts Institute of Technology and Harvard University, USA), Andrew L. Betz (Progressive Insurance, USA), James H. Drew (Verizon Laboratories, USA)
A structural conflict exists in businesses which sell services whose production costs are discontinuous and whose consumption is continuous but variable. A classic e...
Sample PDF |
More details... | $37.50 |
| 240. |
Seung-won Hwang (Pohang University of Science and Technology (POSTECH), Korea)
As near-infinite amount of data are becoming accessible on the Web, it becomes more important to support intelligent personalized retrieval mechanisms, to help users...
Sample PDF |
More details... | $37.50 |
| 241. |
Alfredo Cuzzocrea (University of Calabria, Italy), Vincenzo Russo (University of Calabria, Italy)
The problem of ensuring the privacy and security of OLAP data cubes (Gray et al., 1997) arises in several fields ranging from advanced Data Warehousing (DW) and Busi...
Sample PDF |
More details... | $37.50 |
| 242. |
Stanley R.M. Oliveira (Embrapa Informática Agropecuária, Brazil)
Despite its benefits in various areas (e.g., business, medical analysis, scientific data analysis, etc), the use of data mining techniques can also result in new thr...
Sample PDF |
More details... | $37.50 |
| 243. |
Laura Maruster (University of Groningen, The Netherlands)
As the on-line services and Web-based information systems proliferate in many domains of activities, it has become increasingly important to model user behaviour and...
Sample PDF |
More details... | $37.50 |
| 244. |
Senqiang Zhou (Simon Fraser University, Canada)
A major obstacle in data mining applications is the gap between the statistic-based pattern extraction and the value-based decision-making. “Profit mining” aims to r...
Sample PDF |
More details... | $37.50 |
| 245. |
Ioannis N. Kouris (University of Patras, Dept. of Computer Engineering and Informatics Greece)
Software development has various stages, that can be conceptually grouped into two phases namely development and production (Figure 1). The development phase include...
Sample PDF |
More details... | $37.50 |
| 246. |
Minh Ngoc Ngo (Nanyang Technological University, Singapore)
Due to the need to reengineer and migrating aging software and legacy systems, reverse engineering has started to receive some attention. It has now been established...
Sample PDF |
More details... | $37.50 |
| 247. |
Ping Deng (University of Illinois at Springfield, USA), Qingkai Ma (Utica College, USA), Weili Wu (The University of Texas at Dallas, USA)
Clustering can be considered as the most important unsupervised learning problem. It has been discussed thoroughly by both statistics and database communities due to...
Sample PDF |
More details... | $37.50 |
| 248. |
Imad Khoury (School of Computer Science, McGill University, Canada), Godfried Toussaint (School of Computer Science, McGill University, Canada), Antonio Ciampi (Epidemiology & Biostatistics, McGill University, Canada), Isadora Antoniano (IIMAS-UNAM, Ciudad de Mexico, Mex)
Clustering is considered the most important aspect of unsupervised learning in data mining. It deals with finding structure in a collection of unlabeled data. One si...
Sample PDF |
More details... | $37.50 |
| 249. |
Yang Xiang (University of Guelph, Canada)
Graphical models such as Bayesian networks (BNs) (Pearl, 1988; Jensen & Nielsen, 2007) and decomposable Markov networks (DMNs) (Xiang, Wong., & Cercone, 1997) have b...
Sample PDF |
More details... | $37.50 |
| 250. |
Wen-Chi Hou (Southern Illinois University, USA)
Mining market basket data (Agrawal et al. 1993, Agrawal et al. 1994) has received a great deal of attention in the recent past, partly due to its utility and partly...
Sample PDF |
More details... | $37.50 |
| 251. |
Andrew Hamilton-Wright (University of Guelph, Canada, & Mount Allison University, Canada), Daniel W. Stashuk (University of Waterloo, Canada)
A great deal of interesting real-world data is encountered through the analysis of continuous variables, however many of the robust tools for rule discovery and data...
Sample PDF |
More details... | $37.50 |
| 252. |
Colin Cooper (Kings’ College, UK), Michele Zito (University of Liverpool, UK)
The association rule mining (ARM) problem is a wellestablished topic in the field of knowledge discovery in databases. The problem addressed by ARM is to identify a...
Sample PDF |
More details... | $37.50 |
| 253. |
Brian C. Lovell (The University of Queensland, Australia), Shaokang Chen (NICTA, Australia), Ting Shan (NICTA, Australia)
Data mining is widely used in various areas such as finance, marketing, communication, web service, surveillance and security. The continuing growth in computing har...
Sample PDF |
More details... | $37.50 |
| 254. |
Marzena Kryszkiewicz (Warsaw University of Technology, Poland)
Discovering of frequent patterns in large databases is an important data mining problem. The problem was introduced in (Agrawal, Imielinski & Swami, 1993) for a sale...
Sample PDF |
More details... | $37.50 |
| 255. |
Nicolas Lachiche (University of Strasbourg, France)
Receiver Operating Characteristic (ROC curves) have been used for years in decision making from signals, such as radar or radiology. Basically they plot the hit rate...
Sample PDF |
More details... | $37.50 |
| 256. |
Juha Kontio (Turku University of Applied Sciences, Finland)
Reporting is one of the basic processes in all organizations. Reports should offer relevant information for guiding the decision-making. Reporting provides informati...
Sample PDF |
More details... | $37.50 |
| 257. |
Brian C. Lovell (The University of Queensland, Australia), Shaokang Chen (NICTA, Australia), Ting Shan (NICTA, Australia)
While the technology for mining text documents in large databases could be said to be relatively mature, the same cannot be said for mining other important data type...
Sample PDF |
More details... | $37.50 |
| 258. |
Jerzy W. Grzymala-Busse (University of Kansas, USA), Wojciech Ziarko (University of Regina, Canada)
Discovering useful models capturing regularities of natural phenomena or complex systems until recently was almost entirely limited to finding formulae fitting empir...
Sample PDF |
More details... | $37.50 |
| 259. |
Gautam Das (The University of Texas at Arlington, USA)
In recent years, advances in data collection and management technologies have led to a proliferation of very large databases. These large data repositories typically...
Sample PDF |
More details... | $37.50 |
| 260. |
V. Suresh Babu (Indian Institute of Technology-Guwahati, India), P. Viswanath (Indian Institute of Technology-Guwahati, India), Narasimha M. Murty (Indian Institute of Science, India)
Non-parametric methods like the nearest neighbor classifier (NNC) and the Parzen-Window based density estimation (Duda, Hart & Stork, 2000) are more general than par...
Sample PDF |
More details... | $37.50 |
| 261. |
Mike Thelwall (University of Wolverhampton, UK)
Scientific Web Intelligence (SWI) is a research field that combines techniques from data mining, web intelligence and scientometrics to extract useful information fr...
Sample PDF |
More details... | $37.50 |
| 262. |
Päivikki Parpola (Helsinki University of Technology, Finland)
Some parts of this text, namely “Co-operative Building, Adaptation, and Evolution of Abstract Models of a KB” and most subsections in “Performing Reasoning in SOOKAT...
Sample PDF |
More details... | $37.50 |
| 263. |
Hadrian Peter (University of the West Indies, Barbados)
Over the past ten years or so data warehousing has emerged as a new technology in the database environment. “A data warehouse is a global repository that stores pre-...
Sample PDF |
More details... | $37.50 |
| 264. |
Nils Pharo (Oslo University College, Norway)
Several studies of Web information searching (Agosto, 2002, Pharo & Järvelin, 2006, Prabha et al. 2007) have pointed out that searchers tend to satisfice. This means...
Sample PDF |
More details... | $37.50 |
| 265. |
Shuguo Han (Nanyang Technological University, Singapore)
Rapid advances in automated data collection tools and data storage technology have led to the wide availability of huge amount of data. Data mining can extract usefu...
Sample PDF |
More details... | $37.50 |
| 266. |
Yehuda Lindell (Bar-Ilan University, Israel)
The increasing use of data mining tools in both the public and private sectors raises concerns regarding the potentially sensitive nature of much of the data being m...
Sample PDF |
More details... | $37.50 |
| 267. |
Parvathi Chundi (University of Nebraska at Omaha, USA), Daniel J. Rosenkrantz (University of Albany, SUNY, USA)
Time series data is usually generated by measuring and monitoring applications, and accounts for a large fraction of the data available for analysis purposes. A time...
Sample PDF |
More details... | $37.50 |
| 268. |
Yawei Wang (Montclair State University, USA)
The graying of America is one of the most significant demographic changes to the present and future of the United States (Moisey & Bichis, 1999). As more baby boomer...
Sample PDF |
More details... | $37.50 |
| 269. |
Protima Banerjee (Drexel University, USA)
Over the past few decades, data mining has emerged as a field of research critical to understanding and assimilating the large stores of data accumulated by corporat...
Sample PDF |
More details... | $37.50 |
| 270. |
Chrisa Tsinaraki (Technical University of Crete, Greece)
Several consumer electronic devices that allow capturing digital multimedia content (like mp3 recorders, digital cameras, DVD camcorders, smart phones etc.) are avai...
Sample PDF |
More details... | $37.50 |
| 271. |
Ludovic Denoyer (University of Paris VI, France)
Document classification developed over the last ten years, using techniques originating from the pattern recognition and machine learning communities. All these meth...
Sample PDF |
More details... | $37.50 |
| 272. |
Tobias Scheffer (Humboldt-Universität zu Berlin, Germany)
For many classification problems, unlabeled training data are inexpensive and readily available, whereas labeling training data imposes costs. Semi-supervised classi...
Sample PDF |
More details... | $37.50 |
| 273. |
Cane W.K. Leung (The Hong Kong Polytechnic University, Hong Kong SAR)
Sentiment analysis is a kind of text classification that classifies texts based on the sentimental orientation (SO) of opinions they contain. Sentiment analysis of p...
Sample PDF |
More details... | $37.50 |
| 274. |
Florent Masseglia (INRIA Sophia Antipolis, France), Maguelonne Teisseire (University of Montpellier II, France), Pascal Poncelet (Ecole des Mines d’ Alès, France)
Sequential pattern mining deals with data represented as sequences (a sequence contains sorted sets of items). Compared to the association rule problem, a study of s...
Sample PDF |
More details... | $37.50 |
| 275. |
K. G. Srinivasa (M S Ramaiah Institute of Technology, Bangalore, India), K. R. Venugopal (University Visvesvaraya College of Engineering, Bangalore, India), L. M. Patnaik (Indian Institute of Science, Bangalore, India)
Efficient tools and algorithms for knowledge discovery in large data sets have been devised during the recent years. These methods exploit the capability of computer...
Sample PDF |
More details... | $37.50 |
| 276. |
Liping Jing (Hong Kong Baptist University, Hong Kong), Michael K. Ng (Hong Kong Baptist University, Hong Kong), Joshua Zhexue Huang (The University of Hong Kong, Hong Kong)
High dimensional data is a phenomenon in real-world data mining applications. Text data is a typical example. In text mining, a text document is viewed as a vector o...
Sample PDF |
More details... | $37.50 |
| 277. |
Seoung Bum Kim (The University of Texas at Arlington, USA), Chivalai Temiyasathit (The University of Texas at Arlington, USA), Sun-Kyoung Park (North Central Texas Council of Governments, USA), Victoria C.P. Chen (The University of Texas at Arlington, USA)
Vast amounts of data are being generated to extract implicit patterns of ambient air pollution. Because air pollution data are generally collected in a wide area of...
Sample PDF |
More details... | $37.50 |
| 278. |
Wenyuan Li (Nanyang Technological University, Singapore)
With the rapid growth of the World Wide Web and the capacity of digital data storage, tremendous amount of data are generated daily from business and engineering to...
Sample PDF |
More details... | $37.50 |
| 279. |
Christophe Giraud-Carrier (Brigham Young University, USA)
With the growth and wide availability of the Internet, most retailers have successfully added the Web to their other, more traditional distribution channels (e.g., s...
Sample PDF |
More details... | $37.50 |
| 280. |
Claudio Conversano (University of Cagliari, Italy), Roberta Siciliano (University of Naples, Federico II, Italy)
Statistical Data Editing (SDE) is the process of checking and correcting data for errors. Winkler (1999) defines it the set of methods used to edit (clean-up) and im...
Sample PDF |
More details... | $37.50 |
| 281. |
Maria Vardaki (University of Athens, Greece)
The term metadata is frequently used in many different sciences. Statistical metadata generally used to denote “every piece of information required by a data user to...
Sample PDF |
More details... | $37.50 |
| 282. |
Concetto Elvio Bonafede (University of Pavia, Italy)
A statistical model is a possible representation (not necessarily complex) of a situation of the real world. Models are useful to give a good knowledge of the princi...
Sample PDF |
More details... | $37.50 |
| 283. |
Jun Zhu (Tsinghua University, China), Zaiqing Nie (Web Search and Mining Group Microsoft Research Asia, China), Bo Zhang (Tsinghua University, China)
The World Wide Web is a vast and rapidly growing repository of information. There are various kinds of objects, such as products, people, conferences, and so on, emb...
Sample PDF |
More details... | $37.50 |
| 284. |
Alexander Thomasian (New Jersey Institute of Technology - NJIT, USA)
Data storage requirements have consistently increased over time. According to the latest WinterCorp survey (http://www/WinterCorp.com), “The size of the world’s larg...
Sample PDF |
More details... | $37.50 |
| 285. |
Ingrid Fischer (University of Konstanz, Germany)
The amount of available data is increasing very fast. With this data, the desire for data mining is also growing. More and larger databases have to be searched to fi...
Sample PDF |
More details... | $37.50 |
| 286. |
Jason Chen (Australian National University, Australia)
Clustering analysis is a tool used widely in the Data Mining community and beyond (Everitt et al. 2001). In essence, the method allows us to “summarise” the informat...
Sample PDF |
More details... | $37.50 |
| 287. |
Mohammad Al Hasan (Rensselaer Polytechnic Institute, USA)
The research on mining interesting patterns from transactions or scientific datasets has matured over the last two decades. At present, numerous algorithms exist to...
Sample PDF |
More details... | $37.50 |
| 288. |
Ullas Nambiar (IBM India Research Lab, India)
A query against incomplete or imprecise data in a database1, or a query whose search conditions are imprecise can both result in answers that do not satisfy the quer...
Sample PDF |
More details... | $37.50 |
| 289. |
Barak Chizi (Tel-Aviv University, Israel), Lior Rokach (Ben-Gurion University, Israel), Oded Maimon (Tel-Aviv University, Israel)
Dimensionality (i.e., the number of data set attributes or groups of attributes) constitutes a serious obstacle to the efficiency of most data mining algorithms (Mai...
Sample PDF |
More details... | $37.50 |
| 290. |
Qiyang Chen (Montclair State University, USA)
Survival analysis (SA) consists of a variety of methods for analyzing the timing of events and/or the times of transition among several states or conditions. The eve...
Sample PDF |
More details... | $37.50 |
| 291. |
Kuriakose Athappilly (Western Michigan University, USA)
Symbiotic data mining is an evolutionary approach to how organizations analyze, interpret, and create new knowledge from large pools of data. Symbiotic data miners a...
Sample PDF |
More details... | $37.50 |
| 292. |
Silvia Casado Yusta (Universidad de Burgos, Spain), Joaquín Pacheco Bonrostro (Instituto de Empresa, Spain)
Variable selection plays an important role in classification. Before beginning the design of a classification method, when many variables are involved, only those va...
Sample PDF |
More details... | $3 |
|
|