An Approach to Mining Crime Patterns

Sikha Bagui

doi:10.4018/978-1-60566-098-1.ch015

Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

An Approach to Mining Crime Patterns

Sikha Bagui

Source Title: Selected Readings on Database Technologies and Applications

DOI: 10.4018/978-1-60566-098-1.ch015

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This paper presents a knowledge discovery effort to retrieve meaningful information about crime from a U.S. state database. The raw data were preprocessed, and data cubes were created using Structured Query Language (SQL). The data cubes then were used in deriving quantitative generalizations and for further analysis of the data. An entropy-based attribute relevance study was undertaken to determine the relevant attributes. A machine learning software called WEKA was used for mining association rules, developing a decision tree, and clustering. SOM was used to view multidimensional clusters on a regular two-dimensional grid.

Chapter Preview

Top

Introduction

Data mining applications are called to extract descriptive patterns, typically used for decision making, from the data contained in traditional databases and recently also from other unconventional information systems such as the web.

Examples of these applications are the market basket analysis, that extracts patterns such as association rules between purchased items, sequential patterns (that extract temporal descriptions between observed events), classification, clustering and link analysis as in Quinlan (1993) and Agrawal, Imielinski, and Swami (1993) (that provide, in other words, user profiles, text mining, graph mining, and so on). Furthermore, these patterns can be used to give an explanation of the patterns themselves. In this case the data patterns are considered as data to be analysed (and not necessarily with the same analysis tool that was used to obtain them).

Inductive databases have been launched in Imielinski and Mannila (1996) as general-purpose databases in which both the data and the patterns can be represented, retrieved, and manipulated with the goal to assist the deployment of the knowledge discovery process (KDD). Thus, KDD becomes a querying sequence in a query language designed for a specific data mining problem (Boulicaut, Klemettinen, & Mannila, 1998). Consequently, an inductive database should integrate several heterogeneous data mining tools that deal with very different heterogeneous and complex data models. For example, source raw data may be represented as flat tables, or, nowadays, by loosely structured documents containing data coming from the Web as well. Also, the conceptual models are different: classification tools usually adopt a data model that is a classification tree, while basket analysis usually represents patterns by means of set enumeration models.

In this chapter, we propose a semi-structured data model specifically designed for inductive databases and, more generally, for knowledge discovery systems. This model is called XDM (XML for data mining). It is based on XML and is devised to cope with several distinctive features at the same time (Bray, Paoli, & Sperberg-McQueen, 1997).

•
At first, it is semi-structured, in order to be able to represent an a-priori infinite set of data models.
•
Second, it is based on two simple and clear concepts, named Data Item and Statement: a data item is a container of data and/or patterns; a statement is a description of an operator application.
•
Third, with XDM the inductive database state is defined as the collection of data items and statements, and the knowledge discovery process is represented as a set of relationships between data items and statements.
•
Fourth, it provides a definition of the database schema by means of the set of integrity constraints over inputs and outputs for operators. Moreover, it constitutes the meta-data of the KDD process (i.e., in terms of the kind of data produced by the operators). The database schema was obtained with the aid of XML-schema, which makes possible to define constraints that must hold on some specific data items or operators, thus ensuring a certain level of correctness of data and patterns. XML-schema specifications constrain the structure of XML documents and overcome the limitations of classical XML DTDs, by adding the concept of data type for attributes. Refer to Thompson, Beech, Maloney, and Mendelson (2001) and Biron and Malhotra (2001) for detailed descriptions on XML-schema.

The above discussed features of the model set the foundations to achieve operator interoperability within a unique framework (provided that the various operators’ API are XML compliant). Finally, the adoption of XML as syntactic format provides several benefits; in particular, the concept of namespace opens the way to the integration of several data formats and operators inside the same framework (Bray, Hollander, & Layman, 1999).

XDM provides several interesting features for inductive databases:

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

An Approach to Mining Crime Patterns

Abstract

Introduction

Complete Chapter List