Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Spam Mail Filtering Using Data Mining Approach: A Comparative Performance Analysis

Ajay Kumar Gupta

Source Title: Handling Priority Inversion in Time-Constrained Distributed Databases

DOI: 10.4018/978-1-7998-2491-6.ch015

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

This chapter presents an overview of spam email as a serious problem in our internet world and creates a spam filter that reduces the previous weaknesses and provides better identification accuracy with less complexity. Since J48 decision tree is a widely used classification technique due to its simple structure, higher classification accuracy, and lower time complexity, it is used as a spam mail classifier here. Now, with lower complexity, it becomes difficult to get higher accuracy in the case of large number of records. In order to overcome this problem, particle swarm optimization is used here to optimize the spam base dataset, thus optimizing the decision tree model as well as reducing the time complexity. Once the records have been standardized, the decision tree is again used to check the accuracy of the classification. The chapter presents a study on various spam-related issues, various filters used, related work, and potential spam-filtering scope.

Chapter Preview

Top

Introduction

SPAM (Attri, 2012) is one of the electronic messaging systems which includes most broadcast media through which it sends or receives the unsolicited messages on the computer, mobile or PDA etc. indiscriminately. Junk e-mail (E-mail spam), is a subset of spams that involves approximately same e-mail messages transmitted to no. of recipients. Spam (Attri, 2012) is use of electronic messaging system to send unsolicited bulk messages indiscriminately. When the number of messages in your inbox started to increase, it became annoying for us to remove the unwanted e-mail. IE- mail spam is also known as unsolicited bulk e-mail (or junk e-mail). The current survey shows an increasing trend for amount of incoming spam and scammer attacks are becoming targeted, and consequently more of a threat. When targeted attacks first emerged five years ago, Symantec message labs intelligence tracked between one or two attacks per week. Subsequently, attacks have increased to 10 per day to 60 per day in 2010. The number of spam sent by the countries of Europe will increase to 40 percent to 45 percent of all spam. These facts state that the spam is a big problem for today and also for tomorrow and it actually makes sense to investigate new effective methods against spam. The purpose of this work is to discover the techniques to filter the spam from incoming emails. Filtering spam is a technique to categorize all the incoming emails in network into spam and ham messages. Here, important issues related to spam filtering, the applicable steps for classification, methods and the evaluation measures in the spam filtering are discussed in detail. A lot of works have been done before in this spam filtering domain. These include Bayesian Networks, Decision Tree, K-Nearest Neighbor etc. (Ma, 2009), (Razmara, 2012) with some extra features or with some additional methods in it. With advancement, Spammers frequently change their email’s external sign to misguide spam filtering systems, so, there arises a need for adaptive filtering systems, which have the power of quick reaction to the changes and provides fast and qualitative self-tuning with a new set of features. The study so far concludes that there are many of the filtering techniques which are based on text categorization methods but none of them can claim to provide an ideal solution i.e. zero percent false positive and zero percent false negative. Still, there are lots of scopes for research in classifying text messages as well as multimedia messages. This is not possible to maintain 100% accuracy and efficiency of filtering spam. But, one should try to make sure that the model is more efficient, reliable and accurate as possible. Classifier should avoid the following two cases to be more accurate.

•
Ham Misclassification: The genuine mail should not be classified as a spam mail. Due to this misclassification, the receiver may get unaware of important mails which may be very damaging sometimes by causing serious risks.
•
Spam Misclassification: The spam should not be classified as important mails as it causes many more financial and behavioral damage.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Spam Mail Filtering Using Data Mining Approach: A Comparative Performance Analysis

Abstract

Introduction

Complete Chapter List