Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Hybrid Technique Using PCA and Wavelets in Network Traffic Anomaly Detection

Stevan Novakov, Chung-Horng Lung, Ioannis Lambadaris, Nabil Seddigh

Source Title: Research Methods: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-4666-7456-1.ch034

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Research into network anomaly detection has become crucial as a result of a significant increase in the number of computer attacks. Many approaches in network anomaly detection have been reported in the literature, but data or solutions typically are not freely available. Recently, a labeled network traffic flow dataset, Kyoto2006+, has been created and is publicly available. Most existing approaches using Kyoto2006+ for network anomaly detection apply various clustering techniques. This paper leverages existing well known statistical analysis and spectral analysis techniques for network anomaly detection. The first popular approach is a statistical analysis technique called Principal Component Analysis (PCA). PCA describes data in a new dimension to unlock otherwise hidden characteristics. The other well known spectral analysis technique is Haar Wavelet filtering analysis. It measures the amount and magnitude of abrupt changes in data. Both approaches have strengths and limitations. In response, this paper proposes a Hybrid PCA–Haar Wavelet Analysis. The hybrid approach first applies PCA to describe the data and then Haar Wavelet filtering for analysis. Based on prototyping and measurement, an investigation of the Hybrid PCA–Haar Wavelet Analysis technique is performed using the Kyoto2006+ dataset. The authors consider a number of parameters and present experimental results to demonstrate the effectiveness of the hybrid approach as compared to the two algorithms individually.

Chapter Preview

Top

Introduction

The way networks are being used is rapidly changing and a by-product of this change is that the types of computer attacks are rapidly evolving. For example, malicious attacks are no longer limited to desktop computer viruses, but can target a network itself (Estevez-Tapiador et al., 2004). These attacks are designed to create failures in the system. Depending on the network, these failures can cause mild inconvenience, loss of productivity, loss of economic activity, or even, loss of public well being. This paper applies statistical analysis and spectral analysis techniques to network traffic data in order to identify potential malicious network attacks. Specifically, this paper focuses on a hybrid technique to provide information to a network operator such that the source of malicious behavior can be isolated.

There is a need for effective and scalable approaches to maintain network stability and to detect anomalous network traffic behavior created by attacks. This need is increasingly being addressed through the use of flow-based protocols such as Cisco’s NetFlow protocol (Cisco, 2012). This protocol resides on routers and each packet that passes through is examined for a set of IP packet attributes. The output of NetFlow is a multi-tuple record, called a flow. Some core features of a flow are: Source IP address, Destination IP address, total bytes, etc. NetFlow does not indicate whether a flow is a part of abnormal or malicious behavior (NetFlow, 2012).

Current anomaly detection approaches can be classified into two main categories: knowledge base approaches to identify attacks through patterns for signatures (Bro Secucity, 2012) and approaches to detect patterns which do not conform to expected behavior (Campos & Milenova, 2005). Inspecting individual signatures or traces of known hazards based on a knowledge base is time consuming and inefficient. Furthermore, the turnaround from discovery to updating the knowledge base can be extensive. The second type for anomaly detection is not dependent on an existing knowledge base and identifies potential network threats by finding deviations from normal behavior. Some statistical models and signal processing algorithms have been used for this purpose. These methods can be applied relatively quickly to create relationships and discover patterns from a range of data types and sizes. However, a comprehensive anomaly detection system will require a significant amount of human expertise (Campos & Milenova, 2005).

This paper proposes a hybrid solution for network anomaly detection based on statistical and spectral analysis techniques, which provides the network administrator time slices containing potential network traffic anomalies. To the best of our knowledge, no such hybrid techniques have deployed by systems described in literature.

The statistical analysis studied is a modified or time shifted Principal Component Analysis (PCA) technique to determine abnormal network behavior (Brauckhoff, Salamatian, & May, 2009). Components are extracted by comparing feature data similarity. A ranked subset of components, selected by comparing the sparsity of projected data, is used to create a subspace that describes anomalous behavior. Time grouped data is projected onto this space and spectral analysis is applied. The feature with the most spread out data is considered in spectral analysis portion.

The spectral analysis technique adopted is Haar Wavelet decomposition (Barford, Kline, Plonka, & Ron, 2002). This type of wavelet decomposition uses a Haar basis function to decompose the input dataset set into core time functions. Thresholds are applied to remove noise and highlight network traffic anomaly characteristics. The signal is reconstructed and a weighted score to describe the magnitude of fluctuations within each time slice is calculated. A high score represents a large change in a time window and suggests to the network administrator that abnormal and potentially malicious behavior is present.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

A Hybrid Technique Using PCA and Wavelets in Network Traffic Anomaly Detection

Abstract

Introduction

Complete Chapter List