Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A New MapReduce Approach with Dynamic Fuzzy Inference for Big Data Classification Problems

Shangzhu Jin, Jun Peng, Dong Xie

Source Title: International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) 12(3)

DOI: 10.4018/IJCINI.2018070103

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Currently, big data and its applications have become one of the emergent topics. In practice, MapReduce framework and its different extensions are the most popular approaches for big data. Fuzzy system based models stand out for many applications. However, when a given observation has no overlap with antecedent values, no rule can be invoked in classical fuzzy inference can also appear in big data environment, and therefore no consequence can be derived. Fortunately, fuzzy rule interpolation techniques can support inference in such cases. Combining traditional fuzzy reasoning technique and fuzzy interpolation method may promote the accuracy of inference conclusion. Therefore, in this article, an initial investigation into the framework of MapReduce with dynamic fuzzy inference/interpolation for big data applications (BigData-DFRI) is reported. The results of an experimental investigation of this method are represented, demonstrating the potential and efficacy of the proposed approach.

Article Preview

Top

1. Introduction

Big data is a term for data sets are so large or complex that traditional data processing applications are difficult to deal with this situation. It often refers simply to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data, and seldom to a particular size of data set (Madden, 2012; Luo et al., 2015; Wibig 2010). The architecture of big data system has been rebuilt at the storage, processing, and database levels. Which is to allow a data distribution among different computers, supporting parallel access from different disks to increase the speed ratio. With more data available, the analysis and knowledge extraction process should be benefited, and more accurate and precise information should be obtained (Wang & Peng 2017). The frameworks that are typically used to handle big data somehow involve some kind of parallelization so that they can easily process and analyse the data that is ready to be used. One of the most popular platforms for big data purposes nowadays, MapReduce (Dean & Ghemawat, 2004), is a programming model and an associated implementation for processing and generating large data sets. A MapReduce program is composed of two key operations: a map function that will act over a subset of the data, and a reduce function that will integrate the results obtained in the map function.

In order to manage the uncertainty that is coursed by the variety and veracity of big data, there are a few works which address this topic from the perspective of fuzzy modelling. So far, most of the existing approaches adopt the Hadoop MapReduce implementation. The highest effort has been carried out for clustering algorithms, especially for a scalable fuzzy c-means approach (Wasikowski & Chen, 2009; Garg & Trivedi, 2014). The results in terms of purity shown by this model were comparable to other hard and fuzzy clustering techniques (Xu et al., 2015). The fuzzy clustering parallel implementation is also applied to the organization of text documents (Goswami & Shishodia, 2013). To deal with classification tasks, the fuzzy rule-based classification system adapted to the MapReduce scheme named as was proposed (Ro et al., 2015; Baciu et al., 2016). An extension of this implementation was developed in (Lpez et al., 2015). In this work, a cost-sensitive approach was derived from the original approach in order to address classification with imbalanced datasets (Lpez et al., 2013). However, lack of data in the training partitions (Wasikowski & Chen, 2010), also known as rare cases problem, may cause a low density in the problem domain. In these cases, the existing fuzzy big data models are not directly applicable to sparse rule-based big data systems due to their assumption of dense rule bases. Depending on the nature of the rule base either fuzzy inference like compositional rule of inference (CRI) or fuzzy rule interpolation (FRI) may be employed to draw the conclusion. CRI methods rely on a dense rule base in which any observation can find at least a complete or partial matching rule. In many real-world problems, obtaining such a complete rule base is costly or even impractical. Interpolation is more robust when working on sparse rule bases. On the other hand, the resulting interpolated conclusions may be not as accurate as their inferred counterparts if partial matching between a given observation and the rule base can be established. To compensate for the drawbacks of these two techniques, in this paper, an integrated reasoning system so called dynamic fuzzy inference/interpolation for big data (BigData-DFRI) is proposed. An initial investigation into the feasibility of dynamic fuzzy inference/interpolation-based classification system adapted to the MapReduce scheme. The method is also applicable for calculating the crucial missing variables and intermediate variables by using backward fuzzy interpolation (Jin et al., 2014). In so doing, the overall BigData-DFRI model and its reasoning becomes more transparent and interpretable.

Complete Article List

Search this Journal:

Reset

Volume 18: 1 Issue (2024)

Volume 17: 1 Issue (2023)

Volume 16: 1 Issue (2022)

Volume 15: 4 Issues (2021)

Volume 14: 4 Issues (2020)

Volume 13: 4 Issues (2019)

Volume 12: 4 Issues (2018)

Volume 11: 4 Issues (2017)

Volume 10: 4 Issues (2016)

Volume 9: 4 Issues (2015)

Volume 8: 4 Issues (2014)

Volume 7: 4 Issues (2013)

Volume 6: 4 Issues (2012)

Volume 5: 4 Issues (2011)

Volume 4: 4 Issues (2010)

Volume 3: 4 Issues (2009)

Volume 2: 4 Issues (2008)

Volume 1: 4 Issues (2007)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A New MapReduce Approach with Dynamic Fuzzy Inference for Big Data Classification Problems

Abstract

1. Introduction

Complete Article List