Save 10% on All IGI Global Research Books
& OnDemand Individual Chapter & Article DownloadsAvailable exclusively on IGI Global’s Online Bookstore. Offer valid through October 31, 2024

Special Offers
- Save 10% on the IGI Global Online bookstore
  Now through October 31, 2024, save 10% on all IGI Global research books & OnDemand individual chapter & article downloads. IGI Global contributors may stack this discount with their exclusive 50% contributor discount, which is automatically applied when logged into a contributor portal account. Non-contributors may also combine the discount with one other discount, including coupon codes. Not valid on open access processing charges, e-collections, or videos. Discount is not applicable for distributors.
  Explore Books & Chapters
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Towards Convergence in Information Systems Design

Deepika Prakash

Source Title: Novel Approaches to Information Systems Design

DOI: 10.4018/978-1-7998-2975-1.ch011

OnDemand:

(Individual Chapters)

Available

$33.75

List Price: $37.50

Current Special Offers

10% Discount:-$3.75

TOTAL SAVINGS: $3.75

Abstract

Three technologies—business intelligence, big data, and machine learning—developed independently and address different types of problems. Data warehouses have been used as systems for business intelligence, and NoSQL databases are used for big data. In this chapter, the authors explore the convergence of business intelligence and big data. Traditionally, a data warehouse is implemented on a ROLAP or MOLAP platform. Whereas MOLAP suffers from having propriety architecture, ROLAP suffers from the inherent disadvantages of RDBMS. In order to mitigate the drawbacks of ROLAP, the authors propose implementing a data warehouse on a NoSQL database. They choose Cassandra as their database. For this they start by identifying a generic information model that captures the requirements of the system to-be. They propose mapping rules that map the components of the information model to the Cassandra data model. They finally show a small implementation using an example.

Chapter Preview

Top

Introduction

Business Intelligence (BI), Big Data and Machine Learning (ML) are three among the major technological developments in the last 15 years. Business Intelligence encompasses query reporting, data mining in the context of providing decision support. It is based on Data Warehouse (DW) technology. Traditionally, Data Warehouse (DW) star schemas are implemented either using a relational database which allows ROLAP operations or on a multi-dimensional database that allows MOLAP operations. While the data in the former is stored in relational tables, the data in the latter are stored in multidimensional databases (MDB). MDBs use either multi-dimensional array or hypercubes to store this data. A number of RDBMS offer support for building DW systems and for ROLAP queries. MOLAP engines have proprietary architectures. This results in niche servers and is often a disadvantage.

One of the early views of Big Data is that any data satisfying the properties of Velocity, Volume, Variety is big data; this was expanded to include Veracity. Clearly, based on this definition there are two major concerns (a) building a repository for storage of large amounts of data, (b) accommodating a variety of data. To address (a), there was a shift away from vertical scaling to what is called horizontal scaling. Unlike vertical scaling, horizontal scaling is done using commodity machines. Horizontal scaling leads to a repository of data which is distributed across nodes and datacenters. Now, to address (b), variety includes structured, semi-structured and unstructured data. While traditional relational databases are able to store structured data, unstructured data can be stored as a BLOB. The BLOB does not allow full range of querying and processing. Thus, a new model and architecture for databases was required that also provided horizontal scaling. The answer was found in NoSQL databases.

The third technological development is Machine Learning (ML). The area develops and applies algorithms enabling a system to learn. Notice, the system learns by itself without any additional explicit program being written. This may be done through learning patterns or inference rules. The aim of this learning is to gain insights and improve user experience. ML algorithms make no commitment to data storage and management.

If we compare the three technologies from the query viewpoint, we find that BI is oriented to provide business information; Big Data systems improve execution of unstructured and distributed data; and finally ML improves the quality of data in the hands of the user. The first relies on an explicit data storage and architecture of a data warehouse, the second relies on the NoSQL data storage and architecture whereas the ML de-emphasizes the data aspects but deals with the processing aspects almost exclusively. It can be seen that these three technologies reflect the tension between data orientation and process orientation in information systems with BI and Big Data at the data end and ML at the process end.

Figure 1.

The three technological islands

The three technologies developed independently and at different times (see figure 1): BI was the earliest followed by Big data and ML that were developed almost at the same time. Notice that these three technologies, in so far as they address different domains, are isolated from one another. Yet, there is no reason why these could not benefit from cross fertilization. Indeed there is a case for convergence of these three.

Notice that ML algorithms depend on “lots of data” to effectively run the algorithms. In fact, they not only need bulk storage of data but also historical data. For example, they may need voice data for the last 4 years for analysis. Further they require a system that enables quick random “reads”. Based on the white paper (TDWI, 2018) there are eight requirements for storage of ML data some of which are need for scalability, durability and parallel architecture. Notice, these requirements are satisfied by a Big Data system. The input to an ML algorithm can be unstructured or structured data. Outputs are typically smaller and output storage can be often handled easily.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Towards Convergence in Information Systems Design

Abstract

Introduction

Complete Chapter List