Creating a Data Lakehouse for a South African Government-Sector Learning Control Enforcing Quality Control for Incremental Extract-Load-Transform Pipe

Creating a Data Lakehouse for a South African Government-Sector Learning Control Enforcing Quality Control for Incremental Extract-Load-Transform Pipe

Dharmesh Dhabliya (Vishwakarma Institute of Information Technology, India), Vivek Veeraiah (Sri Siddharth Institute of Technology, Sri Siddhartha Academy of Higher Education, India), Sukhvinder Singh Dari (Symbiosis Law School, Symbiosis International University, India), Jambi Ratna Raja Kumar (Genba Sopanrao Moze College of Engineering, India), Ritika Dhabliya (ResearcherConnect, India), Sabyasachi Pramanik (Haldia Institute of Technology, India), and Ankur Gupta (Vaish College of Engineering, India)
Copyright: © 2024 |Pages: 22
DOI: 10.4018/979-8-3693-1582-8.ch004
This chapter was retracted

Key Terms in this Chapter

Data Ingestion: The process of collecting and importing information from a variety of sources into an archive, e.g. the Data Lakehouse, so that it can be made available for analysis.

Data Lakehouse: A unified data management architecture that enables organizations to store and analyze structured and unstructured data on a single platform, combining the benefits of Data Lakes and Data Warehouses.

Semi-Structured Data: Data which has an irregular and implicit structure that lacks a defined data model.

Data Orchestration: This is to ensure efficient data flows and integration into a system, often facilitated through automated working processes, by coordinating and managing different processing tasks.

Data Quality: The degree to which data are accurate, complete, timely, and consistent, providing reliable and meaningful information for decision making.

Complete Chapter List

Search this Book:
Reset