Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Semi-Structured Data

Machine Learning and Data Science Techniques for Effective Government Service Delivery
Data which has an irregular and implicit structure that lacks a defined data model.
Published in Chapter:
Developing a Data Lakehouse for a South African Government-Sector Training Authority: Implementing Quality Control for Incremental Extract-Load-Transform Pipelines in the Ingestion Layer
Priyanka Govender (Durban University of Technology, South Africa), Nalindren Naicker (Durban University of Technology, South Africa), Sulaiman Saleem Patel (Durban University of Technology, South Africa), Seena Joseph (Durban University of Technology, South Africa), Devraj Moonsamy (Durban University of Technology, South Africa), Ayotuyi Tosin Akinola (Durban University of Technology, South Africa), Lavanya Madamshetty (Durban University of Technology, South Africa), and Thamotharan Prinavin Govender (Durban University of Technology, South Africa)
DOI: 10.4018/978-1-6684-9716-6.ch006
Abstract
The Durban University of Technology is undertaking a project to develop a data lakehouse system for a South African government-sector training authority. This system is considered critical to enhance the monitoring and evaluation capabilities of the training authority and ensure service delivery. Ensuring the quality of data ingested into the lakehouse is critical, as poor data quality deteriorates the efficiency of the lakehouse solution. This chapter studies quality control for ingestion-layer pipelines to propose a data quality framework. Metrics considered for data quality were completeness, accuracy, integrity, correctness, and timeliness. The framework was evaluated by practically applying it to a sample semi-structured dataset to gauge its effectiveness. Recommendations for future work include expanded integration, such as incorporating data from more varied sources and implementing incremental data ingestion triggers.
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Machine Learning in Text Analysis
Data that have some organizational property, but not having some row and column relationship.
Full Text Chapter Download: US $37.50 Add to Cart
Reading Data Possibilities From an LMS Data Portal Data Dictionary
A mix of labeled and unlabeled data (such as a data table with imagery involved).
Full Text Chapter Download: US $37.50 Add to Cart
Full Text Chapter Download: US $37.50 Add to Cart
Promoting Social and Solidarity Economy through Big Data
Data that do not conform to fixed fields but contain tags and other markers to separate data elements. Contrast with structured data and unstructured data.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data: Its Implications on Healthcare and Future Steps
Structured data that does not fully conform with the formal structure of data models associated with relational databases or other forms of data tables
Full Text Chapter Download: US $37.50 Add to Cart
Querying GML: A Pressing Need
Data with incomplete structure. Data are directly described using a simple syntax, e.g. XML, GML, etc.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR