An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate

An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate

Mitali Desai, Rupa G. Mehta, Dipti P. Rana
DOI: 10.4018/978-1-7998-7371-6.ch014
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Data imbalance is a key challenge in the majority of real-world classification problems. It refers to the disparity of data instances corresponding to either of the class labels. Data imbalance is studied in detail with respect to many data domains such as transaction data, medical data, e-commerce data, meteorological data, social media data, and web data. But the scholarly data domain is yet to be analyzed pertaining to data imbalance. In this chapter, the scholarly data domain is explored with a focus to study various forms of data imbalance. A well-known and popular scholarly platform, ResearchGate (RG), is targeted to extract real scholarly data. An extensive experimental analysis is performed on the extracted data in order to identify the existence of both data-level and network-level imbalance. The outcome contributes to the learning of various types of data imbalance that exist in scholarly data. Resolving the existing data imbalance will substantially help in achieving efficient and accurate outcomes in many real-world scholarly literature applications.
Chapter Preview
Top

Data imbalance problem has remained at the focus since a long and many researchers have contributed to this domain. In this section, the data domains that are explored in terms of analyzing data imbalance in recent research are briefly discussed.

Complete Chapter List

Search this Book:
Reset