An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate

An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate

Mitali Desai, Rupa G. Mehta, Dipti P. Rana
ISBN13: 9781799873716|ISBN10: 1799873714|ISBN13 Softcover: 9781799873723|EISBN13: 9781799873730
DOI: 10.4018/978-1-7998-7371-6.ch014
Cite Chapter Cite Chapter

MLA

Desai, Mitali, et al. "An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate." Data Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance, edited by Dipti P. Rana and Rupa G. Mehta, IGI Global, 2021, pp. 242-254. https://doi.org/10.4018/978-1-7998-7371-6.ch014

APA

Desai, M., Mehta, R. G., & Rana, D. P. (2021). An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate. In D. Rana & R. Mehta (Eds.), Data Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance (pp. 242-254). IGI Global. https://doi.org/10.4018/978-1-7998-7371-6.ch014

Chicago

Desai, Mitali, Rupa G. Mehta, and Dipti P. Rana. "An Experimental Analysis to Learn Data Imbalance in Scholarly Data: A Case Study on ResearchGate." In Data Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance, edited by Dipti P. Rana and Rupa G. Mehta, 242-254. Hershey, PA: IGI Global, 2021. https://doi.org/10.4018/978-1-7998-7371-6.ch014

Export Reference

Mendeley
Favorite

Abstract

Data imbalance is a key challenge in the majority of real-world classification problems. It refers to the disparity of data instances corresponding to either of the class labels. Data imbalance is studied in detail with respect to many data domains such as transaction data, medical data, e-commerce data, meteorological data, social media data, and web data. But the scholarly data domain is yet to be analyzed pertaining to data imbalance. In this chapter, the scholarly data domain is explored with a focus to study various forms of data imbalance. A well-known and popular scholarly platform, ResearchGate (RG), is targeted to extract real scholarly data. An extensive experimental analysis is performed on the extracted data in order to identify the existence of both data-level and network-level imbalance. The outcome contributes to the learning of various types of data imbalance that exist in scholarly data. Resolving the existing data imbalance will substantially help in achieving efficient and accurate outcomes in many real-world scholarly literature applications.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.