Weighting Imputation for Categorical Data

Weighting Imputation for Categorical Data

Liang-Ting Tsai, Chih-Chien Yang, Timothy Teo
Copyright: © 2014 |Pages: 11
ISBN13: 9781466652026|ISBN10: 1466652020|EISBN13: 9781466652033
DOI: 10.4018/978-1-4666-5202-6.ch241
Cite Chapter Cite Chapter

MLA

Tsai, Liang-Ting, et al. "Weighting Imputation for Categorical Data." Encyclopedia of Business Analytics and Optimization, edited by John Wang, IGI Global, 2014, pp. 2706-2716. https://doi.org/10.4018/978-1-4666-5202-6.ch241

APA

Tsai, L., Yang, C., & Teo, T. (2014). Weighting Imputation for Categorical Data. In J. Wang (Ed.), Encyclopedia of Business Analytics and Optimization (pp. 2706-2716). IGI Global. https://doi.org/10.4018/978-1-4666-5202-6.ch241

Chicago

Tsai, Liang-Ting, Chih-Chien Yang, and Timothy Teo. "Weighting Imputation for Categorical Data." In Encyclopedia of Business Analytics and Optimization, edited by John Wang, 2706-2716. Hershey, PA: IGI Global, 2014. https://doi.org/10.4018/978-1-4666-5202-6.ch241

Export Reference

Mendeley
Favorite

Abstract

This article aims to propose the Learning Vector Quantization (LVQ) approach to impute missing group membership and sampling weights in inferring the accuracy of population parameters of confirmatory factor analysis (CFA) models with categorical questionnaires. Survey data with missing group memberships, for example, gender, age, or ethnicity, are very familiar. However, the group memberships of examinees are critical for calculating the stratum sampling weights. Asparouhov (2005), Tsai and Yang (2008), and Yang and Tsai (2008) have described that appropriate imputation can further improve the precision of CFA model estimations. Questionnaires with categorical responses are not well established yet. In this study, a Monte Carlo simulation was conducted to compare the LVQ method with the other three existing methods (e.g., listwise-deletion, weighting-class adjustment, non-weighted). Four experimental factors, such as missing data rates, sampling sizes, disproportionate sampling, and different populations, were used to examine the performance of these four methods. The results showed that the LVQ method outperformed the other three methods in terms of accuracy of parameters of CFA model with binary or 5-category responses. The conclusion and discussion sections of this article provide for some practical guidelines.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.