Open Access Open Access  Restricted Access Subscription Access

Methods of Analysing Missing Values in a Regression Model


Affiliations
1 Department of Mathematics and Statistics University of Port Harcourt, Nigeria
2 Department of Mathematics and Statistics University of Port Harcourt
 

Different methods of imputation are adopted in this study to compensate for missing values encountered in the data collected. The imputation methods considered are the overall mean value, Random Overall, Logistic Regression, Linear Regression, Predictive Match, Multiple Imputations and the Hot Deck Imputation. The various values obtained by the methods were analysed and compared using Bartlett's test statistic for equality of variances among groups (Mean Square Errors of the seven methods). The software packages used for this research work are Winmice, Solas and SAS. Different values were estimated applying the various methods. However, results obtained from the test showed that the variances among the groups have no significant differences, that is, any of the imputation methods could be used. Further test using relative variance revealed that the multiple imputation method may be preferred.

Keywords

Missing at Random, Imputation, Bartlett's Test, Coefficient of Relative Variance
User

  • Afifi AA and Elashoff RM (1966) Missing observations in multivariate statistics. J. Am. Stat. Asso. 61(2), 595–604.
  • Kim J and Curry J (1977) The Treatment of Missing Data in Multivariate Analysis. Sociol. Methods & Res. 6, 215–240.
  • Lepkowski JM, J Richard Landis, Sharon A Stehouwer (1987) Strategies for Analyzing of Imputed Data from Sample Survey. The Med. Care utilization & Expenditure. 25(8), 705-715.
  • Little RJA and Rubin DB (2002) Statistical analysis with missing data 2nd Edition, Wiley, NY.
  • Rubin DB (1986) Statistical matching using file concentration with adjusted weights and multiple imputation. J. Bus. & Econ. Stat. 4, 87-94.
  • SAS Learning Edition Version 4.1. Bringing the power of data analytics to individuals. http://www.amazon. com/SAS-Learning-4-1-Little-nterprise/dp/1590479173.

Abstract Views: 408

PDF Views: 94




  • Methods of Analysing Missing Values in a Regression Model

Abstract Views: 408  |  PDF Views: 94

Authors

P. Ogoke Uchenna
Department of Mathematics and Statistics University of Port Harcourt, Nigeria
E. C. Nduka
Department of Mathematics and Statistics University of Port Harcourt

Abstract


Different methods of imputation are adopted in this study to compensate for missing values encountered in the data collected. The imputation methods considered are the overall mean value, Random Overall, Logistic Regression, Linear Regression, Predictive Match, Multiple Imputations and the Hot Deck Imputation. The various values obtained by the methods were analysed and compared using Bartlett's test statistic for equality of variances among groups (Mean Square Errors of the seven methods). The software packages used for this research work are Winmice, Solas and SAS. Different values were estimated applying the various methods. However, results obtained from the test showed that the variances among the groups have no significant differences, that is, any of the imputation methods could be used. Further test using relative variance revealed that the multiple imputation method may be preferred.

Keywords


Missing at Random, Imputation, Bartlett's Test, Coefficient of Relative Variance

References





DOI: https://doi.org/10.17485/ijst%2F2012%2Fv5i2%2F30333