Open Access
Subscription Access
Methods of Analysing Missing Values in a Regression Model
Different methods of imputation are adopted in this study to compensate for missing values encountered in the data collected. The imputation methods considered are the overall mean value, Random Overall, Logistic Regression, Linear Regression, Predictive Match, Multiple Imputations and the Hot Deck Imputation. The various values obtained by the methods were analysed and compared using Bartlett's test statistic for equality of variances among groups (Mean Square Errors of the seven methods). The software packages used for this research work are Winmice, Solas and SAS. Different values were estimated applying the various methods. However, results obtained from the test showed that the variances among the groups have no significant differences, that is, any of the imputation methods could be used. Further test using relative variance revealed that the multiple imputation method may be preferred.
Keywords
Missing at Random, Imputation, Bartlett's Test, Coefficient of Relative Variance
User
Information
- Afifi AA and Elashoff RM (1966) Missing observations in multivariate statistics. J. Am. Stat. Asso. 61(2), 595–604.
- Kim J and Curry J (1977) The Treatment of Missing Data in Multivariate Analysis. Sociol. Methods & Res. 6, 215–240.
- Lepkowski JM, J Richard Landis, Sharon A Stehouwer (1987) Strategies for Analyzing of Imputed Data from Sample Survey. The Med. Care utilization & Expenditure. 25(8), 705-715.
- Little RJA and Rubin DB (2002) Statistical analysis with missing data 2nd Edition, Wiley, NY.
- Rubin DB (1986) Statistical matching using file concentration with adjusted weights and multiple imputation. J. Bus. & Econ. Stat. 4, 87-94.
- SAS Learning Edition Version 4.1. Bringing the power of data analytics to individuals. http://www.amazon. com/SAS-Learning-4-1-Little-nterprise/dp/1590479173.
Abstract Views: 408
PDF Views: 94