Outliers are a well-known problem in survey estimation, and a variety of approaches have been suggested for dealing with them in this context. However, when the focus is on small area estimation using the survey data, much less is known – even though outliers within a small area sample are clearly much more influential than they are in the larger overall sample. To the best of our knowledge, Chambers and Tzavidis (2006) is the only published methodological approach to small area estimation to date that has explicitly addressed the issue of outlier robustness, using an approach based on fitting outlier robust M-quantile models to the survey data. Recently, Sinha and Rao (2008) have also addressed this issue from the perspective of linear mixed models. Both these approaches, however, use plug-in robust prediction. That is, they replace parameter estimates in optimal, but outlier sensitive, predictors by outlier robust versions. As such, they essentially represent the application of the well-known approach to outlier robust survey estimation that excludes outliers from the prediction component of a survey estimate. Unfortunately, this approach may involve an unacceptable prediction bias (but a low prediction variance) in situations where the outliers are drawn from a distribution that has a different mean to the rest of the survey data (Chambers, 1986), which then leads to the suggestion that outlier robust prediction should include an additional term that makes a correction for this bias. In this paper, we explore the extension of this idea to the small area estimation situation, and use simulation based on realistic outlier contaminated data to evaluate how this extended approach compares with the plug-in robust methods described earlier.

Outlier Robust Small Area Estimation

SALVATI, NICOLA;
2009-01-01

Abstract

Outliers are a well-known problem in survey estimation, and a variety of approaches have been suggested for dealing with them in this context. However, when the focus is on small area estimation using the survey data, much less is known – even though outliers within a small area sample are clearly much more influential than they are in the larger overall sample. To the best of our knowledge, Chambers and Tzavidis (2006) is the only published methodological approach to small area estimation to date that has explicitly addressed the issue of outlier robustness, using an approach based on fitting outlier robust M-quantile models to the survey data. Recently, Sinha and Rao (2008) have also addressed this issue from the perspective of linear mixed models. Both these approaches, however, use plug-in robust prediction. That is, they replace parameter estimates in optimal, but outlier sensitive, predictors by outlier robust versions. As such, they essentially represent the application of the well-known approach to outlier robust survey estimation that excludes outliers from the prediction component of a survey estimate. Unfortunately, this approach may involve an unacceptable prediction bias (but a low prediction variance) in situations where the outliers are drawn from a distribution that has a different mean to the rest of the survey data (Chambers, 1986), which then leads to the suggestion that outlier robust prediction should include an additional term that makes a correction for this bias. In this paper, we explore the extension of this idea to the small area estimation situation, and use simulation based on realistic outlier contaminated data to evaluate how this extended approach compares with the plug-in robust methods described earlier.
2009
978-90-73592-29-2
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/134859
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact