[Statlist] seminar 29.11.2023

MATEI Alina Gabriela A||n@@M@te| @end|ng |rom un|ne@ch
Thu Nov 16 11:31:25 CET 2023


The Institute of Statistics, University of Neuchâtel is organizing on Wednesday, November 29, 2023 at 11:00 the seminar "Measuring impact and reducing the dimension of auxiliary variables in the calibration estimator" by Alessio Guandalini (Italian National Institute of Statistics).

Room : GB31-33, Faculté des Sciences, Bâtiment G, Av. des Bellevaux 51, 2000, Neuchâtel

Abstract: In multipurpose surveys several interest variables and a very large number of auxiliary variables are collected. Auxiliary variables are usually considered in calibration (Deville and Särndal, 1992) for improving estimates. But, very often, some of them are included for the sole purpose of increasing consistency. Consistency is an important point for National Statistical Institutes especially as a means for promoting credibility in published statistics. As a direct result, the number of auxiliary variables considered in calibration continue to grow over time. In literature, several methods show how to manage many auxiliary variables in order to prevent some unpleasant consequences on the accuracy of estimates. They consist mainly in variable selection or dimension reduction and they are very useful for deriving calibrated estimates more accurately. Among these methods, there are forward and stepwise selection (Chambers et al., 1999 and Clark and Chambers, 2008), lasso (McConville et al., 2017), the partial least squares (Swold et al., 2001), the principal component analysis (PCA) (Cardot et al., 2017) and, recently, even machine learning techniques. However, looking at them, it is not easy to infer how much the contribution of each auxiliary variable is, especially when there are plenty of them. The Shapley decomposition (Shapley, 1953) applied in the calibration context could be a useful tool to better understand the net effects of auxiliary variables, and, in addition, it provides further information for supporting researchers in choosing the best calibration system. It provides a direct measure of the change with respect to Horvitz-Thompson (Horvitz and Thompson, 1952) estimates and to related sampling variances due to the introduction of each auxiliary variable in the calibration. An application on real data of the Italian Labour Force Survey (LFS) that makes an extensive use of auxiliary variables in calibration will be shown.


Main references
Cardot, H., C. Goga, and M.-A. Shehzad. 2017. Calibration and partial calibration on principal components when the number of auxiliary variables is large. Stat. Sin.: 243-260.
Chambers, R. L.. 1996. Robust case-weighting for multipurpose establishment surveys. J. Off. Stat. 12(1): 3-32.
Chambers, R. L., C. Skinner, S. Wang. 1999. Intelligent calibration. Bull. Int. Stat. Inst. 58(2): 321-324.
Clark, R.G, R.L. Chambers. 2008. Adaptive calibration for prediction of finite population totals. Surv. Methodol. 34(2): 163-172.
Deville, J.-C., C.-E. Särndal. 1992. Calibration estimators in survey sampling. J. Am. Stat. Assoc. 87(418): 376-382.
Horvitz, D.G. and D. J. Thompson. 1952. A generalization of sampling without replacement from a finite universe. J Am Stat Assoc.  47(260): 663-685.
McConville, K.S., Breidt F.J., Lee T.M.C., Moisen G.G.. 2017. Model-assisted survey regression estimation with the LASSO. J. Surv. Stat. Methodol. 5:131-158.
Shapley, L.S.. 1953. A value for n-person games. Contributions to the Theory of Games 2(28): 307-317.
Swold, S., M. Sjöström, L. Eriksson. PLS-regression: a basic tool of chemometrics. Chemom. Intell. Lab. Syst 58(2): 109-130.



More information about the Statlist mailing list