Creating a synthetic database for research in migration and subjective well-being:
Statistical Matching techniques for combining the basic and complementary questionnaires of the Hungarian Microcensus 2016
In 2016, with the aim of tracking social trends between full-scope censuses, the Hungarian Central Statistical Office (HCSO) carried out Microcensus, a population survey based on an unusually large sample covering 10 percent of the Hungarian households. Apart from the basic questionnaires on dwellings and personal information, selected households were asked to fill in one of the following complementary surveys on a) international migration, b) subjective well-being, c) social stratification, d) occupational prestige, and e) health problems.
From a methodological point of view, the Microcensus dataset with the above described structure of basic and complementary questionnaires invites for performing a statistical – or synthetic – matching exercise. This method, in accordance with the UNECE Data Integration Guide [1], “involves the integration of data sources with usually distinct samples from the same target population, in order to study and provide information on the relationship of variables not jointly observed in the data sets”. That is, the statistical matching exercise resembles an “imputation problem of the target variables from a donor to a recipient survey” on the basis of common variables [2].
The HCSO methodologists and experts of population and migration statistics embarked on creating such a synthetic Microcensus database of the complementary modules on the basis of the variables from the basic questionnaires. The resulting dataset – unique in terms of sample size – will contain, apart from the information obtained in the basic questionnaires, the estimated/imputed data from each of the complementary sets of questions. Thus, it will serve to develop analytic richness through making possible the study of the diverse relationships between a wide range of variables never observed together in Hungary. In their joint effort, as a first step, the multidisciplinary project team is currently working on identifying the best solutions to combine the basic and two of the complementary questionnaires: the one on migration and another on the aspects of subjective well-being. This presentation – instead of entering in the details of the analytical results – focuses on the methodological questions of creating such a synthetic database and the evaluation of the usability and potential of the output dataset. It also gives an overview of the first phase of and the lessons already learnt from the ongoing experiment and insight into the next steps.
Reference:
CPS09-002
Session:
Measuring Well-Being
Presenter/s:
Zoltán Csányi
Presentation type:
Oral presentation
Room:
JENK
Chair:
Gaby Umbach, European University Institute, Italy, (Email)
Date:
Thursday, 14 March
Time:
11:30 - 12:30
Session times:
11:30 - 12:30