11:30 - 12:30
Tu-STS02
Chair:
Samuel Stolton
From raw telco data to final statistics: a modular process with synthetic network event data
David Salgado, (Email) 1, 2, Luis Sanguiao, (Email) 1, Bogdan Oancea, (Email) 3, 4, Sandra Barragán, (Email) 1, Marian Necula, (Email) 3
1 Statistics Spain (INE)
2 Complutense University of Madrid
3 Statistics Romania (INS)
4 University of Bucharest

We present an end-to-end process from raw telecommunication data to final outputs (basically population counts). The process is illustrated with synthetic network event data from a simulator developed in the ESSnet on Big Data. This is part of a strategy to develop the ESS Reference Methodological Framework for Mobile Network Data even in the blocked scenario of access to real data. The process deals with aspects of the estimation (geolocation, deduplication, aggregation, and inference to the target population). Each step is developed with an R package implementing the methodology as a proof-of-concept. Steps are integrated via the total probability theorem.