11:00 - 12:00
We-IPS05
Chair:
Ioannis XIROUCHAKIS (Eurostat, Luxembourg)
Organiser:
Claude LAMBORAY (Eurostat, Other)
Classifying transaction and web scraped data
Ingolf Boettcher, (Email), et al.
Statistics Austria

A mapping must be established between individual products that are available in scanner/web scraped data sets and the classification used for the statistical product, such as price indices. Paper and presentation introduce the range of available (semi-) automated methods (incl. their pros and cons and technical pre-conditions) and describe in detail an advanced approach taken to classify the data for CPI compilation using machine learning techniques.