Classifying transaction and web scraped data
A mapping must be established between individual products that are available in scanner/web scraped data sets and the classification used for the statistical product, such as price indices. Paper and presentation introduce the range of available (semi-) automated methods (incl. their pros and cons and technical pre-conditions) and describe in detail an advanced approach taken to classify the data for CPI compilation using machine learning techniques.