Email), Elena Goni, (Email), Marina Ayestaran, (Email), Javier San Vicente"> Email)">
15:45 - 16:45
Contributed Paper Session
Room: JENK
Chair:
Martin Karlberg, Eurostat, Luxembourg, (Email)
Discussant:
Jens Mehrhoff, European Commission (Eurostat), Luxembourg, (Email)
BIG DATA: EUSTAT EXPERIENCE, DEVELOPMENT OF THE PILOT PROJECT TOWARDS PRODUCTION
Jorge Aramendi, (Email), Elena Goni, (Email), Marina Ayestaran, (Email), Javier San Vicente, (Email)
EUSTAT - Basque Statistical Office, Vitoria-Gasteiz
“Big Data” represents a paradigmatic change for Official Statistics and Eustat, aware of this, has created a cross-sector group for all the institutions which have relied on the University of the Basque Country´s collaboration. In 2015, a data-capture pilot to establish hotel prices on the Internet was proposed, allowing Eustat to develop its own online data-capture programme, to examine data purging tasks in greater detail and to confront the challenge of storing large volumes of data. We are currently dealing with the analysis of data gathered between August 2017 up to the present. The analysis includes three new products and we are using Machine Learning techniques to carry it out. The three new products are: an alternative to ADR (Average Daily Rate) monthly estimates by region that Eustat publishes using data from the Tourism Survey, a spatial analysis of prices and hotel patterns depending on seasonal price variations. Python is the software that has been used for data collection and R for the analysis of the results. The following sources of information were used to analyse hotel prices in the Basque Country: prices scraped from Booking, the Eustat Survey on Tourist Establishments and the Tourism Directory.


Reference:
CPS11-004
Session:
Experimental Statistics
Presenter/s:
Jorge Aramendi
Presentation type:
Oral presentation
Room:
JENK
Chair:
Martin Karlberg, Eurostat, Luxembourg, (Email)
Date:
Thursday, 14 March
Time:
15:45 - 16:45
Session times:
15:45 - 16:45