Publishing georeferenced statistical data using linked open data technologies
In January 2018 Statistics Poland concluded the “Development of guidelines for publishing statistical data as linked open data” project. The aim of the project was to perform a thorough inventory of data sources and investigate technologies, which could be used to publish georeferenced statistics as linked open data.
Data samples from statistical databases and geospatial datasets have been selected for transformation to linked open data RDF triples and a dataset catalogue has been set up and encoded in RDF. A pilot triple store has been established with a SPARQL endpoint – a query interface. Aside from the pilot’s results being machine readable, all data created in the pilot was also internally published as human readable webpages using linked open data frontend software.
The pilot linked open data implementation was a valuable exercise which provided a lot of answers but at the same time raised a lot of new questions: Is there a reference implementation for statistical data? Which vocabularies to use? What should we link to? How to encode geospatial data to make them most usable? Most implementations are technically correct but are they of good quality?
Reference:
CPS03-002
Session:
Linked Open Data
Presenter/s:
Mirosław Migacz
Presentation type:
Oral presentation
Room:
JENK
Chair:
Benjamin Sakarovitch, INSEE, France, (Email)
Date:
Tuesday, 12 March
Time:
17:15 - 18:15
Session times:
17:15 - 18:15