Reproducible analysis with Renku
RENKU is an open-source platform for data analytics designed with reproducibility, reusability and the ability to collaborate as its main concerns. As the researcher perform their analysis, data lineage is automatically recorded and seamlessly captures both the workflow within and across projects, allowing any derived data to be unambiguously traced back to the original raw data sources in a manner that is fully transparent. The results are verified by an automated build, allowing scientists to detect problems with reproducibility early in the process.
To simplify the adoption of Renku and to ensure that the data, metadata and code captured by the platform is interoperable with other systems, we rely on frequently used open-source tools like git, GitLab, the Common Workflow Language, Docker and JSON-LD schemas. The platform allows users to perform analysis either off-line using a RENKU python client, or in a (self-)hosted cloud environment. While each project is a self-contained repository, workflows may seamlessly reference workflow steps from other projects and even projects on other RENKU instances.
Reference:
CPS10-003
Session:
Open, transparent and reproducible
Presenter/s:
Roskar Rok
Presentation type:
Oral presentation
Room:
JENK
Chair:
Per Nymand-Andersen, European Central Bank, Germany, (Email)
Date:
Thursday, 14 March
Time:
14:30 - 15:30
Session times:
14:30 - 15:30