About this project
This project is led by Workstream 5 of the UN Task Team on Scanner data, part of the UN Committee of Experts on Big Data and Data Science for Official Statistics (UN-CEBD).
About the project
What this project aims to do
Research in the price statistics discipline is not as reproducible as it could be. Most researchers utilize proprietary datasets and don’t publish the code alongside the research so that a specific methodology or finding can be easily reproduced. The idea of this project is to help in both topics! We aim to:
- Provide clear and approachable guidance on how researchers in price statistics can make their projects reproducible (including by learning new skills, by understanding how to publish and cite code, and many other topics).
- Support cataloging open datasets that can be used as benchmarks to use for research purposes. The idea is to have all findings trialed and demonstrated on open benchmark datasets.
The project also builds on great efforts of early promoters of open science in the discipline!1 We are aiming to continue to build momentum and make it easier for researchers to work openly!
What this project does not aim to do
Not all team members are listed, we will update this section soon.
As there is a lot of great guidance on the topic already, including The Turing Way, Reproducible Analytical Pipelines or RAP, and many more resources—the idea is to distill key information for the price statistics community, not to create a new standard.
Project team members
The following are current or historic members of Workstream 5 of the UN Task Team.
Serge Goussev
- Role: Workstream lead (2024 - current)
- GitHub id: sergegoussev
- Email: serge.goussev@statcan.gc.ca
Steve Martin
- Role: Contributor (2024 - current)
- GitHub id: marberts
Claude Lamboray
- Role: Contributor (2024 - current)
Christophe Bontemps
- Role: Contributor (2024 - current)
Tanya Flower
- Role: Contributor (2024 - current)
Ben Hillman
- Role: Contributor (2024 - current)
Jens Mehrhoff
- Role: Contributor (2024 - current)
- Email: jens.mehrhoff@bundesbank.de
Footnotes
Notable early efforts include but are not limited to Frances Krsinich publishing the FEWS package with open data in 2018, Jens Mehrhoff showcasing the open Dominick’s Finer Foods scanner dataset in 2019, Erwin Dielert and Chihiro Shimizu opening a dataset of japanese laptop sales, or Jacek Białek including datasets with the PriceIndices R package.↩︎