Dataset

nz-electronics-modified
Open Data Contract Standard v3.1.0
scanner electronics-and-applicances

Fundamentals

Basic information about the data contract

Name
New Zealand Electronics Modified Dataset
Version
1.0
Tenant
StatisticsNZ
Purpose
A modified dataset that was based on real scanner data obtained from GfK. The dataset contains sales (both quanitity and total sales) by month for 459 unique products in a single consumer electronic category for a total of 26 months (2017-01-01 to 2019-02-01). The dataset includes 11 different quality chacacteristics (all are obfuscated to hide their true values). Distribution of quality characteristis and their relationships to quality adjustment was maintained during the modification process.

Dataset facilitates evaluation of quality adjustment methods with scanner data.
Usage
Open dataset, shared under "GNU General Public License" by StatisticsNZ.

Links:
How to cite
APA style:
TBC

Chicago style
TBC

Bibtex citation
Click here for raw form

Entity Relationship Diagram

Visual representation of data model relationships

                    erDiagram
	"**Dataset**" {
	month_num object
	prodid_num number
	char1 number
	char2 number
	char3 object
	char4 object
	char5 object
	char6 object
	char7 object
	char8 object
	char9 object
	char10 object
	char11 object
	quantity number
	value number
}


                  

Schema

The data schema and structure

Dataset None
The orginal dataset was modified to be published and variable names were obfuscated. prodid_num was a unique identifier to each unique product. Several variables were dropped when the synthetic dataset was published however, resulting in some rows having the same prodid_num and same quality characteristics but different quantities or values. Recommend to groupby to use the data.
Property Business Name Type Required Description
month_num
Reporting month
object
No Data month number (1-26) representing the sales month starting from January 2017, ISO standard YYYY-MM-DD followed (day of the month is always first).
Example: '2017-01-01'
prodid_num
Product identifier
number
No Distinct product identifier number assigned during modification process to simulate a product ID (such as GTIN).

There are total of 459 unique products in the dataset.

Note on data quality: There are times when prodid_num (and all other quality characteristics) repeat in the month. This is due to other variables that separted those products being ommitted during the data modification process and product ids were assigned at the end. Recommend to groupby to use the data.
char1
Quality characteristic 1
number
No Modified quality characteristic 1 (obfuscated to hide true value).
Example: '10.6'
char2
Quality characteristic 2
number
No Modified quality characteristic 2 (obfuscated to hide true value).
Example: '16006'
char3
Quality characteristic 3
object
No Modified quality characteristic 3 (obfuscated to hide true value).
Example: 'val_w'
char4
Quality characteristic 4
object
No Modified quality characteristic 4 (obfuscated to hide true value).
Example: 'val_a'
char5
Quality characteristic 5
object
No Modified quality characteristic 5 (obfuscated to hide true value).
Example: 'val_a
char6
Quality characteristic 6
object
No Modified quality characteristic 6 (obfuscated to hide true value).
Example: 'PRG566'
char7
Quality characteristic 7
object
No Modified quality characteristic 7 (obfuscated to hide true value).
Example: 'CCC'
char8
Quality characteristic 8
object
No Modified quality characteristic 8 (obfuscated to hide true value).
Example: '150D'
char9
Quality characteristic 9
object
No Modified quality characteristic 9 (obfuscated to hide true value).
Example: 'B230'
char10
Quality characteristic 10
object
No Modified quality characteristic 10 (obfuscated to hide true value).
Example: 'ted'
char11
Brand
object
No Modified quality characteristic 11 (obfuscated to hide true value). Represents the brand of the product.
Example: 'brand_a'
quantity
Quantity
number
No Quantity of products sold during the period (month).
Example: '280'
value
Total sales
number
No The total sales of products sold during the period (month) in New Zealand dollars (NZD), could also be referred to as turnover.
Example: '196420'
Created at 24 Feb 2026 05:02:51 UTC with Data Contract CLI v0.11.5
version: '1.0'
kind: DataContract
apiVersion: v3.1.0
id: nz-electronics-modified
name: New Zealand Electronics Modified Dataset
tenant: StatisticsNZ
tags:
- scanner
- electronics-and-applicances
status: 'Draft
Target documentation level: 2' description: usage: "Open dataset, shared under \"GNU General Public License\" by StatisticsNZ.\n\ \n

\nLinks:
\n\n" purpose: 'A modified dataset that was based on real scanner data obtained from GfK. The dataset contains sales (both quanitity and total sales) by month for 459 unique products in a single consumer electronic category for a total of 26 months (2017-01-01 to 2019-02-01). The dataset includes 11 different quality chacacteristics (all are obfuscated to hide their true values). Distribution of quality characteristis and their relationships to quality adjustment was maintained during the modification process.

Dataset facilitates evaluation of quality adjustment methods with scanner data. ' limitations: 'APA style:
TBC

Chicago style
TBC

Bibtex citation
Click here for raw form ' domain: Consumer Price Statistics schema: - name: Dataset description: 'The orginal dataset was modified to be published and variable names were obfuscated. prodid_num was a unique identifier to each unique product. Several variables were dropped when the synthetic dataset was published however, resulting in some rows having the same prodid_num and same quality characteristics but different quantities or values. Recommend to groupby to use the data. ' businessName: '' properties: - name: month_num description: 'Data month number (1-26) representing the sales month starting from January 2017, ISO standard YYYY-MM-DD followed (day of the month is always first).
Example: ''2017-01-01'' ' businessName: Reporting month logicalType: object examples: - '2017-01-01' - name: prodid_num description: 'Distinct product identifier number assigned during modification process to simulate a product ID (such as GTIN).

There are total of 459 unique products in the dataset.

Note on data quality: There are times when prodid_num (and all other quality characteristics) repeat in the month. This is due to other variables that separted those products being ommitted during the data modification process and product ids were assigned at the end. Recommend to groupby to use the data. ' businessName: Product identifier logicalType: number examples: - '3' - name: char1 description: 'Modified quality characteristic 1 (obfuscated to hide true value).
Example: ''10.6''' businessName: Quality characteristic 1 logicalType: number examples: - '10.6' - name: char2 description: 'Modified quality characteristic 2 (obfuscated to hide true value).
Example: ''16006''' businessName: Quality characteristic 2 logicalType: number examples: - '16006' - name: char3 description: 'Modified quality characteristic 3 (obfuscated to hide true value).
Example: ''val_w''' businessName: Quality characteristic 3 logicalType: object examples: - val_w - name: char4 description: 'Modified quality characteristic 4 (obfuscated to hide true value).
Example: ''val_a''' businessName: Quality characteristic 4 logicalType: object examples: - val_a - name: char5 description: 'Modified quality characteristic 5 (obfuscated to hide true value).
Example: ''val_a' businessName: Quality characteristic 5 logicalType: object examples: - val_a - name: char6 description: 'Modified quality characteristic 6 (obfuscated to hide true value).
Example: ''PRG566''' businessName: Quality characteristic 6 logicalType: object examples: - PRG566 - name: char7 description: 'Modified quality characteristic 7 (obfuscated to hide true value).
Example: ''CCC''' businessName: Quality characteristic 7 logicalType: object examples: - CCC - name: char8 description: 'Modified quality characteristic 8 (obfuscated to hide true value).
Example: ''150D''' businessName: Quality characteristic 8 logicalType: object examples: - 150D - name: char9 description: 'Modified quality characteristic 9 (obfuscated to hide true value).
Example: ''B230''' businessName: Quality characteristic 9 logicalType: object examples: - B230 - name: char10 description: 'Modified quality characteristic 10 (obfuscated to hide true value).
Example: ''ted''' businessName: Quality characteristic 10 logicalType: object examples: - ted - name: char11 description: 'Modified quality characteristic 11 (obfuscated to hide true value). Represents the brand of the product.
Example: ''brand_a''' businessName: Brand logicalType: object examples: - brand_a - name: quantity description: 'Quantity of products sold during the period (month).
Example: ''280''' businessName: Quantity logicalType: number examples: - '280' - name: value description: 'The total sales of products sold during the period (month) in New Zealand dollars (NZD), could also be referred to as turnover.
Example: ''196420''' businessName: Total sales logicalType: number examples: - '196420'