Dataset
nz-electronics-modified
Open Data Contract Standard v3.1.0
scanner
electronics-and-applicances
Fundamentals
Basic information about the data contract
- Name
- New Zealand Electronics Modified Dataset
- Version
- 1.0
- Status
- Draft
Target documentation level: 2 - Tenant
- StatisticsNZ
- Purpose
- A modified dataset that was based on real scanner data obtained from GfK. The dataset contains sales (both quanitity and total sales) by month for 459 unique products in a single consumer electronic category for a total of 26 months (2017-01-01 to 2019-02-01). The dataset includes 11 different quality chacacteristics (all are obfuscated to hide their true values). Distribution of quality characteristis and their relationships to quality adjustment was maintained during the modification process.
Dataset facilitates evaluation of quality adjustment methods with scanner data. - Usage
- Open dataset, shared under "GNU General Public License" by StatisticsNZ.
Links:
- CSV version of the dataset: See FEWS package (on the UN Global Platform GitLab)
- Rdata version of the dataset: See StatisticsNZ Multilateral package
- How to cite
- APA style:
TBC
Chicago style
TBC
Bibtex citation
Click here for raw form
Entity Relationship Diagram
Visual representation of data model relationships
erDiagram
"**Dataset**" {
month_num object
prodid_num number
char1 number
char2 number
char3 object
char4 object
char5 object
char6 object
char7 object
char8 object
char9 object
char10 object
char11 object
quantity number
value number
}
Schema
The data schema and structure
|
Dataset
None
The orginal dataset was modified to be published and variable names were obfuscated. prodid_num was a unique identifier to each unique product. Several variables were dropped when the synthetic dataset was published however, resulting in some rows having the same prodid_num and same quality characteristics but different quantities or values. Recommend to groupby to use the data.
|
||||
|---|---|---|---|---|
| Property | Business Name | Type | Required | Description |
|
month_num
|
Reporting month |
object
|
No | Data month number (1-26) representing the sales month starting from January 2017, ISO standard YYYY-MM-DD followed (day of the month is always first). Example: '2017-01-01' |
|
prodid_num
|
Product identifier |
number
|
No | Distinct product identifier number assigned during modification process to simulate a product ID (such as GTIN). There are total of 459 unique products in the dataset. Note on data quality: There are times when prodid_num (and all other quality characteristics) repeat in the month. This is due to other variables that separted those products being ommitted during the data modification process and product ids were assigned at the end. Recommend to groupby to use the data. |
|
char1
|
Quality characteristic 1 |
number
|
No | Modified quality characteristic 1 (obfuscated to hide true value). Example: '10.6' |
|
char2
|
Quality characteristic 2 |
number
|
No | Modified quality characteristic 2 (obfuscated to hide true value). Example: '16006' |
|
char3
|
Quality characteristic 3 |
object
|
No | Modified quality characteristic 3 (obfuscated to hide true value). Example: 'val_w' |
|
char4
|
Quality characteristic 4 |
object
|
No | Modified quality characteristic 4 (obfuscated to hide true value). Example: 'val_a' |
|
char5
|
Quality characteristic 5 |
object
|
No | Modified quality characteristic 5 (obfuscated to hide true value). Example: 'val_a |
|
char6
|
Quality characteristic 6 |
object
|
No | Modified quality characteristic 6 (obfuscated to hide true value). Example: 'PRG566' |
|
char7
|
Quality characteristic 7 |
object
|
No | Modified quality characteristic 7 (obfuscated to hide true value). Example: 'CCC' |
|
char8
|
Quality characteristic 8 |
object
|
No | Modified quality characteristic 8 (obfuscated to hide true value). Example: '150D' |
|
char9
|
Quality characteristic 9 |
object
|
No | Modified quality characteristic 9 (obfuscated to hide true value). Example: 'B230' |
|
char10
|
Quality characteristic 10 |
object
|
No | Modified quality characteristic 10 (obfuscated to hide true value). Example: 'ted' |
|
char11
|
Brand |
object
|
No | Modified quality characteristic 11 (obfuscated to hide true value). Represents the brand of the product. Example: 'brand_a' |
|
quantity
|
Quantity |
number
|
No | Quantity of products sold during the period (month). Example: '280' |
|
value
|
Total sales |
number
|
No | The total sales of products sold during the period (month) in New Zealand dollars (NZD), could also be referred to as turnover. Example: '196420' |
Created at 24 Feb 2026 05:02:51 UTC with Data Contract CLI v0.11.5