Dataset

dominicks-scanner-dataset

Info

Information about the dataset

Title
Dominicks finer foods scanner dataset
Version
0.0.1
Description
A dataset made up of 9 years of transaction (scanner) data for 26 different categories in 100 stores sold by Dominick's finer foods (a now defunct retailer in Chicago in the 90s).
Owner
Chicago Booth School of Business

Terms

Terms and conditions for using the dataset

Usage
Dataset can be used for research and training purposes
Limitations
Dataset must not be republished
Billing
None

Data Model

The logical data model

movement table
The movement (or turnover) data. Schema identical but this is provided separately for each of the 26 categories.
STORE
int
Store number (references store data on https://github.com/eurostat/dff/blob/master/CSV/stores.csv)
Example: 76
primary primaryKey:False
UPC
bigint
Universal Product Code for sold product
Example: 1192603016
primary primaryKey:False
WEEK
int
Week number representing the sales (references weeks data on https://github.com/eurostat/dff/blob/master/CSV/weeks.csv)
Example: 306
primary primaryKey:False
MOVE
int
Number of units sold
Example: 1
primaryKey:False
QTY
int
Quantity of products sold
Example: 1
primaryKey:False
PRICE
float
Price a product (i.e. the UPC) was sold for
Example: 2.99
primaryKey:False
SALES
float
Derived variable for sales per UPC (SALES = PRICE x MOVE / QTY)
Example: 2.99
primaryKey:False
upc table
List of unique products (tracked by UPC) sold by category
UPC
bigint
Universal Product Code for sold product
Example: 1192603016
primary primaryKey:False
DESCRIP
text
Description of the product (i.e. the UPC)
Example: CAFFEDRINE CAPLETS 1
primaryKey:False
NITEM
bigint
Item code (attempt at tracking products across multiple UPCs)
Example: 7342431
primaryKey:False
COM_CODE
int
Commodity code (Dominick's version of categories). Lower granularity that each category file, hence one file may have more than one COM_CODE
Example: 953
primaryKey:False
SIZE
bigint
Size of the product
Example: 16 CT
primaryKey:False
Created at 16 Apr 2025 04:50:07 UTC with Data Contract CLI v0.10.18
dataContractSpecification: 1.1.0
id: dominicks-scanner-dataset
info:
  title: Dominicks finer foods scanner dataset
  version: 0.0.1
  description: A dataset made up of 9 years of transaction (scanner) data for 26 different
    categories in 100 stores sold by Dominick's finer foods (a now defunct retailer
    in Chicago in the 90s).
  owner: Chicago Booth School of Business
  contact:
    name: Chicago Booth School of Business
    url: https://www.chicagobooth.edu/research/kilts/research-data/dominicks
terms:
  usage: Dataset can be used for research and training purposes
  limitations: Dataset must not be republished
  billing: None
models:
  movement:
    description: The movement (or turnover) data. Schema identical but this is provided
      separately for each of the 26 categories.
    type: table
    fields:
      STORE:
        type: int
        required: false
        primary: true
        unique: false
        description: Store number (references store data on https://github.com/eurostat/dff/blob/master/CSV/stores.csv)
        example: '76'
        primaryKey: false
      UPC:
        type: bigint
        required: false
        primary: true
        unique: false
        description: Universal Product Code for sold product
        example: '1192603016'
        primaryKey: false
      WEEK:
        type: int
        required: false
        primary: true
        unique: false
        description: Week number representing the sales (references weeks data on
          https://github.com/eurostat/dff/blob/master/CSV/weeks.csv)
        example: '306'
        primaryKey: false
      MOVE:
        type: int
        required: false
        unique: false
        description: Number of units sold
        example: '1'
        primaryKey: false
      QTY:
        type: int
        required: false
        unique: false
        description: Quantity of products sold
        example: '1'
        primaryKey: false
      PRICE:
        type: float
        required: false
        unique: false
        description: Price a product (i.e. the UPC) was sold for
        example: '2.99'
        primaryKey: false
      SALES:
        type: float
        required: false
        unique: false
        description: Derived variable for sales per UPC (SALES = PRICE x MOVE / QTY)
        example: '2.99'
        primaryKey: false
  upc:
    description: List of unique products (tracked by UPC) sold by category
    type: table
    fields:
      UPC:
        type: bigint
        required: false
        primary: true
        unique: false
        description: Universal Product Code for sold product
        example: '1192603016'
        primaryKey: false
      DESCRIP:
        type: text
        required: false
        unique: false
        description: Description of the product (i.e. the UPC)
        example: CAFFEDRINE CAPLETS 1
        primaryKey: false
      NITEM:
        type: bigint
        required: false
        unique: false
        description: Item code (attempt at tracking products across multiple UPCs)
        example: '7342431'
        primaryKey: false
      COM_CODE:
        type: int
        required: false
        unique: false
        description: Commodity code (Dominick's version of categories). Lower granularity
          that each category file, hence one file may have more than one COM_CODE
        example: '953'
        primaryKey: false
      SIZE:
        type: bigint
        required: false
        unique: false
        description: Size of the product
        example: 16 CT
        primaryKey: false