Pdi Transaction Items
PDI Transaction Items
Overview
The PDI Transaction Items dataset is a line-item-level transaction table providing SKU-level point-of-sale (POS) data from independent convenience stores across the United States. [Source: Page 2, "Dataset description"] This dataset is maintained by PDI Technologies (formerly Skupos), which operates the largest panel of POS data from independent convenience stores in the United States, covering roughly 80% of the independent convenience-store market. [Source: Page 1, "Company Description"]
The dataset contains one row per item line within a transaction, capturing GTIN, POS-assigned description, unit price, unit quantity, discount amount applied at the line, taxable amount, tax rate, grand total, and scan versus non-scan capture method. [Source: Page 2, "Dataset description"] Non-scan items carry NACS category/subcategory/detail labels. [Source: Page 2, "Dataset description"] This transaction data provides insight into consumer behavior, economic trends, and other research topics. [Source: Page 1, "Partner description"]
The Transaction Items dataset is one of several relational tables in the PDI Convenience Store Transaction Dataset, sharing unique identifiers across stores, transactions, payments, items, discounts, and shoppers. [Source: Page 1, "Partner description"] Standard join keys include STORE_ID, TRANSACTION_SET_ID, TRANSACTION_ITEM_ID, PAYMENT_ID, and GTIN. [Source: Page 1, "Partner description"] The dataset joins to Master GTIN via GTIN for product metadata and joins to Transaction Sets via TRANSACTION_SET_ID for basket context; it is a child of the transaction set. [Source: Page 2, "Dataset description"]
Data Description
The observation level is one row per line item within a transaction, keyed by TRANSACTION_ITEM_ID. [Source: Page 2, "Observation Level"]
Each row captures the following elements:
- GTIN (Global Trade Item Number)
- POS-assigned description
- Unit price
- Unit quantity
- Discount amount applied at the line
- Taxable amount
- Tax rate
- Grand total
- Scan versus non-scan capture method
[Source: Page 2, "Dataset description"]
For non-scan items, the dataset includes NACS (National Association of Convenience Stores) category, subcategory, and detail labels. [Source: Page 2, "Dataset description"]
The dataset supports joins to other tables in the PDI Convenience Store Transaction Dataset using the following identifiers:
- STORE_ID
- TRANSACTION_SET_ID
- TRANSACTION_ITEM_ID
- PAYMENT_ID
- GTIN
[Source: Page 1, "Partner description"]
Coverage
Geographic Coverage: United States; tens of thousands of independent convenience stores covering roughly 80% of the independent convenience-store market. [Source: Page 2, "Coverage"]
Temporal Coverage: Continuous SKU-level POS coverage from 2023 onward; new transactions ingested incrementally as POS t-logs are delivered. [Source: Page 2, "Coverage"]
Methodology
The dataset is parsed from POS t-log files. [Source: Page 2, "Collection Methodology"] PDI captures every item-line entry as recorded by the POS, including scanned items, manually-entered items, and non-scan department keys. [Source: Page 2, "Collection Methodology"] NACS category/subcategory/detail tags are applied by PDI for non-scan rows. [Source: Page 2, "Collection Methodology"]
Additional Notes
This dataset joins to the rest of the PDI Convenience Store Transaction Dataset via the shared identifiers listed in the Partner description. [Source: Page 2, "Additional Notes"]
Timestamps are wall-clock at the store; researchers should combine with store ZIP/state information to localize timestamps. [Source: Page 2, "Additional Notes"]
POS vendor coverage includes Verifone, Gilbarco, Clover, and NCR; some fields are vendor-specific. [Source: Page 2, "Additional Notes"]
Additional documentation is available at https://help.skupos.com/. [Source: Page 2, "Documentation links"]
Updated 3 days ago