US Catalog Quality Audit
Date: 2026-04-18 Issue: BUY-3140 Status: Complete
Summary
| Source | Total | Missing Image | Missing Price | Missing Title | Flagged |
|---|---|---|---|---|---|
| amazon_us | 1,382 | 0 (0.0%) | 104 (7.5%) | 0 (0.0%) | No |
| nike_us | 40 | 0 (0.0%) | 10 (25.0%) | 0 (0.0%) | YES |
| zappos_us | 712 | 0 (0.0%) | 0 (0.0%) | 0 (0.0%) | No |
| (null source) | 1,728 | 0 (0.0%) | 0 (0.0%) | 1,728 (100.0%) | YES |
Total USD products: 3,862
Flagged Sources (>10% data quality issues)
1. nike_us — 25% missing price
- Issue: 10 of 40 products have NULL or zero price
- Action required: Re-scrape Nike US feed to fill in missing prices, or flag for manual review
2. (null source) — 100% missing title
- Issue: 1,728 products have no source attribution and no title
- Action required: These are orphaned records with no source. Likely ingestion pipeline error. Investigate and purge or re-attribute before launch.
Clean Sources (Within Threshold)
- amazon_us: 7.5% missing price — within 10% threshold, acceptable for launch
- zappos_us: No data quality issues detected
Recommendation
- Block nike_us from US catalog launch until price coverage is remediated
- Investigate null source ingestion — 1,728 records with no source/title indicate a pipeline bug
- amazon_us and zappos_us are cleared for US catalog launch
Next Steps
- Assign nike_us re-scrape to scraping team
- Investigate null source pipeline bug (assign to data eng)
- Re-run audit after remediation