← Back to documentation

catalog-quality-audit

US Catalog Quality Audit

Date: 2026-04-18 Issue: BUY-3140 Status: Complete

Summary

SourceTotalMissing ImageMissing PriceMissing TitleFlagged
amazon_us1,3820 (0.0%)104 (7.5%)0 (0.0%)No
nike_us400 (0.0%)10 (25.0%)0 (0.0%)YES
zappos_us7120 (0.0%)0 (0.0%)0 (0.0%)No
(null source)1,7280 (0.0%)0 (0.0%)1,728 (100.0%)YES

Total USD products: 3,862

Flagged Sources (>10% data quality issues)

1. nike_us — 25% missing price

  • Issue: 10 of 40 products have NULL or zero price
  • Action required: Re-scrape Nike US feed to fill in missing prices, or flag for manual review

2. (null source) — 100% missing title

  • Issue: 1,728 products have no source attribution and no title
  • Action required: These are orphaned records with no source. Likely ingestion pipeline error. Investigate and purge or re-attribute before launch.

Clean Sources (Within Threshold)

  • amazon_us: 7.5% missing price — within 10% threshold, acceptable for launch
  • zappos_us: No data quality issues detected

Recommendation

  1. Block nike_us from US catalog launch until price coverage is remediated
  2. Investigate null source ingestion — 1,728 records with no source/title indicate a pipeline bug
  3. amazon_us and zappos_us are cleared for US catalog launch

Next Steps

  • Assign nike_us re-scrape to scraping team
  • Investigate null source pipeline bug (assign to data eng)
  • Re-run audit after remediation