← Back to documentation

scraper-evasion-inventory

Anti-Bot Evasion Scraper Inventory

Issue: BUY-3400 Status: Active Last Updated: 2026-04-19 Owner: Parse (Scraping Engineer)


Overview

This document tracks all scrapers that employ anti-bot evasion techniques. These scrapers require external services (ScraperAPI, Playwright, cloudscraper) to bypass anti-bot protection implemented by e-commerce platforms.


Evasion Methods

MethodDescriptionExternal Dependency
ScraperAPIProxy service with built-in anti-bot bypass and JavaScript renderingScraperAPI account + API key
PlaywrightBrowser automation for JavaScript-rendered content and stealth browsingPlaywright + browser binaries
cloudscraperCloudflare bypass librarycloudscraper library

Scrapers with Anti-Bot Evasion

ScraperAPI-based Scrapers

PlatformModuleEvasion MethodNotes
Amazon SGscrapers.amazon_sg_scraperapiScraperAPI render=trueBypasses Amazon anti-bot
Shopee SGscrapers.shopee_sg_scraperapiScraperAPI render=trueBypasses Shopee anti-bot
Lazada SGscrapers.lazada_sg_scraperapiScraperAPIBypasses Lazada anti-bot
RedMart SGscrapers.redmart_sg_scraperapiScraperAPIBypasses Lazada anti-bot
Watsons SGscrapers.watsons_sg_scraperapiScraperAPIBypasses Watsons anti-bot
Carousell SGscrapers.carousell_sg_scraperapiScraperAPIBypasses Carousell anti-bot
Tokopedia IDscrapers.tokopedia_idScraperAPIBypasses Tokopedia anti-bot
Lazada THscrapers.lazada_th_scraperapiScraperAPIBypasses Lazada TH anti-bot
Lazada VNscrapers.lazada_vn_scraperapiScraperAPIBypasses Lazada VN anti-bot
Shopee THscrapers.shopee_thScraperAPIBypasses Shopee TH anti-bot
Giant SGscrapers.giant_sg_proxyScraperAPIBypasses Giant anti-bot
Nordstrom USscrapers.nordstrom_us_scraperapiScraperAPIBypasses Nordstrom anti-bot
Macys USscrapers.macys_us_scraperapiScraperAPIBypasses Macys anti-bot
Sephora USscrapers.sephora_us_scraperapiScraperAPI + premium proxiesBypasses Sephora anti-bot
Gap USscrapers.gap_us_scraperapiScraperAPIBypasses Gap anti-bot
CVS USscrapers.cvs_us_scraperapiScraperAPIBypasses CVS anti-bot
Kohl's USscrapers.kohls_usScraperAPIBypasses Kohl's anti-bot
Target USscrapers.target_us_scraperapiScraperAPIBypasses Target anti-bot
Yahoo Shopping JPscrapers.yahoo_shopping_jpScraperAPIBypasses Yahoo Shopping JP anti-bot

Playwright-based Scrapers

PlatformModuleNotes
Lazada SGscrapers.lazada_sgBrowser automation for JavaScript rendering
Watsons SG Playwrightscrapers.watsons_sg_playwrightHandles Watsons JS-rendered content
Watsons SG Hybridscrapers.watsons_sg_hybridPlaywright + API hybrid approach
Castlery SGscrapers.castlery_sg_playwrightBypasses anti-bot via browser
Carousell SGscrapers.carousell_sg_home_appliancesEnhanced anti-bot handling
Harvey Norman SGscrapers.harvey_norman_sg_v2Optional ScraperAPI fallback
Harvey Norman SG Fullscrapers.harvey_norman_sg_fullOptional ScraperAPI fallback
iPrice SGscrapers.iprice_sgJavaScript rendering for anti-bot
Temu SGscrapers.temu_sgPlaywright-based bypass
Naiise SGscrapers.naiise_sgBrowser automation
Wayfair USscrapers.wayfair_usPerimeterX anti-bot bypass
Ulta USscrapers.ulta_usJavaScript rendering + anti-bot
Walgreens USscrapers.walgreens_usJavaScript rendering + anti-bot
Walgreens US Playwrightscrapers.walgreens_us_playwrightEnhanced stealth browsing
Home Depot USscrapers.homedepot_us_playwrightAnti-bot with stealth config
Home Depot US Undetectedscrapers.homedepot_us_undetectedUndetected-chromedriver
Best Buy USscrapers.bestbuy_us_playwrightBrowser automation
Best Buy US Sitemapscrapers.bestbuy_us_sitemapPlaywright for sitemap
B&H Photo USscrapers.bhphoto_us_playwrightBrowser automation
REI USscrapers.rei_usJavaScript rendering
Costco US V2scrapers.costco_us_v2Queue-it anti-bot handling
Ulta US Sitemapscrapers.ulta_us_sitemapBypasses waiting room anti-bot
Ulta US Sitemap Fixedscrapers.ulta_us_sitemap_fixedEnhanced anti-bot bypass
Ulta US Undetectedscrapers.ulta_us_undetectedFull anti-bot bypass
Chewy US Playwrightscrapers.chewy_us_playwrightKasada (KPSDK) anti-bot
Chewy US Undetectedscrapers.chewy_us_undetectedKasada anti-bot bypass
eBay US Playwrightscrapers.ebay_us_playwrightJavaScript + anti-bot
Tokopedia ID Playwrightscrapers.tokopedia_id_playwrightBypasses anti-bot
Bukalapak ID Playwrightscrapers.bukalapak_id_playwrightBrowser automation
Target US Playwright V2scrapers.target_us_playwright_v2Enhanced anti-bot bypass

cloudscraper-based Scrapers

PlatformModuleNotes
eBay USscrapers.ebay_usCloudflare bypass
Amazon US Sportsscrapers.amazon_us_sportsCloudflare + anti-bot
Amazon US Booksscrapers.amazon_us_booksCloudflare + anti-bot
Amazon US Toysscrapers.amazon_us_toysCloudflare + anti-bot
Amazon US Healthscrapers.amazon_us_healthCloudflare + anti-bot
Amazon SG Fashionscrapers.amazon_sg_fashioncloudscraper for Cloudflare
Amazon SG Groceryscrapers.amazon_sg_grocerycloudscraper for Cloudflare

Blocked Upstream (Requires API Credentials)

The following scrapers are blocked upstream because the platform requires official API access:

PlatformModuleBlocker Issue
Shopee SGscrapers.shopee_sgBUY-480 - Requires Shopee Open Platform API
Lazada SGscrapers.lazada_sgBUY-480 - Requires Lazada Open Platform API
Lazada SG eng08scrapers.lazada_sg_eng08BUY-480 - Same Lazada geo-block issue

Evasion Dependency Status

Critical (Required for scraping)

These platforms are impossible to scrape without anti-bot evasion:

  • Amazon (all regions)
  • Lazada (all regions)
  • Shopee (all regions)
  • Carousell
  • Watsons
  • Tokopedia
  • Bukalapak
  • Nordstrom
  • Macys
  • Target
  • CVS
  • Walgreens
  • Costco
  • Chewy
  • Ulta

High Risk (High dependency on external service)

  • All ScraperAPI-dependent scrapers (service availability risk)
  • All Playwright-dependent scrapers (browser binary / memory risk)

Configuration

In scripts/scraper_scheduler.py, scrapers with evasion have the field:

"uses_evasion": True  # Indicates anti-bot evasion is used

This allows operations to:

  1. Track which scrapers depend on external evasion services
  2. Plan for redundancy if evasion services fail
  3. Monitor costs associated with ScraperAPI usage

Related Issues

  • BUY-480: Platform API credentials required for Shopee/Lazada
  • BUY-3400: Anti-bot evasion inventory (this document)

Last Review

  • 2026-04-19: Initial inventory created by Parse