BUY-2880-logging-plan

Plan: Centralized Logging Aggregation for All Microservices

BUY-2880 - Implement centralized logging aggregation for all microservices

In Progress

BuyWhere has multiple microservices (API, scrapers, MCP, background jobs) that currently log in different formats. This makes it difficult to:

docs/logging-schema.md - Standardized logging schema specification
app/logging_centralized.py - Centralized logger for API service
app/request_logging.py - Request logging middleware with structured output
scrapers/scraper_logging.py & base_scraper.py - Scraper-specific structured logging
docker-compose.prod.yml - Loki + Fluent Bit + Grafana stack
k8s/production/ & k8s/staging/ - Fluent Bit and Loki Kubernetes configurations
grafana/provisioning/dashboards/loki-logs.json - Loki logs dashboard
k8s/production/loki-alerts-configmap.yaml - Loki alerting rules

Inconsistent log formats: Scrapers use platform field; API uses service field
Missing service labels: Docker Compose scraper services lack labels for Fluent Bit filtering
No unified job label: Loki queries reference job="buywhere-api" but scrapers use job="scraper-fleet"
Missing log level standardization: Log levels not consistently applied
Grafana dashboard limited: Current dashboard only shows API logs, not scraper fleet logs

1.1 Update scrapers/base_scraper.py to use centralized logging

Import get_logger from app.logging_centralized
Replace StructuredLogger with centralized logger
Map scraper-specific fields to schema fields:
- platform → service
- error_type → include in metadata
Add scraper_name as service identifier

1.2 Update app/logging_centralized.py

Add log_scraper_progress function already exists but ensure it's properly integrated

2.1 Update docker-compose.yml scraper services Add logging labels to all scraper services:

logging:
  driver: json-file
  options:
    max-size: "50m"
    max-file: "5"
    labels: "service,environment"

2.2 Add labels to all service definitions

3.1 Update parsers.conf for unified parsing

3.2 Add service label extraction

4.1 Update Loki schema

4.2 Verify retention policies

5.1 Update loki-logs.json dashboard

6.1 Test log flow

6.2 Verify alerting

scrapers/base_scraper.py - Use centralized logging
scrapers/scraper_logging.py - Keep for backwards compatibility, use centralized logger
docker-compose.yml - Add logging labels to all services
docker-compose.prod.yml - Ensure all services have proper labels
k8s/production/fluent-bit-configmap.yaml - Update parsers for unified format
k8s/staging/fluent-bit-configmap.yaml - Same updates
grafana/provisioning/dashboards/loki-logs.json - Add multi-service support