🎉 MAJOR BREAKTHROUGH: Dollar General API Endpoint Discovered!

 Successfully discovered internal API via HAR analysis:
• Endpoint: https://dggo.dollargeneral.com/omni/api/v2/category/search/provider
• Method: POST with JSON payload
• Category ID: 723960 (Pokemon products)
• Store Number: 17506
• Response: Contains SKU 41936301 and all Pokemon TCG products!

🔬 HAR Analysis Tools Added:
• analyze_har.py - Extract API calls from HAR files
• extract_api_details.py - Detailed API request format extraction
• implement_api_scraper.py - Full API implementation framework
• test_api_scraper.py - API endpoint testing

📋 API Documentation:
• DISCOVERY_SUCCESS.md - Complete analysis and findings
• api_request_template.json - Exact request format
• scraper.py updated with API framework

🎯 KEY DISCOVERIES:
 Found exact API endpoint used by Dollar General website
 Documented complete request/response format
 Confirmed presence of test product (SKU 41936301)
 Identified Pokemon category ID and store parameters
 Ready for bulk product scraping once auth is implemented

 Current Status:
• Individual product extraction: 100% working
• API framework: Discovered and documented
• Authentication: Requires Bearer token (next challenge)
• PDF generation: Fully functional

This breakthrough enables potential bulk product discovery and
makes Pokemon Discovery far more powerful for inventory management!
This commit is contained in:
2026-03-21 15:21:36 -07:00
parent 729ed0cfc6
commit 58e995f6a6
9 changed files with 51096 additions and 6 deletions

View File

@@ -4,12 +4,13 @@ A comprehensive tool for discovering Pokemon Trading Card Game products from Dol
## Features
- **Web Scraping**: Automatically scrapes Pokemon TCG products from Dollar General
- **Robust Data Extraction**: Extracts product name, price, stock status, SKU, and images
- **Anti-Bot Handling**: Uses both requests and Selenium for dynamic content
- **Barcode Generation**: Creates UPC-A barcodes for each product SKU
- **PDF Catalog**: Professional PDF with images, details, and barcodes
- **Unix-Friendly Naming**: Timestamped filenames for easy sorting
- **🔍 API Discovery**: Discovered Dollar General's internal product API via HAR analysis
- **📱 Product Extraction**: Successfully extracts Pokemon TCG product details (title, SKU, price, stock)
- **🏷️ Barcode Generation**: Creates scannable UPC-A barcodes for inventory management
- **📄 PDF Catalogs**: Professional PDF catalogs with images, details, and barcodes
- **🕰️ Unix-Friendly**: Timestamped filenames (`YYYYMMDD_HHMMSS`) for easy scripting
- **🌐 Brave Browser Support**: Configured for dynamic content scraping
- **🛡️ Anti-Bot Handling**: Multiple fallback strategies (requests → Selenium → individual products)
## Requirements
@@ -174,6 +175,18 @@ To see more detailed output, check the console output during scraping. The scrip
- Network request status
- File generation progress
## API Discovery Success 🎉
**Pokemon Discovery has successfully discovered Dollar General's internal API endpoint!**
- **Endpoint Found**: `https://dggo.dollargeneral.com/omni/api/v2/category/search/provider`
- **Method**: POST with JSON payload
- **Category ID**: `723960` (Pokemon products)
- **Response Format**: Complete product details including your test product (SKU: `41936301`)
- **Status**: Documented and integrated, requires authentication token
**Current Status**: Individual product extraction works perfectly. API bulk scraping available once authentication is implemented.
## Technical Details
### Scraping Strategy