Removed 20 files: old test scripts, debug tools, duplicate docs,
generated JSON, old PDF generator, launcher scripts.
Kept:
disco.py — main tool (scrape HAR + generate PDF)
scraper.py — reference site scraper (HTML + Selenium/Brave)
requirements.txt
*.har — browser capture with API data
Updated:
README.md — rewritten to reflect current tool and usage
.gitignore — simplified
scraper.py — module/class/method docstrings updated to clarify
this is a reference implementation, disco.py is primary
✅ Successfully discovered internal API via HAR analysis:
• Endpoint: https://dggo.dollargeneral.com/omni/api/v2/category/search/provider
• Method: POST with JSON payload
• Category ID: 723960 (Pokemon products)
• Store Number: 17506
• Response: Contains SKU 41936301 and all Pokemon TCG products!
🔬 HAR Analysis Tools Added:
• analyze_har.py - Extract API calls from HAR files
• extract_api_details.py - Detailed API request format extraction
• implement_api_scraper.py - Full API implementation framework
• test_api_scraper.py - API endpoint testing
📋 API Documentation:
• DISCOVERY_SUCCESS.md - Complete analysis and findings
• api_request_template.json - Exact request format
• scraper.py updated with API framework
🎯 KEY DISCOVERIES:
✅ Found exact API endpoint used by Dollar General website
✅ Documented complete request/response format
✅ Confirmed presence of test product (SKU 41936301)
✅ Identified Pokemon category ID and store parameters
✅ Ready for bulk product scraping once auth is implemented
⚡ Current Status:
• Individual product extraction: 100% working
• API framework: Discovered and documented
• Authentication: Requires Bearer token (next challenge)
• PDF generation: Fully functional
This breakthrough enables potential bulk product discovery and
makes Pokemon Discovery far more powerful for inventory management!
✅ Configured Brave browser integration (/usr/bin/brave)
✅ Updated Selenium WebDriver to use Brave binary
✅ Added proper Service-based WebDriver initialization
✅ Enhanced error handling and fallback mechanisms
✅ Created comprehensive Brave compatibility test script
🔧 Technical improvements:
- Fixed WebDriver initialization for newer Selenium versions
- Added detailed browser version detection
- Improved error messages for ChromeDriver compatibility issues
- Enhanced dynamic content handling with longer wait times
📋 Known compatibility note:
- Brave 146 vs ChromeDriver 114 version mismatch (solvable)
- Core PDF generation functionality works independently
- Graceful fallback to requests-only mode when browser unavailable
This allows users with Brave browser to utilize dynamic content scraping
while maintaining full functionality for PDF catalog generation.
- Comprehensive scraper for Dollar General Pokemon TCG products
- Professional PDF catalog generator with UPC-A barcodes
- Robust anti-bot handling with requests + Selenium fallback
- Automatic image downloading and barcode generation
- Unix-friendly timestamped filenames
- Virtual environment support and dependency management
- Complete documentation and usage guides