Removed 20 files: old test scripts, debug tools, duplicate docs,
generated JSON, old PDF generator, launcher scripts.
Kept:
disco.py — main tool (scrape HAR + generate PDF)
scraper.py — reference site scraper (HTML + Selenium/Brave)
requirements.txt
*.har — browser capture with API data
Updated:
README.md — rewritten to reflect current tool and usage
.gitignore — simplified
scraper.py — module/class/method docstrings updated to clarify
this is a reference implementation, disco.py is primary
- Fixed double .png extension issue in barcode generation
- Added test data file for demonstrating functionality
- Updated gitignore to allow test data while excluding output files
- Comprehensive testing of PDF generation pipeline
- All core features working: barcode generation, PDF creation, data processing
- Added detailed test results documentation
Test summary:
✅ Virtual environment setup
✅ Python dependencies installation
✅ UPC-A barcode generation (3-6KB PNG files)
✅ Professional PDF catalog generation (161KB output)
✅ Markdown formatting and file organization
✅ Error handling and fallbacks
- Comprehensive scraper for Dollar General Pokemon TCG products
- Professional PDF catalog generator with UPC-A barcodes
- Robust anti-bot handling with requests + Selenium fallback
- Automatic image downloading and barcode generation
- Unix-friendly timestamped filenames
- Virtual environment support and dependency management
- Complete documentation and usage guides