A complete web scraping framework built on Python stdlib. CSS-style selectors, automatic pagination, rate limiting, proxy rotation, and multi-format export — no pip installs needed.
Browse the actual product documentation and code examples included in this toolkit.
Key features of Web Scraper
• CSS-like selectors — Extract elements by tag, `.class`, or `#id` • Automatic pagination — Follow next-page links across multiple pages • Rate limiting — Configurable delay between requests (be a polite scraper) • Proxy rotation — Round-robin through a list of proxy servers • Retry with backoff — Exponential backoff on 429/5xx errors • Multi-format export — Save to JSON, CSV, or SQLite
CSS-like selectors — Extract elements by tag, `.class`, or `#id`
Automatic pagination — Follow next-page links across multiple pages
Rate limiting — Configurable delay between requests (be a polite scraper)
Proxy rotation — Round-robin through a list of proxy servers
Retry with backoff — Exponential backoff on 429/5xx errors
Multi-format export — Save to JSON, CSV, or SQLite
Configure Web Scraper parameters to see how the product works.
# Scrape all h2 elements from a page
python src/web_scraper.py --url "https://example.com" --selector "h2"
# Scrape product cards with pagination
python src/web_scraper.py --url "https://example.com/shop" --selector ".product" \
--follow --next-selector "a.next" --max-pages 5
# Export to CSV
p