shot-power-scraper

A command-line utility for taking automated screenshots of websites, powered by nodriver for enhanced stealth and anti-detection capabilities.

This is a fork of Simon Willison's excellent shot-scraper, migrated from Playwright to nodriver. This provides powerful, built-in bypass capabilities for CAPTCHAs and services like Cloudflare.

Experimental: This tool works! But it is still a work in progress and there appear to be some underlying bugs in the nodriver library that can lead to chrome crashing. Use with caution in production environments.

Installation

The easiest way to install this tool is with uv:

uv tool install git+https://github.com/elidickinson/shot-power-scraper.git

Then run the install command to test browser detection and set up the correct user agent for stealth mode:

shot-power-scraper install

Requirements: Google Chrome or Chromium must be installed on your system. No separate driver installation is required.

Testing: The test suite runs on macOS CI in GitHub Actions and includes both unit tests and browser integration tests.

Taking your first screenshot

You can take a screenshot of a web page like this:

shot-power-scraper https://datasette.io/

This will create a screenshot in a file called datasette-io.png.

Different Output Formats

Beyond screenshots, shot-power-scraper supports multiple output formats:

Screenshots (default)

shot-power-scraper https://example.com/          # Creates example-com.png

PDF Documents

shot-power-scraper pdf https://example.com/      # Creates example-com.pdf

HTML Source

shot-power-scraper html https://example.com/     # Outputs HTML to stdout
shot-power-scraper html https://example.com/ -o page.html

MHTML Web Archives

shot-power-scraper mhtml https://example.com/    # Creates example-com.mhtml
shot-power-scraper mhtml https://example.com/ -o archive.mhtml

MHTML (MIME HTML) archives contain the complete web page including all embedded resources like images, CSS, and JavaScript in a single file - perfect for offline viewing or archival purposes.

Anti-Detection Features

This fork includes comprehensive stealth capabilities that make it much harder to detect than standard automation tools.

Enhanced Stealth with nodriver

Unlike Playwright and other automation frameworks, nodriver provides built-in anti-detection that bypasses most bot detection systems by removing automation markers, simulating natural browser behavior, and masking its fingerprint.

Required: Set Up Stealth User Agent

For stealth features to work when running in headless mode (the default) you must run the install command once to set up the correct user agent:

shot-power-scraper install

Headless vs Headful Mode

By default, shot-power-scraper runs in headless mode (browser is invisible). You can run with a visible browser using:

# Run with visible browser (no interaction pause)
shot-power-scraper --headful https://example.com

# Or use the alias
shot-power-scraper --no-headless https://example.com

This is different from -i/--interactive mode which shows the browser AND pauses for manual interaction before taking the screenshot.

Ad and Popup Blocking

This fork uses uBlock Origin Lite for content blocking during screenshot capture. Use --ad-block to enable blocking, and add --ublock-lists to enable additional filter lists for blocking popups, cookie notices, and other annoyances.

shot-power-scraper --ad-block https://example.com
shot-power-scraper --ad-block --ublock-lists annoyances-cookies,annoyances-overlays https://example.com

This can be enabled by default using the config command. For detailed information about available filter lists and customization, see EXTENSIONS.md.

Building uBlock Origin Lite

Our custom build includes "Complete" filtering mode (maximum blocking), annoyance filters, and custom rules support:

# Build extension with latest filter lists (2-3 minutes)
./shot_power_scraper/extensions/update-ublock.sh

# Build faster using cached filter lists
./shot_power_scraper/extensions/update-ublock.sh --use-cache

The script automatically clones/updates uBlock Origin, enables selected filter lists, sets "Complete" mode as default, and installs to shot_power_scraper/extensions/ublock-lite-custom/.

⚠️ Important: Differences from Original shot-scraper

This fork has some important differences from the original. It only supports Chrome/Chromium and some features aren't fully implemented.

🚫 Commands & Features That Don't Work

shot-power-scraper accessibility - Not implemented.
--log-requests option is not implemented.
--quality to specify JPEG quality not implemented.

🔄 Commands With Limited Functionality

Console logging (--log-console) - Basic CDP implementation, may miss some message types.
Browser selection (--browser) - Only Chrome/Chromium is supported.
HAR recording (har command) - Content bodies not included in the archive.

📋 Command Status

✅ shot: Fully Implemented (except --log-requests)
✅ multi: Fully Implemented
✅ pdf: Fully Implemented
✅ javascript: Fully Implemented
✅ html: Fully Implemented
✅ mhtml: Fully Implemented - Create MHTML web page archives
✅ har: Implemented (limited) - Record HTTP Archive files (content bodies not included)
✅ auth: Fully Implemented
✅ install: Fully Implemented - also sets up user agent for stealth mode
✅ config: Fully Implemented

Configuration & Defaults

shot-power-scraper stores default settings in ~/.config/shot-power-scraper/config.json. These settings are used unless overridden by command-line options.

Configuration Commands

# Set default ad blocking
shot-power-scraper config --ad-block true

# View current settings
shot-power-scraper config --show

# Clear all settings
shot-power-scraper config --clear

Examples

The following examples demonstrate concepts that can be adapted for shot-power-scraper.

Examples of similar usage patterns can be found in projects that use the original shot-scraper as a reference
The concepts demonstrated in shot-scraper-demo can be adapted for shot-power-scraper
The Datasette Documentation shows how screenshots can be integrated into documentation workflows
Projects like @newshomepages demonstrate automated screenshot workflows
scrape-hacker-news-by-domain shows JavaScript execution patterns that can be adapted

Code Architecture: `shot-power-scraper shot` Execution Path

This section outlines the major code path and functions called when executing shot-power-scraper shot ....

Entry Point and Flow

CLI Entry (cli.py:shot()) - Parse arguments, create centralized ShotConfig object with all parameters
Browser Command (cli.py:run_browser_command()) - Orchestrate browser lifecycle using shot_config
Extension Setup (browser.py:setup_blocking_extensions()) - Configure ad blocking based on shot_config
Browser Context (browser.py:create_browser_context()) - Initialize nodriver browser using shot_config parameters
Screenshot Execution (cli.py:execute_shot()) - Handle interactive mode and viewport
Core Screenshot (screenshot.py:take_shot()) - Main screenshot logic with shot_config
Page Setup (page_utils.py:create_tab_context() + navigate_to_url()) - Create tab context, navigate, wait, handle errors using shot_config
Screenshot Capture (screenshot.py:_save_screenshot()) - Take and save image
Browser Cleanup (browser.py:cleanup_browser()) - Stop browser and cleanup
Async Wrapper (cli.py:run_nodriver_async()) - Setup nodriver event loop

Key Modules and Responsibilities

cli.py - Main entry point, CLI parsing, command orchestration
shot_config.py - Centralized configuration object with all parameters (browser, screenshot, execution options) and config file management
browser.py - Browser instance management, extension setup, cleanup
screenshot.py - Core screenshot logic, selector handling, image capture
page_utils.py - Page navigation, error detection, Cloudflare handling, JavaScript execution
utils.py - Utility functions for filename generation, URL processing, GitHub script loading

Architecture Design

Centralized Configuration: All parameters (browser options, screenshot settings, execution flags) are consolidated in ShotConfig
Simplified Interfaces: run_browser_command() takes just command_func and shot_config parameters
Config File Integration: Configuration file loading and defaults are handled directly in ShotConfig.__init__()
Consistent Pattern: All CLI commands follow the same ShotConfig → run_browser_command() pattern

Major Operations

Configuration parsing, validation, and config file fallback handling
Browser context initialization with anti-detection features using consolidated configuration
Optional extension loading for ad blocking (via uBlock Lite)
Page navigation with error detection and Cloudflare bypass
JavaScript execution and custom waiting conditions
Element selector processing (CSS/JS selectors)
Screenshot capture (full page or element-specific)
Optional HTML content saving
Comprehensive cleanup of browser and temporary files

The architecture is fully async-based using nodriver for enhanced stealth capabilities and automatic anti-detection. All configuration is centralized through ShotConfig for consistency and maintainability.

How It Works: Understanding Execution Order

Standard Screenshot Sequence

CLI Parsing (cli.py:shot()) - Parse command-line arguments and create ShotConfig
Browser Initialization (browser.py:create_browser_context()) - Start nodriver browser with stealth features
Tab Creation (page_utils.py:create_tab_context()) - Create new tab and configure user agent
Page Navigation (page_utils.py:navigate_to_url()) - Navigate to target URL and wait for load
Viewport Setup (page_utils.py:navigate_to_url()) - Set viewport dimensions if width/height explicitly specified (not full page)
Error Detection - Check for Chrome error pages and DNS failures
Cloudflare Handling - Detect and wait for Cloudflare challenge bypass
Wait Operations - Apply --wait delay and --wait-for conditions
JavaScript Execution - Execute any provided JavaScript code
Lazy Loading (page_utils.py:trigger_lazy_load()) - Trigger lazy-loaded content if requested
Viewport Expansion - Apply viewport expansion when blocking extensions are enabled
Screenshot Capture (screenshot.py:_save_screenshot()) - Set final viewport and capture screenshot
HTML Saving - Save HTML content if --save-html specified
Browser Cleanup (browser.py:cleanup_browser()) - Stop browser and clean up temporary files

Feature Interaction Notes

Dual Viewport Approach:
- Window Size (set_window_size) - Controls physical browser window dimensions (important for --interactive, --headful, and --devtools modes)
- Viewport Metrics (set_device_metrics_override) - Controls page layout dimensions for rendering and screenshot capture
Viewport Timing: Viewport metrics are set immediately after navigation (step 5) if width/height explicitly specified (not full page), aiding lazy loading of images
Extension Effects: Ad/popup blocking may require viewport expansion to fix intersection observer behavior (step 11)
Lazy Loading: Only runs if --trigger-lazy-load is specified, after viewport setup but before final screenshot capture
Full Page Screenshots: Skip early viewport setup; use calculated document height for viewport dimensions during screenshot capture (step 12)
Selector Screenshots: Process JavaScript selectors before taking element-specific screenshots

Error Handling Flow

HTTP errors are checked after navigation and can trigger --skip (exit silently) or --fail (exit with error)
Navigation errors are detected and can be handled with the same skip/fail logic
Cloudflare challenges are automatically detected and waited for (unless disabled)
All errors fail loudly with exceptions for debugging unless explicitly configured otherwise

Name		Name	Last commit message	Last commit date
Latest commit History 417 Commits
.github		.github
.stfolder		.stfolder
shot_power_scraper		shot_power_scraper
tests		tests
.gitignore		.gitignore
API_README.md		API_README.md
CONFIG.md		CONFIG.md
EXTENSIONS.md		EXTENSIONS.md
LICENSE		LICENSE
MHTML.md		MHTML.md
Pipfile		Pipfile
README.md		README.md
api_server.py		api_server.py
crop_cwg.sh		crop_cwg.sh
crop_params.txt		crop_params.txt
cwg-style.js		cwg-style.js
main.py		main.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-api.txt		requirements-api.txt
shot		shot
shot-power-scraper-api.service		shot-power-scraper-api.service
shot-scraper-api.plist.example		shot-scraper-api.plist.example
shotbox.api.plist		shotbox.api.plist
uv.lock		uv.lock

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

shot-power-scraper

Installation

Taking your first screenshot

Different Output Formats

Screenshots (default)

PDF Documents

HTML Source

MHTML Web Archives

Anti-Detection Features

Enhanced Stealth with nodriver

Required: Set Up Stealth User Agent

Headless vs Headful Mode

Ad and Popup Blocking

Building uBlock Origin Lite

⚠️ Important: Differences from Original shot-scraper

🚫 Commands & Features That Don't Work

🔄 Commands With Limited Functionality

📋 Command Status

Configuration & Defaults

Configuration Commands

Examples

Code Architecture: shot-power-scraper shot Execution Path

Entry Point and Flow

Key Modules and Responsibilities

Architecture Design

Major Operations

How It Works: Understanding Execution Order

Standard Screenshot Sequence

Feature Interaction Notes

Error Handling Flow

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Code Architecture: `shot-power-scraper shot` Execution Path

Packages