Pricing

Pay per usage

Go to Store

Example Website Screenshot Crawler

Try for free

Developed by

Abdlhakim hefaia

Automated website screenshot crawler using Pyppeteer and Apify. This open-source actor captures screenshots from specified URLs, uploads them to the Apify Key-Value Store, and provides easy access to the results, making it ideal for monitoring website changes and archiving web content.

0.0 (0)

Pricing

Pay per usage

Monthly users

Runs succeeded

>99%

Last modified

6 months ago

Automation

Open source

Website Screenshot Crawler

A template for automated website screenshot capturing. This actor takes screenshots of websites from specified URLs, uploads them to Apify Key-Value Store, and provides screenshot URLs in a dataset. It is ideal for monitoring website changes, archiving web content, or capturing visuals for reports. The actor uses Pyppeteer for browser automation and screenshot generation.

Source Code

You can find the source code for this actor in my GitHub account:

GitHub: https://github.com/DZ-ABDLHAKIM/Example-Website-Screenshot-Crawler

Included Features

Apify SDK - A toolkit for building Apify Actors and scrapers in Python.
Pyppeteer - A Python port of Puppeteer, an open-source tool for automating web browsers using a high-level API.
Key-Value Store - Store screenshots and metadata for easy retrieval.
Dataset - Structured storage for results like screenshot URLs and metadata.
Cookie and Viewport Support - Allows setting cookies and specifying the viewport dimensions before capturing screenshots.

Input

The input for this actor should be JSON containing the necessary configuration. The only required field is link_urls, which must be an array of website URLs. All other fields are optional. Here’s a detailed description of the input fields:

Field	Type	Description	Allowed Values
`link_urls`	Array	An array of website URLs to capture screenshots of.	Any valid URL
`Sleep`	Number	Duration to wait after the page has loaded before taking a screenshot (in seconds).	Minimum: 0, Maximum: 3600
`waitUntil`	String	Event to wait for before taking the screenshot.	One of: `"load"`, `"domcontentloaded"`, `"networkidle2"`, `"networkidle0"`
`cookies`	Array	Any cookies to set for the browser session.	Array of cookie objects
`fullPage`	Boolean	Whether to capture the full page or just the viewport.	`true` or `false`
`window_Width`	Number	Width of the browser viewport.	Minimum: 100, Maximum: 3840
`window_Height`	Number	Height of the browser viewport.	Minimum: 100, Maximum: 2160
`scrollToBottom`	Boolean	Should the browser scroll to the bottom of the page before taking a screenshot?	`true` or `false`
`distance`	Number	Distance (in pixels) to scroll down for each scroll action.	Minimum: 0
`delay`	Number	Delay (in milliseconds) between scroll actions.	Minimum: 0, Maximum: 3600000
`delayAfterScrolling`	Number	Specify the delay (in milliseconds) after scrolling to the bottom of the page before taking a screenshot.	Minimum: 0, Maximum: 3600000
`waitUntilNetworkIdleAfterScroll`	Boolean	Choose whether to wait for the network to become idle after scrolling to the bottom of the page.	`true` or `false`
`waitUntilNetworkIdleAfterScrollTimeout`	Number	Maximum wait time (in milliseconds) for the network to become idle after scrolling.	Minimum: 1000, Maximum: 3600000

For more information about the waitUntil parameter, please refer to the Puppeteer page.goto function documentation.

Output

Once the actor finishes executing, it will output a screenshot of each website into a file stored in the Key-Value Store associated with the run. The screenshot URLs will also be stored in a dataset for easy access.

How It Works

Input Configuration: The actor reads the input data as specified above.
Browser Automation: The actor launches a headless browser using Pyppeteer, loading the target URLs, and capturing screenshots.
Setting Cookies and Viewport: Before navigating to each link, specified cookies are set using page.setCookie(), and the viewport is configured with specified width and height.
Page Navigation: The actor navigates to each URL using page.goto(), waiting for the specified waitUntil event.
Scrolling (Optional): If the scrollToBottom option is enabled, the actor executes a scrolling script that scrolls down the page by the defined distance in pixels.
Screenshot Capture: After the page has fully loaded, the actor waits for the Sleep duration before capturing the screenshot and saves it with a random filename.
Uploading Screenshots: The captured screenshots are read as binary data and uploaded to the Apify Key-Value Store using Actor.set_value(), with URLs stored in the dataset.
Logging and Error Handling: The actor logs the success or failure of each URL processed, ensuring that it can continue processing even if one fails.
Cleanup: After processing all URLs, the actor closes the browser.

This open-source actor effectively automates the process of capturing and storing screenshots of multiple web pages, making it a valuable tool for monitoring website changes, archiving content, or generating visual reports.

Resources

Getting Started

To get started with this actor:

Build the Actor: Define your input URLs and configure optional settings like scrolling and sleep duration.
Run the Actor: Execute the actor on the Apify platform or locally using the Apify CLI.

Pull the Actor for Local Development

To develop this actor locally, follow these steps:

Install apify-cli:

Using Homebrew:

brew install apify-cli

Using NPM:

npm install -g apify-cli

Pull the Actor using its unique <ActorId>:

apify pull <ActorId>

Example Use Cases

Website Monitoring: Capture screenshots periodically to monitor changes to web pages.
Visual Archiving: Store visual representations of websites over time for research or archival purposes.
Reporting: Automatically capture visuals for reports or presentations.

Documentation Reference

Contact Information

For any inquiries, you can reach me at:
Email: fridaytechnolog@gmail.com
GitHub: https://github.com/DZ-ABDLHAKIM
Twitter: https://x.com/DZ_45Omar

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.

Website Screenshot Generator

apify/screenshot-url

Create a screenshot of a website based on a specified URL. The screenshot is stored as the output in a key-value store. It can be used to monitor web changes regularly after setting up the scheduler.

Apify

3.3k

Screenshot: Website Image and Video Capture

yeyo/screenshot

Capture high-quality screenshots and videos of any website. Perfect for design verification, content creation, and website snapshots. Compatible across diverse device resolutions. Instant results.

sametcodes

165

Screenshot Taker

jancurn/screenshot-taker

Takes a screenshot of one or more web pages using the Chrome browser. The actor enables the setting of custom viewport size, page load timeout, delay, proxies, and output image format.

Jan Čurn

654

Store Screenshot Url

dainty_screw/store-screenshot-url

Effortlessly capture and extract URL screenshots with the Store Screenshot URL Extractor. This powerful Apify actor is designed to streamline your data collection process, providing high-quality snapshots of web pages with ease. Ideal for market research, content verification, and SEO analysis, this

codemaster devops

Screenshot URL Storage - Capture and Organize Web Images

dainty_screw/screenshot-url-storage---capture-and-organize-web-images

Effortlessly capture and store screenshots of URLs with our tool. Keep a visual record of web pages for archiving, sharing, or analysis, all organized in a convenient and accessible format.

codemaster devops

Snapify - Capture Screenshot, save PDF

replymaster/snapify

Snapify is a powerful and efficient tool for capturing full-page screenshots and PDFs of any website with ease. Whether you need high-quality website snapshots for archiving, reports, or design reviews, Snapify automates the process.

ReplyMaster

Website Screenshot & Color Palette Bot

finderr/website-screenshot-color-palette-bot

Automate full web page screenshots and extract vibrant color palettes effortlessly. Get the full picture and a beautiful color palette at your fingertips with our bot. puppeteer headless.

Offpista LTD

102

Website domain name validator

saswave/website-domain-name-validator

Verify if a website is still up and running or if it contains a redirection to a new domain name. Data enrichment at scale.

SASWAVE

Ultimate Screenshot

dz_omar/ultimate-screenshot

Ultimate Screenshot allows you to extract data in formats like JPEG, PNG, PDF, GIF, and MP4. It supports device emulation, including iPhones, Android phones, tablets, and desktops, or uses a default resolution of 1920x1080 for accurate, versatile screenshots and videos.

Abdlhakim hefaia

Website Image Downloader Pro (Pay per Result)

powerful_bachelor/website-image-downloader-pro-pay-per-result

📷 Website Image Downloader Pro: Scrape and download images effortlessly from any URL! 🌟 Features include extracting image URLs, converting SVG to PNG, downloading, and zipping images into one file. Ideal for research, AI training, and visual content archiving. 🖼️✨ Start now on Apify! 🚀