No credit card required

Playwright Scraper

apify/playwright-scraper

No credit card required

Crawls websites with the headless Chromium, Chrome, or Firefox browser and Playwright library using a provided server-side Node.js code. Supports both recursive crawling and a list of URLs. Supports login to a website.

Do you want to learn more about this Actor?

Get a demo

You can access the Playwright Scraper programmatically from your own JavaScript applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

1import { ApifyClient } from 'apify-client';
2
3// Initialize the ApifyClient with your Apify API token
4// Replace the '<YOUR_API_TOKEN>' with your token
5const client = new ApifyClient({
6    token: '<YOUR_API_TOKEN>',
7});
8
9// Prepare Actor input
10const input = {
11    "startUrls": [
12        {
13            "url": "https://crawlee.dev"
14        }
15    ],
16    "globs": [
17        {
18            "glob": "https://crawlee.dev/*/*"
19        }
20    ],
21    "pseudoUrls": [],
22    "excludes": [
23        {
24            "glob": "/**/*.{png,jpg,jpeg,pdf}"
25        }
26    ],
27    "linkSelector": "a",
28    "pageFunction": async function pageFunction(context) {
29        const { page, request, log } = context;
30        const title = await page.title();
31        log.info(`URL: ${request.url} TITLE: ${title}`);
32        return {
33            url: request.url,
34            title
35        };
36    },
37    "proxyConfiguration": {
38        "useApifyProxy": true
39    },
40    "initialCookies": [],
41    "launcher": "chromium",
42    "waitUntil": "networkidle",
43    "preNavigationHooks": `// We need to return array of (possibly async) functions here.
44        // The functions accept two arguments: the "crawlingContext" object
45        // and "gotoOptions".
46        [
47            async (crawlingContext, gotoOptions) => {
48                const { page } = crawlingContext;
49                // ...
50            },
51        ]`,
52    "postNavigationHooks": `// We need to return array of (possibly async) functions here.
53        // The functions accept a single argument: the "crawlingContext" object.
54        [
55            async (crawlingContext) => {
56                const { page } = crawlingContext;
57                // ...
58            },
59        ]`,
60    "customData": {}
61};
62
63// Run the Actor and wait for it to finish
64const run = await client.actor("apify/playwright-scraper").call(input);
65
66// Fetch and print Actor results from the run's dataset (if any)
67console.log('Results from dataset');
68console.log(`💾 Check your data here: https://console.apify.com/storage/datasets/${run.defaultDatasetId}`);
69const { items } = await client.dataset(run.defaultDatasetId).listItems();
70items.forEach((item) => {
71    console.dir(item);
72});
73
74// 📚 Want to learn more 📖? Go to → https://docs.apify.com/api/client/js/docs

Playwright Scraper API in JavaScript

The Apify API client for JavaScript is the official library that allows you to use Playwright Scraper API in JavaScript or TypeScript, providing convenience functions and automatic retries on errors.

Install the apify-client

npm install apify-client

Other API clients include:

Playwright Scraper API in Python

Playwright Scraper API through CLI

Playwright Scraper API

Developer

Apify

Actor metrics

63 monthly users
14 stars
99.4% runs succeeded
21 days response time
Created in Aug 2022
Modified 3 months ago

Categories

Developer tools

For creators

Puppeteer Scraper

apify/puppeteer-scraper

Crawls websites with the headless Chrome and Puppeteer library using a provided server-side Node.js code. This crawler is an alternative to apify/web-scraper that gives you finer control over the process. Supports both recursive crawling and list of URLs. Supports login to website.

Apify

4.2k

Facebook Marketplace

shmlkv/facebook-marketplace

This is a simple scraper for Facebook Marketplace. It uses Playwright to scrape the data

Andre Sh

432

Redfin Fast Scraper

mantisus/redfin-fast-scraper

Redfin: Scrape fast, stay light! Skip bloated browser tools. My Redfin scraper extracts property data in a flash, no heavy lifting is needed. Scrape/monitor listings with ease, all without Puppeteer or Playwright. ⚡️

Maksym Bohomolov

Thefork Fast Scraper

mantisus/thefork-fast-scraper

Scrape TheFork.com quickly and easily! Skip bloated browser tools. This scraper extracts restaurant data in a flash, no heavy lifting is needed. Scrape and monitor data with ease, all without Puppeteer or Playwright. ⚡️

Maksym Bohomolov

Redfin Fast Scraper Per Results

mantisus/redfin-fast-scraper-per-results

Maksym Bohomolov

Thefork Fast Scraper Per Result

mantisus/thefork-fast-scraper-per-result

Maksym Bohomolov

Zoopla.co.uk Fast Scraper

mantisus/zoopla-actor

Zoopla.co.uk: Scrape fast, stay light! Skip bloated browser tools. My Zoopla scraper extracts property data in a flash, no heavy lifting is needed. Scrape/monitor listings with ease, all without Puppeteer or Playwright. ⚡️

Maksym Bohomolov

X Crawler

lumen_limitless/x-crawler

This project is a web scraper designed to extract user data and tweets from X (formerly known as Twitter) using Crawlee and Playwright.

lumen limitless

Zalando Price Comparator

mantisus/zalando-price-comparator

Zalando: scrape without stressing! Skip the bloated browser-based tools. My Zalando scraper extracts price and stock data from all Zalando stores. Scrape prices and remainders, all without Puppeteer or Playwright. ⚡️

Maksym Bohomolov

Website Content Crawler

apify/website-content-crawler

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗LangChain, LlamaIndex, and the wider LLM ecosystem.

Apify

21.2k

472