NS

Nordstrom Scraper

Try for free

2 hours trial then $35.00/month - No credit card required now

Go to Store
NS

Nordstrom Scraper

runtime/nordstrom-scraper
Try for free

2 hours trial then $35.00/month - No credit card required now

Nordstrom Scraper is an Apify Actor that scrapes product data from Nordstrom’s search results pages. It extracts key product details such as title, brand, image URL, current price, and previous price.

Nordstrom Scraper is an Apify Actor that scrapes product data from Nordstrom’s search results pages. It extracts key product details such as title, brand, image URL, current price, and previous price. The scraper supports pagination and configurable delays to ensure that lazy-loaded content is captured, and it can be run behind a proxy to help bypass bot detection.

🚀 Features

  • Scrapes product details: Extracts product title, brand, image URL, current price, and previous price (with currency symbols removed).
  • Pagination: Iterates over multiple pages using the configurable maxPages parameter.
  • Configurable delay: Waits a set number of seconds (default is 2s) after page load to allow lazy-loaded content to render.
  • Proxy and Stealth support: Uses Apify Proxy (if configured) and a stealth plugin with Playwright for realistic browsing behavior.
  • Structured output: Stores each product as an individual entry in an Apify Dataset for easy export and further processing.

📥 Input Parameters

The Actor accepts the following input parameters:

ParameterTypeDescriptionDefault Value
nordstromUrlstringThe base URL of the Nordstrom search results page (e.g., for heels search)."https://www.nordstrom.com/sr?origin=keywordsearch&keyword=heels"
maxPagesintegerThe maximum number of pages to scrape.10
useApifyProxybooleanWhether to use the Apify Proxy.true
proxyConfigurationobjectProxy configuration object (can be customized or use Apify Proxy defaults).{ "useApifyProxy": true, "apifyProxyGroups": ["DATACENTER"] }

Example Input

1{
2  "nordstromUrl": "https://www.nordstrom.com/sr?origin=keywordsearch&keyword=heels",
3  "maxPages": 10,
4  "useApifyProxy": true,
5  "proxyConfiguration": {
6    "useApifyProxy": true,
7    "apifyProxyGroups": ["DATACENTER"]
8  }
9}

Result

The output dataset includes the following fields for each job listing, as specified in dataset_schema.json:

title (string): The title of the job listing. company (string): The name of the company offering the job. location (string): The location of the job. jobLink (string): The URL link to the job listing. responsibilities (array of strings): A list of responsibilities or job description items.

Example Result

1[
2  {
3    "title": "Software Engineer",
4    "company": "Tech Corp",
5    "location": "San Francisco, CA",
6    "jobLink": "https://www.indeed.com/viewjob?jk=12345",
7    "responsibilities": [
8      "Develop and maintain software applications.",
9      "Collaborate with product managers and designers."
10    ]
11  },
12  {
13    "title": "Frontend Developer",
14    "company": "Web Solutions",
15    "location": "San Francisco, CA",
16    "jobLink": "https://www.indeed.com/viewjob?jk=67890",
17    "responsibilities": [
18      "Implement UI/UX designs.",
19      "Work closely with the backend team."
20    ]
21  }
22]

⚙️ How It Works

Page Processing

Pagination:
A request queue is populated with URLs for each page (from page 1 up to the maxPages value) by appending or replacing the page parameter in the base URL.

Data Extraction

Product Details:
Each product is extracted from an <article> element on the page. The Actor scrapes:

  • Title: From an <h3> element containing an <a> tag.
  • Brand: Using a heuristic from surrounding <div> elements.
  • Image URL: From the product’s <img> element.
  • Current Price and Previous Price: Extracted from the page text using regex, with currency symbols (€ and $) removed.

Output Format:
Each product is stored as an individual object in the Apify Dataset, resulting in a flat array of products.

Result Storage:
All scraped product data is pushed to an Apify Dataset as individual entries, making it easy to review, export, or process further.

📤 Output

The Actor outputs a flat JSON dataset with each product structured as follows:

1{
2  "title": "Ce-Soire Slide Sandal (Women)",
3  "brand": "Jeffrey Campbell",
4  "imageUrl": "https://n.nordstrommedia.com/it/173749f5-29fb-4bf2-a7e2-d93ec5cc684d.jpeg?h=365&w=240&dpr=2",
5  "currentPrice": "147,47",
6  "previousPrice": "147,52"
7}

This project is intended for educational and research purposes only. When using this Actor, please comply with Nordstrom’s Terms of Service and relevant robots.txt policies.
Use this tool responsibly and avoid aggressive scraping that could negatively impact Nordstrom’s website infrastructure.

Developer
Maintained by Community

Actor Metrics

  • 2 monthly users

  • 0 No bookmarks yet

  • >99% runs succeeded

  • Created in Feb 2025

  • Modified 2 days ago