Nordstrom Scraper
2 hours trial then $35.00/month - No credit card required now
Nordstrom Scraper
2 hours trial then $35.00/month - No credit card required now
Nordstrom Scraper is an Apify Actor that scrapes product data from Nordstrom’s search results pages. It extracts key product details such as title, brand, image URL, current price, and previous price.
Nordstrom Scraper is an Apify Actor that scrapes product data from Nordstrom’s search results pages. It extracts key product details such as title, brand, image URL, current price, and previous price. The scraper supports pagination and configurable delays to ensure that lazy-loaded content is captured, and it can be run behind a proxy to help bypass bot detection.
🚀 Features
- ✅ Scrapes product details: Extracts product title, brand, image URL, current price, and previous price (with currency symbols removed).
- ✅ Pagination: Iterates over multiple pages using the configurable
maxPages
parameter. - ✅ Configurable delay: Waits a set number of seconds (default is 2s) after page load to allow lazy-loaded content to render.
- ✅ Proxy and Stealth support: Uses Apify Proxy (if configured) and a stealth plugin with Playwright for realistic browsing behavior.
- ✅ Structured output: Stores each product as an individual entry in an Apify Dataset for easy export and further processing.
📥 Input Parameters
The Actor accepts the following input parameters:
Parameter | Type | Description | Default Value |
---|---|---|---|
nordstromUrl | string | The base URL of the Nordstrom search results page (e.g., for heels search). | "https://www.nordstrom.com/sr?origin=keywordsearch&keyword=heels" |
maxPages | integer | The maximum number of pages to scrape. | 10 |
useApifyProxy | boolean | Whether to use the Apify Proxy. | true |
proxyConfiguration | object | Proxy configuration object (can be customized or use Apify Proxy defaults). | { "useApifyProxy": true, "apifyProxyGroups": ["DATACENTER"] } |
Example Input
1{ 2 "nordstromUrl": "https://www.nordstrom.com/sr?origin=keywordsearch&keyword=heels", 3 "maxPages": 10, 4 "useApifyProxy": true, 5 "proxyConfiguration": { 6 "useApifyProxy": true, 7 "apifyProxyGroups": ["DATACENTER"] 8 } 9}
Result
The output dataset includes the following fields for each job listing, as specified in dataset_schema.json:
title (string): The title of the job listing. company (string): The name of the company offering the job. location (string): The location of the job. jobLink (string): The URL link to the job listing. responsibilities (array of strings): A list of responsibilities or job description items.
Example Result
1[ 2 { 3 "title": "Software Engineer", 4 "company": "Tech Corp", 5 "location": "San Francisco, CA", 6 "jobLink": "https://www.indeed.com/viewjob?jk=12345", 7 "responsibilities": [ 8 "Develop and maintain software applications.", 9 "Collaborate with product managers and designers." 10 ] 11 }, 12 { 13 "title": "Frontend Developer", 14 "company": "Web Solutions", 15 "location": "San Francisco, CA", 16 "jobLink": "https://www.indeed.com/viewjob?jk=67890", 17 "responsibilities": [ 18 "Implement UI/UX designs.", 19 "Work closely with the backend team." 20 ] 21 } 22]
⚙️ How It Works
Page Processing
Pagination:
A request queue is populated with URLs for each page (from page 1 up to the maxPages
value) by appending or replacing the page parameter in the base URL.
Data Extraction
Product Details:
Each product is extracted from an <article>
element on the page. The Actor scrapes:
- Title: From an
<h3>
element containing an<a>
tag. - Brand: Using a heuristic from surrounding
<div>
elements. - Image URL: From the product’s
<img>
element. - Current Price and Previous Price: Extracted from the page text using regex, with currency symbols (€ and $) removed.
Output Format:
Each product is stored as an individual object in the Apify Dataset, resulting in a flat array of products.
Result Storage:
All scraped product data is pushed to an Apify Dataset as individual entries, making it easy to review, export, or process further.
📤 Output
The Actor outputs a flat JSON dataset with each product structured as follows:
1{ 2 "title": "Ce-Soire Slide Sandal (Women)", 3 "brand": "Jeffrey Campbell", 4 "imageUrl": "https://n.nordstrommedia.com/it/173749f5-29fb-4bf2-a7e2-d93ec5cc684d.jpeg?h=365&w=240&dpr=2", 5 "currentPrice": "147,47", 6 "previousPrice": "147,52" 7}
⚠️ Legal Disclaimer
This project is intended for educational and research purposes only. When using this Actor, please comply with Nordstrom’s Terms of Service and relevant robots.txt policies.
Use this tool responsibly and avoid aggressive scraping that could negatively impact Nordstrom’s website infrastructure.
Actor Metrics
2 monthly users
-
0 No bookmarks yet
>99% runs succeeded
Created in Feb 2025
Modified 2 days ago