
Taobao Product Scraper
Pricing
Pay per usage
Go to Apify Store

Taobao Product Scraper
Extract detailed product data from Taobao, including title, price, reviews, specifications, images, options, and categories. Perfect for market research, e-commerce insights, and data-driven decision-making
0.0 (0)
Pricing
Pay per usage
1
50
25
Issues response
3.6 hours
Last modified
23 days ago
🛒 Taobao Product Scraper
This Actor scrapes detailed product data from Taobao using Playwright. It’s perfect for market research, competitor tracking, and e-commerce automation, providing all data as clean JSON.
✨ What it does
- Fetches product title, ID, and URL.
- Extracts price and sales details.
- Collects all product images and video (if available).
- Scrapes brand, material, size, style, and full specifications.
- Gathers categories, reviews, and ratings.
- Captures all product options/variants with images.
- Outputs everything as structured JSON.
🧾 Example Output
{"product_id": "624164131692","product_url": "[https://item.taobao.com/item.htm?id=624164131692](https://item.taobao.com/item.htm?id=624164131692)","product_title": "Authentic kaws puzzle high transparent acrylic display magnetic DIY picture frame modern trend decorative ornaments","price_info": "95.00","sales_info": null,"image_urls": ["[https://img.alicdn.com/imgextra/i1/1822521250/O1CN01GCWTQ11L6Xh9zvtAQ_!!1822521250.jpg_q50s50.jpg_.webp](https://img.alicdn.com/imgextra/i1/1822521250/O1CN01GCWTQ11L6Xh9zvtAQ_!!1822521250.jpg_q50s50.jpg_.webp)","... more images ..."],"categories": "front page > Home accessories > Photo frame/photo frame/picture frame","review_count": "16","details": {"Brand": "Neverislang.","Material": "Acrylic","Style": "simple and modern","Origin": "Mainland China"},"product_options": [{"label": "size","options": [{"text": "Other sizes"}]}]}
🛠️ How to Use
Run the Actor via the API and automatically download the JSON file with all the product data.
- Replace your credentials: Add your
API_TOKEN
andACTOR_ID
to the script. - Set the product URL: Update the
PRODUCT_URL
variable. - Run the script: The script will start the Actor, wait for it to finish, and save the results to
taobao_product.json
.
import osimport timeimport jsonimport requestsAPI_TOKEN = "YOUR_APIFY_API_KEY"ACTOR_ID = "pizani~taobao-product-scraper"PRODUCT_URL = "https://item.taobao.com/item.htm?id=624164131692"# Simple folder pathoutput_dir = "./scraped_results"os.makedirs(output_dir, exist_ok=True)output_file = os.path.join(output_dir, "product_data.json")print("Starting the Actor...")run_resp = requests.post(f"https://api.apify.com/v2/acts/{ACTOR_ID}/runs?token={API_TOKEN}",json={"product_url": PRODUCT_URL})run_data = run_resp.json()run_id = run_data.get("data", {}).get("id")if not run_id:raise Exception(f"Failed to start Actor: {run_data}")print(f"Actor started successfully! Run ID: {run_id}")status_url = f"https://api.apify.com/v2/actor-runs/{run_id}?token={API_TOKEN}"while True:status_resp = requests.get(status_url).json()status = status_resp.get("data", {}).get("status")print(f"Current Actor status: {status}")if status in {"SUCCEEDED", "FAILED", "ABORTED", "TIMED-OUT"}:breaktime.sleep(5)print("Fetching the result from the Key-Value Store...")item_url = f"https://api.apify.com/v2/key-value-stores/{status_resp['data']['defaultKeyValueStoreId']}/records/OUTPUT?disableRedirect=true&token={API_TOKEN}"result = requests.get(item_url).json()# Save JSON file locallywith open(output_file, "w", encoding="utf-8") as f:json.dump(result, f, ensure_ascii=False, indent=4)print("\n--- Final Scraped Product Data ---")print(result)print(f"\nJSON file saved at: {os.path.abspath(output_file)}")
📊 Use Cases
- Market Research: Track competitors and product trends.
- E-commerce Automation: Build catalogs for Shopify, WooCommerce, etc.
- Price Tracking: Save historical data for analysis.
- Data Analytics: Integrate JSON with BI dashboards.
On this page
Share Actor: