Scrape Alibaba Products avatar
Scrape Alibaba Products

Pricing

$19.99 / 1,000 results

Go to Apify Store
Scrape Alibaba Products

Scrape Alibaba Products

Developed by

shareze

shareze

Maintained by Community

Scrape product information from the product details page, product category page. The product category page includes `productgrouplist`, `featureproductlist`, `productlist`, etc. Currently using an excellent proxy tool, making data scraping more stable.

0.0 (0)

Pricing

$19.99 / 1,000 results

6

101

15

Last modified

5 days ago

This Apify Actor is designed to scrape product data from Alibaba. It retrieves detailed information about products, including categories, prices, descriptions, and more.

How to Use

  1. Input Configuration: The Actor requires the following input parameters, which are defined in the input_schema.json file:

    • size (integer, required): The total number of products to scrape. Default is 10.
    • group_urls (array, optional): A list of Alibaba category URLs to scrape products from.
    • detail_urls (array, optional): A list of specific product detail URLs to scrape.

    Example input:

    {
    "size": 10,
    "group_urls": [
    { "url": "https://ycsolarenergy.en.alibaba.com/productlist-1.html" }
    ],
    "detail_urls": [
    { "url": "https://www.alibaba.com/product-detail/Teddy-Bear-I-Love-You-Valentines_1600505498261.html" }
    ]
    }
  2. Run the Actor: You can run the Actor on the Apify platform or locally using the Apify SDK. Provide the required input parameters in JSON format.

  3. Output: The Actor will return a JSON array containing the scraped product data. Each product includes fields such as:

    • url: The product URL.
    • fromUrl: The source URL where the product was found.
    • productId: The unique ID of the product.
    • subject: The product title.
    • categories: A list of categories the product belongs to.
    • mediaItems: Media items (e.g., images) associated with the product.
    • moq: Minimum order quantity.
    • price: The product price.
    • sku: Stock keeping unit information.
    • sample: Sample availability.
    • sampleInfo: Additional information about samples.
    • productHtmlDescription: The product description in HTML format.
    • productBasicProperties: Basic properties of the product.
    • productKeyIndustryProperties: Key industry-specific properties.
    • productOtherProperties: Other properties of the product.

    Example output:

    [
    {
    "url": "https://example.com/product/123",
    "fromUrl": "https://example.com/category",
    "productId": "123456",
    "subject": "High-Quality Product",
    "categories": ["Category1", "Category2"],
    "mediaItems": ["https://example.com/image1.jpg"],
    "moq": "100 pieces",
    "price": "$10.00",
    "sku": "SKU123",
    "sample": true,
    "sampleInfo": "Sample available",
    "productHtmlDescription": "<p>This is a high-quality product...</p>",
    "productBasicProperties": {},
    "productKeyIndustryProperties": {},
    "productOtherProperties": {}
    }
    ]

Purpose of the Output Fields

  • url: Direct link to the product.
  • fromUrl: The URL from which the product was scraped.
  • productId: Unique identifier for the product.
  • subject: Title or name of the product.
  • categories: Categories the product belongs to.
  • mediaItems: Links to product images or media.
  • moq: Minimum order quantity required.
  • price: Price of the product.
  • sku: Stock keeping unit for inventory tracking.
  • sample: Indicates if a sample is available.
  • sampleInfo: Additional details about the sample.
  • productHtmlDescription: HTML-formatted description of the product.
  • productBasicProperties: General properties of the product.
  • productKeyIndustryProperties: Industry-specific properties.
  • productOtherProperties: Miscellaneous properties.