Amazon Scraper avatar
Amazon Scraper

Pricing

$10.00 / 1,000 results

Go to Store
Amazon Scraper

Amazon Scraper

Developed by

Junglee

Maintained by Apify

Gets you product data from Amazon. Unofficial API. Scrapes and downloads product information without using the Amazon API, including reviews, prices, descriptions, and ASIN.

4.4 (14)

Pricing

$10.00 / 1,000 results

113

Monthly users

603

Runs succeeded

98%

Response time

20 hours

Last modified

4 days ago

SI

Duplicate issue when scraping via the API

Closed

sinclairgsm opened this issue
4 months ago

Hello,

First of all, thank you for your work. I am using the scraper via the API, and I am encountering an issue: several products that have already been scraped are appearing again. This becomes problematic when I scrape around 100 products from a page and then scrape another 100 products from the same page, resulting in duplicates.

As a beginner, I would like to know if it is possible to configure the scraper to avoid these duplicates.

Thank you in advance for your response.

lukas.prusa avatar

Hi, thanks for opening this issue!

Unfortunately, duplicates are a part of web scraping and are almost impossible to mitigate. Do you want to simply filter them out, or to not scrape them at all? A simple solution would be to use a tool like the Duplications Checker and filter them by each product's ASIN.

If you want to not scrape them at all (essentially, not waste any credits on them) then that is sadly not possible. Simply put, there is not such a functionality on Amazon, so we are forced to search their pages "aimlessly".

I hope this helps, thanks and happy scraping!

SI

sinclairgsm

4 months ago

Thank you for your reply. I will filter the scraping results to avoid duplicates in my database. Have a nice day !

lukas.prusa avatar

Great, that's how it should be done properly using a database ;) Good luck and have a nice day too!

Pricing

Pricing model

Pay per result 

This Actor is paid per result. You are not charged for the Apify platform usage, but only a fixed price for each dataset of 1,000 items in the Actor outputs.

Price per 1,000 items

$10.00