AliExpress Scraper avatar

AliExpress Scraper

Try for free

3 days trial then $30.00/month - No credit card required now

View all Actors
AliExpress Scraper

AliExpress Scraper

epctex/aliexpress-scraper
Try for free

3 days trial then $30.00/month - No credit card required now

Effortlessly extract descriptions, images, feedback, questions, prices, and shipping information from AliExpress. Customize country, language, and region preferences for enhanced data gathering.

IR

Scraper keeps getting blocked

Closed

impressive_rainforest opened this issue
2 years ago

All recent runs are being blocked and not successfully scraping any product information

TU

tugkan

2 years ago

Hey Matt Payne,

Can you please send us one/two run IDs so that we can investigate?

Best

IR

impressive_rainforest

2 years ago

IQWJXg8EsBYVMdWJk 0WpeK3TF0NC8imd6K mhZn6mwrgUZvFxGT0 WTfrpp8KU3ubofjfm

IR

impressive_rainforest

2 years ago

A more recent run: qovTyrNWbcOHCtcMb

we are still encountering this error: Reclaiming failed request back to the list or queue. We got blocked. Retrying.

IR

impressive_rainforest

2 years ago

I went back in my run history, on 2023-03-01 22:47 run number "6llGGtPE3UEcKA9Fi" was the last successful run I had. This was using version 1.0.162. I tried using this older version again (new run ID: "hPoCgXzXU7ssYCgqQ"), but this also failed.

On top of these requests failing, they are all fasely reporting as successful with the exit codes you are using

TU

tugkan

2 years ago

Hey Matt Payne!

Thank you very much for using this public actor. If possible, can you please add https:// at the beginning of the startUrls and try it again?

Awaiting your response. Best

IR

impressive_rainforest

2 years ago

yep that works, why is there no error handling / better error messages or documentation on this? feel like that is an easy one to catch

IR

impressive_rainforest

2 years ago

I am still getting intermittent issues after adding the https:// at the beginning of the startUrls. Here are two run numbers Wdmk6seNwWlHxIGj5. XAHVWIyZdUi3DykS4. This error is causing great delay in the time it takes to scrape these pages

TU

tugkan

2 years ago

Hey Matt Payne,

The error handling is completely our fault. I'll pass this problem as a ticket to the team. About the intermittent issues, that is expected. All the public actors have the potential to get blocked. That's why it contains the retry mechanisms and proxies in place. About the delay on the scrape, I checked the logs of the run and it seems like; scraping takes 40 seconds but the build time of the actor takes 2 minutes. Unfortunately, this is completely up to the Apify platform and there is not much to do on our side.

Best

IR

impressive_rainforest

2 years ago

Is there a private option?

TU

tugkan

2 years ago

Do you mean getting the same service from somewhere else? If so, unfortunately not.

Developer
Maintained by Community
Actor metrics
  • 52 monthly users
  • 13 stars
  • 86.3% runs succeeded
  • 5.5 hours response time
  • Created in Oct 2019
  • Modified about 7 hours ago