Tripadvisor Scraper avatar

Tripadvisor Scraper

Try for free

Pay $3.00 for 1,000 results

View all Actors
Tripadvisor Scraper

Tripadvisor Scraper

maxcopell/tripadvisor
Try for free

Pay $3.00 for 1,000 results

This unofficial Tripadvisor API is a data extraction tool able to get data on hotels, restaurants, things to do, vacation rentals, attractions, tours, and public trips. Get pricing, contact details, amenities, awards, ratings, and more. Download your data in Excel, JSON, CSV, and other formats.

Do you want to learn more about this Actor?

Get a demo
Luke0092 avatar

Few results

Open

Luke0092 opened this issue
22 days ago

Hello, i tried to scrape the page: https://www.tripadvisor.com/Hotels-g187791-Rome_Lazio-Hotels.html as you can see there are 8.340 properties but i get only 3.980 results. Can you help me?

lukas.prusa avatar

Hi Luke, thanks for opening this issue!

Seems like the scraper failed on the 3990 offset page and somehow got only 0 results. We've seen a similar thing happen in the past, so we have a good idea of what is causing it. We will fix this.

If you want to finish the rest of the scrape, just continue with this URL, which already has the pagination set in it: https://www.tripadvisor.com/Hotels-g187791-oa3990-Rome_Lazio-Hotels.html

I will keep you updated here, thanks!

Luke0092 avatar

Luke0092

21 days ago

Thank you for your reply. If i restart the scrape with this link: https://www.tripadvisor.com/Hotels-g187791-oa3990-Rome_Lazio-Hotels.html i will pay again the same 3.980 results i have already scraped, how can i avoid to scrape this results? Thank you very much

lhotanok avatar

Hello, don't worry - if you provide https://www.tripadvisor.com/Hotels-g187791-oa3990-Rome_Lazio-Hotels.html as a start URL, you won't be charged for the results from previous pages as they won't be scraped at all. Just do a new run with this start URL.

For each run, you're charged for the actual number of results stored in the dataset of that particular run. If you start on the 134. page (which corresponds to the URL with offset oa3990), the Actor won't crawl the listings from previous pages 1-133. It will just directly open the 134. page, scrape the results from there and then continue with 135. page (https://www.tripadvisor.com/Hotels-g187791-oa4020-Rome_Lazio-Hotels.html).

You can take a look at the example run: https://console.apify.com/view/runs/GTwglY6AJEX7ude0Q

I used the suggested start URL https://www.tripadvisor.com/Hotels-g187791-oa3990-Rome_Lazio-Hotels.html and the Actor logged the following messages:

Loaded listing page, estimated total number of results: 8341 {"url":"https://www.tripadvisor.com/Hotels-g187791-oa3990-Rome_Lazio-Hotels.html","loadedOffset":3990,"desiredOffset":3990}
Enqueued next listing page {"nextPage":"https://www.tripadvisor.com/Hotels-g187791-oa4020-Rome_Lazio-Hotels.html","nextPageUserData":{"inputQueryOrUrl":"https://www.tripadvisor.com/Hotels-g187791-oa3990-Rome_Lazio-Hotels.html","hasOnlyNearbyResults":false,"label":"WEB_LISTINGS"}}
Luke0092 avatar

Luke0092

18 days ago

Perfect! thank you very much!

Developer
Maintained by Apify
Actor metrics
  • 387 monthly users
  • 58 stars
  • 97.0% runs succeeded
  • 3.1 days response time
  • Created in Nov 2019
  • Modified 3 days ago
Categories