Tripadvisor Reviews Scraper avatar

Tripadvisor Reviews Scraper

Try for free

Pay $2.00 for 1,000 reviews

View all Actors
Tripadvisor Reviews Scraper

Tripadvisor Reviews Scraper

maxcopell/tripadvisor-reviews
Try for free

Pay $2.00 for 1,000 reviews

Get and download reviews for chosen places on Tripadvisor. Extract the review text, URL, rating, date of travel, published date, basic reviewer info, owner's response, helpful votes, images, review language, place details. Download reviews in XML, JSON, CSV.

Do you want to learn more about this Actor?

Get a demo
TP

The values for the Subratings Doesn't match

Closed

tasty_providence opened this issue
4 months ago
  1. The values for the subratings doesn't match
  2. The Json dump shows empty subratings for few reviews
TP

tasty_providence

4 months ago

Also i have attached the Json file for your referenve

lukas.prusa avatar

Hi Sojern,

thanks a lot for reporting this! It looks like the crawler is working as expected.

Some of the reviews, in fact, don't have any subratings. See the attached images.

As for the values that don't match, I wasn't able to find any and all of them look right. Could you please give me a concrete example of such review?

I hope this helps, thanks!

TO

tasty_providence-owner

4 months ago

Sure, the below attached images is a snapshot for the subratings in the site https://www.tripadvisor.com/Hotel_Review-g33252-d84460-Reviews-Le_Parc_at_Melrose-West_Hollywood_California.html, The other two screenshot are from the crawler results where one of the subratings is blank and other on has mismatched values.

Also I don't see anywhere in website for the subrating value except in the beginning. Thanks.

lukas.prusa avatar

Oh, you are mistaking review specific subratings with the overall average hotel subratings. I think the attached image will make the mistake quickly clear ;)

If you wish to extract the averaged hotel subratings, please use hotels Tripadvisor Scraper for it.

See an example run with this hotel: https://console.apify.com/view/runs/Sgt2NrORTVrxIXUIM

You can find the hotel review subratings under categoryReviewScores (it's a bit confusing with the switched names...)

TO

tasty_providence-owner

4 months ago

Thank you, but the actor doesn't provide me with other details which include customer reviews and management replies, which are most important to me. So, is there a way where you can make the changes to the crawler? It would be really helpful if you can look into it, as it is really important for our analysis.

TO

tasty_providence-owner

4 months ago

Tripadvisor Scraper https://console.apify.com/actors/dbEyMBriog95Fv8CW/console doesn't provide more information with detailed reviews and management response, so is it possible to add a category to the maxcopell/tripadvisor-reviews, in this way it would be really helpful for us to do our analysis.

lukas.prusa avatar

The Actors are currently split intentionally, because each of them scrape different pages and overall work differently. We would like to keep them separate.

Of course, you can manually run both Actors and then merge the datasets together, but that's not ideal. Generally, this kind of stuff should be handled automatically on the user's workflow end. I'm not sure how technical you are, but this would require some coding skills to create a new “orchestrator” Actor, which would manage running and merging of both of the TripAdvisor Actors.

I will discuss it with the team, because this could be a more common problem for TripAdvisor so we might add this Actor. If not then you will probably have to reach out to some freelancers on our discord.

I will keep you updated here, thanks!

TO

tasty_providence-owner

4 months ago

Thank you luke for sharing your inputs, while you discuss with the rest of the team regarding the solution for the problem, can you please share any documentation on how to Orchestrate an actor, I will look into it and get back to you if i have any questions. your help is much appreciated

Thanks.

lukas.prusa avatar

Unfortunately, it isn't really in our interest to add such an Actor to the store. Obviously, if more users would be interested, we might reconsider it.

Either way, here are some general steps and documentation, on how to set up such an orchestrator Actor:

  1. Create a new Actor: https://docs.apify.com/academy/getting-started/creating-actors
  2. In the code, call both of the Tripadvisor scrapers and get their run's default dataset IDs using actor.callTask (or alternatively actor.run, that's just preference): https://docs.apify.com/sdk/js/reference/next/class/Actor#callTask
  3. Merge the datasets. This depends on how you want to merge the data, either do it yourself in the code, or you can simply use the utility Merge, Dedup & Transform Datasets Actor with the previously finished dataset IDs.

I hope this helps, let me know if you have further questions regarding this matter. Good luck and happy scraping!

lukas.prusa avatar

Also, here is an absolutely most bare-bones example of calling another Actor from Actor: https://apify.com/apify/example-call/source-code

1const Apify = require('apify');
2
3Apify.main(async () => {
4    const run = await Apify.call('apify/hello-world', { greeting: 'Hello sir!' });
5    console.log('Apify.call() result:');
6    console.dir(run);
7});

Replace apify/hello-world with Tripadvisor scraper1, add call for the other one, afterward just call Merge, Dedup & Transform Datasets with the finished dataset IDs (which you can find under run.defaultDatasetId).

TO

tasty_providence-owner

4 months ago

Hi Luke,

Thank you for getting back to me regarding this issue, but i will have to use another actor which is going to affect my cost and i have around 600 links for which i need to scrape the data, So that's the reason i'm trying to use one single actor. Also it doesn't make any sense while not scraping the average ratings for the links while scraping individual ones.

So is there someone in your management i can talk to?

lukas.prusa avatar

I see your point, and it makes sense, it would be easier to have everything in one place. However, this would not be feasible for our current pricing, as we scrape reviews differently and independently to the place details.

Hence, I suggested you the possible data processing step to join the Actors together.

I will forward this issue further. Thank you!

lukas.prusa avatar

Hi again, we've decided to implement this under our Actor to Actor integration. Expect it early next ;)

I will keep you updated here, thanks!

lukas.prusa avatar

I'm happy to inform you that we've just updated the scraper with this feature ;)

To set it up, simply select the Reviews Scraper integration under the main TripAdvisor scraper here: https://console.apify.com/actors/dbEyMBriog95Fv8CW/webhooks

Do not add anything to start URLs as they will be used automatically. You can change the other settings as you wish.

The place place data will be available under the placeInfo output field.

Try it out and let me know how it works, thanks and happy scraping!

TO

tasty_providence-owner

4 months ago

Hi Lucas,

First of all, thank you for finding a solution as soon as possible. I tried to run the actor but encountered some issue with the same.

I have attached a link and Screenshot for the issue over here for your reference. ActorRun https://console.apify.com/organization/v9hqAJiffZGKLkDhd/actors/dbEyMBriog95Fv8CW/runs/abt9rNRAm5V2SaG53 .

Developer
Maintained by Apify
Actor metrics
  • 298 monthly users
  • 29 stars
  • 99.3% runs succeeded
  • 3.1 days response time
  • Created in Jan 2023
  • Modified 3 days ago
Categories