Reddit Scraper avatar

Reddit Scraper

Try for free

1 day trial then $45.00/month - No credit card required now

View all Actors
Reddit Scraper

Reddit Scraper

trudax/reddit-scraper
Try for free

1 day trial then $45.00/month - No credit card required now

Unlimited Reddit web scraper to crawl posts, comments, communities, and users without login. Limit web scraping by number of posts or items and extract all data in a dataset in multiple formats.

OK

Getting multiple duplicates for the same comment

Closed

ons_kharrat opened this issue
a month ago

I have scraped 20k items which are mostly comments. When performing data cleaning, I have noticed that only 1k of these items are unique and the other 19K are just multiple duplicates of the same unique items. Some comment appeared more than 100 times! How can I avoid this issue?

trudax avatar

Can you share your run ID so I can take a look at what is happening?

OK

ons_kharrat

a month ago

Hey! My run id is V9An8uU7tUqWgw58R I have also run the actor again to get more data, will let you know if I face any issues with the new data.

OK

ons_kharrat

a month ago

Hey, I have faced another issue with my new scraping, it is scraping comments, but not scraping the posts these comments belong to. Is there a way to avoid this?

trudax avatar

I have fixed the duplication issue.

Developer
Maintained by Community
Actor metrics
  • 318 monthly users
  • 52 stars
  • 99.9% runs succeeded
  • 1 days response time
  • Created in Feb 2022
  • Modified about 11 hours ago
Categories