Github Search Scraper avatar

Github Search Scraper

Try for free

1 day trial then $25.00/month - No credit card required now

Go to Store
Github Search Scraper

Github Search Scraper

saswave/github-search-scraper
Try for free

1 day trial then $25.00/month - No credit card required now

Github search scraper. Get all data from search results list

LE

Fails on page 10 every time

Closed

aleksandrmoshkov opened this issue
2 months ago

Please see the actor run. I made a resurrect attempt, but it fails at the same page.

saswave avatar

SASWAVE (saswave)

2 months ago

We are looking into it, probably anti bot protections

LE

aleksandrmoshkov

2 months ago

I successfully added my cookies, which improved the process, but there were still some errors. In the end, I managed to run 100 pages. I think it would be helpful to include a line in the README stating that the antibot sometimes fails and that it's better to add a cookie.

saswave avatar

SASWAVE (saswave)

2 months ago

We found the issue, nothing related to anti bot protections:

When not providing cookies, github sometimes returns 429 responses to slow the requests made to the server

Initially, we ask cookies only if you need to search a url with type=code (mandatory for the extraction to work)

Now you should be able to handle the 100 pages without cookies

Developer
Maintained by Community

Actor Metrics

  • 3 monthly users

  • 4 stars

  • 89% runs succeeded

  • 10 hours response time

  • Created in Mar 2024

  • Modified 2 months ago