Website Content Crawler avatar
Website Content Crawler

Pricing

Pay per usage

Go to Store
Website Content Crawler

Website Content Crawler

Developed by

Apify

Apify

Maintained by Apify

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.

4.0 (41)

Pricing

Pay per usage

1638

Total users

65K

Monthly users

8.5K

Runs succeeded

>99%

Issues response

6.9 days

Last modified

3 days ago

TR

Really high failure rate

Closed

tree3 opened this issue
12 days ago

Hi, I've been using this actor for a couple of months now. Today, starting around 11am EST I'm seeing really high failure rates. I'm running 100's of crawls a day and although i see a couple fail today I'm seeing the majority fail.

Can you help?

TR

tree3

12 days ago

I switched from residential proxies to datacenter and that seems to have fixed it - you might wanna take a look at your residential proxy setup.

nick_slam avatar

We are very sorry for the inconvenience!

It's true that performance of residential proxies has suffered in the last day. It was due to some issues in our upstream proxy provider.

At the moment most of the performance is restored and it should be safe to switch back from datacenter proxies.

I'm going to close the issue for now, but if you still experience any significant issues, feel free to reopen it.