
Website Content Crawler
Pricing
Pay per usage

Website Content Crawler
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
4.0 (41)
Pricing
Pay per usage
1638
Total users
65K
Monthly users
8.5K
Runs succeeded
>99%
Issues response
6.9 days
Last modified
3 days ago
Really high failure rate
Closed
Hi, I've been using this actor for a couple of months now. Today, starting around 11am EST I'm seeing really high failure rates. I'm running 100's of crawls a day and although i see a couple fail today I'm seeing the majority fail.
Can you help?
tree3
I switched from residential proxies to datacenter and that seems to have fixed it - you might wanna take a look at your residential proxy setup.
We are very sorry for the inconvenience!
It's true that performance of residential proxies has suffered in the last day. It was due to some issues in our upstream proxy provider.
At the moment most of the performance is restored and it should be safe to switch back from datacenter proxies.
I'm going to close the issue for now, but if you still experience any significant issues, feel free to reopen it.