Website Content Crawler
No credit card required
Website Content Crawler
No credit card required
Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗 LangChain, LlamaIndex, and the wider LLM ecosystem.
Do you want to learn more about this Actor?
Get a demoHi, I am trying to crawl a page which clearly has links on it. My setting also allows it to enqueue upto depth 10. but the crawler isn't. I tried multiple things like changing the crawler type, proxy settings etc. but not able to get it to work.
My start URL is the issue actually. I think I can fix it.
Yes, that was the issue. I will close this.
- 3.8k monthly users
- 616 stars
- 99.9% runs succeeded
- 3.4 days response time
- Created in Mar 2023
- Modified 3 days ago