XiaoHongShu Profile Scraper
1 day trial then $20.00/month - No credit card required now
XiaoHongShu Profile Scraper
1 day trial then $20.00/month - No credit card required now
本工具可以处理小红书用户页数据及其发表的文章详情页数据。[小红书](https://www.xiaohongshu.com/) is a famous social e-commerce platform that combines user-generated content with online shopping, catering to the needs of young Chinese consumers. This scraper can get data from xiaohongshu user profile and detail pages.
After changing the value of startUrls, the crawling result still be the default profile value
what's starturl you used? Give me example starturl or your running url, let me check.
here is my code snippet:
1const input = { 2 startUrls: [ 3 { 4 url: "https://www.xiaohongshu.com/user/profile/5bfb3280e7444b0001520768", 5 }, 6 ], 7 scrape_detail_page: false, 8 download_image: false, 9 }; 10 // Run the Actor and wait for it to finish 11 const run = await client.actor("kuaima/xiaohongshu-profile").call(input);
I'm try this url now. By the way, is this your first time use this scraper?
do some update, try the latest version again. Let me know if it works for you.
Yes, first time use this scraper.
Do you try the latest version I just updated. I think it should work for your url now.
It works now. One more question, can the scraper get the video link in profile page also?
It can get the video link if it's a video. But I need double check, maybe it need some adjust. It will be helpful if you can provide some sample profile link which include video post.
Sample profile link: https://www.xiaohongshu.com/user/profile/63761ee0000000001f01b7a4
Get it. I'll try this profile first.
add video link support now. Just need configure input parameter scrape_detail_page to true.
Thanks, video link works fine. Would scraper support fetch notes by Date range or by page or the default scraping behaviour will try to get all notes from profile?
Filter notes by date range is possible. It need some logic change and code flow change. I'll try it later.
It's now can filter by date range. Need enable scrape_detail_page first. Let me know if it works.
If any other funtion or requirement, please open new issue to discuss about it.
Close this issue for problem solved.
- 10 monthly users
- 4 stars
- 100.0% runs succeeded
- 65 days response time
- Created in Jun 2023
- Modified 4 days ago