Parsera avatar

Parsera

Try for free

This Actor is paid per event

Go to Store
Parsera

Parsera

parsera-labs/parsera
Try for free

This Actor is paid per event

Extract Data from ANY website with Parsera.org

Run Parsera Actor on Apify

Parsera Actor

Extract structured data from any website using Parsera's AI-powered data extraction API.

Example

Input url you want to scrape in Basic Configuration > Target URL, and list columns to extract in Extraction Settings > Extraction Attributes. For example, you can extract list of articles from https://news.ycombinator.com/ by putting this value into Target URL and filling Extraction Attributes with:

1[
2    {
3        "description": "News title",
4        "name": "title"
5    },
6    {
7        "description": "Number of points",
8        "name": "points"
9    },
10    {
11        "description": "Number of comments",
12        "name": "nr_comments"
13    }
14]

At end you'll get a table that looks like this:

nr_commentspointstitle
1141The Inevitability of the Borrow Checker
119When Louis Armstrong Conquered Chicago
448689Meta torrented & seeded 81.7 TB dataset containing copyrighted data
.........

📝 Input Configuration

The actor accepts the following input parameters:

FieldTypeRequiredDescription
urlStringYesThe target URL to extract data from
attributesArrayYesList of data attributes to extract
proxyCountryStringNoCountry for proxy IP (defaults to United States)
cookiesArrayNoCookies to inject into the request
precisionModeBooleanNoEnable high-precision extraction mode

Attributes Structure

Each attribute in the attributes array should have:

  • name: Identifier for the extracted data
  • description: Natural language description of what to extract

💡 Tips

  • Use precise, detailed descriptions in your attributes for better extraction accuracy
  • Enable precisionMode for highest accuracy (uses more credits)
  • Test your extraction pattern on a few pages before running large-scale scrapes
  • The speed of the response depends mainly on the LLM output so if you're collecting a lot of data, the response time will increase. We're working on a code generation sytem to provide back data instantly, so stay tuned and sign up for news at https://parsera.org!

📊 Usage Limits

🤝 Support

Developer
Maintained by Community

Actor Metrics

  • 4 monthly users

  • 4 bookmarks

  • >99% runs succeeded

  • Created in Nov 2024

  • Modified 20 hours ago

Categories