Page Rank Actor avatar
Page Rank Actor

Under maintenance

Pricing

Pay per usage

Go to Store
Page Rank Actor

Page Rank Actor

Under maintenance

Developed by

Ludvík Prokopec

Maintained by Community

Page rank Actor ranks URLs that satisfy the glob patterns.

0.0 (0)

Pricing

Pay per usage

0

Monthly users

1

Runs succeeded

>99%

Last modified

a month ago

Start URLs

startUrlsarrayOptional

URLs to start with.

Alpha

alphastringOptional

The damping factor used in the PageRank algorithm, which determines the probability that a user will continue to follow links.

Default value of this property is "0.85"

Epsilon

epsilonstringOptional

A small positive value used to control the precision of the PageRank algorithm, helping to determine when the algorithm has converged. It is used as a threshold for stopping the iterative computation process. The default value is Number.EPSILON * 10.

Default value of this property is "2.220446049250313e-15"

Include URLs (globs)

includeUrlGlobsarrayOptional

Glob patterns matching URLs of pages that will be included in crawling.

Setting this option will disable the default Start URLs based scoping and will allow you to customize the crawling scope yourself. Note that this affects only links found on pages, but not Start URLs - if you want to crawl a page, make sure to specify its URL in the Start URLs field.

For example https://{store,docs}.example.com/** lets the crawler to access all URLs starting with https://store.example.com/ or https://docs.example.com/, and https://example.com/**/*\?*foo=* allows the crawler to access all URLs that contain foo query parameter with any value.

Learn more about globs and test them here.

Exclude URLs (globs)

excludeUrlGlobsarrayOptional

Glob patterns matching URLs of pages that will be excluded from crawling. Note that this affects only links found on pages, but not Start URLs, which are always crawled.

For example https://{store,docs}.example.com/** excludes all URLs starting with https://store.example.com/ or https://docs.example.com/, and https://example.com/**/*\?*foo=* excludes all URLs that contain foo query parameter with any value.

Learn more about globs and test them here.

Max pages

maxCrawlPagesintegerOptional

The maximum number pages to crawl. It includes the start URLs, pagination pages, pages with no content, etc. The crawler will automatically finish after reaching this number. This setting is useful to prevent accidental crawler runaway.

Default value of this property is 9999999

Max crawling depth

maxCrawlDepthintegerOptional

The maximum number of links starting from the start URL that the crawler will recursively follow. The start URLs have depth 0, the pages linked directly from the start URLs have depth 1, and so on.

This setting is useful to prevent accidental crawler runaway. By setting it to 0, the Actor will only crawl the Start URLs.

Default value of this property is 20

Link selector

linkSelectorstringOptional

A CSS selector matching links to be enqueued.

Default value of this property is "a"

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.