Page Rank Actor
Under maintenance
Pricing
Pay per usage
Page Rank Actor
Under maintenance
Page rank Actor ranks URLs that satisfy the glob patterns.
0.0 (0)
Pricing
Pay per usage
0
Monthly users
1
Runs succeeded
>99%
Last modified
a month ago
Alpha
alpha
stringOptional
The damping factor used in the PageRank algorithm, which determines the probability that a user will continue to follow links.
Default value of this property is "0.85"
Epsilon
epsilon
stringOptional
A small positive value used to control the precision of the PageRank algorithm, helping to determine when the algorithm has converged. It is used as a threshold for stopping the iterative computation process. The default value is Number.EPSILON * 10
.
Default value of this property is "2.220446049250313e-15"
Include URLs (globs)
includeUrlGlobs
arrayOptional
Glob patterns matching URLs of pages that will be included in crawling.
Setting this option will disable the default Start URLs based scoping and will allow you to customize the crawling scope yourself. Note that this affects only links found on pages, but not Start URLs - if you want to crawl a page, make sure to specify its URL in the Start URLs field.
For example https://{store,docs}.example.com/**
lets the crawler to access all URLs starting with https://store.example.com/
or https://docs.example.com/
, and https://example.com/**/*\?*foo=*
allows the crawler to access all URLs that contain foo
query parameter with any value.
Learn more about globs and test them here.
Exclude URLs (globs)
excludeUrlGlobs
arrayOptional
Glob patterns matching URLs of pages that will be excluded from crawling. Note that this affects only links found on pages, but not Start URLs, which are always crawled.
For example https://{store,docs}.example.com/**
excludes all URLs starting with https://store.example.com/
or https://docs.example.com/
, and https://example.com/**/*\?*foo=*
excludes all URLs that contain foo
query parameter with any value.
Learn more about globs and test them here.
Max pages
maxCrawlPages
integerOptional
The maximum number pages to crawl. It includes the start URLs, pagination pages, pages with no content, etc. The crawler will automatically finish after reaching this number. This setting is useful to prevent accidental crawler runaway.
Default value of this property is 9999999
Max crawling depth
maxCrawlDepth
integerOptional
The maximum number of links starting from the start URL that the crawler will recursively follow. The start URLs have depth 0
, the pages linked directly from the start URLs have depth 1
, and so on.
This setting is useful to prevent accidental crawler runaway. By setting it to 0
, the Actor will only crawl the Start URLs.
Default value of this property is 20
Pricing
Pricing model
Pay per usageThis Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.