Page Rank Actor avatar
Page Rank Actor

Under maintenance

Pricing

Pay per usage

Go to Store
Page Rank Actor

Page Rank Actor

Under maintenance

Developed by

Ludvík Prokopec

Maintained by Community

Page rank Actor ranks URLs that satisfy the glob patterns.

0.0 (0)

Pricing

Pay per usage

0

Monthly users

1

Runs succeeded

>99%

Last modified

25 days ago

Apify Page rank Actor

Apify Page rank Actor is a crawler designed to calculate and rank web pages based on their URL and inlinks/outlinks data. It crawls the web pages, calculates the PageRank for each page, and filters the pages that meet the specified restrictions. The final output includes the URL, rank, inlinks, and outlinks, sorted by rank in descending order.

Use Cases

  • SEO Analysis: Identify the most influential pages on a website based on page rank.
  • Website Optimization: Discover important pages with weak internal linking and improve site structure.
  • Competitor Research: Analyze competitor websites to understand their link hierarchy and page importance.
  • Content Strategy: Prioritize high-ranking pages for content updates or linking strategies.
  • Web Archiving: Rank and filter pages efficiently for web scraping and archiving purposes.
  • Data Science & Research: Use page rank data for academic studies or data-driven decision-making.
  • Social media profile: Use page rank data for measuring social media profile popularity.

Features

  • Crawl Web Pages: The actor crawls through a set of web pages.
  • PageRank Calculation: The PageRank algorithm is applied to rank the pages based on their incoming (inlinks) and outgoing (outlinks) links.
  • Restriction-based Filtering: Only pages that meet specific restrictions (such as URL patterns or conditions) will be processed.
  • Sorted Output: The output is a sorted list of web pages, showing the URL, calculated PageRank, number of inlinks, and outlinks.
  • Inlinks/Outlinks Tracking: The actor keeps track of the number of inlinks and outlinks for each page.

Output

The output consists of a list of crawled pages, sorted by PageRank, and includes the following details for each page:

PropertyTypeDescription
urlStringThe URL of the crawled webpage.
importanceIntegerA ranking score where a lower value means higher importance.
pageRankFloatThe calculated PageRank score, representing the authority of the page.
inlinksString[]An array of URLs linking to this page (incoming links).
outlinksString[]An array of URLs that this page links out to (outgoing links).

Example Output

Datasets

1{
2    "url": "https://apify.com/",
3    "importance": 1,
4    "pageRank": 0.00011726428097546502,
5    "inlinks": [
6        "https://apify.com/store",
7        "https://apify.com/actors",
8        ...
9    ],
10    "outlinks": [
11        "https://console.apify.com/sign-up",
12        "https://console.apify.com/sign-in",
13        ...
14    ]
15}

HTML Graph

The Actor generates an interactive HTML page that visualizes a graph of web pages and their connections.

  • Nodes represent individual web pages.
  • Edges represent links between pages (inlinks and outlinks).
  • Node size is based on its PageRank value—larger nodes indicate more important pages.
  • Clicking a node toggles the visibility of all its inlinks and outlinks.
  • Hovering over a node displays the URL of that page.

HTML Graph Screenshot

GEXF Graph

The Actor also generates a GEXF file, an XML-based format that represents the graph structure.

This file can be used to create custom visualizations, analyze network data, or import the graph into various tools supporting GEXF.

1<?xml version="1.0" encoding="UTF-8"?>
2<gexf version="1.2" xmlns="http://www.gexf.net/1.2draft" xmlns:viz="http:///www.gexf.net/1.1draft/viz">
3  <meta/>
4  <graph defaultedgetype="directed">
5    <attributes class="edge">
6      <attribute id="source" title="source" type="string"/>
7      <attribute id="target" title="target" type="string"/>
8      <attribute id="type" title="type" type="string"/>
9      <attribute id="hidden" title="hidden" type="boolean"/>
10    </attributes>
11    <nodes>
12      <node id="n0" label="https://apify.com/">
13        <viz:color r="74" g="201" b="104"/>
14        <viz:size value="30"/>
15        <viz:position x="-121.61664581298828" y="-51.96864318847656"/>
16      </node>
17      ...

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.