Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR

Pricing

$3.00 / 1,000 results

Try for free

Go to Apify Store

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR

Try for free

Developed by

cheap_scraper

Maintained by Community

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs. The Jobs Scraper allows you to collect jobs by providing one or more start URLs. 🔹 Jobs are solely scraped from Glassdoor, however Indeed and Glassdoor belong to the same company and share the exact same job postings.

1.7 (2)

Pricing

$3.00 / 1,000 results

Last modified

2 months ago

Jobs

You can access the Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR programmatically from your own applications by using the Apify API. You can also choose the language preference from below. To use the Apify API, you’ll need an Apify account and your API token, found in Integrations settings in Apify Console.

Python

JavaScript

CLI

OpenAPI

HTTP

MCP

{
  "openapi": "3.0.1",
  "info": {
    "version": "0.0",
    "x-build-id": "baFI2XOCgoELryudZ"
  },
  "servers": [
    {
      "url": "https://api.apify.com/v2"
    }
  ],
  "paths": {
    "/acts/cheap_scraper~glassdoor-job-scraper/run-sync-get-dataset-items": {
      "post": {
        "operationId": "run-sync-get-dataset-items-cheap_scraper-glassdoor-job-scraper",
        "x-openai-isConsequential": false,
        "summary": "Executes an Actor, waits for its completion, and returns Actor's dataset items in response.",
        "tags": [
          "Run Actor"
        ],
        "requestBody": {
          "required": true,
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/inputSchema"
              }
            }
          }
        },
        "parameters": [
          {
            "name": "token",
            "in": "query",
            "required": true,
            "schema": {
              "type": "string"
            },
            "description": "Enter your Apify token here"
          }
        ],
        "responses": {
          "200": {
            "description": "OK"
          }
        }
      }
    },
    "/acts/cheap_scraper~glassdoor-job-scraper/runs": {
      "post": {
        "operationId": "runs-sync-cheap_scraper-glassdoor-job-scraper",
        "x-openai-isConsequential": false,
        "summary": "Executes an Actor and returns information about the initiated run in response.",
        "tags": [
          "Run Actor"
        ],
        "requestBody": {
          "required": true,
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/inputSchema"
              }
            }
          }
        },
        "parameters": [
          {
            "name": "token",
            "in": "query",
            "required": true,
            "schema": {
              "type": "string"
            },
            "description": "Enter your Apify token here"
          }
        ],
        "responses": {
          "200": {
            "description": "OK",
            "content": {
              "application/json": {
                "schema": {
                  "$ref": "#/components/schemas/runsResponseSchema"
                }
              }
            }
          }
        }
      }
    },
    "/acts/cheap_scraper~glassdoor-job-scraper/run-sync": {
      "post": {
        "operationId": "run-sync-cheap_scraper-glassdoor-job-scraper",
        "x-openai-isConsequential": false,
        "summary": "Executes an Actor, waits for completion, and returns the OUTPUT from Key-value store in response.",
        "tags": [
          "Run Actor"
        ],
        "requestBody": {
          "required": true,
          "content": {
            "application/json": {
              "schema": {
                "$ref": "#/components/schemas/inputSchema"
              }
            }
          }
        },
        "parameters": [
          {
            "name": "token",
            "in": "query",
            "required": true,
            "schema": {
              "type": "string"
            },
            "description": "Enter your Apify token here"
          }
        ],
        "responses": {
          "200": {
            "description": "OK"
          }
        }
      }
    }
  },
  "components": {
    "schemas": {
      "inputSchema": {
        "type": "object",
        "required": [
          "country"
        ],
        "properties": {
          "country": {
            "title": "Country",
            "enum": [
              "US",
              "AR",
              "AU",
              "AT",
              "BE",
              "BR",
              "CA",
              "DE",
              "ES",
              "FR",
              "HK",
              "IN",
              "IE",
              "IT",
              "MX",
              "NL",
              "NZ",
              "SA",
              "SG",
              "CH",
              "GB",
              "AE"
            ],
            "type": "string",
            "description": "Select country (default is USA) Enhance accuracy by selecting the associated country. Data is drawn directly from country-specific glassdoor.com platforms, ensuring relevant outcomes for your location. JSON field name: `country`"
          },
          "startUrls": {
            "title": "Start URLs",
            "type": "array",
            "description": "One or more URLs of the pages where the crawler will start. Note that the Actor will additionally crawl job page data.",
            "items": {
              "type": "object",
              "required": [
                "url"
              ],
              "properties": {
                "url": {
                  "type": "string",
                  "title": "URL of a web page",
                  "format": "uri"
                }
              }
            }
          },
          "saveOnlyUniqueItems": {
            "title": "Save Only Unique Items",
            "type": "boolean",
            "description": "If enabled, only unique items will be saved. Default is false. JSON field name: `saveOnlyUniqueItems`",
            "default": false
          },
          "maxItems": {
            "title": "Maximum Items",
            "type": "integer",
            "description": "Maximum number of items to scrape. Default is none, it will scrape all the jobs it finds. JSON field name: `maxItems`"
          },
          "includeNoSalaryJob": {
            "title": "Include No Salary Job",
            "type": "boolean",
            "description": "Include jobs with no salary. Default is true. JSON field name `includeNoSalaryJob`",
            "default": true
          }
        }
      },
      "runsResponseSchema": {
        "type": "object",
        "properties": {
          "data": {
            "type": "object",
            "properties": {
              "id": {
                "type": "string"
              },
              "actId": {
                "type": "string"
              },
              "userId": {
                "type": "string"
              },
              "startedAt": {
                "type": "string",
                "format": "date-time",
                "example": "2025-01-08T00:00:00.000Z"
              },
              "finishedAt": {
                "type": "string",
                "format": "date-time",
                "example": "2025-01-08T00:00:00.000Z"
              },
              "status": {
                "type": "string",
                "example": "READY"
              },
              "meta": {
                "type": "object",
                "properties": {
                  "origin": {
                    "type": "string",
                    "example": "API"
                  },
                  "userAgent": {
                    "type": "string"
                  }
                }
              },
              "stats": {
                "type": "object",
                "properties": {
                  "inputBodyLen": {
                    "type": "integer",
                    "example": 2000
                  },
                  "rebootCount": {
                    "type": "integer",
                    "example": 0
                  },
                  "restartCount": {
                    "type": "integer",
                    "example": 0
                  },
                  "resurrectCount": {
                    "type": "integer",
                    "example": 0
                  },
                  "computeUnits": {
                    "type": "integer",
                    "example": 0
                  }
                }
              },
              "options": {
                "type": "object",
                "properties": {
                  "build": {
                    "type": "string",
                    "example": "latest"
                  },
                  "timeoutSecs": {
                    "type": "integer",
                    "example": 300
                  },
                  "memoryMbytes": {
                    "type": "integer",
                    "example": 1024
                  },
                  "diskMbytes": {
                    "type": "integer",
                    "example": 2048
                  }
                }
              },
              "buildId": {
                "type": "string"
              },
              "defaultKeyValueStoreId": {
                "type": "string"
              },
              "defaultDatasetId": {
                "type": "string"
              },
              "defaultRequestQueueId": {
                "type": "string"
              },
              "buildNumber": {
                "type": "string",
                "example": "1.0.0"
              },
              "containerUrl": {
                "type": "string"
              },
              "usage": {
                "type": "object",
                "properties": {
                  "ACTOR_COMPUTE_UNITS": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATASET_READS": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATASET_WRITES": {
                    "type": "integer",
                    "example": 0
                  },
                  "KEY_VALUE_STORE_READS": {
                    "type": "integer",
                    "example": 0
                  },
                  "KEY_VALUE_STORE_WRITES": {
                    "type": "integer",
                    "example": 1
                  },
                  "KEY_VALUE_STORE_LISTS": {
                    "type": "integer",
                    "example": 0
                  },
                  "REQUEST_QUEUE_READS": {
                    "type": "integer",
                    "example": 0
                  },
                  "REQUEST_QUEUE_WRITES": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATA_TRANSFER_INTERNAL_GBYTES": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATA_TRANSFER_EXTERNAL_GBYTES": {
                    "type": "integer",
                    "example": 0
                  },
                  "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                    "type": "integer",
                    "example": 0
                  },
                  "PROXY_SERPS": {
                    "type": "integer",
                    "example": 0
                  }
                }
              },
              "usageTotalUsd": {
                "type": "number",
                "example": 0.00005
              },
              "usageUsd": {
                "type": "object",
                "properties": {
                  "ACTOR_COMPUTE_UNITS": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATASET_READS": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATASET_WRITES": {
                    "type": "integer",
                    "example": 0
                  },
                  "KEY_VALUE_STORE_READS": {
                    "type": "integer",
                    "example": 0
                  },
                  "KEY_VALUE_STORE_WRITES": {
                    "type": "number",
                    "example": 0.00005
                  },
                  "KEY_VALUE_STORE_LISTS": {
                    "type": "integer",
                    "example": 0
                  },
                  "REQUEST_QUEUE_READS": {
                    "type": "integer",
                    "example": 0
                  },
                  "REQUEST_QUEUE_WRITES": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATA_TRANSFER_INTERNAL_GBYTES": {
                    "type": "integer",
                    "example": 0
                  },
                  "DATA_TRANSFER_EXTERNAL_GBYTES": {
                    "type": "integer",
                    "example": 0
                  },
                  "PROXY_RESIDENTIAL_TRANSFER_GBYTES": {
                    "type": "integer",
                    "example": 0
                  },
                  "PROXY_SERPS": {
                    "type": "integer",
                    "example": 0
                  }
                }
              }
            }
          }
        }
      }
    }
  }
}

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR OpenAPI definition

OpenAPI is a standard for designing and describing RESTful APIs, allowing developers to define API structure, endpoints, and data formats in a machine-readable way. It simplifies API development, integration, and documentation.

OpenAPI is effective when used with AI agents and GPTs by standardizing how these systems interact with various APIs, for reliable integrations and efficient communication.

By defining machine-readable API specifications, OpenAPI allows AI models like GPTs to understand and use varied data sources, improving accuracy. This accelerates development, reduces errors, and provides context-aware responses, making OpenAPI a core component for AI applications.

You can download the OpenAPI definitions for Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR from the options below:

OpenAPI.json

If you’d like to learn more about how OpenAPI powers GPTs, read our blog post.

You can also check out our other API clients:

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR API in Python

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR API in JavaScript

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR API through CLI

Glassdoor ~ Indeed Jobs Scraper | Remove Duplicate Jobs | PPR API

Glassdoor Jobs Scraper

getdataforme/glassdoor-jobs-scraper

Glassdoor Jobs Scraper is an Apify Actor that automatically retrieves job listings from Glassdoor using your specified title and location. It gathers details such as job title, employer, location, salary estimates, ratings, and application URLs, returning structured JSON for effortless integration.

GetDataForMe

Indeed Company Jobs Scraper

dtrungtin/indeed-company-jobs-scraper

Indeed Company Jobs Scraper

Tin

Linkedin Indeed Glassdoor Job Scraper

gauravsaran/linkedin-indeed-glassdoor-job-scraper

🚀 Find your dream job instantly! Search Indeed, LinkedIn & Glassdoor simultaneously. Get hundreds of jobs with salary data, remote filters & company details in seconds. Perfect for job seekers, recruiters & HR teams. Works globally in 60+ countries. Fast, reliable & easy to use!

ScrapeForge

Glassdoor Jobs Scraper

silo/glassdoor-jobs-scraper

Introducing the Glassdoor Jobs Scraper for Apify! This cutting-edge tool simplifies job listing extraction from Glassdoor. Ideal for job seekers, recruiters, and market analysts, it provides extensive customization to meet diverse job-hunting and research requirements effortlessly.

Silo

196

🌍All Jobs Scraper - LinkedIn, Indeed, Glassdoor

agentx/all-jobs-scraper

Professional job scraping API extracts comprehensive job listings with salary ranges, company details, job requirements, benefits, remote options, and location data from Indeed, LinkedIn, Glassdoor. Advanced recruitment intelligence platform for job market analysis.

AgentX

5.0

🔥 Fast Indeed Jobs Scraper

memo23/apify-indeed-cheerio

Web scraper for Indeed.com job listings. Add Indeed URLs to customize searches. Set max jobs, concurrency, and other parameters to optimize performance while respecting site resources. Efficiently extracts job data, balancing speed with ethical practices. Ideal for gathering targeted job market info

Muhamed Didovic

331

4.4

Indeed Company: Reviews, Interview, Salary, Jobs, About Scraper

memo23/apify-indeed-reviews

Unlock 360° workforce intelligence - scrape reviews, salaries, jobs, interviews, company profiles, and cultural metrics from Indeed in one click. Transform raw data into recruitment strategies, competitive analysis, and market trends with enterprise-grade HR analytics.

Muhamed Didovic

158

5.0