Indeed Comprehensive Crawler avatar

Indeed Comprehensive Crawler

Try for free

2 hours trial then $15.00/month - No credit card required now

View all Actors
Indeed Comprehensive Crawler

Indeed Comprehensive Crawler

memo23/apify-indeed
Try for free

2 hours trial then $15.00/month - No credit card required now

Unearth every job gem on Indeed with laser precision. Our advanced scraper dives deep, extracting rich data from listings, reviews, and company profiles. Empower your recruitment strategy or market analysis with unparalleled insights, all at the speed of automation.

Overview

The Ultimate Indeed Scraper is a powerful tool designed for the Apify platform, tailored to bypass protections and efficiently extract job listing information directly from Indeed. It's engineered to navigate and parse data from Indeed's job listings, offering users a seamless experience in collecting valuable job market insights.

Features

  • Browser Automation: Uses advanced browser automation techniques to navigate Indeed's website and extract data reliably.
  • Customizable Searches: Input URLs directly from Indeed to target specific job searches or categories.
  • Comprehensive Data Extraction: Gathers detailed information from job listings, including job titles, company names, locations, salary estimates, job descriptions, and more.

How to Use

  1. Set Up: Ensure you have an Apify account and access to the Indeed Scraper Browser Automation actor.
  2. Input Configuration:
    • Start URLs: Enter your target Indeed job listing URLs. For example: https://de.indeed.com/jobs?q=medizinische+Fachangestellte&sc=0bf%3Aexrec()%2Ck
    • Max jobs to scrape (optional): Set the maximum number of job listings to scrape. Default is 20.
  3. Proxy Configuration: Configure proxy settings for enhanced reliability and to avoid IP blocks.
  4. Run the Scraper: Launch the scraper on the Apify platform.
  5. Data Collection: Extracted data will be available in your preferred format (JSON, CSV, etc.).

Input Example

1{
2    "startUrls": [
3        { "url": "https://de.indeed.com/jobs?q=medizinische+Fachangestellte&sc=0bf%3Aexrec()%2Ck" }
4    ],
5    "maxJobsToScrape": 20,
6    "proxyConfiguration": {
7        "useApifyProxy": true
8    }
9}

Output Structure

The scraper's output includes detailed information for each job listing. Here's a sample output with explanations for each field:

1{
2  "source_link": "",  // Original source link if the job is aggregated from another site
3  "adBlob": "...",  // Encoded string containing additional job information
4  "adId": "433403349",  // Unique identifier for the advertisement
5  "additionalLocationLink": {
6    "label": "+3 Orte",  // Label for additional locations
7    "url": "..."  // URL to view jobs in additional locations
8  },
9  "advn": "625754421564269",  // Advertiser identification number
10  "company": "Ober Scharrer Gruppe GmbH'",  // Name of the company offering the job
11  "companyRating": 0,  // Company rating on Indeed (if available)
12  "companyReviewCount": 0,  // Number of company reviews on Indeed
13  "createDate": 1721054703000,  // Timestamp of when the job was created
14  "displayTitle": "MFA / Medizinische Fachangestellte (w/m/d) für den OP",  // Job title as displayed
15  "formattedLocation": "76532 Baden-Baden",  // Formatted job location
16  "formattedRelativeTime": "vor 30+ Tagen",  // Relative time since job was posted
17  "jobkey": "ea2f2dbc75f8a438",  // Unique identifier for the job listing
18  "jobTypes": [  // Types of employment offered
19    "Vollzeit",
20    "Teilzeit"
21  ],
22  "locationCount": 4,  // Number of locations this job is available in
23  "pubDate": 1721019600000,  // Timestamp of when the job was published
24  "salarySnippet": {  // Salary information (if available)
25    "currency": "EUR",
26    "salaryTextFormatted": false
27  },
28  "snippet": "...",  // Brief excerpt or summary of the job description
29  "title": "MFA / Medizinische Fachangestellte (w/m/d) für den OP"  // Job title
30}

Field Explanations

  • source_link: If the job is aggregated from another site, this field contains the original link.
  • adBlob: An encoded string that may contain additional job information.
  • adId: A unique identifier for the job advertisement.
  • additionalLocationLink: Information about other locations where this job is available.
  • advn: The advertiser's identification number.
  • company: The name of the company offering the job.
  • companyRating: The company's rating on Indeed, if available.
  • companyReviewCount: The number of reviews the company has on Indeed.
  • createDate: The timestamp when the job listing was created.
  • displayTitle: The job title as it's displayed on the listing.
  • formattedLocation: The job's location in a formatted string.
  • formattedRelativeTime: How long ago the job was posted, in a human-readable format.
  • jobkey: A unique identifier for the specific job listing.
  • jobTypes: An array of job types (e.g., full-time, part-time) offered for this position.
  • locationCount: The number of locations where this job is available.
  • pubDate: The timestamp when the job was published.
  • salarySnippet: Information about the salary, if provided.
  • snippet: A brief excerpt or summary of the job description.
  • title: The title of the job position.

Support

Additional Services

Developer
Maintained by Community
Actor metrics
  • 3 monthly users
  • 2 stars
  • 100.0% runs succeeded
  • Created in Jan 2024
  • Modified 17 days ago