πŸ“§βœ¨ Extract Emails From Any Website avatar

πŸ“§βœ¨ Extract Emails From Any Website

Try for free

1 day trial then $20.00/month - No credit card required now

Go to Store
πŸ“§βœ¨ Extract Emails From Any Website

πŸ“§βœ¨ Extract Emails From Any Website

logical_scrapers/extract-email-from-any-website
Try for free

1 day trial then $20.00/month - No credit card required now

(fastest) An advanced Actor for extracting email addresses from websites. This tool is perfect for web scraping, contact collection, and lead generation.

Extract Emails From a Website

An advanced Apify Actor for extracting email addresses from websites, including JavaScript-rendered pages. This tool is perfect for web scraping, contact collection, and lead generation.

Features

  • πŸš€ High Accuracy: Extracts valid and relevant email addresses using robust filtering.
  • 🌐 JavaScript Support: Handles JS-rendered pages via Playwright.
  • πŸ”— Contact Page Detection: Prioritizes contact-related pages for better results.
  • ⚑ Concurrency: Processes multiple websites simultaneously with a configurable limit.
  • πŸ”’ Error Handling: Comprehensive error logging ensures reliability.

Use Cases

  1. Lead Generation: Collect contact information from company websites.
  2. Market Research: Gather emails for outreach campaigns.
  3. Web Crawling: Efficiently scan and extract emails from multiple pages.

How It Works

The actor performs the following steps:

  1. Fetches the HTML content of the given URLs using HTTPX.
  2. If required, renders the page using Playwright to handle JavaScript.
  3. Extracts all email addresses from the page content using regex patterns.
  4. Scans all internal links (prioritizing contact-related pages).
  5. Outputs a consolidated list of unique and valid email addresses.

Input Format

The actor accepts a JSON object with the following structure:

1{
2  "urls": [
3    "https://example.com",
4    "https://another-example.com"
5  ]
6}
  • urls (required): An array of website URLs to scrape.

Output Format

The actor produces an output dataset in the following format:

1[
2  {
3    "url": "https://example.com",
4    "emails": ["info@example.com", "support@example.com"],
5    "scanned_pages": [
6      "https://example.com",
7      "https://example.com/contact"
8    ],
9    "status": "success",
10    "error": null
11  },
12  {
13    "url": "https://another-example.com",
14    "emails": [],
15    "scanned_pages": ["https://another-example.com"],
16    "status": "error",
17    "error": "Timeout while fetching the URL"
18  }
19]
  • url: The processed website URL.
  • emails: List of extracted email addresses.
  • scanned_pages: Pages scanned during the process.
  • status: "success" or "error".
  • error: Details of any encountered error (if applicable).

Example Use Cases

1. Extract Emails from Multiple Websites

Provide a list of URLs in the input JSON, and the actor will output all the emails it finds.

2. Scan Contact Pages for Specific Emails

The actor prioritizes scanning Contact Us or About Us pages for higher relevance.

3. Automate Email Extraction for Marketing

Integrate this actor into your automation pipeline to gather contact leads regularly.


Keywords

Email Extractor, Web Scraper, Lead Generation Tool, Apify Actor, Playwright Email Extraction, Contact Page Scanner, JavaScript Rendering Scraper, Bulk Email Scraping.


For more information or support, feel free to open an issue or reach out on Apify Community Forum.

Happy Scraping! 😊

Developer
Maintained by Community

Actor Metrics

  • 6 monthly users

  • 0 No stars yet

  • >99% runs succeeded

  • Created in Jan 2025

  • Modified 6 days ago