Pricing

$5.00 / 1,000 results

Go to Store

Website Email Scraper

Try for free

Developed by

thenetaji

Extract videos, images, audio, APKs & emails from websites. This Apify actor crawls pages to discover media links with configurable depth, proxy support & domain filtering. Boost content research & lead gen.

5.0 (2)

Pricing

$5.00 / 1,000 results

Total users

183

Monthly users

Runs succeeded

>99%

Last modified

2 days ago

Lead generation

Developer tools

Automation

Website Email Extractor - Most efficieent

🔍 Overview

Media Link Extractor is a powerful Apify actor that automatically crawls websites to discover and extract various types of media links including videos, images, audio files, APK files, and email addresses. Perfect for content aggregation, SEO research, lead generation, and digital asset management.

Media Link Extractor Banner

✨ Key Features

Multi-Media Support: Extract various media types (videos, images, audio, APKs, emails)
Configurable Crawling: Set crawl depth, concurrency, and URL limits to suit your needs
Smart Extraction: Uses multiple detection methods including URL patterns, HTML tags, and CSS selectors
Proxy Support: Optional Apify proxy integration for better scraping success rates
Domain Filtering: Stays on the same domain to focus crawling on relevant content
Detailed Output: Organized dataset with source URLs, timestamps, and media metadata
Rate Limiting Protection: Built-in mechanisms to avoid overloading target websites

🎯 Use Cases

Content Creators: Find media resources for projects and presentations
Digital Marketers: Discover image and video assets for competitor analysis
App Developers: Locate APK distribution points for competitive research
Lead Generation: Extract email addresses for business outreach campaigns
SEO Specialists: Analyze media usage patterns across websites
Researchers: Gather media files for analysis and archiving projects

🛠️ Input Parameters

{
  "startUrls": [{ "url": "https://example.com" }],
  "mediaType": "all",
  "maxCrawlDepth": 1,
  "maxConcurrency": 10,
  "maxRequestRetries": 3,
  "maxUrlsToCrawl": 100,
  "useProxy": {
    "useApifyProxy": false,
    "apifyProxyGroups": [],
    "apifyProxyCountry": ""
  }
}

Parameter Details

Parameter	Type	Description
`startUrls`	Array	List of URLs where the crawler will begin
`mediaType`	String	Type of media to extract: `video`, `audio`, `image`, `apk`, `email`, or `all`
`maxCrawlDepth`	Number	How many links deep the crawler will go
`maxConcurrency`	Number	Maximum parallel requests
`maxRequestRetries`	Number	Number of retry attempts for failed requests
`maxUrlsToCrawl`	Number	Maximum number of URLs to process
`useProxy`	Object	Configuration for Apify proxy usage

📊 Output Format

The actor stores results in the default dataset with this structure:

{
  "sourceUrl": "https://example.com/page",
  "pageTitle": "Example Page Title",
  "mediaLinks": [
    {
      "url": "https://example.com/video.mp4",
      "sourceUrl": "https://example.com/page",
      "title": "Example Page Title",
      "type": "video",
      "foundAt": "2025-04-10T06:40:01.000Z"
    }
  ],
  "timestamp": "2025-04-10T06:40:01.000Z"
}

⚙️ Technical Implementation

Media Link Extractor uses a combination of techniques to find media resources:

CSS Selectors: Targets specific HTML elements containing media
URL Pattern Matching: Identifies file extensions and URL patterns
Context Analysis: Examines surrounding elements for media indicators
Domain Adherence: Maintains focus on the original domain

💡 Best Practices

Start Small: Begin with a low maxUrlsToCrawl value to test results
Respect Websites: Use reasonable maxConcurrency values to avoid overloading sites
Optimize Depth: Most valuable media is often found within 1-2 levels of crawl depth
Target Specific Media: Use the appropriate mediaType parameter instead of "all" for more focused results

📚 Examples

Extract Videos from a Website

{
  "startUrls": [{ "url": "https://example.com/videos" }],
  "mediaType": "video",
  "maxCrawlDepth": 2,
  "maxUrlsToCrawl": 50
}

Find Email Addresses for Lead Generation

{
  "startUrls": [{ "url": "https://company.com/about" }],
  "mediaType": "email",
  "maxCrawlDepth": 3,
  "maxUrlsToCrawl": 200
}

Collect APK Files from Android Sites

{
  "startUrls": [{ "url": "https://apksite.com" }],
  "mediaType": "apk",
  "maxCrawlDepth": 2,
  "maxUrlsToCrawl": 100
}

📈 Performance Considerations

Processing speed depends on website complexity and response times
Typical extraction rates: 5-10 pages per second without proxy, 2-5 pages per second with proxy
Memory usage scales with concurrency and page complexity
Consider using Apify proxy for rate-limited or IP-blocking websites

🔗 Integration Ideas

Connect with Apify Storage for permanent dataset archiving
Combine with Google Sheets integration for easy team collaboration
Use with Zapier or Make to automate workflows with extracted media
Export data to S3 or other cloud storage for batch processing

On this page

Website Email Extractor - Most efficieent

Share Actor:

Free Email Domain Scraper - Extract Emails From Any Website

s-r/free-email-domain-scraper

Extract Emails From Any Website. No monthly costs. Contact discovery, employing a two-pass search strategy, advanced filtering (remove generic and malformed emails), user-agent rotation, and configurable limits per domain. Ideal for lead generation and market research.

199

5.0

Email and Contact Us Page Scraper

moving_beacon-owner1/my-actor-12

This advanced email scraper crawls websites to identify and extract valid email addresses while filtering out unwanted ones. It gathers emails from both main pages and contact pages for effective outreach and analysis.

Jamshaid Arif

Extract Contact Details from Any Website – Email, Phone, Social

creative_tablecloth/extract-email-phone-social-media-from-any-website

Discover our powerful scraper that effortlessly extracts emails, phone numbers, and social media links from any website. Ideal for marketers and businesses seeking to enhance their contact database quickly and efficiently.

Jinny Kim

1.8K

3.0

📧✨ Extract Emails, Socials and Contacts from Any Website

logical_scrapers/extract-email-from-any-website

(fastest) An advanced Actor for extracting email addresses, social links and contact details from websites. This tool is perfect for web scraping, contact collection, and lead generation.

Goldmine

472

5.0

Contact scraper Extracts Email Phone Social Media from website

giovannibiancia/website-actor

LeadFinder Scraper is a powerful lead generation tool designed to extract emails, phone numbers, and social media profiles from websites. Perfect for B2B and B2C businesses, market research, and seamless CRM data integration.

Giovanni Bianciardi

291

Email Scraper

ib4ngz/email-scraper

This actor scrapes email addresses from a list of provided URLs. It recursively crawls pages, extracts unique emails, and stores them in a dataset. The actor supports DNS validation to ensure domain authenticity and allows filtering based on custom crawling depth.

Iqbal R

160

Scrape Emails Websites

techionik9993/static-websites-email-scraper

This Actor is a powerful and scalable solution designed to extract email addresses from static websites in a reliable and efficient manner. It leverages Python’s requests and BeautifulSoup libraries to parse HTML pages.

Techionik

5.0

Email, Phone & Socials Extractor

louisdeconinck/email-extractor

Easily extract emails, phone numbers, and social media links from any website. It crawls the specified start URLs and their subpages, collecting contact information and social media profiles.

Louis Deconinck

141

5.0

Website Emails Scraper

maximedupre/website-emails-scraper

It goes to a website and extracts every email addresses. Super simple.

Maxime

Advanced Website Email, Phone and Social Media Scraper

perfectscrape/actor

This advanced contact scraper is an ALL-IN-ONE scraper that navigates pages likely to contain contact data, extracting emails, phone numbers, and social media links, with precision and speed. This scraper can bypass cloudflare and captchas. Very good scraper for lead generation.