Zoominfo Scrapper avatar
Zoominfo Scrapper

Pricing

Pay per event

Go to Store
Zoominfo Scrapper

Zoominfo Scrapper

Developed by

HappiTap

HappiTap

Maintained by Community

ZoomInfo Company Scraper: An Apify actor that extracts comprehensive company information from ZoomInfo company pages, including basic details, financials, similar companies, funding data, and social network links.

0.0 (0)

Pricing

Pay per event

0

Total users

1

Monthly users

1

Runs succeeded

0%

Last modified

a day ago

ZoomInfo Company Scraper

An Apify actor that extracts comprehensive company information from ZoomInfo company pages, including basic details, financials, similar companies, funding data, and social network links.

What It Does

This scraper extracts structured company data from ZoomInfo including:

FieldDescription
idCompany identifier
nameCompany name
descriptionCompany description
revenueAnnual revenue information
websiteCompany website URL
stock_symbolStock ticker symbol
addressCompany headquarters address
employeesEmployee count
phoneContact phone number
founding_yearYear company was founded
industriesIndustry classifications
similar_companiesArray of similar companies with details
fundingFunding rounds and total funding
social_networksLinkedIn, Twitter, Facebook URLs

Supported Page Types

  • Company Pages: ZoomInfo company profile pages (e.g., /c/company-name/123456)
  • Similar Companies: Extracts related companies when enabled
  • Funding Information: Investment rounds and financial data
  • Social Networks: LinkedIn, Twitter, Facebook profile links

Use Cases

  • Lead Generation: Extract company contact information and details
  • Market Research: Analyze companies in specific industries
  • Competitive Intelligence: Gather competitor information and similar companies
  • Sales Prospecting: Build comprehensive company databases
  • Investment Research: Track funding rounds and company financials

Input

The actor accepts the following input format:

{
"max_retries_per_url": 2,
"include_similar_companies": true,
"urls_or_companies_names": [
"https://www.zoominfo.com/c/walmart-inc/155353090",
"https://www.zoominfo.com/c/google-llc/16400573"
],
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "US"
}
}

Input Parameters

ParameterTypeRequiredDefaultDescription
urls_or_companies_namesArrayYes-Array of ZoomInfo company URLs to scrape
max_retries_per_urlNumberNo2Number of retry attempts per URL
include_similar_companiesBooleanNotrueWhether to extract similar companies data
proxyObjectNo-Proxy configuration for residential proxies

Supported ZoomInfo URLs

The scraper works with ZoomInfo company profile pages:

Company Profile Pages:

  • https://www.zoominfo.com/c/walmart-inc/155353090
  • https://www.zoominfo.com/c/google-llc/16400573
  • https://www.zoominfo.com/c/microsoft-corporation/138614
  • https://www.zoominfo.com/c/apple-inc/138617

URL Format:

  • Pattern: https://www.zoominfo.com/c/{company-name}/{company-id}
  • The scraper extracts both basic company data and related information

Output

The actor outputs structured data for each company found:

{
"id": "155353090",
"name": "Walmart Inc.",
"description": "Walmart Inc. operates as a multinational retail corporation...",
"revenue": "$611.3 billion",
"website": "https://www.walmart.com",
"stock_symbol": "WMT",
"address": {
"street": "702 SW 8th Street",
"city": "Bentonville",
"state": "Arkansas",
"country": "United States",
"zipCode": "72716"
},
"employees": "2,100,000",
"phone": "+1-479-273-4000",
"founding_year": "1962",
"industries": ["Retail", "E-commerce", "Grocery"],
"similar_companies": [
{
"name": "Target Corporation",
"url": "https://www.zoominfo.com/c/target-corporation/138615",
"employees": "409,000"
}
],
"funding": {
"total_funding": "$0",
"rounds": []
},
"social_networks": {
"linkedin": "https://www.linkedin.com/company/walmart",
"twitter": "https://twitter.com/walmart",
"facebook": "https://www.facebook.com/walmart"
},
"url": "https://www.zoominfo.com/c/walmart-inc/155353090",
"scrapedAt": "2024-01-01T00:00:00.000Z"
}

Example Usage

Scrape Single Company

{
"max_retries_per_url": 3,
"include_similar_companies": true,
"urls_or_companies_names": [
"https://www.zoominfo.com/c/walmart-inc/155353090"
],
"proxy": {
"useApifyProxy": true,
"apifyProxyGroups": ["RESIDENTIAL"],
"apifyProxyCountry": "US"
}
}

Scrape Multiple Companies

{
"max_retries_per_url": 2,
"include_similar_companies": false,
"urls_or_companies_names": [
"https://www.zoominfo.com/c/walmart-inc/155353090",
"https://www.zoominfo.com/c/google-llc/16400573",
"https://www.zoominfo.com/c/microsoft-corporation/138614"
]
}

Basic Configuration

{
"urls_or_companies_names": [
"https://www.zoominfo.com/c/apple-inc/138617"
]
}

How It Works

  1. URL Validation: Validates ZoomInfo company URLs and extracts company identifiers
  2. Anti-Detection: Implements advanced anti-bot measures including user agent rotation and headers
  3. Data Extraction: Uses specialized selectors to extract comprehensive company information
  4. Similar Companies: Optionally extracts related companies and their details
  5. Funding Data: Scrapes investment rounds and funding information
  6. Social Networks: Extracts LinkedIn, Twitter, and Facebook profile links
  7. Structured Output: Returns clean, structured company data ready for analysis

Features

  • Comprehensive Data: Extracts 14+ fields including financials, contact info, and social networks
  • Similar Companies: Optional extraction of related companies with details
  • Anti-Bot Protection: Advanced stealth measures to bypass ZoomInfo's detection
  • Proxy Support: Built-in residential proxy support for reliable scraping
  • Retry Logic: Configurable retry attempts for failed requests
  • Error Handling: Graceful error handling with detailed logging
  • Data Validation: Ensures data quality with validation checks
  • Funding Tracking: Extracts investment rounds and total funding amounts

Installation

  1. Clone this repository
  2. Install dependencies: npm install
  3. Run the actor: npm start

Development

  • npm start - Run the actor
  • npm run format - Format code with Prettier
  • npm run lint - Run ESLint
  • npm run lint:fix - Fix ESLint issues

Architecture

  • src/main.js - Main entry point and input validation
  • src/routes.js - Request routing and page type detection
  • src/handlers/amazonProductPage.js - Individual product page scraping logic
  • src/handlers/amazonSearchResults.js - Search results and category page scraping logic
  • src/puppeteerLauncher.js - Puppeteer browser configuration with stealth mode

Notes

  • The scraper is designed to be respectful of Amazon's servers and includes appropriate delays
  • Results may vary based on Amazon's page structure changes
  • The scraper automatically handles different Amazon page layouts and product formats
  • All extracted data is timestamped for tracking purposes
  • Product pages return single items, while search/category pages return multiple products