An actor that crawls a website to identify key decision‑maker names and job titles, then uses NER‑powered matching to extract and pair their email addresses for streamlined lead generation and B2B data enrichment.
Custom Request Headers: Added defaultRequestHeaders input to allow custom HTTP headers for all requests
Language Filtering: Added languageCountryCode input to specify language/country codes for URL filtering
Improved Error Handling: Added more robust error handling throughout the pipeline
Changed
Code Refactoring: Restructured codebase for better maintainability and clarity
Name Extraction Improvements: Enhanced name validation and extraction logic to reduce false positives
Email Matching Optimization: Improved accuracy of email-person matching algorithm
URL Processing: Optimized URL processing and domain extraction logic
HTML Parsing: Enhanced BeautifulSoup parsing with better context detection
[0.6.0—alpha] — 2025-07-19
Added
New user inputs for explorationMode, depth, pagecount, priorityLinkKeywords, useCstomJobTitles and customJobTitles.
Chnaged
Improvements to the name extraction and validation.
[0.5.1—alpha] — 2025-07-15
Fixed
Minor Bug Fixes: Addressed issues in name extraction logic to ensure more consistent and accurate results.
[0.5.0—alpha] — 2025-07-13
Added
New Input Settings: Introduced jobTitles and obeyRobotstxt settings to enhance customization and compliance.
jobTitles: Allows users to specify target job titles for more precise extraction.
obeyRobotstxt: Ensures adherence to website robots.txt files for ethical crawling.
Changed
Updated Job Title Lists: Refined and expanded the internal job title database to improve accuracy and relevance of extracted decision-maker information.
Fixed
Extraction and Processing Improvements: Resolved multiple bugs affecting data extraction and processing, resulting in more reliable and accurate outputs.
[0.4.0—alpha] — 2025-07-11
Changed
Enhanced Name Processing: Optimized the handling of extracted names to improve accuracy, reduce false positives, and better support complex name formats.
Fixed
Minor Bug Fixes: Addressed issues in name extraction logic to ensure more consistent and accurate results.
[0.3.0—alpha] — 2025-07-11
Changed:
Expanded FALSE_POSITIVE_SELECTORS to include broader matching for testimonial-like content, including elements with class or ID containing "customer", "client", "review", "quote", "press", and related keywords.
[0.2.0—alpha] — 2025-07-09
Changed:
Replaced the NER model with a slightly smaller one from Hugging Face. This new model offers better performance and improved support for multilingual names, paving the way for future internationalization features.
Added:
Support for bulk URL input. Each URL is processed independently.
[0.1.0—alpha] — 2025-07-08
Added:
Initial alpha release of the Decision Maker Name & Email Extractor.