Merge, Dedup & Transform Datasets avatar

Merge, Dedup & Transform Datasets

Try for free

No credit card required

View all Actors
Merge, Dedup & Transform Datasets

Merge, Dedup & Transform Datasets

lukaskrivka/dedup-datasets
Try for free

No credit card required

The ultimate dataset processor. Extremely fast merging, deduplications & transformations all in a single run.

Do you want to learn more about this Actor?

Get a demo
MO

It's not merging and deduping like it used to

Open

Moudi opened this issue
7 days ago

I have 2 data sets that i want to merge and dedup but it is not working. usually this actor works perfectly fine but with this large dataset for some reason it is just adding them together but not merging them at all. i spent $200 on google maps scraper and email extractor if this doesn't work than that data is useless. please help me fix it

lukaskrivka avatar

Hello,

Thanks for the report.

  1. You don't have anything in "Fields for deduplication" so the Actor doesn't know what it should dedup by.
  2. You are deduping between Contact Details and Google Maps, that will not merge the data because it doesn't know how. That is actually what Google Maps Email Extractor does. So instead you should resurrect the Google Maps Email Extractor run that you probably aborted, it will pick up the Google and contact runs and merge them.
MO

Moudi

3 days ago

øk i followed your steps, finished all the actors and put the title as the field to dedup. It did what i wanted but the issue i have now is that I'm not getting any emails in the data. this is the most important thing for me.

here is the run: https://console.apify.com/organization/wTBD9nCgifroPF7eY/actors/runs/H20HED8u2Yy7IKpsa#output

lukaskrivka avatar

You should only run the Google Maps Email Extractor Actor, it has the correct logic to merge the data. Just let me know the run IDs I should rework and I will process it.

Developer
Maintained by Apify
Actor metrics
  • 831 monthly users
  • 49 stars
  • 99.8% runs succeeded
  • 3.6 days response time
  • Created in Apr 2020
  • Modified 9 days ago