Merge, Dedup & Transform Datasets avatar

Merge, Dedup & Transform Datasets

Try for free

No credit card required

View all Actors
Merge, Dedup & Transform Datasets

Merge, Dedup & Transform Datasets

lukaskrivka/dedup-datasets
Try for free

No credit card required

The ultimate dataset processor. Extremely fast merging, deduplications & transformations all in a single run.

Do you want to learn more about this Actor?

Get a demo
NV

Select contact details

Closed

nico_v opened this issue
5 months ago

Hello,

Given the (very) large amount of column the contact details output for this run (1,070 columns in total), I'd like only the first 2 Contact Details of each sub-category (e.g. emails/0 + emails/1 ; Facebooks/0 + Facebooks/1, ...).

How exactly can I make this happen please? I've tried in the JSON input, using the Fileds to Load as e.g. 'contactDetails/facebooks/0',, but it wouldn't output any of these.

Happy to hear from you on that, thanks.

lukaskrivka avatar

Hello,

I can do a small script for you to do this in this actor. Another option would be to join all emails to one column separated by ;, e.g. instead of emails/0 = john@apify.com and emails/1 = peter@apify.com, it would be emails = john@apify.com; peter@apify.com. And it could join any number (you can decide a limit). Would you prefer that or your original idea?

NV

nico_v

5 months ago

Hello Lukáš,

Thank you for the suggestions. It would be great if you could indeed have contactDetails 0 AND 1 in one column for each contactDetails type (e.g. Facebook 0 + Facebook 1 into one column, separated by a comma).

How should I proceed?

Also, I would need this not for this run in partuclar, but anytime (as I plan many different datasets merges).

Thanks for your help !

lukaskrivka avatar

Hello,

I created a task "Merge Google Maps - Trim to 2 contacts" for you. There is just a simple transform function that cleans the contacts. See the run below. https://console.apify.com/actors/tasks/WyOhdrUQvylK2G1f6/runs/wONYnadbFx2JeWNY3#output

I also added placeId to deduplication fields so you get rid of duplicate places, it goes down to 31053 places now.

Developer
Maintained by Apify
Actor metrics
  • 831 monthly users
  • 49 stars
  • 99.8% runs succeeded
  • 3.6 days response time
  • Created in Apr 2020
  • Modified 9 days ago