Instagram Scraper avatar
Instagram Scraper

Pricing

$29.99/month + usage

Go to Store
Instagram Scraper

Instagram Scraper

Developed by

DataFusionX

DataFusionX

Maintained by Community

Scrape Instagram user's profile details along with Email/Phone details.

5.0 (9)

Pricing

$29.99/month + usage

55

Total users

1.3K

Monthly users

153

Runs succeeded

>99%

Issues response

0.71 hours

Last modified

3 days ago

PE

Output Size Exceeds Limit

Open

Pedlar opened this issue
4 days ago

Hi, I’m using this Apify actor through Clay and ran it on over 12,000 accounts. I just noticed that for more than 1,600 results, I can’t access the output because of the following error: “Cell data size exceeds limit (200KB).”

Is there a way to adjust the output format to reduce the size? I only need the following fields: • Number of followers • Last post date • Number of posts • Username

PE

Pedlar

4 days ago

this is the input format i am giving { "directUrls": [ "https://www.instagram.com/blindbarber" ], "proxy": { "useApifyProxy": true, "groups": [ "RESIDENTIAL" ] } }

devil_port369-owner avatar

Dear Pedlar,

It looks like Clay limitation for some reason but I think below could help you if not please ping me we will sort it out.

You can extract only the needed fields (followers, last post date, posts, username):

In Clay, open the Source Column for your Apify data. 2.

Create columns for each field. 3.

Use formulas like =SOURCE_CELL["followers_count"], =SOURCE_CELL["last_post_date"], etc. (confirm JSON keys).

Optionally, tweak the Apify actor to output only these fields. Share a JSON sample for exact formulas or contact Clay support via “Talk to a person.”

devil_port369-owner avatar

Dear Pedlar,

It looks like Clay limitation for some reason but I think below could help you if not please ping me we will sort it out.

You can extract only the needed fields (followers, last post date, posts, username): In Clay, open the Source Column for your Apify data. Create columns for each field. Use formulas like =SOURCE_CELL["followers_count"], =SOURCE_CELL["last_post_date"], etc. (confirm JSON keys).

Optionally, tweak the Apify actor to output only these fields. Share a JSON sample for exact formulas or contact Clay support via “Talk to a person.”

Best, DataFusionX

PE

Pedlar

4 days ago

This is what clay agent suggested: We currently limit the amount of data that can be stored in a single side to 200KB. We don't have a current workaround that lets us exceed that cell limit, If you're using Apify or AI prompts that return large JSON or HTML blocks, you might consider: Reducing the number of fields you return by limiting the output fields in your Apify configuration. meaning only pick the fields you really require. Use parameters like: ​ { "maxItems": 1, "onlyTotal": true, "outputFields": ["title", "date"], "resultsLimit": 1 }

Extracting only key insights (vs. full page scrapes or documents) Returning a link to the full result + summary instead of everything inline Use comma separation for multiple fields Use dot notation for nested data (object1.object2.name) Let me know if you still have questions.

PE

Pedlar

4 days ago

I am not sure how to do that in apify

devil_port369-owner avatar

I think you need to config in clay apify integration to include only those required datapoints(column field) from APIFY datasets into your clay:

{ "maxItems": 1, "onlyTotal": true, "outputFields": ["username", "followers", "last_post_date"], "resultsLimit": 1 }

like this.

devil_port369-owner avatar

If nothing works till tomorrow then let me know I will add another actor for you with your required data points and format only

Thank you,

PE

Pedlar

4 days ago

Clay team couldn't help me with that so if you can please do it

devil_port369-owner avatar

Sure, i will look into it.

devil_port369-owner avatar

Please share with me your final data points and data format you want also may i know what is the rate for urls/usernames you normal process?

PE

Pedlar

3 days ago

The final data point that I want are: Username, IG Last Post Date, Followings, Followers, Business Address, Posts number

this is the current imput that i am using: {

"directUrls": [ "Instagram URL"
],
"proxy": {
"useApifyProxy": true,
"groups": [
"RESIDENTIAL"
]
}

}

devil_port369-owner avatar

Okay got it. Thank you

PE

Pedlar

3 days ago

Hi there,

That worked perfectly — thank you so much!

I wanted to ask if you could help us build a new Apify actor that scrapes the Facebook Ads Library. We only need one data point: the total number of ads a page is running. If it’s zero, just return 0 — no other data is needed.

We’re currently using an existing actor, but we’re running into the same “data size” issue. If you’re able to build a more efficient version for us, we’d be happy to switch and use it.

Here’s the current input we’re using:

{ "isDetailsPerAd": false, "onlyTotal": true, "startUrls": [ { "url": "https://www.facebook.com/ads/library/?active_status=active&ad_type=all&country=ALL&is_targeted_country=false&media_type=all&q=bghs.org&search_type=keyword_unordered", "method": "GET" } ] }

devil_port369-owner avatar

Wow 🤩 great to hear it fits in, sure I will publish it very soon.

PE

Pedlar

2 days ago

Hey, any update on this?

devil_port369-owner avatar

Hi there by tomorrow eod i will publish it. Thank you

PE

Pedlar

3 hours ago

Hi there, have you published already, is anything wrong?