Pay $0.30 for 1,000 tweets

🏯 Tweet Scraper V2 (Pay Per Result) - X / Twitter Scraper

apidojo/tweet-scraper

Pay $0.30 for 1,000 tweets

⚡️ Lightning-fast search, URL, list, and profile scraping, with customizable filters. At $0.30 per 1000 tweets, and 30-80 tweets per second, it is ideal for researchers, entrepreneurs, and businesses! Get comprehensive insights from Twitter (X) now!

Back to issues Create new issue

Scraper a lot of duplicate tweet.

Closed

relishable_finance opened this issue

For this run:https://console.apify.com/actors/runs/hcZD7Xyb7rBkXEQj7 id 1833141861617221670 duplicated for 5 times

And for this run : https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/0GBM6p0QcgxdmVLl9 id:1833197842661576793 duplicated for 104 times.

That cause my scraper fees about 100 times higher

Attach is result of hcZD7Xyb7rBkXEQj7

API Dojo (apidojo)

Hello,

Our engineering team checked the runs and we couldn't find any issues on our end. Our scraper uses the query you give to it with Twitter and returns you whatever it can get from it. There is no additional logic, including removing duplicate. It never alters the output.

Twitter changes behaviour constantly and it is better to test your queries on twitter web UI before running the actor. Another reason can be using multiple from queries withORs. Are you sure that works with Twitter? If you are using ORs, I think the best approach would be to create separate runs for each profile.

Does that make sense?

Cheers!

relishable_finance

I tried agian with only one from, but there is many duplicate as well。 https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/8tXLNNdSegCRGCYue

relishable_finance

https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/kShmSwwrGcfhDee0V this run duplicate as well

topical_summer

results are duplicated 27 times for my last run.

API Dojo (apidojo)

Hey hey,

As I mentioned, this is something we have no control over. Our actor uses Twitter search and paginates as long as it can get the pagination. And when you try to fetch lots of tweets, this is inevitable since Twitter acts very weird with long paginations.

For fetching a profile, I suggest you to use a similar approach as we explained here:

1{
2    "includeSearchTerms": false,
3    "onlyImage": false,
4    "onlyQuote": false,
5    "onlyTwitterBlue": false,
6    "onlyVerifiedUsers": false,
7    "onlyVideo": false,
8    "searchTerms": [
9        "from:NASA since:2023-01-01 until:2023-02-01",
10        "from:NASA since:2023-02-01 until:2023-03-01",
11        "from:NASA since:2023-03-01 until:2023-04-01",
12        "from:NASA since:2023-04-01 until:2023-05-01",
13        "from:NASA since:2023-05-01 until:2023-06-01",
14        "from:NASA since:2023-06-01 until:2023-07-01",
15        "from:NASA since:2023-07-01 until:2023-08-01",
16        "from:NASA since:2023-08-01 until:2023-09-01",
17        "from:NASA since:2023-09-01 until:2023-10-01",
18        "from:NASA since:2023-10-01 until:2023-11-01",
19        "from:NASA since:2023-11-01 until:2023-12-01"
20    ],
21    "sort": "Latest",
22    "tweetLanguage": "en"
23}

That way, you will have less paginations and your results will have less duplicates.

Cheers!

relishable_finance

https://console.apify.com/actors/61RPP7dywgiy0JPD0/runs/sJdAPAajvSfCpQpud

I've tried this way.. but when there is no reuslt return. it still show 33 outputs.

API Dojo (apidojo)

Hello,

Yes, that is expected. When your query returns 0 results, you get a zeroResult object for us to cover the cost of the run. You can check the output of the dataset in order to seet his.

Cheers!

Add comment

Developer

API Dojo

Actor metrics

1.4k monthly users
194 stars
97.7% runs succeeded
6.4 hours response time
Created in Nov 2023
Modified about 1 hour ago

Categories

Social media

Business

Lead generation

Website Content Crawler

apify/website-content-crawler

Crawl websites and extract text content to feed AI models, LLM applications, vector databases, or RAG pipelines. The Actor supports rich formatting using Markdown, cleans the HTML, downloads files, and integrates well with 🦜🔗LangChain, LlamaIndex, and the wider LLM ecosystem.

Apify

21.2k

472

Google Maps Extractor

compass/google-maps-extractor

Extract data from hundreds of places fast. Scrape Google Maps by keyword, category, location, URLs & other filters. Get addresses, contact info, opening hours, popular times, prices, menus & more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Compass

9.8k

163

Facebook Posts Scraper

apify/facebook-posts-scraper

Extract data from hundreds of Facebook posts from one or multiple Facebook pages and profiles. Get post URL, post text, page or profile URL, timestamp, number of likes, shares, comments, and more. Download the data in JSON, CSV, and Excel and use it in apps, spreadsheets, and reports.

Apify

12.4k

129

Instagram Scraper

apify/instagram-scraper

Scrape and download Instagram posts, profiles, places, hashtags, photos, and comments. Get data from Instagram using one or more Instagram URLs or search queries. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

Apify

51.3k

311

TikTok Data Extractor

clockworks/free-tiktok-scraper

Extract data about videos, users, and channels based on hashtags or scrape full user profiles including posts, total likes, name, nickname, numbers of comments, shares, followers, following, and more.

Clockworks

14.6k

122

Contact Details Scraper

vdrmota/contact-info-scraper

Free email extractor to extract and download emails, phone numbers, Facebook, Twitter, LinkedIn, and Instagram profiles from any website. Extract contact information at scale from lists of URLs and download the data as Excel, CSV, JSON, HTML, and XML.

Vojta Drmota

21.9k

124

Google Maps Scraper

compass/crawler-google-places

Extract data from hundreds of Google Maps locations and businesses. Get Google Maps data including reviews, images, contact info, opening hours, location, popular times, prices & more. Export scraped data, run the scraper via API, schedule and monitor runs, or integrate with other tools.

Compass

74.7k

409

Instagram Profile Scraper

apify/instagram-profile-scraper

Scrape all Instagram profile info. Just add one or more Instagram usernames and extract number of followers&follows, URLs, bio, posts, likes, counts, related profiles, captions, highlight reels. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

Apify

34.6k

149

Facebook Groups Scraper

apify/facebook-groups-scraper

Extract data from one or multiple public Facebook groups. Get group and post URLs, post text, comments, timestamp, likes and comments count, and basic commentator info. Download the data in JSON, CSV, and Excel and use it in apps, spreadsheets, and reports.

Apify

6.1k

Instagram Post Scraper

apify/instagram-post-scraper

Scrape Instagram posts. Just add one or more Instagram usernames and get your data in seconds including text, hashtags, mentions, comments, images, URLs, likes, locations, and metadata. Export scraped data, run the scraper via API, schedule and monitor runs or integrate with other tools.

Apify

18.1k

How to scrape Twitter (X.com) data using Python without Twitter API

How to download tweets from Twitter in 2024

Build new tools

Are you a developer? Build your own Actors and run them on Apify.

Learn more

Get a custom solution

Get a custom web scraping or RPA solution.

Book a demo

🏯 Tweet Scraper V2 (Pay Per Result) - X / Twitter Scraper

🏯 Tweet Scraper V2 (Pay Per Result) - X / Twitter Scraper