Fast News Scraper avatar

Fast News Scraper

Try for free

Pay $3.00 for 1,000 articles

View all Actors
Fast News Scraper

Fast News Scraper

timgreen/fast-news-scraper
Try for free

Pay $3.00 for 1,000 articles

Extract full article text and metadata from popular news sites like The New York Times, Bloomberg, Reuters, BBC, CNBC, and Wired. Scrape thousands of articles in just a few minutes. Scape a single site or provide a list of article URLs to scrape.

Site

siteEnumOptional

The website to scrape. It must be one of the supported options. To request a website that's not supported, create an issue in the Issues tab.

Value options:

"nytimes.com": string"washingtonpost.com": string"bloomberg.com": string"cnn.com": string"bbc.com": string"reuters.com": string"seekingalpha.com/market-news": string"wired.com": string"cnbc.com": string

Default value of this property is "cnn.com"

Query

querystringOptional

The search term for the website. Not all websites support this, and some allow an empty value. See the README for more details.

Default value of this property is ""

Sort by

sortEnumOptional

Sort by date or relevance. Not all websites support both. See the README for more details.

Value options:

"date": string"relevance": string

Default value of this property is "date"

Max Items

maxItemsintegerOptional

The approximate maximium number of articles to return.

Default value of this property is 500

Article URLs

articleURLsarrayOptional

A list of article URLs from which to extract data. If any URLs are provided, site, query, and sort will be ignored and only those URLs will be scraped.

Dataset Name

datasetNamestringOptional

If set, results will be stored in the name dataset

Request Queue Name

requestQueueNamestringOptional

If set, requests will be stored in the name queue. This is useful for not pulling the same articles multiple times.

Begin Date

beginDatestringOptional

ONLY SUPPORTED FOR SOME WEBSITES. Extract articles on or after this date.

End Date

endDatestringOptional

ONLY SUPPORTED FOR SOME WEBSITES. Extract articles on or before this date.

Proxy configuration

proxyobjectRequired

Select proxies to be used by your crawler.

Developer
Maintained by Community
Actor metrics
  • 16 monthly users
  • 3 stars
  • 99.0% runs succeeded
  • 5.2 hours response time
  • Created in May 2024
  • Modified 2 months ago
Categories