Smart Article Extractor avatar

Smart Article Extractor

Try for free

No credit card required

View all Actors
Smart Article Extractor

Smart Article Extractor

lukaskrivka/article-extractor-smart
Try for free

No credit card required

📰 Smart Article Extractor extracts articles from any scientific, academic, or news website with just one click. The extractor crawls the whole website and automatically distinguishes articles from other web pages. Download your data as HTML table, JSON, Excel, RSS feed, and more.

Do you want to learn more about this Actor?

Get a demo
CAn I get the article in HTML format so I can paste it Wordpress?

Opened a month ago by maxorank, last comment a month ago by Milan Knoll (milunnn)

Sometimes task is not running even after i had scheduled it earlier.

Opened 2 months ago by bigdot, last comment 2 months ago by Lukáš Křivka (lukaskrivka)

Inserting article urls from file works wrong

Opened 2 months ago by meaningful_cheerleader, last comment 2 months ago by Lukáš Křivka (lukaskrivka)

Not returning any results for article on a particular website

Opened 2 months ago by ralic, last comment 2 months ago by Milan Knoll (milunnn)

Wrt previous isse

Opened 3 months ago by pzubkiewicz, last comment 3 months ago by Lukáš Křivka (lukaskrivka)

Article has too few words

Opened 3 months ago by pzubkiewicz, last comment 3 months ago by Lukáš Křivka (lukaskrivka)

Got an error "unknown exception"

Opened 4 months ago by MarkDB, last comment 4 months ago by Lukáš Křivka (lukaskrivka)

Empty results from medium.com

Opened 6 months ago by pzubkiewicz, last comment 5 months ago by Lukáš Křivka (lukaskrivka)

Needless text in results

Opened 6 months ago by pzubkiewicz, last comment 5 months ago by Lukáš Křivka (lukaskrivka)

Is it possible to crawl based on keyword?

Opened 6 months ago by assuring_fade, last comment 6 months ago by Zuzka Pelechová (zuzka)

Date to Timestamp?

Opened 6 months ago by japan_ravel, last comment 6 months ago by japan_ravel

Does not return text of articles from Russian sites

Opened 8 months ago by nxlex87, last comment 8 months ago by Lukáš Křivka (lukaskrivka)

Date Filter Not Working

Opened a year ago by dev1_labdsi, last comment a year ago by Lukáš Křivka (lukaskrivka)

pls fix doesnt work with fox news

Opened a year ago by cardinal_lobster, last comment a year ago by Lukáš Křivka (lukaskrivka)

Is it possible to set a max nr of articles per startUrl

Opened a year ago by ybierens, last comment a year ago by Míša Fialová (misa)

The tags are not always scraped properly

Opened a year ago by tblobaum, last comment a year ago by Lukáš Křivka (lukaskrivka)

Make it possible to know startURL of scraped article

Opened a year ago by ybierens, last comment a year ago by Lukáš Křivka (lukaskrivka)

Why isn't it scraping articles from the following website?

Opened a year ago by ybierens, last comment a year ago by ybierens

error article is not valid

Opened a year ago by admin.explicar, last comment a year ago by Lukáš Křivka (lukaskrivka)

failed running actor from kompas.com

Opened a year ago by admin.explicar, last comment a year ago by Lukáš Křivka (lukaskrivka)

Developer
Maintained by Apify
Actor metrics
  • 202 monthly users
  • 56 stars
  • 99.6% runs succeeded
  • 1.4 days response time
  • Created in Nov 2019
  • Modified about 2 months ago
Categories