🧪 Blog & Content Extractor (Testing Mode – $0.01 for now!) avatar
🧪 Blog & Content Extractor (Testing Mode – $0.01 for now!)

Under maintenance

Pricing

Pay per usage

Go to Store
🧪 Blog & Content Extractor (Testing Mode – $0.01 for now!)

🧪 Blog & Content Extractor (Testing Mode – $0.01 for now!)

Under maintenance

Developed by

Jeff Halverson

Maintained by Community

Please enjoy, feedback Greatly appreciated!! Will leave at a penny for a week or 2. Then switch once bugs and perfection has garnered. Once real monetization happens, I'll still beat everyone on the platform by half and with a great product. Since ill base it off reality not greed.

0.0 (0)

Pricing

Pay per usage

0

Monthly users

4

Runs succeeded

>99%

Last modified

2 days ago

📄 Blog & Media Content Scraper

This actor crawls blogs, news, and content-heavy websites to extract valuable structured content, including:

  • ✅ Page Title
  • 🧠 Meta Description, Keywords, and Publish Date
  • ✍️ Readable article sections (smart detection of .article, .post, .entry)
  • 🖼️ Image URLs
  • 📹 Videos grouped by file type (e.g., mp4, mkv, flv, 3gp)
  • 📤 Structured JSON + CSV output via Apify dataset

💡 Use Cases

  • 🧠 Fine-tune GPT models with your blog or publication data
  • ✍️ Build summarizers, translators, SEO tools, or content aggregators
  • 🕵️ Monitor competitor or client site content
  • 📰 Collect articles and media for newsletters, research, or knowledge bases
  • 🔍 Turn messy blog pages into clean, machine-readable content

📦 ChangeLog

🆕 v1.0 – Full Release

  • ✅ Removed testing mode – now production-ready
  • 📹 Video extraction is now grouped by file type (e.g., mp4, mkv, 3gp)
  • 🔗 Extracts all page links into the links field
  • 🧠 Auto-tags media-rich pages if more than 30 images/videos are found
  • 📤 Includes direct CSV export links for images and videos
  • 🔍 Improved article detection and meta extraction

📥 Input

The actor accepts the following input:

1{
2  "startUrls": [{ "url": "https://example.com" }],
3  "maxRequestsPerCrawl": 50,
4  "maxConcurrency": 5
5}
6---
7
8## 📤 Output
9
10The actor publishes the following structured JSON output:
11
12```json
13{
14  "url": "https://sample-videos.com",
15  "title": "Download Sample Videos / Dummy Videos For Demo Use",
16  "meta": {
17    "description": "Download sample video or test video for your testing purpose. We provides you different types of video formats with different size.",
18    "keywords": "download sample clip, free clip download, download demo clip, download demo video, download dummy video, download video, download sample video, download video for developer, download mp4 video, download flv video, download mkv video",
19    "publishDate": null
20  },
21  "articles": [
22    {
23      "title": "Download Sample Videos",
24      "link": "https://sample-videos.com/",
25      "summary": "Are you looking for videos of different resolution and sizes to test while designing or developing a mobile app?",
26      "author": null,
27      "date": null
28    }
29  ],
30  "images": [
31    "https://sample-videos.com/images/imgw.png"
32  ],
33  "videos": {
34    "mp4": [
35      "https://sample-videos.com/video321/mp4/720/big_buck_bunny_720p_10mb.mp4"
36    ],
37    "flv": [],
38    "mkv": [],
39    "3gp": []
40  },
41  "links": [
42    "https://sample-videos.com/index.php#sample-mp4-video",
43    "https://sample-videos.com/download-sample-jpg-image.php"
44  ],
45  "exports": {
46    "videoCsv": "https://api.apify.com/v2/datasets/iphP0fH83dPWkUFbG/items?format=csv&fields=videos",
47    "imageCsv": "https://api.apify.com/v2/datasets/iphP0fH83dPWkUFbG/items?format=csv&fields=images"
48  },
49  "autoTag": {
50    "type": "media"
51  }
52}

Pricing

Pricing model

Pay per usage

This Actor is paid per platform usage. The Actor is free to use, and you only pay for the Apify platform usage.