Pricing

$4.99/month + usage

Go to Store

Audio and Video Transcript (OpenAI Whisper)

Try for free

Developed by

Vít Tuhý

This Actor transcribes audio or video files from publicly accessible URLs using OpenAI's Whisper API. To use this Actor, you'll need to provide your own OpenAI API key. It supports multiple languages and highly customizable parameters, enabling precise control over the transcription process.

1.9 (2)

Pricing

$4.99/month + usage

Total users

Monthly users

Runs succeeded

>99%

Issues response

30 days

Last modified

5 months ago

Videos

Automation

Audio and Video Transcript

This Apify actor transcribes audio or video files from publicly accessible URLs using OpenAI's Whisper API. To use this actor, you'll need to provide your own OpenAI API key. It supports multiple languages and highly customizable parameters, enabling precise control over the transcription process. The actor processes each provided URL, downloads the corresponding audio or video files, transcribes them via OpenAI, and securely stores the resulting transcripts in Apify's Storage under the Key-Value Store.

🚀 Features

Automatic language detection or manual language specification from an extensive list.
Capability to process multiple audio or video URLs simultaneously.
Versatile output formats including plain text, JSON, SRT, VTT, and verbose JSON.
Optional inclusion of timestamps for individual words (when using verbose JSON format).
Fine-tuning through parameters such as temperature, compression ratio thresholds, and speech detection thresholds.
Secure handling of your OpenAI API key, hidden from logs for added safety.

🔧 Input Configuration

Configure your actor with the following parameters:

Parameter	Description	Required
`url`	Array of publicly accessible audio/video file URLs	✅
`language`	Language selection or set to `Auto-detect`	❌
`temperature`	Floating-point temperature to control variability in transcription	❌
`response_format`	Desired transcript format (`text`, `srt`, `vtt`, `json`, `verbose_json`)	❌
`word_timestamps`	Include timestamps per word (only valid when using `verbose_json` format)	❌
`prompt`	Additional textual context to enhance transcription accuracy	❌
`temperature_increment_on_fallback`	Increment in temperature if the initial transcription attempt fails	❌
`compression_ratio_threshold`	Maximum allowable compression ratio for transcript acceptance	❌
`logprob_threshold`	Minimum log probability required for transcript segments	❌
`no_speech_threshold`	Probability threshold to detect segments with no speech	❌
`openai_api_key`	Your personal OpenAI API key (kept secure and hidden)	✅

📥 Example Input

{
  "url": [
    { "url": "https://example.com/sample-audio.mp3" }
  ],
  "language": "Auto-detect",
  "temperature": "0.0",
  "response_format": "text",
  "word_timestamps": false,
  "prompt": "",
  "temperature_increment_on_fallback": 0,
  "compression_ratio_threshold": 2,
  "logprob_threshold": -1,
  "no_speech_threshold": 1,
  "openai_api_key": "YOUR_OPENAI_API_KEY"
}

📤 Output

Transcription results are securely stored within Apify's Storage under the Key-Value Store. Each transcript is saved individually with an identifiable key for convenient access.

On this page

Audio and Video Transcript

Share Actor:

Tiktok Video Transcirpt Using OpenAI Whisper API

linen_snack/tiktok-video-transcirpt-using-openai-whisper-api

This Apify actor uses the OpenAI Whisper API to either transcribe Tiktok video into its original language or translate it into English. It's built to be robust, automatically handling video-to-audio conversion and compression to stay within API limits.

ius iyb

Audio & Video to Text

donjuan_mime/audio-video-to-text

Transcribes video and audio files into plain text and subtitle formats (TXT, SRT, VTT, TSV, JSON) using OpenAI's Whisper model. Supports preloaded tiny, base, and small models.

Donjuan

Audio And Video Transcriber (OpenAI GPT-4o-transcribe)

stanvanrooy6/audio-video-transcriber

Downloads videos from public URLs, extracts audio, and transcribes them using OpenAI

Stan Van Rooy

5.0

Video to Text Transcription

aizen0/video-to-text-transcription

Convert video speech to text in bulk. Supports Only Twitter/Instagram, auto-detects languages, handles large files automatically. Uses OpenAI Whisper for high accuracy.

Pratham Yadav

Instagram reel transcript

linen_snack/instagram-videos-transcipt-subtitles-and-translate

Effortlessly convert any public Instagram reels videos into accurate text, subtitles, or translations with this powerful OpenAI Whisper API actor.

ius iyb

113

Twitter subtitles transcript

linen_snack/twitter-subtitles-transcript

Effortlessly convert any public Twitter/X video into accurate text, subtitles, or translations with this powerful OpenAI Whisper API actor.

ius iyb

Text-to-Speech Generator (OpenAI voice generator)

stanvanrooy6/text-to-speech-generator-openai-voice-generator

Convert text to speech effortlessly with our OpenAI voice generator. Choose from 6 English-optimized voices, customize settings, and get high-quality audio files fast. Simple to use, integrates with your OpenAI API key.

Stan Van Rooy

5.0

Free Large Video Converter

lukaskrivka/audio-video-converter

Flexible and powerful conversion tool using the popular ffmpeg program ideal for very large video and audio files. Convert any audio or video file to a different format and adjust any settings. Automatically recognizes the source format.

Lukáš Křivka

124

Video to Text Pro🔥

marketingme/video-to-text-pro

🎬 Convert videos to text from 1000+ platforms. YouTube, TikTok, Twitter/X, Instagram... Supports 12+ languages: English, Chinese, Japanese, Korean, Spanish, French, German, Portuguese, Russian, Arabic, Hindi, Italian with translation capabilities.