Question 1

Can I transcribe a 2-hour episode?

Accepted Answer

Yes, but split it first. Our per-file cap is 30 minutes free, or 60 minutes once you sign in. For a 2-hour episode, split into two or three parts and transcribe each. Our audio splitter guide walks through how to do it in 60 seconds with ffmpeg or Audacity.

Question 2

Do I get speaker labels (host vs guest)?

Accepted Answer

Not automatically right now. Whisper itself does not do speaker diarization. If you have separate tracks per speaker (common in Riverside, SquadCast, Zencastr), upload each one separately and label them yourself in the final transcript. We are looking at adding diarization, but only when we can do it well.

Question 3

How does it handle accents and bilingual podcasts?

Accepted Answer

Whisper large-v3 was trained on 680,000 hours of multilingual audio. Non-native English, regional accents, and code-switching all work better than smaller models. For a podcast that switches between English and Spanish mid-episode, pick "Auto-detect" as the language and Whisper will follow along.

Question 4

What audio formats do you support for podcasts?

Accepted Answer

MP3, M4A, WAV, FLAC, OGG, WEBM, and AAC. Plus video files like MP4 and MOV (we extract the audio). If your podcast host gives you a download in any of these, you are set. AIFF and ALAC are not supported directly, convert to WAV first.

Question 5

Is there a per-episode word limit?

Accepted Answer

No word limit. The only limit is the file size (25 MB free, 60 MB signed in) and duration (30 min free, 60 min signed in). A typical 60-minute episode produces around 9000 to 11000 words.

Question 6

How accurate is podcast transcription compared to human transcribers?

Accepted Answer

For clean studio audio, Whisper large-v3 typically lands at 5 to 10 percent word error rate. Human transcribers are around 3 to 5 percent. For most show notes and blog repurposing work, AI is good enough. For court testimony or academic citation, hire a human.

Question 7

Will my episode be stored on your servers?

Accepted Answer

No. We pipe the audio straight to the transcription provider (Groq, with OpenAI as backup). They process it and we discard it. We never write your podcast file to our database or our object storage.

Question 8

Can I download as SRT for subtitles?

Accepted Answer

Yes. After transcription, hit the SRT download button. Use it directly in YouTube Studio, Premiere Pro, DaVinci Resolve, or any video editor.

Question 9

Do you charge per minute?

Accepted Answer

No. Transcription on Mictoo is free. We are funded by ads at the moment, with a paid Pro tier coming later for users who need longer files or batch uploads.

Question 10

My episode has explicit language. Will it get censored?

Accepted Answer

No filtering. The transcript reflects exactly what was said. If you want to edit profanity for a clean version, do that yourself after download.

Question 11

Can I edit the transcript before downloading?

Accepted Answer

Yes. There is a basic editor in the result view. Fix any wrong words, then download the edited version as TXT or SRT.

Question 12

Is podcast transcription on Mictoo compliant with GDPR?

Accepted Answer

We do not store the audio or the transcript on our servers after you leave the page. We are based in Europe, and our providers (Groq US, OpenAI US) have DPAs in place. For specific compliance questions, see our Privacy Policy or email info@mictoo.com.

Podcast Transcription
Free Podcast Transcript Generator

How it works

Drop the episode

AI does the work

Copy, download, or edit

Why podcasters use Mictoo

Long episodes are fine

Accents and crosstalk hold up

Music beds do not break it

No subscription

Your audio is not stored

AI summary for free after every episode

What podcasters actually do with the transcript

Show notes and blog posts

Episode quote cards for social

Searchable archive for your back catalog

YouTube auto-captions replacement

Accessibility transcript link

Pro tips for cleaner podcast transcripts

Strip the music intro and outro first

Export at 64 kbps mono if your raw file is huge

For interviews with bad guest audio, transcribe each track separately

Set the language explicitly for short episodes

Punctuation will be imperfect. Fix the first 10 lines, then leave the rest

Use SRT export even if you do not need subtitles

Frequently asked questions

Ready to transcribe?

Podcast TranscriptionFree Podcast Transcript Generator