Question 1

Can I transcribe a WAV for free?

Accepted Answer

Yes. Mictoo is free for files up to 60 MB. No signup needed, no watermark on exports, no upsell after the first transcription. For long studio bounces or multi-hour recordings, downsample to 16 kHz mono or re-encode to a short MP3 to stay under the cap.

Question 2

Is WAV better than MP3 for transcription accuracy?

Accepted Answer

For clean speech at any reasonable MP3 bitrate (128 kbps or above), no meaningful difference. For noisy, low-gain, or otherwise marginal recordings, WAV can sometimes recover words a low-bitrate MP3 would miss. Most podcast and interview audio falls in the first category.

Question 3

What are the best WAV settings for transcription?

Accepted Answer

16 kHz mono, 16-bit PCM is the practical sweet spot. Whisper resamples to that internally anyway. Higher sample rates and bit depths make the file larger without improving the transcript. Keep your original studio-quality WAV in your project folder, and use the downsampled version only for upload.

Question 4

Do you support 24-bit and 32-bit float WAV?

Accepted Answer

Yes. Both work directly. Internally we normalise to 16-bit before sending to the speech model, which matches what Whisper expects. The extra bit depth gives you editing headroom in your DAW, but does not change the transcript.

Question 5

Do you support Broadcast Wave (BWF) files?

Accepted Answer

Yes. BWF is a standard WAV with extra metadata chunks (bext, iXML, chna). We read the audio and ignore the metadata. The original file on your drive stays untouched, including all timecode and scene/take info.

Question 6

Will WAV files from my Zoom, Tascam, or Sound Devices recorder work?

Accepted Answer

Yes. Zoom H1n, H5, H6, H8, Tascam DR-40X, DR-100mkIII, Portacapture X8, and Sound Devices MixPre / Scorpio all default to standard or Broadcast Wave. Drop the file straight in, no conversion needed.

Question 7

What about exports from Pro Tools, Logic, Reaper, or Audacity?

Accepted Answer

All four export standard PCM WAV by default. Pro Tools and Logic typically write 24-bit at session sample rate, Reaper similar, Audacity writes whatever depth you configured. Mictoo accepts all of them as-is.

Question 8

My WAV is over the 60 MB limit, what do I do?

Accepted Answer

WAV does not compress, so size scales with sample rate, bit depth, channel count, and duration. A 30-minute stereo 24-bit 48 kHz file is around 250 MB. Three fixes, in order: (1) downsample to 16 kHz mono 16-bit, which typically drops the file 10-12x with no transcript quality loss for clean speech; (2) trim leading and trailing silence with Audacity Truncate Silence; (3) for very long files, re-encode to a 64 kbps mono MP3 just for the upload. See our compress-audio and split-audio guides for exact steps.

Question 9

Can I export SRT or VTT subtitles?

Accepted Answer

Yes. After transcription finishes you can download SRT or VTT with timestamps every few seconds. Both formats align with your original audio timeline, so they drop straight into your video editor or subtitle workflow.

Question 10

Can I get timestamps in the transcript?

Accepted Answer

Yes. The default transcript view shows segment-level timestamps you can click to jump to that moment in the audio. Download as VTT or JSON for word-level granularity, or as SRT for segment-level subtitle format.

Question 11

How accurate is the transcript for a noisy WAV?

Accepted Answer

Background noise (wind, HVAC, traffic, tape hiss) reduces accuracy noticeably. Run the WAV through Audacity → Effect → Noise Reduction or the free Adobe Podcast Enhance tool before uploading. The cleaned version typically transcribes much better.

Question 12

Will my original WAV file be changed in any way?

Accepted Answer

No. The file you upload is read by our backend, sent to the transcription provider, and discarded after the response comes back. Your original file on your computer is never modified. We never write a transformed copy back to you.

Question 13

What can I do with the transcript after it is generated?

Accepted Answer

Edit wrong words inline before exporting. Then download as TXT (plain text), SRT or VTT (subtitle format with timestamps), or DOCX (Word document). Copy directly to clipboard if you just need to paste somewhere. The AI summary appears alongside the transcript automatically.

WAV to Text
Transcribe any WAV in seconds

How it works

Upload your WAV

AI transcribes the speech

Edit and export

Why use Mictoo for WAV files

Direct WAV transcription, no manual conversion

PCM and Broadcast Wave (BWF) both work

Sample rates and bit depths we actually handle

Useful exports out of the box

Practical guidance for large WAV files

Where WAV files come from

Interviews

Podcasts

Lectures

Field recordings

DAW and studio bounces

Archival audio

Recommended WAV settings for transcription

Aim for 16 kHz mono, 16-bit PCM

Trim silence at the start and end

Keep the original WAV in your project folder

For very long files, use a temporary MP3

For noisy WAVs, denoise before upload

WAV files in plain language

Why WAV is so large

What this means for speech recognition

When uncompressed actually helps

The Broadcast Wave (BWF) variant

WAV vs other audio formats for transcription

WAV

MP3 →

FLAC →

M4A →

Frequently asked questions

Upload your WAV and get an editable transcript

WAV to TextTranscribe any WAV in seconds