mictoo
M4A · AAC · iPhone · Free

M4A to Text
iPhone Voice Memos and Apple audio

AirDrop a Voice Memo, drop a GarageBand bounce, share an Apple Podcasts download. We turn M4A files into editable transcripts with timestamps and clean exports.

AI summaryTranslate, 28 langsOpenAI Whisper

Drop your file here

or click to browse

MP3 · MP4 · WAV · M4A · OGG · WEBM · FLAC  ·  Max 25MB  ·  Max 30 min (60 min · Sign in)

Got a bigger file? See how to compress.

Got a longer recording? See how to split.

Almost every audio file your iPhone, iPad, or Mac creates is an M4A: Voice Memos, GarageBand bounces, FaceTime audio exports, podcasts saved for offline listening. Drop one in here and the transcript comes back in seconds, with timestamps you can click to jump into the audio.

No need to convert to MP3 first. We accept the M4A directly, the way Apple wrote it. Works just as well with M4A files produced outside the Apple ecosystem, like recordings from Discord, some Android apps, or web-recorder tools that pick AAC.

Free for files up to 60 MB, which covers most Voice Memos (typical 30-minute Voice Memo is around 15 MB). Long recordings? See how to compress audio or how to split audio.

How it works

📱

AirDrop or upload your M4A

From the Voice Memos app share sheet, AirDrop the file to your Mac and drag it in. Or upload directly from iPhone Safari by tapping the share icon next to the memo.

Whisper reads the AAC track

We decode the AAC audio inside the MP4 container directly. No re-encoding step. A 20-minute Voice Memo usually transcribes in 15-30 seconds end to end.

✍️

Edit and send

Fix names Whisper misheard, then export as TXT, SRT, VTT, or DOCX. Most people copy the cleaned text straight into Notes, Slack, or an email draft.

Why use Mictoo for M4A files

iPhone Voice Memos work without exporting first

You do not need to open Files, convert to MP3, or send to a different app. Share the Voice Memo to Safari, drop it on this page, done. The original Voice Memo stays in your iPhone library.

Handles all the M4A relatives

Standard .m4a from Voice Memos and GarageBand, .m4b audiobook variant, .m4r ringtone variant, and .mp4 files that contain only audio. Same AAC codec under different file extensions, same workflow.

Stereo or mono, any AAC bitrate

iPhone Voice Memos default to mono at 32 kbps in Lossy mode and stereo PCM in Lossless mode. GarageBand exports usually stereo at 256 kbps. Both transcribe with the same accuracy.

Works on M4A from outside Apple too

Discord voice notes saved as M4A on Android, web recorder tools that pick AAC for compatibility, audiobooks downloaded from libraries in M4B. All accepted.

Files are processed and discarded

Your M4A streams to the transcription provider, gets read once, gets dropped from memory. We do not save the audio to disk. The text transcript is only stored if you sign in and choose to.

Where M4A files come from

iPhone Voice Memos

Far and away the most common source. Interview recordings, song idea memos, lecture captures, voice notes to yourself. Default format is M4A, exact codec depends on your Voice Memos settings (Compressed AAC by default, Lossless ALAC if enabled).

GarageBand and Logic exports

When you export a project from GarageBand on iPhone or Mac, the default share format is M4A AAC at 256 kbps. Useful for transcribing podcast episodes you recorded in GarageBand before sending to a host.

Apple Podcasts offline downloads

Podcasts you have downloaded for offline listening in the Apple Podcasts app are stored as M4A. You can transcribe them for show notes, study material, or to search a specific quote.

FaceTime audio call exports

If you record a FaceTime audio call (through QuickTime Audio Recording or a third-party screen recorder), the audio export is M4A. Useful for transcribing remote interviews or family memory recordings.

Discord voice notes

Discord saves voice messages as M4A by default on iOS and Android. Right-click in the desktop client to download the file, then transcribe to keep a text record of important messages.

Web recorder tools and meeting exports

Many browser-based recording apps pick AAC inside MP4 (so .m4a) for cross-platform compatibility. If you exported a recording from a meeting tool and got an M4A, it works the same way as a Voice Memo here.

M4B audiobooks and downloaded study material

M4B is the audiobook variant of M4A, with chapter markers. We treat it like any M4A, transcribe the audio, ignore the chapter metadata. Useful for creating searchable text versions of educational audiobooks.

M4A tips that save time

1

Use Voice Memos at Lossy quality, not Lossless

Voice Memos has a Lossless setting hidden in Settings, Voice Memos, Audio Quality. Lossless writes ALAC inside an M4A and triples the file size with zero transcription benefit. Lossy AAC at 32 kbps mono is plenty for clean speech recognition.

2

Trim silence in Voice Memos before exporting

Open the recording in Voice Memos, tap the waveform, tap the trim icon at the top right. Drag the handles to cut leading and trailing dead air. The trimmed memo is smaller and uploads faster, with no transcript content lost.

3

For multi-hour interviews, share via Files not AirDrop

AirDrop times out on files over about 500 MB on slow connections. Save the M4A from Voice Memos to Files (iCloud Drive or local), then upload from your laptop. More reliable for long recordings.

4

If transcription returns the wrong language, set it manually

Whisper auto-detects language, but for files under five minutes or files where the speaker pauses a lot at the start, detection can mis-fire. Set the language explicitly in the language dropdown before upload.

What an M4A file actually is

M4A is, technically, a regular MP4 file that happens to contain only an audio track. The file extension is just a convention (Apple started using .m4a to distinguish audio-only MP4s from video MP4s, so iTunes and Music could filter properly). Open the same file with the .mp4 extension and most players will treat it identically.

The audio inside an M4A is almost always AAC (Advanced Audio Coding), the codec that succeeded MP3 in efficiency. Sometimes it is ALAC (Apple Lossless), which preserves audio bit-for-bit like FLAC does. Voice Memos picks based on your Audio Quality setting. GarageBand always writes AAC for shared exports. Apple Music streaming uses AAC.

The .m4a, .m4b, .m4r, .mp4 family

Same container, different file extensions, different intent. .m4a is plain audio. .m4b adds chapter markers for audiobooks. .m4r is the same as .m4a but Apple uses the extension to mark a file as a ringtone (so iTunes and Music would put it in the right place). .mp4 with only an audio track is what some non-Apple tools write instead of .m4a. Mictoo treats them all as M4A and decodes the audio normally.

Why your iPhone Voice Memo is so small

iPhone Voice Memos default to AAC at 32 kbps mono. That works out to roughly 240 KB per minute, so a one-hour interview is about 14 MB. The same hour as WAV would be 600 MB or more. AAC achieves this by removing audio information humans cannot perceive: very high frequencies, masked sounds, redundant information across channels.

For transcription this almost never matters. Whisper transcribes 32 kbps mono AAC about as well as it transcribes uncompressed WAV of the same speech. Where AAC compression starts to lose words is in heavy background noise or very quiet speech, where the encoder may have already removed the signal Whisper needed.

AAC vs ALAC inside the M4A container

If you have Voice Memos set to Lossless, the audio inside the M4A is ALAC instead of AAC. We handle both. Transcription quality is the same. The only practical difference is file size: an ALAC Voice Memo is roughly 10-15 times larger than the AAC equivalent. For everyday voice work, stick with Lossy (AAC). For situations where the audio will be processed further later (DAW import, restoration, archival), Lossless is fine but unnecessary for transcription.

M4A vs other audio formats

All four work here. Pick the page that matches the file you actually have.

M4A

What it is
AAC audio in MP4 container
Typical source
iPhone, Apple ecosystem
File size
Small (efficient AAC)
For transcription
Direct, no conversion needed

MP3 →

What it is
Older lossy codec, no container
Typical source
Podcast distribution, legacy files
File size
Slightly larger than M4A
For transcription
Same accuracy as M4A in practice

WAV →

What it is
Uncompressed PCM
Typical source
DAW, field recorders
File size
Largest by far
For transcription
Use only if you already have it

FLAC →

What it is
Lossless compressed audio
Typical source
Audiophile or archive workflows
File size
About half of WAV
For transcription
Same accuracy as WAV

Frequently asked questions

How do I send an iPhone Voice Memo to Mictoo?

Open the Voice Memos app, tap the recording, tap the share icon (square with up arrow), then tap Safari. Safari opens mictoo.com and the M4A is attached to the share. Drop it in the upload zone. Alternatively, AirDrop the file to a Mac and upload from there.

Will M4A files from GarageBand work?

Yes. GarageBand on iPhone and Mac exports finished projects as M4A AAC at 256 kbps by default. Drop the file in, the transcript comes back in seconds. Useful for podcast hosts who record in GarageBand before sending the file to a podcast host.

My Voice Memo is in Lossless mode (ALAC). Does that work?

Yes. We accept ALAC inside M4A the same way we accept AAC. Transcription quality is identical between the two. The Lossless file is just 10-15 times larger, which only matters if you are near the 60 MB upload limit.

What is the difference between .m4a, .m4b, .m4r, and .mp4?

They are all the same container (MP4) with different extensions to signal intent. .m4a is plain audio. .m4b is an audiobook with chapter markers. .m4r is a ringtone. .mp4 with only an audio track is what some tools write instead. We handle all four the same way.

Can I transcribe a podcast I downloaded in the Apple Podcasts app?

In principle yes, but extracting the file is the catch. Apple Podcasts stores downloaded episodes in a protected location and the in-app share sheet does not always expose them as M4A. Easiest path: re-download the episode directly from the podcast feed URL in Safari, then upload here.

Why does my M4A not play on Windows?

Older Windows versions do not include an AAC codec by default. The M4A itself is fine, you just need a player that can decode AAC (VLC works everywhere, free). For transcription this does not matter, we decode the AAC ourselves on the server.

Will Discord voice messages work?

Yes. Discord saves voice messages as M4A on iOS and Android. In the Discord desktop client, right-click the voice message and choose Download. Then upload the saved file here. Mobile Discord lets you forward the file to a notes or files app, then upload.

My recording is too long for the 60 MB limit, what now?

Three options, in order of effort. (1) Trim leading or trailing silence in Voice Memos using the trim handles. (2) Re-export at Lossy quality (Settings, Voice Memos, Audio Quality) if it is currently Lossless. (3) Split the file in two using the Voice Memos edit feature and upload each half separately.

How accurate is the transcription for a noisy Voice Memo?

Background noise (cafe chatter, car interior, wind) hurts accuracy noticeably. For important recordings, use the Adobe Podcast Enhance free web tool to clean the audio before uploading. Set the iPhone microphone close to the speaker (under 30 cm) for cleaner recordings to start with.

Will my iPhone Voice Memo be deleted from my phone?

No. Uploading a Voice Memo here only sends a copy to the transcription service. Your original recording stays in the Voice Memos app on your iPhone, in iCloud sync if you have that enabled, and nowhere else changes.

Can I get the transcript with speaker labels (interviewer vs guest)?

Not yet in the free tool. Whisper does not separate speakers out of the box. If you have separate Voice Memo recordings for each speaker (each person recorded on their own iPhone), transcribe them separately and label by hand. Speaker diarization is on the roadmap for the future Pro tier.

What languages does M4A transcription support?

Over 50 languages with automatic detection. For short Voice Memos (under five minutes) where detection sometimes wobbles, pick the language explicitly in the dropdown before uploading. Common picks: English, Spanish, French, German, Portuguese, Italian, Russian, Polish, Japanese, Korean.

Drop an M4A and get the text back

iPhone Voice Memos, GarageBand exports, FaceTime recordings, Apple Podcasts downloads. All work.

Transcribe an M4A now