Zoom recording mechanics, in detail
Zoom has two recording modes with quite different file outputs and storage paths. Understanding which one your meeting used is the only friction step in getting the recording to Mictoo.
Cloud Recording (Pro and higher)
When a meeting is recorded to Cloud, Zoom stores the recording on its own servers under your account quota. To download: log in to zoom.us, open Recordings in the left menu, find the meeting (sorted by date), click the title, and the file list appears. You will see:
- Shared Screen with Speaker View / Gallery View: MP4 video, typically 100-500 MB per hour
- Audio Only: M4A file, typically 15-30 MB per hour
- Chat File: TXT of the in-meeting chat
- Closed Caption / Transcript: VTT or TXT if Zoom audio transcript is enabled on your account
- Audio Transcript: separate JSON if AI Companion summaries are enabled
For Mictoo, download the Audio Only M4A. It is by far the smallest file and transcribes with identical quality.
Local Recording (any account, including Free)
Free Zoom accounts can record locally to the host machine with the Record button → Record on this Computer option. File path:
- Mac:
~/Documents/Zoom/<date> <meeting-name>/ - Windows:
Documents\Zoom\<date> <meeting-name>\ - Linux:
~/Documents/Zoom/<date> <meeting-name>/
Inside that folder: zoom_0.mp4 (video),audio_only.m4a (audio extracted), andchat.txt. Upload the M4A directly.
Cloud Recording quota and what to delete
Zoom Cloud Recording storage is metered per tier (Pro: 5 GB; Business: 5 GB per user; Enterprise: 10 GB+ per user depending on contract; unlimited cloud storage is an add-on). When you hit quota, new recordings either replace old ones or fail to save depending on settings.
If quota matters: download the M4A, transcribe with Mictoo, then either keep the M4A locally (no cloud quota) or delete the entire Cloud Recording (it is now redundant with your local copy + Mictoo transcript). The Zoom auto-transcript JSON / VTT files inside the recording also count toward quota; deleting them reduces footprint without losing the recording itself.
Built-in Zoom audio transcript vs Whisper
Zoom audio transcript (the VTT inside Cloud Recording) is generated by Zoom internal ASR pipeline. It includes speaker labels (a meaningful advantage) but is noticeably weaker than Whisper large-v3 on proper nouns, technical terms, and accented speakers. For a quick scan with speaker labels, the built-in transcript is fine. For accuracy when the transcript is the actual deliverable (article, citation, training material), Whisper output is cleaner.
Zoom AI Companion summary vs Mictoo AI summary
Zoom AI Companion is a separately licensed add-on that generates meeting summaries inside Zoom. The quality is reasonable for the most common meeting shape (status check-ins, project syncs). Mictoo AI summary runs over the Whisper transcript and tends to be more detailed because the underlying transcript is cleaner. The trade-off: Zoom AI Companion appears inside your Zoom workflow automatically; Mictoo requires the upload step.
Speaker layout and audio quality
The recording layout you picked (Speaker View, Gallery View, side-by-side) affects the video composition only. The audio track is the same across layouts. Audio quality in Zoom is constrained by the original mic setup, network conditions, and Zoom Original Sound for Musicians option (which preserves higher-fidelity audio if enabled before recording).