mictoo
OGG · Vorbis · Opus · .oga

OGG to Text
Telegram voice and open-source audio

Drag a Telegram voice export, a Linux desktop recording, or any OGG file with Vorbis, Opus, or FLAC inside. We figure out the codec and return an editable transcript.

AI summaryTranslate, 28 langsOpenAI Whisper

Drop your file here

or click to browse

MP3 · MP4 · WAV · M4A · OGG · WEBM · FLAC  ·  Max 25MB  ·  Max 30 min (60 min · Sign in)

Got a bigger file? See how to compress.

Got a longer recording? See how to split.

OGG is a container that can hold several different audio codecs. Vorbis is the classic. Opus is the modern voice-friendly one that powers Telegram voice messages and most WebRTC calls. FLAC also lives inside OGG sometimes. We detect what is in the file automatically and decode the right way.

Most people who land here are transcribing a Telegram voice message (usually saved with a .oga extension) or audio from a Linux recording app. Both work directly, no conversion to MP3 first. The transcript comes back with timestamps and exports as TXT, SRT, VTT, or DOCX.

Free for files up to 60 MB. Opus voice messages are very small (a 10-minute Telegram voice memo is around 1 MB), so the cap is rarely an issue here.

How it works

🐧

Drop the OGG (any codec)

Vorbis, Opus, or FLAC inside an OGG container, file extensions .ogg, .oga, or .opus. We sniff the actual codec from the file headers, you do not need to know in advance.

🎤

We decode and transcribe

Our backend pulls the audio out of the OGG container, decodes the codec (Vorbis or Opus is most common), and feeds the raw audio to Whisper. Adds a second or two over a WAV.

💬

Read, edit, share

Open the transcript inline, fix any wrong names, copy the cleaned text into Telegram, email, or Notes. Export TXT, SRT, VTT, or DOCX when you need a file.

Why use Mictoo for OGG and .oga files

Telegram voice messages work directly

When you export a Telegram voice message from the desktop or web client, the file extension is usually .oga (Ogg Audio). The codec inside is Opus. Drop the .oga file here as-is. No rename to .ogg needed, no conversion to MP3 needed.

All three OGG codecs supported

Vorbis (the original, late 90s), Opus (2012, voice-optimised, what Telegram and WebRTC use), and FLAC inside OGG (sometimes used in archival workflows). We detect the codec and decode appropriately.

Opus voice quality is genuinely good

Opus at the bitrates Telegram uses (~32 kbps for voice) sounds clean and transcribes well. Whisper handles Opus as accurately as it handles uncompressed audio of the same speech.

Plays nice with Linux audio workflows

Many Linux desktop recording apps, podcast publishing tools, and Mastodon voice posts use OGG. We accept all of them. Same for KDE Connect audio shares, Wikipedia audio samples, and free-software podcast networks.

No tracking, no signup

Upload, transcribe, copy. The file is decoded on the server, the audio is dropped from memory after transcription. We do not log the audio or save it to disk.

Where OGG and .oga files come from

Telegram voice messages

By far the most common source landing on this page. Telegram Desktop and Web save voice messages as .oga (Opus inside OGG). Useful for keeping a written record of long voice notes from friends, family, or work groups.

Linux audio recordings

GNOME Sound Recorder, KDE Audiotube, and similar Linux desktop apps default to OGG (usually Opus or Vorbis). Common for free-software-aligned podcasters, journalists, researchers.

Mastodon and Fediverse voice posts

Some Fediverse audio posting apps use OGG Opus for compatibility with the broader free-software ecosystem. Drop the file here to transcribe for accessibility or your own archive.

Wikipedia and archive.org audio samples

Wikipedia pronunciation guides, IPA samples, and short historic recordings are usually OGG Vorbis (sometimes FLAC inside OGG). Useful for researchers and language students transcribing reference audio.

Discord voice on certain platforms

Discord chose AAC for its mobile voice messages but uses Opus for live voice channels. Older or third-party Discord export tools sometimes produce OGG files.

WebRTC call recordings

Many browser-based meeting and call recorders save audio as Opus inside OGG (or WebM, see below). Mictoo handles both directly.

OGG and Opus tips worth knowing

1

Rename .oga to .ogg only if a player complains

Both extensions are valid OGG containers. Some media players insist on .ogg for legacy reasons. Mictoo accepts both, so no rename needed for transcription. Only rename if you also need to play the file in a stubborn third-party app.

2

For Telegram exports, use Desktop or Web, not mobile

Mobile Telegram does not always expose voice messages as downloadable files. Telegram Desktop and Telegram Web both let you right-click a voice message and save it. From there, drop directly here.

3

Opus already optimises for voice, no pre-processing needed

Unlike WAV or MP3 where downsampling helps, Opus is already engineered for voice at low bitrates. The file is small (10-minute voice memo is around 1 MB), the audio is clear, just upload and go.

4

If your OGG is actually a video container, see /webm-to-text

Some screen recorder tools confusingly save audio-plus-video files with .ogg or .ogv extension. If your file has video, use the WebM page instead, we extract just the audio there too.

OGG is a container, not a codec (and why that matters)

A common confusion: "OGG audio quality" does not really exist as a single thing. OGG is a container, like MP4 or WebM. The quality depends on what codec is packed inside. Three codecs account for almost every OGG file in the wild: Vorbis (the original lossy audio codec, late 1990s), Opus (the modern voice-and-music codec, 2012), and FLAC (lossless, sometimes shipped inside OGG instead of bare).

That is why your friend can send you a "voice message OGG file" from Telegram and it is dramatically smaller than the OGG audio sample you download from Wikipedia: Telegram is using Opus at 32 kbps, Wikipedia is using Vorbis at 128 kbps or FLAC inside OGG. Same container, different codec, very different file sizes.

The Xiph.org and open-source story

OGG and its codecs were created by Xiph.Org Foundation as an explicit open-source alternative to the patent-encumbered MP3 and AAC formats that dominated the early 2000s. The original promise: high-quality audio nobody owes royalties on. Vorbis shipped in 2000 and quickly became the de-facto choice for free-software desktops, games, and projects allergic to MPEG patents.

By 2012 the world had moved on. Patents on MP3 expired, AAC became universal in mobile, and Xiph released Opus as a new general-purpose codec, this time co-designed with the IETF specifically for low-latency voice. Opus replaced Vorbis in most new applications. Today, almost any new OGG file you encounter contains Opus, not Vorbis.

Why Telegram chose .oga and Opus

Telegram needed a voice codec that was small enough to send quickly on slow networks, clear enough to be useful for spoken words, and free of patent claims they would have to license. Opus inside OGG was the obvious pick. The .oga extension (Ogg Audio) is the official IANA-registered file extension for OGG files containing only audio (vs .ogv for video or .ogg historically for either). In practice, most players treat .oga and .ogg as the same thing for audio-only files.

FLAC inside OGG: an unusual but real combination

FLAC is normally shipped as bare .flac. But the FLAC specification also defines a way to pack the same lossless audio inside an OGG container (file extension .oga or .ogg). Some archival workflows prefer this because OGG containers are seekable in a slightly different way that some legacy archival tools handle better. Mictoo decodes FLAC inside OGG the same way as bare FLAC, no extra step needed.

OGG vs other audio formats

OGG is the container most associated with open-source and Telegram voice. Here is how it stacks up.

OGG (Opus)

Codec
Opus (or Vorbis/FLAC)
Typical source
Telegram voice, Linux apps
File size
Very small (~1 MB / 10 min)
For transcription
Direct, clean accuracy

M4A →

Codec
AAC (or ALAC)
Typical source
iPhone Voice Memos, Apple
File size
Small
For transcription
Same accuracy as OGG

MP3 →

Codec
MP3 (no container)
Typical source
Podcasts, general audio
File size
Small
For transcription
Same accuracy at 128 kbps+

WebM →

Codec
Opus + VP8/VP9 video
Typical source
Loom, OBS, browser recording
File size
Bigger (often contains video)
For transcription
We strip video, decode audio

Frequently asked questions

My Telegram voice export has .oga extension, will it work?

Yes. .oga is the official extension for OGG files that contain only audio (Telegram voice messages, podcasts shipped as audio-only OGG). No rename to .ogg needed. We accept the .oga file as-is and decode the Opus codec inside.

How do I save a Telegram voice message to a file?

Easiest path is Telegram Desktop or Web (not mobile). Right-click the voice message in the chat and choose Save As. The saved file is an .oga (Opus inside OGG). Drop it in here. Mobile Telegram does not expose voice messages as files reliably.

What codecs inside OGG do you support?

All three common ones: Vorbis (the original, used in older OGG files and Wikipedia samples), Opus (modern, used by Telegram and most new OGG files), and FLAC inside OGG (sometimes used in archival workflows). We detect the codec from the file header automatically.

Is Opus better than MP3 for voice quality?

At the same bitrate, yes, especially at low bitrates. Opus at 32 kbps mono sounds noticeably better than MP3 at the same bitrate, which is why voice apps standardised on Opus. For transcription specifically, Whisper handles both with similar accuracy.

Why is my Telegram voice file so small (under 1 MB)?

Telegram uses Opus voice mode at very low bitrates (~32 kbps mono) because voice messages need to send quickly on slow mobile networks. A 10-minute voice memo lands around 1 MB. Quality is fine for transcription, no pre-processing needed.

Can I transcribe OGG files from Wikipedia or other reference sources?

Yes. Wikipedia pronunciation samples, IPA reference recordings, and archive.org audio are typically OGG Vorbis or FLAC inside OGG. All work. Useful for language students, researchers, and people building speech datasets.

My OGG plays in VLC but not in iPhone Voice Memos, why?

iPhone Music and Voice Memos do not include OGG codecs by default. The file is fine, it just needs a player that knows OGG (VLC on iOS works, free). For transcription this does not matter, we decode OGG ourselves on the server.

What is the difference between .ogg, .oga, .opus, and .ogv?

All four are OGG containers. .ogg is the historical generic extension (audio or video). .oga is the modern extension for audio-only OGG. .opus is a special case for Opus-codec-only OGG files. .ogv is for video. Mictoo accepts .ogg, .oga, and .opus directly. For .ogv with audio inside, see our WebM page.

Will my Telegram voice message be deleted after upload?

No. We never modify or delete the file you uploaded. Your original Telegram voice message stays in the chat, your saved .oga copy stays on your computer. We only read a copy to transcribe it, then drop it from memory.

Can I transcribe a long Telegram voice message in another language?

Yes, over 50 languages with auto-detect. For very short voice messages (under five minutes) where detection sometimes mis-fires on the first few seconds of silence, set the language manually in the dropdown.

Does the transcript come back instantly for a small Opus file?

Almost. A 10-minute Telegram Opus voice message (around 1 MB) usually finishes transcribing in 5-15 seconds total, including upload. The OGG decode step adds maybe a second over a WAV of the same audio.

I have an OGG file with FLAC codec inside. Does that work?

Yes. FLAC inside OGG is detected automatically and decoded the same way as a bare .flac file. Most users will not have this combination unless they came from an archival workflow that prefers OGG-wrapped FLAC.

Send your OGG or .oga, get the text

Telegram voice messages, Linux recordings, Wikipedia samples. Free up to 60 MB.

Transcribe an OGG now