WhatsApp Audio Format: Opus, MP3 & AAC Explained (2026)

WhatsApp records every voice note as an .opus file — Opus codec, 16 kHz mono, approximately 16 kbps. That explains why the file is tiny, why it sounds perfectly clear inside WhatsApp, and why it refuses to play in Windows Media Player or older car stereos. You can convert it to MP3, but if what you actually want is the spoken content as searchable, shareable text, that is transcription — a different tool entirely.

WhatsApp voice note waveform showing Opus codec badge alongside MP3 and AAC format badges on a neutral background

What audio format does WhatsApp use?

Every voice message recorded inside WhatsApp — on iOS or Android — is encoded as Opus and wrapped in an OGG container. The file extension you see when you export a chat is .opus. This is not an accident or a quirk; it is a deliberate engineering choice that Meta made when WhatsApp was a voice-heavy messaging app competing on battery life and data cost.

Opus codec

Opus is an open, royalty-free audio codec standardised as RFC 6716 (IETF, 2012). It is engineered for voice-over-IP: excellent intelligibility at bitrates as low as 6–16 kbps, very low algorithmic delay (as little as 2.5 ms), and a variable bitrate mode that handles silence cheaply. WhatsApp encodes voice notes at approximately 16 kbps, 16 kHz sample rate, mono channel — enough for clear speech with a file that stays well under 200 KB for a one-minute note.

Anatomy of a WhatsApp opus audio file: OGG container, Opus codec RFC 6716, 16 kHz mono, 16 kbps, ~120 KB per minute

The 16 kHz sample rate is worth understanding. Human speech lives almost entirely in the 300 Hz–3,400 Hz band — the classic "telephone band". WhatsApp captures up to 8 kHz (the Nyquist limit of a 16 kHz sample rate), which covers everything you need for clear voice intelligibility and throws away the frequencies above that, keeping the file small.

What this means practically: a one-minute voice note in WhatsApp is typically 110–160 KB as .opus. The same one minute as an MP3 at 128 kbps would be around 960 KB. As AAC at 256 kbps it would be around 1.9 MB. Opus at 16 kbps is roughly 6× smaller than MP3 and 12× smaller than typical AAC — and it sounds almost identical to both for speech.

Bar chart: WhatsApp opus audio 120 KB vs MP3 128 kbps 960 KB vs AAC 128 kbps 720 KB for one minute of speech

The catch is compatibility. Opus/OGG is natively supported in Chrome, Firefox, and Android. It is NOT natively supported in Safari (iOS/macOS) outside of the WhatsApp app itself, Windows Media Player, iTunes, or most consumer audio hardware. That is the reason people want to convert: not because the audio quality is lacking inside WhatsApp, but because the file will not play in the places they want to send or archive it.

Matrix showing whatsapp opus format support: Chrome, Firefox, Android yes — Safari, iTunes, Windows Media Player no

Opus vs MP3 vs AAC — the real differences

Here is what the three formats actually trade off against each other. I am using typical WhatsApp-relevant bitrates and use cases, not audiophile music scenarios.

Comparison table of Opus vs MP3 vs AAC: compression, compatibility, audio quality and use case for each format
Opus vs MP3 vs AAC for WhatsApp voice notes
PropertyOpus (WhatsApp default)MP3AAC
Typical bitrate (voice)~16 kbps128–192 kbps96–256 kbps
File size (1 min speech)~120 KB~960 KB~720 KB – 1.9 MB
Voice intelligibilityExcellent (VoIP-optimised)Excellent at 128+Excellent at 96+
Native browser supportChrome, Firefox, AndroidNearly universalNearly universal
macOS / iOS (Safari)Not natively supportedSupportedSupported (preferred)
Windows Media PlayerNot supportedSupportedSupported
Editing / DAW supportLimited (via FFmpeg import)UniversalVery wide
Transcription engine supportExcellent (Deepgram, Whisper)ExcellentExcellent
LicenceRoyalty-free (IETF RFC 6716)Historically patentedPatented (Via Licensing)
Best use caseVoIP, streaming, WhatsAppUniversal sharing, archivingApple ecosystem, streaming

The important thing to notice: converting .opus to MP3 does NOT improve audio quality. The original Opus file was recorded at 16 kHz, 16 kbps. Converting it to MP3 at 128 kbps encodes the same audio at a higher bitrate — but the audio information that was discarded during the original Opus encoding is gone permanently. You get a larger file with the same underlying audio, but now in a format that plays everywhere. That is the only reason to convert.

How to convert WhatsApp .opus to MP3

Before you can convert the file, you need to get it out of WhatsApp. The .opus file lives inside the ZIP that WhatsApp generates when you use the Export Chat function. Export the chat with media (so voice notes are included), then extract the ZIP to find files named something like PTT-20260501-WA0003.opus. Those are your voice notes.

Four-step flow diagram for converting WhatsApp opus voice notes to MP3 using VLC or FFmpeg after exporting the chat ZIP
  1. Export the WhatsApp chat with media

    On iPhone: open the chat → tap the contact or group name at the top → scroll to the bottom → Export Chat → Including Media → Save to Files. On Android: open the chat → three-dot menu → More → Export Chat → Include Media → share to your preferred storage. You need "Including Media" — without it the voice note files are not in the ZIP.

  2. Extract the ZIP and locate the .opus files

    Move the ZIP to your desktop (drag from iPhone Files via Finder if you used iCloud Drive, or cable-transfer for Android). Unzip it. Inside the flat folder you will find _chat.txt and a collection of media files. Voice notes have the .opus extension and names beginning with PTT- (Push-To-Talk). Note that in newer WhatsApp versions on Android the voice notes inside the ZIP may also appear as .opus but with a different naming convention — same format regardless.

  3. Convert with VLC (easiest, no install on most machines)

    VLC Media Player supports .opus natively and can batch-convert to MP3. On Windows or macOS: open VLC → Media menu → Convert/Save → Add the .opus file(s) → Convert/Save button → pick "Audio - MP3" as the profile → set a destination file path → Start. VLC re-encodes the audio from Opus to LAME MP3. For a single voice note this takes a few seconds. VLC is free, open-source, and does not require an account or internet connection for the conversion.

  4. Alternative: FFmpeg via command line (best for batches)

    If you are comfortable with a terminal, FFmpeg converts an entire folder of .opus files to MP3 in a single command. Install FFmpeg from ffmpeg.org, then run: for f in *.opus; do ffmpeg -i "$f" "${f%.opus}.mp3"; done in the folder containing the .opus files. FFmpeg is free and handles any batch size. On macOS you can install it via Homebrew (brew install ffmpeg). On Windows, download the static binary from ffmpeg.org and add it to your PATH.

  5. Verify playback before discarding originals

    Open one of the converted .mp3 files in the player you need it to work in (iTunes, Windows Media Player, car stereo via USB, etc.) and confirm it plays correctly. Keep the original .opus files until you have verified the conversions — re-export from WhatsApp is the only recovery path if something goes wrong, and you may not want to repeat the export.

A note on online converters: there are dozens of browser-based .opus-to-MP3 tools. They work, but you are uploading personal voice notes from private conversations to a third-party server. For casual recordings that is probably fine; for anything sensitive — business conversations, legal discussions, medical details — use a local tool (VLC or FFmpeg) that never sends your audio anywhere.

Bitrate and quality — why WhatsApp audio sounds the way it does

Chart showing audio quality versus bitrate with WhatsApp Opus at 16 kbps, MP3 at 128 kbps, and AAC at 192 kbps

WhatsApp voice notes sound clear and natural inside the app but often sound slightly "telephone-like" when played on speakers or through headphones. That is the 16 kbps ceiling doing its work. Here is why.

Bitrate

Bitrate is the number of bits used to represent one second of audio. A higher bitrate means more data per second, which generally means more frequency information preserved, less audible compression artefact, and a larger file. For audio, bitrate is measured in kilobits per second (kbps). At 16 kbps, Opus can represent voice speech very intelligibly — but it discards most of the audio above roughly 8 kHz and uses aggressive perceptual coding below that to stay within the budget.

The tradeoff WhatsApp made is a deliberate product decision. A one-minute voice note at 16 kbps is roughly 120 KB. At 128 kbps MP3 it would be 960 KB — 8× larger. For a messaging app used heavily in regions with expensive or slow mobile data, that difference matters enormously. Opus at 16 kbps also sounds better than MP3 at 16 kbps for speech, because Opus was specifically designed for this scenario. An MP3 at 16 kbps sounds quite degraded; Opus at 16 kbps is surprisingly listenable.

What you can and cannot hear at 16 kbps Opus:

For transcription accuracy, 16 kbps Opus is workable but noticeably more fragile in noisy conditions. In my own testing with Deepgram Nova-3, clean quiet recordings produce word error rates in the 5–10% range, while the same voice notes recorded in a moving vehicle or a loud café can push toward 25–30% WER. Converting the file to a higher-bitrate MP3 does not improve the transcription result, because the frequency information discarded during Opus encoding is not recovered by the conversion.

Don't want the file — want the words?

Everything above is about the audio file as a file — its format, its codec, how to convert it so it plays somewhere. But a lot of people searching "WhatsApp audio format" are not really trying to play the file anywhere. They want the spoken content as text: for a record, for search, for a legal document, for a quick summary of a long voice thread.

Converting .opus to MP3 gets you a different file. It does not get you the words. That is transcription, and it is a completely different operation.

Decision split: convert whatsapp audio to MP3 for playback compatibility vs transcribe with ChatToPDF to extract words

The way ChatToPDF handles this is by running transcription inline on the voice notes that are inside your exported chat ZIP — the same ZIP you would use for any WhatsApp-to-PDF conversion. You do not have to extract the files, run FFmpeg, upload them to a transcription API, and stitch the output back into a document. You upload the ZIP once, pick the $49 Premium+Voice per chat tier, and the resulting PDF includes every voice note transcribed in-place, at the correct timestamp, attributed to the correct sender. The transcript is inside the PDF alongside the surrounding text messages, so the context is preserved.

If what you wanted was "I have a WhatsApp export with 40 voice notes and I want to know what was said in all of them without listening to each one" — that is the tier and the path.

The distinction between conversion and transcription is worth being clear about because the search intent behind "WhatsApp audio format" splits roughly three ways:

  1. Format curiosity — "I just want to know what .opus is" (answered above)
  2. Compatibility fix — "I want the file to play somewhere it doesn't currently play" (convert to MP3, steps above)
  3. Content extraction — "I want the words" (transcription, not conversion)

If you are in category 3, conversion will not help you. Transcription will.

FAQ

Can I play .opus files from a WhatsApp export?

Yes, with the right player. Chrome browser plays .opus natively — drag the file onto a Chrome tab. Firefox also plays .opus natively. VLC Media Player on Windows or macOS plays .opus. What does NOT play .opus natively: Safari, iTunes, Windows Media Player, QuickTime, and most consumer audio devices connected via Bluetooth or USB. If you need to play the file in those environments, convert to MP3 or AAC first using VLC or FFmpeg (free tools, no account required).

Why does WhatsApp use Opus instead of MP3?

Three reasons. First, Opus is royalty-free (MP3 was encumbered by Fraunhofer patents until 2017 and AAC still has licensing costs); using Opus avoids paying per-user codec royalties at WhatsApp's scale. Second, Opus was specifically designed for real-time voice-over-IP: it encodes faster than real-time even on modest hardware, has very low latency, and sounds excellent for speech at low bitrates. Third, the resulting files are dramatically smaller than MP3 at equivalent perceptual quality for speech — important for users on metered mobile data. WhatsApp's engineers made the pragmatic call: Opus wins on all the dimensions that matter for a messaging app. Full technical specification is available at opus-codec.org.

Does converting .opus to MP3 improve audio quality?

No. Converting .opus to MP3 does not recover any audio information. The Opus encoder discarded the frequencies above 8 kHz and applied perceptual coding to everything below when the voice note was recorded. That information is gone. Converting to MP3 at 128 kbps re-encodes the same audio with different maths and produces a larger file, but the underlying audio quality is identical. You might theoretically introduce a small additional quality loss from the second encoding step (called "generation loss"), though at 128 kbps MP3 this is usually inaudible. Convert only because you need a specific playback format — not expecting a quality improvement.

How do I get the text of a WhatsApp voice note?

That is transcription, not format conversion. The standard approach is: export the WhatsApp chat as a ZIP (Including Media so voice notes are in the ZIP), then either use a dedicated transcription service or upload the ZIP to ChatToPDF and choose the $49 Premium+Voice per chat tier. ChatToPDF runs Deepgram Nova-3 on every .opus file in the ZIP and inserts the transcript at the correct position in the chat timeline, attributed to the sender, in the final PDF. You get the full conversation — text messages and voice notes — as a single searchable document. The transcribe WhatsApp audio guide covers the full pipeline in detail.

What is the difference between .opus and .ogg files from WhatsApp?

Technically, .opus is Opus audio wrapped in an OGG transport container — so a .ogg file containing Opus audio and a .opus file are essentially the same thing with a different file extension convention. WhatsApp exports voice notes with the .opus extension on most versions, but some older WhatsApp Android versions exported them as .ogg. Both are the same codec and same container; the extension is the only difference. VLC, FFmpeg, and modern browsers handle both identically.

Key takeaways

  • WhatsApp records every voice note as Opus codec, wrapped in an OGG container, at approximately 16 kHz, 16 kbps — this keeps files tiny (around 120 KB per minute) and intelligible, but limits playback compatibility outside of Chrome, Firefox, and Android.
  • Opus is not MP3: .opus files do not play natively in Safari, iTunes, Windows Media Player, QuickTime, or most consumer audio hardware — that is the practical reason to convert.
  • Converting .opus to MP3 does NOT improve audio quality; it changes the format so it plays in more places, but the original 16 kHz audio information is already baked in. Use VLC or FFmpeg (both free, both local) for privacy-safe conversion.
  • Bitrate tells the story: Opus at 16 kbps is 8× smaller than MP3 at 128 kbps for speech, with equivalent intelligibility — a deliberate WhatsApp engineering decision for low-data-cost markets.
  • If your real goal is extracting the spoken content as text, that is transcription — not format conversion. Converting to MP3 does not give you the words.
  • ChatToPDF's $49 Premium+Voice per chat tier transcribes every voice note inline inside your exported chat, so you get a single searchable PDF with text messages and voice transcripts in conversation order.
  • Internal links: transcribe WhatsApp audio full guide · WhatsApp voice to text · best WhatsApp transcription tools · WhatsApp to PDF conversion overview
Paul, founder of ChatToPDF
Paul · ChatToPDF

I'm Paul. I built ChatToPDF after watching a friend try to print a 4-year-old WhatsApp chat across forty-something one-page PDFs. I write here about exporting WhatsApp chats, converting them to PDF, transcribing voice notes, and the messy edge cases nobody else writes about (40,000-message export limits, broken emojis, RTL Arabic, Samsung Secure Folder).

Published 2026-05-21