Should I use Zoom's built-in audio recording instead of extracting from video?

Yes, if you configure Zoom to record audio separately. In Zoom's Recording settings, you can enable separate audio file recording, which produces an M4A file of the meeting audio in addition to the MP4 video. This M4A is already the audio track without the video processing overhead, and it skips the extraction step. However, Zoom's separate audio file is an AAC-compressed M4A, while the Video Audio Extractor produces an uncompressed WAV. For quality-conscious post-production, extracting from the MP4 gives you a WAV that is a better editing master.

Why does my podcast audio sound different from what I heard during the recording?

Several factors can cause this discrepancy. Zoom and other video call platforms apply real-time noise suppression, echo cancellation, and automatic gain control during the call, but the local recording may bypass some of these processing steps. You may also be hearing artifacts from internet connection quality during the call that were present in real time but less noticeable in the moment. The recording captures what was transmitted and received, which may include compression artifacts, not what your microphone originally captured. Dedicated podcast recording platforms like Riverside.fm avoid this by recording locally on each participant's device.

How large will my podcast MP3 files be?

At 128 kbps mono, a podcast episode is approximately 1 MB per minute. A 45-minute episode will be about 45 MB as a 128 kbps mono MP3. At 192 kbps stereo, the same episode would be approximately 65 MB. These are typical sizes for podcast episodes uploaded to hosting platforms. Most podcast hosts charge based on storage and bandwidth, so keeping episodes at 128 kbps mono is both the quality-appropriate and cost-efficient choice for speech podcasts.

video6 min readvideo-audio-extractor

Save Podcast Audio from Video Files

By the WikiPlus Editorial Team

Researched with the help of AI tools, edited and reviewed for accuracy by Sergio Robles (Founder, WikiPlus).

Published November 22, 2025Last reviewed May 23, 2026

Millions of podcasters record their shows on video platforms — Zoom, Riverside.fm, Squadcast, Google Meet — and need to publish audio-only episodes to podcast platforms. The video file is the source; the MP3 is the distribution format. Getting from one to the other involves extracting the audio track, cleaning it up, and encoding it correctly for podcast hosting. This guide walks through the complete workflow, from video file to published podcast episode, using free browser-based tools.

Why Podcasters Record on Video Platforms

Recording a podcast via Zoom or a dedicated remote recording platform like Riverside.fm or Squadcast has become standard practice for remote interviews and co-hosted shows. These platforms provide synchronized recording, automatic level balancing, and a backup stream in case the connection drops. The video recording also serves as a second content format: the same session becomes a YouTube video and a podcast episode. The challenge is that these platforms output video files (MP4) even when the visual component is not the primary product. Podcast hosts need audio files. The first step in the post-production workflow is therefore always extracting the audio from the video. Recording quality on these platforms has improved dramatically. Riverside.fm and Squadcast record each participant's audio locally on their own device and upload after the session, avoiding the compression and dropout artifacts that used to plague remote podcast recordings. The extracted audio from these local recording tools is genuinely broadcast-quality, making post-processing simpler. Zoom recordings, by contrast, mix all participant audio in real-time and record the mixed stream. This means you cannot separate individual speakers after the fact. For podcast production where you want to edit each speaker individually, Zoom's mixed audio is a limitation. Dedicated podcast recording platforms are preferable specifically because they provide separate audio tracks per participant. For most podcasters, however, Zoom is what their guests have and are comfortable with, making it the pragmatic choice despite the mixed-audio limitation.

Extracting Audio from Your Podcast Recording

The extraction step is the first step in your podcast production workflow. Here is how to do it efficiently. Open the WikiPlus Video Audio Extractor in your browser. The tool is free, requires no account, and processes your file locally — your recording never leaves your device. Load your video file. Zoom local recordings are MP4. Riverside.fm provides separate MP4 files per participant for video, plus separate WAV files per participant for audio (if you configure this in Riverside's settings). If you used Riverside's dedicated audio recording, you may already have a WAV and can skip the extraction step. For a Zoom MP4, the extraction yields a single mixed WAV containing all participant audio and any system audio that was captured. The duration of the WAV will match the video exactly. Download the WAV file. Note the file size — a one-hour recording will be roughly 300 MB as WAV. This is normal and expected for uncompressed audio. At this point you have your editing master — the uncompressed WAV of your podcast session. Import this into your audio editor for the post-production steps: editing out pauses and mistakes, normalizing levels, adding intro/outro music, and applying noise reduction if needed. Do all editing in WAV format. Export to MP3 only as the final delivery step. This ensures no additional quality loss from intermediate re-encoding.

Audio Post-Production Basics for Podcasters

After extracting the WAV from your video recording, a few post-production steps dramatically improve the listener experience. Noise reduction: use Audacity's Noise Reduction (Effect > Noise Reduction) to remove consistent background noise. Find a 0.5-second sample of pure background noise (no speech), select it, click Get Noise Profile, then select all audio and apply the reduction at a moderate setting (10–12 dB). More aggressive settings can make voices sound processed. Volume normalization: go to Effect > Normalize in Audacity and set the peak amplitude to -1 dB. This ensures the loudest moment in your recording reaches near-maximum volume without clipping, and that all speakers are at a consistent level relative to each other. Edit out filler and mistakes: use Audacity's silence detection or simply listen through and select and delete sections that should be cut — long pauses, stumbled sentences, tangents, and technical interruptions. The cut-and-close edit (removing the selected section and closing the gap) creates a seamless listening experience. Intro and outro music: import your music files as additional tracks in Audacity, align them at the beginning and end of your main audio, and use the fade-in and fade-out effects to transition smoothly between music and speech. Keep intro music under 15 seconds for listener retention. Export to MP3: when editing is complete, go to File > Export > Export as MP3 in Audacity. Use 128 kbps for speech-only episodes and 192 kbps for music-heavy shows. Export as mono for interview-style shows (halves file size with no quality impact for speech). Add ID3 tags (title, episode number, description, artwork) during the export dialog.

Podcast Format Requirements and Hosting

Each podcast hosting platform has slightly different requirements, but the industry has settled on consistent standards that work everywhere. File format: MP3 is universally required. While some platforms accept AAC or WAV, MP3 is the safe, universal choice that every podcast app and platform handles correctly. Bitrate: 128 kbps mono is the standard for interview and speech podcasts. 192 kbps stereo is appropriate for music podcasts or shows with significant music components. Going higher than 192 kbps for a speech podcast wastes bandwidth with no listener benefit. Sample rate: 44100 Hz is the standard. Some podcast hosts specify this explicitly. Audacity exports at 44100 Hz by default for MP3. File naming: use descriptive, hyphenated filenames without spaces. Podcast-EP042-Guest-Name.mp3 is a better filename than podcast episode 42.mp3. Some hosting platforms use the filename as the episode URL slug, so clean filenames matter for SEO. Episode artwork: most podcast platforms require artwork in the square JPEG or PNG format, typically 3000x3000 pixels at 72 dpi. The artwork is embedded in the MP3 file's ID3 tags and displayed in podcast apps. Publishing to RSS: Buzzsprout, Anchor (Spotify for Podcasters), Podbean, Castos, Transistor, and similar hosting platforms all provide an interface for uploading your MP3, filling in episode metadata (title, description, notes), and publishing to your RSS feed. The RSS feed is what podcast apps subscribe to — Spotify, Apple Podcasts, and all other apps pull your episodes from this feed.

Frequently Asked Questions

Should I use Zoom's built-in audio recording instead of extracting from video?: Yes, if you configure Zoom to record audio separately. In Zoom's Recording settings, you can enable separate audio file recording, which produces an M4A file of the meeting audio in addition to the MP4 video. This M4A is already the audio track without the video processing overhead, and it skips the extraction step. However, Zoom's separate audio file is an AAC-compressed M4A, while the Video Audio Extractor produces an uncompressed WAV. For quality-conscious post-production, extracting from the MP4 gives you a WAV that is a better editing master.
Why does my podcast audio sound different from what I heard during the recording?: Several factors can cause this discrepancy. Zoom and other video call platforms apply real-time noise suppression, echo cancellation, and automatic gain control during the call, but the local recording may bypass some of these processing steps. You may also be hearing artifacts from internet connection quality during the call that were present in real time but less noticeable in the moment. The recording captures what was transmitted and received, which may include compression artifacts, not what your microphone originally captured. Dedicated podcast recording platforms like Riverside.fm avoid this by recording locally on each participant's device.
How large will my podcast MP3 files be?: At 128 kbps mono, a podcast episode is approximately 1 MB per minute. A 45-minute episode will be about 45 MB as a 128 kbps mono MP3. At 192 kbps stereo, the same episode would be approximately 65 MB. These are typical sizes for podcast episodes uploaded to hosting platforms. Most podcast hosts charge based on storage and bandwidth, so keeping episodes at 128 kbps mono is both the quality-appropriate and cost-efficient choice for speech podcasts.

Save Podcast Audio from Video Files

Why Podcasters Record on Video Platforms

Extracting Audio from Your Podcast Recording

Audio Post-Production Basics for Podcasters

Podcast Format Requirements and Hosting

Frequently Asked Questions

Related articles

Best Free Video Audio Extractor Online (No Upload, MP3 and WAV)

How to Extract Audio From a Zoom or Teams Recording (Free Guide)

Complete Guide to Extracting Audio From Video: Tools and Methods