Can I separate microphone audio from system audio in a screen recording?

The browser-based Video Audio Extractor outputs the mixed audio track from the video file. If your screen recording software mixed microphone and system audio into a single stream (as Windows Game Bar and most common tools do by default), the extracted WAV will contain both mixed together, and separating them afterward is not possible without source isolation technology. For recordings where you need separate tracks, configure your screen recording software to record microphone and system audio as separate tracks before recording. Tools like OBS Studio on Windows and macOS support multi-track audio recording.

Does the extractor work with Zoom cloud recordings?

Zoom cloud recordings are accessible through the Zoom web portal and can be downloaded as MP4 files to your device. Once downloaded, they work with the Video Audio Extractor exactly like local recordings. If you only have a Zoom share link and not the downloaded file, you will need to download the MP4 first through the Zoom portal before using the browser-based tool.

Why does my extracted meeting audio have an echo or reverb?

Echo in meeting recordings typically comes from participants who joined without headphones and whose microphone picked up the audio playing through their speakers. This acoustic echo creates a delayed repeat of speech that is baked into the recording. The extraction process faithfully reproduces this echo because it was in the source. Some AI-powered audio processing tools offer acoustic echo cancellation that can reduce this effect in post-production — Adobe Podcast's Enhance Speech feature, Krisp, and NVIDIA RTX Voice all offer echo removal capabilities that can be applied to the extracted WAV.

video6 min readvideo-audio-extractor

How to Extract Audio from Screen Recordings

By the WikiPlus Editorial Team

Researched with the help of AI tools, edited and reviewed for accuracy by Sergio Robles (Founder, WikiPlus).

Published May 6, 2025Last reviewed May 23, 2026

Screen recordings capture two things: the visual content on screen and the audio that accompanies it. Often the audio is the more valuable of the two — a meeting discussion, a walkthrough narration, a webinar presentation, an interview conducted over video call. Extracting that audio separately makes it accessible for transcription, archiving, podcast distribution, and reference without requiring anyone to watch a video. This guide covers how to extract audio from screen recordings made with the most common tools, and what to do with the WAV output.

Common Screen Recording Formats and Their Audio

Before extracting audio, it helps to know what audio is in your screen recording file. Windows Game Bar recordings (MP4): Game Bar records system audio and microphone audio mixed into a single AAC stereo stream. When you extract the audio from a Game Bar recording, you get both sources mixed together — there is no way to separate system sounds from microphone audio at this stage. If you need the microphone audio only, use a recording tool that captures them as separate tracks. macOS QuickTime screen recordings (MOV): QuickTime records the internal microphone (or an external mic) as a mono or stereo AAC track. System audio is not captured by default in macOS screen recordings through QuickTime — for system audio capture on macOS, you need a third-party tool like Loopback or BlackHole. Zoom local recordings (MP4): Zoom local recordings contain all participant audio mixed together in a single AAC stream. The extracted audio is the same as what you hear when you play the recording. Zoom also offers separate audio recording (an M4A file) if you configure this in your Zoom settings before the call — that M4A is already the audio track without any video. Microsoft Teams recordings (MP4): Teams meeting recordings follow the same mixed-audio model as Zoom. The extracted audio contains all participants. ShareX (MKV or MP4): ShareX can record microphone and system audio together or separately depending on configuration. If you recorded with separate audio tracks, the extraction may only yield one track. MKV format may require conversion to MP4 before the browser-based extractor can process it.

Step-by-Step: Extracting Audio from a Zoom or Teams Recording

Zoom and Teams recordings are the most common screen recording audio extraction target for business users. Here is the full workflow. Locate your recording. Zoom local recordings are saved in a folder configurable in Zoom's settings — typically Documents/Zoom on Windows and ~/Documents/Zoom on macOS. Each meeting gets its own folder with the date and meeting ID. The MP4 file is the recording you want. Open the WikiPlus Video Audio Extractor in your browser. Click to select the MP4 file or drag it onto the drop zone. The file stays local — no upload. Wait for the extraction to complete. A one-hour meeting recording typically processes in 5–15 seconds. The tool's progress indicator shows the decoding stage. Download the WAV file. The extracted audio contains the full meeting audio — all participants, system notifications, and background sounds that were audible during the recording. For transcription: upload the WAV directly to a transcription service. Otter.ai, Whisper, Descript, and Rev all accept WAV files and handle multi-speaker meeting audio well. Speaker diarization (identifying who said what) is a feature available in most of these services. For archiving: store the WAV alongside the original MP4. The WAV provides audio-only access for future reference without requiring a video player. For sharing a meeting summary: extract the audio, transcribe it, and use the transcript to generate a written summary. This workflow — video recording to audio to transcript to written summary — is far more efficient than re-watching the recording when reviewing meeting outcomes.

Improving Audio Quality from Screen Recordings

Screen recording audio often has quality issues: background noise, echo, varying volume between speakers, and audio dropouts from poor internet connections in video calls. These issues cannot be reversed by the extraction process — they exist in the source recording. But they can be mitigated after extraction with basic audio processing. Noise reduction: Audacity (free, cross-platform) offers a Noise Reduction effect that samples a section of background noise (a moment of silence between speakers) and subtracts that noise profile from the entire track. This works well for steady background sounds like HVAC hum, computer fan noise, or air conditioning. Volume normalization: multiple participants in a meeting recording often have very different volume levels. Audacity's Normalize effect can bring all levels to a consistent maximum, making quieter speakers easier to hear. For a more sophisticated approach, use dynamic compression to reduce the gap between the loudest and quietest moments. Trim silence: meeting recordings often start and end with dead time before and after the meeting proper begins. Trimming these silent sections before archiving or transcribing saves storage space and speeds up transcription. Export clean audio: after processing in Audacity, export as WAV (for archiving) or MP3 (for sharing) at your chosen quality settings. The processing steps do not change the core extraction — they are post-extraction editing that you apply to the WAV output. For future recordings: the best audio quality improvements happen before recording. A USB microphone, a headset, or even earbuds with an inline mic produce dramatically better audio than a laptop's built-in microphone in a reverberant room.

Use Cases: What to Do with Extracted Screen Recording Audio

Extracting audio from screen recordings enables a range of valuable downstream workflows. Meeting minutes and documentation: a Zoom recording's audio, transcribed by an AI tool, provides the raw material for accurate meeting minutes without requiring manual note-taking. The transcript can be edited into a summary, searched for specific points, and archived as a permanent record of decisions made. Webinar and course content distribution: a recorded webinar or online course session often has value as a podcast episode or audio course. Extract the audio, clean it up in Audacity, and publish it as an audio-only version for learners who prefer to listen rather than watch. Compliance and record-keeping: regulated industries (financial services, healthcare, legal) often require archiving of client communications. Extracting and archiving audio from recorded meetings provides a more accessible format than video for compliance review. Language and communication training: sales teams and customer service departments often review call recordings for training purposes. Extracting the audio from recorded calls and making it available as an audio file makes it easier to listen and re-listen without a video player, and enables analysis tools that work with audio files. Accessibility: extracting audio from a screen recording tutorial and providing it as an audio track alongside captions makes the content accessible to users with visual impairments who use screen readers or audio players rather than visual media.

Frequently Asked Questions

Can I separate microphone audio from system audio in a screen recording?: The browser-based Video Audio Extractor outputs the mixed audio track from the video file. If your screen recording software mixed microphone and system audio into a single stream (as Windows Game Bar and most common tools do by default), the extracted WAV will contain both mixed together, and separating them afterward is not possible without source isolation technology. For recordings where you need separate tracks, configure your screen recording software to record microphone and system audio as separate tracks before recording. Tools like OBS Studio on Windows and macOS support multi-track audio recording.
Does the extractor work with Zoom cloud recordings?: Zoom cloud recordings are accessible through the Zoom web portal and can be downloaded as MP4 files to your device. Once downloaded, they work with the Video Audio Extractor exactly like local recordings. If you only have a Zoom share link and not the downloaded file, you will need to download the MP4 first through the Zoom portal before using the browser-based tool.
Why does my extracted meeting audio have an echo or reverb?: Echo in meeting recordings typically comes from participants who joined without headphones and whose microphone picked up the audio playing through their speakers. This acoustic echo creates a delayed repeat of speech that is baked into the recording. The extraction process faithfully reproduces this echo because it was in the source. Some AI-powered audio processing tools offer acoustic echo cancellation that can reduce this effect in post-production — Adobe Podcast's Enhance Speech feature, Krisp, and NVIDIA RTX Voice all offer echo removal capabilities that can be applied to the extracted WAV.

How to Extract Audio from Screen Recordings

Common Screen Recording Formats and Their Audio

Step-by-Step: Extracting Audio from a Zoom or Teams Recording

Improving Audio Quality from Screen Recordings

Use Cases: What to Do with Extracted Screen Recording Audio

Frequently Asked Questions

Related articles

Best Free Video Audio Extractor Online (No Upload, MP3 and WAV)

How to Extract Audio From a Zoom or Teams Recording (Free Guide)

Complete Guide to Extracting Audio From Video: Tools and Methods