Audio Extractor: Separate Audio from Video

Extract audio from video or audio in Descript

01
Upload your audio or video file
Drag the file into your Descript project, or upload it from Media > Files. This quick step sets you up for using our audio extractor from audio or video content.
02
Extract the audio you want to isolate
With video files, right-click the clip in your timeline and select detach audio to split it into a new track. If you're working with speaker-only audio, open audio effects and switch on Studio Sound to reduce noise and sharpen voices. It's a reliable way to extract sound effects from audio or remove background clutter.
03
Download your extracted audio file
Choose Publish, then export your new audio in M4A, WAV, or MP3 format. You can keep editing in your Descript timeline or download the file to use however you like.

Rip audio from other media formats for free

Detach audio from video with ease
Separate audio from a saved YouTube video, TikTok post, or Instagram Reel, then manage that audio track independently in the timeline.
Extract voices from audio files
Studio Sound uses AI to draw out speaker voices while cutting background noise. It creates separate voice layers and can improve clarity beyond the original recording.
Convert extracted audio to multiple formats
Extract audio, then convert it to MP4, WAV, MP3, or other outputs like audiogram videos, subtitles, and transcripts.

More than an audio extractor

Descript is a flexible AI-powered editing platform for audio and video, letting you create or refine content simply by working with text.

Studio Sound
Enhance rough audio with AI-driven voice regeneration for a crisper result.
AI voice cloning
Turn typed text into speech that mirrors your own voice, perfect for consistent narration or announcements.
Podcasting
Descript provides all you need to record, edit, and host podcasts, including interview recording, collaborative editing, and quick-share clips.
Remove filler words
Descript’s AI filler-word detection pinpoints every “um” and “like” and scrubs them from both your audio and transcript in one go.

Don’t just take our word for it

With a 4.6-out-of-5-star rating and a bunch of distinctions on G2, Descript’s users have declared it an industry standard in the video and podcasting world.

2025

Best Software

Video Editing

AI Video Generators

Screen and Video Capture

Text to Speech

“With Descript I'll be able to at least double my content output since editing is taking one-quarter the time it used to.”

Donna B. |

“With Descript we can create videos for our YouTube channel and our LinkedIn page much faster and with high quality.”

Balázs N.

“Descript has made cleaning up and creating my educational videos into professional presentations [possible] without needing extensive technical computer skills.”

Barbara C.

“Descript makes recording and editing audio and video a breeze. It's advanced features have streamlined my workflows, saving me a lot of time usually spent editing.”

Roderick F.

“The collaborative tools streamline teamwork, allowing my team and me to work efficiently together on projects. Overall, Descript enhances productivity and simplifies the editing process.”

Aldrich M.

“Transcription-based editing makes the process much faster…All in all, a must have editor for most audiences, especially in SaaS marketing.”

Nidhin M.

Surely there’s one for you

Free

no credit card required

Start your journey with text-based editing

Get started

1 transcription hour / month

Export 720p, with watermarks

Limited trial of Basic AI Actions

Limited trial of AI Speech

Hobbyist

$16

per person / month, billed annually

Elevate your projects, watermark-free

Get started

10 transcription hours / month

Export 1080p, watermark-free

20 uses / month of Basic AI Actions suite including Filler word removal, Studio sound, Draft show notes, Create clips, and more

30 minutes / month of AI speech with stock AI speakers and custom voice clones

5 minutes / month of avatars

Questions? We have answers

What is an audio extractor?
An Audio Extractor isolates an audio track from any audio or video file. Typically, you upload your file to a platform like Descript, which processes and separates the sound. Once it's done, you can convert it to another format, download it, or do extra editing.
Can Descript help me rip just the speaker audio from an audio or video file?
Yes. Descript includes Studio Sound and other audio effects that peel away unwanted noise so you can focus purely on speaker voices.
Can I extract audio from a YouTube video with Descript?
Yes. Once you download the YouTube video to your device, place it into Descript, right-click on the video layer, and detach the audio to get a dedicated track for editing.
What file types can I convert my extracted audio to with Descript?
You can export your extracted audio as M4A, WAV, or MP3, as well as SRT or VTT subtitles, MP4 audiograms, and text-based formats like Markdown, rich text, or HTML.
Does Descript preserve the quality of the audio after extracting?
Yes. Descript keeps the original audio fidelity intact while offering AI enhancements for clarity and polish.