Add and edit audio files onto any MP4 in just a few clicks. Modify videos and include voiceovers, music, effects, or whatever sound suits you.
Get startedThese companies use Descript. Not bad!
01
Drag and drop your MP4 video into a Descript project. After a brief wait, Descript creates a transcript that lets you edit your video simply by editing text. It’s a quick path to adjusting or trimming sections without typical timeline hassle.
02
Once your MP4 and transcript are ready in Descript, pick any music track, narration, background track, or other audio file to add to your MP4. Drag it into the exact spot in the transcript where you’d like it to play. You can also highlight transcript segments, right-click, and select Add layer to add your own audio or pick something from our built-in media library.
03
Review your project and adjust timing or volume in the timeline as needed. When satisfied, click Publish to export your final MP4 with its added sound. Feel free to export directly to YouTube or save the file locally for sharing on Instagram, TikTok, or any platform you prefer.
Descript makes it straightforward to place MP3 files in MP4 videos and supports WAV, MP3, AAC, AIFF, M4A, or FLAC options. Looking to add audio or music to other formats? That works too—Descript accepts MOV, GIF, MPEG, and more.
Whether you want to add sound to MP4 files, convert MP3 to MP4, or refine voice quality, the AI features clean up background noise and polish overall audio. It’s a fitting choice for creators aiming for pro content minus hefty studio costs.

Incorporate background music or audio effects into your MP4 from our large stock media library, or create voice overs using text-to-speech and voice cloning. Just select the right section of text and click.
Descript is an AI-powered audio and video editing tool that lets you edit podcasts and videos like a doc.
Save time editing MP4 video and audio files with a unique text-based editor.
Tap into a vast stock media library of music, sound effects, videos, images, and GIFs.
Cut out filler words like “um” from voice recordings and captions to keep your MP4 dialogue smooth.
Publish your videos directly to supported platforms or share a web link for anyone to view or download your MP4.
With a 4.6-out-of-5-star rating and a bunch of distinctions on G2, Descript’s users have declared it an industry standard in the video and podcasting world.
2025
“With Descript I'll be able to at least double my content output since editing is taking one-quarter the time it used to.”
Donna B.
“With Descript we can create videos for our YouTube channel and our LinkedIn page much faster and with high quality.”
Balázs N.
“Descript has made cleaning up and creating my educational videos into professional presentations [possible] without needing extensive technical computer skills.”
Barbara C.
“Descript makes recording and editing audio and video a breeze. It's advanced features have streamlined my workflows, saving me a lot of time usually spent editing.”
Roderick F.
“The collaborative tools streamline teamwork, allowing my team and me to work efficiently together on projects. Overall, Descript enhances productivity and simplifies the editing process.”
Aldrich M.
“Transcription-based editing makes the process much faster…All in all, a must have editor for most audiences, especially in SaaS marketing.”
Nidhin M.
Surely there’s one for you
$0
$0
per person / month
Start your journey with text-based editing
1 media hour / month
100 AI credits / month
Export 720p, watermark-free
Limited use of Underlord, our agentic video co-editor and AI tools
Limited trial of AI Speech
$24
$16
per person / month
1 person included
Elevate your projects, watermark-free
10 media hours / month
400 AI credits / month
Export 1080p, watermark-free
Access to Underlord, our AI video co-editor
AI tools including Studio Sound, Remove Filler Words, Create Clips, and more
AI Speech with custom voice clones and video regenerate
Most Popular
$35
$24
per person / month
Scale to a team of 3 (billed separately)
Unlock advanced AI-powered creativity
30 media hours / month
+5 bonus hours
800 AI credits / month
+500 bonus credits
Export 4k, watermark-free
Full access to Underlord, our AI video co-editor and 20+ more AI tools
Generate video with the latest AI models
Unlimited access to royalty-free stock media library
Access to top ups for more media hours and AI credits
Bring your MP4 into a Descript project, highlight where you’d like the MP3 to appear, then add or paste it there. If needed, use trimming or alignment tools to fit everything nicely. Once it looks and sounds good, export your project—done! You now have a new MP4 ready for use.
Record or upload a voice recording into your project alongside your MP4 video, or use text-to-speech based on a written script. Choose an AI voice or your own voice clone if desired. Drag the resulting voice track into position, and you’re set.
In Descript, dropping music (MP3, WAV, or other supported formats) into the timeline along with your MP4 is effortless. Then reposition or trim the audio track for the perfect fit, and tweak volume levels or transitions. It’s fast, flexible, and great for layering tracks just the way you want.