Convert text or a script to audio with our realistic speech features. Pick from many AI voices or build a custom voice clone in just a few minutes. Perfect for podcast intros, voiceovers, videos without on-screen hosts, and more.
Get startedThese companies use Descript. Not bad!
01
Type or paste in your text
In a new Descript project, type or paste your script into the text editor, or use the Ask AI command in the Actions menu to draft text based on your chosen parameters.
02
Choose an AI voice or clone your own
Press ‘@’ to add a speaker to your script. You can create a new speaker name and then Enable speech generation to clone your voice. Or select Browse stock AI speakers to pick from a wide range of realistic options, including various styles and tones.
03
Generate your AI speech
Your script will display a brief loading icon while your audio is generated. After it’s ready, review your newly formed voice content, continue building an audio or video project, or export it by clicking Publish.
Generate and edit voice audio by typing
Descript enables you to turn your script to audio and edit by typing. Export the final result as MP3, WAV, or other common formats—all within one tool.
20+ realistic AI voices, emotions, and styles
Descript’s text-to-speech (TTS) features rely on advanced AI to create authentic voices. Pick from casual or formal tones to fit your project.
Create AI voice clones in minutes
Design and share personalized AI voices for ongoing work. Let AI manage voiceovers or subtle updates so you don’t need another recording session.
Descript is an AI-enhanced audio and video editing tool that helps you create podcasts or videos in a straightforward manner, much like editing text.
Captions & subtitles
Attach captions and subtitles to any text-to-speech project. This step supports accessibility for everyone.
Regenerate
Create a tailored voice clone to correct misreads or re-recorded lines with your original vocal character.
Podcasting
Produce, release, and share your audio or video podcast without complex steps.
Studio Sound
Enhance your audio by removing filler words and other issues for a polished final result.
30 minutes / month of dubbing in 20+ languages
Can someone else replicate my voice in Descript?
No. That can’t occur without your specific approval. Your voice data remains secure, and you can remove it at any point. We place user privacy first and follow our detailed code of ethics.
Can I use Descript's TTS generator for free?
You can make up to 5 minutes of text-to-speech audio at no charge. Once you pass that limit, upgrading gives you 120 minutes of TTS each month and other AI features, starting at $24/month.
Is there a difference between text to speech generated with a free subscription vs. a paid plan?
The free option offers 5 minutes of text-to-speech audio and 5 Regenerate actions. With a paid plan, you get higher monthly limits—for example, at $12/month, you receive 30 TTS minutes and 10 Regenerate uses, plus extra benefits.
How can I improve the quality of my text-to-speech voice clone?
Improve your text-to-speech voice clone by recording in a quiet spot, speaking clearly, and using reliable gear. Sticking to Descript’s recording suggestions in the prompt also helps create better outcomes.