How to write a transcript: 9 tips for beginners

Master the art of transcript writing with our comprehensive guide. Learn how to write a transcript and pick up tips and tricks for accurate and effective transcription.
November 13, 2023
Elise Dopson
In this article
Start editing audio & video
This makes the editing process so much faster. I wish I knew about Descript a year ago.
Matt D., Copywriter
Sign up

What type of content do you primarily create?

Videos
Podcasts
Social media clips
Transcriptions
Start editing audio & video
This makes the editing process so much faster. I wish I knew about Descript a year ago.
Matt D., Copywriter
Sign up

What type of content do you primarily create?

Videos
Podcasts
Social media clips
Transcriptions

Producing a high-quality podcast or video can take hours. You’ve recorded it, edited it, and published it online. But if you don’t have a transcript, your effort isn’t reaching as many people as it could. Limiting your content formats to video or audio-only means your content isn’t easily accessible to some internet users. 

That said, there’s more to writing a transcript than publishing a word-for-word copy of your recording. In this guide, we’ll cover the different use cases for a transcript, alongside a step-by-step guide on how to write accurate transcripts that are accessible and enjoyable to read.

Why write a transcript?

Before we dive into the tutorial on how to write a transcript, let’s cover why it’s important to create one for podcasts and video content:

  • Makes your content more accessible. Not everyone can listen to audio or watch a video. A transcript makes your content more accessible. People who are deaf, hard of hearing, or use a screen reader can still interact with your content if there’s an accompanying transcript. 
  • Helps your podcast or videos rank in search engines. Text-based search engines like Google and Bing can’t understand video content. Help them understand what your video or audio content is—and therefore, where it should rank—by sharing the transcript on your website. 
  • Makes content easy to repurpose. As a creator, your time is finite. Get the most mileage from your video or audio content by transcribing it. You can go back to the transcript and pull snippets to reuse on social media without having to rewatch the entire thing and pull out key points. 

Take Being Freelance, a podcast with over 300 episodes in its archive. Host Steve Folland says: “The podcasts become an absolute treasure trove of quotes on various aspects of freelancing. By having them transcribed I can easily search for and pull out quotes to use when both promoting or repurposing the podcast.”

Steve adds that these transcriptions also helped him make more money as a creator: “I also used this technique when creating my course for new freelancers. I could search on a topic and use quotes as evidence to help people learn and make the course all the richer for it.”

“I create transcripts as we had listeners who are hearing impaired who requested transcripts. I also prefer reading to listening as I find it easier to digest some content visually rather than when its absorbed audibly, and I need a low or zero sound option when in spaces with others.” —Dr Naomi Murphy, co-host of Locked Up Living

🧠 Read more: How to optimize your podcast show notes for SEO (from an SEO expert)

4 types of transcription

Now we know the benefits of producing a transcript, let’s explore the four different methods you can use to do it:

  • Verbatim transcription. This type of transcription is a word-for-word copy of any conversation. It’s usually the hardest type to read since it captures natural voices and nonverbal communication such as stutters, filler words, and false starts. 
  • Clean or edited transcription. A clean transcript has been manually edited to remove filler words, fix grammatical errors, and simplify complex sentences. The edits don’t change the words said—they just make the content easier to read. 
  • Intelligent or smart transcription. An intelligent transcription uses artificial intelligence (AI) to transcribe audio. The software can also remove filler words, awkward pauses, and distracting background noise from a transcript. 
  • Time-stamped transcription. This type of transcript ties words to a specific point in the audio or video. They’re useful because they help people scan the video transcription and rewind the recording to re-listen to the exact point something was said.

Tools you need to start writing a transcription

Before you start transcribing a video or podcast episode from scratch, get these four tools to make the process easier and faster. 

Transcription software

Manually typing out a conversation is time-consuming. Transcription software helps you speed up the process by automatically transcribing audio into text. Instead of listening to the audio recording and manually typing out each word, tools like Descript allow you to create an intelligent transcription that writes the text for you with up to 95% accuracy

Descript’s transcription tool can also:

  • Use speaker detection to separate multiple voices—ideal for interview transcriptions
  • Color-code and label different speakers in your automated transcript 
  • Transcribe audio in different 23 languages
“I've explored various transcription packages in my quest for the one that makes my job simplest. I settled on Descript as it allows me to edit my video and transcript at the same time.” —Dr Naomi Murphy, co-host of Locked Up Living

Example of a transcript auto-generated by Descript

There are also professional transcription services you can use if you want that human touch.

Noise-canceling headphones

Anyone who is transcribing an audio or video file needs complete concentration on the recording—especially if you’re doing the process manually. Noise-canceling headphones help you tune out background noise that could interfere with your transcript. The last thing you want to do is muddle the transcript with words from someone else in your workplace. (Just don’t use them for audio editing—for that, check out our list of the best podcast headphones.)

Audio and video player

If you’re not using software to transcribe video or audio, you’ll need a media player that allows you to pause, rewind, and fast-forward a recording.

Here are some other popular options to play audio and video, depending on the device you’re using:

  • Mac or iPhone: iMovie, GarageBand, Voice Memos, or QuickTime Player
  • Windows: Windows Media Player or VLC Media Player
  • Android: MediaPlayer, VLC for Android, or RealPlayer 

Foot pedal

While not essential, a foot pedal is a smart purchase if you plan to do a lot of manual transcription. A foot pedal is a piece of equipment that allows you to set the playback speed of your recording. You can also use it to create keyboard shortcuts that fast-forward, skip, or rewind parts of the conversation if you need extra clarity while transcribing. 

How to write a transcript in 5 steps

In our opinion, the most efficient way to produce a transcript is by using transcription apps—especially Descript. But if you’re determined to write a transcript manually, here’s how to do it.

  1. Prepare your tools and environment
  2. Listen and transcribe audio
  3. Format and structure
  4. Proofread and edit your transcript
  5. Add timestamps

Step 1. Prepare your tools and environment

The first step to transcribing audio is to gather your tools. At the very least, you’ll need some sort of media player, an open text editor, and quiet workspace to hear audio clearly and minimize errors in the transcript. 

Step 2. Listen and transcribe audio

Next, press play on the media player. Unless your typing speed is world class, it might help to change the dictation playback speed to 0.75x or 0.5x. Slowing down the recording will help you type alongside the speaker in real-time. 

At this point, don’t worry about fixing any typos or grammatical errors. The goal is to get down a solid first draft that you can polish later on. 

Step 3. Format and structure

Now you’ve got a first draft of the transcript, go back through the content and format it for readability. Lines upon lines of text aren’t easy to digest, so you should help readers follow the written conversation by separating out topics and speakers into their own paragraphs. Depending on your purposes, you may also want to add:

  • Bullet points
  • Pull quotes
  • Headings
  • Bold and italic text 

This summary of an episode of the Freelance Writing Coach Podcast, for example, uses headings, numbered lists, and pull quotes to emphasize key points in the transcription: 

Example of a podcast transcription turned into a summary

Step 4. Proofread and edit your transcript

Keep mistakes from distracting your readers (and creating a negative perception of you) by cleaning up your transcript.  

Editing tools like Grammarly and ProWritingAid can come in handy here. They use artificial intelligence to spot grammatical errors, spell-check words, and suggest missed words without having to scan for them manually. 

It’s always a good idea to read through the transcript in a different font, size, or on a different device. The change in view can keep your brain from anticipating what’s ahead—and missing spelling or grammar mistakes that ruin the credibility of your transcript. 

Step 5. Add timestamps 

If someone is reading the write-up to find a specific point in your audio or video recording, timestamps can make it easier for people to reference specific points in the conversation. 

Timestamps are especially useful if you’re transcribing a podcast or YouTube video, since you can use them to help people navigate your conversation. In the video podcast for this episode of The Break, for example, viewers can click the timestamp in the transcription to fast forward the video. 

Example of YouTube podcast video with timestamps on the transcript

Transcript writing: 4 bonus tips to save time

No matter what transcription process you’re using, you’ll likely need to make a few final tweaks to your transcription before sharing it with the world. Here are four bonus tips to save time and publish high-quality transcripts.

Prioritize clean, clear audio

Your job as a transcriptionist is made ten times harder if you’re working with bad sound. Even automated transcription software can accidentally “transcribe” unwanted background noise, which you’ll need to go back in and remove manually. 

Before you start writing a transcript, improve the audio quality of the original file. Features like Descript’s Studio Sound do this automatically. It uses artificial intelligence to remove unwanted background noise, leaving you with a high-quality audio file that you can use to create an error-free transcript.

Familiarize yourself with unfamiliar accents and dialects

Depending on the topic of the discussion you’re transcribing, it might be difficult to understand the speakers. This is especially true if you’re manually transcribing someone speaking in a different accent or dialect.

Transcription tools are usually better at picking up these differences, but if you’re writing a transcript manually (or editing one that’s been automatically generated), it’s a good idea to familiarize yourself with the speaker, subject matter, and common abbreviations. It’ll be easier to turn text into audio when you know what the speaker intended to say.

Remove filler words

Being Freelance’s Steve Folland uses transcription software to produce transcripts. He goes back and removes any filler words that muddle up the text in his first draft.

“I lose the filler words like ums and ahs,” Steve says. “Then some guests have their own filler phrases like ‘you know’—I can search and delete these. Then I go through making it readable. Humans don’t really speak as fluently as we like to think we do, so do I go to the trouble of tidying up the whole interview just as a magazine journalist or editor would.”

🧠Pro tip: Let Descript do this for you. The filler word removal feature removes awkward pauses, “ums,” and “uhs” from both the video and the automated transcript—so you can publish clutter-free content that keeps viewers (or readers) engaged.

Image of user using Descript’s filler word removal tool

How to write a transcript FAQs

What is a transcript?

A transcript is a word-for-word write-up of a discussion. You might create them for audio or video files to help people engage with the content without having to listen to it out loud.

What should a transcript include?

A transcript should include the words spoken by someone in an audio recording. It should also name the different speakers and attach sentences to a timestamp for easy reference.

How are transcripts made?

There are two ways to make a transcript: manually typing out each word or using transcription software. Descript, for example, uses AI to transcribe video and audio files with up to 95% accuracy.

Elise Dopson
Elise Dopson is a freelance writer, creator, and fierce Descript user. She's also the co-founder of Peak Freelance.
Start creating
The all-in-one video & podcast editor, easy as a doc.
Sign up
Start creating—for free
Sign up
Join millions of others creating with Descript

How to write a transcript: 9 tips for beginners

Producing a high-quality podcast or video can take hours. You’ve recorded it, edited it, and published it online. But if you don’t have a transcript, your effort isn’t reaching as many people as it could. Limiting your content formats to video or audio-only means your content isn’t easily accessible to some internet users. 

That said, there’s more to writing a transcript than publishing a word-for-word copy of your recording. In this guide, we’ll cover the different use cases for a transcript, alongside a step-by-step guide on how to write accurate transcripts that are accessible and enjoyable to read.

A fully powered tool that does everything you’d want an editing suite to do. So you can spend less time on the technical grind and more time creating something great.
Descript makes editing easier, faster, and more fun.

Why write a transcript?

Before we dive into the tutorial on how to write a transcript, let’s cover why it’s important to create one for podcasts and video content:

  • Makes your content more accessible. Not everyone can listen to audio or watch a video. A transcript makes your content more accessible. People who are deaf, hard of hearing, or use a screen reader can still interact with your content if there’s an accompanying transcript. 
  • Helps your podcast or videos rank in search engines. Text-based search engines like Google and Bing can’t understand video content. Help them understand what your video or audio content is—and therefore, where it should rank—by sharing the transcript on your website. 
  • Makes content easy to repurpose. As a creator, your time is finite. Get the most mileage from your video or audio content by transcribing it. You can go back to the transcript and pull snippets to reuse on social media without having to rewatch the entire thing and pull out key points. 

Take Being Freelance, a podcast with over 300 episodes in its archive. Host Steve Folland says: “The podcasts become an absolute treasure trove of quotes on various aspects of freelancing. By having them transcribed I can easily search for and pull out quotes to use when both promoting or repurposing the podcast.”

Steve adds that these transcriptions also helped him make more money as a creator: “I also used this technique when creating my course for new freelancers. I could search on a topic and use quotes as evidence to help people learn and make the course all the richer for it.”

“I create transcripts as we had listeners who are hearing impaired who requested transcripts. I also prefer reading to listening as I find it easier to digest some content visually rather than when its absorbed audibly, and I need a low or zero sound option when in spaces with others.” —Dr Naomi Murphy, co-host of Locked Up Living

🧠 Read more: How to optimize your podcast show notes for SEO (from an SEO expert)

4 types of transcription

Now we know the benefits of producing a transcript, let’s explore the four different methods you can use to do it:

  • Verbatim transcription. This type of transcription is a word-for-word copy of any conversation. It’s usually the hardest type to read since it captures natural voices and nonverbal communication such as stutters, filler words, and false starts. 
  • Clean or edited transcription. A clean transcript has been manually edited to remove filler words, fix grammatical errors, and simplify complex sentences. The edits don’t change the words said—they just make the content easier to read. 
  • Intelligent or smart transcription. An intelligent transcription uses artificial intelligence (AI) to transcribe audio. The software can also remove filler words, awkward pauses, and distracting background noise from a transcript. 
  • Time-stamped transcription. This type of transcript ties words to a specific point in the audio or video. They’re useful because they help people scan the video transcription and rewind the recording to re-listen to the exact point something was said.

Tools you need to start writing a transcription

Before you start transcribing a video or podcast episode from scratch, get these four tools to make the process easier and faster. 

Transcription software

Manually typing out a conversation is time-consuming. Transcription software helps you speed up the process by automatically transcribing audio into text. Instead of listening to the audio recording and manually typing out each word, tools like Descript allow you to create an intelligent transcription that writes the text for you with up to 95% accuracy

Descript’s transcription tool can also:

  • Use speaker detection to separate multiple voices—ideal for interview transcriptions
  • Color-code and label different speakers in your automated transcript 
  • Transcribe audio in different 23 languages
“I've explored various transcription packages in my quest for the one that makes my job simplest. I settled on Descript as it allows me to edit my video and transcript at the same time.” —Dr Naomi Murphy, co-host of Locked Up Living

Example of a transcript auto-generated by Descript

There are also professional transcription services you can use if you want that human touch.

Noise-canceling headphones

Anyone who is transcribing an audio or video file needs complete concentration on the recording—especially if you’re doing the process manually. Noise-canceling headphones help you tune out background noise that could interfere with your transcript. The last thing you want to do is muddle the transcript with words from someone else in your workplace. (Just don’t use them for audio editing—for that, check out our list of the best podcast headphones.)

Audio and video player

If you’re not using software to transcribe video or audio, you’ll need a media player that allows you to pause, rewind, and fast-forward a recording.

Here are some other popular options to play audio and video, depending on the device you’re using:

  • Mac or iPhone: iMovie, GarageBand, Voice Memos, or QuickTime Player
  • Windows: Windows Media Player or VLC Media Player
  • Android: MediaPlayer, VLC for Android, or RealPlayer 

Foot pedal

While not essential, a foot pedal is a smart purchase if you plan to do a lot of manual transcription. A foot pedal is a piece of equipment that allows you to set the playback speed of your recording. You can also use it to create keyboard shortcuts that fast-forward, skip, or rewind parts of the conversation if you need extra clarity while transcribing. 

How to write a transcript in 5 steps

In our opinion, the most efficient way to produce a transcript is by using transcription apps—especially Descript. But if you’re determined to write a transcript manually, here’s how to do it.

  1. Prepare your tools and environment
  2. Listen and transcribe audio
  3. Format and structure
  4. Proofread and edit your transcript
  5. Add timestamps

Step 1. Prepare your tools and environment

The first step to transcribing audio is to gather your tools. At the very least, you’ll need some sort of media player, an open text editor, and quiet workspace to hear audio clearly and minimize errors in the transcript. 

Step 2. Listen and transcribe audio

Next, press play on the media player. Unless your typing speed is world class, it might help to change the dictation playback speed to 0.75x or 0.5x. Slowing down the recording will help you type alongside the speaker in real-time. 

At this point, don’t worry about fixing any typos or grammatical errors. The goal is to get down a solid first draft that you can polish later on. 

Step 3. Format and structure

Now you’ve got a first draft of the transcript, go back through the content and format it for readability. Lines upon lines of text aren’t easy to digest, so you should help readers follow the written conversation by separating out topics and speakers into their own paragraphs. Depending on your purposes, you may also want to add:

  • Bullet points
  • Pull quotes
  • Headings
  • Bold and italic text 

This summary of an episode of the Freelance Writing Coach Podcast, for example, uses headings, numbered lists, and pull quotes to emphasize key points in the transcription: 

Example of a podcast transcription turned into a summary

Step 4. Proofread and edit your transcript

Keep mistakes from distracting your readers (and creating a negative perception of you) by cleaning up your transcript.  

Editing tools like Grammarly and ProWritingAid can come in handy here. They use artificial intelligence to spot grammatical errors, spell-check words, and suggest missed words without having to scan for them manually. 

It’s always a good idea to read through the transcript in a different font, size, or on a different device. The change in view can keep your brain from anticipating what’s ahead—and missing spelling or grammar mistakes that ruin the credibility of your transcript. 

Step 5. Add timestamps 

If someone is reading the write-up to find a specific point in your audio or video recording, timestamps can make it easier for people to reference specific points in the conversation. 

Timestamps are especially useful if you’re transcribing a podcast or YouTube video, since you can use them to help people navigate your conversation. In the video podcast for this episode of The Break, for example, viewers can click the timestamp in the transcription to fast forward the video. 

Example of YouTube podcast video with timestamps on the transcript

Transcript writing: 4 bonus tips to save time

No matter what transcription process you’re using, you’ll likely need to make a few final tweaks to your transcription before sharing it with the world. Here are four bonus tips to save time and publish high-quality transcripts.

Prioritize clean, clear audio

Your job as a transcriptionist is made ten times harder if you’re working with bad sound. Even automated transcription software can accidentally “transcribe” unwanted background noise, which you’ll need to go back in and remove manually. 

Before you start writing a transcript, improve the audio quality of the original file. Features like Descript’s Studio Sound do this automatically. It uses artificial intelligence to remove unwanted background noise, leaving you with a high-quality audio file that you can use to create an error-free transcript.

Familiarize yourself with unfamiliar accents and dialects

Depending on the topic of the discussion you’re transcribing, it might be difficult to understand the speakers. This is especially true if you’re manually transcribing someone speaking in a different accent or dialect.

Transcription tools are usually better at picking up these differences, but if you’re writing a transcript manually (or editing one that’s been automatically generated), it’s a good idea to familiarize yourself with the speaker, subject matter, and common abbreviations. It’ll be easier to turn text into audio when you know what the speaker intended to say.

Remove filler words

Being Freelance’s Steve Folland uses transcription software to produce transcripts. He goes back and removes any filler words that muddle up the text in his first draft.

“I lose the filler words like ums and ahs,” Steve says. “Then some guests have their own filler phrases like ‘you know’—I can search and delete these. Then I go through making it readable. Humans don’t really speak as fluently as we like to think we do, so do I go to the trouble of tidying up the whole interview just as a magazine journalist or editor would.”

🧠Pro tip: Let Descript do this for you. The filler word removal feature removes awkward pauses, “ums,” and “uhs” from both the video and the automated transcript—so you can publish clutter-free content that keeps viewers (or readers) engaged.

Image of user using Descript’s filler word removal tool

How to write a transcript FAQs

What is a transcript?

A transcript is a word-for-word write-up of a discussion. You might create them for audio or video files to help people engage with the content without having to listen to it out loud.

What should a transcript include?

A transcript should include the words spoken by someone in an audio recording. It should also name the different speakers and attach sentences to a timestamp for easy reference.

How are transcripts made?

There are two ways to make a transcript: manually typing out each word or using transcription software. Descript, for example, uses AI to transcribe video and audio files with up to 95% accuracy.

Featured articles:

AI for Creators

The best prompt for AI brainstorming, according to a study

AI tools are notorious for coming up with a lot of similar ideas. Researchers have now discovered a prompt to make those ideas more diverse.

AI for Creators

How to write AI prompts: 5 expert tips

Having trouble creating good content with AI? Discover how to write AI prompts with these examples, tips, and tricks.

Articles you might find interesting

Video

The 7 best laptops for video editing, ranked and reviewed

Having a laptop with the right specs, processing speed, graphics, and other attributes is a game changer when a large percentage of your time is spent editing video footage. In this article, we’ll cover the seven best video editing laptops and how to choose the right one for you.

Podcasting

Burned out? Here's how to take a hiatus

Everyone who’s ever committed to a creative project has had the thought at some point: Am I sure this is worth it? We asked two successful podcasters how to step away gracefully.

Podcasting

How The Podglomerate uses Descript to produce award-winning podcasts: A #MadeinDescript Q&A

Descript is just as useful for podcast production powerhouses as it is for solo podcasters— which is why we wanted to ask The Podglomerate how Descript helps them produce their many critically acclaimed, award-winning podcasts.

Video

How to Watermark a Video: The Benefits of Using a Watermark

A watermark is a type of logo, pattern, or symbol printed onto work to denote who made it and who owns it, you can it everywhere from stock photos to Youtube videos.

Product Updates

Ultra Fast Audio Synthesis with MelGAN

In this post, we introduce MelGAN, a new generative model of raw audio waveforms created by the Lyrebird team that is capable of generating natural sounding speech at a rate of more than 2,500,000 audio samples per second — more than 100x faster than real time, and 10x faster than alternative methods on similar hardware.We believe that MelGAN paves the way for taking many real-time speech applications onto smaller devices. Imagine, for example, in the not too distant future, having real-time text-to-speech translation on your mobile device without the internet. And, it’s application to music translation brings us one step closer to AI-assisted music composing.We’ve open sourced MelGAN and we encourage interested machine learning developers and researchers to check out our code base.

Related articles:

Share this article

Get started for free →