Subtitles vs captions: what's the difference?
Subtitles and captions are often used interchangeably, but they're not the same. Here's the actual difference — and which one you need for short-form video.
The short answer
The traditional distinction: subtitles transcribe or translate the spoken dialogue, assuming the viewer can hear the audio. Captions include the dialogue plus non-speech information — sound effects, music cues, speaker labels — for viewers who can't hear the audio at all. Captions are an accessibility feature; subtitles are primarily a language or translation aid.
Where the terms blur
In everyday short-form use, people use 'captions' and 'subtitles' interchangeably to mean 'the words on screen.' On TikTok, Reels, and Shorts, the on-screen text creators add is usually a transcription of what's said — technically subtitles, but everyone calls them captions. So in practice the distinction matters more for accessibility and broadcast than for a typical creator clip.
Open vs closed (a separate axis)
Don't confuse subtitles-vs-captions with open-vs-closed. 'Open' (burned-in) means the text is permanent and always shows; 'closed' means it's a toggleable track. You can have closed captions or burned-in subtitles — the two distinctions are independent.
Which do you need?
For short-form clips, you want burned-in on-screen text that transcribes the speech and is styled to be readable on mute — whatever you call it. For accessibility compliance or broadcast, use true captions that include sound cues and speaker labels. For reaching a foreign-language audience, use translated subtitles.
FAQ
What's the difference between subtitles and captions?
Subtitles transcribe or translate dialogue and assume you can hear the audio; captions also include non-speech sounds and speaker labels for viewers who can't hear. Captions are an accessibility feature; subtitles are mainly a translation aid.
Are subtitles and captions the same thing?
Technically no, but in everyday short-form use the terms are used interchangeably for the on-screen text. The distinction matters most for accessibility and broadcast.
Do I need captions or subtitles for TikTok and Reels?
You want burned-in on-screen text transcribing the speech, styled to read on mute. Most creators call these captions regardless of the technical definition.
What's the difference between open and closed captions?
Open (burned-in) captions are permanent and always show; closed captions are a toggleable track. That's a separate distinction from subtitles vs captions.