WikiPlus

How to Add Subtitles to Videos Using Transcription

Subtitles and captions dramatically increase video reach, engagement, and accessibility. Studies consistently show that captioned videos get more views, better retention, and wider sharing across social platforms — partly because most social media video is watched without sound. Creating subtitles used to require specialized software and manual timing work. AI transcription makes the process faster and more accessible than ever, and this guide walks you through it from start to finish.

Understanding Subtitle and Caption File Formats

Before diving into the how-to, it helps to understand what subtitle files actually are. A subtitle or caption file is a plain text document that contains the transcript text broken into short segments, each with a start time and end time specifying when it should appear on screen. The most common formats are: SRT (SubRip Text): The most universal format. Widely supported by video players, editing software, and hosting platforms. Each caption block has a sequence number, a time range (in HH:MM:SS,milliseconds format), and the caption text. VTT (Web Video Text Tracks): Very similar to SRT but designed for web browsers and HTML5 video. Supported by YouTube, Vimeo, and most streaming platforms. Slightly different time format (HH:MM:SS.milliseconds with a period instead of comma). ASS/SSA (Advanced SubStation Alpha): A more complex format supporting styling, positioning, and effects. Used in anime fansubs and professional broadcast workflows but overkill for most creator needs. SMPTE Timed Text (TTML): Used in broadcast and streaming industry workflows. For most creators, SRT or VTT is sufficient. When you upload subtitles to YouTube, they accept SRT, VTT, SBV, and several other formats. For local video playback, SRT is the safest bet for compatibility. The WikiPlus Video Transcriptor produces plain text output which you then format into an SRT or VTT file. For workflows where the tool produces time-coded output directly, you can skip the manual timing step. For plain text output, timing must be added — which can be done with subtitle editors.

From Transcript to Timed Subtitle File

Converting a plain text transcript into a properly timed subtitle file is the core technical challenge of the caption creation workflow. Here are the main approaches. Approach 1: Automated time-coding using subtitle software. Tools like Subtitle Edit (free, Windows), Aegisub (free, cross-platform), and SubtitleBee (web-based, freemium) can take a plain text transcript and automatically synchronize it to the audio of your video using speech alignment algorithms. This is the fastest approach and works well when the transcript is accurate. In Subtitle Edit, for example: import your video file, paste your transcript text, and use the 'Auto-sync' feature to align the text to the speech. The software analyzes the audio, identifies speech segments, and maps your transcript lines to those segments. The result is an SRT file ready for upload or embedding. Approach 2: Manual timing in a subtitle editor. For highest accuracy, especially with complex audio or multiple speakers, manually timing subtitles gives full control. Open your video in Subtitle Edit or Aegisub, paste your transcript, then for each subtitle block, mark the in-point and out-point by pressing hotkeys while listening to the audio. Tedious but highly accurate. Approach 3: Cloud subtitle services. Web tools like Kapwing, VEED, and Clideo offer caption generation that combines transcription and timing in one step — you upload the video, the service transcribes it and generates a timed SRT file that you can review, edit, and download. These typically cost money beyond a free tier but offer a very streamlined workflow. Approach 4: Upload to YouTube and edit captions there. If your video is going on YouTube, uploading it and then editing the auto-generated captions within YouTube Studio is a fully integrated workflow. The auto-timing from YouTube's system is generally very good — you just need to correct the text errors.

Hardcoded vs Softcoded Subtitles

There are two fundamentally different ways to add subtitles to a video, each appropriate for different distribution scenarios. Softcoded subtitles (also called external subtitles or soft subtitles) exist as separate files that accompany the video. The viewer can turn them on or off. On YouTube and other platforms, these are the standard caption tracks uploaded alongside the video. In video players like VLC, an SRT file in the same folder as the video is loaded automatically. Advantages: viewers can disable them if they prefer; multiple language tracks can be offered simultaneously; the video file itself is not modified; corrections can be made by replacing the subtitle file. Disadvantages: the subtitles do not travel with the video if it is shared outside the platform; they do not appear in video clips or screenshots; they may not be supported by all players or embedding contexts. Hardcoded subtitles (also called burned-in subtitles or open captions) are permanently embedded into the video image. They cannot be turned off. They appear in all contexts: direct video shares, embeds, screenshots, and every player. For social media distribution — Instagram, TikTok, LinkedIn, Facebook — hardcoded subtitles are strongly recommended because many users scroll feeds with sound off and autoplay captions only show if the platform supports them. Hardcoded captions ensure every viewer can follow the content regardless of the viewing context. To burn subtitles into a video, you need video editing software (Adobe Premiere, Final Cut Pro, DaVinci Resolve, or free tools like Handbrake + SRT track or the open-source FFmpeg command line tool).

Platform-Specific Subtitle Upload Guide

Different platforms have different workflows for adding subtitles to uploaded videos. YouTube: Go to YouTube Studio, select the video, click Subtitles, then Add Language. You can either upload an SRT/VTT file directly or type/paste captions manually. YouTube also lets you edit the auto-generated captions it produces, which is often the fastest workflow — let YouTube auto-generate the timing, then correct the text errors using the inline editor. Instagram: Instagram does not accept subtitle file uploads for feed posts or Reels. The only way to display subtitles on Instagram is to burn them into the video before upload. Instagram has an auto-captions feature for Reels in some regions, but it is AI-generated with limited accuracy and language support. For professional-quality captions, burn them in. TikTok: TikTok has an auto-captions feature available in the post creation interface, but like Instagram, file uploads are not supported. Burned-in captions are recommended for quality and consistency. LinkedIn: LinkedIn supports SRT file uploads for native video posts. Upload the video, then in the post creation or video settings area, there is an option to add a caption file. LinkedIn's auto-caption feature is also available but less accurate than platform leaders like YouTube. Facebook: Facebook accepts SRT file uploads for videos on Pages and personal profiles. Go to the video in Creator Studio or Page settings, find the captions option, and upload your SRT file. Vimeo: Vimeo's paid plans support uploading multiple SRT caption files and allow viewers to toggle between languages. Upload under the video's settings, Captions section.

Frequently Asked Questions

What is the difference between subtitles and closed captions?
Subtitles are intended for viewers who can hear the audio but need text translation — typically used when the video language differs from the viewer's language. Closed captions are designed for deaf or hard-of-hearing viewers and include not just dialogue but also descriptions of non-speech sounds like [music playing] or [applause]. In practice, online platforms use the terms interchangeably. On YouTube, what is labeled 'subtitles/captions' serves both purposes — it is a single text track accessible by all viewers regardless of whether they need it for translation or accessibility.
Can I add subtitles to a video without re-encoding it?
Yes, for softcoded subtitles (external SRT/VTT files). Simply uploading an SRT file to YouTube, LinkedIn, or Vimeo adds captions without modifying the video file. For local playback, placing an SRT file with the same filename as the video in the same folder allows most media players to load it automatically. For hardcoded (burned-in) subtitles, re-encoding is required because the text is rendered into the video frames themselves.
How many characters should each subtitle line contain?
Best practice for subtitle readability is a maximum of 42 characters per line and a maximum of two lines per subtitle block. This ensures the text is readable without obscuring too much of the video frame. Each subtitle block should represent a natural pause or sentence break in speech, with a duration of approximately 1–7 seconds. Blocks shorter than 0.5 seconds are too brief to read. These are conventions, not strict rules — many professional productions use slightly different guidelines.