How to transcribe Instagram Reels, TikToks, and YouTube Shorts for free
Paste any public Reel, TikTok, or YouTube Short URL and get a timestamped transcript in under 30 seconds. No download, no account, no daily cap. Here's the flow plus the platform-specific gotchas.
You're watching a Reel, a TikTok, or a YouTube Short. Someone explains something useful, drops a hook so clean you want to take it apart, or talks through a story you'd quote somewhere. You want the text.
Your options: transcribe by hand (slow), download the video and run it through a general tool (annoying), or paste the URL and have something do it for you in seconds.
This is the third option. The same flow handles all three platforms. Below is the walkthrough, plus the platform-specific things that trip people up.
What the tool actually does
Lomero takes a public short-form video URL and returns a timestamped transcript in under 30 seconds. It works on Instagram Reels, TikTok, and YouTube Shorts. No account is required for basic transcription. No file upload, no browser extension, no daily cap.
The transcript comes back with a start time on every line, so you can cite the exact moment something was said. That matters when you're pulling quotes, writing a follow-up post, or trying to figure out where a creator lost you.
The MP4 comes down with it. If the file was what you wanted in the first place, the player has a download button. The download walkthrough covers that flow end-to-end.
How to transcribe a video in under a minute
The flow is the same on all three platforms:
- Open the video. Tap the share icon and copy the link.
- Go to lomero.app/analyze.
- Paste the URL into the input field.
- Press Analyze.
The transcript appears in the output panel in 10 to 30 seconds, depending on length. Copy it, or scroll into the structure breakdown underneath.
The platform is detected from the URL. You don't pick a mode or a tab. Paste an Instagram link and you get an Instagram pipeline, paste a TikTok link and you get the TikTok pipeline.
Instagram Reels
The Reel URL format is https://www.instagram.com/reel/[id]/. Tap the paper-airplane share icon, then "Copy link." Paste it into Lomero.
What works: public Reels, long-form Reels (the new format that allows up to 90 seconds), and Reels with music in the background where the speech still gets transcribed cleanly.
What doesn't work: Reels from private accounts, since the platform refuses to serve them to anonymous requests. Stories and Close Friends content are walled off by Instagram, so no public tool can reach them. Reels that have been deleted by the creator are obviously gone too.
A quick test before you paste: open the Reel in an incognito window. If it loads without asking you to sign in, the transcription will work. If it asks for a login, it won't.
TikTok
TikTok hands you two URL formats. Both work.
- Long format:
https://www.tiktok.com/@username/video/[id] - Short format:
https://vm.tiktok.com/[id]
The short URL gets resolved automatically, so you don't have to expand it manually.
A few TikTok-specific things to know:
Live replays transcribe normally once the creator saves them as a video. Real-time live streams aren't supported.
Slideshow posts (the photo-carousel format with a voiceover) work as long as there's spoken audio.
Duets and stitches transcribe whichever audio track is playing. For duets that mix both participants' audio, expect some overlap artifacts in the transcript.
Region-locked videos that show "not available in your region" to anonymous requests can't be reached. Same with age-gated videos, which require login on TikTok's side.
Music-only TikToks with no voiceover give you an empty transcript. The tool reads speech, not lyrics.
The TikTok retention curve drops harder in the opening seconds than Instagram's. Reading the first sentence in isolation tells you whether the hook is doing real work or whether the creator was leaning on visuals and music to carry it. The hook patterns post breaks down what holds attention in those first three seconds.
YouTube Shorts
The Shorts URL format is https://www.youtube.com/shorts/[id] or sometimes the shortened https://youtu.be/[id]. Both work.
A few YouTube-specific notes:
Shorts up to 3 minutes transcribe in 10 to 30 seconds. Older Shorts at the original 60-second cap finish faster.
Regular YouTube videos (non-Shorts) also transcribe. Up to about an hour of audio works reliably. The structural analysis is short-form-specific, so the hook score on a 45-minute podcast returns a number, but it isn't meaningful.
Live streams aren't supported in real time. Once the stream is saved as a regular video, the tool can read it. Premieres become standard videos when they end and work normally after that.
Members-only and age-restricted videos can't be reached because YouTube demands a sign-in for them. Unlisted videos are also off-limits, since they're served only to people with the direct link, not to anonymous fetchers.
YouTube's auto-captions are already pretty good, which is worth saying. If you only want raw text, YouTube's transcript panel works fine. Where Lomero earns its slot on Shorts is the analysis layer (segment labels, hook score, structural fixes), which YouTube's built-in tool doesn't do. Plus the cross-platform consistency: same flow, same output, same format on Reels and TikTok.
The SEO angle is worth a separate note. YouTube indexes the transcript, which means hooks that include the target keyword verbatim get a ranking benefit TikTok doesn't give you. Pulling the transcript of a top-ranking Short in your niche tells you whether the creator is actually saying the keywords the video ranks for. Often the answer is yes, and that's the lesson.
What a transcript is actually for
The obvious case is accessibility (captions, translations, search). The more useful ones:
Swiping hooks at scale. Read the opening line of ten Reels in your niche in the time it takes to watch one. The patterns surface faster on the page than on the screen.
Post-mortems on your own videos. When one underperforms, reading the transcript makes the structural problems obvious in a way rewatching never does. You stop sympathizing with your own performance and see the words for what they are.
Quoting creators accurately. If you're writing a newsletter or citing a Reel in a post, paraphrasing from memory is how misquotes happen. Paste the URL, quote the transcript.
Training Claude or GPT on content patterns. Batch transcripts are training data for a custom model or a well-prompted Claude. Run patterns across 50 TikToks and ask the model what they share. You'll surface structural patterns faster than studying the feed manually. The Claude MCP guide covers this workflow specifically.
Fact-checking AI summaries. YouTube's auto-summaries are fine but inexact. A transcript is the actual record.
What the tool adds beyond the transcript
Each paste also returns:
Segment labels. Each part of the video is tagged as hook, context, problem, reveal, or CTA. On a 30-second video, you see exactly where the opening ended and the payoff started without rewatching. The anatomy of a viral short post explains the segment framework.
A hook score from 0 to 100. This rates the opening seconds against patterns that hold attention. The score is calibrated per-platform, since TikTok's retention curve is different from Reels, which is different from Shorts. The hook scoring breakdown explains the rubric.
Specific fixes. Not "your hook is weak" but something concrete like "viewers tend to drop at 0:08 when the payoff is delayed past 10 seconds. Pull the reveal forward."
If you only want the transcript, you can ignore the rest. Most people run the analysis anyway because it's already there.
Is it really free?
Transcription and the basic structure breakdown are free, no watermark, no daily cap at present. Deep audits and export are part of the paid plans, currently on a waitlist while the paid product gets finalized.
Nothing stops you from transcribing one video, then another, then another. If you hit a cap, that's a bug.
What this isn't
Lomero gives you text, not burned-in subtitles. So if you want automatic captions on a video you're publishing, use CapCut or Submagic. If you want to repurpose a long video into Shorts clips, Opus Clip handles that. Lomero works on existing short-form video: pull the text, see the structure.
Frequently asked questions
Is it actually free?
Yes. Transcription and a basic structure breakdown are free with no watermark and no daily cap. Deep audits and export are part of the paid plans, currently waitlisted.
What languages does it support?
Forty-plus languages, including English, Spanish, Portuguese, French, German, Italian, Indonesian, Thai, Tagalog, Hindi, Mandarin, Japanese, Korean, and Arabic. Accuracy is highest on the major languages Whisper handles well.
Can I transcribe a private Reel, TikTok, or unlisted Short?
No. If the platform won't serve the video to an unauthenticated request, the tool can't reach it. Private accounts, friends-only posts, age-gated content, and unlisted videos are all off-limits.
Does the transcript include timestamps?
Yes. Every line has a start time so you can cite or clip specific moments.
How accurate is it?
Above 95% on quiet audio. Accuracy drops slightly with heavy background music, overlapping speakers, or thick accents. The model is Whisper-based, fine-tuned for short-form pacing.
Will Lomero save my transcripts?
Only if you save them yourself. Unsaved transcripts are purged after the session.
What if the video has no spoken words?
The transcript will be empty or very short. The tool works on speech, not music or instrumental audio.
Can I download the transcript as a file?
Export is part of the paid plans, currently waitlisted. On the free tier you can copy the text and paste it into a document.
Does this work on regular YouTube videos, not just Shorts?
Yes. Transcription handles long YouTube videos too. The hook score and segment analysis are tuned for short-form pacing, so the structural scoring is less meaningful on a 30-minute video. Use it for the transcript and ignore the score.
How does this compare to YouTube's built-in transcript panel?
The raw text is similar. YouTube's transcription is solid. Lomero adds the structural analysis on top, which the built-in tool doesn't do. If you only want text to paste into a document, YouTube's panel works. If you want to understand why the Short is working structurally, this gives you more.
Try it: paste a URL at lomero.app/analyze. For a deeper look at what comes back after the transcript, the five beats of a viral short-form video walks through the segment framework, and how hook scoring works explains the 0-to-100 number.