On-screen text is the highest-leverage element you control on a Reel because most people watch with the sound off or scroll before audio even registers. The words you put on the frame in the first second decide whether someone keeps watching or swipes. This guide covers exactly where to place text, how to time it, and the formatting tricks that lower your skip rate and lift retention.
What is on-screen text for Reels?
On-screen text is any caption, title card, or word overlay burned into the video frame - distinct from the post caption below the video. It serves three jobs: it states the hook before sound loads, it keeps silent viewers oriented, and it adds a second information channel so the viewer's brain has more reason to stay. Roughly 70 to 85 percent of short-form views happen muted at first, so text is often the only message that lands in the critical opening.
Why text drives retention (in reach-weight order)
Every retention lever on a Reel is weighted differently by the algorithm. The canonical order of what moves reach is: skip rate first (your hook in the first 3 seconds), then shares, then likes, then saves, then reposts, then comments. On-screen text touches the top of that list directly.
- Skip rate (the hook): a clear text hook on frame one gives muted scrollers a reason to stop before they ever hear you. This is where text earns most of its value.
- Shares: a punchy, screenshot-worthy text frame - a bold claim or a list someone wants to send a friend - drives the share that the algorithm rewards next.
- Likes and saves: text that promises a payoff (a tip, a recipe, a number) gets saved for later, which signals depth.
- Comments: a single on-screen question or a deliberately incomplete statement can pull replies without you begging for them in the caption.
Where to place your hook text
Put the hook text in the upper-middle third of the frame, not dead center and not at the bottom. The bottom 15 percent gets covered by the username, caption, and UI buttons on Instagram and TikTok. The dead center competes with your subject's face. Upper-middle stays readable in the feed preview and in full view.
- 1Frame 1 (0.0s): the hook line appears instantly - no fade-in. A 0.5s fade costs you the exact moment a scroller decides.
- 2First 3 seconds: keep the hook text on screen the entire time, not just a flash. Skimmers need it to persist.
- 3Body: switch to dynamic captions (word-by-word or phrase-by-phrase) synced to your voice to hold the eye.
- 4Payoff: reintroduce a bold text card at the value moment so saves and shares trigger on something concrete.
Formatting rules that actually move the numbers
- Contrast: always use a solid or semi-opaque background bar behind text, or a heavy stroke. Plain white text over busy footage is unreadable on a phone in sunlight and people just leave.
- Size: hook text should fill most of the frame width. If you can comfortably read it from arm's length at 30 percent zoom, it's big enough.
- Length: keep the hook to 5 to 9 words. Anything longer than one line forces reading time the scroller won't give you.
- Font: one clean sans-serif font for everything. Mixing decorative fonts looks amateur and slows reading.
- Caption style: word-by-word captions consistently outperform full-sentence blocks for body retention because each new word is a micro-reason to keep watching.
- Spelling and punctuation in the hook: a typo on frame one reads as low-effort and quietly raises skip rate.
Hook text formulas you can copy
- Curiosity gap: "The reason your Reels stop at 3 seconds" - names a problem and withholds the fix.
- Specific number: "I tested 40 hooks. These 3 won." - numbers feel concrete and credible.
- Direct callout: "Stop posting Reels until you read this" - speaks to one person, breaks the scroll.
- Contrarian claim: "Posting more is killing your reach" - tension makes people stay to argue or learn.
- Mid-action open: "Here's what happened when I deleted my best post" - starts inside a story.
Common on-screen text mistakes
- Bottom-anchored hooks hidden behind UI buttons.
- Hook text that fades or slides in, wasting the first second.
- Auto-captions left in default tiny styling with no contrast.
- Burying the hook on the second clip instead of frame one.
- Text that just repeats the voiceover verbatim instead of adding a second message.
If you're not sure whether your text is helping or hurting, the fastest way to know is to look at where viewers actually drop off. Reelyze analyzes a Reel frame-by-frame and shows you the exact second the skip rate spikes - so you can see if people leave the moment your hook text appears (or before it does). That turns guesswork about text placement into a fix you can measure on the next post.
A 60-second pre-post checklist
- 1Is the hook text on screen at 0.0s with no animation?
- 2Can you read it in a feed-sized thumbnail?
- 3Is it in the upper-middle third, clear of UI?
- 4Does it promise a specific payoff in under 9 words?
- 5Do body captions move word-by-word with your voice?
- 6Is there one bold text card at the value moment to earn the share or save?