AI Short-Video Script Tutorial: Hook to CTA in 90 Seconds

Use AI to draft short-video scripts (Reels / Shorts / TikTok) that respect the platform's rhythm.

The biggest failure mode of AI-written short-video scripts isn’t bad writing — it’s that they read fine on the page and die when spoken aloud. AI drafts in essay-pace, with full sentences, polite transitions, and zero camera awareness. Short-video viewers leave at 3 seconds if the hook hasn’t landed. This workflow gives you a 6-section prompt brief, a 5-hook variant rule, and the read-aloud trim that converts AI prose into something you can actually film.

What this covers

Use AI to draft short-video scripts (Reels / Shorts / TikTok) that respect the platform’s rhythm: 3-second hook, 45-60 second middle, 10-second payoff, 5-second CTA. Plus the direction-notes layer that AI never adds and is the difference between “I have a script” and “I have a shot list”.

Who this is for

Solo creators batching weekly content, founders running their own brand short-form (less polish, more authenticity), UGC creators making spec ads, and small marketing teams who can’t afford a copywriter per script.

When to reach for it

Weekly batch script writing for a content calendar, reacting to a new format / trend, building hook variants for A/B testing, and prepping a backlog of ideas before a filming day so the camera spends zero minutes waiting for inspiration.

When this is NOT the right tool

Highly stylized creator-personality content where your voice is the product — AI can scaffold but can’t replicate. Anything legally sensitive (medical, financial claims, etc). Long-form video (YouTube 5+ minute) — that’s a different workflow entirely.

Before you start

  • Pull 3-5 reference videos in the style you want. Drop the URLs (or transcripts) into the AI session — “match this energy” with examples beats prose.
  • Decide platform first: Reels and Shorts skew 60-90s with mid-paced energy; TikTok skews 15-30s with faster cuts; each has different hook conventions.
  • Know your goal: entertain, educate, or sell. AI defaults to “educate” and will produce dry voice unless told otherwise.
  • Have your CTA ready before scripting. “Follow for part 2” / “DM the keyword” / “Link in bio” — the script should end on the CTA, not invent it last.

Step by step

  1. Brief AI with 6 components: niche / target viewer, platform (Reels 60s / TikTok 30s / Shorts 45s), goal (entertain / educate / sell), one-sentence promise the video delivers, your voice / tone (one adjective), CTA. Skip any of these and AI guesses badly.
  2. Generate 5 hook variants. Never use the first hook AI writes — it’s almost always the safest, weakest one. Specify hook patterns: “specific number”, “contrarian take”, “before-after gap”, “calling out the viewer”, “question that demands an answer”.
  3. Pick 1 hook. Generate the full 60-90 second script in the structure: hook (3s) → middle (45-60s of value or story) → payoff (10s of insight or resolution) → CTA (5s).
  4. Read it aloud and time it. AI scripts run 20% too long. Cut by reading and trimming words that don’t change meaning when removed.
  5. Add direction notes inline: “cut on word X” / “B-roll: phone hand-holding” / “text overlay on word Y” / “zoom-in on word Z” / “music drops here”. AI never adds these; you must.
  6. Build a shot list from the direction notes. This is what you actually look at while filming — the script is for the audio track.
  7. Film, edit, post. Save the winning hook + script as a template; the next script in this style takes 15 minutes instead of 60.

First-run exercise

  1. Pick one real video you want to film this week. Not a hypothetical “creator persona” exercise.
  2. Run the full 6-component brief, generate 5 hooks, pick 1, generate the script, read aloud, trim, add direction notes.
  3. Film it (even rough). Real filming surfaces script problems no read-aloud catches.
  4. Post and check the 3-second retention. That’s your AI hook’s actual report card.

Quality check

  • Does the hook land in 3 seconds? Read just the first sentence aloud — if it’s setup, not payoff, the hook is too slow.
  • Does the script SPEAK well? Sentences over 12 words die in delivery. Contractions everywhere. Strip “however” and “additionally”.
  • Does each line earn its place? Cut any line where you can remove it and the meaning still lands.
  • Are direction notes in the script? Otherwise you’re improvising on set or your editor improvises in post — both reduce quality.
  • Does the CTA make sense for the platform and audience? “Subscribe” on TikTok lands worse than “follow for part 2”.

How to reuse this workflow

  • Save the 6-component brief template per content pillar. “Educational mini-explainer”, “behind-the-scenes story”, “before-after demo” — each has its own brief.
  • Build a hook variant library by pattern. After 20 scripts, you’ll know which patterns work for your audience and can skip variants that historically lose.
  • Save winning scripts as templates with one variable (topic / number / claim) to swap. Weekly batch becomes 30 minutes for 5 scripts.
  • Re-evaluate every quarter — platforms change format conventions (TikTok length norms shift, hooks get faster).

6-component brief → 5 hook variants → pick 1 → full 60-90s script → read-aloud trim → direction notes inline → shot list extracted → film → post → track 3-second retention.

Common mistakes

  • No hook variants — the first hook AI writes is almost never the best. Always generate 5.
  • Scripts that read but don’t SPEAK well — AI defaults to essay pace. Read aloud and cut.
  • No direction notes — editor (or future you) forgets the intended timing of cuts, B-roll, and text overlays.
  • Generic CTAs — “subscribe” on TikTok, “smash that like button” anywhere. Pick a CTA that fits the platform.
  • Ignoring 3-second retention data — your hook’s report card. Use it to retire weak patterns.
  • One script length for all platforms — Reels 60s and TikTok 15s need fundamentally different pacing.

FAQ

  • What’s the right script length per platform?: Reels 60-90s with energetic mid-pace, TikTok 15-30s with fast cuts, Shorts 45-60s, LinkedIn video 30-60s, Twitter / X 30-90s. Test within those bands.
  • How do I make AI sound like me?: Feed AI 3-5 of your past best-performing scripts as reference. “Write in this voice: [examples]”. Without examples, AI defaults to generic LinkedIn-influencer voice.
  • Should I use AI voiceover?: For utility lines yes; for the main delivery, your real voice always outperforms. Audiences detect AI voice in under 2 seconds.
  • Can AI suggest hooks I haven’t thought of?: Yes, when you specify patterns. “Give me 5 contrarian-take hooks” produces different output than “5 hooks for this video”.
  • How long until I have a usable workflow?: First script: 60 minutes. After 10 scripts: 15-20 minutes per script with templates.

Tags: #Tutorial #Content creation #Short video #Script