YouTube Script Outline Template
Below are three script outline templates you can copy directly and fill in with your own content. Each one includes word count targets, time allocations, and the retention reasoning behind the structure. Pick the one that matches your video format.
Template 1: The Standard 10-Minute Video
This is the most versatile template. It works for commentary, analysis, opinion pieces, and most educational content.
``` HOOK (100 words / 40 seconds): [Specific claim or result]: "I tested/discovered/found [specific thing]" [One piece of evidence]: "Here is the data/proof/comparison" [Why this matters to YOU]: "If you are [viewer situation], this changes [specific outcome]"
SECTION 1 (250 words / 90 seconds): [Mini-hook]: Why this section matters — one sentence [Core content]: The main point with evidence or example [Transition]: "But this only works if..." / "The problem is..."
SECTION 2 (250 words / 90 seconds): [Mini-hook]: Why this section matters [Core content]: Main point with evidence [Transition]: Bridge to the re-engagement beat
RE-ENGAGEMENT (50 words / 20 seconds): [Stakes escalation]: "This next part is where it gets [interesting/dangerous/surprising]"
SECTION 3 (250 words / 90 seconds): [Mini-hook]: Why this section matters [Core content]: Main point with evidence [Transition]: Set up your strongest point
SECTION 4 (250 words / 90 seconds): [Your STRONGEST point — save it for here, not Section 1] [Core content]: Best evidence, most compelling argument [Transition to conclusion]
CONCLUSION (150 words / 50 seconds): [Key insight — not a summary, a new perspective on everything above] [One More Thing]: Unexpected bonus tip the viewer did not expect [CTA]: "If you want to [outcome], [action]" ```
**Why this structure works**: The hook is capped at 100 words because retention data shows front-loading context kills early retention. Sections are 250 words each because segments longer than 300 words show measurable retention dips. Your strongest point is in Section 4 (not Section 1) because placing it late creates a retention peak right before the conclusion — viewers who make it past the midpoint are rewarded. The re-engagement beat at the halfway mark counters the natural mid-video drop-off.
Template 2: The List/Numbered Video
Use this for "X Mistakes," "X Tips," "X Things You Need to Know" formats. The item ordering is critical.
``` HOOK (80 words / 30 seconds): [State the number]: "There are [X] [mistakes/tips/things]..." [Stakes]: "...and number [X-1] is the one most people get wrong" [Tease]: Give a one-sentence preview of the most surprising item
ITEM 1 (200 words / 70 seconds): [Strong item — build credibility early] [Specific example or data point] [Quick transition to Item 2]
ITEM 2 (200 words / 70 seconds): [Another strong item — confirm viewer is in the right place] [Evidence or story] [Transition]
ITEM 3 (150 words / 55 seconds): [Medium item — carried by the momentum of Items 1-2] [Keep it tight, do not overstay] [Transition with curiosity: "but this next one..."]
ITEM 4 (250 words / 90 seconds): [Your STRONGEST item — this is the peak moment] [Go deeper here: more evidence, more examples, more detail] [This is where viewers screenshot or take notes]
ITEM 5 (180 words / 65 seconds): [Medium-strong item to close on] [Include a "one more thing" bonus tip attached to this item]
CONCLUSION (100 words / 35 seconds): [The overarching principle connecting all items] [Not a recap — the single insight that ties everything together] [CTA] ```
**Why this order matters**: Placing your strongest item at position 4 (not last) creates a peak-end effect. Viewers remember peaks and endings. If Item 5 is medium-strong and includes a bonus, the ending feels generous rather than anticlimactic. Starting with two strong items builds trust — viewers decide whether to stay within the first two items, so those need to deliver. Item 3 can be weaker because momentum from 1-2 carries the viewer through.
Template 3: The Tutorial
Use this for how-to content, walkthroughs, and step-by-step guides. The key principle: show the result before you teach the process.
``` HOOK (50 words / 15 seconds): [Show the end result immediately — the finished product, the completed project, the final outcome] "Here is what we are building/creating/setting up today"
STEP 1 (200 words / 70 seconds): [Start instruction immediately — no context preamble, no "before we begin" section] [Clear action]: "First, [do this specific thing]" [Expected result]: "You should now see..."
MILESTONE 1 (40 words / 15 seconds): "If you have done this right, you should see [specific result]. If not, check [common issue]."
STEP 2 (200 words / 70 seconds): [Continue instruction] [Vary your teaching mode: if Step 1 was tell-then-show, make Step 2 show-then-tell]
STEP 3 (200 words / 70 seconds): [Continue — include one "why" explanation here] [Viewers need to understand the reason behind at least one step]
MILESTONE 2 (50 words / 20 seconds): [Show the intermediate result] "At this point your [project] should look like this" [Compare to the final result from the hook]
STEP 4 (200 words / 70 seconds): [Continue instruction] [Include one "common mistake" warning]: "A lot of people do [wrong thing] here — instead, do [right thing]"
STEP 5 (180 words / 65 seconds): [Final step to reach the result shown in the hook] [Confirm: "And now you have [the thing from the hook]"]
LEVEL UP (100 words / 35 seconds): [One advanced tip beyond the tutorial scope] "Once you have this working, you can also..." [CTA: link to advanced tutorial or tool] ```
**Why tutorials are structured differently**: Tutorial retention follows a different pattern than entertainment content. Viewers drop off at confusion points, not boredom points. Milestones prevent confusion drop-off by giving viewers a checkpoint — if their result matches yours, they feel confident to continue. Showing the end result first (not last) is critical because it gives viewers a mental model of where they are headed. Without it, each step feels disconnected. The "common mistake" warning in Step 4 serves as a re-engagement beat — viewers pay extra attention because they do not want to make the mistake.
How to Use These Templates
- Pick the template that matches your video format
- Copy it into your document
- Fill in each bracketed section with your specific content
- Check that each section hits the word count target (within 20%)
- [Analyze your completed outline](/upload) to catch retention issues before recording
You do not need to follow these templates rigidly. They are starting structures. Once you have made 10-15 videos with a template, you will internalize the rhythm and start modifying it to fit your style.
For more on why these structures work and the retention data behind them, see the [How to Write a YouTube Script](/guides/how-to-write-a-youtube-script) guide and the [Storytelling Structures](/guides/youtube-storytelling-structures) breakdown. You can also browse the full [Script Templates Library](/templates) for additional formats.
Related Articles
Script vs Bullets vs Freestyle
We compared retention data across 3 scripting approaches. Full scripts averaged 44% retention, bullet points 39%, and freestyle 34%. But the best approach depends on your format.
YouTube Script Length: Words per Min
The exact word count targets for every video length from 5 to 20 minutes, based on speaking pace data. Includes a calibration method to find your personal words-per-minute rate.
YouTube Scripts If You Hate Writing
Four practical methods to produce scripts without staring at a blank page — talk-first, bullet-to-script, templates, and the 3-draft speed method. Total time: 35 minutes.