VSLs

The Video Sales Letter. VSL, is the modern incarnation of the long-form direct-response sales letter. 10 to 60 minutes of video selling a single offer. Structure-wise, it's the sales letter in video form. Done well, it outconverts a text-based equivalent for most high-ticket offers in 2026. Done poorly, it's a long way to lose attention.

Why VSLs work

The structure, same as a sales letter

  1. 0:00โ€“0:30 Hook. Pattern interrupt. A surprising claim, a specific scene, a direct question. Earn the next 30 seconds.
  2. 0:30โ€“2:00 Promise. State the big promise, what they'll learn or be able to do by the end of the video / by buying.
  3. 2:00โ€“5:00 Problem + agitation. The problem they're facing. Why it's worse than they thought. Consequences.
  4. 5:00โ€“7:00 Introduction. Who you are, why you're qualified. Brief. Credibility, not autobiography.
  5. 7:00โ€“15:00 Mechanism. The novel approach. Why this works when others don't. Explain the thing.
  6. 15:00โ€“22:00 Proof. Case studies. Testimonials. Data. Screenshots. Specific outcomes.
  7. 22:00โ€“28:00 Benefits / "here's what you get." The offer stack unfolded.
  8. 28:00โ€“32:00 Price + bonuses. Value anchoring, reveal, bonus stack.
  9. 32:00โ€“35:00 Guarantee. Risk reversal.
  10. 35:00โ€“37:00 Urgency. Deadline, reason, consequence of waiting.
  11. 37:00โ€“40:00 CTA + close. Specific next step. Clear button on the screen.

The times are approximations, a tight VSL might be 18 minutes; a long one 60. Same sections, different pacing.

Pacing rules

Hook formulas that work

The script

VSLs are scripted. Word-for-word. Not ad-libbed. The reason: every sentence earns the next; there's no room for tangents, dead air, or self-editing in the moment.

Writing process:

  1. Draft the full script. 8,000โ€“15,000 words for a 30โ€“45 minute VSL
  2. Read aloud. Time each section.
  3. Cut anything that doesn't directly move the viewer forward
  4. Mark in beats (visual cues, b-roll moments, on-screen text)
  5. Rehearse. 3โ€“5 read-throughs before recording
  6. Record in takes; splice together

Production quality

The bar has risen. What used to work (slide-based, faceless VSLs) now underperforms in most categories. Current norms:

"Good enough" production now means: natural light, face visible, clear audio, reasonable cuts. You don't need a film crew; you do need to not look amateur.

Delivery platforms

The CTA layer

Under the video, clear CTAs:

Some VSLs have a "lock" where the CTA only appears at a certain timestamp. This forces viewing. Effective at scale but reads as manipulative, decide based on your audience.

Metrics to track

The iteration loop

VSLs aren't one-and-done:

  1. Ship v1
  2. Analyze retention curve, major drop-offs
  3. Rewrite problem sections
  4. A/B test hook variants
  5. Test price reveals at different timestamps
  6. Iterate every 30โ€“60 days

A mature VSL is usually on iteration 5+ before it hits its best conversion.

Related: Sales letter structure ยท Long form vs short form ยท Story selling