What to test

There are hundreds of things you could test. Most aren't worth the effort. The highest-leverage tests come from the top of a specific list, market, offer, headline, and decay from there. This page is the checklist, ordered by expected impact.

Tier 1. Offer-level tests (5–200% lift possible)

Price

Offer structure

Guarantee

Urgency / scarcity

Tier 2. Headline + hook (5–50% lift)

Tier 3. Landing page structure (5–30% lift)

Tier 4. Ad creative (paid channels; 10–100% CTR lift possible)

Tier 5. Email

Tier 6. Audience / targeting (paid)

Tier 7. Form / checkout

Tier 8. Small design / copy

The "impact estimate" filter

Before running a test, estimate: if this wins, how much lift would it produce?

If you're consistently running tests in the < 5% range, you're optimizing the wrong things. Go back up the tier list.

Test one variable, not three

The temptation: "let's test a new headline, new image, and new button all at once and compare to the old." Result: you learn nothing about which element actually drove the change.

Test one thing at a time. If you've already found a winning combination through isolated tests, then test the stacked combination against the old stack, but even then, understand you're testing systems, not variables.

Sequential vs, concurrent

You can only run one test per page at a time (if you run two, they contaminate each other). But you can run tests on different pages simultaneously. A mature operation runs 3–6 independent tests in parallel across different funnels.

The "we already know what works" trap

The moment a team says "we know our headline is best, no need to test," conversion stops improving. Markets shift. Audiences shift. What won in Q1 often loses in Q4. The control is always subject to being dethroned.

Multi-armed bandit vs. A/B

Advanced: multi-armed bandit algorithms dynamically shift traffic toward winners during the test. Good for long-running optimization where you value ongoing performance over clean A/B comparisons. Most teams are better off with straight A/B until they've exhausted obvious tests.

Related: Scientific testing · Controls + challengers · Measurement