![B2B hypothesis - the first step in a well defined experiment](/images/2019-06-b2b-hypothesis.png) The hypothesis is what we’re testing. It must be a clear statement that is falsifiable. It should also be something that is risky. A good experiment will generate an invalid hypothesis about half the time. Our hypothesis might be about our customer, our channels, even our key resources. We can use the Business Model Canvas or other tools to help identify a hypothesis. Some good examples: - Bernie, our customer persona, has foot pain because of running for exercise three days a week. - If we add logos of our existing customers to our landing page, it will convey trust as a key value proposition to Bernie, our customer persona, and he will sign up. - Potential partners will fill out a lead sheet when they view a partner page explaining our three key value propositions. Hypotheses should be clear statements that indicate a causal relationship with a clear actor (i.e. customer). Some bad examples: - Some people will sign up if we launch this landing page. This is almost certainly true. _Somebody_ always signs up for every landing page. - We believe consumers will enjoy this feature. Firstly, we're supposed to be testing reality, not trying to test our beliefs. (Do you think users will enjoy this feature? Yes. Hypothesis validated!) Secondly, "consumers" is vague. Which consumers? Will my 80-year-old grandmother use it? Be specific. Thirdly, we can only test things that are observable. I can't test whether two magnets dislike each other. I can test if they will physically move away from one another. Make your hypothesis strong and the rest of the experiment will write itself.

![Lean Startup Template - Every experiment needs a leaning goal](/images/2019-06-leaning-goal.png)We've found that occasionally we run experiments which don't seem to have a clear purpose, or are tangential to our critical business decisions. Sometimes things just sneak in as pet projects. Sometimes we just thought they were important and looking back it seems that the share button on our blog wasn't really that critical. To counter this, we add a Learning Goal to our template. The learning goal is broader than a hypothesis. If you’re familiar with Agile, you can think of this as the epic. Typical learning goals might be: - Does Sally, our customer persona, need help getting through her homework assignments? - Is LinkedIn Advertising a good channel to reach Sally? - Will adding an option to video chat with a tutor make sally use our product more? These things are not specific enough to be hypotheses. They are not falsifiable. They are very broad and are likely to be big questions which have a number of possible answers, each of which we can test in succession. This field is useful so that we always remember to tie our experiments back to a critical component of our business such as the business model canvas. Many other templates have a similar component, some even use the Business Model Canvas as the Learning goal.

Lean Startup Template: Why Ours Works (And Why Yours Should Differ)

Q: Metric![Lean startup template - metrics for measurement](/images/2019-06-metrics-for-measurement.png)

The metric is what we will measure in order to [invalidate](success-metric-vs-fail-condition-to-the-pain/) that hypothesis. That data can be quantitative or, in some cases, qualitative. Plenty has been written about this subject elsewhere, so just make sure it's not a vanity metric such as "number of likes." (Also see, "Failure Condition" below.)

Q: Fail Condition

![Lean Startup Template - Have a fail condition for every experiment](/images/2015-07-very-disappointed-customer.png) Generally we're pretty good at setting metrics for our experiments, but we occasionally get a result and don't know what to do with it. Is 7.6% conversion rate on that landing page good or bad? If we don't know, then the outcome of this experiment isn't going to help us decide what to do next. The Fail Condition is the resulting measurement which would convince us beyond any reasonable doubt that our hypothesis is invalid. This is more useful to us than success criteria because we tend to be overly lenient in success criteria. If we consider success to be a 50% conversion…49% will do just fine. 45% isn’t too bad either. We can probably optimize 30%. 15%? Well…maybe we hit the wrong target market or maybe we should wait until the school year starts. By setting a fail condition, we are acknowledging a basic tenant of science: If we can’t agree on a fail condition, our hypothesis is not falsifiable and our test is meaningless. If our fail condition is <10% conversion rate…then a 9.999% conversion rate is still a fail and we are forced to acknowledge that.

Q: Time Box

![Lean Startup Template - A time box helps prevent feature creep](/images/2019-06-temporary-completeness-clock.png)It’s important for us to constrain the amount of time the experiment will run to keep us moving fast. If we argue that in order to gather sufficient data for a statistically significant sample size we will have to run the experiment for four weeks, then we should probably define a hypothesis we can test in a shorter time, a better metric, or start cheating or hacking our way to a larger sample size however we can. Agile practices also use time boxes, often in the form of a set sprint length. It helps constrain feature creep. If the experiment will last longer than a week, we are probably not being aggressive enough.

Q: Results

![Lean Startup Template - Results are the only outcome that matters](/images/2019-06-experiment-results.jpg)Any experiment template really ought to have a results box. Some people may be able to remember everything with crystal clear precision. We cannot. We always want to be able to look up what we did six months ago if that data becomes relevant in another context. If we’re not getting results, we’re obviously not learning.

Q: Next Step

![Lean Startup Template - If there is no next step, then the experiment was flawed](/images/2019-06-fork-in-the-road-decisions.jpg)Similarly, if the results of the experiment don’t inform what our next steps should be, something went wrong with our experiment. At the very least, the experiment should tell us, “Don’t do that again.” Preferably, the results should suggest the next experiment to perform which may be based on the same epic, may be a higher fidelity test of the same hypothesis, or it might indicate that the hypothesis is now “less risky” and it’s time to investigate another element of our business model. If no clear step is apparent, it's a warning sign that we may have been experimenting on something irrelevant or low priority.

Q: WTF,s!

Credit for this goes to [Kenny Nguyen](http://www.linkedin.com/in/kennythanhnguyen) although we’ve seen a few teams implement something similar. **WTF,s!** stands for “What the fuck, stop!” Something has gone fundamentally wrong with our experiment and it’s time to turn it off early and retro.

Q: ![Lean Startup Experiment - Evaluate a clear hypothesis with an experiment](/images/evaluative-experiment.png)

This is our opportunity to look for additional elements to minimize the experiment and find smaller ways of getting valid data. For example, if we are testing the sign-up rate on a landing page and we’re going to drive 1,000 visitors to the landing page via our newsletter, we could set a wtf,s! condition as “If the open rate on our newsletter is <50% with the first 100 sends…wtf,s!” Since our benchmark for other newsletters is 60%, did we fail to install tracking? Is the subject line utterly horrible? We might only have 2,000 people on our mailing list, and sending too many emails to the same list will burn out the list. It’s a limited resource. So let’s send out our email in small batches and make sure we haven’t done something dumb. Some dumb things I’ve personally done: - 0% conversion? Ooops…broke the analytics by deploying a small change straight to production. - Sent a templated email with a subject line for a workshop that tested really well on Twitter but is marginally offensive via email. (Sorry to everyone I told "Your MVP sucks!") - Forgot to turn adwords off.

We hate templates. So naturally, we made one.

By Tristan Kromer · Sep 16, 2014 · 11 min read

Lean startup experiment template overview Every few weeks, someone asks us if there’s a lean startup template that they should use to define their experiments…and we say no. We’re not big fans of templates in the broad sense of a ‘one size fits all’ approach to define experiments regardless of the context. We ARE huge fans of having a repeatable process. We (Kromatic) do use templates that work well for us. That doesn’t necessarily mean they’ll work well for someone else. Templates are often a means of asserting order over the innovation process in order to measure team velocity in an overly stringent way before it’s appropriate or necessary to do so. So in an effort to satisfy the requests and not give overly broad advice, here’s the template we use and why we designed it this way. If you’d like to cut to the chase, you can download the template below. It’s licensed under a Creative Commons Attribution-ShareAlike 4.0 International License so feel free to modify and hack to make your own.

Quick Answer: A lean startup template should start with just three elements — hypothesis, metric, and plan — and only add complexity when experiments fail to produce clear learnings. As product managers, we should resist one-size-fits-all templates and instead iterate on our process by adding components like fail conditions (not success criteria, since we rationalize bad results), time boxes (keep experiments under one week), and learning goals that tie back to critical business decisions. The template should provoke harder thinking, not replace it.

Download Learn S.M.A.R.T. Template

The Basics

Any template, framework, or checklist is there to provide us with a repeatable process. A lean startup template should provoke us to think harder, not remove the need for critical thought. It should remind us of important points to consider and frame the question in a way that makes sense. The basic criteria for a lean startup experiment is always:

Hypothesis
Metric
Experiment

Most templates such as Ash Maurya’s & The Validation Board have these elements. This is a great place to start when we define our own template. Just this and no more. Run a few experiments, did we learn from them? Is the team in agreement on the results? If yes…that’s all we need. If we find there is some disagreement as to how to interpret the results, such as: the experiment lasted longer than it should have or we’re not learning the things that are critical to our business, then it’s time to modify the template.

Hypothesis

B2B hypothesis - the first step in a well defined experiment The hypothesis is what we’re testing. It must be a clear statement that is falsifiable. It should also be something that is risky. A good experiment will generate an invalid hypothesis about half the time. Our hypothesis might be about our customer, our channels, even our key resources. We can use the Business Model Canvas or other tools to help identify a hypothesis. Some good examples:

Bernie, our customer persona, has foot pain because of running for exercise three days a week.
If we add logos of our existing customers to our landing page, it will convey trust as a key value proposition to Bernie, our customer persona, and he will sign up.
Potential partners will fill out a lead sheet when they view a partner page explaining our three key value propositions.

Hypotheses should be clear statements that indicate a causal relationship with a clear actor (i.e. customer). Some bad examples:

Some people will sign up if we launch this landing page.

This is almost certainly true. Somebody always signs up for every landing page.

We believe consumers will enjoy this feature.

Firstly, we’re supposed to be testing reality, not trying to test our beliefs. (Do you think users will enjoy this feature? Yes. Hypothesis validated!) Secondly, “consumers” is vague. Which consumers? Will my 80-year-old grandmother use it? Be specific. Thirdly, we can only test things that are observable. I can’t test whether two magnets dislike each other. I can test if they will physically move away from one another. Make your hypothesis strong and the rest of the experiment will write itself.

Metric

The metric is what we will measure in order to invalidate that hypothesis. That data can be quantitative or, in some cases, qualitative. Plenty has been written about this subject elsewhere, so just make sure it’s not a vanity metric such as “number of likes.” (Also see, “Failure Condition” below.)

Plan

The plan is a clear description of the steps we must take to gather that data. Sometimes this might include wireframes or interview scripts. We only put in enough information such that everyone agrees that they could execute the experiment without further questions.

Hacking the Lean Startup Template

Unfortunately, sometimes the simple template isn’t sufficient and we find ourselves in the infamous “Build-Measure-Flail” loop. Lean Startup Template - An early stop condition prevents flawed experiments from going too far and consuming resources Don’t panic! Do a retro, then modify your template so that situation doesn’t occur again.

Learning Goal

Lean Startup Template - Every experiment needs a leaning goal We’ve found that occasionally we run experiments which don’t seem to have a clear purpose, or are tangential to our critical business decisions. Sometimes things just sneak in as pet projects. Sometimes we just thought they were important and looking back it seems that the share button on our blog wasn’t really that critical. To counter this, we add a Learning Goal to our template. The learning goal is broader than a hypothesis. If you’re familiar with Agile, you can think of this as the epic. Typical learning goals might be:

Does Sally, our customer persona, need help getting through her homework assignments?
Is LinkedIn Advertising a good channel to reach Sally?
Will adding an option to video chat with a tutor make sally use our product more?

These things are not specific enough to be hypotheses. They are not falsifiable. They are very broad and are likely to be big questions which have a number of possible answers, each of which we can test in succession. This field is useful so that we always remember to tie our experiments back to a critical component of our business such as the business model canvas. Many other templates have a similar component, some even use the Business Model Canvas as the Learning goal.

Fail Condition

Lean Startup Template - Have a fail condition for every experiment Generally we’re pretty good at setting metrics for our experiments, but we occasionally get a result and don’t know what to do with it. Is 7.6% conversion rate on that landing page good or bad? If we don’t know, then the outcome of this experiment isn’t going to help us decide what to do next. The Fail Condition is the resulting measurement which would convince us beyond any reasonable doubt that our hypothesis is invalid. This is more useful to us than success criteria because we tend to be overly lenient in success criteria. If we consider success to be a 50% conversion…49% will do just fine. 45% isn’t too bad either. We can probably optimize 30%. 15%? Well…maybe we hit the wrong target market or maybe we should wait until the school year starts. By setting a fail condition, we are acknowledging a basic tenant of science: If we can’t agree on a fail condition, our hypothesis is not falsifiable and our test is meaningless. If our fail condition is <10% conversion rate…then a 9.999% conversion rate is still a fail and we are forced to acknowledge that.

Time Box

It’s important for us to constrain the amount of time the experiment will run to keep us moving fast. If we argue that in order to gather sufficient data for a statistically significant sample size we will have to run the experiment for four weeks, then we should probably define a hypothesis we can test in a shorter time, a better metric, or start cheating or hacking our way to a larger sample size however we can. Agile practices also use time boxes, often in the form of a set sprint length. It helps constrain feature creep. If the experiment will last longer than a week, we are probably not being aggressive enough.

Results

Lean Startup Template - Results are the only outcome that matters Any experiment template really ought to have a results box. Some people may be able to remember everything with crystal clear precision. We cannot. We always want to be able to look up what we did six months ago if that data becomes relevant in another context. If we’re not getting results, we’re obviously not learning.

Next Step

Lean Startup Template - If there is no next step, then the experiment was flawed Similarly, if the results of the experiment don’t inform what our next steps should be, something went wrong with our experiment. At the very least, the experiment should tell us, “Don’t do that again.” Preferably, the results should suggest the next experiment to perform which may be based on the same epic, may be a higher fidelity test of the same hypothesis, or it might indicate that the hypothesis is now “less risky” and it’s time to investigate another element of our business model. If no clear step is apparent, it’s a warning sign that we may have been experimenting on something irrelevant or low priority.

WTF,s!

Credit for this goes to Kenny Nguyen although we’ve seen a few teams implement something similar. WTF,s! stands for “What the fuck, stop!” Something has gone fundamentally wrong with our experiment and it’s time to turn it off early and retro.

This is our opportunity to look for additional elements to minimize the experiment and find smaller ways of getting valid data. For example, if we are testing the sign-up rate on a landing page and we’re going to drive 1,000 visitors to the landing page via our newsletter, we could set a wtf,s! condition as “If the open rate on our newsletter is <50% with the first 100 sends…wtf,s!” Since our benchmark for other newsletters is 60%, did we fail to install tracking? Is the subject line utterly horrible? We might only have 2,000 people on our mailing list, and sending too many emails to the same list will burn out the list. It’s a limited resource. So let’s send out our email in small batches and make sure we haven’t done something dumb. Some dumb things I’ve personally done:

0% conversion? Ooops…broke the analytics by deploying a small change straight to production.
Sent a templated email with a subject line for a workshop that tested really well on Twitter but is marginally offensive via email. (Sorry to everyone I told “Your MVP sucks!”)
Forgot to turn adwords off.

BYOT - Bring Your Own Template

If this template works for you, great. Go for it. We use it for both doing research and running experiments:

Download Learn S.M.A.R.T. Template

If another one works better, great! Better yet, it’s licensed under a Creative Commons Attribution-ShareAlike 4.0 International License so feel free to modify and hack it to make your own. But please please please do not force your hapless startups or innovation teams to use an arbitrary template just so you can measure how fast they are innovating. It makes us cry.

Frequently Asked Questions

What should a lean startup template include at minimum?

At minimum, a lean startup template needs three elements: a hypothesis (a clear, falsifiable statement of what we’re testing), a metric (what we’ll measure to validate or invalidate it), and a plan (the steps to gather that data). As product managers, we should start simple and only add complexity when we find ourselves disagreeing on results or failing to learn from experiments.

What’s the difference between a hypothesis and a learning goal in lean experiments?

A learning goal is a broad question tied to a critical business decision — like “Is LinkedIn a good channel to reach our persona?” It’s the equivalent of an epic in Agile. A hypothesis is a specific, falsifiable statement under that goal, such as “LinkedIn ads with value proposition X will generate a 5% click-through rate from our target persona.” We use learning goals to ensure every experiment ties back to something that actually matters for our business model.

Why should I use a fail condition instead of success criteria for experiments?

We tend to be overly lenient with success criteria — if we set 50% conversion as success, we’ll rationalize 30% as “close enough.” A fail condition forces us to define the measurement that would convince us beyond reasonable doubt that our hypothesis is invalid. If we can’t agree on a fail condition, our hypothesis isn’t truly falsifiable and the test is meaningless. This keeps us honest and prevents endless goalpost-moving.

What does “WTF,s!” mean in lean startup experiments?

WTF,s! stands for “What the fuck, stop!” — it’s an early stop condition we set before running an experiment to catch fundamental problems before they waste resources. For example, if we’re testing a landing page via newsletter, we might set a WTF,s! condition like “if the open rate is below 50% in the first 100 sends, stop immediately.” This helps us catch broken analytics, bad targeting, or other dumb mistakes before burning through limited resources.

How long should a lean startup experiment take to run?

We recommend time-boxing experiments to no more than one week. If we find ourselves arguing that we need four weeks for statistical significance, that’s a signal to redefine a simpler hypothesis, choose a better metric, or find ways to hack our way to a larger sample size faster. Just like sprint lengths in Agile, time boxes prevent feature creep and keep the team moving aggressively through the learning cycle.

Written by

Tristan Kromer

Tristan Kromer is an innovation coach and the founder of Kromatic. He helps enterprise companies build innovation ecosystems and works with startups and intrapreneurs worldwide to create better products for real people. Editor of The Real Startup Book (2nd edition, 2026), a free field guide to 51 experiments for finding product/market fit. Speaker and longtime advocate for lean startup and innovation accounting methods, now focused on how AI changes (and does not change) the customer-discovery work that decides whether a startup lives.

𝕏 @TriKro LinkedIn Website

Comments

Loading comments…

Lean Startup Template: Why Ours Works (And Why Yours Should Differ)

The Basics

Hypothesis

Metric

Plan

Hacking the Lean Startup Template

Learning Goal

Fail Condition

Time Box

Results

Next Step

WTF,s!

BYOT - Bring Your Own Template

Frequently Asked Questions

What should a lean startup template include at minimum?

What’s the difference between a hypothesis and a learning goal in lean experiments?

Why should I use a fail condition instead of success criteria for experiments?

What does “WTF,s!” mean in lean startup experiments?

How long should a lean startup experiment take to run?

Get innovation insights in your inbox

Comments

Leave a comment