Innovation Metric: Why Experiment Velocity Still Matters

It's gamable. So is everything else. Here's how to use it anyway.

By Tristan Kromer·Jan 22, 2018

As corporate innovation gets more trendy, businesses are keen to put clear KPIs in place to measure the effectiveness of innovation teams and hold them accountable for making progress (innovation metric). CEOs yearn for a nice, clean number that keeps rising and tells them that the innovation program is succeeding. VPs of innovation are also desperate to prove the effectiveness of their programs so that they can request more funding. While most innovation programs have successfully argued against using traditional metrics like ROI on an early-stage product, there is a rush to replace those KPIs with more actionable/innovation metric such as experiment velocity. However, we have to use innovation metric in the right context for them to be useful. Our friend Dan Toma, author of The Corporate Startup, recently proposed that experiment velocity, one of our favorite measurements of innovation metric, was a bad metric and prone to gaming. Dan is correct. Experiment velocity is a vanity metric and, therefore, gamable. But how do we measure progress without it?

Experiment Velocity

Put simply, it’s how much stuff the team is doing while trying to learn something about their business model. It is not a measure of how much the team actually accomplishes or whether the business model is viable. I often use the innovation metric with early-stage innovation teams to understand if they are ready to focus on other metrics. (We’ll come back to how to use it in detail in another post.)

Gaming the System

Dice: Gaming the System Dan’s main argument is that experiment velocity can be gamed by teams trying to make themselves look good. And that is correct.

Quick Answer: Experiment velocity is a gamable vanity metric, but that doesn’t mean we should abandon it — every alternative metric is equally gamable. As product managers, we should use experiment velocity as a threshold, not a scorecard: measure but don’t count. A simple traffic light system (green = at least one experiment and learning per week, yellow = experiments but no learnings, red = no experiments at all) helps coaches identify struggling teams without incentivizing anyone to inflate their numbers.

“…purposefully or not. Product teams knowing that they are having their ‘experiment velocity’ measured might claim that every tiny thing they do is an experiment. Or, giving them the benefit of the doubt, they don’t know how to design the right experiments so although their velocity is high, their impact is low.” - Dan Toma, Experiment Velocity vs. Learning Velocity

Teams can easily run dozens of small experiments with little or no outcome in terms of knowledge generated. If team Alpha reports six experiments run in the last week and team Beta reports only one, that says nothing about which team is doing better. All experiments are not created equal. For example, we can run 20 comprehension tests in a week iterating on a value proposition. But one good concierge test might generate one critical piece of information about which features to build.

Learning Velocity

Dan suggests that focussing on learning velocity can be more productive. Truthfully, that innovation metric is just as gamable. The number of learnings itself is not important. It’s whether those learnings validate or invalidate a critical element of the business model. Some teams come back from one experiment with a list of 20 learnings, many of which are utterly useless. Sometimes learning that the customer enjoys the color blue used in our logo is useful, sometimes it’s irrelevant. (Usually it’s irrelevant.) How can we compare team Charlie that learns 20 minor ways to tweak value propositions that don’t work vs. team Delta that simply learns their value proposition doesn’t work at all? It’s far too easy to come back from a customer discovery interview and label all of the different notes as individual learnings and report back a raft of 15 brand new nuggets of knowledge. When it comes down to it, if we’re incentivized based on any metric, we will game it. If our holiday bonus depends on it, “Customers like free coffee” is valuable learning.

Combining Gamed Metrics

Dice: Combining Gamed Innovation Metric Beyond either of these metrics, Dan then proposed using experiment/learning ratio as a diagnostic tool. He has some nice ideas about this innovation metric that are worth reading but are a bit beside the point. If experiment velocity and learning velocity are both gamable, then the experiment learning ratio is too. If even one of those metrics are gamable, then of course the combined metric will be gamable. We can’t complain about one metric being fallible and then combine two vanity metrics to somehow create an actionable metric. We need to fix the problem.

Experiment Points

Alternatively, we could measure progress by applying the story point system of agile to experiments to give them proportional weight. A comprehension test might be just 1 point, while a value proposition experiment (or learning) can be valued at 3 points. In this system, the points would be assigned by team members using planning poker, T-shirt sizing, or some other collaborative estimation system. scrum poker cards Agile Planning Cards for Estimation We don’t have personal experience with this, but we see some flaws in this method as well. The only reason to spend time doing this is if we’ve become obsessed with measuring teams against one another. It’s a way of saying that our learning is great and that other team is learning useless things. A value proposition test on an incremental innovation is very different than the same test run on a radical innovation. How should we compare the two? Perhaps it’s possible to generate an elaborate system of evaluating risk and comparing learnings, but what is the benefit? The purpose of any of these proposed innovation metrics is not to set someone’s bonus for the year and certainly not to compare one team to another. The purpose is to provide a warning sign when teams are going off the rails and get them help.

Necessary But Insufficient

We need to focus on learning—not running experiments in order to make progress. In order to learn, we run experiments and research. Running experiments does not guarantee knowledge, but it is impossible to generate knowledge without them. Crystal balls and product manager “remote viewing” clairvoyants do not count. Running experiments is a necessary but insufficient condition for successful innovation teams. Dan points out that we can run a lot of experiments and not learn anything. But we can’t learn anything without running at least a few. So if we’re trying to understand if the team is functioning well, we don’t need to know how many experiments they are running…so long as that number is higher than one. That’s why we try and run at least one experiment per week. Some growth teams doing rapid A/B testing can run a dozen experiments in a week; some can only run a couple. But all teams must run at least one. Note: If you’re looking for some guidance on how to design lean experiments, download our Learn SMART Experiment Template.

Measure, But Don’t Count

We need one, so I'd say we have enough For both experiment and learning velocity, the better approach is to measure but not count. Running three experiments or thirty doesn’t prove that everything is going well, but zero experiments indicates something is wrong. Same goes with learning velocity. A team reporting 20 learnings doesn’t indicate everything is going right, but reporting zero learnings indicates something is wrong. These metrics can be used as a threshold to tell us when things are going well or going poorly. A simple traffic light system can be helpful:

Green - At least one experiment and learning each week.
Yellow - One experiment, but no learnings. The team is doing something, but it’s not generating knowledge. Something needs to be fixed.
Red - Problems. The team isn’t even getting itself out the door.

(Note: This is not the complete system we use at Kromatic when coaching teams, but it’s a good first step.) traffic light Moreover, if we’re not forcing teams to compete on the total number of experiments run, we don’t have to worry as much about this metric being gamed. We just need to make sure that this metric is being used to help teams and not as a means to set bonuses or punish teams.

Traffic Light Triage

Measuring progress can be tricky. Red Light: When dealing with a small number of teams, we don’t need a fancy accounting system. We just need to focus first on the teams that aren’t getting out of the building. Those teams need a swift kick in the ass. And we can only help the teams that actually want a kick in the ass. Yellow Light: Next, we deal with the teams that are putting in the effort, but aren’t generating knowledge. We can help those teams by showing them the right experiment to run and helping them run the experiment right. Then and only then can we start making real progress on the business model. This is something that Dan, Esther, and Tendayi call Validation Velocity in The Corporate Startup. This is not only generating knowledge, but generating knowledge about the right things.

…The real question is when it comes to innovation how do you measure progress. This is where Validation Velocity comes in. This is based on businesses having an innovation framework that has key components (e.g. customer need, solution, business model etc). Ultimately, every innovation project has to answer the same broad questions: Is there a real customer need? Do we have the right solution? Have we found the right price point? Do we have the right channels? Does our growth engine work? If you organize these questions in a hierarchy of importance then all that matters is whether teams are running experiments that provide definitive answers to these questions.” - Tendayi Viki

Coaching Metrics

The metrics we’re discussing here shouldn’t be mistaken for progress on a business model. They are metrics for coaches. coach They are useful as a mental framework to focus our efforts on helping teams or in a high-level dashboard to see overall progress. But ultimately they don’t matter to the individual teams, and most innovation programs don’t have enough teams to worry about creating a dashboard. Just go talk to the teams! So if you’re an innovation team, ignore this stuff. Just go learn about your business. If you’re a coach or innovation portfolio manager, use these concepts to make sure the team is making progress, and remember the rudder fallacy.

Lessons Learned

Experiment velocity is a vanity metric and is gamable.
So are most other metrics.
Use a threshold on such metrics and measure, but don’t count.
Don’t mistake coaching metrics for business progress.

Frequently Asked Questions

What is experiment velocity as an innovation metric?

Experiment velocity measures how many experiments a team runs in a given period — essentially, how much stuff the team is doing while trying to learn about their business model. It is not a measure of what the team actually accomplishes or whether the business model is viable. As product managers, we should treat it as a threshold indicator rather than a scorecard.

Why is experiment velocity considered a vanity metric?

Experiment velocity is gamable because teams can run dozens of small, low-impact experiments to inflate their numbers without generating meaningful knowledge. A team reporting six experiments per week isn’t necessarily outperforming one that reports only one — all experiments are not created equal. One well-designed concierge test can generate more critical insight than 20 quick comprehension tests.

Is learning velocity a better innovation metric than experiment velocity?

Not necessarily. Learning velocity is just as gamable as experiment velocity. Teams can easily label every minor observation from a customer interview as a separate “learning” and report back inflated numbers. As the article points out, if we’re incentivized based on any metric, we will game it — so combining two gamable metrics doesn’t magically create an actionable one.

How should innovation teams measure progress without gaming metrics?

The recommended approach is to measure but don’t count. Use a simple traffic light system: green means at least one experiment and one learning per week, yellow means experiments are happening but no learnings are generated, and red means the team isn’t running experiments at all. This threshold-based approach identifies teams that need help without incentivizing teams to inflate their numbers.

What is the difference between coaching metrics and business model progress?

Coaching metrics like experiment velocity and learning velocity help coaches and portfolio managers spot struggling teams — they’re diagnostic tools, not measures of business viability. True business model progress comes from what Dan Toma and Tendayi Viki call “validation velocity,” which tracks whether teams are generating definitive answers to critical questions about customer need, solution fit, pricing, channels, and growth engines.

Written by

Tristan Kromer

Tristan Kromer is an innovation coach and the founder of Kromatic. He helps enterprise companies build innovation ecosystems and works with startups and intrapreneurs worldwide to create better products for real people. Editor of The Real Startup Book (2nd edition, 2026), a free field guide to 51 experiments for finding product/market fit. Speaker and longtime advocate for lean startup and innovation accounting methods, now focused on how AI changes (and does not change) the customer-discovery work that decides whether a startup lives.

𝕏 @TriKro LinkedIn Website

Comments

Loading comments…