The Evidence-Based Management Guide

Authors Scrum.org

Plaintext

Scrum.org

The Evidence-Based Management
Guide
Measuring Value to Enable Improvement and Agility

September 2020
Purpose of the EBM Guide
Evidence-Based Management (EBM) is an empirical approach that helps organizations to
continuously improve customer outcomes, organizational capabilities, and business results
under conditions of uncertainty. It provides a framework for organizations to improve their
ability to deliver value in an uncertain world, seeking a path toward strategic goals. Using
intentional experimentation and evidence (measures), EBM enables organizations to
systematically improve their performance over time and refine their goals based on better
information.

By measuring current conditions, setting performance goals, forming small experiments for
improvement that can be run quickly, measuring the effect of the experiment, and inspecting
and adapting goals and next steps, EBM helps organizations to take into account the best
available evidence to help them make decisions on ways to improve.

This Guide defines EBM, its concepts, and its application.

This publication is offered for license under the Attribution Share-Alike license of Creative Commons,
accessible at http://creativecommons.org/licenses/by-sa/4.0/legalcode and also described in
summary form at http://creativecommons.org/licenses/by-sa/4.0/. By utilizing this EBM Guide, you
acknowledge and agree that you have read and agree to be bound by the terms of the Attribution
Share-Alike license of Creative Commons.

2
EBM Helps Organizations Seek toward Their Goals
in a Complex World Using Empiricism
Complex problems defy easy solutions, but instead require organizations seek toward their
goals in a series of small steps, inspecting the results of each step, and adapting their next
actions based on feedback (see Figure 1).1

This model has several key elements:
A Strategic Goal, which is something important that the organization would like to achieve.
This goal is so big and far away, with many uncertainties along the journey that the
organization must use empiricism. Because the Strategic Goal is aspirational and the
path to it is uncertain, the organization needs a series of practical targets, like
Intermediate Goals, achievements of which will indicate that the organization is on the path
to its Strategic Goal. The path to the Intermediate Goal is often still somewhat uncertain,
but not completely unknown.
Immediate Tactical Goals, critical near-term objectives toward which a team or group of
teams will work help toward Intermediate Goals.
A Starting State, which is where the organization is relative to the Strategic Goal when it
starts its journey.
A Current State, which is where the organization is relative to the Strategic Goal at the
present time.

In order to progress toward the Strategic Goal, organizations run experiments which involve
forming hypotheses that are intended to advance the organization toward their current
Intermediate Goal. As they run these experiments and gather results, they use the evidence
they obtain to evaluate their goals and determine their next steps to advance toward these
goals.

1
For more on complexity, see the Scrum Theory section of the Scrum Guide at
https://www.scrumguides.org/scrum-guide.html

3
Figure 1: Reaching strategic goals requires experimenting, inspecting, and adapting2

Setting Goals

When setting goals, organizations must define specific measures that will indicate that the goal
is achieved. Goals, measures, and experiments should be made transparent in order to
encourage organizational alignment.

Consider the case of the response to an infectious disease:
● The Strategic Goal is to eradicate the effects of the disease, as measured by the
number of people who fall ill and suffer significant illness. Measurement is
important; in this example, the goal is focused on the effects of the disease, and
not on the means for achieving the desired impact. For example, the goal is not
to vaccinate a certain percentage of the population against the disease; that may
be an activity necessary to achieving the Strategic Goal, but it is not the Strategic
Goal.

2
Figure adapted from Mike Rother’s Improvement Kata (http://www-
personal.umich.edu/~mrother/The_Improvement_Kata.html)

4
● An example of an Intermediate Goal is the successful completion of a trial of a
vaccine against the disease. This is still ambitious and measurable, and
achieving it may require the completion of many different activities, but it is seen
as a necessary step on the path to achieving the Strategic Goal.
● Examples of immediate tactical goals may include activities like isolating
symptoms, evaluating a therapy, sequencing the DNA of a virus or bacterium,
and so forth.

The Strategic Goal is usually focused on achieving a highly desirable but unrealized outcome
for a specific group of people that results in improved happiness, safety, security, or well-being
of the recipients of some product or service. In EBM, we refer to this as Unrealized Value, which
is the satisfaction gap between a beneficiary’s desired outcome and their current experience.
Unrealized Value is described in greater detail below, in the Key Value Areas section.

Understanding What Is Valuable
Organizations measure many different kinds of things. Broadly speaking, measures fall into
three categories:
● Activities. These are things that people in the organization do, such as perform work,
go to meetings, have discussions, write code, create reports, attend conferences, and so
forth.
● Outputs. These are things that the organization produces, such as product releases
(including features), reports, defect reports, product reviews, and so on.
● Outcomes. These are desirable things that a customer or user of a product
experiences. They represent some new or improved capability that the customer or user
was not able to achieve before. Examples include being able to travel to a destination
faster than before, or being able to earn or save more money than before. Outcomes
can also be negative, as in the case where the value a customer or user experiences
declines from previous experiences, for example when a service they previously relied
upon is no longer available.

The problem most organizations face, which is often reflected in the things they measure, is that
measuring activities and outputs is easy, while measuring outcomes is difficult. Organizations
may gather a lot of data with insufficient information about their ability to deliver value. However,
delivering valuable outcomes to customers is essential if organizations are to reach their goals.
For example, working more hours (activities) and delivering more features (outputs) does not
necessarily lead to improved customer experiences (outcomes).

EBM Focuses on Four Key Value Areas
In addition to using hypotheses and experiments to move toward goals, EBM provides a set of
perspectives on value and the organization’s ability to deliver value. These perspectives are
called Key Value Areas (KVAs). These areas examine the goals of the organization (Unrealized
Value), the current state of the organization relative to those goals (Current Value), the

5
responsiveness of the organization in delivering value (Time-to-Market), and the effectiveness
of the organization in delivering value (Ability-to-Innovate). Focusing on these four dimensions
enables organizations to better understand where they are and where they need to go (see
Figure 2).

Figure 2: EBM focuses on four Key Value Areas (KVAs).

Each KVA focuses on a different aspect of either value, or the ability of the organization to
deliver value. Delivering business value (Current Value) is important, but organizations must
also show that they can respond to change (Time-to-Market) while being able to sustain
innovation over time (Ability-to-Innovate). And they must be able to continually make progress
toward their long-term goals (Unrealized Value) or they risk succumbing to stagnation and
complacency.

Example Key Value Measures (KVMs) for each KVA are described in the Appendix.

Current Value (CV)

The value that the product delivers today
The purpose of looking at CV is to understand the value that an organization delivers to
customers and stakeholders at the present time; it considers only what exists right now, not the
value that might exist in the future. Questions that organizations need to continually re-evaluate
for current value are:

1. How happy are users and customers today? Is their happiness improving or declining?

6
2. How happy are your employees today? Is their happiness improving or declining?
3. How happy are your investors and other stakeholders today? Is their happiness
improving or declining?

Considering CV helps an organization understand the value that their customers or users
experience today.

Example: While profit, one way to measure investor happiness, will tell you the
economic impact of the value that you deliver, knowing whether customers are happy
with their purchase will tell you more about where you may need to improve to keep
those customers. If your customers have few alternatives to your product, you may have
high profit even though customer satisfaction is low. Considering CV from several
perspectives will give you a better understanding of your challenges and opportunities.
Customer happiness and investor happiness also do not tell the whole story about your
ability to deliver value. Considering employee attitudes recognizes that employees are
ultimately the producers of value. Engaged employees that know how to maintain,
sustain and enhance the product are one of the most significant assets of an
organization, and happy employees are more engaged and productive.

Unrealized Value (UV)

The potential future value that could be realized if the organization met the
needs of all potential customers or users
Looking at Unrealized Value helps an organization to maximize the value that it realizes from a
product or service over time. When customers, users, or clients experience a gap between their
current experience and the experience that they would like to have, the difference between the
two represents an opportunity; this opportunity is measured by Unrealized Value.

Questions that organizations need to continually re-evaluate for UV are:

1. Can any additional value be created by our organization in this market or other markets?
2. Is it worth the effort and risk to pursue these untapped opportunities?
3. Should further investments be made to capture additional Unrealized Value?

The consideration of both CV and UV provides organizations with a way to balance present and
possible future benefits. Strategic Goals are formed from some satisfaction gap and an
opportunity for an organization to decrease UV by increasing CV.

Example: A product may have low CV, because it is an early version being used to test
the market, but very high UV, indicating that there is great market potential. Investing in
the product to try to boost CV is probably warranted, given the potential returns, even
though the product is not currently producing high CV.

7
Conversely, a product with very high CV, large market share, no near competitors, and
very satisfied customers may not warrant much new investment; this is the classic cash
cow product that is very profitable but nearing the end of its product investment cycle
with low UV.

Time-to-Market (T2M)

The organization’s ability to quickly deliver new capabilities, services, or
products
The reason for looking at T2M is to minimize the amount of time it takes for the organization to
deliver value. Without actively managing T2M, the ability to sustainably deliver value in the
future is unknown. Questions that organizations need to continually re-evaluate for T2M are:

1. How fast can the organization learn from new experiments and information?
2. How fast can you adapt based on the information?
3. How fast can you test new ideas with customers?

Improving T2M helps improve the frequency at which an organization can potentially change
CV.

Example: Reducing the number of features in a product release can dramatically
improve T2M; the smallest release possible is one that delivers at least some
incremental improvement in value to some subset of the customers/users of the product.
Many organizations also focus on removing non value-added activities from the product
development and delivery process to improve their T2M.

Ability to Innovate (A2I)

The effectiveness of an organization to deliver new capabilities that might
better meet customer needs
The goal of looking at the A2I is to maximize the organization’s ability to deliver new capabilities
and innovative solutions. Organizations should continually re-evaluate their A2I by asking:

1. What prevents the organization from delivering new value?
2. What prevents customers or users from benefiting from that innovation?

Improving A2I helps an organization become more effective in ensuring that the work that it
does improves the value that its products or services deliver to customers or users.

Example: A variety of things can impede an organization from being able to deliver new
capabilities and value: spending too much time remedying poor product quality, needing
to maintain multiple variations of a product due to lack of operational excellence, lack of

8
decentralized decision-making, inability to hire and inspire talented, passionate team-
members, and so on.
As low-value features and systemic impediments accumulate, more budget and time is
consumed maintaining the product or overcoming impediments, reducing its available
capacity to innovate. In addition, anything that prevents users or customers from
benefiting from innovation, such as hard to assemble/install products or new versions of
products, will also reduce A2I.

Progress toward Goals in A Series of Small Steps
The first step in the journey toward a Strategic Goal is understanding your Current State. If your
focus is to achieve a Strategic Goal related to Unrealized Value (UV), as is typically the case,
then measuring the Current Value (CV) your product or service delivers is where you should
start (of course, if your product or service is new then its CV will be zero). To understand where
you need to improve, you may also need to understand your effectiveness (A2I), and your
responsiveness (T2M).

The Experiment Loop (shown in Figure 1) helps organizations move from their Current State
toward their Next Target Goal, and ultimately their Strategic Goal, by taking small, measured
steps, called experiments, using explicit hypotheses.3 This loop consists of:
● Forming a hypothesis for improvement. Based on experience, form an idea of
something you think will help you move toward your Next Target Goal, and decide how
you will know whether this experiment succeeded based on measurement.
● Running your experiments. Make the change you think will help you to improve and
gather data to support or refute your hypothesis.
● Inspecting your results. Did the change you made improve your results based on the
measurements you have made? Not all changes do; some changes actually make things
worse.
● Adapting your goals or your approach based on what you learned. Both your goals
and your improvement experiments will likely evolve as you learn more about customers,
competitors, and your organization's capabilities. Goals can change because of outside
events, and your tactics to reach your goals may need to be reconsidered and revised.
Was the Intermediate Goal the right goal? Is the Strategic Goal still relevant? If you
achieved the Intermediate Goal, you will need to choose a new Intermediate Goal. If
you did not achieve it, you will need to decide whether you need to persevere, stop, or
pivot toward something new. If your Strategic Goal is no longer relevant, you will need to
either adapt it, or replace it.

3
The Experiment Loop is a variation on the Shewhart Cycle, popularized by W. Edwards Deming, also
sometimes called the PDCA (Plan-Do-Check-Act) cycle; see https://en.wikipedia.org/wiki/PDCA.

9
Hypotheses, Experiments, Features, and Requirements
Features are “distinguishing characteristics of a product”4, while a requirement is, practically
speaking, something that someone thinks would be desirable in a product. A feature description
is one kind of requirement.

Organizations can spend a lot of money implementing features and other requirements in
products, only to find that customers don’t share the company’s opinion on their value; beliefs in
what is valuable are merely assumptions until they are validated by customers. This is where
hypotheses and experiments are useful.

In simplified terms, a hypothesis is a proposed explanation for some observation that has not
yet been proven (or disproven). In the context of requirements, it is a belief that doing something
will lead to something else, such as delivering feature X will lead to outcome Y. An experiment
is a test that is designed to prove or reject some hypothesis.

Every feature and every requirement really represent a hypothesis about value. One of the
goals of an empirical approach is to make these hypotheses explicit and to consciously design
experiments that explicitly test the value of the features and requirements. The entire feature or
requirement need not actually be built to determine whether it is valuable; it may be sufficient for
a team to simply build enough of it to validate critical assumptions that would prove or disprove
its value.

Explicitly forming hypotheses, measuring results, and inspecting and adapting goals based on
those results are implicit parts of an agile approach. Making this work explicit and transparent is
what EBM adds to the organizational improvement process.

End Note
Evidence-Based Management is free and offered in this Guide. Although implementing only
parts of EBM is possible, the result is not Evidence-Based Management.

Acknowledgements
Evidence-Based Management was collaboratively developed by Scrum.org, the Professional
Scrum Trainer Community, Ken Schwaber and Christina Schwaber.

4
Adapted from the IEEE 829 specification

10
Appendix: Example Key Value Measures
To encourage adaptability, EBM defines no specific Key Value Measures (KVMs). KVMs listed
below are presented to show the kinds of measures that might help an organization to
understand its current state, desired future state, and factors that influence its ability to improve.

Current Value (CV)
KVM Measuring:

Revenue per Employee The ratio (gross revenue / # of employees) is a key competitive
indicator within an industry. This varies significantly by industry.

Product Cost Ratio Total expenses and costs for the product(s)/system(s) being
measured, including operational costs compared to revenue.

Employee Satisfaction Some form of sentiment analysis to help gauge employee
engagement, energy, and enthusiasm.

Customer Satisfaction Some form of sentiment analysis to help gauge customer
engagement and happiness with the product.

Customer Usage Index Measurement of usage, by feature, to help infer the degree to
which customers find the product useful and whether actual usage
meets expectations on how long users should be taking with a
feature.

Unrealized Value (UV)
KVM Measuring:

Market Share The relative percentage of the market not controlled by the
product; the potential market share that the product might achieve
if it better met customer needs.

Customer or User The difference between a customer or user’s desired experience
Satisfaction Gap and their current experience.

Desired Customer A measure that indicates the experience that the customer would
Experience or like to have
satisfaction

11
Time-to-Market (T2M)
KVM Measuring:

Build and Integration The number of integrated and tested builds per time period. For a
Frequency team that is releasing frequently or continuously, this measure is
superseded by actual release measures.

Release Frequency The number of releases per time period, e.g. continuously, daily,
weekly, monthly, quarterly, etc. This helps reflect the time needed
to satisfy the customer with new and competitive products.

Release Stabilization The time spent correcting product problems between the point the
Period developers say it is ready to release and the point where it is
actually released to customers. This helps represent the impact of
poor development practices and underlying design and code
base.

Mean Time to Repair The average amount of time it takes from when an error is
detected and when it is fixed. This helps reveal the efficiency of
an organization to fix an error.

Customer Cycle Time The amount of time from when work starts on a release until the
point where it is actually released. This measure helps reflect an
organization’s ability to reach its customer.

Lead Time The amount of time from when an idea is proposed, or a
hypothesis is formed until a customer can benefit from that idea.
This measure may vary based on customer and product. It is a
contributing factor for customer satisfaction.

Lead Time for Changes The amount of time to go from code-committed to code
successfully running in production. For more information, see the
DORA 2019 report.

Deployment Frequency The number of times that the organization deployed (released) a
new version of the product to customers/users. For more
information, see the DORA 2019 report.

Time to Restore Service The amount of time between the start of a service outage and the
restoration of full availability of the service. For more information,
see the DORA 2019 report.

Time-to-Learn The total time needed to sketch an idea or improvement, build it,
deliver it to users, and learn from their usage.

12
Time to remove The average amount of time from when an impediment is raised
Impediment until when it is resolved. It is a contributing factor to lead time and
employee satisfaction.

Time to Pivot A measure of true business agility that presents the elapsed time
between when an organization receives feedback or new
information and when it responds to that feedback; for example,
the time between when it finds out that a competitor has delivered
a new market-winning feature to when the organization responds
with matching or exceeding new capabilities that measurably
improve customer experience.

Ability to Innovate (A2I)
KVM Measuring:

Innovation Rate The percentage of effort or cost spent on new product capabilities, divided
by total product effort or cost. This provides insight into the capacity of the
organization to deliver new product capabilities.

Defect Trends Measurement of change in defects since last measurement. A defect is
anything that reduces the value of the product to a customer, user, or to the
organization itself. Defects are generally things that don’t work as intended.

On-Product Index The percentage of time teams spend working on product and value.

Installed Version The number of versions of a product that are currently being supported.
Index This reflects the effort the organization spends supporting and maintaining
older versions of software.

Technical Debt A concept in programming that reflects the extra development and testing
work that arises when “quick and dirty” solutions result in later remediation.
It creates an undesirable impact on the delivery of value and an avoidable
increase in waste and risk.

Production The number of times in a given period that the Development Team was
Incident Count interrupted to fix a problem in an installed product. The number and
frequency of Production Incidents can help indicate the stability of the
product.

13
Active Product The number of different versions (or variants) of a product or service.
(Code) Branches Provides insight into the potential impact of change and the resulting
complexity of work.

Time Spent The amount of time spent applying changes across different versions of a
Merging Code product or service. Provides insight into the potential impact of change and
Between the resulting complexity of work.
Branches

Time Spent Examples include time lost to interruptions caused by meetings or calls,
Context-Switching time spent switching between tasks, and time lost when team members are
interrupted to help people outside the team can give simple insight into the
magnitude of the problem.

Change Failure The percentage of released product changes that result in degraded
Rate service and require remediation (e.g. hotfix, rollback, patch). For more
information, see the DORA 2019 report.