Span Programs and Quantum Space Complexity

Authors Stacey Jeffery,
Plaintext
                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49
                                       www.theoryofcomputing.org




       Span Programs and Quantum Space
                  Complexity
                                                   Stacey Jeffery∗
                 Received October 7, 2020; Revised November 3, 2021; Published May 24, 2022




       Abstract. While quantum computers hold the promise of significant computational
       speedups, the limited size of early quantum machines motivates the study of
       space-bounded quantum computation. We relate the quantum space complexity of
       computing a function 𝑓 with one-sided error to the logarithm of its span program size
       over the reals, a classical quantity that is well-studied in attempts to prove formula
       size lower bounds.
           In the more natural bounded error model, we show that the amount of space needed
       for a unitary quantum algorithm (i. e., an algorithm that makes no measurements
       until the final step) to compute 𝑓 with bounded (two-sided) error is at least the
       logarithm of its approximate span program size. Approximate span programs have been
       introduced in the field of quantum algorithms but not studied classically. However,
       the approximate span program size of a function is a natural generalization of its
       span program size.

     A conference version of this paper appeared in the Proceedings of the 11th Innovations in Theoretical Computer
Science Conference, 2020 [17].
   ∗ Supported by an NWO WISE Fellowship, an NWO Veni Innovational Research Grant under project number
639.021.752, and QuantERA project QuantAlgo 680-91-03. SJ is a CIFAR Fellow in the Quantum Information Science
Program.


ACM Classification: F.1.1, F.1.3
AMS Classification: 81P68
Key words and phrases: quantum computing, quantum space complexity, span programs


© 2022 Stacey Jeffery
c b Licensed under a Creative Commons Attribution License (CC-BY)                      DOI: 10.4086/toc.2022.v018.a011
                                                S TACEY J EFFERY

          While no non-trivial lower bound is known on the span program size (or
      approximate span program size) of any explicit function, a number of lower bounds
      are known on the monotone span program size. We show that the approximate
      monotone span program size of 𝑓 is a lower bound on the space needed by quantum
      algorithms of a particular form, called monotone phase estimation algorithms, to compute
      𝑓 . We then give the first non-trivial lower bound on the approximate monotone span
      program size of an explicit function.


1    Introduction
While quantum computers hold the promise of significant speedups for a number of problems,
building them is a serious technological challenge, and it is expected that early quantum
computers will have quantum memories of very limited size. This motivates the theoretical
question: what problems could we solve faster on a quantum computer with limited space?
Or similarly, what is the minimum number of qubits needed to solve a given problem (and
hopefully still get a speedup)?
     We take a modest step towards answering such questions, by relating the space complexity
of a function 𝑓 to its span program size (see Definition 3.3), which is a measure that has received
significant attention in theoretical computer science over the past few decades. Span programs
are a model of computation introduced by Karchmer and Wigderson [20] in an entirely classical
setting; they defined the span program size of a function in order to lower bound the size of
counting branching programs. Some time later, Reichardt and Špalek [28] related span programs
to quantum algorithms, and introduced the new measure of span program complexity (see
Definition 3.4). The importance of span programs in quantum algorithms stems from the ability
to compile any span program for a function 𝑓 into a bounded error quantum algorithm for 𝑓
[27]. In particular, there is a tight correspondence between the span program complexity of 𝑓 ,
and its quantum query complexity—a rather surprising and beautiful connection for a model
originally introduced outside the realm of quantum computing. In contrast, the classical notion
of span program size had received no attention in the quantum computing literature before now.
     Ref. [15] defined the notion of an approximate span program for a function 𝑓 , and showed
that even an approximate span program for 𝑓 can be compiled into a bounded error quantum
algorithm for 𝑓 . In this paper, we further relax the definition of an approximate span program
for 𝑓 , making analysis of such algorithms significantly easier (see Definition 3.6).
     Let S𝑈 ( 𝑓 ) denote the bounded error unitary space complexity of 𝑓 , or the minimum space needed
by a unitary quantum algorithm—i. e., an algorithm that makes no measurements until the final
step—that computes 𝑓 with bounded error (see Definition 2.3). In [10] and [12], independently,
it was shown that S𝑈 ( 𝑓 ) = S( 𝑓 ) (up to constants), where S( 𝑓 ) denotes the bounded error space
complexity of 𝑓 , without the restriction to algorithms that are unitary. Our results are proven for
S𝑈 ( 𝑓 ), but the results of [10, 12] imply that they also apply to S( 𝑓 ). A similar statement is not
known for the one-sided error unitary quantum space complexity, S𝑈          1
                                                                              ( 𝑓 ), though we suspect that
it also holds, and a proof of this would strengthen our results about S𝑈        1
                                                                                   ( 𝑓 ) to also hold for S1 ( 𝑓 ).
                                𝑛
     For a function 𝑓 : {0, 1} → {0, 1}, we can assume that the input is accessed by queries, so

                         T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                   2
                            S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

that we do not need to store the full 𝑛-bit input in working memory, but we need at least log 𝑛
bits of memory to store an index into the input. Thus, a lower bound of 𝜔(log 𝑛) on S( 𝑓 ) for
some 𝑓 would be considered non-trivial.
    Letting SP( 𝑓 ) denote the minimum size of a span program deciding 𝑓 , and SP    f ( 𝑓 ) the
minimum size of a span program approximating 𝑓 (see Definition 3.7), we have the following
(see Theorem 4.1):

Theorem 1.1 (Informal). For any Boolean function 𝑓 , if S( 𝑓 ) denotes its bounded error quantum space
                f ( 𝑓 ) its approximate span program size, then
complexity, and SP

                                                S( 𝑓 ) ≥ log SP
                                                             f ( 𝑓 ).

Similarly, if S𝑈
               1
                 ( 𝑓 ) denotes its one-sided error unitary space complexity, and SP( 𝑓 ) its span program size,
then
                                                1
                                               S𝑈 ( 𝑓 ) ≥ log SP( 𝑓 ).

    In the case of bounded (two-sided) error, this lower bound is tight in the following sense
(corollary of Theorem 3.1 and 3.2):

Theorem 1.2 (Informal). The class of languages decidable in bounded error by a quantum algorithm
with space 𝑂(𝑆) and 2𝑂(𝑆) queries1 is equal to the class of languages approximated by a span program of
size and complexity 2𝑂(𝑆) .

     The relationship between span program size and quantum space complexity is rather natural,
as the span program size of 𝑓 is known to lower bound the minimum size of a symmetric
branching program for 𝑓 , and the logarithm of the branching program size of a function 𝑓
characterizes its classical deterministic space complexity.
     The inequality S𝑈  1
                          ( 𝑓 ) ≥ log SP( 𝑓 ) in Theorem 1.1 follows from a construction of [27] for
converting a one-sided error quantum algorithm for 𝑓 into a span program for 𝑓 . We adapt this
construction to show how to convert a bounded (two-sided) error unitary quantum algorithm
for 𝑓 with query complexity 𝑇 and space complexity 𝑆 ≥ log 𝑇 into an approximate span
program for 𝑓 with complexity Θ(𝑇) and size 2Θ(𝑆) , proving S𝑈 ( 𝑓 ) ≥ Ω(log SP      f ( 𝑓 )), and thus
S( 𝑓 ) ≥ Ω(log SP( 𝑓 )). The connection between S( 𝑓 ) and log SP( 𝑓 ) is tight up to an additive term
                f                                                 f
of the logarithm of the minimum complexity of any span program for 𝑓 with optimal size,
yielding Theorem 1.2. This follows from the fact that an approximate span program can be
compiled into a quantum algorithm in a way that similarly preserves the correspondence between
space complexity and (logarithm of) span program size, as well as the correspondence between
query complexity and span program complexity (see Theorem 3.1). While the preservation of
   1Depending on the precise model of computation, it is without loss of generality to assume that the space is at
least logarithmic in the number of queries. In our model of unitary quantum algorithms (see Section 2), this is a
reasonable assumption since we would need to use a counter of size at least logarithmic in the query complexity to
know which unitary to apply. In the case of a quantum Turing machine that halts absolutely, if there is ever a pair of
time steps 𝑡 ≠ 𝑡 0 such that the state of the machine at step 𝑡 and the state at step 𝑡 0 are non-orthogonal, then some
(exponentially decreasing) branch of the computation will run forever, which is a contradiction.


                         T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                       3
                                                 S TACEY J EFFERY

the correspondence between query complexity and span program complexity (in both directions)
is not necessary for our results, it may be useful in future work for studying lower bounds on
time and space simultaneously.
    The significance of Theorem 1.1 is that span program size has received extensive attention in
theoretical computer science. Using results from [5], the connection in Theorem 1.1 immediately
implies the following (Theorem 4.2):

Theorem 1.3. For almost all Boolean functions 𝑓 on 𝑛 bits, S𝑈
                                                            1
                                                              ( 𝑓 ) = Ω(𝑛).

    If we make a uniformity assumption that the quantum space complexity of an algorithm is at
least the logarithm of its time complexity, then Theorem 1.3 would follow from a lower bound of
Ω(2𝑛 ) on the quantum time complexity of almost all 𝑛-bit Boolean functions. Notwithstanding,
the proof via span program size is evidence of the power of the technique.
    In the pursuit of lower bounds on span program size of explicit functions, several nice
expressions lower bounding SP( 𝑓 ) have been derived. By adapting one such lower bound on
SP( 𝑓 ) to SP
           f ( 𝑓 ), we get the following (see Lemma 4.6):

Theorem 1.4 (Informal). For any Boolean function 𝑓 , and partial matrix2 𝑀 ∈ (ℝ ∪ {★}) 𝑓
                                                                                           −1 (0)× 𝑓 −1 (1)


with k𝑀 k ∞ ≤ 1:                                                   !!
                                                 1
                                                   -rank(𝑀)
                          S( 𝑓 ) ≥ Ω log         2
                                                                      ,
                                           max𝑖∈[𝑛] rank(𝑀 ◦ Δ𝑖 )

where ◦ denotes the entrywise product, and Δ𝑖 [𝑥, 𝑦] = 1 if 𝑥 𝑖 ≠ 𝑦 𝑖 and 0 else.

    Above, 12 -rank denotes the approximate rank, or the minimum rank of any matrix 𝑀        e such
that |𝑀[𝑥, 𝑦] − 𝑀[𝑥,
                  e 𝑦]| ≤ for each 𝑥, 𝑦 such that 𝑀[𝑥, 𝑦] ≠ ★. If we replace -rank(𝑀) with
                              1
                              2
                                                                                     1
                                                                                     2
rank(𝑀), we get the logarithm of an expression called the rank measure, introduced by Razborov
[25]. The rank measure was shown by Gàl to be a lower bound on span program size, SP [11],
and thus, our results imply that the log of the rank measure is a lower bound on S𝑈          1
                                                                                               . It is
straightforward to extend this proof to the approximate case to get Theorem 1.4.
    Theorem 1.4 seems to give some hope of proving a non-trivial—that is, 𝜔(log 𝑛)—lower
bound on the quantum space complexity of some explicit 𝑓 , by exhibiting a matrix 𝑀 for which
the (approximate) rank measure is 2𝜔(log 𝑛) . In [25], Razborov showed that the rank measure is a
lower bound on the Boolean formula size of 𝑓 , motivating significant attempts to prove lower
bounds on the rank measure of explicit functions. The bad news is, circuit lower bounds have
been described as “Complexity theory’s Waterloo” [4]. Despite significant effort, no non-trivial
lower bound on span program size for any 𝑓 is known.
    Due to the difficulty of proving explicit lower bounds on span program size, earlier work has
considered the easier problem of lower bounding monotone span program size, mSP( 𝑓 ). For a
monotone function 𝑓 , the monotone span program size of 𝑓 , mSP( 𝑓 ) is the minimum size of any
monotone span program for 𝑓 (see Definition 5.1). We can similarly define the approximate monotone
span program size of 𝑓 , mSP
                           f ( 𝑓 ) (see Definition 5.1). Although log mSP
                                                                       f ( 𝑓 ) is not a lower bound

   2Note that 𝑀 depends on 𝑓 in that it is indexed by the 0- and 1-inputs of 𝑓 .


                         T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                           4
                        S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

on S( 𝑓 ), even for monotone 𝑓 , it is a lower bound on the space complexity of any algorithm
obtained by compiling a monotone span program. We show that such algorithms are equivalent
to a more natural class of algorithms called monotone phase estimation algorithms. Informally, a
phase estimation algorithm is an algorithm that works by performing phase estimation of some
unitary that makes one query to the input, and estimating the amplitude on a 0 in the phase
register (see Definition 5.12). Any quantum algorithm can be put into this form in a way that
preserves its space, query, and even time complexity. A monotone phase estimation algorithm
is a phase estimation algorithm where, loosely speaking, adding 0s to the input can only make
the algorithm more likely to reject (see Definition 5.13). This includes, for example, the phase
estimation variant of Grover’s algorithm. We can then prove the following (see Theorem 5.14):

Theorem 1.5 (Informal). For any Boolean function 𝑓 , any bounded error monotone phase estimation
algorithm for 𝑓 has space complexity at least log mSP f ( 𝑓 ), and any one-sided error monotone phase
estimation algorithm for 𝑓 has space complexity at least log mSP( 𝑓 ).

    Fortunately, non-trivial lower bounds for the monotone span program complexity are
known for explicit functions. In Ref. [5], Babai, Gàl and Wigderson showed a lower bound
of mSP( 𝑓 ) ≥ 2Ω(log (𝑛)/log log(𝑛)) for some explicit function 𝑓 , which was later improved to
                      2

                                                                                                𝜖
mSP( 𝑓 ) ≥ 2Ω(log (𝑛)) by Gàl [11]. In Ref. [29], a function 𝑓 was exhibited with mSP( 𝑓 ) ≥ 2𝑛
                  2


for some constant 𝜖 ∈ (0, 1), and in the strongest known result, Pitassi and Robere exhibited a
function 𝑓 with mSP( 𝑓 ) ≥ 2Ω(𝑛) [24]. Combined with our results, each of these implies a lower
bound on the space complexity of one-sided error monotone phase estimation algorithms. For
example, the result of [24] implies a lower bound of Ω(𝑛) on the space complexity of one-sided
error monotone phase estimation algorithms for a certain satisfiability problem 𝑓 . This lower
bound, and also the one in [29], are proven by choosing 𝑓 based on a constraint satisfaction
problem with high refutation width, which is a measure related to the space complexity of certain
classes of SAT solvers, so it is intuitively not surprising that these problems should require a
large amount of space to solve with one-sided error.
    For the case of bounded error space complexity, we also prove the following (see Theorem 5.3,
Corollary 5.15):

Theorem 1.6 (Informal). There exists a function 𝑓 : {0, 1} 𝑛 → {0, 1} such that any bounded error
monotone phase estimation algorithm for 𝑓 has space complexity (log 𝑛)2−𝑜(1) .

   This lower bound is non-trivial, although much less so than the best known lower bound
of Ω(𝑛) for the one-sided case. The approximate monotone span program lower bound
from which Theorem 1.6 follows also implies a new lower bound of 2(log 𝑛)
                                                                           2−𝑜(1)
                                                                                  on the (non-
approximate) monotone span program size of the function 𝑓 in Theorem 1.6 (although, as
previously mentioned, there are much better lower bounds for monotone span program size of
other explicit functions).
   To prove the lower bound in Theorem 1.6, we apply a new technique that leverages the
best possible gap between the certificate complexity and approximate polynomial degree of a

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                        5
                                              S TACEY J EFFERY

function, employing a function 𝑔 : {0, 1} 𝑚
                                                 2+𝑜(1)
                                                 → {0, 1} from [8],3 whose certificate complexity
is 𝑚 1+𝑜(1) , and whose approximate degree is 𝑚 2−𝑜(1) . Following a strategy of [29], we use this 𝑔
to construct a pattern matrix [30] (see Definition 5.8) and use this matrix in a monotone version
of Theorem 1.4 (see Theorem 5.4). The fact that certificate complexity and approximate degree
                                  g (𝑔) ≤ 𝐶(𝑔)2 for all 𝑔 is a barrier to proving a lower bound
of total functions are related by deg 1/3
better than (log 𝑛)2 using this technique, but we also give a generalization that has the potential
to prove significantly better lower bounds (see Lemma 5.11).


Discussion and open problems The most conspicuous open problem of this work is to prove
a lower bound of 𝜔(log 𝑛) on S( 𝑓 ) or even S𝑈    1
                                                     ( 𝑓 ) for some explicit decision function 𝑓 . It is
known that any space 𝑆 quantum Turing machine can be simulated by a deterministic classical
algorithm in space 𝑆 2 [31], so a lower bound of 𝜔(log2 𝑛) on classical space complexity would
also give a non-trivial lower bound on quantum space complexity. If anything, the relationship
to span program size is evidence that this task is extremely difficult.
     We have shown a lower bound of 2(log 𝑛)
                                                2−𝑜(1)
                                                       on the approximate monotone span program
complexity of an explicit monotone function 𝑓 , which gives a lower bound of (log 𝑛)2−𝑜(1) on
the bounded error space complexity needed by a quantum algorithm of a very specific form: a
monotone phase estimation algorithm. This is much worse than the best bound we can get in
the one-sided case: a lower bound of Ω(𝑛) for some explicit function. An obvious open problem
is to try to get a better lower bound on the approximate monotone span program complexity of
some explicit function.
     Our lower bound of (log 𝑛)2−𝑜(1) only applies to the space complexity of monotone phase
estimation algorithms and does not preclude the existence of a more space-efficient algorithm
of a different form for 𝑓 . We do know that phase estimation algorithms are fully general, in
the sense that every function has a space-optimal phase estimation algorithm. Does something
similar hold for monotone functions and monotone phase estimation algorithms? This would
imply that log mSP f ( 𝑓 ) is a lower bound on S( 𝑓 ) for all monotone functions 𝑓 .
     In this paper, we define an approximate version of the rank method, and monotone rank
method, and in case of the monotone rank method, give an explicit non-trivial lower bound. The
rank method is known to give lower bounds on formula size, and the monotone rank method
on monotone formula size. An interesting question is whether the approximate rank method
also gives lower bounds on some complexity theoretic quantity related to (classical) formulas.
     Our results are a modest first step towards understanding unitary quantum space complexity,
but even if we could lower bound the unitary quantum space complexity of an explicit function,
there are several obstacles limiting the practical consequences of such a result. First, while an
early quantum computer will have a small quantum memory, it is simple to augment it with
a much larger classical memory. Thus, in order to achieve results with practical implications,
we would need to study computational models that make a distinction between quantum and
classical memories. We leave this as an important challenge for future work.

   3An earlier version of this paper used a function described in [1] with a 7/6-separation between certificate
complexity and approximate degree. We thank Robin Kothari for pointing us to the improved result of [8].


                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                6
                             S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

    Second, we are generally only interested in running quantum algorithms when we get an
advantage over classical computers in the time complexity, so results that give a lower bound on
the quantum space required if we wish to keep the time complexity small, such as time-space
lower bounds, are especially interesting. While we do not address time-space lower bounds in
this paper, one advantage of the proposed quantum space lower bound technique, via span
programs, is that span programs are also known to characterize quantum query complexity,
which is a lower bound on time complexity. We leave exploration of this connection for future
work.
    We mention two previous characterizations of S( 𝑓 ). Ref. [19] showed that S( 𝑓 ) is equal to
the logarithm of the minimum width of a matchgate circuit computing 𝑓 , and thus our results
imply that this minimum matchgate width is approximately equal to the approximate span
program size of 𝑓 . Separately, in Ref. [9], Fefferman and Lin showed that for every function
𝑘, inverting 2 𝑘(𝑛) × 2 𝑘(𝑛) matrices is complete for the class of problems 𝑓 such that S( 𝑓 ) ≤ 𝑘(𝑛).
Our results imply that evaluating an approximate span program of size 2 𝑘(𝑛) (for some suitable
definition of the problem) is similarly complete for this class. Evaluating an approximate span
program boils down to deciding if k𝐴(𝑥)+ |𝑤0 ik, for some matrix 𝐴(𝑥) partially determined by
the input 𝑥, and some initial state |𝑤 0 i, is below a certain threshold, so these results are not
unrelated.4 We leave exploring these connections as future work.

Organization The remainder of this paper is organized as follows. In Section 2, we present
the necessary notation and quantum algorithmic preliminaries, and define quantum space
complexity. In Section 3, we define span programs, and describe how they correspond to
quantum algorithms. In particular, we describe how a span program can be “compiled” into
a quantum algorithm (Section 3.2), and how a quantum algorithm can be turned into a span
program (Section 3.3), with both transformations more or less preserving the relationships
between span program size and algorithmic space, and between span program complexity and
query complexity. From this correspondence, we obtain, in Section 4, expressions that lower
bound the quantum space complexity of a function. While we do not know how to instantiate
any of these expressions to get a non-trivial lower bound for an explicit function, in Section 5, we
consider to what extent monotone span program lower bounds are meaningful lower bounds
on variants of quantum space complexity, and give the first non-trivial lower bound on the
approximate monotone span program size of a function.


2     Preliminaries
We begin with some miscellaneous notation. For a vector |𝑣i, we let k|𝑣ik denote its ℓ2 -norm. In
the following, let 𝐴 be a matrix with 𝑖 and 𝑗 indexing its rows and columns. Define:
                      k𝐴k ∞ = max |𝐴 𝑖,𝑗 |,      and     k𝐴k = max{k𝐴|𝑣ik : k|𝑣ik = 1}.
                                  𝑖,𝑗

    4Here, 𝐴(𝑥) = 𝐴Π𝐻(𝑥) , where 𝐴 is as in Definition 3.3, |𝑤 0 i = 𝐴+ |𝜏i for |𝜏i as in Definition 3.3, and 𝐻(𝑥) is as
in Definition 3.4. 𝐴(𝑥)+ denotes the pseudo-inverse of 𝐴(𝑥). Then one can verify that 𝑤 + (𝑥) = k𝐴(𝑥)+ |𝑤 0 ik 2 (see
Definition 3.4).


                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                       7
                                            S TACEY J EFFERY

Following [2], define the 𝜀-rank of a matrix 𝐴 as the minimum rank of any matrix 𝐵 such that
k𝐴 − 𝐵k ∞ ≤ 𝜀. For a matrix 𝐴 with singular value decomposition 𝐴 = 𝑘 𝜎 𝑘 |𝑣 𝑘 ih𝑢 𝑘 |, where we
                                                                    Í
assume ∀𝑘, 𝜎 𝑘 > 0, define:
                                                                                        Õ 1
 col(𝐴) = span{|𝑣 𝑘 i} 𝑘 ,   row(𝐴) = span{|𝑢 𝑘 i} 𝑘 ,   ker(𝐴) = row(𝐴)⊥ ,      𝐴+ =             |𝑢 𝑘 ih𝑣 𝑘 |.
                                                                                             𝜎𝑘
                                                                                         𝑘

The following lemma, from [22], is useful in the analysis of quantum algorithms.

Lemma 2.1 (Effective spectral gap lemma). Fix orthogonal projectors Π𝐴 and Π𝐵 . Let 𝑈 =
(2Π𝐴 − 𝐼)(2Π𝐵 − 𝐼), and let ΠΘ be the orthogonal projector onto the e𝑖𝜃 -eigenspaces of 𝑈 such that
|𝜃| ≤ Θ. Then if Π𝐴 |𝑢i = 0, then kΠΘ Π𝐵 |𝑢ik ≤ Θ2 k|𝑢ik.

In general, we will let Π𝑉 denote the orthogonal projector onto 𝑉, for a subspace 𝑉.

Unitary quantum algorithms and space complexity A unitary quantum algorithm 𝒜 = {𝒜 𝑛 } 𝑛∈ℕ
                                                                                         (𝑛)       (𝑛)
is a family (parametrized by 𝑛) of sequences of 2𝑠(𝑛) -dimensional unitaries 𝑈1 , . . . , 𝑈𝑇(𝑛) , for
some 𝑠(𝑛) ≥ log 𝑛 and 𝑇(𝑛). (We will generally dispense with the explicit parametrization by 𝑛).
For 𝑥 ∈ {0, 1} 𝑛 , let 𝒪𝑥 be the unitary that acts as 𝒪𝑥 | 𝑗i = (−1)𝑥 𝑗 | 𝑗i for 𝑗 ∈ [𝑛], and 𝒪𝑥 |0i = |0i.
We let 𝒜(𝑥) denote the random variable obtained from measuring

                                        𝑈𝑇 𝒪𝑥 𝑈𝑇−1 . . . 𝒪𝑥 𝑈1 |0i

with some two-outcome measurement that should be clear from context. We call 𝑇(𝑛) the
query complexity of the algorithm, and 𝑆(𝑛) = 𝑠(𝑛) + log 𝑇(𝑛) the space complexity. By including
a log 𝑇(𝑛) term in the space complexity, we are implicitly assuming that the algorithm must
maintain a counter to know which unitary to apply next. This is a fairly mild uniformity
assumption (that is, any uniformly generated algorithm uses Ω(log 𝑇) space), and it will make
the statement of our results much simpler. The requirement that 𝑠(𝑛) ≥ log 𝑛 is to ensure that
the algorithm has enough space to store an index 𝑖 ∈ [𝑛] into the input.

Remark 2.2. Since 𝑇 is the number of queries made by the algorithm, we may be tempted to
assume that it is at most 𝑛, however, while every 𝑛-bit function can be computed in 𝑛 queries,
this may not be the case when space is restricted. For example, it is difficult to imagine an
algorithm that uses 𝑂(log 𝑛) space and 𝑜(𝑛 3/2 ) quantum queries to solve the following problem
on [𝑞]𝑛 ≡ {0, 1} 𝑛 log 𝑞 : Decide whether there exist distinct 𝑖, 𝑗, 𝑘 ∈ [𝑛] such that 𝑥 𝑖 + 𝑥 𝑗 + 𝑥 𝑘 = 0
mod 𝑞.

   For a (partial) function 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 , we say that 𝒜 computes 𝑓 with
bounded error if for all 𝑥 ∈ 𝐷, 𝒜(𝑥) = 𝑓 (𝑥) with probability at least 2/3. We say that 𝒜
computes 𝑓 with one-sided error if in addition, for all 𝑥 such that 𝑓 (𝑥) = 1, 𝒜(𝑥) = 𝑓 (𝑥) with
probability 1.

Definition 2.3 (Unitary Quantum Space). For a family of functions 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 ,
the unitary space complexity of 𝑓 , S𝑈 ( 𝑓 ), is the minimum 𝑆(𝑛) such that there is a family of

                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                      8
                        S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

unitary quantum algorithms with space complexity 𝑆(𝑛) that computes 𝑓 with bounded error.
Similarly, S𝑈
            1
              ( 𝑓 ) is the minimum 𝑆(𝑛) such that there is a family of unitary quantum algorithms
with space complexity 𝑆(𝑛) that computes 𝑓 with one-sided error.
    In general, quantum algorithms need not be of the strict unitary form described above, as a
quantum computer is not restricted to only measure at the end of the algorithm. If one only cares
about time complexity, then it is without loss of generality to assume that all measurements
happen in the final step of the algorithm, because one can simply set aside any register that
is to be measured, to be used as a “read-only” register (that is, strictly as a control) for the
remainder of the computation. However, it is not obvious that this would not increase the space
complexity, since any register that should have been measured is not available for re-use. It
was recently shown that, in fact, even for space complexity, there is no loss of generality in
considering unitary quantum algorithms [10, 12]; if we let S( 𝑓 ) denote the minimum space
complexity of any quantum algorithm that computes 𝑓 with bounded error, S( 𝑓 ) = S𝑈 ( 𝑓 ). Thus,
we can restrict our attention to unitary quantum algorithms for the remainder of this article but
all of our results in the bounded error setting also hold for non-unitary algorithms [10, 12]. At
the time of writing, there is no analogous result for S𝑈
                                                       1
                                                         ( 𝑓 ), but we suspect it holds along similar
lines.

Phase estimation For a unitary 𝑈 acting on 𝐻 and a state |𝜓i ∈ 𝐻, we will say we perform 𝑇
steps of phase estimation of 𝑈 on |𝜓i when we compute
                                             𝑇−1
                                          1 Õ
                                         √       |𝑡i𝑈 𝑡 |𝜓i,
                                           𝑇 𝑡=0

and then perform a quantum Fourier transform over ℤ/𝑇 ℤ on the first register, called the phase
register. This procedure was introduced in [21]. It is easy to see that the complexity (either query
or time) of phase estimation is 𝑂(𝑇) times the complexity of implementing a controlled call to
𝑈. The space complexity of phase estimation is log 𝑇 + log dim(𝐻).
    Informally: we will use the fact that if 𝑈 |𝜓i = |𝜓i, then performing 𝑇 steps of phase
estimation of 𝑈 on |𝜓i and measuring the phase register results in outcome 0 with probability
1; and if 𝑈 |𝜓i = e𝑖𝜃 |𝜓i for some 𝜃 ∈ (−𝜋, 𝜋] with |𝜃| > 0, then performing sufficiently large
𝑇 = Ω(1/|𝜃|) steps of phase estimation results in outcome 0 with probability bounded by a
constant below 1. Formally: for the results in Section 3.2, we refer to the proof of [15, Lemma
3.2] where formal results about phase estimation are exploited; for the results in Section 5.2,
we prove the specific properties of phase estimation needed for our purposes in Lemma 5.18
and 5.19.
    We note that we can increase the success probability to any constant by adding some constant
number 𝑘 of phase registers, and doing phase estimation 𝑘 times in parallel, still using a single
register for 𝑈, and taking the majority. This still has space complexity log dim 𝐻 + 𝑂(log 𝑇).

Amplitude estimation For a unitary 𝑈 acting on 𝐻, a state |𝜓i ∈ 𝐻, and an orthogonal
projector Π on 𝐻, we will say we perform 𝑀 steps of amplitude estimation of 𝑈 on |𝜓i with respect

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                        9
                                                S TACEY J EFFERY

to Π when we perform 𝑀 steps of phase estimation of

                                          𝑈(2|𝜓ih𝜓| − 𝐼)𝑈 † (2Π − 𝐼)
                                                                                              𝜋𝑡
on 𝑈 |𝜓i, then, if the phase register contains some 𝑡 ∈ {0, . . . , 𝑀 − 1}, compute 𝑝˜ = sin2 2𝑀 ,
                                  2
which is an estimate of kΠ𝑈 |𝜓ik in a new register. The (time or query) complexity of this is
𝑂(𝑀) times the complexity of implementing a controlled call to 𝑈, implementing a controlled
call to 2Π − 𝐼, and generating |𝜓i. The space complexity is log 𝑇 + log dim 𝐻 + 𝑂(1). We have
the following guarantee [7]:

Lemma 2.4. Let 𝑝 = kΠ𝑈 |𝜓ik 2 . There exists Δ = Θ(1/𝑀) such that when 𝑝˜ is obtained as above from
𝑀 steps of amplitude estimation, with probability at least 1/2, | 𝑝˜ − 𝑝| ≤ Δ.

We will thus also refer to 𝑀 steps of amplitude estimation as amplitude estimation to precision 1/𝑀.


3     Span programs and quantum algorithms
In Section 3.1, we will define a span program, its size and complexity, and what it means for a
span program to approximate a function 𝑓 . In Section 3.2, we will prove the following, which
implies that the first part of Theorem 1.1 is essentially tight.

Theorem 3.1. Let 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 , and let 𝑃 be a span program that 𝜅-approximates
𝑓 with size 𝐾 and complexity 𝐶, for some constant 𝜅 ∈ (0, 1). Then there exists a unitary quantum
algorithm 𝒜 𝑃 that decides 𝑓 with bounded error in space 𝑆 = 𝑂(log 𝐾 + log 𝐶) using 𝑇 = 𝑂(𝐶) queries
to 𝑥.

    Finally, in Section 3.3, we prove the following theorem, which implies Theorem 1.1:

Theorem 3.2. Let 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 and let 𝒜 be a unitary quantum algorithm using 𝑇
queries, and space 𝑆 to compute 𝑓 with bounded error. Then for any constant 𝜅 ∈ (0, 1), there is a span
program 𝑃𝒜 with size 𝑠(𝑃𝒜 ) ≤ 2𝑂(𝑆) that 𝜅-approximates 𝑓 with complexity 𝐶𝜅 ≤ 𝑂(𝑇). If 𝒜 decides
𝑓 with one-sided error, then 𝑃𝒜 decides 𝑓 .

    A statement similar to Theorem 3.1 for the case of exact (𝜅 = 1) span programs5 was proven in
[27]. Later this was generalized to the case of approximate span program [15], but a slightly more
constrained notion of approximation was used, which would not allow us to prove Theorem 3.2.
Neither of these works explicitly mentioned space complexity, although the analysis of the space
complexity follows easily.
    Theorem 3.2 is proven by exhibiting a construction that maps a bounded-error quantum
algorithm for 𝑓 to a span program that approximates it. This is based on a similar construction
in [27] that maps a one-sided error quantum algorithm for 𝑓 to a span program that decides
it exactly. Interestingly, the fact that span program complexity is a lower bound on query
complexity was known even without a mapping from bounded-error quantum algorithms to
    5See Section 3.1 for definitions of exact and approximate span programs.


                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                       10
                             S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

span programs. This was proven by showing [27] that the semidefinite minimization problem
whose solution is the minimum span program complexity of a span program that decides 𝑓 is
the dual of a semidefinite program that was known to be a lower bound on quantum query
complexity [3, 6, 14].

3.1     Span programs
Span programs were first introduced in the context of classical complexity theory in [20], where
they were used to study counting classes for nondeterministic logspace machines. While span
programs can be defined with respect to any field, we will consider span programs over ℝ (or
equivalently, ℂ, when convenient, see Remark 3.10). We use the following definition, slightly
modified from [20]:

Definition 3.3 (Span Program and Size). A span program on {0, 1} 𝑛 consists of:

      • Finite inner product spaces {𝐻 𝑗,𝑏 } 𝑗∈[𝑛],𝑏∈{0,1} ∪ {𝐻true , 𝐻false }. We define 𝐻 =                 𝑗,𝑏 𝐻 𝑗,𝑏 ⊕
                                                                                                          É

        𝐻true ⊕ 𝐻false , and for every 𝑥 ∈ {0, 1} 𝑛 , 𝐻(𝑥) = 𝐻1,𝑥1 ⊕ · · · ⊕ 𝐻𝑛,𝑥 𝑛 ⊕ 𝐻true .6

      • A vector space 𝑉.

      • A target vector |𝜏i ∈ 𝑉.7

      • A linear map 𝐴 : 𝐻 → 𝑉.

We specify this span program by 𝑃 = (𝐻, 𝑉 , |𝜏i, 𝐴), and leave the decomposition of 𝐻 implicit.
The size of the span program is 𝑠(𝑃) = dim 𝐻.

    To recover the classical definition from [20], we can view 𝐴 as a matrix, with each of its
columns labelled by some (𝑗, 𝑏) ∈ [𝑛] × {0, 1} (or “true” or “false”).
    Span programs were introduced to the study of quantum query complexity in [28]. In the
context of quantum query complexity, 𝑠(𝑃) is no longer the relevant measure of the complexity
of a span program. Instead, [28] introduce the following measures:

Definition 3.4 (Span Program Complexity and Witnesses). For a span program 𝑃 = (𝐻, 𝑉 , |𝜏i, 𝐴)
on {0, 1} 𝑛 and input 𝑥 ∈ {0, 1} 𝑛 , we say 𝑥 is accepted by the span program if there exists |𝑤i ∈ 𝐻(𝑥)
such that 𝐴|𝑤i = |𝜏i, and otherwise we say 𝑥 is rejected by the span program. Let 𝑃0 and 𝑃1 be
respectively the set of rejected and accepted inputs to 𝑃. For 𝑥 ∈ 𝑃1 , define the positive witness
complexity of 𝑥 as:

                       𝑤 + (𝑥, 𝑃) = 𝑤+ (𝑥) = min{k|𝑤ik 2 : |𝑤i ∈ 𝐻(𝑥), 𝐴|𝑤i = |𝜏i}.
    6We remark that while 𝐻true and 𝐻false may be convenient in constructing a span program, they are not necessary.
We can always consider a partial function 𝑓 0 defined on (𝑛 + 1)-bit strings of the form (𝑥, 1) for 𝑥 in the domain of 𝑓 ,
as 𝑓 (𝑥), and let 𝐻𝑛+1,1 = 𝐻true and 𝐻𝑛+1,0 = 𝐻false .
    7Although 𝑉 has no meaningful inner product, we use Dirac notation, such as |𝜏i and h𝜔| for the sake of our
fellow quantum computing researchers.


                         T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                        11
                                                 S TACEY J EFFERY

Such a |𝑤i is called a positive witness for 𝑥. For a domain 𝐷 ⊆ {0, 1} 𝑛 , we define the positive
complexity of 𝑃 (with respect to 𝐷) as:

                                    𝑊+ (𝑃, 𝐷) = 𝑊+ = max 𝑤 + (𝑥, 𝑃).
                                                          𝑥∈𝑃1 ∩𝐷

    For 𝑥 ∈ 𝑃0 , define the negative witness complexity of 𝑥 as:

         𝑤 − (𝑥, 𝑃) = 𝑤 − (𝑥) = min{kh𝜔|𝐴k 2 : h𝜔| ∈ ℒ(𝑉 , ℝ), h𝜔|𝜏i = 1, h𝜔|𝐴Π𝐻(𝑥) = 0}.

Above, ℒ(𝑉 , ℝ) denotes the set of linear functions from 𝑉 to ℝ. Such an h𝜔| is called a negative
witness for 𝑥. We define the negative complexity of 𝑃 (with respect to 𝐷) as:

                                    𝑊− (𝑃, 𝐷) = 𝑊− = max 𝑤− (𝑥, 𝑃).
                                                          𝑥∈𝑃0 ∩𝐷

                                                                           √
    Finally, we define the complexity of 𝑃 (with respect to 𝐷) by 𝐶(𝑃, 𝐷) = 𝑊+𝑊− .

    For 𝑓 : 𝐷 → {0, 1}, we say a span program 𝑃 decides 𝑓 if 𝑓 −1 (0) ⊆ 𝑃0 and 𝑓 −1 (1) ⊆ 𝑃1 .

Definition 3.5. We define the span program size of a function 𝑓 , denoted SP( 𝑓 ), as the minimum
𝑠(𝑃) over families of span programs that decide 𝑓 .

    We note that originally, in [20], span program size was defined
                                   Õ                            Õ
                        𝑠 0(𝑃) =         dim(col(𝐴Π𝐻 𝑗,𝑏 )) =         dim(row(𝐴Π𝐻 𝑗,𝑏 )).
                                   𝑗,𝑏                          𝑗,𝑏


This could differ from 𝑠(𝑃) = dim(𝐻) = 𝑗,𝑏 dim(𝐻 𝑗,𝑏 ), because dim(𝐻 𝑗,𝑏 ) might be much larger
                                                Í
than dim(row(𝐴Π𝐻 𝑗,𝑏 )). However, if a span program has dim(𝐻 𝑗,𝑏 ) > dim(row(𝐴Π𝐻 𝑗,𝑏 )) for
some 𝑗, 𝑏, then it is a simple exercise to show that the dimension of dim(𝐻 𝑗,𝑏 ) can be reduced
without altering the witness size of any 𝑥 ∈ {0, 1} 𝑛 , so the definition of SP( 𝑓 ) is the same as
if we had used 𝑠 0(𝑃) instead of 𝑠(𝑃). In any case, we will not be relying on previous results
about the span program size as a black-box, and will rather prove all required statements, so
this difference has no impact on our results.
    While span program size has only previously been relevant outside the realm of quantum
algorithms, the complexity of a span program deciding 𝑓 has a fundamental correspondence
with the quantum query complexity of 𝑓 . Specifically, a span program 𝑃 can be turned into a
quantum algorithm for 𝑓 with query complexity 𝐶(𝑃, 𝐷), and moreover, for every 𝑓 , there exists
a span program such that the algorithm constructed in this way is optimal [27]. This second
direction is not constructive: there is no known method for converting a quantum algorithm with
query complexity 𝑇 to a span program with complexity 𝐶(𝑃, 𝐷) = Θ(𝑇). However, if we relax
the definition of which functions are decided by a span program, then such a construction is
possible, as we will show in Section 3.3. The following is a slight relaxation of [15, Definition 2.6].8
   8Which was already a relaxation of the notion of a span program deciding a function.


                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                         12
                         S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

Definition 3.6 (A Span Program that Approximately Decides a Function). Let 𝑓 : 𝐷 → {0, 1}
for 𝐷 ⊆ {0, 1} 𝑛 and 𝜅 ∈ (0, 1). We say that a span program 𝑃 on {0, 1} 𝑛 𝜅-approximates 𝑓 if
𝑓 −1 (0) ⊆ 𝑃0 , and for every 𝑥 ∈ 𝑓 −1 (1), there exists an approximate positive witness | 𝑤i
                                                                                           ˆ such that
                                     𝜅
𝐴| 𝑤i
    ˆ = |𝜏i, and Π𝐻(𝑥)⊥ | 𝑤i
                               2
                            ˆ    ≤ 𝑊− . We define the approximate positive complexity as

                                                                           𝜅
                                                                            
                  𝜅
           𝑊+ = 𝑊+ (𝑃, 𝐷) = max min k| 𝑤ik   : 𝐴| 𝑤i
                                                  ˆ = |𝜏i, Π𝐻(𝑥)⊥ | 𝑤i         .
                                           2                           2
           b    b                      ˆ                            ˆ    ≤
                                 𝑥∈ 𝑓 −1 (1)                                                       𝑊−
                                                                                                          q
If 𝑃 𝜅-approximates 𝑓 , we define the complexity of 𝑃 (wrt. 𝐷 and 𝜅) as 𝐶𝜅 (𝑃, 𝐷) =                         𝑊
                                                                                                            b+𝑊− .

   If 𝜅 = 0, the span program in Definition 3.6 decides 𝑓 (exactly), and 𝑊
                                                                         b+ = 𝑊+ . By [15,
Theorem 2.10], for any 𝑥,
                                     n                                         o      1
                                         Π𝐻(𝑥)⊥ | 𝑤i        : 𝐴| 𝑤i                        .
                                                        2
                             min                  ˆ              ˆ = |𝜏i =
                                                                                    𝑤− (𝑥)
Thus, since 𝑊− = max𝑥∈ 𝑓 −1 (0) 𝑤 − (𝑥), for every 𝑥 ∈ 𝑓 −1 (0), there does not exist an approximate
positive witness with Π𝐻(𝑥)⊥ | 𝑤i             < 𝑊1− . Thus, when a span program 𝜅-approximates 𝑓 , there
                                          2
                               ˆ

                 𝑊− between the smallest positive witness error Π𝐻(𝑥) | 𝑤i                         of 𝑥 ∈ 𝑓 −1 (1), the
                                                                                               2
is a gap of size 1−𝜅                                                 ⊥ ˆ

smallest positive witness error of 𝑥 ∈ 𝑓 (0).
                                        −1


Definition 3.7. We define the 𝜅-approximate span program size of a function 𝑓 , denoted SPf 𝜅 ( 𝑓 ), as
the minimum 𝑠(𝑃) over families of span programs that 𝜅-approximate 𝑓 . We let SP f ( 𝑓 ) = SP
                                                                                           f 1/4 ( 𝑓 ).

    We note that the choice of 𝜅 = 1/4 in SPf ( 𝑓 ) is arbitrary, as it is possible to modify a span
program to reduce any constant 𝜅 to any other constant without changing the complexity or the
logarithm of the size asymptotically. This convenient observation is formalized in the following
claim.
Claim 3.8. Let 𝑃 be a span program that 𝜅-approximates 𝑓 : 𝐷 → {0, 1} for some constant 𝜅. For any
constant 𝜅0 ≤ 𝜅, there exists a span program 𝑃 0 that 𝜅0-approximates 𝑓 with
                                                                       log(1/𝜅0 )
                                              𝑠(𝑃 0) = (𝑠(𝑃) + 2) log(1/𝜅) ,
                                                                   2
                                                                                                                 (3.1)
and 𝐶𝜅0 (𝑃 0 , 𝐷) ≤ 𝑂 (𝐶𝜅 (𝑃, 𝐷)).
   We prove Claim 3.8 shortly in Section 3.1.1. We have the following corollary that will be
                     f 𝜅 is the monotone approximate span program size, defined in Definition 5.1:
useful later, where mSP
Corollary 3.9. For any 𝜅, 𝜅0 ∈ (0, 1) with 𝜅0 < 𝜅, and any Boolean function 𝑓 ,
                                                                 1 log(1/𝜅)
                                         f 𝜅 ( 𝑓 ) ≥ SP
                                         SP          f 𝜅0 ( 𝑓 ) 2 log(1/𝜅0) − 2.

If 𝑓 is monotone, we also have
                                                                   1 log(1/𝜅)
                                      f 𝜅 ( 𝑓 ) ≥ mSP
                                     mSP           f 𝜅0 ( 𝑓 ) 2 log(1/𝜅0) − 2.

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                         13
                                                    S TACEY J EFFERY

Remark 3.10. It can sometimes be useful to construct a span program over ℂ. However, for
any span program over ℂ, 𝑃, there is a span program over ℝ, 𝑃 0, such that for all 𝑥 ∈ 𝑃0 ,
𝑤 − (𝑥, 𝑃 0) ≤ 𝑤− (𝑥, 𝑃), for all 𝑥 ∈ 𝑃1 , 𝑤 + (𝑥, 𝑃 0) ≤ 𝑤 + (𝑥, 𝑃), and 𝑠(𝑃 0) ≤ 2𝑠(𝑃). We define 𝑃 0
as follows. Without loss of generality, suppose 𝐻 𝑗,𝑏 = spanℂ {| 𝑗, 𝑏, 𝑘i : 𝑘 ∈ 𝑆 𝑗,𝑏 }. Define
𝐻 0𝑗,𝑏 = spanℝ {| 𝑗, 𝑏, 𝑘, 𝑎i : 𝑘 ∈ 𝑆 𝑗,𝑏 , 𝑎 ∈ {0, 1}}. Define

                              𝐴0 | 𝑗, 𝑏, 𝑘, 0i = Re (𝐴| 𝑗, 𝑏, 𝑘i) |0i + Im (𝐴| 𝑗, 𝑏, 𝑘i) |1i

                              𝐴0 | 𝑗, 𝑏, 𝑘, 1i = Re (𝐴| 𝑗, 𝑏, 𝑘i) |1i − Im (𝐴| 𝑗, 𝑏, 𝑘i) |0i.

Finally, let |𝜏0i = |𝜏i|0i.
   Suppose |𝑤i is a witness in 𝑃. Then

              |𝜏i = 𝐴|𝑤i = 𝐴Re(|𝑤i) + 𝑖𝐴Im(|𝑤i)
                  = Re(𝐴Re(|𝑤i)) + 𝑖Im(𝐴Re(|𝑤i)) + 𝑖Re(𝐴Im(|𝑤i)) − Im(𝐴Im(|𝑤i)).

Since we can assume |𝜏i is real, we have

         |𝜏i = Re(𝐴Re(|𝑤i)) − Im(𝐴Im(|𝑤i)) and                     Im(𝐴Re(|𝑤i)) + Re(𝐴Im(|𝑤i)) = 0.

Define |𝑤 0i = Re(|𝑤i)|0i + Im(|𝑤i)|1i. Then

𝐴0 |𝑤 0i = Re(𝐴Re(|𝑤i))|0i+Im(𝐴Re(|𝑤i))|1i+Re(𝐴Im(|𝑤i))|1i−Im(𝐴Im(|𝑤i))|0i = |𝜏i|0i = |𝜏0i.

Note that we have k|𝑤ik = k|𝑤 0ik. A similar argument holds for negative witnesses.
   Thus, we will restrict our attention to real span programs, but still allow constructions of
span programs over ℂ (in particular, in Section 3.3 and Section 5.2.1).

3.1.1   Proof of Claim 3.8

In this section, we prove Claim 3.8. The proof is somewhat technical, and may be skipped
without compromising the reader’s understanding of the remainder of the paper. We restate
Claim 3.8 below.

Claim 3.8. Let 𝑃 be a span program that 𝜅-approximates 𝑓 : 𝐷 → {0, 1} for some constant
𝜅. For any constant 𝜅0 ≤ 𝜅, there exists a span program 𝑃 0 that 𝜅0-approximates 𝑓 with
                       log(1/𝜅0 )
𝑠(𝑃 0) = (𝑠(𝑃) + 2) log(1/𝜅) , and 𝐶𝜅0 (𝑃 0 , 𝐷) ≤ 𝑂 (𝐶𝜅 (𝑃, 𝐷)).
                   2


    Let |𝑤0 i = 𝐴+ |𝜏i. We say a span program is normalized if k|𝑤 0 ik = 1. A span program can
easily be normalized by scaling |𝜏i, which also scales all positive witnesses and inverse scales
all negative witnesses. However, we sometimes want to normalize a span program, while also
keeping all negative witness sizes bounded by a constant. We can accomplish this using the
following construction, from [15].

                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                       14
                                  S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

Theorem 3.11. Let 𝑃 = (𝐻, 𝑉 , |𝜏i, 𝐴) be a span program on {0, 1} 𝑛 , and let 𝑁 = k|𝑤0 ik 2 . For a
positive real number 𝛽, define a span program 𝑃 𝛽 = (𝐻 𝛽 , 𝑉 𝛽 , |𝜏𝛽 i, 𝐴𝛽 ) as follows, where | 0i
                                                                                                 ˆ and | 1i
                                                                                                         ˆ are
not in 𝐻 or 𝑉:
                    𝛽                   𝛽                ˆ                 𝛽           ˆ
                  𝐻 𝑗,𝑏 = 𝐻 𝑗,𝑏 , 𝐻true = 𝐻true ⊕ span{| 1i}, 𝐻false = 𝐻false ⊕ span{| 0i}

                                                                    p
              𝛽                                 𝛽                       𝛽2 + 𝑁
                           ˆ
            𝑉 = 𝑉 ⊕ span{| 1i},              ˆ +
                                𝐴 = 𝛽𝐴 + |𝜏ih0|                                | 1ih ˆ |𝜏𝛽 i = |𝜏i + | 1i.
                                                                                 ˆ 1|,                 ˆ
                                                                          𝛽
Then we have the following:

   • (𝐴𝛽 )+ |𝜏𝛽 i = 1;

   • for all 𝑥 ∈ 𝑃1 , 𝑤 + (𝑥, 𝑃 𝛽 ) = 𝛽12 𝑤 + (𝑥, 𝑃) + 2;

   • for all 𝑥 ∈ 𝑃0 , 𝑤 − (𝑥, 𝑃 𝛽 ) = 𝛽 2 𝑤 − (𝑥, 𝑃) + 1.

Corollary 3.12. Let 𝑃 be a span program on {0, 1} 𝑛 , and 𝑃 𝛽 be defined as above for 𝛽 = √ 1 . If 𝑃
                                                                                           𝑊− (𝑃)
                          𝛽
                            √                               𝛽              𝛽
𝜅-approximates 𝑓 , then 𝑃     𝜅-approximates 𝑓 , with 𝑊− (𝑃 ) ≤ 2, 𝑊+ (𝑃 ) ≤ 𝑊− (𝑃)𝑊+ (𝑃) + 2 and
                                                                     b                  b
   𝛽
𝑠(𝑃 ) ≤ 𝑠(𝑃) + 2.

Proof. First note that by Theorem 3.11, 𝑊− (𝑃 𝛽 ) ≤ 2. Let |𝑤i be an approximate positive witness
for 𝑥 in 𝑃, with Π𝐻(𝑥)⊥ |𝑤i ≤ 𝑊−𝜅(𝑃) and k|𝑤ik 2 ≤ 𝑊
                             2
                                                       b+ (𝑃). Define

                                                                 𝛽
                                    |𝑤 0i =
                                                 1
                                                       |𝑤i + p        ˆ + 𝜅 | 0i.
                                                                    | 1i      ˆ
                                              𝛽(1 + 𝜅)         𝛽 +𝑁
                                                                2        1+𝜅

One can check that 𝐴𝛽 |𝑤 0i = |𝜏𝛽 i.

                                      1                           𝜅2            1        𝜅        𝜅2
        Π𝐻 𝛽 (𝑥)⊥ |𝑤 0i
                          2                               2
                              =              Π 𝐻(𝑥) ⊥ |𝑤i   +           ≤                       +
                                𝛽 2 (1 + 𝜅)2                   (1 + 𝜅)2   𝛽 2 (1 + 𝜅)2 𝑊− (𝑃) (1 + 𝜅)2
                                                                                       √
                                 𝜅 + 𝜅2        2𝜅(1 + 𝜅)             1     2𝜅            𝜅
                              =            ≤                    =                 ≤           ,
                                (1 + 𝜅)  2        𝛽
                                             𝑊− (𝑃 )(1 + 𝜅)  2    𝑊− (𝑃 ) 1 + 𝜅
                                                                        𝛽           𝑊− (𝑃 𝛽 )

where we have used 𝑊− (𝑃 𝛽 ) ≤ 2. We upper bound 𝑊
                                                 b+ (𝑃 𝛽 ) by noting that:

                                                            b+ (𝑃) + 𝛽        𝜅2
                                                                        2
                                                     1
                                  k|𝑤 0ik 2 ≤               𝑊             +
                                                𝛽2 (1 + 𝜅)2         𝛽2 + 𝑁 (1 + 𝜅)2
                                              ≤ 𝑊− (𝑃)𝑊
                                                      b+ (𝑃) + 2.

Finally, 𝑠(𝑃 𝛽 ) = 𝑠(𝑃) + 2 because of the two extra degrees of freedom | 0i
                                                                          ˆ and | 1i.
                                                                                  ˆ                          

                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                              15
                                                 S TACEY J EFFERY

Proof of Claim 3.8. We will first show how, given a span program 𝑃 such that k|𝑤0 ik 2 ≤ 1, and 𝑃
𝜅-approximates 𝑓 , we can get a span program 𝑃 0 such that |𝑤 00 i ≤ 1, 𝑊− (𝑃 0) ≤ 𝑊− (𝑃)2 , 𝑃 0
                                                                    2

𝜅2 -approximates 𝑓 , 𝑊
                     b+ (𝑃 0) ≤ 4𝑊
                                 b+ (𝑃), and 𝑠(𝑃 0) = 𝑠(𝑃)2 .
    Define 𝑃 as follows, where 𝑆 is a swap operator, which acts as 𝑆(|𝑢i|𝑣i) = |𝑣i|𝑢i for all
              0

|𝑢i, |𝑣i ∈ 𝐻:

                                                        𝐼𝐻⊗𝐻 + 𝑆
                                                                           
                    𝐻 0𝑗,𝑏 = 𝐻 𝑗,𝑏 ⊗ 𝐻,      0
                                            𝐴 = (𝐴 ⊗ 𝐴)          ,                  |𝜏0i = |𝜏i|𝜏i.
                                                           2

Observe that for any |𝑢i, |𝑣i ∈ 𝐻, we have

                         𝐴0(|𝑢i|𝑣i − |𝑣i|𝑢i) = 0,         and       𝐴0 |𝑢i|𝑢i = 𝐴|𝑢i ⊗ 𝐴|𝑢i.

Note that 𝐴0(|𝑤0 i|𝑤 0 i) = |𝜏0i, so 𝐴0+ |𝜏0i ≤ k|𝑤 0 i|𝑤 0 ik ≤ 1.
   If h𝜔| is a negative witness for 𝑥 in 𝑃, it is easily verified that h𝜔0 | = h𝜔| ⊗ h𝜔| is a negative
witness in 𝑃 0, and
                                                                                   2
                                    1                  1
                kh𝜔0 |𝐴0 k 2 =        (h𝜔|𝐴) ⊗ (h𝜔|𝐴) + (h𝜔|𝐴) ⊗ (h𝜔|𝐴)                = kh𝜔|𝐴k 4 ,
                                    2                  2

so 𝑤 − (𝑥, 𝑃 0) ≤ 𝑤 − (𝑥, 𝑃)2 , and 𝑊− (𝑃 0) ≤ 𝑊− (𝑃)2 .
   If |𝑤i is an approximate positive witness for 𝑥 in 𝑃, then define

        |𝑤 0i = |𝑤i|𝑤i − Π𝐻(𝑥)⊥ |𝑤iΠ𝐻(𝑥) |𝑤i + Π𝐻(𝑥) |𝑤iΠ𝐻(𝑥)⊥ |𝑤i − Π𝐻(𝑥) |𝑤iΠker(𝐴) |𝑤i.

We have
                            1
𝐴0 |𝑤 0i = 𝐴|𝑤i𝐴|𝑤i −         𝐴Π𝐻(𝑥) |𝑤i ⊗ 𝐴Πker(𝐴) |𝑤i + 𝐴Πker(𝐴) |𝑤i ⊗ 𝐴Π𝐻(𝑥) |𝑤i = |𝜏i|𝜏i = |𝜏0i.
                                                                                   
                            2

We can bound the error as:

           Π𝐻 0(𝑥)⊥ |𝑤 0i       = (Π𝐻(𝑥)⊥ ⊗ 𝐼)|𝑤 0i
                            2                         2                                               2
                                                          = Π𝐻(𝑥)⊥ |𝑤i|𝑤i − Π𝐻(𝑥)⊥ |𝑤iΠ𝐻(𝑥) |𝑤i
                                                                     𝜅2        𝜅2
                                                                                       .
                                                            2
                                = Π𝐻(𝑥)⊥ |𝑤iΠ𝐻(𝑥)⊥ |𝑤i          ≤           ≤
                                                                    𝑊− (𝑃)2   𝑊− (𝑃 0)

   Next, observe that

                    (Π𝐻(𝑥) + Π𝐻(𝑥)⊥ ) ⊗ (Π𝐻(𝑥) + Π𝐻(𝑥)⊥ ) − Π𝐻(𝑥)⊥ ⊗ Π𝐻(𝑥) + Π𝐻(𝑥) ⊗ Π𝐻(𝑥)⊥
                     = Π𝐻(𝑥) ⊗ Π𝐻(𝑥) + Π𝐻(𝑥) ⊗ Π𝐻(𝑥)⊥ + Π𝐻(𝑥)⊥ ⊗ Π𝐻(𝑥)⊥ + Π𝐻(𝑥) ⊗ Π𝐻(𝑥)⊥
                     = Π𝐻(𝑥) ⊗ 𝐼 + 𝐼 ⊗ Π𝐻(𝑥)⊥
                0
          so |𝑤 i = Π𝐻(𝑥) |𝑤i ⊗ |𝑤i + |𝑤i ⊗ Π𝐻(𝑥)⊥ |𝑤i − Π𝐻(𝑥) |𝑤i ⊗ Πker(𝐴) |𝑤i.


                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                           16
                               S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

Thus, using the assumption k|𝑤 0 ik ≤ 1, and the fact that Πrow(𝐴) |𝑤i = |𝑤0 i:

          k|𝑤 0ik 2 = Π𝐻(𝑥) |𝑤i|𝑤i + |𝑤iΠ𝐻(𝑥)⊥ |𝑤i − Π𝐻(𝑥) |𝑤iΠker(𝐴) |𝑤i
                                                                                                 2

                                                                            2
                      = Π𝐻(𝑥) |𝑤iΠrow(𝐴) |𝑤i + |𝑤iΠ𝐻(𝑥)⊥ |𝑤i
                                                 2                     2                     2
                      = Π𝐻(𝑥) |𝑤i|𝑤0 i               + |𝑤iΠ𝐻(𝑥)⊥ |𝑤i       + 2 Π𝐻(𝑥) |𝑤i      h𝑤 0 |Π𝐻(𝑥)⊥ |𝑤i
                                                  𝜅                      𝜅                      √
                                                                    r
                      ≤𝑊
                       b+ (𝑃) + 𝑊
                                b+ (𝑃)                  + 2𝑊
                                                           b+ (𝑃)                   ≤ (1 + 𝜅 + 2 𝜅)𝑊 b+ (𝑃).
                                                 𝑊− (𝑃)                 𝑊− (𝑃)

Note that we could assume that 𝑊     b− (𝑃) ≥ 1 because k𝑤0 k ≤ 1.
    We complete the proof by extending to the general case. Let 𝑃 be any span program that
𝜅-approximates 𝑓 . By applying Theorem 3.11 and Corollary 3.12, we can get a span program, 𝑃0 ,
                                                                                  √
with k|𝑤 0 ik = 1, 𝑊− (𝑃0 ) ≤ 2, 𝑊
                                 b+ (𝑃0 ) ≤ 𝐶(𝑃)2 + 2, and 𝑠(𝑃0 ) = 𝑠(𝑃) + 2, that 𝜅-approximates 𝑓 .
We can then apply the construction described above, iteratively, 𝑑 times, to get a span program
       √ 2𝑑        𝑑−1
𝑃𝑑 that 𝜅 = 𝜅2 -approximates 𝑓 , with
                                                              𝑑                 𝑑
                                               𝑠(𝑃𝑑 ) = 𝑠(𝑃0 )2 = (𝑠(𝑃) + 2)2 ,
                                        𝑑
               𝑊− (𝑃𝑑 ) ≤ 22 ,                 and       b+ (𝑃𝑑 ) ≤ 4𝑑 𝑊
                                                         𝑊             b+ (𝑃0 ) ≤ 4𝑑 𝐶(𝑃)2 + 2 · 4𝑑 .
                      log(1/𝜅0 )
                                  
Setting 𝑑 = log        log(1/𝜅)
                                       + 1 gives the desired 𝜅0.                                                 

3.2   From span programs to quantum algorithms
In this section, we will prove Theorem 3.1, which states that if a span program approximately
decides a function 𝑓 , then we can compile it to a quantum algorithm for 𝑓 . While we hope that
Theorem 3.1 will have applications in designing span program algorithms, its only relevance
to the contents of this paper are its implications with respect to the tightness of the first lower
bound expression in Theorem 4.1, and so this section can be safely skipped.
    Theorem 3.1 is similar to [15, Lemma 3.6], the difference here is we let an approximate
positive witness for 𝑥 be any witness with error, Π𝐻(𝑥)⊥ |𝑤i , at most 𝜅/𝑊− , whereas in [15], it
                                                                2

is required to have error as small as possible. This relaxation could potentially decrease the
positive complexity 𝑊 b+ , since we now have more freedom in selecting positive witnesses, but
more importantly, it makes it easier to analyze a span program, because we need not find the
approximate positive witness with the smallest possible error. Importantly, this change in how
we define a span program that approximates 𝑓 does not change the most important property of
such a span program: that it can be compiled into a quantum algorithm for 𝑓 . To show this,
we now modify the proof of [15, Lemma 3.6] to fit the new definition. We will restrict to span
programs on binary strings {0, 1} 𝑛 , but the proof also works for span programs on [𝑞]𝑛 for 𝑞 > 2.

Proof of Theorem 3.1. For a span program 𝑃 on {0, 1} 𝑛 and 𝑥 ∈ {0, 1} 𝑛 , define

                                            𝑈(𝑃, 𝑥) = (2Πker(𝐴) − 𝐼)(2Π𝐻(𝑥) − 𝐼),

                           T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                 17
                                               S TACEY J EFFERY

which acts on 𝐻. To prove Theorem 3.1, we will show that by performing phase estimation of
𝑈(𝑃, 𝑥) on initial state |𝑤 0 i = 𝐴+ |𝜏i, and estimating the amplitude on having |0i in the phase
register, we can distinguish 1- and 0-inputs of 𝑓 with bounded error.
    By Corollary 3.12 and Claim 3.8, we can assume without loss of generality that 𝑃 has been
scaled so that it 𝜅-approximates 𝑓 for some 𝜅 < 1/4, |𝑤 0 i = 𝐴+ |𝜏i is a unit vector, and 𝑊− ≤ 2.
The scaled span program still has size 𝐾 𝑂(1) and complexity 𝑂(𝐶).
    We first modify the proof of [15, Lemma 3.2] to get the following lemma:

Lemma 3.13. Let 𝑃 be a span program that 𝜅-approximates 𝑓 , with k|𝑤0 ik 2 = 1. Fix any Θ ∈ (0, 𝜋),
and let ΠΘ be the projector onto the e𝑖𝜃 -eigenspaces of 𝑈(𝑃, 𝑥) with |𝜃| ≤ Θ. For any 𝑥 ∈ 𝑓 −1 (1),

                                                                    4𝜅
                                        kΠΘ |𝑤 0 ik 2 ≤ Θ 2𝑊
                                                           b+ +        .
                                                                    𝑊−

Proof. Suppose 𝑥 ∈ 𝑓 −1 (1) and let | 𝑤ˆ 𝑥 i be an approximate positive witness with Π𝐻(𝑥)⊥ | 𝑤ˆ 𝑥 i ≤
                                                                                                           2

 𝜅
𝑊− and k| 𝑤
          ˆ 𝑥 ik 2 ≤ 𝑊
                     b+ . Note that since 𝐴| 𝑤ˆ 𝑥 i = |𝜏i, Πrow(𝐴) | 𝑤ˆ 𝑥 i = 𝐴+ 𝐴| 𝑤ˆ 𝑥 i = 𝐴+ |𝜏i = |𝑤0 i, so

                            Πrow(𝐴) Π𝐻(𝑥) | 𝑤ˆ 𝑥 i + Πrow(𝐴) Π𝐻(𝑥)⊥ | 𝑤ˆ 𝑥 i = |𝑤 0 i.

Since Π𝐻(𝑥)⊥ Π𝐻(𝑥) | 𝑤ˆ 𝑥 i = 0, we have, by the effective spectral gap lemma (Lemma 2.1):

                                                                                    Θ2
                                                          ΠΘ Πrow(𝐴) Π𝐻(𝑥) | 𝑤ˆ 𝑥 i     Π𝐻(𝑥) | 𝑤ˆ 𝑥 i
                                                                                         2             2
                                                                                             ≤
                                                                                    4
                                                                               2 Θ2
                                           ΠΘ |𝑤0 i − Πrow(𝐴) Π𝐻(𝑥)⊥ | 𝑤ˆ 𝑥 i     ≤    k| 𝑤ˆ 𝑥 ik
                                                                                                  2
                                                                                    4
                                                                                    Θ2 b
    kΠΘ |𝑤0 ik 2 + ΠΘ Πrow(𝐴) Π𝐻(𝑥)⊥ | 𝑤ˆ 𝑥 i − 2h𝑤 0 |ΠΘ Πrow(𝐴) Π𝐻(𝑥)⊥ | 𝑤ˆ 𝑥 i ≤    𝑊+
                                              2
                                                                                    4
                                                                                    Θ2 b
                                 kΠΘ |𝑤0 ik 2 − 2 kΠΘ |𝑤0 ik Π𝐻(𝑥)⊥ | 𝑤ˆ 𝑥 i ≤         𝑊+
                                                                                    4
                                                                              𝜅
                                                                           r
                                                                                    Θ2 b
                                             kΠΘ |𝑤 0 ik 2 − 2 kΠΘ |𝑤 0 ik        ≤    𝑊+ .
                                                                             𝑊−     4

This is satisfied only when
                                                 r                       r
                                          𝜅     𝜅                                   𝜅
                                      r
                                                    Θ2 b                     Θ2 b
                       kΠΘ |𝑤 0 ik ≤         +    +   𝑊+ ≤ 2                   𝑊+ +
                                          𝑊−   𝑊−   4                        4      𝑊−
                                        b+ + 4𝜅 .
                      kΠΘ |𝑤0 ik 2 ≤ Θ 2𝑊                                                                      
                                             𝑊−

    We will let Θ 2 =    1−4𝜅
                               . Then when 𝑓 (𝑥) = 0, we have
                        2𝑊
                         b+ 𝑊−

                                                        1     1
                                    kΠ0 |𝑤0 ik 2 =          ≥   =: 𝑞0 ,
                                                     𝑤 − (𝑥) 𝑊−

                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                  18
                            S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

by [15, Lemma 3.3]. On the other hand, when 𝑓 (𝑥) = 1, we have

                                                    𝜅    1 − 4𝜅   4𝜅   1 + 4𝜅
                      kΠΘ |𝑤0 ik 2 ≤ Θ 2𝑊
                                        b+ + 4         =        +    =        =: 𝑞1 .
                                                    𝑊−    2𝑊−     𝑊−    2𝑊−
We want to distinguish these two cases using 1/Θ steps of phase estimation, and then estimating
the amplitude on having an estimate of 0 in the phase register to precision:
                                                    𝑞0 − 𝑞1 1 − 4𝜅
                                            Δ=             =       .
                                                       2     4𝑊−

This will allow us to distinguish between amplitude ≥ 𝑞0 and amplitude ≤ 𝑞1 . Since 𝜅 < 14
is a constant, Δ = Ω(1/𝑊− ), and thus we use 𝑂(1/Δ) = 𝑂(𝑊− ) = 𝑂(1) (recall that we are
assuming the span program
                            has been scaled) calls to phase estimation, each of which requires
                 q
𝑂(1/Θ) = 𝑂         𝑊
                   b+𝑊− = 𝑂(𝐶) controlled calls to 𝑈 (for more details, see the nearly identical
proof of [15, Lemma 3.2]). Since 𝑈(𝑃, 𝑥) can be implemented in cost one query, the query
complexity of this algorithm is 𝑂(𝐶).
    The algorithm needs a single register of dimension dim 𝐻 = 𝐾 𝑂(1) to apply 𝑈(𝑃, 𝑥), 𝑂(1)
registers of dimension 1/Θ to act as phase registers in phase estimation, and 𝑂(1) registers
of dimension 𝑂(1/Δ) to act as phase registers in the amplitude estimation, for a total space
requirement of
                                                             
                                       1         1
                     log dim 𝐻 + 𝑂 log   + 𝑂 log   = 𝑂(log 𝐾) + 𝑂(log 𝐶).
                                       Δ         Θ

To complete the proof, we note that the algorithm is unitary, since it consists of phase estimation,
composed unitarily with amplitude estimation.                                                     

3.3   From quantum algorithms to span programs
In this section, we will show how to turn a unitary quantum algorithm into a span program,
proving Theorem 3.2, which implies Theorem 1.1. The construction we use to prove Theorem 3.2
is based on a construction of Reichardt for turning any one-sided error quantum algorithm
into a span program whose complexity matches the algorithm’s query complexity [27, arXiv
version]. We observe that a similar construction also works for two-sided error algorithms,9 but
the resulting span program only approximately decides 𝑓 .

The algorithm Fix a function 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 , and a unitary quantum algorithm
𝒜 such that on input 𝑥 ∈ 𝑓 −1 (0), Pr[𝒜(𝑥) = 1] ≤ 13 , and on input 𝑥 ∈ 𝑓 −1 (1), Pr[𝒜(𝑥) = 1] ≥ 1− 𝜀,
for 𝜀 ∈ {0, 13 }, depending on whether we want to consider a one-sided error or a bounded error
algorithm. Let 𝑝0 (𝑥) = Pr[𝒜(𝑥) = 0], so if 𝑓 (𝑥) = 0, 𝑝0 (𝑥) ≥ 2/3, and if 𝑓 (𝑥) = 1, 𝑝 0 (𝑥) ≤ 𝜀.
  9A preliminary version of this result appeared in [16], but there was an error in the proof, which is fixed by our
new definition of approximate span programs.


                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                    19
                                                S TACEY J EFFERY

     We can suppose 𝒜 acts on three registers: a query register span{| 𝑗i : 𝑗 ∈ [𝑛] ∪ {0}}; a
workspace register span{|𝑧i : 𝑧 ∈ 𝒵} for some finite set of symbols 𝒵 that contains 0; and an
answer register span{|𝑎i : 𝑎 ∈ {0, 1}}. The query operator 𝒪𝑥 acts on the query register as
𝒪𝑥 | 𝑗i = (−1)𝑥 𝑗 | 𝑗i if 𝑗 ≥ 1, and 𝒪𝑥 |0i = |0i. If 𝒜 makes 𝑇 queries, the final state of 𝒜 is:

                              |Ψ2𝑇+1 (𝑥)i = 𝑈2𝑇+1 𝒪𝑥 𝑈2𝑇−1 . . . 𝑈3 𝒪𝑥 𝑈1 |0, 0, 0i

for some unitaries 𝑈2𝑇+1 , . . . , 𝑈1 . The output bit of the algorithm, 𝒜(𝑥), is obtained by measuring
the answer register of |Ψ2𝑇+1 (𝑥)i. We have given the input-independent unitaries odd indicies
so that we may refer to the 𝑡-th query as 𝑈2𝑡 .
    Let |Ψ0 (𝑥)i = |Ψ0 i = |0, 0, 0i denote the starting state, and for 𝑡 ∈ {1, . . . , 2𝑇 + 1}, let
|Ψ𝑡 (𝑥)i = 𝑈𝑡 . . . 𝑈1 |Ψ0 i denote the state after 𝑡 steps.

The span program We now define a span program 𝑃𝒜 from 𝒜. The space 𝐻 will represent
all three registers of the algorithm, with an additional time counter register, and an additional
register to represent a query value 𝑏.

    𝐻 = span{|𝑡, 𝑏, 𝑗, 𝑧, 𝑎i : 𝑡 ∈ {0, . . . , 2𝑇 + 1}, 𝑏 ∈ {0, 1}, 𝑗 ∈ [𝑛] ∪ {0}, 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1}}.

We define 𝑉 and 𝐴 as follows, where 𝑐 is some constant to be chosen later:

              𝑉 = span{|𝑡, 𝑗, 𝑧, 𝑎i : 𝑡 ∈ {0, . . . , 2𝑇 + 1}, 𝑗 ∈ [𝑛] ∪ {0}, 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1}}
                      
                       |𝑡, 𝑗, 𝑧, 𝑎i − |𝑡 + 1i𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i if 𝑡 ∈ {0, . . . , 2𝑇} is even
                       |𝑡, 𝑗, 𝑧, 𝑎i − (−1)𝑏 |𝑡 + 1, 𝑗, 𝑧, 𝑎i if 𝑡 ∈ {0, . . . , 2𝑇} is odd (i. e., 𝑈𝑡+1 = 𝒪𝑥 )
                      
                      
                      
                         |𝑡, 𝑗, 𝑧, 𝑎i                         if 𝑡 = 2𝑇 + 1, 𝑎 = 1, and 𝑏 = 0
                      
                      
 𝐴|𝑡, 𝑏, 𝑗, 𝑧, 𝑎i =     √
                          𝑐𝑇 |𝑡, 𝑗, 𝑧, 𝑎i                     if 𝑡 = 2𝑇 + 1, 𝑎 = 0, and 𝑏 = 0
                      
                      
                      
                      
                                                              if 𝑡 = 2𝑇 + 1 and 𝑏 = 1.
                      
                       0
                      
For 𝑡 ≤ 2𝑇, 𝐴|𝑡, 𝑏, 𝑗, 𝑧, 𝑎i should be intuitively understood as applying 𝑈𝑡+1 to | 𝑗, 𝑧, 𝑎i, and
incrementing the counter register from |𝑡i to |𝑡 + 1i. When 𝑡 is even, this correspondence is
clear (in that case, the value of 𝑏 is ignored). When 𝑡 is odd, so 𝑈𝑡+1 = 𝒪𝑥 , then as long as 𝑏 = 𝑥 𝑗 ,
(−1)𝑏 |𝑡 + 1, 𝑗, 𝑧, 𝑎i = |𝑡 + 1i𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i. We thus define

               𝐻 𝑗,𝑏 = span{|𝑡, 𝑏, 𝑗, 𝑧, 𝑎i : 𝑡 ∈ {0, . . . , 2𝑇} is odd, 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1}}.

For even 𝑡, applying 𝑈𝑡+1 is independent of the input, so we make the corresponding states
available to every input; along with states where the query register is set to 𝑗 = 0, meaning 𝒪𝑥
acts input-independently; and accepting states, whose answer register is set to 1 at time 2𝑇 + 1:

    𝐻true = span{|𝑡, 𝑏, 𝑗, 𝑧, 𝑎i : 𝑡 ∈ {0, . . . , 2𝑇} is even, 𝑏 ∈ {0, 1}, 𝑗 ∈ [𝑛], 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1}}
                       ⊕ span{|𝑡, 𝑏, 0, 𝑧, 𝑎i : 𝑡 ∈ {0, . . . , 2𝑇}, 𝑏 ∈ {0, 1}, 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1}}
                       ⊕ span{|2𝑇 + 1, 𝑏, 𝑗, 𝑧, 1i : 𝑏 ∈ {0, 1}, 𝑗 ∈ [𝑛] ∪ {0}, 𝑧 ∈ 𝒵}.


                         T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                    20
                            S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

The remaining part of 𝐻 will be assigned to 𝐻false :

                  𝐻false = span{|2𝑇 + 1, 𝑏, 𝑗, 𝑧, 0i : 𝑏 ∈ {0, 1}, 𝑗 ∈ [𝑛] ∪ {0}, 𝑧 ∈ 𝒵}.
                                                      √
Note that in defining 𝐴, we have put a large factor of 𝑐𝑇 in front of 𝐴|2𝑇 + 1, 0, 𝑗, 𝑧, 0i, making
the vectors in 𝐻false very “cheap” to use. These vectors are √never in 𝐻(𝑥), but will be used as
the error part of approximate positive witnesses, and the 𝑐𝑇 ensures they only contribute
relatively small error.
    Finally, we define:

                                            |𝜏i = |0, 0, 0, 0i = |0i|Ψ0 i.

Intuitively, we can construct |𝜏i, the initial state, using a final state that has 1 in the answer
register, and using the transitions |𝑡, 𝑗, 𝑧, 𝑎i − |𝑡 + 1i𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i to move from the final state to
the initial state. In the following analysis, we make this idea precise.


Analysis of 𝑃𝒜 We will first show that for every 𝑥 there is an approximate positive witness
with error depending on its probability of being rejected by 𝒜, 𝑝0 (𝑥).

Lemma 3.14. For any 𝑥 ∈ {0, 1} 𝑛 , there exists an approximate positive witness |𝑤i for 𝑥 in 𝑃𝒜 such
that:
                                                                   𝑝 0 (𝑥)
                                                                           .
                                                              2
                       k|𝑤ik 2 ≤ 2𝑇 + 2, and Π𝐻(𝑥)⊥ |𝑤i ≤
                                                                     𝑐𝑇
In particular, if 𝑓 (𝑥) = 1,
                                                                        𝜀
                                                                          .
                                                               2
                                                  Π𝐻(𝑥)⊥ |𝑤i       ≤
                                                                       𝑐𝑇

Proof. Let 𝑄 𝑥 be the linear isometry that acts as

                     𝑄 𝑥 | 𝑗, 𝑧, 𝑎i = |𝑥 𝑗 , 𝑗, 𝑧, 𝑎i   ∀𝑗 ∈ [𝑛] ∪ {0}, 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1},

where we interpret 𝑥 0 as 0. Note that for all | 𝑗, 𝑧, 𝑎i, and 𝑡 ∈ {0, . . . , 2𝑇}, we have

                            𝐴(|𝑡i𝑄 𝑥 | 𝑗, 𝑧, 𝑎i) = |𝑡, 𝑗, 𝑧, 𝑎i − |𝑡 + 1i𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i.

Let Π𝑎 = 𝑗∈[𝑛]∪{0},𝑧∈𝒵 | 𝑗, 𝑧, 𝑎ih𝑗, 𝑧, 𝑎| be the orthogonal projector onto states of the algorithm
           Í
with answer register set to 𝑎. We will construct a positive witness for 𝑥 from the states of the
algorithm on input 𝑥, as follows:

                2𝑇
                Õ                                                  1
        |𝑤i =       |𝑡i𝑄 𝑥 |Ψ𝑡 (𝑥)i + |2𝑇 + 1i|0iΠ1 |Ψ2𝑇+1 (𝑥)i + √ |2𝑇 + 1i|0iΠ0 |Ψ2𝑇+1 (𝑥)i.
                𝑡=0                                                𝑐𝑇

                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                         21
                                                       S TACEY J EFFERY

To see that this is a positive witness, we compute 𝐴|𝑤i, using the fact that 𝑈𝑡+1 |Ψ𝑡 (𝑥)i = |Ψ𝑡+1 (𝑥)i.
            2𝑇
            Õ
  𝐴|𝑤i =          (|𝑡i|Ψ𝑡 (𝑥)i − |𝑡 + 1i𝑈𝑡+1 |Ψ𝑡 (𝑥)i) + |2𝑇 + 1iΠ1 |Ψ2𝑇+1 (𝑥)i + |2𝑇 + 1iΠ0 |Ψ2𝑇+1 (𝑥)i
            𝑡=0
            2𝑇
            Õ                       2𝑇
                                    Õ
        =         |𝑡i|Ψ𝑡 (𝑥)i −           |𝑡 + 1i|Ψ𝑡+1 (𝑥)i + |2𝑇 + 1i|Ψ2𝑇+1 (𝑥)i
            𝑡=0                     𝑡=0
            2𝑇+1
            Õ                       2𝑇+1
                                    Õ
        =          |𝑡i|Ψ𝑡 (𝑥)i −           |𝑡i|Ψ𝑡 (𝑥)i = |0i|Ψ0 (𝑥)i = |𝜏i.
            𝑡=0                      𝑡=1

    We next consider the error of |𝑤i for 𝑥, given by Π𝐻(𝑥)⊥ |𝑤i . Since 𝑄 𝑥 | 𝑗, 𝑧, 𝑎i ∈ 𝐻(𝑥) for
                                                                                      2

all 𝑗, 𝑧, 𝑎, and |2𝑇 + 1, 0iΠ1 |Ψ2𝑇+1 (𝑥)i ∈ 𝐻true ⊂ 𝐻(𝑥), Π𝐻(𝑥)⊥ |𝑤i = √1 |2𝑇 + 1i|0iΠ0 |Ψ2𝑇+1 (𝑥)i,
                                                                         𝑐𝑇
so
                                                           1                      𝑝 0 (𝑥)
                                                                                          .
                                                  2
                                    Π𝐻(𝑥)⊥ |𝑤i        =      kΠ0 |Ψ2𝑇+1 (𝑥)ik 2 =
                                                          𝑐𝑇                        𝑐𝑇
    Finally, we compute an upper bound on the positive witness complexity of |𝑤i.
                              2𝑇
                              Õ                                                     1
                  k|𝑤ik 2 =         k𝑄 𝑥 |Ψ𝑡 (𝑥)ik 2 + kΠ1 |Ψ2𝑇+1 (𝑥)ik 2 +           kΠ0 |Ψ2𝑇+1 (𝑥)ik 2
                                                                                   𝑐𝑇
                              𝑡=0
                              2𝑇
                              Õ
                          ≤         k|Ψ𝑡 (𝑥)ik 2 + k|Ψ2𝑇+1 (𝑥)ik 2 = 2𝑇 + 2.                               
                              𝑡=0

    Next, we compute an upper bound on 𝑤− (𝑥) whenever 𝑓 (𝑥) = 0.
Lemma 3.15. For any 𝑥 that is rejected by 𝒜 with probability 𝑝 0 (𝑥) > 0,
                                                                   (𝑐 + 4)𝑇
                                                      𝑤− (𝑥) ≤              .
                                                                     𝑝0 (𝑥)

In particular, if 𝑓 (𝑥) = 0, 𝑤 − (𝑥) ≤ 𝑐+4
                                       2/3
                                           𝑇, so 𝑊− ≤ 𝑐+4
                                                      2/3
                                                          𝑇.

Proof. We will define a negative witness for 𝑥 as follows. First, define

                                              |Ψ02𝑇+1 (𝑥)i = Π0 |Ψ2𝑇+1 (𝑥)i,

the rejecting part of the final state. This is non-zero whenever 𝑝 0 (𝑥) > 0. Then for 𝑡 ∈ {0, . . . , 2𝑇},
define
                                                †         †
                                  |Ψ0𝑡 (𝑥)i = 𝑈𝑡+1 . . . 𝑈2𝑇+1 |Ψ02𝑇+1 (𝑥)i.
From this we can define
                                                            2𝑇+1
                                                            Õ
                                                  h𝜔| =            h𝑡|hΨ0𝑡 (𝑥)|.
                                                             𝑡=0


                         T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                             22
                              S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

We first observe that
     h𝜔|𝜏i = hΨ00 (𝑥)|0, 0, 0i = hΨ02𝑇+1 (𝑥)|𝑈2𝑇+1 . . . 𝑈1 |0, 0, 0i = hΨ02𝑇+1 (𝑥)|Ψ2𝑇+1 (𝑥)i = 𝑝0 (𝑥).
Thus
                                                                  1
                                                      h 𝜔|
                                                        ¯ =            h𝜔|
                                                               𝑝 0 (𝑥)
is a negative witness. Next, we show that h𝜔|𝐴Π𝐻(𝑥) = 0. First, for |𝑡, 𝑥 𝑗 , 𝑗, 𝑧, 𝑎i ∈ 𝐻 𝑗,𝑥 𝑗 (so
𝑡 < 2𝑇 is odd), we have
                h𝜔|𝐴|𝑡, 𝑥 𝑗 , 𝑗, 𝑧, 𝑎i = h𝜔|(|𝑡, 𝑗, 𝑧, 𝑎i − (−1)𝑥 𝑗 |𝑡 + 1i| 𝑗, 𝑧, 𝑎i)
                                          = hΨ0𝑡 (𝑥)| 𝑗, 𝑧, 𝑎i − (−1)𝑥 𝑗 hΨ0𝑡+1 (𝑥)| 𝑗, 𝑧, 𝑎i
                                          = hΨ0𝑡+1 (𝑥)|𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i − (−1)𝑥 𝑗 hΨ0𝑡+1 (𝑥)| 𝑗, 𝑧, 𝑎i
                                          = hΨ0𝑡+1 (𝑥)|𝒪𝑥 | 𝑗, 𝑧, 𝑎i − (−1)𝑥 𝑗 hΨ0𝑡+1 (𝑥)| 𝑗, 𝑧, 𝑎i = 0.
The same argument holds for |𝑡, 0, 0, 𝑗, 𝑧, 𝑎i ∈ 𝐻true . Similarly, for any |𝑡, 𝑏, 𝑗, 𝑧, 𝑎i ∈ 𝐻true with
𝑡 ≤ 2𝑇 even, we have
                      h𝜔|𝐴|𝑡, 𝑏, 𝑗, 𝑧, 𝑎i = h𝜔|(|𝑡, 𝑗, 𝑧, 𝑎i − |𝑡 + 1i𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i)
                                               = hΨ0𝑡 (𝑥)| 𝑗, 𝑧, 𝑎i − hΨ0𝑡+1 (𝑥)|𝑈𝑡+1 | 𝑗, 𝑧, 𝑎i = 0.
Finally, for any |2𝑇 + 1, 𝑏, 𝑗, 𝑧, 1i ∈ 𝐻true , we have
                  h𝜔|𝐴|2𝑇 + 1, 𝑏, 𝑗, 𝑧, 1i = h𝜔|2𝑇 + 1, 𝑗, 𝑧, 1i = hΨ02𝑇+1 (𝑥)| 𝑗, 𝑧, 1i = 0.
Thus h𝜔|𝐴Π𝐻(𝑥) = 0 and so h𝜔|𝐴Π¯    𝐻(𝑥) = 0, and h 𝜔|
                                                    ¯ is a negative witness for 𝑥 in 𝑃. To compute
its witness complexity, first observe that h𝜔|𝐴 = h𝜔|𝐴Π𝐻(𝑥)⊥ , and
                      𝑇
                      Õ            Õ
       𝐴Π𝐻(𝑥)⊥ =                                    (|2𝑠 − 1, 𝑗, 𝑧, 𝑎i + (−1)𝑥 𝑗 |2𝑠, 𝑗, 𝑧, 𝑎i)h2𝑠 − 1, 𝑥¯ 𝑗 , 𝑗, 𝑧, 𝑎|
                      𝑠=1 𝑗∈[𝑛]∪{0},𝑧∈𝒵,𝑎∈{0,1}
                              Õ           √
                      +                       𝑐𝑇 |2𝑇 + 1, 𝑗, 𝑧, 0ih2𝑇 + 1, 0, 𝑗, 𝑧, 0|
                          𝑗∈[𝑛]∪{0},𝑧∈𝒵

so, using hΨ02𝑠−1 (𝑥)| 𝑗, 𝑧, 𝑎i = hΨ02𝑠 (𝑥)|𝑈2𝑠 | 𝑗, 𝑧, 𝑎i = (−1)𝑥 𝑗 hΨ02𝑠 (𝑥)| 𝑗, 𝑧, 𝑎i, we have:
                  𝑇
                  Õ              Õ
h𝜔|𝐴Π𝐻(𝑥)⊥ =                                      (hΨ02𝑠−1 (𝑥)| 𝑗, 𝑧, 𝑎i + (−1)𝑥 𝑗 hΨ02𝑠 (𝑥)| 𝑗, 𝑧, 𝑎i)h2𝑠 − 1, 𝑥¯ 𝑗 , 𝑗, 𝑧, 𝑎|
                  𝑠=1 𝑗∈[𝑛]∪{0},𝑧∈𝒵,𝑎∈{0,1}
                            Õ         √
                  +                       𝑐𝑇 hΨ02𝑇+1 (𝑥)| 𝑗, 𝑧, 0ih2𝑇 + 1, 0, 𝑗, 𝑧, 0|
                      𝑗∈[𝑛]∪{0},𝑧∈𝒵
                  𝑇
                  Õ              Õ
              =                                   2(−1)𝑥 𝑗 hΨ02𝑠 (𝑥)| 𝑗, 𝑧, 𝑎i)h2𝑠 − 1, 𝑥¯ 𝑗 , 𝑗, 𝑧, 𝑎|
                  𝑠=1 𝑗∈[𝑛]∪{0},𝑧∈𝒵,𝑎∈{0,1}
                            Õ         √
                  +                       𝑐𝑇 hΨ02𝑇+1 (𝑥)| 𝑗, 𝑧, 0ih2𝑇 + 1, 0, 𝑗, 𝑧, 0|.
                      𝑗∈[𝑛]∪{0},𝑧∈𝒵


                           T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                            23
                                                    S TACEY J EFFERY

Thus, the complexity of h 𝜔|
                          ¯ is:
                     1
   kh𝜔|𝐴k 2                          2
     ¯      =             h𝜔|𝐴Π𝐻(𝑥)⊥
                 𝑝 0 (𝑥)2
                           𝑇
                     1 Õ          Õ                               2        1        Õ                                     2
             =                               4 hΨ02𝑠 (𝑥)| 𝑗, 𝑧, 𝑎i +                           𝑐𝑇 hΨ02𝑇+1 (𝑥)| 𝑗, 𝑧, 0i
                 𝑝 0 (𝑥)2 𝑠=1                                          𝑝 0 (𝑥)2
                                𝑗∈[𝑛]∪{0},                                        𝑗∈[𝑛]∪{0},
                                   𝑧∈𝒵,                                              𝑧∈𝒵
                                  𝑎∈{0,1}
                           𝑇
                     4 Õ                2     𝑐𝑇                2
             =                |Ψ02𝑠 (𝑥)i +          |Ψ02𝑇+1 (𝑥)i .
                 𝑝 0 (𝑥)2 𝑠=1              𝑝 0 (𝑥)2

Because each 𝑈𝑡 is unitary, we have |Ψ02𝑠 (𝑥)i                                          = 𝑝 0 (𝑥), thus:
                                                            2                       2
                                                                = |Ψ02𝑇+1 (𝑥)i
                                              4𝑇       𝑐𝑇      4+𝑐
                           kh 𝜔|𝐴k
                              ¯    2
                                     =              +        ≤     𝑇 when 𝑓 (𝑥) = 0.                                          
                                             𝑝 0 (𝑥) 𝑝 0 (𝑥)   2/3
    We conclude the proof of Theorem 3.2 with the following corollary, from which Theorem 3.2
follows immediately, by appealing to Claim 3.8 with 𝜅 = 10
                                                         9
                                                           and 𝜅0 any constant in (0, 1).
Corollary 3.16. Let 𝑐 = 5, in the definition of 𝑃𝒜 . Then:
   • 𝑠(𝑃𝒜 ) = 2𝑆+𝑂(1)
   • If 𝒜 decides 𝑓 with one-sided error, then 𝑃𝒜 decides 𝑓 with complexity 𝐶 ≤ 𝑂(𝑇).
   • If 𝒜 decides 𝑓 with bounded error, then 𝑃𝒜 10
                                                9
                                                   -approximates 𝑓 with complexity 𝐶𝜅 ≤ 𝑂(𝑇).
Proof. We first compute 𝑠(𝑃𝒜 ) = dim 𝐻 using the fact that the algorithm uses space
                 𝑆 = log dim span{| 𝑗, 𝑧, 𝑎i : 𝑗 ∈ [𝑛] ∪ {0}, 𝑧 ∈ 𝒵, 𝑎 ∈ {0, 1}} + log 𝑇.
We have:
           dim 𝐻 = (dim span{|𝑡, 𝑏i : 𝑡 ∈ {0, . . . , 2𝑇 + 1}, 𝑏 ∈ {0, 1}})2𝑆−log 𝑇 = 2𝑆+𝑂(1) .
   We prove the third statement, as the second is similar. By Lemma 3.15, using 𝑐 = 5, we have
                                                        5+4    27
                                                 𝑊− ≤       𝑇 = 𝑇.
                                                        2/3     2
By Lemma 3.14, we can see that for every 𝑥 such that 𝑓 (𝑥) = 1, there is an approximate positive
witness |𝑤i for 𝑥 with error at most

                                         𝜀          1 2𝑇
                                                       27
                                             1/3             9 1
                                           =     ≤        =       .
                                        𝑐𝑇   5𝑇    15𝑇 𝑊−   10 𝑊−
                                                                                               q              p
Furthermore, k|𝑤ik ≤ 2𝑇 + 2, so 𝑊
                       2        b+ ≤ 2𝑇 + 2. Observing 𝐶𝜅 =                                       𝑊−𝑊
                                                                                                    b+ ≤          27𝑇(𝑇 + 1)
completes the proof.                                                                                                          

                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                  24
                           S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

4    Span programs and space complexity
Using the transformation from algorithms to span programs from Section 3.3, we immediately
have the following connections between span program size and space complexity.

Theorem 4.1. For any 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 , we have
                                              
                     S𝑈 ( 𝑓 ) ≥ Ω log SP
                                      f( 𝑓 )                          ( 𝑓 ) ≥ Ω log SP( 𝑓 ) .
                                                                    1
                                                                                                  
                                                       and         S𝑈

Theorem 4.1 is a corollary of Theorem 3.2. Theorem 3.1 shows that the lower bound for S𝑈 ( 𝑓 ) in
Theorem 4.1 is part of a tight correspondence between space complexity and log 𝑠(𝑃) + log 𝐶(𝑃).
   Theorem 2.9 of [5] gives a lower bound of SP( 𝑓 ) ≥ Ω(2𝑛/3 /(𝑛 log 𝑛)1/3 ) for almost all 𝑛-bit
Boolean functions. Combined with Theorem 4.1, we immediately have:

Theorem 4.2. For almost all Boolean functions 𝑓 : {0, 1} 𝑛 → {0, 1}, S𝑈
                                                                      1
                                                                        ( 𝑓 ) = Ω(𝑛).

    Ideally, we would like to use the lower bound in Theorem 4.1 to prove a non-trivial lower
bound for S𝑈 ( 𝑓 ) or S𝑈
                       1
                         ( 𝑓 ) for some explicit function 𝑓 . Fortunately, there are somewhat nice
expressions lower bounding SP( 𝑓 ) [25, 11], which we extend to lower bounds of SP      f ( 𝑓 ) in the
remainder of this section. However, on the unfortunate side, there has already been significant
motivation to instantiate these expressions to non-trivial lower bounds for explicit 𝑓 , with no
success. There has been some success in monotone versions of these lower bounds, which we
discuss more in Section 5.

    For a function 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 , and an index 𝑗 ∈ [𝑛], we let Δ 𝑓 ,𝑗 ∈
{0, 1} 𝑓 (0)× 𝑓 (1) be defined by Δ 𝑓 ,𝑗 [𝑦, 𝑥] = 1 if and only if 𝑥 𝑗 ≠ 𝑦 𝑗 . When 𝑓 is clear from context,
        −1     −1


we simply denote this by Δ 𝑗 . The following tight characterization of SP( 𝑓 ) may be found in, for
example, [23].

Lemma 4.3. For any 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 ,
                                                      Õ
                            SP( 𝑓 ) = minimize                rank(Λ 𝑗 )
                                                      𝑗∈[𝑛]

                                       subject to ∀𝑗 ∈ [𝑛], Λ 𝑗 ∈ ℝ 𝑓
                                                                               −1 (0)× 𝑓 −1 (1)

                                                      Õ
                                                              Λ 𝑗 ◦ Δ 𝑗 = 𝐽,
                                                      𝑗∈[𝑛]


where 𝐽 is the 𝑓 −1 (0) × 𝑓 −1 (1) all-ones matrix.

    By Theorem 4.1, the logarithm of the above is a lower bound on S𝑈
                                                                    1
                                                                      ( 𝑓 ). We modify Lemma 4.3
to get the following approximate version, whose logarithm lower bounds S𝑈 ( 𝑓 ) when 𝜅 = 14 .



                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                             25
                                                           S TACEY J EFFERY

Lemma 4.4. For any 𝜅 ∈ [0, 1), and 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 ,
                                                                  Õ
                               f 𝜅 ( 𝑓 ) ≥ minimize
                               SP                                           rank(Λ 𝑗 )                                                      (4.1)
                                                                  𝑗∈[𝑛]

                                              subject to ∀𝑗 ∈ [𝑛], Λ 𝑗 ∈ ℝ 𝑓
                                                                                              −1 (0)× 𝑓 −1 (1)



                                                                     Õ                                √
                                                                                Λ𝑗 ◦ Δ𝑗 − 𝐽       ≤       𝜅.
                                                                    𝑗∈[𝑛]
                                                                                              ∞

Proof. Fix a span program that 𝜅-approximates 𝑓 with 𝑠(𝑃) = SP   f 𝜅 ( 𝑓 ), and let {h𝜔 𝑦 | : 𝑦 ∈ 𝑓 −1 (0)}
be optimal negative witnesses, and {|𝑤 𝑥 i : 𝑥 ∈ 𝑓 (1)} be approximate positive witnesses with
                                                   −1

 Π𝐻(𝑥) |𝑤 𝑥 i ≤ 𝑊𝜅− . Letting Π 𝑗,𝑏 denote the projector onto 𝐻 𝑗,𝑏 , define
             2

                                                                            !                           !
                                        Õ                                        Õ
                                Λ𝑗 =            |𝑦ih𝜔 𝑦 |𝐴Π 𝑗, 𝑦¯ 𝑗                   Π 𝑗,𝑥 𝑗 |𝑤 𝑥 ih𝑥| ,
                                          𝑦                                       𝑥


so Λ 𝑗 has rank at most dim 𝐻 𝑗 , and so 𝑗∈[𝑛] rank(Λ 𝑗 ) ≤ 𝑠(𝑃) = SP    f 𝜅 ( 𝑓 ).
                                                      Í
     We now show that {Λ 𝑗 } 𝑗 is a feasible solution. Let | err(𝑥)i be the positive witness error of
|𝑤 𝑥 i, | err(𝑥)i = Π𝐻(𝑥)⊥ |𝑤 𝑥 i = 𝑛𝑗=1 Π 𝑗, 𝑥¯ 𝑗 |𝑤 𝑥 i. Then we have:
                                   Í

              𝑛
              Õ                                Õ                                                       Õ
        h𝑦|         Λ 𝑗 ◦ Δ 𝑗 |𝑥i = h𝜔 𝑦 |𝐴                Π 𝑗,𝑥 𝑗 |𝑤 𝑥 i = h𝜔 𝑦 |𝐴  |𝑤 𝑥 i −                     Π 𝑗,𝑥 𝑗 |𝑤 𝑥 i − | err(𝑥)i ®
                                                                                          ©                                                  ª
              𝑗=1                             𝑗:𝑥 𝑗 ≠𝑦 𝑗                                  «           𝑗:𝑥 𝑗 =𝑦 𝑗                             ¬
                                                                 Õ
                                = h𝜔 𝑦 |𝜏i − h𝜔 𝑦 |𝐴                        Π𝐻(𝑦) Π 𝑗,𝑥 𝑗 |𝑤 𝑥 i − h𝜔 𝑦 |𝐴| err(𝑥)i
                                                               𝑗:𝑥 𝑗 =𝑦 𝑗

                                = 1 − 0 − h𝜔 𝑦 |𝐴| err(𝑥)i
             𝑛
                                                                                          𝜅   √
             Õ                                                              r
   1 − h𝑦|          Λ 𝑗 ◦ Δ 𝑗 |𝑥i ≤ h𝜔 𝑦 |𝐴 k| err(𝑥)ik =                       𝑤 − (𝑦)      ≤ 𝜅.
                                                                                          𝑊−
              𝑗=1

Above we used the fact that h𝜔 𝑦 |𝐴Π𝐻(𝑦) = 0. Thus, {Λ 𝑗 } 𝑗 is a feasible solution with objective
        f 𝜅 ( 𝑓 ), so the result follows.
value ≤ SP                                                                                       
    As a corollary of the above, and the connection between span program size and unitary
quantum space complexity stated in Theorem 4.1, the logarithm of the expression in (4.1) with
𝜅 = 41 is a lower bound on S𝑈 ( 𝑓 ), and with 𝜅 = 0, it is a lower bound on S𝑈
                                                                             1
                                                                               ( 𝑓 ). However, as
stated, it is difficult to use this expression to prove an explicit lower bound, because it is a
minimization problem. We will shortly give a lower bound in terms of a maximization problem,
making it possible to obtain explicit lower bounds by exhibiting a feasible solution.
    A partial matrix is a matrix 𝑀 ∈ (ℝ ∪ {★}) 𝑓 (0)× 𝑓 (1) . A completion of 𝑀 is any 𝑀 ∈
                                                     −1    −1


ℝ 𝑓 (0)× 𝑓 (1) such that 𝑀[𝑦, 𝑥] = 𝑀[𝑦, 𝑥] whenever 𝑀[𝑦, 𝑥] ≠ ★. For a partial matrix 𝑀, define
   −1     −1




                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                                   26
                        S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

rank(𝑀) to be the smallest rank of any completion of 𝑀, and 𝜀-rank(𝑀) to be the smallest rank
of any 𝑀˜ such that |𝑀[𝑦, 𝑥] − 𝑀[𝑦,
                               ˜     𝑥]| ≤ 𝜀 for all 𝑦, 𝑥 such that 𝑀[𝑦, 𝑥] ≠ ★. Let 𝑀 ◦ Δ𝑖 to be
the partial matrix defined:

                                                          𝑀[𝑦, 𝑥] if Δ𝑖 [𝑦, 𝑥] = 1
                                                   
                           𝑀 ◦ Δ𝑖 [𝑦, 𝑥] =
                                                          0       if Δ𝑖 [𝑦, 𝑥] = 0.

Then we have the following corollary of [11, Lemma 3.2, Theorem 3.4] and Theorem 4.1:

Lemma 4.5. For all Boolean functions 𝑓 : 𝐷 → {0, 1}, with 𝐷 ⊆ {0, 1} 𝑛 , and all partial matrices
𝑀 ∈ (ℝ ∪ {★}) 𝑓 (0)× 𝑓 (1) such that max{|𝑀[𝑦, 𝑥]| : 𝑀[𝑦, 𝑥] ≠ ★} ≤ 1:
               −1     −1


                                                                                   
                                                           rank(𝑀)
                            1
                           S𝑈 (𝑓) ≥ Ω            log                                      .
                                                     max𝑖∈[𝑛] rank(𝑀 ◦ Δ𝑖 )

    In [25], Razborov showed that the expression on the right-hand side in Lemma 4.5 is a lower
bound on the logarithm of the formula size of 𝑓 (Ref. [11] related this to SP( 𝑓 )). Later, in [26],
Razborov noted that when restricted to non-partial matrices, this can never give a better bound
than 𝑛. Thus, to prove a non-trivial lower bound on S𝑈1
                                                        ( 𝑓 ) using this method, one would need
to use a partial matrix. We prove the following generalization to the approximate case.

Lemma 4.6. For all Boolean functions 𝑓 : 𝐷 → {0, 1}, with 𝐷 ⊆ {0, 1} 𝑛 , and all partial matrices
𝑀 ∈ (ℝ ∪ {★}) 𝑓 (0)× 𝑓 (1) such that max{|𝑀[𝑦, 𝑥]| : 𝑀[𝑦, 𝑥] ≠ ★} ≤ 1:
               −1     −1


                                                                                     !!
                                                                 2 -rank(𝑀)
                                                                 1
                           S𝑈 ( 𝑓 ) ≥ Ω log                                               .
                                                          max𝑖∈[𝑛] rank(𝑀 ◦ Δ𝑖 )

Proof. Let {Λ 𝑗 } 𝑗 be an optimal feasible solution for the expression from Lemma 4.4, so

                                Õ                                   Õ                             √
                   f 𝜅( 𝑓 ) ≥
                   SP                   rank(Λ 𝑗 ),        and              Λ𝑗 ◦ Δ𝑗 − 𝐽       ≤    𝜅.
                                𝑗∈[𝑛]                               𝑗∈[𝑛]
                                                                                          ∞


Let 𝑀 𝑗 be a completion of 𝑀 ◦ Δ 𝑗 with rank(𝑀 ◦ Δ 𝑗 ) = rank(𝑀 𝑗 ). Then for any 𝑥, 𝑦 such that
𝑀[𝑦, 𝑥] ≠ ★:


           ©Õ                                     Õ
                   𝑀 𝑗 ◦ Λ 𝑗 ® [𝑦, 𝑥] − 𝑀[𝑦, 𝑥] =       𝑀[𝑦, 𝑥]Δ 𝑗 [𝑦, 𝑥]Λ 𝑗 [𝑦, 𝑥] − 𝑀[𝑦, 𝑥]
                             ª
           
           « 𝑗∈[𝑛]           ¬                    𝑗∈[𝑛]

                                                             Õ                        √
                                                       ≤ |𝑀[𝑦, 𝑥]|              Δ𝑗 ◦ Λ𝑗 − 𝐽       ≤     𝜅.
                                                                        𝑗∈[𝑛]
                                                                                              ∞


                     T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                   27
                                               S TACEY J EFFERY

Thus

                     √                         ©Õ                       Õ
                         𝜅-rank(𝑀) ≤ rank               𝑀 𝑗 ◦ Λ𝑗 ® ≤           rank(𝑀 𝑗 ◦ Λ 𝑗 ).
                                                                 ª

                                               « 𝑗∈[𝑛]           ¬      𝑗∈[𝑛]


Using the fact that for any matrices 𝐵 and 𝐶, rank(𝐵 ◦ 𝐶) ≤ rank(𝐵)rank(𝐶), we have
               √                 Õ
                   𝜅-rank(𝑀) ≤                                  f 𝜅 ( 𝑓 ) max rank(𝑀 ◦ Δ 𝑗 ).
                                         rank(Λ 𝑗 )rank(𝑀 𝑗 ) ≤ SP
                                                                                 𝑗∈[𝑛]
                                 𝑗∈[𝑛]


Setting 𝜅 = 14 , and noting that by Theorem 4.1, S𝑈 ( 𝑓 ) ≥ log SP
                                                                f ( 𝑓 ) = log SP
                                                                              f 1/4 ( 𝑓 ) completes the
proof.                                                                                                

    Unfortunately, as far as we are aware, nobody has used this lower bound to successfully
prove any explicit, non-trivial formula size lower bound of 2𝜔(log 𝑛) , so it seems to be quite
difficult. However, there has been some success proving lower bounds in the monotone span
program case, even without resorting to partial matrices, which we discuss in the next section.


5   Monotone span programs and monotone algorithms
A monotone function is a Boolean function in which 𝑦 ≤ 𝑥 implies 𝑓 (𝑦) ≤ 𝑓 (𝑥), where 𝑦 ≤ 𝑥
should be interpreted bitwise. In other words, flipping 0s to 1s either keeps the function value
the same, or changes it from 0 to 1. A monotone span program is a span program in which
𝐻𝑖,0 = {0} for all 𝑖, so only 1-valued queries contribute to 𝐻(𝑥), hence 𝐻(𝑦) ⊆ 𝐻(𝑥) whenever
𝑦 ≤ 𝑥. A monotone span program can only decide or approximate a monotone function.

Definition 5.1. For a monotone function 𝑓 , define the monotone span program size, denoted mSP( 𝑓 ),
as the minimum 𝑠(𝑃) over (families of) monotone span programs 𝑃 such that 𝑃 decides 𝑓 ; and
the approximate monotone span program size, denoted mSPf 𝜅 ( 𝑓 ), as the minimum 𝑠(𝑃) over (families
of) monotone span programs 𝑃 such that 𝑃 𝜅-approximates 𝑓 . We let mSP        f ( 𝑓 ) = mSP
                                                                                         f 1/4 ( 𝑓 ).

    In contrast to SP( 𝑓 ), there are non-trivial lower bounds for mSP( 𝑓 ) for explicit monotone
functions 𝑓 . However, this does not necessarily give a lower bound on SP( 𝑓 ), and in particular,
may not be a lower bound on the one-sided error quantum space complexity of 𝑓 . However, lower
bounds on log mSP( 𝑓 ) or log mSP f ( 𝑓 ) do give lower bounds on the space complexity of quantum
algorithms obtained from monotone span programs, and as we will soon see, log mSP( 𝑓 ) and
      f ( 𝑓 ) are lower bounds on the space complexity of monotone phase estimation algorithms,
log mSP
described in Section 5.2. The strongest known lower bound on mSP( 𝑓 ) is the following:

Theorem 5.2 ([24]). There is an explicit Boolean function 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 such that

                                             log mSP( 𝑓 ) ≥ Ω(𝑛).

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                         28
                                  S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

   We will adapt some of the techniques used in existing lower bounds on mSP to show a lower
bound on mSPf ( 𝑓 ) for some explicit 𝑓 :

Theorem 5.3. There is an explicit Boolean function 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 such that for any
constant 𝜅,
                                         f 𝜅 ( 𝑓 ) ≥ (log 𝑛)2−𝑜(1) .
                                   log mSP

    In particular, this implies a lower bound of 2(log 𝑛)     on mSP( 𝑓 ) for the function 𝑓 in
                                                                                          2−𝑜(1)


Theorem 5.3. We prove Theorem 5.3 in Section 5.1. Theorem 5.3 implies that any quantum
algorithm for 𝑓 obtained from a monotone span program must have space complexity (log 𝑛)2−𝑜(1) ,
which is slightly better than the trivial lower bound of Ω(log 𝑛). In Section 5.2, we describe
a more natural class of algorithms called monotone phase estimation algorithms such that
      f ( 𝑓 ) is a lower bound on the quantum space complexity of any such algorithm computing
log mSP
 𝑓 with bounded error. Then for the specific function 𝑓 from Theorem 5.3, any monotone phase
estimation algorithm for 𝑓 must use space (log 𝑛)2−𝑜(1) .

5.1   Monotone span program lower bounds
Our main tool in proving Theorem 5.3 will be the following.

Theorem 5.4. For any Boolean function 𝑓 : 𝐷 → {0, 1}, 𝐷 ⊆ {0, 1} 𝑛 , and any constant 𝜅 ∈ [0, 1):
                                                                                              √
                              f 𝜅( 𝑓 ) ≥                                           𝜅-rank(𝑀)
                             mSP                              max                                      ,
                                                𝑀∈ℝ 𝑓 (0)× 𝑓 (1) :k𝑀 k ∞ ≤1 max 𝑗∈[𝑛] rank(𝑀 ◦ Δ 𝑗,1 )
                                                     −1     −1



where Δ 𝑗,1 [𝑦, 𝑥] = 1 if 𝑦 𝑖 = 0 and 𝑥 𝑖 = 1, and 0 else.

    When, 𝜅 = 0, the right-hand side of the equation in Theorem 5.4 is the (monotone) rank
measure, defined in [25], and shown in [11] to lower bound monotone span program size. We
extend the proof for the 𝜅 = 0 case to get a lower bound on approximate span program size. We
could also allow for partial matrices 𝑀, as in the non-monotone case (Lemma 4.6) but unlike
the non-monotone case, it is not necessary to consider partial matrices to get non-trivial lower
bounds.

Proof. Fix a monotone span program that 𝜅-approximates 𝑓 with size mSP       f 𝜅 ( 𝑓 ). Let {h𝜔 𝑦 | :
𝑦 ∈ 𝑓 (0)} be optimal negative witnesses, and let {|𝑤 𝑥 i : 𝑥 ∈ 𝑓 (1)} be approximate positive
      −1                                                         −1

witnesses with Π𝐻(𝑥)⊥ |𝑤 𝑥 i ≤ 𝑊𝜅− . Letting Π 𝑗,𝑏 denote the projector onto 𝐻 𝑗,𝑏 , define
                            2

            Õ                                   Õ                                    Õ                                 Õ
   Λ𝑗 =                 |𝑦ih𝜔 𝑦 |𝐴Π 𝑗, 𝑦¯ 𝑗                 Π 𝑗,𝑥 𝑗 |𝑤 𝑥 ih𝑥| =                   |𝑦ih𝜔 𝑦 |𝐴Π 𝑗,1                  Π 𝑗,1 |𝑤 𝑥 ih𝑥|,
          𝑦∈ 𝑓 −1 (0)                         𝑥∈ 𝑓 −1 (1)                         𝑦∈ 𝑓 −1 (0):                      𝑥∈ 𝑓 −1 (1):
                                                                                     𝑦 𝑗 =0                            𝑥 𝑗 =1


so Λ 𝑗 has rank at most dim𝐻 𝑗 , and so 𝑗∈[𝑛] rank(Λ 𝑗 ) ≤ 𝑠(𝑃) = mSP  f 𝜅 ( 𝑓 ). Furthermore, Λ 𝑗 is
                                                               Í
only supported on (𝑦, 𝑥) such that 𝑦 𝑗 = 0 and 𝑥 𝑗 = 1, so Λ 𝑗 ◦ Δ 𝑗,1 = Λ 𝑗 . Denoting the error of

                              T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                                   29
                                                               S TACEY J EFFERY

                                                 Í
|𝑤 𝑥 i as | err(𝑥)i = Π𝐻(𝑥)⊥ |𝑤 𝑥 i =                𝑗:𝑥 𝑗 =0 Π 𝑗,1 |𝑤 𝑥 i, we have
                       Õ                          Õ                                                     Õ               Õ
                h𝑦|           Λ 𝑗 |𝑥i =                       h𝜔 𝑦 |𝐴Π 𝑗,1 |𝑤 𝑥 i = h𝜔 𝑦 |𝐴                    Π 𝑗,1              Π 𝑗,1 |𝑤 𝑥 i
                      𝑗∈[𝑛]                 𝑗:𝑦 𝑗 =0,𝑥 𝑗 =1                                         𝑗:𝑦 𝑗 =0           𝑗:𝑥 𝑗 =1

                                         = h𝜔 𝑦 |𝐴(|𝑤 𝑥 i − | err(𝑥)i) = h𝜔 𝑦 |𝐴|𝑤 𝑥 i − h𝜔 𝑦 |𝐴| err(𝑥)i

                                                                                                            𝜅   √
                      Õ                                                                        p        r
           1 − h𝑦|            Λ 𝑗 |𝑥i ≤ 1 − 1 + h𝜔 𝑦 |𝐴 k| err(𝑥)ik ≤                           𝑊−             = 𝜅.
                                                                                                            𝑊−
                      𝑗∈[𝑛]


Then for any 𝑀 ∈ ℝ 𝑓
                              −1 (0)× 𝑓 −1 (1)
                                                 with k𝑀 k ∞ ≤ 1, we have:

                                                  Õ                                       Õ                     √
                                 𝑀−𝑀◦                     Λ𝑗          ≤ k𝑀 k ∞ 𝐽 −                 Λ𝑗       ≤    𝜅.
                                                  𝑗∈[𝑛]                                  𝑗∈[𝑛]
                                                                ∞                                       ∞

Thus

               √                                              Õ                Õ
                   𝜅-rank(𝑀) ≤ rank  𝑀 ◦                             Λ𝑗 ® ≤           rank(𝑀 ◦ Λ 𝑗 )
                                                  ©                     ª
                                                              𝑗∈[𝑛]            𝑗∈[𝑛]
                                          Õ «                           ¬              Õ
                                     =            rank(𝑀 ◦ Δ 𝑗,1 ◦ Λ 𝑗 ) ≤                     rank(𝑀 ◦ Δ 𝑗,1 )rank(Λ 𝑗 )
                                         𝑗∈[𝑛]                                         𝑗∈[𝑛]
                                        f 𝜅 ( 𝑓 ) max rank(𝑀 ◦ Δ 𝑗,1 ).
                                     ≤ mSP                                                                                                       
                                                          𝑗∈[𝑛]


    To show a lower bound on mSP        f ( 𝑓 ) for some explicit 𝑓 : {0, 1} 𝑛 → {0, 1}, it turns out to be
sufficient to find some high approximate rank matrix 𝑀 ∈ ℝ𝑌×𝑋 for finite sets 𝑋 and 𝑌, and
a rectangle cover of 𝑀, Δ1 , . . . , Δ𝑛 , where each Δ𝑖 ◦ 𝑀 has low rank. Specifically, we have the
following lemma, which, with rank in place of approximate rank, has been used extensively in
previous monotone span program lower bounds.

Lemma 5.5. Let 𝑀 ∈ ℝ𝑌×𝑋 with k𝑀 k ∞ ≤ 1, for some finite sets 𝑋 and 𝑌 and 𝑋1 , . . . , 𝑋𝑛 ⊆ 𝑋,
𝑌1 , . . . , 𝑌𝑛 ⊆ 𝑌 be such that for all (𝑥, 𝑦) ∈ 𝑋 × 𝑌, there exists 𝑗 ∈ [𝑛] such that (𝑥, 𝑦) ∈ 𝑋 𝑗 × 𝑌𝑗 .
Define Δ 𝑗 ∈ {0, 1}𝑌×𝑋 by Δ 𝑗 [𝑦, 𝑥] = 1 if and only if (𝑦, 𝑥) ∈ 𝑌𝑗 × 𝑋 𝑗 . There exists a monotone function
𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 such that for any constant 𝜅 ∈ [0, 1):
                                                     √
                                      f 𝜅( 𝑓 ) ≥        𝜅-rank(𝑀)
                                    mSP                                    .
                                                 max 𝑗∈[𝑛] rank(𝑀 ◦ Δ 𝑗 )

Proof. For each 𝑦 ∈ 𝑌, define 𝑡 𝑦 ∈ {0, 1} 𝑛 by:

                                                                        0 if 𝑦 ∈ 𝑌𝑗
                                                                    
                                                           𝑦
                                                          𝑡𝑗 =
                                                                        1 else.


                          T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                                  30
                          S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

Similarly, for each 𝑥 ∈ 𝑋, define 𝑠 𝑥 ∈ {0, 1} 𝑛 by

                                                       1 if 𝑥 ∈ 𝑋 𝑗
                                                   
                                          𝑠 𝑗𝑥 =
                                                       0 else.

For every (𝑦, 𝑥) ∈ 𝑌 × 𝑋, there is some 𝑗 such that 𝑦 𝑗 ∈ 𝑌𝑗 and 𝑥 𝑗 ∈ 𝑋 𝑗 , so it cannot be the case
that 𝑠 𝑥 ≤ 𝑡 𝑦 . Thus, we can define 𝑓 as the unique monotone function such that 𝑓 (𝑠) = 1 for
every 𝑠 ∈ {0, 1} 𝑛 such that 𝑠 𝑥 ≤ 𝑠 for some 𝑥 ∈ 𝑋, and 𝑓 (𝑡) = 0 for all 𝑡 ∈ {0, 1} 𝑛 such that
𝑡 ≤ 𝑡 𝑦 for some 𝑦 ∈ 𝑌. Then we can define a matrix 𝑀 0 ∈ ℝ 𝑓 (0)× 𝑓 (1) by 𝑀 0[𝑡 𝑦 , 𝑠 𝑥 ] = 𝑀[𝑦, 𝑥]
                                                                    −1    −1


for all (𝑦, 𝑥) ∈ 𝑌 × 𝑋, and 0 elsewhere. We have 𝜀-rank(𝑀 0) = 𝜀-rank(𝑀) for all 𝜀, and
rank(𝑀 0 ◦ Δ 𝑗,1 ) = rank(𝑀 ◦ Δ 𝑗 ) for all 𝑗. The result then follows from Theorem 5.4.            
    We will prove Theorem 5.3 by constructing an 𝑀 with high approximate rank, and a good
rectangle cover. Following [29] and [24], we will make use of a technique due to Sherstov for
proving communication lower bounds, called the pattern matrix method [30]. We begin with
some definitions.
Definition 5.6 (Fourier spectrum). For a real-valued function 𝑝 : {0, 1} 𝑚 → ℝ, its Fourier
coefficients are defined, for each 𝑆 ⊆ [𝑚]:
                                                1        Õ
                                      𝑝(𝑆)
                                      ˆ    =                       𝑝(𝑧)𝜒𝑆 (𝑧),
                                               2𝑚
                                                       𝑧∈{0,1} 𝑚

where 𝜒𝑆 (𝑧) = (−1) 𝑖∈𝑆 𝑧 𝑖 . It is easily verified that 𝑝 =             𝑆⊆[𝑚] 𝑝(𝑆)𝜒
                      Í                                              Í
                                                                               ˆ     𝑆.

Definition 5.7 (Degree and approximate degree). The degree of a function 𝑝 : {0, 1} 𝑚 → ℝ is
defined deg(𝑝) = max{|𝑆| : 𝑝(𝑆)
                           ˆ    ≠ 0}. For any 𝜀 ≥ 0, deg
                                                     g (𝑝) = min{deg(𝑝)
                                                         𝜀            ˜ : k𝑝 − 𝑝k
                                                                                ˜ ∞ ≤ 𝜀}.
    Pattern matrices, defined by Sherstov in [30], are useful for proving lower bounds in
communication complexity, because their rank and approximate rank are relatively easy to
lower bound. In [29], Robere, Pitassi, Rossman and Cook first used this analysis to give lower
bounds on mSP( 𝑓 ) for some 𝑓 . We now state the definition, using the notation from [24], which
differs slightly from [30].
Definition 5.8 (Pattern matrix). For a real-valued function 𝑝 : {0, 1} 𝑚 → ℝ, and a positive integer
                                                           𝜆𝑚     𝑚      𝑚
𝜆, the (𝑚, 𝜆, 𝑝)-pattern matrix is defined as 𝐹 ∈ ℝ {0,1} ×([𝜆] ×{0,1} ) where for 𝑦 ∈ {0, 1}𝜆𝑚 ,
𝑥 ∈ [𝜆]𝑚 , and 𝑤 ∈ {0, 1} 𝑚 ,
                                    𝐹[𝑦, (𝑥, 𝑤)] = 𝑝(𝑦| 𝑥 ⊕ 𝑤),
where by 𝑦| 𝑥 , we mean the 𝑚-bit string containing one bit from each 𝜆-sized block of 𝑦 as
                                  (1)    (2)            (𝑚)
specified by the entries of 𝑥: (𝑦 𝑥1 , 𝑦 𝑥2 , . . . , 𝑦 𝑥 𝑚 ), where 𝑦 (𝑖) ∈ {0, 1}𝜆 is the 𝑖-th block of 𝑦.
     For comparison, what [30] calls an (𝑛, 𝑡, 𝑝)-pattern matrix would be a (𝑡, 𝑛/𝑡, 𝑝)-pattern
matrix in our notation. As previously mentioned, a pattern matrix has the nice property that
its rank (or even approximate rank) can be bounded from below in terms of properties of the
Fourier spectrum of 𝑝. In particular, the following is proven in [30]:

                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                             31
                                                  S TACEY J EFFERY

Lemma 5.9. Let 𝐹 be the (𝑚, 𝜆, 𝑝)-pattern matrix for 𝑝 : {0, 1} 𝑚 → {−1, +1}. Then for any 𝜀 ∈ [0, 1]
and 𝛿 ∈ [0, 𝜀], we have:
                                     Õ                                                          (𝜀 − 𝛿)2
                    rank(𝐹) =                   𝜆 |𝑆|   and   𝛿-rank(𝐹) ≥ 𝜆deg𝜀 (𝑝)                      .
                                                                                  g
                                                                                                (1 + 𝛿)2
                                 𝑆⊆[𝑚]:𝑝(𝑆)≠0
                                       ˆ

    This shows that we can use functions 𝑝 of high approximate degree to construct pattern
                     𝜆𝑚     𝑚    𝑚
matrices 𝐹 ∈ ℝ {0,1} ×([𝜆] ×{0,1} ) of high approximate rank. To apply Lemma 5.5, we also need
to find a good rectangle cover of some 𝐹.
    A 𝑏-certificate for a function 𝑝 on {0, 1} 𝑚 is an assignment 𝛼 : 𝑆 → {0, 1} for some 𝑆 ⊆ [𝑚]
such that for any 𝑥 ∈ {0, 1} 𝑚 such that 𝑥 𝑗 = 𝛼(𝑗) for all 𝑗 ∈ 𝑆, 𝑓 (𝑥) = 𝑏. The size of a certificate is
|𝑆|. The following shows how to use the certificates of 𝑝 to construct a rectangle cover of its
pattern matrix.
Lemma 5.10. Let 𝑝 : {0, 1} 𝑚 → {−1, +1}, and suppose there is a set of ℓ certificates for 𝑝 of size at
most 𝐶 such that every input satisfies at least one certificate. Then for any positive integer
                                                                                           √ 𝜆, there exists
                     𝑛                           𝐶
a function 𝑓 : {0, 1} → {0, 1} for 𝑛 = ℓ (2𝜆) such that for any 𝜅 ∈ (0, 1) and 𝜀 ∈ [ 𝜅, 1]:
                                                             √               
                                     f 𝜅 ( 𝑓 ) ≥ Ω (𝜀 −
                                    mSP                           𝜅)2 𝜆deg𝜀 (𝑝) .
                                                                       g


Proof. For 𝑖 = 1, . . . , ℓ , let 𝛼 𝑖 : 𝑆 𝑖 → {0, 1} for 𝑆 𝑖 ⊂ [𝑚] of size |𝑆 𝑖 | ≤ 𝐶 be one of the ℓ certificates.
That is, for each 𝑖, there is some 𝑣 𝑖 ∈ {−1, +1} such that for any 𝑥 ∈ {0, 1} 𝑚 , if 𝑥 𝑗 = 𝛼 𝑖 (𝑗) for all
𝑗 ∈ 𝑆 𝑖 , then 𝑝(𝑥) = 𝑣 𝑖 (so 𝛼 𝑖 is a 𝑣 𝑖 -certificate).
    We let 𝐹 be the (𝑚, 𝜆, 𝑝)-pattern matrix, which has k𝐹k ∞ = 1 since 𝑝 has range {−1, +1}. We
will define a rectangle cover as follows. For every 𝑖 ∈ [ℓ ], 𝑘 ∈ [𝜆]𝑆𝑖 , and 𝑏 ∈ {0, 1} 𝑆𝑖 , define:

                      𝑋𝑖,𝑘,𝑏 = {(𝑥, 𝑤) ∈ [𝜆]𝑚 × {0, 1} 𝑚 : ∀𝑗 ∈ 𝑆 𝑖 , 𝑤 𝑗 = 𝑏 𝑗 , 𝑥 𝑗 = 𝑘 𝑗 }
                                                               (𝑗)
                      𝑌𝑖,𝑘,𝑏 = {𝑦 ∈ {0, 1}𝜆𝑚 : ∀𝑗 ∈ 𝑆 𝑖 , 𝑦 𝑘 = 𝑏 𝑗 ⊕ 𝛼 𝑖 (𝑗)}.
                                                                𝑗


We first note that this is a rectangle cover. Fix any 𝑦 ∈ {0, 1}𝜆𝑚 , 𝑥 ∈ [𝜆]𝑚 and 𝑤 ∈ {0, 1} 𝑚 . First
note that for any 𝑖, if we let 𝑏 be the restriction of 𝑤 to 𝑆 𝑖 , and 𝑘 the restriction of 𝑥 to 𝑆 𝑖 , we
have (𝑥, 𝑤) ∈ 𝑋𝑖,𝑘,𝑏 . This holds in particular for 𝑖 such that 𝛼 𝑖 is a certificate for 𝑦| 𝑥 ⊕ 𝑤, and by
                                                                                    (𝑗)
assumption there is at least one such 𝑖. For such an 𝑖, we have 𝑦 𝑥 𝑗 ⊕ 𝑤 𝑗 = 𝛼(𝑗) for all 𝑗 ∈ 𝑆 𝑖 , so
𝑦 ∈ 𝑌𝑖,𝑘,𝑏 . Thus, we can apply Lemma 5.5.
                                                                                          (𝑗)
    Note that if (𝑥, 𝑤) ∈ 𝑋𝑖,𝑘,𝑏 , and 𝑦 ∈ 𝑌𝑖,𝑘,𝑏 , then (𝑦| 𝑥 ⊕ 𝑤)[𝑗] = 𝑦 𝑥 𝑗 ⊕ 𝑤 𝑗 = 𝛼 𝑖 (𝑗) for all 𝑗 ∈ 𝑆 𝑖 ,
so 𝑝(𝑦| 𝑥 ⊕ 𝑤) = 𝑣 𝑖 . Letting Δ𝑖,𝑘,𝑏 [𝑦, (𝑥, 𝑤)] = 1 if 𝑦 ∈ 𝑌𝑖,𝑘,𝑏 and (𝑥, 𝑤) ∈ 𝑋𝑖,𝑘,𝑏 , and 0 else, we
have that if 𝑦 ∈ 𝑌𝑖,𝑘,𝑏 and (𝑥, 𝑤) ∈ 𝑋𝑖,𝑘,𝑏 , (𝐹 ◦ Δ𝑖,𝑘,𝑏 )[𝑦, (𝑥, 𝑤)] = 𝑝(𝑦| 𝑥 ⊕ 𝑤) = 𝑣 𝑖 , and otherwise,
(𝐹 ◦ Δ𝑖,𝑘,𝑏 )[𝑦, (𝑥, 𝑤)] = 0. Thus rank(𝐹 ◦ Δ𝑖,𝑘,𝑏 ) = rank(𝑣 𝑖 Δ𝑖,𝑘,𝑏 ) = 1. Then by Lemma 5.5, there
exists 𝑓 : {0, 1} 𝑛 → {0, 1} where 𝑛 = ℓ𝑖=1 (2𝜆)|𝑆𝑖 | ≤ ℓ (2𝜆)𝐶 such that:
                                           Í
                                          √
                               f 𝜅 ( 𝑓 ) ≥ 𝜅-rank(𝐹)
                             mSP
                                                       √ 2
                                            g (𝑝) (𝜀 − 𝜅)
                                         ≥𝜆 deg 𝜀      √      , by Lemma 5.9.                               
                                                  (1 + 𝜅)2

                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                  32
                          S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

    We now prove Theorem 5.3, restated below.

Theorem 5.3. There is an explicit Boolean function 𝑓 : 𝐷 → {0, 1} for 𝐷 ⊆ {0, 1} 𝑛 such that for
any constant 𝜅,
                                     f 𝜅 ( 𝑓 ) ≥ Ω((log 𝑛)2−𝑜(1) ).
                               log mSP

Proof. By [8, Theorem 38], there is a function 𝑝 with deg
                                                        g (𝑝) ≥ 𝐶(𝑝)2−𝑜(1) , which is, up to the
                                                            1/3
𝑜(1) in the exponent, the best possible separation between these two quantities. In particular, this
               g (𝑝) ≥ 𝑀 2−𝑜(1) , and 𝐶(𝑝) ≤ 𝑀 1+𝑜(1) , where 𝐶(𝑝) is the certificate complexity
function has deg  1/3
of 𝑝, for some parameter 𝑀 (see [8] equations (64) and (65), where 𝑝 is referred to as 𝐹), and 𝑝 is
a function on 𝑀 2+𝑜(1) variables (see [8], discussion above equation (64)). Thus, there are at most
 𝑀 2+𝑜(1)  𝑀 1+𝑜(1)
 𝑀 1+𝑜(1) 2          possible certificates of size 𝑀 1+𝑜(1) such that each input satisfies at least one of
them.
                                                                                                      𝑀 2+𝑜(1)  𝑀 1+𝑜(1)
Then by Lemma 5.10 there exists a function 𝑓 : {0, 1} 𝑛 → {0, 1} for 𝑛 ≤                                                  (2𝜆)𝑀
                                                                                                                                1+𝑜(1)
                                                                                                      𝑀 1+𝑜(1) 2
such that for constant 𝜅 < 1/36 and constant 𝜆,

                                  f 𝜅 ( 𝑓 ) ≥ Ω(deg
                             log mSP            g (𝑝) log 𝜆) ≥ 𝑀 2−𝑜(1) .
                                                    1/3

Then we have
                   𝑀 2+𝑜(1)
                             
       log 𝑛 ≤ log          + log 2𝑀        + 𝑀 1+𝑜(1) log(2𝜆) = 𝑂(𝑀 1+𝑜(1) log 𝑀) = 𝑀 1+𝑜(1) .
                                     1+𝑜(1)

                   𝑀 1+𝑜(1)


           f 𝜅 ( 𝑓 ) ≥ (log 𝑛)2−𝑜(1) , and the result for any 𝜅 follows using Corollary 3.9.
Thus, log mSP                                                                                                                       

    Since for all total functions 𝑝, deg
                                     g (𝑝) ≤ 𝐶(𝑝)2 , where 𝐶(𝑝) is the certificate complexity
                                         1/3
of 𝑝, Lemma 5.10 cannot prove a lower bound better than log mSP f (𝑝) ≥ (log 𝑛)2 for any 𝑛-bit
function. We state a more general version of Lemma 5.10 that might have the potential to prove
a better bound, but we leave this for future work.

Lemma 5.11. Fix 𝑝 : {0, 1} 𝑚 → {−1, +1}. For 𝑖 = 1, . . . , ℓ , let 𝛼 𝑖 : 𝑆 𝑖 → {0, 1} for 𝑆 𝑖 ⊆ [𝑚] be a
partial assignment such that every 𝑧 ∈ {0, 1} 𝑚 satisfies at least one of the assignments. Let 𝑝 𝑖 denote the
restriction of 𝑝 to strings 𝑧 satisfying the assignment 𝛼 𝑖 . Then for every positive integer 𝜆, there
                                                                                                   √ exists a
function 𝑓 : {0, 1} 𝑛 → {0, 1}, where 𝑛 = ℓ𝑖=1 (2𝜆)|𝑆𝑖 | such that for any 𝜅 ∈ (0, 1) and 𝜀 ∈ [ 𝜅, 1]:
                                              Í

                                                        √                                     !
                                                    (𝜀 − 𝜅)2 𝜆deg𝜀 (𝑝)
                                                              g
                             f 𝜅( 𝑓 ) ≥ Ω
                            mSP                                                                   .
                                                              𝑆⊆[𝑚]\𝑆 𝑖 :𝑝ˆ 𝑖 (𝑆)≠0 𝜆
                                                                                        |𝑆|
                                                          Í
                                              max𝑖∈[ℓ ]

      To make use of this lemma, one needs a function 𝑝 of high approximate degree, such
that for every input, there is a small assignment that lowers the degree to something small.
This generalizes Lemma 5.10 because a certificate is an assignment that lowers the degree
of the remaining sub-function to constant. However, we note that a 𝑝 with these conditions
is necessary but may not be sufficient for proving a non-trivial lower bound, because while
  𝑆: 𝑝ˆ 𝑖 (𝑆)≠0 𝜆
Í                 |𝑆| ≥ 𝜆deg(𝑝 𝑖 ) , it may also be much larger if 𝑝 has a dense Fourier spectrum.
                                                                    𝑖


                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                      33
                                                             S TACEY J EFFERY

Proof. Let 𝐹 be the (𝑚, 𝜆, 𝑝)-pattern matrix. Let {𝑋𝑖,𝑘,𝑏 × 𝑌𝑖,𝑘,𝑏 } 𝑖,𝑘,𝑏 be the same rectangle covered
defined in the proof of Lemma 5.10, with the difference that since the 𝛼 𝑖 are no longer certificates,
the resulting submatrices of 𝐹 may not have constant rank.
    Let Δ𝑖,𝑘,𝑏 = 𝑦∈𝑌𝑖,𝑘,𝑏 |𝑦i (𝑥,𝑤)∈𝑋𝑖,𝑘,𝑏 h𝑥, 𝑤|. Then
                Í            Í

                                                                Õ
                               𝐹 ◦ Δ𝑖,𝑘,𝑏 =                                       𝑝(𝑦| 𝑥 ⊕ 𝑤)|𝑦ih𝑥, 𝑤|.
                                                     𝑦∈𝑌𝑖,𝑘,𝑏 ,(𝑥,𝑤)∈𝑋𝑖,𝑘,𝑏

Note that when 𝑦 ∈ 𝑌𝑖,𝑘,𝑏 and (𝑥, 𝑤) ∈ 𝑋𝑖,𝑏,𝑘 , 𝑦| 𝑥 ⊕ 𝑤 satisfies 𝛼 𝑖 , so 𝑝(𝑦| 𝑥 ⊕ 𝑤) = 𝑝 𝑖 (𝑦 0 | 𝑥0 ⊕ 𝑤 0),
where 𝑦 0, 𝑥 0 and 𝑤 0 are restrictions of 𝑦 ∈ ({0, 1}𝜆 )𝑚 , 𝑥 ∈ [𝜆]𝑚 and 𝑤 ∈ {0, 1} 𝑚 to [𝑚] \ 𝑆 𝑖 . Thus,
continuing from above, and rearranging registers, we have:
                              Õ                      Õ                                                             Õ
        𝐹 ◦ Δ𝑖,𝑘,𝑏 =                                                𝑝 𝑖 (𝑦 0 | 𝑥0 ⊕ 𝑤 0)|𝑦 0ih𝑥 0 , 𝑤 0 | ⊗                       | 𝑦ih𝑘,
                                                                                                                                    ¯     𝑏|
                                                                                                                       𝜆 )𝑆 𝑖 :
                       𝑦 0 ∈({0,1}𝜆 )[𝑚]\𝑆 𝑖    𝑥 0 ∈[𝜆][𝑚]\𝑆 𝑖 ,                                             𝑦∈({0,1}
                                                                                                              ¯
                                               𝑤 0 ∈{0,1}[𝑚]\𝑆 𝑖                                                𝑦|
                                                                                                                ¯ 𝑘 =𝑏⊕𝛼 𝑖

                   = 𝐹𝑖 ⊗ 𝐽2(𝜆−1)|𝑆𝑖 | ,1

where 𝐹𝑖 is the (𝑚, 𝜆, 𝑝 𝑖 )-pattern matrix, and 𝐽𝑎,𝑏 is the all-ones matrix of dimension 𝑎 by 𝑏,
which always has rank 1 for 𝑎, 𝑏 > 0. Thus
                                                                                                                Õ
             rank(𝐹 ◦ Δ𝑖,𝑘,𝑏 ) = rank(𝐹𝑖 )rank(𝐽2(𝜆−1)|𝑆𝑖 | ,1 ) = rank(𝐹𝑖 ) =                                                    𝜆 |𝑆| ,
                                                                                                      𝑆⊆[𝑚]\𝑆 𝑖 :𝑝ˆ 𝑖 (𝑆)≠0

by [30]. This part of the proof follows [29, Lemma IV.6].
    Then by Lemma 5.5 and Lemma 5.9, we have:
                                                                √ 2
                                               √                𝜀− 𝜅
                                 
                                   𝜅-rank(𝐹)             ©        √
                                                                1+ 𝜅
                                                                      𝜆deg𝜀 (𝑝)
                                                                              
                                                                                          ª
            mSP𝜅 ( 𝑓 ) ≥ Ω
             f                                        ≥ Ω
                                                                                         ®.                                                   
                           max𝑖,𝑘,𝑏 rank(𝐹 ◦ Δ𝑖,𝑘,𝑏 )      max𝑖 𝑆⊆[𝑚]\𝑆𝑖 :𝑝ˆ 𝑗 (𝑆)≠0 𝜆 ®
                                                                                      |𝑆|
                                                                Í
                                                         «                                ¬

5.2   Monotone algorithms
In Theorem 5.3, we showed a non-trivial lower bound on log mSP       f ( 𝑓 ) for some explicit monotone
function 𝑓 . Unlike lower bounds on log SP      f ( 𝑓 ), this does not give us a lower bound on the
quantum space complexity of 𝑓 , however, at the very least it gives us a lower bound on the
quantum space complexity of a certain type of quantum algorithm. Of course, this is naturally the
case, since a lower bound on mSP   f ( 𝑓 ) gives us a lower bound on the quantum space complexity
of any algorithm for 𝑓 that is obtained from a monotone span program. However, this is not the
most satisfying characterization, as it is difficult to imagine what this class of algorithms looks
like.
    In this section, we will consider a more natural class of algorithms whose space complexity is
shown to be at least mSPf ( 𝑓 ), and in some cases mSP( 𝑓 ). We will call a quantum query algorithm
a phase estimation algorithm if it works by estimating the amplitude on |0i in the phase register

                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                                  34
                         S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

after running phase estimation of a unitary that makes one query. We assume that the unitary
for which we perform phase estimation is of the form 𝑈 𝒪𝑥 . This is without loss of generality,
because the most general form is a unitary 𝑈2 𝒪𝑥 𝑈1 , but we have (𝑈2 𝒪𝑥 𝑈1 )𝑡 |𝜓0 i = 𝑈1† (𝑈 𝒪𝑥 )𝑡 |𝜓00 i
where |𝜓00 i = 𝑈1 |𝜓0 i, and 𝑈 = 𝑈1𝑈2 . The weight on a phase of |0i is not affected by this global
(𝑡-independent) 𝑈1† . Thus, we define a phase estimation algorithm as follows:

Definition 5.12. A phase estimation algorithm 𝒜 = (𝑈 , |𝜓0 i, 𝛿, 𝑇, 𝑀) for 𝑓 : 𝐷 → {0, 1}, 𝐷 ⊆
{0, 1} 𝑛 , is defined by (families of):

   • a unitary 𝑈 acting on ℋ = span{| 𝑗, 𝑧i : 𝑗 ∈ [𝑛], 𝑧 ∈ 𝒵} for some finite set 𝒵;

   • an initial state |𝜓0 i ∈ ℋ ;

   • a bound 𝛿 ∈ [0, 1/2);

   • positive integers 𝑇 and 𝑀 ≤ √1 ;
                                        𝛿

such that for any 𝑀 0 ≥ 𝑀 and 𝑇 0 ≥ 𝑇, the following procedure computes 𝑓 with bounded error:

   1. Let Φ(𝑥) be the algorithm that runs phase estimation of 𝑈 𝒪𝑥 on |𝜓0 i for 𝑇 0 steps, and then
      computes a bit |𝑏i𝐴 in a new register 𝐴, such that 𝑏 = 0 if and only if the phase estimate
      is 0.

   2. Run 𝑀 0 steps of amplitude estimation to estimate the amplitude on |0i𝐴 after application
      of Φ(𝑥). Output 0 if the amplitude is > 𝛿.

The query complexity of the algorithm is 𝑂(𝑀𝑇), and, the space complexity of the algorithm is
log dim ℋ + log 𝑇 + log 𝑀 + 1.

    We insist that the algorithm work not only for 𝑀 and 𝑇 but for any larger integers as well,
because we want to ensure that the algorithm is successful because 𝑀 and 𝑇 are large enough,
and not by some quirk of the particular chosen values. When 𝛿 = 0, the algorithm has one-sided
error (see Lemma 5.18).
    We remark on the generality of this form of algorithm. Any algorithm can be put into this
form by first converting it to a span program using the construction of Section 3.3 (Theorem 3.2),
and then compiling that into an algorithm using the construction of Section 3.2 (Theorem 3.1),
preserving both the time and space complexity, asymptotically. However, we will consider a
special case of this type of algorithm that is not fully general.

Definition 5.13. A monotone phase estimation algorithm is a phase estimation algorithm such
that if Π0 (𝑥) denotes the orthogonal projector onto the (+1)-eigenspace of 𝑈 𝒪𝑥 , then for any
𝑥 ∈ {0, 1} 𝑛 , Π0 (𝑥)|𝜓0 i is in the (+1)-eigenspace of 𝒪𝑥 .

    Let us consider what is “monotone” about this definition. The algorithm outputs 0 if |𝜓0 i
has high overlap with the (+1)-eigenspace of 𝑈 𝒪𝑥 , i. e., Π0 (𝑥)|𝜓0 i is large. In a monotone phase
estimation algorithm, we know that the only contribution to Π0 (𝑥)|𝜓0 i is in the (+1)-eigenspace

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                            35
                                           S TACEY J EFFERY

of 𝒪𝑥 , which is exactly the span of | 𝑗, 𝑧i such that 𝑥 𝑗 = 0. Thus, only queries that return 0 can
contribute to the algorithm rejecting.
    As a simple example, Grover’s algorithm is a monotone phase estimation algorithm.
Specifically, let |𝜓0 i = √1𝑛 𝑛𝑗=1 | 𝑗i and 𝑈 = (2|𝜓0 ih𝜓0 | − 𝐼). Then 𝑈 𝒪𝑥 is the standard Grover
                             Í

iterate, and |𝜓0 i is in the span of e𝑖𝜃 -eigenvectors of 𝑈 𝒪𝑥 with sin |𝜃| = |𝑥|/𝑛, so phase
                                                                                    p

estimation can be used to distinguish the case |𝑥| = 0 from |𝑥| ≥ 1. So Π0 (𝑥)|𝜓0 i is either 0,
when |𝑥| ≠ 0, or |𝜓0 i, when |𝑥| = 0. In both cases, it is in the (+1)-eigenspace of 𝒪𝑥 .
    It is clear that a monotone phase estimation algorithm can only decide a monotone function.
However, while any quantum algorithm can be converted to a phase estimation algorithm, it is
not necessarily the case that any quantum algorithm for a monotone function can be turned into
a monotone phase estimation algorithm (see Remark 5.17). Thus lower bounds on the quantum
space complexity of any monotone phase estimation algorithm for a monotone 𝑓 do not imply
lower bounds on S𝑈 ( 𝑓 ). Nevertheless, if we let mS𝑈 ( 𝑓 ) represent the minimum quantum space
complexity of any monotone phase estimation algorithm for 𝑓 , then a lower bound on mS𝑈 ( 𝑓 )
at least tells us that if we want to compute 𝑓 with space less than said bound, we must use a
non-monotone phase estimation algorithm.
    Similarly, we let mS𝑈 1
                            ( 𝑓 ) denote the minimum quantum space complexity of any monotone
phase estimation algorithm with 𝛿 = 0 that computes 𝑓 (with one-sided error).
    The main theorem of this section states that any monotone phase estimation algorithm for 𝑓
with space 𝑆 can be converted to a monotone span program of size 2Θ(𝑆) that approximates 𝑓 , so
that lower bounds on mSP    f ( 𝑓 ) imply lower bounds on mS𝑈 ( 𝑓 ); and that any monotone phase
estimation algorithm with 𝛿 = 0 and space 𝑆 can be converted to a monotone span program
of size 2Θ(𝑆) that decides 𝑓 (exactly) so that lower bounds on mSP( 𝑓 ) imply lower bounds on
mS𝑈 1
      ( 𝑓 ). These conversions also preserve the query complexity. We now formally state this
main result.
Theorem 5.14. Let 𝒜 = (𝑈 , |𝜓0 i, 𝛿, 𝑇, 𝑀) be a monotone phase estimation algorithm for 𝑓 with space
complexity 𝑆 = log dim ℋ + log 𝑇 + log 𝑀 + 1 and query complexity 𝑂(𝑇 𝑀). Then there is a monotone
span program with complexity 𝑂(𝑇 𝑀) and size 2 dim ℋ ≤ 2𝑆 that approximates 𝑓 . If 𝛿 = 0, then this
span program decides 𝑓 (exactly). Thus

                      mS𝑈 ( 𝑓 ) ≥ log mSP
                                       f( 𝑓 )    and      1
                                                        mS𝑈 ( 𝑓 ) ≥ log mSP( 𝑓 ).

   We prove this theorem in Section 5.2.1. As a corollary, lower bounds on mSP( 𝑓 ), such as the
one from [24], imply lower bounds on mS𝑈 1
                                           ( 𝑓 ); and lower bounds on mSP
                                                                       f ( 𝑓 ) such as the one in
Theorem 5.3, imply lower bounds on mS𝑈 ( 𝑓 ). In particular:
Corollary 5.15. Let 𝑓 : {0, 1} 𝑛 → {0, 1} be the function described in Theorem 5.3. Then mS𝑈 ( 𝑓 ) ≥
(log 𝑛)2−𝑜(1) . Let 𝑔 : {0, 1} 𝑛 → {0, 1} be the function described in Theorem 5.2. Then mS𝑈
                                                                                           1
                                                                                             (𝑔) ≥ Ω(𝑛).
    We emphasize that while this does not give a lower bound on the quantum space complexity
of 𝑓 , or the one-sided quantum space complexity of 𝑔, it does show that any algorithm that uses
(log 𝑛)𝑐 space to solve 𝑓 with bounded error, for 𝑐 < 2, or 𝑜(𝑛) space to solve 𝑔 with one-sided
error, must be of a different form than that described in Definition 5.13.

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                          36
                                   S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

    In a certain sense, monotone phase estimation algorithms completely characterize those
that can be derived from monotone span programs, because the algorithm we obtain from
compiling a monotone span program is a monotone phase estimation algorithm, as stated below
in Lemma 5.16. However, not all monotone phase estimation algorithms can be obtained by
compiling monotone span programs, and similarly, we might hope to show that an even larger
class of algorithms can be converted to monotone span programs, in order to give more strength
to lower bounds on mS𝑈 ( 𝑓 ).

Lemma 5.16. Let 𝑃 be an approximate monotone span program for 𝑓 with size 𝑆 and complexity 𝐶. Then
there is a monotone phase estimation algorithm for 𝑓 with query complexity 𝑂(𝐶) and space complexity
𝑂(log 𝑆 + log 𝐶).

Proof. Fix a monotone span program, and assume it has been appropriately scaled. Without
loss of generality, we can let 𝐻 𝑗 = 𝐻 𝑗,1 = span{| 𝑗, 𝑧i : 𝑧 ∈ 𝒵𝑗 } for some finite set 𝒵𝑗 . Then,
𝒪𝑥 = 𝐼 − 2Π𝐻(𝑥) , which is only true because the span program is monotone. Let 𝑈 = 2Πrow(𝐴) − 𝐼.
Then 𝑈 𝒪𝑥 = (2Πker(𝐴) − 𝐼)(2Π𝐻(𝑥) − 𝐼) is the span program unitary, described in Section 3.2.
Then it is simple to verify that the algorithm described in [15, Lemma 3.6] (and referred to
in Section 3.2) is a phase estimation algorithm for 𝑓 with query complexity 𝑂(𝐶) and space
complexity 𝑂(log 𝑆 + log 𝐶).
    The algorithm is a monotone phase estimation algorithm because 𝑈 = 2Πrow(𝐴) − 𝐼 is
a reflection, and |𝜓0 i = |𝑤 0 i = 𝐴+ |𝜏i is in the (+1)-eigenspace of 𝑈, row(𝐴). Since 𝑈 is a
reflection, the (+1)-eigenspace of 𝑈 𝒪𝑥 is exactly (ker(𝐴) ∩ 𝐻(𝑥)) ⊕ (row(𝐴) ∩ 𝐻(𝑥)⊥ ), and so
Π0 (𝑥)|𝑤 0 i ∈ row(𝐴) ∩ 𝐻(𝑥)⊥ ⊂ 𝐻(𝑥)⊥ .                                                          
Remark 5.17. We mention an example of monotone functions for which the best known quantum
algorithm, in terms of space complexity, is not a monotone phase estimation algorithm. Every
function can be expressed as a Boolean formula, and every monotone function can be expressed
as a monotone Boolean formula (a formula with no negation gates), but this might be much
larger than the smallest (non-monotone) formula for the function. For example, the function
XOR-SAT, defined in [13], can be computed by a circuit of depth 𝑂((log 𝑛)2 ), which means it has
                                                                            𝜀
a formula of size 2𝑂((log 𝑛) ) , but its monotone formula complexity is 2Ω(𝑛 ) for some constant 𝜀.10
                            2



    √For any Boolean formula of size 𝑁, there exists a quantum algorithm that can evaluate it using
𝑂( 𝑁) queries, and 𝑂(log 𝑁) space [27, 18]. Since this algorithm is designed via span programs,
it is a phase estimation algorithm, and it is monotone if and only if the formula is monotone. For
a function for which there is a separation between the monotone and non-monotone formula
complexities, the smallest space quantum algorithm of this type will not be monotone. For
example, for XOR-SAT, we could use a quantum algorithm that evaluates a monotone formula
and has space complexity 𝑛 𝜀 . This is a monotone phase estimation algorithm, but it is not
optimal. If we instead evaluate the optimal non-monotone formula, we get a quantum algorithm
(that is not monotone) with space complexity (log 𝑛)2 . Of course, this does not rule out that

   10If we pad XOR-SAT with 0s so that the input length goes from 𝑛 to 𝑁 = 2𝑐(log 𝑛) for some appropriate constant
                                                                                    2


𝑐, then √the formula size becomes linear in 𝑁, while the monotone formula size is still superpolynomial in 𝑁, scaling
          𝜀 log 𝑁
like 22             . We thank Robert Robere for this observation.


                                T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                             37
                                             S TACEY J EFFERY

there could be some other space-optimal quantum algorithm for this problem that is a monotone
phase estimation algorithm.

5.2.1   Monotone algorithms to (approximate) monotone span programs
In this section, we prove Theorem 5.14. Throughout this section, we fix a phase estimation
algorithm 𝒜 = (𝑈 , |𝜓0 i, 𝛿, 𝑇, 𝑀) that computes 𝑓 , with 𝑈 acting on ℋ . For any 𝑥 ∈ {0, 1} 𝑛 and
Θ ∈ [0, 𝜋], we let ΠΘ (𝑥) denote the orthogonal projector onto the span of e𝑖𝜃 -eigenvectors of
𝑈 𝒪𝑥 for |𝜃| ≤ Θ. We will let Π𝑥 = 𝑗∈[𝑛],𝑧∈𝒵:𝑥 𝑗 =1 | 𝑗, 𝑧ih𝑗, 𝑧|.
                                     Í
    We begin by drawing some conclusions about the necessary relationship between the
eigenspaces of 𝑈 𝒪𝑥 and a function 𝑓 whenever a monotone phase estimation computes 𝑓 . The
proofs are somewhat dry and are deferred to Section 5.2.2.

Lemma 5.18. Fix a phase estimation algorithm with 𝛿 = 0 that solves 𝑓 with bounded error. Then if
𝑓 (𝑥) = 0,
                                                         1
                                       kΠ0 (𝑥)|𝜓0 ik 2 ≥ 2 ,
                                                        𝑀
               √
and for any 𝑑 < 8/𝜋, if 𝑓 (𝑥) = 1, then
                                                                 2
                                           Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i            = 0,

and the algorithm always outputs 1, so it has one-sided error.

Lemma 5.19. Fix a phase estimation algorithm with 𝛿 ≠ 0 that solves 𝑓 with bounded error. Then there
is some constant 𝑐 > 0 such that if 𝑓 (𝑥) = 0,

                                  kΠ0 (𝑥)|𝜓0 ik 2 ≥ max{𝛿(1 + 𝑐), 1/𝑀 2 }
                                √
and if 𝑓 (𝑥) = 1, for any 𝑑 <    8/𝜋,

                                                                       𝛿
                                                                                 .
                                                         2
                                        Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i       ≤
                                                                 1 − 𝑑 8𝜋
                                                                           2 2



   To prove Theorem 5.14, we will define a monotone span program 𝑃𝒜 as follows:

                                  𝐻true = span{| 𝑗, 𝑧i : 𝑗 ∈ [𝑛], 𝑧 ∈ 𝒵} = ℋ
                                  𝐻 𝑗,1 = 𝐻 𝑗 = span{| 𝑗, 𝑧, 1i : 𝑧 ∈ 𝒵}
                                          1
                            𝐴| 𝑗, 𝑧, 1i = (| 𝑗, 𝑧i − (−1)1 | 𝑗, 𝑧i) = | 𝑗, 𝑧i
                                          2
                              𝐴| 𝑗, 𝑧i = (𝐼 − 𝑈 † )| 𝑗, 𝑧i
                                    |𝜏i = |𝜓0 i.                                               (5.1)

   We first show that Π0 (𝑥)|𝜓0 i is (up to scaling) a negative witness for 𝑥, whenever it is
nonzero:

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                      38
                         S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

Lemma 5.20. For any 𝑥 ∈ {0, 1} 𝑛 , we have

                                                            1
                                        𝑤− (𝑥) =                        .
                                                   kΠ0 (𝑥)|𝜓0 ik 2

In particular, Π0 (𝑥)|𝜓0 i/kΠ0 (𝑥)|𝜓0 ik 2 is an optimal negative witness for 𝑥 when Π0 (𝑥)|𝜓0 i ≠ 0.

Proof. Suppose Π0 (𝑥)|𝜓0 i ≠ 0, and let |𝜔i = Π0 (𝑥)|𝜓0 i/kΠ0 (𝑥)|𝜓0 ik 2 . We will first show that
this is a negative witness, and then show that no negative witness can have better complexity.
First, we notice that
                                                h𝜓0 |Π0 (𝑥)|𝜓0 i
                              h𝜔|𝜏i = h𝜔|𝜓0 i =                  = 1.
                                                 kΠ0 (𝑥)|𝜓0 ik 2
Next, we will see that h𝜔|𝐴Π𝐻(𝑥) = 0. By the monotone phase estimation property, 𝒪𝑥 Π0 (𝑥)|𝜓0 i =
Π0 (𝑥)|𝜓0 i, and so 𝒪𝑥 |𝜔i = |𝜔i, and thus Π𝑥 |𝜔i = 0, where Π𝑥 is the projector onto | 𝑗, 𝑧i such
that 𝑥 𝑗 = 1. Note that 𝐻(𝑥) = span{| 𝑗, 𝑧, 1i : 𝑥 𝑗 = 1, 𝑧 ∈ 𝒵} ⊕ span{| 𝑗, 𝑧i : 𝑗 ∈ [𝑛], 𝑧 ∈ 𝒵}. Thus
Π𝐻(𝑥) = Π𝐻true + Π𝑥 ⊗ |1ih1|. We have:

                                   h𝜔|𝐴(Π𝑥 ⊗ |1ih1|) = h𝜔|Π𝑥 = 0.

Since |𝜔i is in the (+1)-eigenspace of 𝑈 𝒪𝑥 , we have 𝑈 𝒪𝑥 |𝜔i = |𝜔i so since 𝒪𝑥 |𝜔i = |𝜔i,
𝑈 |𝜔i = |𝜔i. Thus

                      h𝜔|𝐴Π𝐻true = h𝜔|(𝐼 − 𝑈 † ) ⊗ h1| = (h𝜔| − h𝜔|) ⊗ h1| = 0.

Thus |𝜔i is a zero-error negative witness for 𝑥. Next, we argue that it is optimal.
   Suppose |𝜔i is any optimal negative witness for 𝑥, with size 𝑤 − (𝑥). Then since h𝜔|Π𝑥 =
h𝜔|𝐴(Π𝑥 ⊗ |1ih1|) must be 0, 𝒪𝑥 |𝜔i = (𝐼 − 2Π𝑥 )|𝜔i = |𝜔i, and since h𝜔|𝐴Π𝐻true = h𝜔|(𝐼 − 𝑈 † )
must be 0, 𝑈 |𝜔i = |𝜔i. Thus |𝜔i is a 1-eigenvector of 𝑈 𝒪𝑥 , so
                                                            2
                                          |𝜔ih𝜔|                    |h𝜔|𝜓0 i| 2         1
                                    2
                      kΠ0 (𝑥)|𝜓0 ik ≥             2
                                                    |𝜓0 i       =            2
                                                                                  =             .
                                          k|𝜔ik                      k|𝜔ik            k|𝜔ik 2

We complete the proof by noticing that since h𝜔|𝐴Π𝐻true = 0, we have h𝜔|𝐴 = h𝜔|h1|, and
𝑤 − (𝑥) = kh𝜔|𝐴k 2 = k|𝜔ik 2 .                                                        

   Next we find approximate positive witnesses.

Lemma 5.21. For any Θ ≥ 0, the span program 𝑃𝒜 has approximate positive witnesses for any 𝑥 with
                                                     5𝜋2
error at most kΠΘ (𝑥)|𝜓0 ik 2 and complexity at most 4Θ 2.


Proof. We first define a vector |𝑣i by:

                                 |𝑣i = (𝐼 − (𝑈 𝒪𝑥 )† )+ (𝐼 − ΠΘ (𝑥))|𝜓0 i.

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                             39
                                                   S TACEY J EFFERY

Note that 𝐼 − (𝑈 𝒪𝑥 )† is supported everywhere except the (+1)-eigenvectors of (𝑈 𝒪𝑥 )† , which
are exactly the (+1)-eigenvectors of 𝑈 𝒪𝑥 . Thus, (𝐼 − ΠΘ (𝑥))|𝜓0 i is contained in this support.
    Next we define                                       
                               |𝑤i = |𝜓0 i − (𝐼 − 𝑈 † )|𝑣i |1i + |𝑣i.

Then we have:

                          𝐴|𝑤i = |𝜓0 i − (𝐼 − 𝑈 † )|𝑣i + (𝐼 − 𝑈 † )|𝑣i = |𝜓0 i = |𝜏i.

So |𝑤i is a positive witness, and we next compute its error for 𝑥:
                                                                      2
                                  = Π𝑥¯ |𝜓0 i − (𝐼 − 𝑈 † )|𝑣i
                              2
             Π𝐻(𝑥)⊥ |𝑤i
                                                                                                                        2
                                  = Π𝑥¯ |𝜓0 i − Π𝑥¯ (𝐼 − 𝑈 † )(𝐼 − (𝑈 𝒪𝑥 )† )+ (𝐼 − ΠΘ (𝑥))|𝜓0 i                            .

Above, Π𝑥¯ = 𝐼 − Π𝑥 . We now observe that
                                                                                 
                        Π𝑥¯ (𝐼 − 𝒪𝑥 𝑈 † ) = Π𝑥¯ Π𝑥¯ − (Π𝑥¯ − Π𝑥 )𝑈 † = Π𝑥¯ (𝐼 − 𝑈 † ).

Thus, continuing from above, we have:
                                                                                                                            2
                                  = Π𝑥¯ |𝜓0 i − Π𝑥¯ (𝐼 − 𝒪𝑥 𝑈 † )(𝐼 − 𝒪𝑥 𝑈 † )+ (𝐼 − ΠΘ (𝑥))|𝜓0 i
                              2
             Π𝐻(𝑥)⊥ |𝑤i
                                  = kΠ𝑥¯ |𝜓0 i − Π𝑥¯ (𝐼 − ΠΘ (𝑥))|𝜓0 ik 2 = kΠ𝑥¯ ΠΘ (𝑥)|𝜓0 ik 2
                                  ≤ kΠΘ (𝑥)|𝜓0 ik 2 .

                                                                                                  𝑖𝜃 𝑗
   Now we compute the complexity of |𝑤i. First, let 𝑈 𝒪𝑥 =
                                                                                         Í
                                                                                             𝑗e          |𝜆 𝑗 ih𝜆 𝑗 | be the eigenvalue
decomposition of 𝑈 𝒪𝑥 . Then
                                                               Õ            1
                                      (𝐼 − (𝑈 𝒪𝑥 )† )+ =                              |𝜆 𝑗 ih𝜆 𝑗 |
                                                           𝑗:𝜃 𝑗 ≠0
                                                                      1 − e−𝑖𝜃𝑗
                                                               Õ
                                  and      𝐼 − ΠΘ (𝑥) =                 |𝜆 𝑗 ih𝜆 𝑗 |.
                                                           𝑗:|𝜃 𝑗 |>Θ


   We can thus bound k|𝑣ik 2 :
                                                                                                                                2
                                                                   2
                                                                                Õ              1
          k|𝑣ik = (𝐼 − (𝑈 𝒪𝑥 )† )+ (𝐼 − ΠΘ (𝑥))|𝜓0 i
                2
                                                                        =                                  h𝜆 𝑗 |𝜓0 i|𝜆 𝑗 i
                                                                            𝑗:|𝜃 𝑗 |>Θ
                                                                                         1 − e−𝑖𝜃𝑗
                         Õ            1                         𝜋2
                    =                     𝜃
                                            |h𝜆 𝑗 |𝜓0 i| 2 ≤        .
                                        2 𝑗                    4Θ 2
                        𝑗:|𝜃 𝑗 |>Θ 4 sin 2



                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                       40
                             S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

Next, using 𝒪𝑥 + 2Π𝑥 = 𝐼 − 2Π𝑥 + 2Π𝑥 = 𝐼, we compute
                             2                                                                                     2
     |𝜓0 i − (𝐼 − 𝑈 † )|𝑣i       = |𝜓0 i − (𝐼 − 𝒪𝑥 𝑈 † − 2Π𝑥 𝑈 † )(𝐼 − 𝒪𝑥 𝑈 † )+ (𝐼 − ΠΘ (𝑥))|𝜓0 i
                                                                                                                       2
                                 = |𝜓0 i − (𝐼 − ΠΘ (𝑥))|𝜓0 i + 2Π𝑥 𝑈 † (𝐼 − (𝑈 𝒪𝑥 )† )+ (𝐼 − ΠΘ (𝑥))|𝜓0 i
                                                                                                               2
                                                                    Õ             1
                                 ≤  kΠΘ (𝑥)|𝜓0 ik + 2 Π𝑥 𝑈 †                             h𝜆 𝑗 |𝜓0 i|𝜆 𝑗 i ®
                                   ©                                                                       ª
                                                                                    −𝑖𝜃 𝑗
                                                                 𝑗:|𝜃 𝑗 |>Θ
                                                                             1−e
                                   «                                                                       ¬
                                                      v                                            2
                                                      t Õ
                                                                          1
                                 ≤  kΠΘ (𝑥)|𝜓0 ik + 2                           |h𝜆 𝑗 |𝜓0 i| 2 ®
                                   ©                                                             ª
                                                                           2 𝜃 𝑗
                                   «                    𝑗:|𝜃 𝑗 |>Θ 4 sin 2                       ¬
                                                    𝜋                               2     𝜋   2
                                 ≤ kΠΘ (𝑥)|𝜓0 ik +             k(𝐼 − ΠΘ (𝑥))|𝜓0 ik         ≤        .
                                                          Θ                                    Θ2
Then we have the complexity of |𝑤i,
                                                                           2
                                    k|𝑤ik 2 = |𝜓0 i − (𝐼 − 𝑈 † )|𝑣i            + k|𝑣ik 2
                                                  𝜋2   𝜋2   5𝜋2
                                              ≤      +    =     .                                                          
                                                  Θ 2 4Θ 2 4Θ 2
   We conclude with the following two corollaries, whose combination gives Theorem 5.14.
Corollary 5.22. Let 𝒜 = (𝑈 , |𝜓0 i, 0, 𝑇, 𝑀) be a monotone phase estimation algorithm for 𝑓 with space
complexity 𝑆 = log dim ℋ + log 𝑇 + log 𝑀 + 1 and query complexity 𝑂(𝑇 𝑀). Then there is a monotone
span program that decides 𝑓 (exactly) whose size is 2 dim ℋ ≤ 2𝑆 and whose complexity is 𝑂(𝑇 𝑀).

Proof. If 𝑓 (𝑥) = 0, then by Lemma 5.18, we have kΠ0 (𝑥)|𝜓0 ik 2 ≥ 𝑀1 2 , so by Lemma 5.20,
𝑤 − (𝑥) ≤ 𝑀 2 . Thus 𝑊− ≤ 𝑀 2 .
     If 𝑓 (𝑥) = 1, then by Lemma 5.18, we have Π2/𝑇 (𝑥)|𝜓0 i = 0, so by Lemma 5.21, there’s an
                                                             2

exact positive witness for 𝑥 with complexity 𝑂(𝑇 2 ). Thus 𝑊+ ≤ 𝑂(𝑇 2 ), and so the span program
𝑃𝒜 from (5.1) has complexity 𝑂(𝑇 𝑀). The size of the span program 𝑃𝒜 is dim 𝐻 = 2 dim ℋ . 

Corollary 5.23. Let 𝒜 = (𝑈 , |𝜓0 i, 𝛿, 𝑇, 𝑀) be a monotone phase estimation algorithm for 𝑓 with
space complexity 𝑆 = log dim ℋ + log 𝑇 + log 𝑀 + 1 and query complexity 𝑂(𝑇 𝑀). Then there is a
constant 𝜅 ∈ (0, 1) such that there exists a monotone span program that 𝜅-approximates 𝑓 whose size is
2 dim ℋ ≤ 2𝑆 and whose complexity is 𝑂(𝑇 𝑀).

Proof. If 𝑓 (𝑥) = 0, then by Lemma 5.19, we have kΠ0 (𝑥)|𝜓0 ik 2 > 𝛿(1 + 𝑐) for some constant 𝑐 > 0.
Thus, by Lemma 5.20, 𝑊− ≤ (1+𝑐)𝛿1
                                   .
                                                                  𝑐
   If 𝑓 (𝑥) = 1, then by Lemma 5.21, setting Θ = 𝑑𝜋/𝑇 for 𝑑 = 𝜋2 1+𝑐 , (where 𝑐 is the constant
                                                                                     p
from above), by Lemma 5.21 there is an approximate positive witness for 𝑥 with error
                                                                               2
                                              𝑒 𝑥 = Π2√ 𝑐 /𝑇 (𝑥)|𝜓0 i
                                                              1+𝑐



                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                               41
                                                        S TACEY J EFFERY

and complexity 𝑂(𝑇 2 ). By Lemma 5.19, we have
                                      𝛿                  𝛿            𝛿(1 + 𝑐)          1        1
                      𝑒𝑥 ≤                    =            𝑐   =             ≤            .
                                 1 − 𝑑 8𝜋
                                      2 2
                                                    1 − 2(1+𝑐)   1 + 𝑐 − 𝑐/2   1 + 𝑐/2 𝑊−

Thus, letting 𝜅 = 1+𝑐/2
                      1
                         < 1, we have that 𝑃𝒜 𝜅-approximates 𝑓 . Since the positive witness
complexity is 𝑂(𝑇 2 ), and by Lemma 5.19, we also have 𝑊− ≤ 𝑂(𝑀 2 ), the complexity of 𝑃𝒜 is
𝑂(𝑇 𝑀). The size of 𝑃𝒜 is dim 𝐻 = 2 dim ℋ .                                               

5.2.2   Proofs of Lemma 5.18 and Lemma 5.19
We will prove the lemmas as a collection of claims. Fix 𝑇 0 ≥ 𝑇 and p 𝑀 0 ≥ 𝑀 with which to
                                                    √
run the algorithm. Suppose Φ(𝑥) outputs |𝜓(𝑥)i = 𝑝 𝑥 |0i𝐴 |Φ0 (𝑥)i + 1 − 𝑝 𝑥 |1i𝐴 |Φ1 (𝑥)i, and
let 𝑝˜ denote the estimate output by the algorithm. We will let 𝑈 𝒪𝑥 = 𝑗 e𝑖𝜎 𝑗 (𝑥) |𝜆 𝑥𝑗 ih𝜆 𝑥𝑗 | be an
                                                                      Í

eigenvalue decomposition.

Claim 5.24. If 𝑓 (𝑥) = 0 then kΠ0 (𝑥)|𝜓0 ik 2 ≥ 𝑀1 2 .
Proof. Since the algorithm computes 𝑓 with bounded error, the probability of accepting 𝑥 is at
most 1/3, so 𝑝˜ ≤ 𝛿 with probability at most 1/3.
   Amplitude estimation is just phase estimation of a unitary 𝑊Φ such that |𝜓(𝑥)i is in the
span of e±2𝑖𝜃𝑥 -eigenvectors of 𝑊Φ , where 𝑝 𝑥 = sin2 𝜃𝑥 , 𝜃𝑥 ∈ [0, 𝜋/2) [7]. One can show that the
probability of outputting an estimate 𝑝˜ = 0 is sin2 (𝑀 0 𝜃𝑥 )/(𝑀 02 sin2 (𝜃𝑥 )), so

                                                       1   sin2 (𝑀 0 𝜃𝑥 )
                                                         ≥                 .
                                                       3   𝑀 02 sin2 (𝜃𝑥 )
If 𝑀 0 𝜃𝑥 ≤ 𝜋2 , then this would give:

                                                   1   (2𝑀 0 𝜃𝑥 /𝜋)2   4
                                                     ≥               = 2,
                                                   3     𝑀 𝜃𝑥0 2 2    𝜋

which is a contradiction. Thus, we have:
                         𝜋                2𝜃𝑥   1                                 1              √          1
              𝑀 0 𝜃𝑥 >            ⇒           > 0              ⇒      sin 𝜃𝑥 >          ⇒            𝑝𝑥 >      .
                         2                 𝜋   𝑀                                  𝑀0                        𝑀0
    Since Φ(𝑥) is the result of running phase estimation, we have
                         Õ                         sin2 (𝑇 0 𝜎 𝑗 (𝑥)/2)                              𝜋2
                  𝑝𝑥 =           |h𝜆 𝑥𝑗 |𝜓0 i| 2                           ≤ kΠΘ (𝑥)|𝜓0 ik 2 +              ,
                             𝑗
                                                   𝑇 02 sin2 (𝜎 𝑗 (𝑥)/2)                         𝑇 02 Θ2

for any Θ. In particular, if Δ is less than the spectral gap of 𝑈 𝒪𝑥 , we have kΠΔ (𝑥)|𝜓0 ik =
kΠ0 (𝑥)|𝜓0 ik, so
                                   1                           𝜋2
                                      <  kΠ 0 (𝑥)|𝜓 0 ik 2
                                                           +         .
                                 𝑀 02                        𝑇 02 Δ2

                      T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                        42
                           S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

This is true for any choices 𝑇 0 ≥ 𝑇 and 𝑀 0 ≥ 𝑀, so we must have:

                                                    1
                                                       ≤ kΠ0 (𝑥)|𝜓0 ik 2 .                                               
                                                    𝑀2
                                                                             √
Claim 5.25. If 𝑓 (𝑥) = 1 and 𝛿 = 0, then for any 𝑑 < 𝜋8 , Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i
                                                                                                    2
                                                                                                        = 0.

Proof. Suppose towards a contradiction that Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i > 0. Then 𝑝 𝑥 > 0, and some
                                                                                            2

sufficiently large 𝑀 0 ≥ 𝑀 would detect this and cause the algorithm to output 0, so we must
actually have Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i = 0. In fact, in order to sure that no large enough value 𝑀 0 detects
                            2

amplitude > 0 on |0i𝐴 , we must have 𝑝 𝑥 = 0 whenever 𝑓 (𝑥) = 1. That means that when 𝑓 (𝑥) = 1,
the algorithm never outputs 0, so the algorithm has one-sided error.                            

Claim 5.26. There is some constant 𝑐 such that if 𝑓 (𝑥) = 0 and 𝛿 > 0 then kΠ0 (𝑥)|𝜓0 ik 2 > 𝛿(1 + 𝑐).

Proof. Recall that 𝑝˜ ∈ {sin2 (𝜋𝑚/𝑀 0) : 𝑚 = 0, . . . , 𝑀 0 − 1}. We will restrict our attention to
choices 𝑀 0 such that for some integer 𝑑,

                                                    𝑑𝜋            (𝑑 + 1/3)𝜋
                                             sin2      ≤ 𝛿 < sin2            .
                                                    𝑀0                𝑀0

To see that such a choice exists, let 𝜏 be such that 𝛿 = sin2 𝜏, and note that the condition holds as
long as 𝑑 ≤ 𝜏𝑀
                  0                                                              3𝜏𝑀 0
                 𝜋 < 𝑑 + 1/3 for some 𝑑, which is equivalent to saying that b 𝜋 c = 0 mod 3. If
         𝜋
𝐾 = b 21 3𝜏 c, then for any 𝑀 0 ≥ 𝑀, and ℓ ≥ 0, define:

                                                             𝑀ℓ = 𝑀 0 + ℓ 𝐾.

Then for any ℓ > 0,
                                                                                               
                                       3𝜏      3𝜏         3𝜏    1 3𝜏 1
                                          𝑀ℓ −    𝑀ℓ −1 =    𝐾∈   −  ,  ,
                                       𝜋       𝜋          𝜋     2   𝜋 2

so there must be one ℓ ∈ {0, . . . , 6} such that b 3𝜏
                                                    𝜋 𝑀ℓ e = 0 mod 3. In particular, there is some
choice 𝑀ℓ satisfying the condition such that (using some 𝑀 0 ≤ √1 ):
                                                                                                𝛿

                            √                 √          𝜋      𝜋 sin 𝜏
                                                                       
                                                     1
                                  𝛿𝑀ℓ ≤           𝛿 √ +6    =1+         ≤ 1 + 𝜋.                                      (5.2)
                                                      𝛿  6𝜏        𝜏

We will use this value as our 𝑀 0 for the remainder of this proof.
                                                                                                 𝜋
   Let 𝑝 𝑥 = sin2 𝜃𝑥 for 𝜃𝑥 ∈ [0, 𝜋/2]. Let 𝑧 be an integer such that Δ = 𝜃𝑥 − 𝜋𝑧/𝑀 0 has |Δ| ≤ 2𝑀 0.
                            2 𝜋𝑧
Then the outcome 𝑝˜ = sin 𝑀 0 has probability:

                         𝑀 0 −1                          2              𝑀 0 −1         2
                   1     Õ
                                      𝑖2𝑡(𝜃𝑥 −𝜋𝑧/𝑀 0 )            1     Õ                   sin2 (𝑀 0Δ)        4
                                  e                          =                    e𝑖2𝑡Δ =                 ≥       ,
                 𝑀 02     𝑡=0                                    𝑀 02       𝑡=0             𝑀 02 sin2 Δ        𝜋2

                        T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                           43
                                                 S TACEY J EFFERY

since |𝑀 0Δ| ≤ 𝜋2 . Thus, by correctness, we must have sin2 (𝜋𝑧/𝑀 0) > 𝛿 ≥ sin2 𝑀
                                                                                𝑑𝜋
                                                                                  0 . Thus 𝑧 > 𝑑, so



                               (𝑑 + 1)𝜋  𝑧𝜋                  𝜋
                                        ≤ 0 = 𝜃𝑥 − Δ ≤ 𝜃𝑥 +      .
                                  𝑀 0    𝑀                  2𝑀 0

Thus:

                                                              (𝑑 + 1/3)𝜋    2𝜋            𝜋
                                                                         +       ≤ 𝜃𝑥 +
                                                                  𝑀  0     3𝑀 0         2𝑀 0
                                                                           𝜋
                                                           
                                                             (𝑑 + 1/3)𝜋
                                                       sin              +        ≤ sin 𝜃𝑥
                                                                 𝑀 0      6𝑀 0
                                   𝜋                             𝜋
                                                                             
                   (𝑑 + 1/3)𝜋                (𝑑 + 1/3)𝜋               √
               sin            cos      + cos                sin      ≤ 𝑝𝑥
                       𝑀 0        6𝑀 0           𝑀  0           6𝑀 0

                              √              𝜋     √             𝜋
                                  r
                                                                      √
                                𝛿 1 − sin2       +    1 − 𝛿 sin      ≤ 𝑝𝑥
                                            6𝑀 0                6𝑀 0


             𝜋                                                                           2 𝜋
When sin2 6𝑀   0 ≤ 1 − 𝛿, which we can assume, the above expression is minimized when sin 6𝑀 0

is as small as possible. We have, using 𝑀 0 ≤ 1+𝜋
                                               √ , from (5.2):
                                                               𝛿

                                             𝜋       4           𝛿
                                     sin2        ≥         ≥           .
                                            6𝑀 0
                                                   36𝑀 0 2   9(1 + 𝜋)2

Thus, continuing from above, letting 𝑘 = 9(1+𝜋)
                                            1
                                                2 , we have:


                                                   √ √        √     √    √
                                                    𝛿 1 − 𝑘𝛿 + 1 − 𝛿 𝑘𝛿 ≤ 𝑝 𝑥
                                                                   p
                         𝛿(1 − 𝑘𝛿) + (1 − 𝛿)𝑘𝛿 + 2𝛿 𝑘(1 − 𝛿)(1 − 𝑘𝛿) ≤ 𝑝 𝑥



Next, notice that (1 − 𝑘𝛿)(1 − 𝛿) is minimized when 𝛿 = 1+𝑘
                                                         2𝑘 , but 𝛿 ≤ 2 < 2𝑘 , so we have, using
                                                                      1   1+𝑘

𝑘 < 1 and 𝛿 ≤ 1/2:
                                             √ p
                          𝛿(1 + 𝑘(1 − 2𝛿) + 2 𝑘 (1 − 𝑘/2)(1 − 1/2)) ≤ 𝑝 𝑥
                                                               √
                                                     𝛿(1 + 0 + 𝑘) ≤ 𝑝 𝑥 .

   Since Φ(𝑥) is the result of running phase estimation of 𝑈 𝒪𝑥 for 𝑇 0 ≥ 𝑇 steps, we have:

                                                                         𝑇 0 𝜎 𝑗 (𝑥)
                                        Õ                              sin2 ( 2 )
                                 𝑝𝑥 =           |h𝜆 𝑥𝑗 |𝜓0 i| 2                          ,
                                                                                 𝜎 𝑗 (𝑥)
                                            𝑗                   (𝑇 0)2 sin2 ( 2 )


                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                     44
                           S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

so in particular, for any Θ ∈ [0, 𝜋), we have
                                                               Õ                                        1
                        𝑝 𝑥 ≤ kΠΘ (𝑥)|𝜓0 ik 2 +                             |h𝜆 𝑥𝑗 |𝜓0 i| 2                          .
                                                            𝑗:|𝜎 𝑗 (𝑥)|>Θ
                                                                                                (𝑇 0)2 sin2 ( Θ2 )

                                                                                                   𝜋2
                           ≤ kΠΘ (𝑥)|𝜓0 ik 2 + k(𝐼 − ΠΘ (𝑥))|𝜓0 ik 2                                      .
                                                                                                (𝑇 0)2 Θ2

In particular, for any Θ < Δ where Δ is the spectral gap of 𝑈 𝒪𝑥 , we have kΠΘ (𝑥)|𝜓0 ik =
kΠ0 (𝑥)|𝜓0 ik, so for any 𝑇 0 ≥ 𝑇, we have

                                                              𝜋2                  √
                                  kΠ0 (𝑥)|𝜓0 ik 2 +           0
                                                                     ≥ 𝑝 𝑥 ≥ 𝛿(1 + 𝑘).
                                                            (𝑇 ) Δ
                                                                2  2


Since this holds for any 𝑇 0 ≥ 𝑇, we get
                                                                                      √
                                          kΠ0 (𝑥)|𝜓0 ik 2 ≥ 𝛿(1 +                         𝑘).
                                                   √
The proof is completed by letting 𝑐 =                  𝑘.                                                                    

Claim 5.27. If 𝑓 (𝑥) = 1 and 𝛿 > 0 then Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i                              (1 − 𝑑 2 𝜋2 /8) ≤ 𝛿.
                                                                                2


                                                                  √
Proof. If |𝜆i is an e𝑖𝜃 -eigenvector of 𝑈 𝒪𝑥 for some |𝜃| ≤ 𝑑𝜋/𝑇 < 8/𝑇, then the probability of
measuring 0 in the phase register upon performing 𝑇 steps of phase estimation is:

                                                             𝑇−1            2
                                                 1 Õ 𝑖𝑡𝜃                             sin2 𝑇𝜃
                                      𝑝 𝑥 (𝜃) := 2    e                         =          2
                                                                                                   .
                                                𝑇 𝑡=0                               𝑇 2 sin2 𝜃2

Let 𝜀(𝑥) = 1 − sin𝑥 2 𝑥 for any 𝑥. It is simple to verify that 𝜀(𝑥) ≤ 𝑥 2 /2 for any 𝑥, and 𝜀(𝑥) ∈ [0, 1]
                  2


for any 𝑥. So we have:

                                  (𝑇𝜃/2)2 (1 − 𝜀(𝑇𝜃/2))                       𝑇 2 𝜃2
                      𝑝 𝑥 (𝜃) ≥                           ≥ 1 − 𝜀(𝑇𝜃/2) ≥ 1 −        .
                                  𝑇 2 (𝜃/2)2 (1 − 𝜀(𝜃/2))                       8

Thus, we conclude that

                                                     𝑇 2 𝑑 2 𝜋2                          𝑑 2 𝜋2
                                                                                                                      
               𝑝 𝑥 ≥ Π𝑑𝜋/𝑇 (𝑥)|𝜓0 i                                                             .
                                          2                                        2
                                                  1−            = Π 𝑑𝜋/𝑇 (𝑥)|𝜓   i   1 −
                                                     8 𝑇2
                                                                               0
                                                                                            8

If this is > 𝛿, then with some sufficiently large 𝑀 0 ≥ 𝑀, amplitude estimation would detect this
and cause the algorithm to output 0 with high probability.                                     

                       T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                                                 45
                                         S TACEY J EFFERY

Acknowledgements
I am grateful to Tsuyoshi Ito for discussions that led to the construction of approximate span
programs from two-sided error quantum algorithms presented in Section 3.3, and to Alex
B. Grilo and Mario Szegedy for insightful comments. I am grateful to Robin Kothari for pointing
out the improved separation between certificate complexity and approximate degree in [8],
which led to an improvement in from (log 𝑛)7/6 (using [1]) to (log 𝑛)2−𝑜(1) in Theorem 5.3. I thank
Robert Robere for pointing me to a separation between formula size and monotone formula
size for XOR-SAT. Finally, I thank the anonymous reviewers, whose feedback has improved the
presentation of these results.


References
 [1] Scott Aaronson, Shalev Ben-David, and Robin Kothari: Separations in query com-
     plexity using cheat sheets. In Proc. 48th STOC, pp. 863–876. ACM Press, 2016.
     [doi:10.1145/2897518.2897644, arXiv:1511.01937, ECCC:TR15-175] 6, 46

 [2] Noga Alon, Troy Lee, Adi Schraibman, and Santosh Vempala: The approximate rank of a
     matrix and its algorithmic applications. In Proc. 45th STOC, pp. 675–684. ACM Press, 2013.
     [doi:10.1145/2488608.2488694, ECCC:TR12-169] 8

 [3] Andris Ambainis: Quantum lower bounds by quantum arguments. J. Comput. System
     Sci., 64(4):750–767, 2002. Preliminary version in STOC’00. [doi:10.1006/jcss.2002.1826,
     arXiv:quant-ph/0002066] 11

 [4] Sanjeev Arora and Boaz Barak: Computational Complexity: A Modern Approach. Cambridge
     Univ. Press, 2009. Book. 4

 [5] László Babai, Anna Gál, and Avi Wigderson: Superpolynomial lower bounds for monotone
     span programs. Combinatorica, 19(3):301–319, 1999. [doi:10.1007/s004930050058] 4, 5, 25

 [6] Howard Barnum, Michael E. Saks, and Mario Szegedy: Quantum query complexity and
     semi-definite programming. In Proc. 18th IEEE Conf. on Comput. Complexity (CCC’03), pp.
     179–193. IEEE Comp. Soc., 2003. [doi:10.1109/CCC.2003.1214419] 11

 [7] Gilles Brassard, Peter Høyer, Michele Mosca, and Alain Tapp: Quantum amplitude
     amplification and estimation. In Samual J. Lomonaca and Howard E. Brandt, editors,
     Quantum Computation and Quantum Information: A Millennium Volume, pp. 53–74. Amer.
     Math. Soc., 2002. [doi:10.1090/conm/305, arXiv:quant-ph/0005055] 10, 42

 [8] Mark Bun and Justin Thaler: A nearly optimal lower bound on the approximate
     degree of 𝐴𝐶 0 . SIAM J. Comput., 49(4):59–96, 2020. Preliminary version in FOCS’17.
     [doi:10.1137/17M1161737, arXiv:1703.05784, ECCC:TR17-051] 6, 33, 46

                     T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                      46
                      S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

 [9] Bill Fefferman and Cedric Yen-Yu Lin: A complete characterization of unitary quan-
     tum space. In Proc. 9th Innovations in Theoret. Comp. Sci. Conf. (ITCS’18), pp. 4:1–21.
     Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2018. [doi:10.4230/LIPIcs.ITCS.2018.4,
     arXiv:1604.01384] 7

[10] Bill Fefferman and Zachary Remscrim: Eliminating intermediate measurements in space-
     bounded quantum computation. In Proc. 53rd STOC, pp. 1343–1356. ACM Press, 2021.
     [doi:10.1145/3406325.3451051, arXiv:2006.03530, ECCC:TR20-088] 2, 9

[11] Anna Gál: A characterization of span program size and improved lower bounds for
     monotone span programs. Comput. Complexity, 10(4):277–296, 2001. Preliminary version in
     STOC’98. [doi:10.1007/s000370100001] 4, 5, 25, 27, 29

[12] Uma Girish, Ran Raz, and Wei Zhan: Quantum logspace algorithm for powering matrices
     with bounded norm. In Proc. 48th Internat. Colloq. on Automata, Languages, and Program-
     ming (ICALP’21), pp. 73:1–20. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2021.
     [doi:10.4230/LIPIcs.ICALP.2021.73, arXiv:2006.04880, ECCC:TR20-087] 2, 9

[13] Mika Göös, Pritish Kamath, Robert Robere, and Dmitry Sokolov: Adventures in
     monotone complexity and TFNP. In Proc. 10th Innovations in Theoret. Comp. Sci.
     Conf. (ITCS’19), pp. 38:1–19. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2019.
     [doi:10.4230/LIPIcs.ITCS.2019.38, ECCC:TR18-163] 37

[14] Peter Høyer, Troy Lee, and Robert Špalek: Negative weights make adversaries stronger.
     In Proc. 39th STOC, pp. 526–535. ACM Press, 2007. [doi:10.1145/1250790.1250867] 11

[15] Tsuyoshi Ito and Stacey Jeffery: Approximate span programs. Algorithmica, 81(6):2158–2195,
     2019. Preliminary version in ICALP’16. [doi:10.1007/s00453-018-0527-1, arXiv:1507.00432]
     2, 9, 10, 12, 13, 14, 17, 18, 19, 37

[16] Stacey Jeffery: Frameworks for Quantum Algorithms. Ph. D. thesis, University of Waterloo,
     2014. Available at http://uwspace.uwaterloo.ca/handle/10012/8710. 19

[17] Stacey Jeffery: Span programs and quantum space complexity. In Proc. 11th Innovations
     in Theoret. Comp. Sci. Conf. (ITCS’20), pp. 4:1–37. Schloss Dagstuhl–Leibniz-Zentrum fuer
     Informatik, 2020. [doi:10.4230/LIPIcs.ITCS.2020.4, arXiv:1908.04232] 1

[18] Stacey Jeffery and Shelby Kimmel: Quantum algorithms for graph connectivity and formula
     evaluation. Quantum, 1:26:1–40, 2017. [doi:10.22331/q-2017-08-17-26, arXiv:1704.00765] 37

[19] Richard Jozsa, Barbara Kraus, Akimasa Miyake, and John Watrous: Matchgate and space-
     bounded quantum computations are equivalent. Proc. Royal Soc. A, 466(2115):809–830, 2010.
     [doi:10.1098/rspa.2009.0433] 7

[20] Mauricio Karchmer and Avi Wigderson: On span programs. In Proc. 8th IEEE
     Conf. Structure in Complexity Theory (SCT’93), pp. 102–111. IEEE Comp. Soc., 1993.
     [doi:10.1109/SCT.1993.336536] 2, 11, 12

                    T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                   47
                                        S TACEY J EFFERY

[21] Alexei Y. Kitaev: Quantum measurements and the Abelian stabilizer problem. Electron.
     Colloq. Comput. Complexity, TR96-003, 1996. [ECCC, arXiv:quant-ph/9511026] 9

[22] Troy Lee, Rajat Mittal, Ben W. Reichardt, Robert Špalek, and Mario Szegedy: Quantum
     query complexity of state conversion. In Proc. 52nd FOCS, pp. 344–353. IEEE Comp. Soc.,
     2011. [doi:10.1109/FOCS.2011.75] 8

[23] Satyanarayana V. Lokam: Complexity lower bounds using linear algebra. Found. Trends
     Theor. Comp. Sci., 4(1–2):1–155, 2009. [doi:10.1561/0400000011] 25

[24] Toniann Pitassi and Robert Robere: Strongly exponential lower bounds for monotone com-
     putation. In Proc. 49th STOC, pp. 1246–1255. ACM Press, 2017. [doi:10.1145/3055399.3055478,
     ECCC:TR16-188] 5, 28, 31, 36

[25] Alexander A. Razborov: Applications of matrix methods to the theory of lower bounds in
     computational complexity. Combinatorica, 10(1):81–93, 1990. [doi:10.1007/BF02122698] 4,
     25, 27, 29

[26] Alexander A. Razborov: On submodular complexity measures. In Proc. London Math. Soc.
     Symposium on Boolean Function Complexity, pp. 76–83, 1992. Author’s website. 27

[27] Ben W. Reichardt: Span programs and quantum query complexity: The general adversary
     bound is nearly tight for every Boolean function. In Proc. 50th FOCS, pp. 544–551. IEEE
     Comp. Soc., 2009. [doi:10.1109/FOCS.2009.55, arXiv:0904.2759] 2, 3, 10, 11, 12, 19, 37

[28] Ben W. Reichardt and Robert Špalek: Span-program-based quantum algorithm for
     evaluating formulas. Theory of Computing, 8(13):291–319, 2012. Preliminary version in
     STOC’08. [doi:10.4086/toc.2012.v008a013] 2, 11

[29] Robert Robere, Toniann Pitassi, Benjamin Rossman, and Stephen A. Cook: Exponential
     lower bounds for monotone span programs. In Proc. 57th FOCS, pp. 406–415. IEEE Comp.
     Soc., 2016. [doi:10.1109/FOCS.2016.51, ECCC:TR16-064] 5, 6, 31, 34

[30] Alexander A. Sherstov: The pattern matrix method. SIAM J. Comput., 40(6):1969–2000,
     2011. [doi:10.1137/080733644, arXiv:0906.4291] 6, 31, 34

[31] John Watrous: Space-bounded quantum complexity. J. Comput. System Sci., 59(2):281–326,
     1999. [doi:10.1006/jcss.1999.1655] 6




                    T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                    48
                     S PAN P ROGRAMS AND Q UANTUM S PACE C OMPLEXITY

AUTHOR

    Stacey Jeffery
    Senior researcher
    CWI & QuSoft
    Amsterdam
    The Netherlands
    jeffery cwi nl
    https://homepages.cwi.nl/~jeffery/


ABOUT THE AUTHOR

    Stacey Jeffery started her academic career as an undergraduate philosophy student
       at McMaster University, before reading the book Gödel, Escher, Bach: An Eternal
       Golden Braid, after which she transfered to a Computer Science program at
       the University of Waterloo. She got her Ph. D. in Computer Science from the
       University of Waterloo under the supervision of Michele Mosca in 2014, before
       spending two and a half years as a postdoc at Caltech at the Insitute for Quantum
       Information and Matter. She now lives in Amsterdam with her husband and
       two-year-old daughter, who enjoys bedtime stories read from Theory of Computing.




                  T HEORY OF C OMPUTING, Volume 18 (11), 2022, pp. 1–49                    49