The Need for Structure in Quantum Speedups

Authors Scott Aaronson, Andris Ambainis,
Plaintext
                         T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166
                                       www.theoryofcomputing.org




The Need for Structure in Quantum Speedups
                              Scott Aaronson∗                       Andris Ambainis†
                  Received October 22, 2012; Revised July 10, 2014; Published August 12, 2014



       Abstract: Is there a general theorem that tells us when we can hope for exponential
       speedups from quantum algorithms, and when we cannot? In this paper, we make two
       advances toward such a theorem, in the black-box model where most quantum algorithms
       operate.
           First, we show that for any problem that is invariant under permuting inputs and outputs
       and that has sufficiently many outputs (like the collision and element distinctness problems),
       the quantum query complexity is at least the 7th root of the classical randomized query
       complexity. (An earlier version of this paper (ICS 2011) gave the 9th root.) This resolves a
       conjecture of Watrous from 2002.
           Second, inspired by work of O’Donnell et al. (2005) and Dinur et al. (2006), we conjecture
       that every bounded low-degree polynomial has a “highly influential” variable. (A multivariate
       polynomial p is said to be bounded if 0 ≤ p(x) ≤ 1 for all x in the Boolean cube.) Assuming
       this conjecture, we show that every T -query quantum algorithm can be simulated on most
       inputs by a T O(1) -query classical algorithm, and that one essentially cannot hope to prove
       P 6= BQP relative to a random oracle.
ACM Classification: F.1.2, F.1.3
AMS Classification: 81P68, 68Q12, 68Q17
Key words and phrases: decision trees, adversary method, collision problem, Fourier analysis, influ-
ences, quantum computing, query complexity
      A preliminary version of this paper appeared in the Proc. 2nd “Innovations in Computer Science” Conference (ICS 2011).
See Sec. 1.3 for a comparison with the present paper.
    ∗ MIT. Email: aaronson@csail.mit.edu. This material is based upon work supported by the National Science Foundation

under Grant No. 0844626, by a TIBCO Career Development Chair, and by an Alan T. Waterman award.
    † University of Latvia and IAS, Princeton. Email: ambainis@lu.lv. Supported by University of Latvia Research Grant

ZP01-100, FP7 Marie Curie International Reintegration Grant (PIRG02-GA-2007-224886), FP7 FET-Open project QCS, FP7
FET-Proactive project QALGO and ERC Advanced Grant MQC (at the University of Latvia) and the National Science Foundation
under agreement No. DMS-1128155 (at IAS, Princeton). Any opinions, findings and conclusions or recommendations expressed
in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.


© 2014 Scott Aaronson and Andris Ambainis
c b Licensed under a Creative Commons Attribution License (CC-BY)                             DOI: 10.4086/toc.2014.v010a006
                                    S COTT A ARONSON AND A NDRIS A MBAINIS

1     Introduction
Perhaps the central lesson gleaned from fifteen years of quantum algorithms research is this:

       Quantum computers can offer superpolynomial speedups over classical computers, but only
       for certain “structured” problems.

    The key question, of course, is what we mean by “structured.” In the context of most existing
quantum algorithms, “structured” basically means that we are trying to determine some global property
of an extremely long sequence of numbers, assuming that the sequence satisfies some global regularity.
As a canonical example, consider P ERIOD -F INDING, the core of Shor’s algorithms for factoring and
computing discrete logarithms [30]. Here we are given black-box access to an exponentially-long
sequence of integers X = (x1 , . . . , xN ); that is, we can compute xi for a given i. We are asked to find
the period of X—that is, the smallest k > 0 such that xi = xi−k for all i > k—under the promise that
X is indeed periodic, with period k  N (and also that the xi values are approximately distinct within
each period). The requirement of periodicity is crucial here: it is what lets us use the Quantum Fourier
Transform to extract the information we want from a superposition of the form

                                                      1 N
                                                     √ ∑ |ii |xi i .
                                                       N i=1

For other known quantum algorithms, X needs to be (for example) a cyclic shift of quadratic residues [17],
or constant on the cosets of a hidden subgroup.
    By contrast, the canonical example of an “unstructured” problem is the Grover search problem. Here
we are given black-box access to an N-bit string (x1 , . . . , xN ) ∈ {0, 1}N , and are asked whether there
                                                                                                       √
exists1 an i such that xi = 1. Grover [21] gave a quantum algorithm to solve this problem using O( N)
queries [21], as compared to the Ω (N) needed classically. However, this quadratic speedup is optimal,
as shown by Bennett, Bernstein, Brassard, and Vazirani [11]. For other “unstructured” problems—such
as computing the PARITY or M AJORITY of an N-bit string—quantum computers offer no asymptotic
speedup at all over classical computers (see Beals et al. [9]).
    Unfortunately, this “need for structure” has essentially limited the prospects for superpolynomial
quantum speedups to those areas of mathematics that are liable to produce things like periodic sequences
or sequences of quadratic residues.2 This is the fundamental reason why the greatest successes of quantum
algorithms research have been in cryptography, and specifically in number-theoretic cryptography. It
helps to explain why we do not have a fast quantum algorithm to solve NP-complete problems (for
example), or to break arbitrary one-way functions.
    Given this history, the following problem takes on considerable importance:

Problem 1.1 (Informal). For every “unstructured” problem f , are the quantum query complexity Q( f )
and the classical randomized query complexity R( f ) polynomially related?
    1 A variant asks us to find an i such that x = 1, under the mild promise that such an i exists.
                                                i
    2 Here we exclude BQP-complete problems, such as simulating quantum physics (the “original” application of quantum

computers), approximating the Jones polynomial [5], and estimating a linear functional of the solution of a well-conditioned
linear system [22].


                        T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                        134
                                  T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

   Despite its apparent vagueness, Problem 1.1 can be formalized in several natural and convincing
ways—and under these formalizations, the problem has remained open for about a decade.

1.1     Formalizing the problem
Let S ⊆ [M]N be a collection of inputs, and let f : S → {0, 1} be a function that we are trying to compute.
In this paper, we assume for simplicity that the range of f is {0, 1}; in other words, that we are trying to
solve a decision problem. It will also be convenient to think of f as a function from [M]N to {0, 1, ∗},
where ∗ means ‘undefined’ (that is, that a given input X ∈ [M]N is not in f ’s domain S).
    We will work in the well-studied decision-tree model. In this model, given an input X = (x1 , . . . , xN ),
an algorithm can at any time choose an i and receive xi . We count only the number of queries the
algorithm makes to the xi , ignoring other computational steps. Then the deterministic query complexity
of f , or D( f ), is the number of queries made by an optimal deterministic algorithm on a worst-case input
X ∈ S. The (bounded-error) randomized query complexity R( f ) is the expected number of queries made
by an optimal randomized algorithm that, for every X ∈ S, computes f (X) with probability at least 2/3.
The (bounded-error) quantum query complexity Q( f ) is the same as R( f ), except that we allow quantum
algorithms. Clearly Q( f ) ≤ R( f ) ≤ D( f ) ≤ N for all f . See Buhrman and de Wolf [15] for detailed
definitions as well as a survey of these measures.
    If S = [M]N , then we say f is total, and if M = 2, then we say f is Boolean. The case of total f is
relatively well-understood. Already in 1998, Beals et al. [9] showed the following:
Theorem 1.2 (Beals et al.). D ( f ) = O(Q( f )6 ) for all total Boolean functions f : {0, 1}N → {0, 1}.
      Furthermore, it is easy to generalize Theorem 1.2 to show that D ( f ) = O(Q( f )6 ) for all total functions
 f : [M]N → {0, 1}, not necessarily Boolean.3 In other words, for total functions, the quantum query
complexity is always at least the 6th root of the classical query complexity. The largest known gap
between D( f ) and Q( f ) for a total function is quadratic, and is achieved by the OR function (because of
Grover’s algorithm).
      On the other hand, as soon as we allow non-total functions, we can get enormous gaps. Aaronson [2]
                                                                  Ω(1) , yet Q( f ) = O (1).4 Other examples,
gave a Boolean function  √ f : S → {0, 1} for which R( f ) = N
for which R( f ) = Ω( N) and Q( f ) = O(log N log log N), follow easily from Simon’s algorithm [31]
and Shor’s algorithm [30]. Intuitively, these functions f achieve such large separations by being highly
structured: that is, their domain S includes only inputs that satisfy a stringent promise, such as encoding a
periodic function, or (in the case of [2]) encoding two Boolean functions, one of which is correlated with
the Fourier transform of the other one.
      By contrast with these highly-structured problems, consider the collision problem: that of deciding
whether a sequence of numbers (x1 , . . . , xN ) ∈ [M]N is one-to-one (each number appears once) or two-to-
one (each number appears twice). Let C OL(X) = 0 if X is one-to-one and C OL(X) = 1 if X is two-to-one,
    3 Theorem 1.2 is proved by combining three ingredients: D( f ) = O (C( f ) bs( f )), C( f ) = O(bs( f )2 ), and bs( f ) = O(Q( f )2 )

(where C( f ) is the certificate complexity of f and bs( f ) is the block sensitivity). And all three ingredients go through with no
essential change if we set M > 2, and define suitable M-ary generalizations of C( f ) and bs( f ). (We could also convert the
non-Boolean function f : [M]N → {0, 1} to a Boolean one, but then we would lose a factor of log M.)
    4 Previously, de Beaudrap, Cleve, and Watrous [10] had stated a similar randomized versus quantum separation. However,

their separation applied not to the standard quantum black-box model, but to a different model in which the black box permutes
the answer register |yi in some unknown way (rather than simply mapping |yi to |y ⊕ f (x)i).


                          T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                                   135
                                 S COTT A ARONSON AND A NDRIS A MBAINIS

under the promise that one of these is the case. Then C OL(X) is not a total function, since its definition
involves a promise on X. Intuitively, however, the collision problem seems much less “structured” than
Simon’s and Shor’s problems. One way to formalize this intuition is as follows.
Definition 1.3. Call a partial function f : [M]N → {0, 1, ∗} permutation-invariant if

                                  f (x1 , . . . , xN ) = f (τ(xσ (1) ), . . . , τ(xσ (N) ))

for all inputs X ∈ [M]N and all permutations σ ∈ SN and τ ∈ SM .
    Then C OL(X) is permutation-invariant: we can permute a one-to-one sequence and relabel its elements
however we like, but it is still a one-to-one sequence, and likewise for a two-to-one sequence. Because
of this symmetry, attempts to solve the collision problem using (for example) the Quantum Fourier
Transform seem unlikely to succeed. And indeed, in 2002 Aaronson [1] proved that Q (C OL) = Ω(N 1/5 ):
that is, the quantum query complexity  √ of the collision problem is at most polynomially better than its
randomized query complexity of Θ( N). The quantum lower bound was later improved to Ω(N ) by      1/3

Aaronson and Shi [4], matching an upper bound of Brassard, Høyer, and Tapp [14].
    Generalizing boldly from this example, John Watrous (personal communication) conjectured that the
randomized and quantum query complexities are polynomially related for every permutation-invariant
problem:
Conjecture 1.4 (Watrous 2002). R( f ) ≤ Q( f )O(1) for every partial function f : [M]N → {0, 1, ∗} that is
permutation-invariant.
    Let us make two remarks about Conjecture 1.4. First, the conjecture talks about randomized versus
quantum query complexity, since in this setting, it is easy to find functions f for which R( f ) and Q( f ) are
both tiny but D( f ) is huge. As an example, consider the Deutsch-Jozsa problem [18]: given a Boolean
input (x1 , . . . , xN ), decide whether the xi are all equal or whether half of them are 1 and the other half are
0, under the promise that one of these is the case.
    Second, if M = 2 (that is, f is Boolean), then Conjecture 1.4 follows relatively easily from known
results: indeed, we prove in Appendix 5 that R( f ) = O(Q( f )2 ) in that case. So the interesting case is
when M  2, as it is for the collision problem.
    Conjecture 1.4 provides one natural way to formalize the idea that classical and quantum query
complexities should be polynomially related for all “unstructured” problems. A different way is provided
by the following conjecture, which we were aware of since about 1999:
Conjecture 1.5 (folklore). Let Q be a quantum algorithm that makes T queries to a Boolean input
X = (x1 , . . . , xN ), and let ε > 0. Then there exists a deterministic classical algorithm that makes
poly(T, 1/ε, 1/δ ) queries to the xi , and that approximates Q’s acceptance probability to within an
additive error ε on a 1 − δ fraction of inputs.
    But what exactly does Conjecture 1.5 have to do with “the need for structure in quantum speedups”?
With Conjecture 1.4, the connection to this paper’s theme was more-or-less obvious, but with Conjec-
ture 1.5, some additional explanation is probably needed.
    Intuitively, we want to say the following: in order to achieve a superpolynomial speedup in the
black-box model, a quantum computer needs not merely a promise problem, but a “severely constrained”

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                 136
                                T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

promise problem. In other words, only a minuscule fraction of the 2N oracle strings X = (x1 , . . . , xN )
ought to satisfy the promise—precisely like what happens in Simon’s and Shor’s problems, where the
promise asserts that X encodes a periodic function. If the promise is too “mild”—if, say, it holds for all X
in some set S ⊆ {0, 1}N with |S| = Ω(2N )—then we should be back in the situation studied by Beals et
al. [9], where the oracle X lacked enough “structure” for a Shor-like algorithm to exploit, and as a result,
the best one could hope for was a polynomial quantum speedup like that of Grover’s algorithm.
     Yet, if we interpret the above intuition too naïvely, then it is easy to find counterexamples. To
illustrate, let S1 consist of all strings X ∈ {0, 1}N that encode valid inputs to Simon’s problem, let
S0 consist of all Y ∈ {0, 1}N that have Hamming distance at least N/10 from every X ∈ S1 , and let
S = S0 ∪ S1 . Then define a Boolean function fSimon : S → {0, 1} by fSimon (X) = 1 for all X ∈ S1 ,
and fSimon (X) = 0 for all X ∈ S0 . As observed by Buhrman et al. [16] (see also Ambainis and de
Wolf [7] and Hemaspaandra, Hemaspaandra, and Zimand [23]), this “property-testing version of Simon’s
problem” achievesp an exponential separation between randomized and quantum query complexities:
R( fSimon ) = Ω( N/ log N) while Q( fSimon ) = O(log N). But the promise is certainly “mild”: indeed
|S| ≥ 2N − 2cN for some constant c < 1.
     On the other hand, examining this counterexample more closely suggests a way to salvage our original
intuition. For notice that there exists a fast, deterministic classical algorithm that correctly evaluates
 fSimon (X) on almost all inputs X ∈ S: namely, the algorithm that always outputs 0! This algorithm errs
only on the minuscule fraction of inputs X ∈ S that happen to belong to S1 . Thus, we might conjecture
that this points to a general phenomenon: namely, whenever        there exists a fast quantum algorithm to
                                                               N
compute a Boolean function f : S → {0, 1} with |S| = Ω 2 , there also exists a fast classical algorithm
to compute f (X) on most inputs X ∈ S. In Appendix 7, we will prove that Conjecture 1.5 is equivalent to
this conjecture.
     Indeed, Conjecture 1.5 readily implies a far-reaching generalization of the result of Beals et al. [9]
stating that D( f ) = O(Q( f )6 ) for all total Boolean functions f . In particular, define the ε-approximate
query complexity of a Boolean function f : {0, 1}N → {0, 1}, or Dε ( f ), to be the minimum number of
queries made by a deterministic algorithm that evaluates f correctly on at least a 1 − ε fraction of inputs
X. Likewise, let Qε ( f ) be the minimum number of queries made by a quantum algorithm that evaluates
 f correctly on at least a 1 − ε fraction of inputs. Then Conjecture 1.5 implies that Dε ( f ) and Qδ ( f )
are polynomially related for all Boolean functions f and all constants ε > δ > 0 independent of N.5
This would provide a quantum counterpart to a beautiful 2002 result of Smyth [32], who solved an old
open problem of Steven Rudich by showing that Dε ( f ) = O(Cε 3 /30 ( f )2 /ε 3 ) for all ε > 0 (where Cδ ( f )
denotes the “δ -approximate certificate complexity” of f ).
     More dramatically, if Conjecture 1.5 holds, then we basically cannot hope to prove P 6= BQP relative
to a random oracle. This would answer a question raised by Fortnow and Rogers [20] in 1998, and
would contrast sharply with the situation for non-random oracles: we have had oracles relative to which
P 6= BQP, and indeed BQP 6⊂ MA, since the work of Bernstein and Vazirani [12] in the early 1990s. More
precisely, under some suitable complexity assumption (such as P = P#P ), we would get BQPA ⊂ AvgPA
with probability 1 for a random oracle A. Here AvgP is the class of languages for which there exists a
polynomial-time algorithm that solves a 1 − o (1) fraction of instances of size n. In other words, separating
BQP from AvgP relative to a random oracle would be as hard as separating complexity classes in the
   5 More generally, as we will show in Corollary 3.4, the relation we obtain is D                         O(1)
                                                                                  ε+δ ( f ) ≤ (Qε ( f )/δ )     for all ε, δ > 0.


                         T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                                137
                                        S COTT A ARONSON AND A NDRIS A MBAINIS

unrelativized world. This would provide a quantum counterpart to a theorem of Impagliazzo and Rudich
(credited in [24]), who used the powerful results of Kahn, Saks, and Smyth [24] to show that if P = NP,
then NPA ∩ coNPA ⊂ ioAvgPA with probability 1 for a random oracle A.6

1.2     Our results
Our main contribution in this paper is essentially to prove Watrous’s conjecture (Conjecture 1.4), that
randomized and quantum query complexities are polynomially related for every symmetric problem.
Theorem 1.6. R( f ) = O(Q( f )7 polylog Q( f )) for every partial function f : [M]N → {0, 1, ∗} that is
permutation-invariant.
     We conjecture that R( f ) and Q( f ) are polynomially related even for functions f satisfying one of
the two symmetries: namely, f (x1 , . . . , xN ) = f (xσ (1) , . . . , xσ (N) ) for all σ ∈ SN . We also conjecture that
the exponent of 7 can be improved to 2: in other words, that Grover’s algorithm once again provides the
optimal separation between the quantum and classical models.
     While the proof of Theorem 1.6 is somewhat involved, it can be entirely understood by those
unfamiliar with quantum computing: the difficulties lie in getting the problem into a form where existing
quantum lower bound technology can be applied to it. Let us stress that it was not at all obvious a priori
that existing quantum lower bounds would suffice here; that they did came as a surprise to us.
     We first define and analyze a simple randomized algorithm, which tries to compute f (X) for a given
X = (x1 , . . . , xN ) by estimating the multiplicity of each element xi . Next, by considering where this
randomized algorithm breaks down, we show that one can identify a “hard core” within f : roughly
speaking, two input types A∗ and B∗ , such that the difficulty of distinguishing A∗ from B∗ accounts
for a polynomial fraction of the entire difficulty of computing f . The rest of the proof consists of
lower-bounding the quantum query complexity of distinguishing A∗ from B∗ . We do so using a hybrid
argument: we develop a “chopping procedure” that gradually deforms A∗ to make it more similar to
B∗ , creating a sequence of intermediate input types A0 = A∗ , A1 , A2 , . . . , A2L = B∗ . We then show that,
for every ` ∈ [L], distinguishing A` from A`−1 requires many quantum queries, either by a reduction
from Zhandry’s recent Ω(N 1/3 ) quantum lower bound for the S ET E QUALITY problem [34] (which is a
nontrivial generalization of Aaronson and Shi’s collision lower bound [4]), or else by an application of
Ambainis’s general quantum adversary theorem [6].
     Note that, prior to Zhandry’s Ω(N 1/3 ) quantum lower bound for S ET E QUALITY, Midrijanis [25] had
proved a lower bound of Ω((N/ log N)1/5 ); the latter was the first quantum lower bound for S ET E QUAL -
ITY , and the only one for nearly a decade. An earlier version of this paper [3] used Midrijanis’s lower
bound to show that R( f ) = O(Q( f )9 polylog Q( f )) for all permutation-symmetric f . The improvement
to R( f ) = O(Q( f )7 polylog Q( f )) in the current version comes entirely from Zhandry’s improvement of
the S ET E QUALITY lower bound to the optimal Ω(N 1/3 ).
     Carrying out the hybrid argument in the “obvious” way produces a bound of the form R( f ) ≤
Q( f )O(1) polylog N, which fails to imply a polynomial relationship between R( f ) and Q( f ) when Q( f ) ≤
(log N)o(1) . However, a more sophisticated hybrid argument eliminates the polylog N factor.
   6 Here ioAvgP means “average-case P for infinitely many input lengths n.” The reason Impagliazzo and Rudich only get a

simulation in ioAvgP, rather than AvgP, has to do with the fact that Smyth’s result [32] only relates Dε ( f ) to Cε 3 /30 ( f ), rather
than Dε+δ ( f ) to Cε ( f ) for all δ > 0.


                          T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                                  138
                             T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

    Our second contribution is more exploratory, something we put forward in the hope of inspiring
followup work. We study Conjecture 1.5, which states that every T -query quantum algorithm can be
simulated on most inputs using T O(1) classical queries. We relate this conjecture to a fundamental open
problem in Fourier analysis and approximation theory. Given a real polynomial p : RN → R, let

                                                               (p(X) − p(X i ))2
                                                                                
                                     Infi [p] :=      E
                                                   X∈{0,1}N


be the influence of the ith variable, where X i means X with the ith bit flipped. Then we conjecture that
every bounded low-degree polynomial has a “highly influential” variable. More precisely:

Conjecture 1.7 (Bounded Polynomials Have Influential Variables). Let p : RN → R be a polynomial of
degree d. Suppose that 0 ≤ p(X) ≤ 1 for all X ∈ {0, 1}N , and

                                                    (p(X) − E [p])2 ≥ ε .
                                                                  
                                            E
                                        X∈{0,1}N


Then there exists an i such that Infi [p] ≥ (ε/d)O(1) .

    We show the following:

Theorem 1.8. Assume Conjecture 1.7. Then

   (i) Conjecture 1.5 holds.

  (ii) Dε+δ ( f ) ≤ (Qε ( f )/δ )O(1) for all Boolean functions f : {0, 1}N → {0, 1} and all ε, δ > 0.

 (iii) If P = P#P , then BQPA ⊂ AvgPA with probability 1 for a random oracle A.

   The main evidence for Conjecture 1.7—besides the fact that all the Fourier analysis experts we asked
were confident of it!—is that extremely similar statements have recently been proved. Firstly, O’Donnell,
Saks, Schramm, and Servedio [27] proved an analogue of Conjecture 1.7 for decision trees, which are a
special case of bounded real polynomials:

Theorem 1.9 (O’Donnell et al. 2005). Let f : {0, 1}N → {0, 1} be a Boolean function, and suppose
Pr [ f = 1] Pr [ f = 0] ≥ ε. Then there exists an i such that Infi [ f ] ≥ 4ε/ D( f ), where D( f ) is the decision-
tree complexity of f .

     Unfortunately, Theorem 1.9 does not directly imply anything about our problem, even though Beals et
al. [9] showed that D( f ) and Q( f ) are polynomially related for all total Boolean functions f . The reason
is that the acceptance probability of a quantum algorithm need not approximate a total Boolean function.
     The second piece of evidence for Conjecture 1.7 comes from a powerful result of Dinur, Friedgut,
Kindler, and O’Donnell [19], which implies our conjecture, except with Infi [p] ≥ ε 3 /2O(d) instead of
Infi [p] ≥ (ε/d)O(1) . Let us state the special case of their result that is relevant for us:

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                  139
                                      S COTT A ARONSON AND A NDRIS A MBAINIS

Theorem 1.10 (Dinur et al. 2006). Let ε > 0, and let p : RN → R be a degree-d polynomial such that
0 ≤ p(X) ≤ 1 for all X ∈ {0, 1}N . Then there exists a 2O(d) /ε 2 -junta pe : RN → R (that is, a polynomial
depending on at most 2O(d) /ε 2 variables) such that
                                             h                 i
                                       E      ( pe(X) − p(X))2 ≤ ε .
                                             X∈{0,1}N

    Even though Theorem 1.10 has an exponential rather than polynomial dependence on 1/d, we observe
that it already has a nontrivial consequence for quantum computation. Namely, it implies that any T -query
quantum algorithm can be simulated on most inputs using 2O(T ) classical queries.7 Recall that the gaps
between classical and quantum query complexities can be superexponential (and even N Ω(1) versus O (1),
as in the example of Aaronson [2]), so even an exponential upper bound is far from obvious.

1.3    Subsequent work
Since the first version of this paper appeared on arXiv [3] was circulated, there have been at least
three interesting developments (not counting the Ω(N 1/3 ) quantum lower bound of Zhandry [34] for
S ET E QUALITY, which we incorporate here).
     First, Yuen [33] adapted the hybrid argument that we used to prove Theorem 1.6, in order to show
that distinguishing a random function X : [N] → [N] from a random permutation requires Ω(N 1/5 / log N)
quantum queries. (Subsequently, however, Zhandry [34] proved a tight lower bound of Ω(N 1/3 ) for the
random function versus random permutation problem, using different ideas.)
     Second, Montanaro [26] used a hypercontractive inequality to prove Conjecture 1.7, in the special
case where p is a multilinear polynomial all of whose coefficients (when written in the Fourier basis)
have the same absolute value. Currently, it remains open to generalize Montanaro’s technique to arbitrary
multilinear polynomials, let alone arbitrary polynomials.
     Third, Bačkurs and Bavarian [8] solved a technical problem that arose from an earlier version of this
paper [3]. In that version, we stated Conjecture 1.7 in terms of L1 -influences rather than L2 -influences, and
we also used the L1 -norm in proving the consequences of Conjecture 1.7 for quantum query complexity.
Subsequently, Bačkurs (personal communication) found an error in our proof. Fortunately, however, we
noticed that (a) our proof could be fixed by simply switching from L1 -norm to L2 -norm throughout, and
(b) the L2 version of Conjecture 1.7 was, in any case, provably equivalent to our original L1 version.
So we switched to the L2 -norm. At the same time, though, we remained curious whether our original
L1 -based argument could have worked. The question boiled down to the following: given a degree-d real
polynomial p : RN → R, let
                                    Inf1i [p] := E        p(X) − p(X i ) .
                                                                         
                                                        X∈{0,1}N

Then do we have ∑Ni=1 Inf1i [p] ≤ d O(1) , whenever p(X) ∈ [0, 1] for all X ∈ {0, 1}N ? Bačkurs and Bavar-
ian [8] show that the answer is yes: indeed, the sum of the L1 -influences is upper-bounded by O(d 3 log d).
Using their result, one can salvage our original L1 -based argument.
    For simplicity, though, in this version of the paper we stick with L2 -influences. There, the analogue
of Bačkurs and Bavarian’s result is much easier, and states that ∑Ni=1 Infi [p] ≤ d (we provide the folklore
   7 Indeed, in this case the classical queries are nonadaptive.



                         T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                          140
                            T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

proof in Lemma 3.1). For completeness, in Appendix 6 we prove the equivalence of the L1 and L2
versions of Conjecture 1.7.


2     Quantum lower bound for all symmetric problems
In this section we prove Theorem 1.6: that R( f ) = O(Q( f )7 polylog Q( f )) for all permutation-symmetric
f.
    We start with a simple observation that is essential to everything that follows. Since f is symmetric,
we can group the inputs X = (x1 , . . . , xN ) into equivalence classes that we call types.
Definition 2.1. Given an input X = (x1 , . . . , xN ) ∈ [M]N , the type of X is a list of positive integers
A = (a1 , . . . , au ), which records the multiplicities of the integers occurring in X from most to least
frequent. So in particular, a1 ≥ · · · ≥ au and a1 + · · · + au = N. For convenience, we adopt the convention
that ai = 0 for all i > u.
    In other words, a type is just a partition (or Young diagram) that records the multiplicities of the
input elements. For example, a one-to-one input has type a1 = · · · = aN = 1, while a two-to-one input
has type a1 = · · · = aN/2 = 2. We write X ∈ A if X is of type A. Clearly f (X) depends only on the
type of X. Furthermore, given a quantum query algorithm Q, we can assume without loss of generality
that Pr [Q accepts X] depends only on the type of X—since we can “symmetrize” Q (that is, randomly
permute X’s inputs and outputs) prior to running Q.

2.1     Randomized upper bound
Let X = (x1 , . . . , xN ) be an input. For each j ∈ [M], let κ j be the number of subscripts i such that xi = j.
Then the first step is to give a classical randomized algorithm that estimates the κ j . This algorithm, ST ,
is an extremely straightforward sampling procedure. (Indeed, there is essentially nothing else that a
randomized algorithm can do here.) ST will make O(T 1+c log T ) queries, where T is a parameter and
c ∈ (0, 1] is a constant that we will choose later to optimize the final bound.

          Set U := 21T 1+c ln T
          Choose U indices i1 , . . . , iU ∈ [N] uniformly at random with replacement
          Query xi1 , . . . , xiU
          For each j ∈ [M]:
              Let z j be the number of occurrences of j in (xi1 , . . . , xiU )
                                  z
              Output κe j := Uj · N as the estimate for κ j

      We now analyze how well ST works.
Lemma 2.2. With probability 1 − O (1/T ), we have
                                                             N κj
                                              κe j − κ j ≤    +
                                                             T Tc
for all j ∈ [M].

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                141
                               S COTT A ARONSON AND A NDRIS A MBAINIS

Proof. For each j ∈ [M], we consider four cases. First suppose κ j ≥ N/T 1−c . Notice that z j is a sum of
U independent Boolean variables, and that
                                                     U           U
                                        E [z j ] =     E[κe j ] = κ j .
                                                     N           N
Thus
                                                                       
                               h           κj i           U        Uκ j
                            Pr κe j − κ j > c = Pr z j − κ j >
                                           T              N        NT c
                                                                 
                                                          Uκ j /N
                                                < 2 exp −
                                                           3T 2c
                                                                
                                                            U
                                                < 2 exp − 1+c
                                                          3T
                                                      = 2T −7 ,

where the second line follows from a Chernoff bound and the third from κ j ≥ N/T 1−c .
   Second, suppose N/T ≤ κ j < N/T 1−c . Then
                                                                 
                                          N             U        U
                          Pr κe j − κ j >     = Pr z j − κ j >
                                          T             N        T
                                                                        !
                                                                    N 2
                                                                 
                                                        Uκ j /N
                                              < 2 exp −
                                                           3       Tκj
                                                              
                                                          U
                                              < 2 exp − 1+c
                                                        3T
                                                 = 2T −7

where the second line follows from a Chernoff bound (which is valid because N/(T κ j ) ≤ 1) and the third
from κ j < N/T 1−c .
   Third, suppose N/T 6 ≤ κ j < N/T . Then
                                                                  
                                       N              U           U
                       Pr κe j − κ j >     = Pr z j − κ j >
                                       T               N           T
                                                                             !Uκ j /N
                                                        eN/(T κ j )
                                           <
                                               (1 + N/ (T κ j ))1+N/(T κ j )
                                                                  
                                                      N Uκ j
                                           ≤ exp −        ·
                                                     Tκj N
                                                       
                                                     U
                                           = exp −
                                                     T
                                                
                                                   1
                                           =O         ,
                                                  T7


                    T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                           142
                            T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

where the second line follows from a Chernoff bound, the third line follows from N/(T κ j ) > 1, and the
last follows from U = 21T 1+c ln T .
     Fourth, suppose κ j < N/T 6 . Then
                                                                    
                                                N                U   U
                                Pr κe j − κ j >     = Pr z j − κ j >
                                                T                N   T
                                                    ≤ Pr [z j ≥ 2]
                                                         2
                                                        U       κj
                                                    ≤
                                                        2       N
                                                        2
                                                      U κj 
                                                           
                                                    ≤ 6
                                                      T       N
                                                       κj
                                                    ≤
                                                      TN

for all sufficiently large T , where the second line follows from κ j < N/T 6 , the third from the union
bound, the fourth from κ j < N/T 6 (again), and the fifth from U ≤ 21T 2 ln T .
    Notice that there are at most T 6 values of j such that κ j ≥ N/T 6 . Hence, putting all four cases
together,
                                                        
                                            N κj     6     1                κj
                       Pr ∃ j : κe j − κ j > + c ≤ T · O    7
                                                              + ∑
                                            T T           T    j:κ j <N/T 6
                                                                            TN
                                                       
                                                       1
                                                   =O     .
                                                       T

    Now call A a 1-type if f (X) = 1 for all X ∈ A, or a 0-type if f (X) = 0 for all X ∈ A. Consider the
following randomized algorithm RT to compute f (X):

         Run ST to find an estimate κei for each κi
         Sort the κei ’s in descending order, so that κe1 ≥ · · · ≥ κeM
         If there exists a 1-type A = (a1 , a2 , . . .) such that |κei − ai | ≤ NT + Taic
             for all i, then output f (X) = 1
         Otherwise output f (X) = 0

    Clearly RT makes O(T 1+c log T ) queries, just as ST does. We now give a sufficient condition for RT
to succeed.

Lemma 2.3. Suppose that for all 1-types A = (a1 , a2 , . . .) and 0-types B = (b1 , b2 , . . .), there exists an i
such that
                                                  2N ai + bi
                                     |ai − bi | >     +            .
                                                   T          Tc
Then RT computes f with bounded probability of error, and hence R ( f ) = O(T 1+c log T ).

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                 143
                                 S COTT A ARONSON AND A NDRIS A MBAINIS

Proof. First suppose X ∈ A where A = (a1 , a2 , . . .) is a 1-type. Then by Lemma 2.2, with probability
1 − O (1/T ) we have
                                                                 N ai
                                                  |κei − ai | ≤ + c
                                                                 T T
for all i. (It is easy to see that sorting the κei can only decrease the maximum difference.) Provided this
occurs, RT finds some 1-type close to (κe1 , κe2 , . . .) (possibly A itself) and outputs f (X) = 1.
    Second, suppose X ∈ B where B = (b1 , b2 , . . .) is a 0-type. Then with probability 1 − O (1/T ) we
have
                                                                 N bi
                                                  |κei − bi | ≤ + c
                                                                 T T
for all i. Provided this occurs, by the triangle inequality, for every 1-type A = (a1 , a2 , . . .) there exists an
i such that
                                                                            N ai
                                    |κei − ai | ≥ |ai − bi | − |κei − bi | > + c .
                                                                            T T
Hence RT does not find a 1-type close to (κ1 , κ2 , . . .), and it outputs f (X) = 0.
                                                   e   e

    In particular, suppose we keep decreasing T until there exists a 1-type A∗ = (a1 , a2 , . . .) and a 0-type
B∗ = (b1 , b2 , . . .) such that
                                                      2N ai + bi
                                         |ai − bi | ≤    +                                                (2.1)
                                                       T      Tc
for all i, stopping as soon as that happens. Then Lemma 2.3 implies that we will still have R ( f ) =
O(T 1+c log T ). For the rest of the proof, we will fix that “almost as small as possible” value of T for
which (2.1) holds, as well as the 1-type A∗ and the 0-type B∗ that RT “just barely distinguishes” from
one another.

2.2   The chopping procedure
Given two sets of inputs A and B with A ∩ B = ∅, let Q(A, B) be the minimum number of queries made
by any quantum algorithm that accepts every X ∈ A with probability at least 2/3, and accepts every Y ∈ B
with probability at most 1/3. Also, let Qε (A, B) be the minimum number of queries made by any quantum
algorithm that accepts every X ∈ A with at least some probability p, and that accepts every Y ∈ B with
probability at most p − ε. Then we have the following basic relation:

Proposition 2.4. Q(A, B) = O( ε1 Qε (A, B)) for all A, B and all ε > 0.

Proof. This follows from standard amplitude estimation techniques (see Brassard et al. [13] for example).


    The rest of the proof consists of lower-bounding Q(A∗ , B∗ ), the quantum query complexity of
distinguishing inputs of type A∗ from inputs of type B∗ . We do this via a hybrid argument. Let
L := dlog2 Ne + 1. At a high level, we will construct a sequence of types A0 , . . . , A2L such that

   (i) A0 = A∗ ,

  (ii) A2L = B∗ , and

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                  144
                           T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

 (iii) Q(A` , A`−1 ) is large for every ` ∈ [2L].

    Provided we can do this, it is not hard to see that we get the desired lower bound on Q(A∗ , B∗ ).
Suppose a quantum algorithm distinguishes A0 = A∗ from A2L = B∗ with constant bias. Then by
the triangle inequality, it must also distinguish some A` from A`+1 with reasonably large bias (say
Ω (1/ log N)). By Proposition 2.4, any quantum algorithm that succeeds with bias ε can be amplified,
with O (1/ε) overhead, to an algorithm that succeeds with constant bias.
    Incidentally, the need, in this hybrid argument, to amplify the distinguishing bias ε = ε` from
Ω (1/ log N) to Ω (1) is exactly what could produce an undesired 1/ log N factor in our final lower bound
on Q( f ), if we were not careful. (We mentioned this issue in Section 1.2.) The way we will solve this
problem, roughly speaking, is to design the A` in such a way that our lower bounds on Q(A` , A`−1 )
increase quickly as functions of `. That way, we can take the biases ε` to decrease quadratically with `
(thus summing to a constant), yet still have Q(A` , A`−1 ) increasing quickly enough that

                                    Qε` (A` , A`−1 ) = Ω(ε` Q(A` , A`−1 ))

remain “uniformly large,” with 1/ log T factors but no 1/ log N factor.
    We now describe the procedure for creating the intermediate types A` . Intuitively, we want to form
A` from A`−1 by making its Young diagram more similar to that of B∗ , by decreasing the rows of A`−1
which are larger than the corresponding rows of B∗ and increasing the rows of A`−1 which are smaller
than the corresponding rows of B∗ .
    More precisely, we construct the intermediate types A1 , A2 , . . . via the following procedure. In this
procedure, (a1 , a2 , . . .) is an input type that is initialized to A∗ , and B∗ = (b1 , b2 , . . .).

         let P be the first power of 2 greater than or equal to N
         for ` := 1 to L
             let SA be the set of i such that ai − bi ≥ P/2`
             let SB be the set of i such that bi − ai ≥ P/2`
             let m := min(|SA | , |SB |)
             choose m elements i from SA , set ai := ai − P/2` and remove them from SA
             choose m elements i from SB , set ai := ai + P/2` and remove them from SB
             let A2`−1 := type(a1 , a2 , . . .)
             if |SA | > 0
                  let ai := ai − P/2` for all i ∈ SA
                  choose |SA | elements i such that ai < bi and set ai := ai + P/2`
             if |SB | > 0
                  let ai := ai + P/2` for all i ∈ SB
                  choose |SB | elements i such that ai > bi and set ai := ai − P/2`
             let A2` := type(a1 , a2 , . . .)
         next `

    The procedure is illustrated pictorially in Figure 1. Intuitively, the main goal we want to achieve is
that, after ` stages (that is, for A2` ), we have |ai − bi | < P/2` for all i. To achieve that, we increase all

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                               145
                                  S COTT A ARONSON AND A NDRIS A MBAINIS




             Figure 1: Chopping a row of A` ’s Young diagram to make it more similar to B` .


ai for i such that ai ≤ bi − P/2` by P/2` , and decrease all ai for i such that ai ≥ bi + P/2` by P/2` . If
the number of i such that ai ≤ bi − P/2` does not equal the number of i such that ai ≥ bi + P/2` , we no
longer have a1 + a2 + . . . = N. To deal with that, we increase or decrease the correct number of other ai
by P/2` , and choose those ai so that, after the increase or decrease, we have |ai − bi | < P/2` .
     We start with some simple observations. First, by construction, this procedure halts after L = O (log N)
iterations.
     By induction, after the `th iteration, we have |ai − bi | < P/2` for all i. (Let a0i be the value of ai
before the `th iteration. Because of the inductive assumption, we must have |a0i − bi | < P/2`−1 . If
|a0i − bi | ∈ [P/2` , P/2`−1 − 1], then ai is increased (decreased by P/2` ), resulting in
                                                                               
                                            P     P P        P            P
                              |ai − bi | ∈ ` − ` , `−1 − ` − 1 = 0, ` − 1 .
                                            2    2 2        2             2

If |a0i − bi | < P/2` , ai is increased only if a0i < bi and ai is decreased only if a0i > bi . Therefore, after the
change, the sign of the difference ai − bi flips and |ai − bi | < P/2` .)
     In particular, this means that, for ` = L, |ai − bi | < P/2` ≤ 1. That is, we have ai = bi for all i and
A2L = B∗ .
     We define
                                                           1 N
                                            kA − Bk := ∑ |ai − bi | .
                                                           2 i=1
                                                             0
Notice that kA2`−2 − A2`−1 k + kA2`−1 − A2` k = rP/2` where r is the number of rows that get increased
(or decreased) in the `th iteration. We now prove an upper bound on kA` − A`−1 k when ` is small, which
will be useful later.

Lemma 2.5. If ` ≤ (log2 T ) − 2, then

                                                                            4N
                                  kA2`−2 − A2`−1 k + kA2`−1 − A2` k ≤          .
                                                                            Tc
Proof. Let m := max(|SA | , |SB |). Then

                                                                      P
                                 kA2`−2 − A2`−1 k + kA2`−1 − A2` k = m ` .
                                                                      2

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                   146
                              T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

Without loss of generality, we assume that m = |SA |. To show the lemma, it suffices to prove that
                                                             4N/T c
                                                   |SA | ≤          .
                                                              P/2`
     We consider the sum ∑ j∈R a j − b j where R is the set of all j such that a j − b j ≥ P/2` , with
(a1 , a2 , . . .) evolving from A0 to A2`−2 and B∗ = (b1 , b2 , . . .) fixed. Initially (when (a1 , a2 , . . .) = A0 ), we
have
                                          P                 2N a j + b j
                                             ≤ aj −bj ≤            +
                                          2`                 T            Tc
for each j ∈ R. Since ` ≤ (log2 T ) − 2, the left inequality implies
                                                                    4N
                                                    aj −bj ≥           ,
                                                                     T
which combined with the right inequality yields
                                                    aj +bj   2N
                                                        c
                                                           ≥    .                                                   (2.2)
                                                      T       T
Therefore
                                                                               
                                                                   2N ai + bi
                                         ∑ |ai − bi | ≤ ∑           T
                                                                      +
                                                                        Tc
                                         i∈R             i∈R
                                                           ai + bi
                                                      ≤ 2∑     c
                                                        i∈R T
                                                       4N
                                                      ≤ c,
                                                       T
where the third line uses (2.2).
   The sum ∑i∈R |ai − bi | is not increased by any step of the algorithm that generates A0 , . . . , A2`−2 .
Therefore, at the beginning of the `th iteration, we still have ∑i∈R |ai − bi | ≤ 4N/T c . This means that
                                                             4N/T c
                                                   |SA | ≤          .
                                                              P/2`

2.3    Quantum lower bounds
Recall that we listed four properties that we needed the chopping procedure to satisfy. We have already
seen that it satisfies properties (i)-(ii), so the remaining step is to show that it satisfies property (iii). That
is, we need to lower-bound Q(A` , A`−1 ), the bounded-error quantum query complexity of distinguishing
inputs of type A` from inputs of type A`−1 . To do this, it will be convenient to consider two cases: first,
that forming A` involved chopping few elements of A`−1 , and second, that it involved chopping many
elements. We will show that we “win either way,” by a different quantum lower bound in each case.
     First consider the case that few elements were chopped. Here we prove a lower bound using
Ambainis’s quantum adversary method [6], in its “general” form (the one used, for example, to lower-
bound the quantum query complexity of inverting a permutation). For completeness, we now state
Ambainis’s adversary theorem in the form we will need.

                       T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                       147
                                   S COTT A ARONSON AND A NDRIS A MBAINIS

Theorem 2.6 (Ambainis [6]). Let A, B ⊆ [M]N be two sets of inputs with A ∩ B = ∅. Let R ⊆ A × B be a
relation on input pairs, such that for every X ∈ A there exists at least one Y ∈ B with (X,Y ) ∈ R and vice
versa. Given inputs X = (x1 , . . . , xN ) in A and Y = (y1 , . . . , yN ) in B, let

                                         qX,i = Pr [xi 6= yi | (X,Y ) ∈ R] ,
                                                 Y ∈B
                                          qY,i = Pr [xi 6= yi | (X,Y ) ∈ R] .
                                                 X∈A


    √ that qX,i qy,i ≤ α for every (X,Y ) ∈ R and every i ∈ [N] such that xi 6= yi . Then Q(A, B) =
Suppose
Ω(1/ α).
    Using Theorem 2.6, we can prove the following lower bound on Q (A` , A`−1 ).
Lemma 2.7. Let d = kA` − A`−1 k, and assume d ≤ N/2. Then Q(A` , A`−1 ) = Ω( N/d).
                                                                            p

Proof. Let A`−1 = (a1 , a2 , . . .), and let `0 = d`/2e. Then in the transition from A`−1 to A` , we augment
                                        0
or chop various rows by P/2` elements each. Let i (1) , . . . , i (r) be the r rows in A`−1 that get chopped
and let i0 (1) , . . . , i0 (r) be the r rows in A`−1 that get augmented.
     Fix distinct h1 , . . . , hr ∈ [M] and h01 , . . . , h0r ∈ [M]. Also, let us restrict ourselves to inputs such that
for each j ∈ [r], there are exactly ai( j) indices i ∈ [N] satisfying xi = h j and exactly ai0 ( j) indices i ∈ [N]
satisfying xi = h0j . Given inputs X = (x1 , . . . , xN ) in A`−1 and Y = (y1 , . . . , yN ) in A` , we set (X,Y ) ∈ R if
and only if it is possible to transform X to Y in the following way:
                                                  0                                                                 0
  (1) For each j ∈ [r], change exactly P/2` of the xi that are equal to h j to value h0j . (Let d := P/2` be
      the number of changed elements.)

  (2) Swap the d elements of X that were changed in step (1) with any other d elements xi of X, subject
      to the following constraints:
                                                                       ai( j)     N −d
         (a) we do not use xi such that xi = h j for some j and             ` 0 <      ;
                                                                       P/2         3d
                                                                                     0
                                                                       ai( j) − P/2`   N −d
        (b) we do not use xi such that xi = h0j for some j and                  `0   <      .
                                                                            P/2         3d

   The procedure is illustrated pictorially in Figure 2. Note that we can reverse the procedure in a natural
way to go from Y back to X:
                                                  0                                                                 0
  (1) For each j ∈ [r], change exactly P/2` of the xi that are equal to h0j to value h j . (Let d := P/2` .)

  (2) Swap the d elements of X that were changed in step (1) with any d elements xi of X, subject to the
      same constraints as in the step (2) of the X → Y conversion.

    Fix any (X,Y ) ∈ R, and let i ∈ [N] be any index such that xi 6= yi . Then we claim that the parameters
of Theorem 2.6 satisfy either
                                               6d                            6d
                                     qX,i ≤               or       qY,i ≤        .
                                              N −d                          N −d

                       T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                      148
                             T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS




Figure 2: In this example, N = 11, r = 2, P/2` = 2, and a1 = a2 = 3. So we transform X to Y by choosing
h1 = 1 and h2 = 2, changing any two elements equal to h1 and any two elements equal to h2 , and then
swapping the four elements that we changed with four unchanged elements.


To see this, let us write qX,i = q0X,i + q00X,i , where q0X,i is the probability that xi is changed in step (1) of the
X → Y conversion and q00X,i is the probability that xi is not changed in step (1), but is swapped with some
changed element in step (2). We also express qY,i in a similar way, with respect to the Y → X conversion.
    We consider two cases. The first case is that xi is one of the “other d elements” with which we
swap the changed elements in step (2) of the X → Y conversion. In this case, q0X,i 6= 0 only if xi = h j
for some j. Then because of the constraint (a), we have q0X,i ≤ 3d/(N − 2d). (The probability that this
                                                                         0
particular xi = h j is chosen for being changed is equal to (P/2` )/ai( j) and, because of (a), this is at most
3d/(N − 2d).)
    We also have
                                                                           d            3d
                         q00X,i = 0Pr xi 6= y0i | X,Y 0 ∈ R ≤
                                                                 
                                                                                   =          ,
                                 Y ∈A`                                 (N − d)/3 N − d
because each of the constraints (a) and (b) eliminates at most (N − d)/3 of the N − d variables xi that are
available for swapping in step (2). Therefore, qX,i = q0X,i + q00X,i ≤ 6d/(N − d).
    The second case is that xi is one of the elements that are changed in step (1) of the X → Y conversion.
Then yi is one of the “other d elements” in step (2) of the Y → X conversion. Similarly to the previous
case, we can show that qY,i ≤ 6d/(N − d).
    Since qX,i ≤ 1 and qY,i ≤ 1, it follows that

                                                               6d
                                                qX,i qY,i ≤        .
                                                              N −d
Thus, by Theorem 2.6,
                                                               r            !      r !
                                                 1                     N −d           N
                      Q(A` , A`−1 ) = Ω √                  =Ω                     =Ω     .
                                         qX,i qY,i                       d            d

    We now consider the case that many elements are chopped. Here we prove a lower bound by reduction
from S ET E QUALITY. Given two sequences of integers Y ∈ [M]N and Z ∈ [M]N , neither with any repeats,
the S ET E QUALITY problem is to decide whether Y and Z are equal as sets or disjoint as sets, under
the promise that one of these is the case. S ET E QUALITY is similar to the collision problem studied by
Aaronson and Shi [4], but it lacks permutation symmetry, making it harder to prove a lower bound by

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                    149
                                   S COTT A ARONSON AND A NDRIS A MBAINIS

the polynomial method. By combining the collision lower bound with Ambainis’s adversary method,
Midrijanis [25] was nevertheless able to show that
                                                            1/5 !
                                                         N
                              Q (S ET E QUALITY) = Ω                .
                                                       log N

Very recently, and using different ideas, Zhandry [34] managed to improve Midrijanis’s lower bound to
the following:
Theorem 2.8 (Zhandry [34]). Q (S ET E QUALITY) = Ω(N 1/3 ).
    Theorem 2.8 is known to be tight, by the upper bound of Brassard, Høyer, and Tapp [14] mentioned
in Section 1.1.
    We will consider a modification of the S ET E QUALITY problem, which we call 3S ET E QUALITY.
Here we are given three sequences of integers Y, Z,W ∈ [M]N , none of which has any repeats. We are
promised that Y and W are disjoint as sets, and that Z is equal either to Y or to W as a set. The task is to
distinguish between those two cases.
Theorem 2.9. Q(3S ET E QUALITY) = Ω(N 1/3 ).
Proof. The theorem follows from Theorem 2.8 together with the following claim: if 3S ET E QUALITY is
solvable by a quantum algorithm A that uses T queries, then S ET E QUALITY is solvable by a quantum
algorithm that uses O(T ) queries.
    To show this, let Y,W be an instance of S ET E QUALITY. We produce an instance of 3S ET E QUALITY
by choosing Z to be either a randomly permuted version of Y or a randomly permuted version of W . We
then run the algorithm for 3S ET E QUALITY on that instance. If Y and W are disjoint, then the promise of
3S ET E QUALITY is satisfied and the algorithm will find whether we used Y or W to generate Z. If Y = W ,
then using Y and using W results in the same probability distribution for Z; hence no algorithm will be
able to guess whether we used Y or W with probability greater than 1/2.

    We now use Theorem 2.9 to prove another lower bound on Q(A` , A`−1 ).
Lemma 2.10. Suppose A` was formed from A`−1 by chopping r rows. Then Q (A` , A`−1 ) = Ω r1/3 .
                                                                                            

Proof. We will show how to embed a 3S ET E QUALITY instance of size r into the A` versus A`−1 problem.
    Let A`−1 = (a1 , . . . , au ). Also, let i (1) , . . . , i (r) ∈ [u] be the r rows that are chopped in going from
A`−1 to A` , let i0 (1) , . . . , i0 (r) ∈ [u] be the r rows that are augmented, and let j (1) , . . . , j (u − 2r) ∈ [u]
be the u − 2r rows that are left unchanged. Recall that, in going from A`−1 to A` , each row i (k) (or i0 (k))
                                            0
is chopped or augmented by P/2` elements, where `0 = d`/2e.
    Now let Y = (y1 , . . . , yr ), Z = (z1 , . . . , zr ), W = (z1 , . . . , zr ) be an instance of 3S ET E QUALITY. Then
                                                                                                       0
we construct an input X ∈ [M]N as follows. First, for each k ∈ [r], set ai(k) − P/2` of the xi equal to yk ,
          0
set P/2` of the xi equal to zk and set a0i(k) of the the xi equal to wk . Next, let w1 , w2 , . . . ∈ [M] be a list of
numbers that are guaranteed not to be in Y ∪ Z. Then for each k ∈ [u − 2r], set a j(k) of the xi equal to wk .
    It is easy to see that, if Y and Z are equal as sets, then X will have type A`−1 , while if Z and W are
equal as sets, then X will have type A` . So in deciding whether X belongs to A` or A`−1 , we also decide
whether Y = Z or Z = W . The lemma now follows from Theorem 2.9.

                       T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                       150
                           T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

2.4   Putting everything together
Let C be a quantum query algorithm that distinguishes A0 = A∗ from A2L = B∗ , and assume C is optimal:
that is, it makes Q(A∗ , B∗ ) ≤ Q( f ) queries. As mentioned earlier, we can assume that Pr [C accepts X]
depends only on the type of X. Thus, let
                                       p` := Pr [C accepts X ∈ A` ] .
Then by assumption, |p0 − p2L | ≥ 1/3. Now let β` := 1/(10`2 ), and observe that ∑∞      `=1 β` < 1/6. By the
triangle inequality, it follows that there exists an ` ∈ [2L] such that |p` − p`−1 | ≥ β` . In other words, we
get a Q( f )-query quantum algorithm that distinguishes A` from A`−1 with bias β` . By Proposition 2.4,
this immediately implies
                                                                                 Q(A` , A`−1 )
                                                                                              
                                  Q( f )
             Q(A` , A`−1 ) = O                 or equivalently      Q( f ) = Ω                     .
                                   β`                                                  `2
Now let d = kA` − A`−1 k, and suppose A` was produced from A`−1 by chopping r rows. Then d =
     0        0
rP/2` ≤ 2rN/2` where `0 = d`/2e. Combining Lemmas 2.7 and 2.10, we find that
                                                      (r        )!
                                                          N 1/3
                             Q(A` , A`−1 ) = Ω max          ,r
                                                          d
                                               r 0            !
                                                   2`     1/3
                                           =Ω          +r
                                                    r
                                               0 
                                           = Ω 2` /5 ,
                                                             0
since the minimum occurs when r is asymptotically 23` /5 . If `0 ≤ (log2 T ) − 2, then combining Lem-
mas 2.7 and 2.5, we also have the lower bound
                                               s         !
                                                   N              √
                             Q(A` , A`−1 ) = Ω              =  Ω(   T c) .
                                                 4N/T c
Hence                                     √ 
                                         Ω T2 c  if `0 ≤ (log2 T ) − 2 ,
                                Q( f ) =    `0 
                                         Ω 2` /5 if `0 > (log2 T ) − 2 .
Let us now make the choice c = 2/5, so that we get a lower bound of
                                                            !
                                                      T 1/5
                                        Q( f ) = Ω
                                                     log2 T
in either case. Hence T = O(Q( f )5 log10 Q( f )). By Lemma 2.3:
                                       R( f ) = O(T 1+c log T )
                                             = O(T 7/5 log T )
                                             = O(Q( f )7 log15 Q( f )) .
This completes the proof of Theorem 1.6.

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                              151
                                     S COTT A ARONSON AND A NDRIS A MBAINIS

3     Quantum lower bounds under the uniform distribution
                                                            ?
In this section, we consider the problems of P = BQP relative to a random oracle, and of simulating a
T -query quantum algorithm on most inputs using T O(1) classical queries. We show that these problems
are connected to a fundamental conjecture about influences in low-degree polynomials.
    Recall Conjecture 1.7, which said that bounded polynomials have influential variables: that is, for
every degree-d polynomial p : RN → R such that 0 ≤ p(X) ≤ 1 for all X ∈ {0, 1}N , there exists an i ∈ [N]
such that Infi [p] ≥ (Var [p] /d)O(1) , where
                                                    (p(X) − p(X i ))2 ,
                                                                    
                                   Infi [p] := E
                                                       X∈{0,1}N

                                                                   (p(X) − E [p])2 .
                                                                                 
                                         Var [p] :=       E
                                                       X∈{0,1}N

We will show that Conjecture 1.7 has several powerful consequences for quantum complexity theory.
   As a first step, let
                                                                    N
                                                      Inf [p] := ∑ Infi [p]
                                                                  i=1
be the total influence of p. Then we have the following bound, versions of which have long been known
in the analysis of Boolean functions community,8 but which we prove for completeness.
Lemma 3.1 (folklore). Let p : RN → R be a degree-d real polynomial such that 0 ≤ p(X) ≤ 1 for all
X ∈ {0, 1}N . Then Inf [p] ≤ d.
Proof. Let q be the analogue of p in the Fourier representation:
                                                                              
                                                           1 − x1       1 − xN
                           q(x1 , . . . , xN ) := 1 − 2p          ,...,          .
                                                             2             2
Clearly deg(q) = deg(p) = d and −1 ≤ q(X) ≤ 1 for all X ∈ {1, −1}N . Also, defining X i to be X ∈
{1, −1}N with xi negated, and
                                                      1
                                                                   (q(X) − q(X i ))2 ,
                                                                                   
                                        Infi [q] :=        E
                                                      4 X∈{1,−1}N
we have Infi [q] = Infi [p].
   Note that we can express q as
                                               q(X) =           ∑        αS χS (X) ,
                                                         S⊆[N] : |S|≤d

where αS ∈ R and χS (X) := ∏i∈S xi is the Fourier character corresponding to the set S. Furthermore, by
Parseval’s identity,
                                             1
                                  ∑ αS2 = 2N ∑ N q(X)2 ≤ 1 .
                                 |S|≤d                        X∈{1,−1}

    8 For example, Shi [29] proved the bound for the special case of Boolean functions, and generalizing his proof to arbitrary

bounded functions is straightforward.


                         T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                          152
                            T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

Now, in the Fourier representation, it is known that

                                              Infi [q] =           ∑          αS2 .
                                                                |S|≤d : i∈S

Hence
              Inf [p] = Inf [q] = ∑       ∑             αS2 =    ∑ ∑ αS2 = ∑ |S| αS2 ≤ d ∑ αS2 ≤ d
                                 i∈[N] |S|≤d : i∈S              |S|≤d i∈S             |S|≤d   |S|≤d

as claimed.

      We also need the following lemma of Beals et al. [9].

Lemma 3.2 (Beals et al.). Suppose a quantum algorithm Q makes T queries to a Boolean input X ∈
{0, 1}N . Then Q’s acceptance probability is a real multilinear polynomial p(X), of degree at most 2T .

3.1     Consequences of our influence conjecture
We now prove our first consequence of Conjecture 1.7: namely, that it implies the folklore Conjecture 1.5.

Theorem 3.3. Suppose Conjecture 1.7 holds, and let ε, δ > 0. Then given any quantum algorithm
Q that makes T queries to a Boolean input X, there exists a deterministic classical algorithm that
makes poly(T, 1/ε, 1/δ ) queries, and that approximates Q’s acceptance probability to within an additive
constant ε on a 1 − δ fraction of inputs.

Proof. Let p(X) be the probability that Q accepts input X = (x1 , . . . , xN ). Then Lemma 3.2 says that p
is a real polynomial of degree at most 2T . Assume Conjecture 1.7. Then for every such p, there exists
an index i satisfying Infi [p] ≥ w(Var [p] /T ), for some fixed polynomial w. Under that assumption, we
give a classical algorithm C that makes poly(T, 1/ε, 1/δ ) queries to the xi and that approximates p(X)
on most inputs X. In what follows, assume X ∈ {0, 1}N is uniformly random.

           set p0 := p
           for j := 0, 1, 2, . . .:
               if Var [p j ] ≤ ε 2 δ /2
                    output EY ∈{0,1}N− j [p j (Y )] as approximation for p(X) and halt
               else
                    find an i ∈ [N − j] such that Infi [p j ] > w(ε 2 δ /2T )
                    query xi , and let p j+1 : RN− j → R be the polynomial
                           induced by the answer

      When C halts, by assumption Var [p j ] ≤ ε 2 δ /2. By Markov’s inequality, this implies

                                                                              δ
                                        Pr              p j (X) − E [p j ] > ε <
                                    X∈{0,1}  N− j                                2

meaning that when C halts, it succeeds with probability at least 1 − δ /2.

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                         153
                                     S COTT A ARONSON AND A NDRIS A MBAINIS

   On the other hand, suppose Var [p j ] > ε 2 δ /2. Then by Conjecture 1.7, there exists an index i∗ ∈ [N]
such that                                                        2 
                                                  Var [p j ]       ε δ
                              Infi [p j ] ≥ w
                                  ∗                            ≥w       .
                                                     T             2T
Thus, suppose we query xi∗ . Since X is uniformly random, xi∗ will be 0 or 1 with equal probability, even
conditioned on the results of all previous queries. So after the query, our new polynomial p j+1 will satisfy
                                                                             1
                                  Pr p j+1 = p j|xi∗ =0 = Pr p j+1 = p j|xi∗ =1 = ,
                                                                                 2
where p j|xi∗ =0 and p j|xi∗ =1 are the polynomials on N − j − 1 variables obtained from p j by restricting xi∗
to 0 or 1 respectively. Therefore
                                                     1                              
                        E         [Inf [p j+1 ]] =     Inf p j|xi∗ =0 + Inf p j|xi∗ =1
                     xi∗ ∈{0,1}                      2
                                                                                                    !
                                                 1                                              
                                               =             ∑ Infi p j|x =0 + ∑ Infi p j|x =1
                                                                         i∗                   i∗
                                                 2           i6=i∗                i6=i∗

                                               = ∑ Infi [p j ]
                                                     i6=i∗

                                               = Inf [p j ] − Infi∗ [p j ]
                                                                 2 
                                                                   ε δ
                                               ≤ Inf [p j ] − w            .
                                                                    2T
By linearity of expectation, this implies that for all j,
                                                                         2 
                                                                         ε δ
                                        E [Inf [p j ]] ≤ Inf [p0 ] − jw       .
                                     X∈{0,1}N                            2T
But recall from Lemma 3.1 that
                                               Inf [p0 ] ≤ deg (p0 ) ≤ 2T .
It follows that C halts after an expected number of iterations that is at most
                                                Inf [p0 ]     2T
                                                  2
                                                          ≤  2
                                                                      .
                                              w(ε δ /2T ) w(ε δ /2T )
Thus, by Markov’s inequality, the probability (over X) that C has not halted after
                                                               4T
                                                         δ · w(ε 2 δ /2T )
iterations is at most δ /2. Hence by the union bound, the probability over X that C fails is at most
δ /2 + δ /2 = δ . Since each iteration queries exactly one variable and
                                               4T
                                                           = poly(T, 1/ε, 1/δ ) ,
                                         δ · w(ε 2 δ /2T )
this completes the proof.

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                               154
                            T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

    An immediate corollary is the following:

Corollary 3.4. Suppose Conjecture 1.7 holds. ThenDε+δ ( f ) ≤ (Qε ( f )/δ )O(1) for all Boolean functions
f and all ε, δ > 0.

Proof. Let Q be a quantum algorithm that evaluates f (X), with bounded error, on a 1 − ε fraction of
inputs X ∈ {0, 1}N . Let p(X) := Pr [Q accepts X]. Now run the classical simulation algorithm C from
Theorem 3.3, to obtain an estimate pe(X) of p(X) such that
                                                             
                                                           1
                                  Pr     | pe(X) − p(X)| ≤      ≥ 1−δ .
                               X∈{0,1}N                    10

Output f (X) = 1 if pe(X) ≥ 1/2 and f (X) = 0 otherwise. By the theorem, this requires poly(T, 1/δ )
queries to X, and by the union bound it successfully computes f (X) on at least a 1 − ε − δ fraction of
inputs X.

    We also get the following complexity-theoretic consequence:

Theorem 3.5. Suppose Conjecture 1.7 holds. Then P = P#P implies BQPA ⊂ AvgPA with probability 1
for a random oracle A.

Proof. Let Q be a polynomial-time quantum Turing machine that queries an oracle A, and assume
Q decides some language   L ∈ BQPA with bounded error. Given an input x ∈ {0, 1}n , let px (A) :=
Pr Q (x) accepts . Then clearly px (A) depends only on some finite prefix B of A, of size N = 2poly(n) .
     A
                 

Furthermore, Lemma 3.2 implies that px is a polynomial in the bits of B, of degree at most poly(n).
    Assume Conjecture 1.7 as well as P = P#P . Then we claim that there exists a deterministic polynomial-
time algorithm C such that for all Q and x ∈ {0, 1}n ,
                                                               
                                                             1      1
                                    Pr | pex (A) − px (A)| >      < 3,                               (3.1)
                                    A                        10    n

where pex (A) is the output of C given input x and oracle A. This C is essentially just the algorithm from
Theorem 3.3. The key point is that we can implement C using not only poly(n) queries to A, but also
poly(n) computation steps.
    To prove the claim, let M be any of the 2poly(n) monomials in the polynomial p j from Theorem 3.3,
and let αM be the coefficient of M. Then notice that αM can be computed to poly(n) bits of precision in
P#P , by the same techniques used to show BQP ⊆ P#P [12]. Therefore the expectation
                                                                     αM
                                              E         [p j (Y )] = ∑|M|
                                         Y ∈{0,1}N− j              M 2

can be computed in P#P as well. The other two quantities that arise in the algorithm—Var [p j ] and
Infi [p j ]—can also be computed in P#P , since they are simply sums of squares of differences of the values
p j (X). This means that finding an i such that Infi [p j ] > w(ε 2 δ /T ) is in NP#P . But under the assumption
that P = P#P , we have P = NP#P as well. Therefore all of the computations needed to implement C take
polynomial time.

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                155
                                 S COTT A ARONSON AND A NDRIS A MBAINIS

    Now let δn (A) be the fraction of inputs x ∈ {0, 1}n such that | pex (A) − px (A)| > 1/10. Then by (3.1)
together with Markov’s inequality,                      
                                                       1      1
                                         Pr δn (A) >       < 2.
                                          A            n     n
Since ∑∞        2
        n=1 1/n converges, it follows that δn (A) ≤ 1/n for all but finitely many values of n, with
probability 1 over A. Assuming this occurs, we can simply hardwire the behavior of Q on the remaining
values n into our classical simulation procedure C. Hence L ∈ AvgPA .
    Since the number of BQPA languages is countable, the above implies that L ∈ AvgPA for every
L ∈ BQPA simultaneously (that is, BQPA ⊂ AvgPA ) with probability 1 over A.

   As a side note, suppose we had an extremely strong variant of Conjecture 1.7, one that implied
something like                                          
                                                      1        1
                             Pr | pex (A) − px (A)| >      <        .
                              A                       10     exp(n)
in place of (3.1). Then we could eliminate the need for AvgP in Theorem 3.5, and show that P = P#P
implies PA = BQPA with probability 1 for a random oracle A.

3.2   Unconditional results
We conclude this section with some unconditional results. These results will use Theorem 1.10 of Dinur
et al. [19]: that for every degree-d polynomial p : RN → R such that 0 ≤ p(X) ≤ 1 for all X ∈ {0, 1}N ,
there exists a polynomial pe depending on at most 2O(d) /ε 2 variables such that k pe − pk22 ≤ ε, where
                                        kpk22 := E        p(X)2 .
                                                               
                                                     X∈{0,1}N

Theorem 1.10 has the following simple corollary.
Corollary 3.6. Suppose a quantum algorithm Q makes T queries to a Boolean input X ∈ {0, 1}N . Then
for all α, δ > 0, we can approximate Q’s acceptance probability to within an additive constant α, on
a 1 − δ fraction of inputs, by making 2O(T ) /(α 4 δ 4 ) deterministic classical queries to X. (Indeed, the
classical queries are nonadaptive.)
Proof. Let p(X) := Pr [Q accepts X]. Then p is a degree-2T real polynomial by Lemma 3.2. Hence, by
Theorem 1.10, there exists a polynomial pe, depending on K = 2O(T ) /(α 4 δ 4 ) variables xi1 , . . . , xiK , such
that
                                           ( pe(X) − p(X))2 ≤ α 2 δ 2 .
                                                          
                                    E
                                     X∈{0,1}N

By the Cauchy-Schwarz inequality, then,
                                           E      [| pe(X) − p(X)|] ≤ αδ ,
                                       X∈{0,1}N

so by Markov’s inequality
                                         Pr      [| pe(X) − p(X)| > α] < δ .
                                      X∈{0,1}N

Thus, our algorithm is simply to query xi1 , . . . , xiK , and then output pe(X) as our estimate for p(X).

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                 156
                            T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

    Likewise:
Corollary 3.7. Dε+δ ( f ) ≤ 2O(Qε ( f )) /δ 4 for all Boolean functions f and all ε, δ > 0.
Proof. Set α to any constant less than 1/6, then use the algorithm of Corollary 3.6 to simulate the
ε-approximate quantum algorithm for f . Output f (X) = 1 if pe(X) ≥ 1/2 and f (X) = 0 otherwise.

    Given an oracle A, let BQPA[log] be the class of languages decidable by a BQP machine able to make
O (log n) queries to A. Also, let AvgPA|| be the class of languages decidable, with probability 1 − o (1)
over x ∈ {0, 1}n , by a P machine able to make poly(n) parallel (nonadaptive) queries to A. Then we get
the following unconditional variant of Theorem 3.5.
Theorem 3.8. Suppose P = P#P . Then BQPA[log] ⊂ AvgPA|| with probability 1 for a random oracle A.
Proof. The proof is essentially the same as that of Theorem 3.5, except that we use Corollary 3.6 in place
of Conjecture 1.7. In the proof of Corollary 3.6, observe that the condition

                                           E       [| pe(X) − p(X)|] ≤ αδ
                                        X∈{0,1}N

implies                                                           
                                          E           pµ (X) − p(X) ≤ αδ                                    (3.2)
                                       X∈{0,1}N

as well, where pµ (X) equals the mean of p(Y ) over all inputs Y that agree with X on xi1 , . . . , xiK . Thus,
given a quantum algorithm that makes T queries to an oracle string, the computational problem that we
need to solve boils down to finding a subset of the oracle bits xi1 , . . . , xiK such that K = 2O(T ) /(α 4 δ 4 )
and (3.2) holds. Just like in Theorem 3.5, this problem is solvable in the counting hierarchy CH =
          #P
P#P ∪ P#P ∪ · · · . So if we assume P = P#P , then it is also solvable in P.
    In Theorem 3.5, the conclusion we got was BQPA ⊂ AvgPA with probability 1 for a random oracle A.
In our case, the number of classical queries K is exponential (rather than polynomial) in the number of
quantum queries T , so we only get BQPA[log] ⊂ AvgPA . On the other hand, since the classical queries are
nonadaptive, we can strengthen the conclusion to BQPA[log] ⊂ AvgPA|| .


4    Open problems
It would be nice to improve the R( f ) = O(Q( f )7 polylog Q( f )) bound for all symmetric problems. As
mentioned earlier, we conjecture that the right answer is R( f ) = O(Q( f )2 ). In trying to improve our
lower bound, it seems best to avoid the use of S ET E QUALITY. After all, it is a curious feature of our
proof that, to get a lower bound for symmetric problems, we need to reduce from the non-symmetric
S ET E QUALITY problem!
    Another problem is to remove the assumption M ≥ N in our lower bound for symmetric problems.
Experience with related problems strongly suggests that this can be done, but one might need to replace
our chopping procedure by something different.
    We also conjecture that R( f ) ≤ Q( f )O(1) for all partial functions f that are symmetric only under
permuting the inputs (and not necessarily the outputs). Proving this seems to require a new approach.

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                157
                                   S COTT A ARONSON AND A NDRIS A MBAINIS

Another problem, in a similar spirit, is whether R( f ) ≤ Q( f )O(1) for all partial functions f : S → {0, 1}
such that S (i. e., the promise on inputs) is symmetric, but f itself need not be symmetric.
    It would be interesting to reprove the R( f ) ≤ Q( f )O(1) bound using only the polynomial method,
and not the adversary method. Or, to rephrase this as a purely classical question: for all X = (x1 , . . . , xN )
in [M]N , let BX be the N × M matrix whose (i, j)th entry is 1 if xi = j and 0 otherwise. Then given a
set S ⊆ [M]N and a function f : S → {0, 1}, let deg(
                                                   f f ) be the minimum degree of a real polynomial
     MN
p : R → R such that
    (i) 0 ≤ p(BX ) ≤ 1 for all X ∈ [M]N , and
                              1
    (ii) |p(BX ) − f (X)| ≤     for all X ∈ S.
                              3
                                  f f )O(1) for all permutation-invariant functions f ?
Then is it the case that R( f ) ≤ deg(
    On the random oracle side, the obvious problem is to prove Conjecture 1.7—thereby establishing that
Dε ( f ) and Qδ ( f ) are polynomially related, and all the other consequences shown in Section 3. Alterna-
tively, one could look for some technique that was tailored to polynomials p that arise as the acceptance
probabilities of quantum algorithms. In this way, one could conceivably solve Dε ( f ) versus Qδ ( f ) and
the other quantum problems, without settling the general conjecture about bounded polynomials.


5      Appendix: The Boolean case
Given a partial Boolean function f : {0, 1}N → {0, 1, ∗}, call f symmetric if f (X) depends only on the
Hamming weight |X| := x1 + · · · + xN . For completeness, in this appendix we prove the following basic
fact:
Theorem 5.1. R( f ) = O(Q( f )2 ) for every partial symmetric Boolean function f .
    For total symmetric Boolean functions, Theorem 5.1 was already shown by Beals et al. [9], using an
approximation theory result of Paturi [28]. Indeed, in the total case one even has D( f ) = O(Q( f )2 ). So
the new twist is just that f can be partial.
    Abusing notation, let f (k) ∈ {0, 1, ∗} be the value of f on all inputs of Hamming weight k (where as
usual, ∗ means ‘undefined’). Then we have the following quantum lower bound:
Lemma 5.2.            f (a) = 0 and f (b) = 1 or vice versa, where a < b and a ≤ N/2. Then
          √ Suppose that
Q( f ) = Ω bN/(b − a) .
Proof. This follows from a straightforward application of Ambainis’s adversary theorem (Theorem 2.6).
Specifically, let A, B ⊆ {0, 1}N be the sets of all strings of Hamming weights a and b respectively, and for
all X ∈ A and Y ∈ B, put (X,Y ) ∈ R if and only if X  Y (that is, xi ≤ yi for all i ∈ [N]). Then
                                            r                  !      √ !
                                               N −a        b            bN
                               Q( f ) = Ω               ·        =Ω          .
                                                b−a b−a               b−a

Alternatively, this lemma can be proved using the approximation theory result of Paturi [28], following
Beals et al. [9].

                      T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                               158
                           T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS
                                                                         p
   In particular, if we set β := b/N and ε := (b − a)/N, then Q( f ) = Ω( β /ε). On the other hand,
we also have the following randomized upper bound, which follows from a Chernoff bound (similar to
Lemma 2.2):

Lemma 5.3. Assume β > ε > 0. By making O(β /ε 2 ) queries to an N-bit string X, a classical sampling
algorithm can estimate the fraction β := |X| /N of 1 bits to within an additive error ±ε/3, with success
probability at least 2/3.

    Thus, assume the function f is non-constant, and let
                                                                √
                                                                  bN
                                          γ :=      max               .                                (5.1)
                                               f (a)=0, f (b)=1 b − a

Assume without loss of generality that the maximum of (5.1) is achieved when a < b and a ≤ N/2, if
necessary by applying the transformations f (X) → 1 − f (X) and f (X) → f (N − X). Now consider the
following randomized algorithm to evaluate f , which makes T := O(γ 2 ) queries:

         Choose indices i1 , . . . , iT ∈ [N] uniformly at random with replacement
         Query xi1 , . . . , xiT
         Set k := NT (xi1 + · · · + xiT )
                                                                                           √
                                                                                             bN
         If there exists a b ∈ {0, . . . , N} such that f (b) = 1 and |k − b| ≤             3γ
             output f (X) = 1
         Otherwise output f (X) = 0

    By Lemma 5.3, the above algorithm succeeds with probability at least 2/3, provided we choose T
suitably large. Hence R( f ) = O(γ 2 ). On the other hand, Lemma 5.2 implies that Q( f ) = Ω(γ). Hence
R( f ) = O(Q( f )2 ), completing the proof of Theorem 5.1.


6    Appendix: 1-norm versus 2-norm
As mentioned in Section 1.3, in the original version of this paper we stated Conjecture 1.7, and all our
results assuming it, in terms of L1 -influences rather than L2 -influences. Subsequently, Arturs Bačkurs
discovered a gap in our L1 -based argument. In recent work, Bačkurs and Bavarian [8] managed to fill
the gap, allowing our L1 -based argument to proceed. Still, the simplest fix for the problem Bačkurs
uncovered is just to switch from L1 -influences to L2 -influences, so that is what we did in Section 3 (and
in our current statement of Conjecture 1.7).
     Fortunately, it turns out that the L1 and L2 versions of Conjecture 1.7 are equivalent, so making this
change does not even involve changing our conjecture. For completeness, in this appendix we prove the
equivalence of the L1 and L2 versions of Conjecture 1.7.
     As usual, let p : {0, 1}N → [0, 1] be a real polynomial, let X ∈ {0, 1}N , and let X i denote X with the
ith bit flipped. Then the L1 -variance Vr [p] of p and the L1 -influence Infi [p] of the ith variable xi are

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                             159
                                     S COTT A ARONSON AND A NDRIS A MBAINIS

defined as follows:

                                          Vr [p] :=        E         [|p(X) − E [p]|] ,
                                                        X∈{0,1}N

                                         Inf1i [p] :=                    p(X) − p(X i ) .
                                                                                      
                                                           E
                                                                 N
                                                        X∈{0,1}

The L1 analogue of Conjecture 1.7 simply replaces Var [p] by Vr [p] and Infi [p] by Inf1i [p]:
Conjecture 6.1 (Bounded Polynomials Have Influential Variables, L1 Version). Let p : RN → R be a
degree-d real polynomial such that 0 ≤ p(X) ≤ 1 for all X ∈ {0, 1}N . Then there exists an i ∈ [N] such
that Inf1i [p] ≥ (Vr [p] /d)O(1) .
    We now prove the equivalence:
Proposition 6.2. Conjectures 1.7 and 6.1 are equivalent.
Proof. First assume Conjecture 1.7. By the Cauchy-Schwarz inequality,
                                                                                                         !2
                                    (p(X) − p(X i ))2 ≥                               p(X) − p(X i )          = Inf1i [p]2 .
                                                                                                     
           Infi [p] =      E                                              E
                        X∈{0,1}N                                     X∈{0,1}N

Also, since p(X) ∈ [0, 1],

                                                                                   (p(X) − E [p])2 = Var [p] .
                                                                                                 
                Vr [p] =         E       [|p(X) − E [p]|] ≥              E
                                     N                                        N
                            X∈{0,1}                               X∈{0,1}

Hence there exists an i ∈ [N] such that
                                                                        O(1)                      O(1)
                                                               Vr [p]                      Var [p]
                               Infi [p] ≥ Inf1i [p]2 ≥                             ≥
                                                                d                            d
and Conjecture 6.1 holds.
    Likewise, assume Conjecture 6.1. Then we have Inf1i [p] ≥ Infi [p] since p(X) ∈ [0, 1], and Var [p] ≥
Vr [p]2 by the Cauchy-Schwarz inequality. Hence there exists an i ∈ [N] such that
                                                                        O(1)                     O(1)
                                                              Var [p]                      Vr [p]
                               Inf1i [p] ≥ Infi [p] ≥                              ≥
                                                                d                           d
and Conjecture 1.7 holds.


7    Appendix: Equivalent form of Conjecture 1.5
Recall Conjecture 1.5, which said (informally) that any quantum algorithm that makes T queries to
X ∈ {0, 1}N can be simulated to within ±ε additive error on a 1 − δ fraction of the inputs X by a classical
algorithm that makes poly(T, 1/ε, 1/δ ) queries. In Section 1.1, we claimed that Conjecture 1.5 was
equivalent to an alternative conjecture, which we now state more formally:

                        T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                                160
                               T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

Conjecture 7.1. Let S ⊆ {0, 1}N with |S| ≥ c2N , and let f : S → {0, 1}. Then there exists a deterministic
classical algorithm that makes poly(Q( f ), 1/α, 1/c) queries, and that computes f (X) on at least a 1 − α
fraction of X ∈ S.
    In this appendix, we justify the equivalence claim. We first need a simple combinatorial lemma.
Lemma 7.2. Suppose we are trying to learn an unknown real p ∈ [0, 1]. There are k “hint bits” h1 , . . . , hk ,
where each hi is 0 if (i − 1) /k ≤ p or 1 if i/k ≥ p (and can otherwise be arbitrary). However, at most
b < k/2 of the hi are then corrupted by an adversary, producing the new string h01 , . . . , h0k . Using h01 , . . . , h0k ,
one can still determine p to within additive error ±(b + 1)/k.
Proof. Given the string h0 = h01 , . . . , h0k , we apply the following correction procedure: we repeatedly
                                              

search for pairs i < j such that h0i = 1 and h0j = 0, and “delete” those pairs (that is, we set h0i = h0j = ∗,
where ∗ means “unknown”). We continue for t steps, until no more such pairs exist. Next, we delete the
rightmost b − t zeroes in h0 (replacing them with ∗s), and likewise delete the leftmost b − t ones. Finally,
as our estimate for p, we output
                                                      i∗ + j ∗ − 1
                                                 q :=              ,
                                                          2k
where i∗ is the index of the rightmost 0 remaining in h0 (or i∗ = 0 if no 0’s 0s remain), and j∗ is the index
of the leftmost 1 remaining (or j∗ = k + 1 if no 1s remain).
    To show correctness: every time we find an i < j pair such that h0i = 1 and h0j = 0, at least one of
hi and h0j must have been corrupted by the adversary. It follows that t ≤ b, where t is the number of
  0

deleted pairs. Furthermore, after the first stage finishes, every 1 is to the right of every 0, at most b − t of
the remaining bits are corrupted, and the bits that are corrupted must be among the rightmost zeroes or
the leftmost ones (or both). Hence, after the second stage finishes, every h0i = 0 reliably indicates that
p ≥ (i − 1) /k, and every h0j = 1 reliably indicates that p ≤ j/k. Moreover, since only 2b bits are deleted
in total, we must have j∗ − i∗ ≤ 2b + 1, where i∗ and j∗ are as defined above. It follows that
                                            j∗ − i∗ + 1 b + 1
                                           |p − q| ≤   ≤      .
                                                2k        k
Theorem 7.3. Conjectures 1.5 and 7.1 are equivalent.
Proof. We start with the easy direction, that Conjecture 1.5 implies Conjecture 7.1. Given f : S → {0, 1}
with |S| ≥ c2N , let Q be a quantum algorithm that evaluates f with error probability at most 1/3 using
T queries. Let p(X) be Q’s acceptance probability on a given input X ∈ {0, 1}N (not necessarily in S).
Then by Conjecture 1.5, there exists a deterministic classical algorithm that approximates p(X) to within
additive error ±ε on a 1 − δ fraction of X ∈ {0, 1}N using poly(T, 1/ε, 1/δ ) queries. If we set (say)
ε := 1/7 and δ := αc, then such an approximation lets us decide whether f (X) = 0 or f (X) = 1 for a
1 − α fraction of X ∈ S, using poly(T, 1/α, 1/c) queries.
    We now show the other direction, that Conjecture 7.1 implies Conjecture 1.5. Let Q be a T -
query quantum algorithm, let p(X) be Q’s acceptance probability on input X, and suppose we want to
approximate p(X) to within error ±ε on at least a 1 − δ fraction of X ∈ {0, 1}N . Let ε := ε/3. Assume for
simplicity that ε has the form 1/k for some positive integer k; this will have no effect on the asymptotics.
For each j ∈ [k], let                                                   
                                                     j−1               j
                                S j := X : p(X) ≤          or p(X) ≥       ,
                                                      k                k

                        T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                                        161
                                S COTT A ARONSON AND A NDRIS A MBAINIS

and define the function f j : S j → {0, 1} by
                                               (
                                                0 if p(X) ≤ ( j − 1) /k ,
                                    f j (X) :=
                                                1 if p(X) ≥ j/k .

By Proposition 2.4, we have Q( f j ) = O(kT ) for all j ∈ [k]. Also, note that
                                                             
                                                          1 n
                                          E Sj         ≥ 1−     2 .
                                           j                k

By Markov’s inequality, this implies that there can be at most one j ∈ [k] (call it j∗ ) such that S j < 2n−2 .
Likewise, note that for every X ∈ {0, 1}N , there is at most one j ∈ [k] such that X ∈  / S j.
      Together with Conjecture 7.1, the above facts imply that, for all j 6= j∗ and α > 0, there exists a
deterministic classical algorithm A j,α , making poly(T, 1/α) queries, that computes f j (X) on at least
a 1 − α fraction of all X ∈ S j . Suppose we run A j,α for all j 6= j∗ . Then by the union bound, for at
least a 1 − kα fraction of X ∈ {0, 1}N , there can be at most two j ∈ [k] such that A j,α fails to compute
 f j (X): namely, j∗ , and the unique j (call it j0 ) such that X ∈
                                                                  / S j0 . Thus, suppose A j,α succeeds for
all j ∈/ { j∗ , j0 }. By Lemma 7.2, this implies that p(X) has been determined up to an additive error of
±3ε = ±ε. Hence, we simply need to set α := δ /k, in order to get a classical algorithm that makes
k · poly(T, k/δ ) = poly(T, 1/ε, 1/δ ) queries, and that approximates p(X) up to additive error ±ε for at
least a 1 − δ fraction of X ∈ {0, 1}N .


8    Acknowledgments
We thank Aleksandrs Belovs, Andy Drucker, Ryan O’Donnell, and Ronald de Wolf for helpful discussions;
Mark Zhandry for taking up our challenge to improve the lower bound on Q (S ET E QUALITY) to the
optimal Ω(N 1/3 ); and Dana Moshkovitz for suggesting a proof of Lemma 7.2. We especially thank Artūrs
Bačkurs, Jānis Iraids, the attendees of the quantum computing reading group at the University of Latvia,
and the anonymous reviewers for their feedback, and for catching some errors in earlier versions of this
paper.


References
 [1] S COTT A ARONSON: Quantum lower bound for the collision problem. In Proc. 34th STOC, pp.
     635–642. ACM Press, 2002. [doi:10.1145/509907.509999] 136, 163

 [2] S COTT A ARONSON: BQP and the polynomial hierarchy. In Proc. 42nd STOC, pp. 141–150. ACM
     Press, 2010. See expanded version in ECCC 2009. [doi:10.1145/1806689.1806711] 135, 140

 [3] S COTT A ARONSON AND A NDRIS A MBAINIS: The need for structure in quantum speedups. In
     Proc. 2nd Innovations in Computer Science Conference (ICS’11), pp. 338–352, 2011. (Tsinghua).
     [arXiv:0911.0996] 138, 140

                     T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                              162
                         T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

 [4] S COTT A ARONSON AND YAOYUN S HI: Quantum lower bounds for the collision and the element
     distinctness problems. J. ACM, 51(4):595–605, 2004. Preliminary versions by Aaronson in STOC’02
     (item [1]) and by Shi in FOCS’02. [doi:10.1145/1008731.1008735] 136, 138, 149

 [5] D ORIT A HARONOV, VAUGHAN J ONES , AND Z EPH L ANDAU: A polynomial quantum algorithm
     for approximating the jones polynomial. Algorithmica, 55(3):395–421, 2009. Preliminary version
     in STOC’06. [doi:10.1007/s00453-008-9168-0] 134

 [6] A NDRIS A MBAINIS: Quantum lower bounds by quantum arguments. J. Comput. Syst. Sci.,
     64(4):750–767, 2002. Preliminary version in STOC’00. [doi:10.1006/jcss.2002.1826] 138, 147,
     148

 [7] A NDRIS A MBAINIS AND RONALD DE W OLF: Average-case quantum query complexity. Journal
     of Physics A: Mathematical and General, 34(35):6741, 2001. Preliminary version in STACS’00.
     [doi:10.1088/0305-4470/34/35/302, arXiv:quant-ph/9904079] 137

 [8] A RT ŪRS BAČKURS AND M OHAMMAD BAVARIAN: On the sum of L1 influences. In Proc. 29th
     IEEE Conf. on Computational Complexity (CCC’14). IEEE Comp. Soc. Press, 2014. ECCC 2014.
     [doi:10.1109/CCC.2014.21, arXiv:1302.4625] 140, 159

 [9] ROBERT B EALS , H ARRY B UHRMAN , R ICHARD C LEVE , M ICHELE M OSCA , AND RONALD
     DE W OLF : Quantum lower bounds by polynomials. J. ACM, 48(4):778–797, 2001. Preliminary
     version in FOCS’98. [doi:10.1145/502090.502097] 134, 135, 137, 139, 153, 158

[10] J ONATHAN N IEL DE B EAUDRAP, R ICHARD C LEVE , AND J OHN WATROUS: Sharp quantum versus
     classical query complexity separations. Algorithmica, 34(4):449–461, 2002. [doi:10.1007/s00453-
     002-0978-1] 135

[11] C HARLES H. B ENNETT, E THAN B ERNSTEIN , G ILLES B RASSARD , AND U MESH V. VAZIRANI:
     Strengths and weaknesses of quantum computing. SIAM J. Comput., 26(5):1510–1523, 1997.
     [doi:10.1137/S0097539796300933] 134

[12] E THAN B ERNSTEIN AND U MESH V. VAZIRANI: Quantum complexity theory. SIAM J. Comput.,
     26(5):1411–1473, 1997. Preliminary version in STOC’93. [doi:10.1137/S0097539796300921] 137,
     155

[13] G ILLES B RASSARD , P ETER H ØYER , M ICHELE M OSCA , AND A LAIN TAPP: Quantum amplitude
     amplification and estimation. In S. J. L OMONACO AND H. E. B RANDT, editors, Quantum
     Computation and Information, volume 305 of Contemporary Mathematics Series. Amer. Math. Soc.,
     2002. [doi:10.1090/conm/305, arXiv:quant-ph/0005055] 144

[14] G ILLES B RASSARD , P ETER H ØYER , AND A LAIN TAPP: Quantum cryptanalysis of hash and claw
     free functions. ACM SIGACT News, 28(2):14–19, 1997. [doi:10.1145/261342.261346, arXiv:quant-
     ph/9705002] 136, 150

[15] H ARRY B UHRMAN AND RONALD DE W OLF: Complexity measures and decision tree complexity:
     a survey. Theor. Comput. Sci., 288(1):21–43, 2002. [doi:10.1016/S0304-3975(01)00144-X] 135

                   T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                      163
                             S COTT A ARONSON AND A NDRIS A MBAINIS

[16] H ARRY B UHRMAN , L ANCE F ORTNOW, I LAN N EWMAN , AND H EIN R ÖHRIG: Quantum prop-
     erty testing. SIAM J. Comput., 37(5):1387–1400, 2008. Preliminary version in SODA’03.
     [doi:10.1137/S0097539704442416] 137

[17] W IM VAN DAM , S EAN H ALLGREN , AND L AWRENCE I P: Quantum algorithms for some hid-
     den shift problems. SIAM J. Comput., 36(3):763–778, 2006. Preliminary version in SODA’03.
     [doi:10.1137/S009753970343141X] 134

[18] DAVID D EUTSCH AND R ICHARD J OZSA: Rapid solution of problems by quantum computation.
     Proc. Roy. Soc. A, 439(1907):553–558, 1992. [doi:10.1098/rspa.1992.0167] 136

[19] I RIT D INUR , E HUD F RIEDGUT, G UY K INDLER , AND RYAN O’D ONNELL: On the Fourier tails
     of bounded functions over the discrete cube. Israel J. Math., 160(1):389–412, 2007. Preliminary
     version in STOC’06. [doi:10.1007/s11856-007-0068-9] 139, 156

[20] L ANCE F ORTNOW AND J OHN D. ROGERS: Complexity limitations on quantum compu-
     tation.   J. Comput. Sys. Sci., 59(2):240–252, 1999. Preliminary version in CCC’98.
     [doi:10.1006/jcss.1999.1651, arXiv:cs/9811023] 137

[21] L OV K. G ROVER: A fast quantum mechanical algorithm for database search. In Proc. 28th STOC,
     pp. 212–219. ACM Press, 1996. [doi:10.1145/237814.237866, arXiv:quant-ph/9605043] 134

[22] A RAM H ARROW, AVINATAN H ASSIDIM , AND S ETH L LOYD: Quantum algorithm for solving
     linear systems of equations. Phys. Rev. Lett., 15, 2009. [doi:10.1103/PhysRevLett.103.150502,
     arXiv:0811.3171] 134

[23] E DITH H EMASPAANDRA , L ANE A. H EMASPAANDRA , AND M ARIUS Z IMAND: Almost-
     everywhere superiority for quantum polynomial time. Inform. and Comput., 175(2):171–181,
     2002. [doi:10.1006/inco.2001.3110, arXiv:quant-ph/9910033] 137

[24] J EFF K AHN , M ICHAEL E. S AKS , AND C LIFFORD D. S MYTH: A dual version of Reimer’s
     inequality and a proof of Rudich’s conjecture. In Proc. 15th IEEE Conf. on Computational
     Complexity (CCC’00), pp. 98–103. IEEE Comp. Soc. Press, 2000. [doi:10.1109/CCC.2000.856739]
     138

[25] G ATIS M IDRIJ ĀNIS: A polynomial quantum query lower bound for the set equality problem. In
     Proc. 31th Internat. Colloq. on Automata, Languages and Programming (ICALP’04), pp. 996–1005.
     Springer, 2004. [doi:10.1007/978-3-540-27836-8_83] 138, 150

[26] A SHLEY M ONTANARO: Some applications of hypercontractive inequalities in quantum information
     theory. J. Math. Phys., 53(12), 2012. [doi:10.1063/1.4769269, arXiv:1208.0161] 140

[27] RYAN O’D ONNELL , M ICHAEL E. S AKS , O DED S CHRAMM , AND ROCCO A. S ERVEDIO: Every
     decision tree has an influential variable. In Proc. 46th FOCS, pp. 31–39. IEEE Comp. Soc. Press,
     2005. [doi:10.1109/SFCS.2005.34, arXiv:cs/0508071] 139

                   T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                       164
                         T HE N EED FOR S TRUCTURE IN Q UANTUM S PEEDUPS

[28] RYAN O’D ONNELL AND ROCCO A. S ERVEDIO: New degree bounds for polynomial threshold func-
     tions. Combinatorica, 30(3):327–358, 2010. Preliminary version in STOC’03. [doi:10.1007/s00493-
     010-2173-3] 158

[29] YAOYUN S HI: Lower bounds of quantum black-box complexity and degree of approximat-
     ing polynomials by influence of Boolean variables. Inform. Proc. Lett., 75(1-2):79–83, 2000.
     [doi:10.1016/S0020-0190(00)00069-7, arXiv:quant-ph/9904107] 152

[30] P ETER W. S HOR: Polynomial-time algorithms for prime factorization and discrete logarithms on a
     quantum computer. SIAM J. Comput., 26(5):1484–1509, 1997. Preliminary version in FOCS’94.
     [doi:10.1137/S0097539795293172] 134, 135

[31] DANIEL R. S IMON: On the power of quantum computation. SIAM J. Comput., 26(5):1474–1483,
     1997. Preliminary version at FOCS’94. [doi:10.1137/S0097539796298637] 135

[32] C LIFFORD D. S MYTH: Reimer’s inequality and Tardos’ conjecture. In Proc. 34th STOC, pp.
     218–221. ACM Press, 2002. [doi:10.1145/509907.509942] 137, 138

[33] H ENRY Y UEN: A quantum lower bound for distinguishing random functions from random permuta-
     tions. Quantum Information & Computation, 14(13 & 14), 2014. QIC-online. [arXiv:1310.2885]
     140

[34] M ARK Z HANDRY: A note on the quantum collision and set equality problems.                2013.
     [arXiv:1312.1027] 138, 140, 150


AUTHORS

      Scott Aaronson
      associate professor
      Massachusetts Institute of Technology
      Cambridge, MA
      aaronson csail mit edu
      http://www.scottaaronson.com


      Andris Ambainis
      professor
      University of Latvia and
      IAS, Princeton
      ambainis lu lv




                   T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                       165
                           S COTT A ARONSON AND A NDRIS A MBAINIS

ABOUT THE AUTHORS

    S COTT A ARONSON received his bachelor’s degree from Cornell University and his Ph. D.
       from UC Berkeley. He is known for his blog and for founding the Complexity Zoo. He
       publishes often in Theory of Computing.


    A NDRIS A MBAINIS received his Ph. D. from UC Berkeley and is now Professor at the
       University of Latvia. His research interests include many areas of quantum computing
       and quantum information theory (quantum algorithms, quantum complexity theory,
       quantum cryptography, pseudorandom quantum states, etc.), as well as the classical
       theory of computation.




                 T HEORY OF C OMPUTING, Volume 10 (6), 2014, pp. 133–166                      166