A Strong XOR Lemma for Randomized Query Complexity

Authors Joshua Brody, Jae Tak Kim, Peem Lerdputtipongporn, Hariharan Srinivasulu,
Plaintext
                           T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14
                                        www.theoryofcomputing.org




    A Strong XOR Lemma for Randomized
              Query Complexity
         Joshua Brody                      Jae Tak Kim     Peem Lerdputtipongporn
                                            Hariharan Srinivasulu
              Received August 3, 2020; Revised December 30, 2023; Published December 31, 2023




       Abstract. We give a strong direct sum theorem for computing XOR 𝑘 ◦ 𝑔, the XOR
       of 𝑘 instances of the partial Boolean function 𝑔. Specifically, we show that for every
       𝑔 and every 𝑘 ≥ 2, the randomized query complexity of computing the XOR of 𝑘
       instances of 𝑔 satisfies R𝜀 (XOR 𝑘 ◦ 𝑔) = Θ(𝑘 R 𝜀𝑘 (𝑔)), where R𝜀 ( 𝑓 ) denotes the expected
       number of queries made by the most efficient randomized algorithm computing 𝑓
       with 𝜀 error. This matches the naive success amplification upper bound and answers
       a conjecture of Blais and Brody (CCC’19).
           As a consequence of our strong direct sum theorem, we give a total function
       𝑔 for which R(XOR 𝑘 ◦ 𝑔) = Θ(𝑘 log(𝑘) · R(𝑔)), where R( 𝑓 ) is the number of queries
       made by the most efficient randomized algorithm computing 𝑓 with 1/3 error. This
       answers a question from Ben-David et al. (RANDOM’20).


1     Introduction
We show that XOR admits a strong direct sum theorem for randomized query complexity.
Generally, the direct sum problem asks how the cost of computing a partial function 𝑔 scales
with the number 𝑘 of instances of the function that we need to compute simultaneously

ACM Classification: F.1.1, F.1.3
AMS Classification: 68Q09,68Q10,68Q17
Key words and phrases: lower bounds, query complexity, direct sum


© 2023 Joshua Brody, Jae Tak Kim, Peem Lerdputtipongporn, and Hariharan Srinivasulu
c b Licensed under a Creative Commons Attribution License (CC-BY)                     DOI: 10.4086/toc.2023.v019a011
      J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

(in parallel) . This is a foundational computational problem that has received considerable
attention [9, 2, 13, 14, 10, 6, 8, 7, 3, 4, 5], including recent a recent paper by Blais and Brody [7],
which showed that expected query complexity obeys a direct sum theorem in a strong sense—
computing 𝑘 copies of a partial function 𝑔 with overall error 𝜀 requires 𝑘 times the cost of
computing 𝑔 on one input with very low (𝜀/𝑘) error. This matches the naive success amplification
algorithm which runs an 𝜀𝑘 -error algorithm for 𝑔 once on each of 𝑘 inputs and applies a union
bound to get an overall error guarantee of 𝜀.
     What happens if we do not need to compute 𝑔 on all instances, but only on a function 𝑓 ◦ 𝑔 of
those instances? Clearly the same success amplification trick (compute 𝑔 on each input with low
error, then apply 𝑓 to the answers) works for computing 𝑓 ◦ 𝑔; however, in principle, computing
 𝑓 ◦ 𝑔 can be easier than computing each instance of 𝑔 individually. When a function 𝑓 ◦ 𝑔
requires success amplification for all 𝑔, we say that 𝑓 admits a strong direct sum theorem. Our main
result shows that XOR admits a strong direct sum theorem.

1.1   Query complexity
A query algorithm, also known as a decision tree, computing 𝑓 , is an algorithm 𝒜 that takes an
input 𝑥 to 𝑓 , examines (or queries) bits of 𝑥, and outputs an answer for 𝑓 (𝑥). A leaf of 𝒜 is a bit
string 𝑞 ∈ {0, 1}∗ representing the answers to the queries made by 𝒜 on input 𝑥. Let leaf(𝒜, 𝑥)
denote the leaf of 𝒜 reached on input 𝑥. Naturally, our general goal is to minimize the length of
𝑞, i. e., minimize the number of queries needed to compute 𝑓 .
     A randomized algorithm 𝒜 computes a function 𝑓 : {0, 1} 𝑛 → {0, 1} with error 𝜖 ≥ 0 if for
every input 𝑥 ∈ {0, 1} 𝑛 , the algorithm outputs the value 𝑓 (𝑥) with probability at least 1 − 𝜖. The
query cost of 𝒜 is the maximum number of bits of 𝑥 that it queries, with the maximum taken
over both the choice of input 𝑥 and the internal randomness of 𝒜. The 𝜖-error randomized query
complexity of 𝑓 (also known as the randomized decision tree complexity of 𝑓 ) is the minimum query
cost of an algorithm 𝒜 that computes 𝑓 with error at most 𝜖. We denote this complexity by
R𝜖 ( 𝑓 ), and we write R( 𝑓 ) := R 1 ( 𝑓 ) to denote the 13 -error randomized query complexity of 𝑓 .
                                   3
     Another natural measure for the query cost of a randomized algorithm 𝒜 is the expected
number of coordinates of an input 𝑥 that it queries. Taking the maximum expected number
of coordinates queried by 𝒜 over all inputs yields the expected query cost of 𝒜. The minimum
expected query cost of an algorithm 𝒜 that computes a function 𝑓 with error at most 𝜖 is the
𝜖-error expected query complexity of 𝑓 , which we denote by R𝜖 ( 𝑓 ). We again write R( 𝑓 ) := R 1 ( 𝑓 ).
                                                                                                   3
Note that R0 ( 𝑓 ) corresponds to the standard notion of zero-error randomized query complexity of 𝑓 .

1.2   Our results
Our main result is a strong direct sum theorem for XOR.
Theorem 1.1. For every partial function 𝑔 : {0, 1} 𝑛 → {0, 1} and all 𝜀 > 0, we have R𝜀 (XOR 𝑘 ◦ 𝑔) =
Ω(𝑘 · R𝜀/𝑘 (𝑔)).
   This answers Conjecture 1 of Blais and Brody [7] in the affirmative. We prove Theorem 1.1 by
proving an analogous result in distributional query complexity. We also allow our algorithms to

                      T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                            2
                   A S TRONG XOR L EMMA FOR R ANDOMIZED Q UERY C OMPLEXITY

                                                                                          𝜇
abort with a given probability. Let 𝜇 be a distribution on valid inputs for 𝑓 . Let D𝛿,𝜀 ( 𝑓 ) denote
the minimal query cost of a deterministic query algorithm that aborts with probability at most 𝛿
and errs with probability at most 𝜀, where the probability is taken over inputs 𝑋 ∼ 𝜇. Similarly,
let R𝛿,𝜀 ( 𝑓 ) denote the minimal query cost of a randomized algorithm that computes 𝑓 with abort
probability at most 𝛿 and error probability at most 𝜀 for each valid input. (Here the probabilities
are taken over the internal randomness of the algorithm.)
    Our main technical result is the following strong direct sum result for XOR 𝑘 ◦ 𝑔 for
distributional algorithms.
Lemma 1.2 (Main Technical Lemma, informally stated.). For every partial function 𝑔 : {0, 1} 𝑛 →
{0, 1}, every distribution 𝜇 on the set of valid inputs and every sufficiently small 𝛿, 𝜀 > 0, we have
                                    𝜇𝑘                       𝜇
                                  D𝛿,𝜀 (XOR 𝑘 ◦ 𝑔) = Ω(𝑘 D𝛿0 ,𝜀0 (𝑔)) ,

for 𝛿0 = Θ(1) and 𝜀0 = Θ(𝜀/𝑘).
   In [7], Blais and Brody also gave a total function 𝑔 : {0, 1} 𝑛 → {0, 1} whose 𝜀-error expected
query complexity satisfies R𝜀 (𝑔) = Ω(R(𝑔) · log 1𝜀 ). We use our strong XOR Lemma together
with this function to show the following.
Corollary 1.3. There exists a total function 𝑔 : {0, 1} 𝑛 → {0, 1} such that

                                 R𝜀 (XOR 𝑘 ◦ 𝑔) = Ω(𝑘 log(𝑘) · R𝜀 (𝑔)) .

Proof. Let 𝑔 : {0, 1} 𝑛 → {0, 1} be a function guaranteed by [7]. Then, we have

   R(XOR 𝑘 ◦ 𝑔) ≥ R(XOR 𝑘 ◦ 𝑔) ≥ Ω(𝑘 · R1/3𝑘 (𝑔)) ≥ Ω(𝑘 · R(𝑔) · log(3𝑘)) = Ω(𝑘 log(𝑘) · R(𝑔)) ,

where the second inequality is by Theorem 1.1 and the third inequality is from the query
complexity guarantee of 𝑔.                                                             
      This answers Open Question 1 from a recent paper by Ben-David et al. [5].

1.3     Previous and related work
Jain et al. [10] gave direct sum theorems for deterministic and randomized query complexity.
In particular, Jain et al. show R𝜀 ( 𝑓 𝑘 ) ≥ 𝛿 · 𝑘 · R𝜀+𝛿 ( 𝑓 ). While their direct sum result holds
for randomized query complexity, the lower bound is in terms of the query complexity of
computing 𝑓 with an increased error of 𝜀 + 𝛿. This weakens the right-hand side of their inequality.
                                              𝜇𝑘             𝜇
Shaltiel [14] gave a function 𝑓 such that D0,𝜀 ( 𝑓 𝑘 )  𝑘 D0,𝜀 ( 𝑓 ), thus showing that a similar direct
sum theorem fails to hold for distributional complexity.
    Drucker [8] gave a direct product theorem for randomized query complexity, showing that
any algorithm computing 𝑔 𝑘 using 𝛼𝑘 R(𝑔) queries for a constant 𝛼 < 1 has success probability
exponentially small in 𝑘. Drucker also gave the following XOR Lemma, showing that any
algorithm for XOR 𝑘 ◦ 𝑔 that makes  𝑘𝑅(𝑔) queries has success probability exponentially close
to 1/2 [8, Theorem 1.3].

                      T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                            3
       J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

Theorem 1.4 (Drucker). Suppose any randomized 𝑇-query algorithm has success probability ≤ 1 − 𝜀0 in
computing the Boolean function 𝑔 on input 𝑥 ∼ 𝜇 for some input distribution 𝜇. Then, for all 0 < 𝛼 < 1,
any randomized algorithm making 𝛼𝜀0𝑇 𝑘 queries to compute XOR 𝑘 ◦ 𝑔 on input distribution 𝜇 𝑘 (𝑘
inputs drawn independently from 𝜇) has success probability at most 12 1 + [1 − 2𝜀0 + 6𝛼 ln(2/𝛼)𝜀0] 𝑘 .
                                                                                                     

    Drucker’s XOR Lemma applies to randomized query complexity R(XOR 𝑘 ◦ 𝑔), while ours
applies to expected randomized query complexity R(XOR 𝑘 ◦ 𝑔).
    Note the 𝜀0 factor in the query complexity in Drucker’s theorem. When 𝜀0 is a constant close
to 1/2, Drucker’s lower bound is stronger than ours by a large constant factor. However, when
𝜀0 = 𝑜(1), his bound degrades significantly. Couched in our notation, Drucker’s XOR Lemma
yields R𝜀 (XOR 𝑘 ◦ 𝑔) = Ω(𝜀0 𝑘 R𝜀0 (𝑔)), for some 𝜀0 = 𝑂(𝜀/𝑘). This simplifies to R𝜀 (XOR 𝑘 ◦ 𝑔) =
Ω(𝜀𝑅 𝜀/𝑘 (𝑔)), a loss of a factor of 𝑘.
    As far as we know, it remains open whether this 𝜀0 factor is needed in the query complexity
lower bound of Drucker’s XOR Lemma. However, Shaltiel’s counterexample [14] shows that the
𝜀0 factor is required for distributional query complexity. This rules out the most direct approach
for proving a tighter XOR Lemma for R(XOR 𝑘 ◦ 𝑔).
    Our paper is most closely related to that of Blais and Brody [7], who give a strong direct sum
theorem for the expected query complexity of computing 𝑘 copies of 𝑓 in parallel, for any partial
function 𝑓 , and explicitly conjecture that XOR admits a strong direct sum theorem. Both [7] and
our paper use techniques similar to work of Molinaro et al. [11, 12] who give strong direct sum
theorems for communication complexity.
    Our strong direct sum theorem for XOR is an example of a composition theorem—a lower bound
on the query complexity of functions of the form 𝑓 ◦ 𝑔. Several recent articles study composition
theorems in query complexity. Bassilakis et al. [1] show that R( 𝑓 ◦ 𝑔) = Ω(fbs( 𝑓 ) R(𝑔)), where
fbs( 𝑓 ) is the fractional block sensitivity of 𝑓 . Ben-David and Blais [3, 4] give a tight lower bound
on R( 𝑓 ◦ 𝑔) as a product of R(𝑔) and a new measure they define called noisyR( 𝑓 ), which
measures the complexity of computing 𝑓 on noisy inputs. They also characterize noisyR( 𝑓 ) in
terms of the gap-majority function. Ben-David et al [5] explicitly consider strong direct sum
theorems for composed functions in randomized query complexity, asking whether the naive
success amplification algorithm is necessary to compute 𝑓 ◦ 𝑔. They give a partial strong direct
sum theorem, showing that there exists a partial function 𝑔 such that computing XOR 𝑘 ◦ 𝑔
requires success amplification, even in a model where the abort probability may be arbitrarily
close to 1.1 Ben-David et al. explicitly ask whether there exists a total function 𝑔 such that
R(XOR 𝑘 ◦ 𝑔) = Ω(𝑘 log(𝑘) R(𝑔)).

1.4    Our technique
Our technique most closely follows the strong direct sum theorem of Blais and Brody. We start
with a query algorithm that computes XOR 𝑘 ◦ 𝑔 and use it to build a query algorithm for comput-
ing 𝑔 with low error. To do this, we will take an input for 𝑔 and embed it into an input for XOR 𝑘 ◦ 𝑔.
Given 𝑥 ∈ {0, 1} 𝑛 , 𝑖 ∈ [𝑘], and 𝑦 ∈ {0, 1} 𝑛×𝑘 , let 𝑦 (𝑖←𝑥) := (𝑦 (1) , . . . , 𝑦 (𝑖−1) , 𝑥, 𝑦 (𝑖+1) , . . . 𝑦 (𝑘) ) denote
    1In this query complexity model, called PostBPP, the query algorithm is allowed to abort with any probability
strictly less than 1. When it does not abort, it must output 𝑓 with probability at least 1 − 𝜀.


                           T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                                            4
                    A S TRONG XOR L EMMA FOR R ANDOMIZED Q UERY C OMPLEXITY

the input obtained from 𝑦 by replacing the 𝑖-th coordinate 𝑦 (𝑖) with 𝑥. Note that if 𝑥 ∼ 𝜇 and
𝑦 ∼ 𝜇 𝑘 ,2 then 𝑦 (𝑖←𝑥) ∼ 𝜇 𝑘 for all 𝑖 ∈ [𝑘].
   We require the following observation [8, Lemma 3.2].

Lemma 1.5 (Drucker). Let 𝑦 ∼ 𝜇 𝑘 be an input for a query algorithm 𝒜, and consider any execution of
queries by 𝒜. The distribution of coordinates of 𝑦, conditioned on the queries made by 𝒜, remains a
product distribution.

     In particular, the answers to 𝑔(𝑦 (𝑖) ) remain independent bits conditioned on any set of queries
made by the query algorithm. Our first observation is that in order to compute XOR 𝑘 ◦ 𝑔(𝑦)
with high probability, we must be able to compute 𝑔(𝑦 (𝑖) ) with very high probability for many
𝑖’s. The intuition behind this observation is captured by the following simple fact about the
XOR of independent random bits.
     Define the bias of a random bit 𝑋 ∈ {0, 1} as 𝑟(𝑋) := max𝑏∈{0,1} Pr[𝑋 = 𝑏]. Define the
advantage of 𝑋 as adv(𝑋) := 2𝑟(𝑋) − 1. Note that when adv(𝑋) = 𝛿, then 𝑟(𝑋) = 12 (1 + 𝛿).

Fact 1.6. Let 𝑋1 , . . . , 𝑋 𝑘 be independent random bits, and let 𝑎 𝑖 be the advantage of 𝑋𝑖 . Then,
                                                              𝑘
                                                              Ö
                                   adv(𝑋1 ⊕ · · · ⊕ 𝑋 𝑘 ) =         adv(𝑋𝑖 ) .
                                                              𝑖=1

Proof. For each 𝑖, let 𝑏 𝑖 := argmax𝑏∈{0,1} Pr[𝑋𝑖 = 𝑏] and 𝛿 𝑖 := adv(𝑋𝑖 ). Then Pr[𝑋𝑖 = 𝑏 𝑖 ] =
2 (1 + 𝛿 𝑖 ). We prove Fact 1.6 by induction on 𝑘. When 𝑘 = 1, there is nothing to prove. For 𝑘 = 2,
1

note that
                                        1         1            1        1
               Pr[𝑋1 ⊕ 𝑋2 = 𝑏 1 ⊕ 𝑏2 ] = (1 + 𝛿1 ) (1 + 𝛿2 ) + (1 − 𝛿1 ) (1 − 𝛿2 )
                                        2         2            2        2
                                        1                        1
                                       = (1 + 𝛿1 + 𝛿2 + 𝛿1 𝛿2 ) + (1 − 𝛿1 − 𝛿2 + 𝛿1 𝛿2 )
                                        4                        4
                                        1
                                       = (1 + 𝛿1 𝛿2 ) .
                                        2
Hence 𝑋1 ⊕ 𝑋2 has advantage 𝛿1 𝛿2 and the claim holds for 𝑘 = 2. For an induction hypothesis,
suppose that the claim holds for 𝑋1 ⊕ · · · ⊕ 𝑋 𝑘−1 . Then, setting 𝑌 := 𝑋1 ⊕ · · · ⊕ 𝑋 𝑘−1 , by the
                                        Î 𝑘−1
induction hypothesis, we have adv(𝑌) = 𝑖=1     adv(𝑋𝑖 ). Moreover, 𝑋1 ⊕ · · · ⊕ 𝑋 𝑘 = 𝑌 ⊕ 𝑋 𝑘 , and

                                                                                 𝑘
                                                                                 Ö
               adv(𝑋1 ⊕ · · · ⊕ 𝑋 𝑘 ) = adv(𝑌 ⊕ 𝑋 𝑘 ) = adv(𝑌) adv(𝑋 𝑘 ) =             adv(𝑋𝑖 ) .            
                                                                                 𝑖=1

    Given an algorithm for XOR 𝑘 ◦ 𝑔 that has error 𝜀, it follows that for typical leaves the
advantage of computing XOR 𝑘 ◦ 𝑔 is & 1 − 2𝜀. Fact 1.6 shows that for such leaves, the advantage
of computing 𝑔(𝑦 (𝑖) ) for most coordinates 𝑖 is & (1 − 2𝜀)1/𝑘 = 1 − Θ(𝜀/𝑘). Thus, conditioned on
   2We use 𝜇 𝑘 to denote the distribution obtained on 𝑘-tuples of {0, 1} 𝑛 obtained by sampling each coordinate
independently according to 𝜇.


                        T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                                5
       J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

reaching this leaf of the query algorithm, we could compute 𝑔(𝑦 (𝑖) ) with very high probability.
We would like to fix a coordinate 𝑖 ∗ such that for most leaves, our advantage in computing 𝑔
on coordinate 𝑖 ∗ is 1 − 𝑂(𝜀/𝑘). There are other complications, namely that (i) our construction
needs to handle aborts gracefully and (ii) our construction must ensure that the algorithm for
XOR 𝑘 ◦ 𝑔 does not query the 𝑖 ∗ -th coordinate too many times. Our construction identifies a
coordinate 𝑖 ∗ and a string 𝑧 ∈ {0, 1} 𝑛×𝑘 , and on input 𝑥 ∈ {0, 1} 𝑛 it emulates a query algorithm
                              ∗
for XOR 𝑘 ◦ 𝑔 on input 𝑧 (𝑖 ←𝑥) and outputs our best guess for 𝑔(𝑥) (which is now 𝑔 evaluated on
                      ∗
coordinate 𝑖 ∗ of 𝑧 (𝑖 ←𝑥) ), aborting when needed e. g., when the algorithm for XOR 𝑘 ◦ 𝑔 aborts or
when it queries too many bits of 𝑥. We defer full details of the proof to Section 2.

1.5    Preliminaries and notation
A partial Boolean function on the domain {0, 1} 𝑛 is a function 𝑓 : 𝑆 → {0, 1} for some subset
𝑆 ⊆ {0, 1} 𝑛 . Call 𝑆 the set of valid inputs for 𝑓 . Let 𝑓 be a partial Boolean function on {0, 1} 𝑛
and 𝜇 a distribution whose support is a subset of the valid inputs. We use [𝑛] to denote the
set {1, . . . , 𝑛} and 𝑋 ∈𝑅 𝑆 to denote an element 𝑋 sampled uniformly from a set 𝑆. Let 𝜇 𝑘
denote the distribution obtained on 𝑘-tuples of {0, 1} 𝑛 obtained by sampling each coordinate
independently according to 𝜇.
    An algorithm 𝒜 is a [𝑞, 𝛿, 𝜀, 𝜇]-distributional query algorithm for 𝑓 if 𝒜 is a deterministic
algorithm with query cost 𝑞 that computes 𝑓 with error probability at most 𝜀 and abort
probability at most 𝛿 when the input 𝑥 is drawn from 𝜇.3
    Our main theorem is a direct sum result for XOR 𝑘 ◦ 𝑔 for expected randomized query
complexity; however, Lemma 1.2 uses distributional query complexity with aborts. To translate
between the two, we need two results from Blais and Brody [7] that connect the query complexities
in the randomized, expected randomized, and distributional query models.

Fact 1.7 ([7], Proposition 14). For every partial function 𝑓 : {0, 1} 𝑛 → {0, 1}, every 0 ≤ 𝜖 < 21 and
every 0 < 𝛿 < 1,
                              𝛿 · R𝛿,𝜀 ( 𝑓 ) ≤ R𝜖 ( 𝑓 ) ≤ 1−𝛿
                                                           1
                                                              · R𝛿,(1−𝛿)𝜖 ( 𝑓 ).

   Fact 1.7 shows that when 𝛿 = 1 − Ω(1), to achieve a lower bound for R𝜀 ( 𝑓 ), it suffices to lower
bound R𝛿,𝜀 ( 𝑓 ). Next, we need the following generalization of Yao’s minimax lemma, which
connects randomized and distributional query complexity in the presence of aborts.

Fact 1.8 ([7], Lemma 15). For any 𝛼, 𝛽 > 0 such that 𝛼 + 𝛽 ≤ 1, we have
                                        𝜇                                   𝜇
                                max D𝛿/𝛼,𝜀/𝛽 ( 𝑓 ) ≤ R𝛿,𝜀 ( 𝑓 ) ≤ max D𝛼𝛿,𝛽𝜀 ( 𝑓 ).
                                  𝜇                                   𝜇


   For simplicity, it might be helpful to consider the simplest case where 𝛼 = 𝛽 = 12 . In this case,
                     𝜇                            𝜇
we recover max𝜇 D2𝛿,2𝜀 ( 𝑓 ) ≤ R𝛿,𝜀 ( 𝑓 ) ≤ max𝜇 D𝛿/2,𝜀/2 ( 𝑓 ). Fact 1.8 shows that to prove a lower

  3Note: in the literature, the error probability is sometimes defined as being conditioned on not aborting (e. g.,[5]).
We define the error probabilty without conditioning to match article [7] most closely related to our work.


                          T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                                       6
                   A S TRONG XOR L EMMA FOR R ANDOMIZED Q UERY C OMPLEXITY

bound on R𝛿,𝜖 ( 𝑓 ), it suffices to prove a lower bound on distributional complexity (albeit with a
constant factor increase in abort and error probabilities).
   We will also use the following convenient facts about expected value.

Fact 1.9 (Law of Conditional Expectations). Let 𝑋 and 𝑌 be random variables. Then, we have

                                             E[𝑋] = E[E[𝑋 |𝑌]] .

Fact 1.10 (Markov Inequality for Bounded Variables). Let 𝑋 be a real-valued random variable with
0 ≤ 𝑋 ≤ 1. Suppose that E[𝑋] ≥ 1 − 𝜀. Then, for any 𝑇 > 1 it holds that

                                                               1
                                            Pr[𝑋 < 1 − 𝑇𝜀] <     .
                                                               𝑇

Proof. Let 𝑌 := 1 − 𝑋. Then, E[𝑌] ≤ 𝜀. By Markov’s Inequality we have

                                                                       1
                                 Pr[𝑋 < 1 − 𝑇𝜀] = Pr[𝑌 > 𝑇𝜀] ≤           .                          
                                                                       𝑇

2   Strong XOR lemma
In this section, we prove our main result.

Lemma 2.1 (Formal Restatement of Lemma 1.2). For every partial function 𝑔 : {0, 1} 𝑛 → {0, 1},
every distribution 𝜇 on {0, 1} 𝑛 , every 0 ≤ 𝛿 ≤ 51 , and every 0 < 𝜀 ≤ 200
                                                                         1
                                                                            , we have

                                       𝜇𝑘                   𝑘 𝜇
                                      D𝛿,𝜀 (XOR 𝑘 ◦ 𝑔) ≥     D 0 0 (𝑔) ,
                                                           25 𝛿 ,𝜀

𝛿0 = 0.36 + 3𝛿 and 𝜀0 = 15000𝜀
                           𝑘 .

                  𝜇𝑘
Proof. Let 𝑞 := D𝛿,𝜀 (XOR 𝑘 ◦𝑔), and suppose that 𝒜 is a [𝑞, 𝛿, 𝜀, 𝜇 𝑘 ]-distributional query algorithm
for XOR 𝑘 ◦ 𝑔. Our goal is to construct an [𝑂(𝑞/𝑘), 𝛿0 , 𝜀0 , 𝜇]-distributional query algorithm for 𝑔.
Towards that end, for each leaf ℓ of 𝒜 define

                        𝑏ℓ := argmax Pr [XOR 𝑘 ◦ 𝑔(𝑥) = 𝑏| leaf(𝒜, 𝑥) = ℓ ]
                                           𝑘
                               𝑏∈{0,1} 𝑥∼𝜇

                         𝑟ℓ := Pr [XOR 𝑘 ◦ 𝑔(𝑥) = 𝑏ℓ | leaf(𝒜, 𝑥) = ℓ ]
                              𝑥∼𝜇 𝑘

                        𝑎ℓ := 2𝑟ℓ − 1 .

Call 𝑎ℓ the advantage of 𝒜 on leaf ℓ .
    The purpose of 𝒜 is to compute XOR 𝑘 ◦ 𝑔; however, we will show that 𝒜 must additionally
be able to compute 𝑔 reasonably well on many coordinates of 𝑥. For any 𝑖 ∈ [𝑘] and any leaf ℓ ,

                       T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                         7
      J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

define

                           𝑏 𝑖,ℓ := argmax Pr [𝑏 = 𝑔(𝑥 (𝑖) )| leaf(𝒜, 𝑥) = ℓ ]
                                                   𝑘
                                       𝑏∈{0,1} 𝑥∼𝜇

                           𝑟 𝑖,ℓ := Pr [𝑏 𝑖,ℓ = 𝑔(𝑥 (𝑖) )| leaf(𝒜, 𝑥) = ℓ ]
                                      𝑥∼𝜇 𝑘

                           𝑎 𝑖,ℓ := 2𝑟 𝑖,ℓ − 1 .

    If 𝒜 reaches leaf ℓ on input 𝑦, then write 𝒜(𝑦)𝑖 := 𝑏 𝑖,ℓ . 𝒜(𝑦)𝑖 represents 𝒜’s best guess for
𝑔(𝑦 (𝑖) ).
    Next, we define some structural characteristics of leaves that we will need to complete the
proof.
Definition 2.2 (Good leaves, good coordinates).

   • Call a leaf ℓ good if 𝑟ℓ ≥ 1 − 50𝜀. Otherwise, call ℓ bad.

   • Call a leaf ℓ good for 𝑖 if 𝑎 𝑖,ℓ ≥ 1 − 5000𝜀/𝑘. Otherwise, call a leaf ℓ bad for 𝑖.

   When a leaf is good for 𝑖, then 𝒜, conditioned on reaching this leaf, computes 𝑔(𝑥 (𝑖) ) with
very high probability. Before presenting the main reduction, we give a few simple claims to
help our proof. Our first claim shows that we reach a good leaf with high probability.
Claim 2.3. Pr𝑥∼𝜇𝑘 [leaf(𝒜, 𝑥) is bad |𝒜(𝑥) doesn’t abort] ≤ 25
                                                            1
                                                               .

Proof. Conditioned on 𝒜 not aborting, it outputs the correct value of XOR 𝑘 ◦ 𝑔 with probability
              𝜀
at least 1 − 1−𝛿 ≥ 1 − 2𝜀. We analyze this error probability by conditioning on which leaf is
reached. Let 𝜈 be the distribution on leaf(𝒜, 𝑥) when 𝑥 ∼ 𝜇 𝑘 , conditioned on 𝒜 not aborting.
Let 𝐿 ∼ 𝜈. Then, we have:

                     1 − 2𝜀 ≤ Pr [𝒜(𝑥) = XOR 𝑘 ◦ 𝑔(𝑥)|𝒜 doesn’t abort]
                                 𝑥∼𝜇 𝑘
                                 Õ
                             =            Pr [𝐿 = ℓ ] · Pr[𝒜(𝑥) = XOR 𝑘 ◦ 𝑔(𝑥)|𝐿 = ℓ ]
                                          𝐿∼𝜈
                                 leaf ℓ
                                 Õ
                             =         Pr[𝐿 = ℓ ] · 𝑟ℓ
                                  ℓ
                             = E[𝑟𝐿 ] .
                                 𝐿

   Thus, E[𝑟𝐿 ] ≥ 1 − 2𝜀. Recalling that ℓ is good if 𝑟ℓ ≥ 1 − 50𝜀 and using Fact 1.10, 𝐿 is bad
                          1
with probability at most 25 .                                                                  
   Next, we claim that each good leaf is good for many 𝑖.
Claim 2.4. Let ℓ be any good leaf, and let 𝐼 be uniform on [𝑘]. Then, we have:
                                                                     1
                                              Pr[ℓ is bad for 𝐼] ≤      .
                                                𝐼                    25

                      T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                      8
                    A S TRONG XOR L EMMA FOR R ANDOMIZED Q UERY C OMPLEXITY

Proof. Fix a good leaf ℓ , and let 𝛽ℓ := Pr𝐼 [ℓ is bad for 𝐼]. Recall that if ℓ is good, then 𝑟ℓ ≥ 1 − 50𝜀.
Therefore, 𝑎ℓ ≥ 1 − 100𝜀. Using 1 + 𝑥 ≤ 𝑒 𝑥 and 𝑒 −2𝑥 ≤ 1 − 𝑥 (which holds for all 0 ≤ 𝑥 ≤ 1/2),
we have for any good leaf ℓ
                                 𝑘
                                 Ö                          𝑘𝛽ℓ
                                                    5000𝜀
              1 − 100𝜀 ≤ 𝑎ℓ =           𝑎 𝑖,ℓ ≤ 1 −                 ≤ 𝑒 −5000𝜀·𝛽ℓ ≤ 1 − 2500𝜀𝛽ℓ .
                                                      𝑘
                                  𝑖=1

Rearranging terms, we see that 𝛽ℓ ≤ 25
                                     1
                                       .                                                                
   Next, we describe a randomized algorithm 𝒜 0 for 𝑔 whose expected query cost, abort
probability, and error probability match the guarantees we want to provide when the input
𝑥 ∼ 𝜇. We will complete the proof of Lemma 2.1 by fixing the randomness used in 𝒜 0. Our
algorithm works by independently 𝑧 ∼ 𝜇 𝑘 and 𝑖 uniformly from [𝑘], embedding 𝑥 in the 𝑖-th
coordinate of 𝑧, and emulating 𝒜 on the resulting string.

Algorithm 1 𝒜 0(𝑥)
 1: Independently sample 𝐼 uniformly from [𝑘] and 𝑧 ∼ 𝜇 𝑘 .
 2: 𝑦 ← 𝑧 (𝐼←𝑥)
 3: Emulate algorithm 𝒜 on input 𝑦.
 4: Abort

      (i) if 𝒜 aborts,
     (ii) if 𝒜 reaches a bad leaf, or
    (iii) if 𝒜 reaches a leaf that is bad for 𝐼.

                                         𝑘 bits of 𝑥,
                                        25𝑞
    (iv) if 𝒜 queries more than
 5: Otherwise, output 𝒜(𝑦).


    Note that the emulation is possible since whenever 𝒜 queries the 𝑗-th bit of 𝑦 (𝐼) , we can
query 𝑥 𝑗 , and we can emulate 𝒜 querying a bit of 𝑦 (𝑖) for 𝑖 ≠ 𝐼 directly since 𝑧 is fixed. We claim
that (i) 𝒜 0 makes at most 𝑘 queries, (ii) 𝒜 0 aborts with probability at most 𝛿 + 0.12, and (iii)
                           25𝑞

𝒜 0 errs with probability at most 5000𝜀
                                    𝑘 .
    First, note that 𝒜 0 makes at most 𝑘 queries, since it aborts instead of making more queries.
                                            25𝑞

    Second, consider the abort probability of 𝒜 0. Our algorithm aborts if 𝒜 aborts, if we reach
a bad leaf, if the leaf we reach is bad for 𝐼, of if 𝒜 makes more than 𝑘 bits of 𝑦 (𝐼) . Let ℰ1
                                                                            25𝑞

be the event that 𝒜 aborts on input 𝑦. Similarly, let ℰ2 , ℰ3 , ℰ4 be the events that 𝒜 reaches a
bad leaf, 𝒜 reaches a leaf that is bad for 𝑖, and 𝒜 queries more than 𝑘 bits of 𝑥 respectively.
                                                                          25𝑞

Since 𝑥 ∼ 𝜇, 𝑧 ∼ 𝜇 𝑘 , and 𝐼 is uniform on [𝑘], it follows that 𝑦 ∼ 𝜇 𝑘 . By the abort guarantees
of 𝒜, we have Pr[ℰ1 ] ≤ 𝛿. By Claim 2.3 we have Pr[ℰ2 |ℰ1 ] ≤ 1/25, and by Claim 2.4 we have
Pr[ℰ3 |ℰ1 , ℰ2 ] ≤ 1/25. Thus, we have Pr[ℰ1 ∨ ℰ2 ∨ ℰ3 ] ≤ 𝛿 + 25
                                                                2
                                                                  .
    Next, for each 𝑖 ∈ [𝑘], let 𝑞 𝑖 (𝑦) denote the number of queries that 𝒜 makes to 𝑦 (𝑖) on
input 𝑦. The query cost of 𝒜 guarantees that for each input 𝑦, 1≤𝑖≤𝑘 𝑞 𝑖 (𝑦) ≤ 𝑞. Therefore,
                                                                     Í


                         T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                           9
      J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

                      𝑘
for any 𝑦, at most 25   indices 𝑖 ∈ [𝑘] satisfy 𝑞 𝑖 (𝑦) ≥ 𝑘 . Hence, for 𝐼 ∈𝑅 [𝑘], 𝑥 ∼ 𝜇, and
                                                                                      25𝑞

        𝑘
𝑧 ∼ 𝜇 , and recalling that 𝑦 = 𝑧 (𝐼←𝑥) , we have: Pr[ℰ4 ] ≤ 25    1
                                                                    . By a union bound, we have
Pr𝐼,𝑧,𝑥 [𝒜 aborts on input 𝑦] = Pr[ℰ1 ∨ ℰ2 ∨ ℰ3 ∨ ℰ4 ] ≤ 𝛿 + 25 = 𝛿 + 0.12.
          0                                                     3

    Third, we analyze the error probability of 𝒜 0. This algorithm errs only when it reaches a leaf
that is good for 𝐼. By Claim 2.4, we are correct with probability at least 𝑟𝐼,ℓ = 2 𝐼,ℓ ≥ 1 − 5000𝜀
                                                                                                 𝑘 .
                                                                                 1+𝑎

Thus, we have Pr[𝒜 0 errs] ≤ 5000𝜀
                                𝑘  .
    Letting 𝑋 be the indicator variable for the event that 𝒜 0 aborts and 𝑌 = (𝐼, 𝑧), Fact 1.9 gives
          Pr[𝒜 0 aborts ] = E[𝒜 0 aborts ] = E [E[𝒜 0 aborts |𝐼, 𝑧]] = E [Pr[𝒜 0 aborts ]] .
                                                              𝐼,𝑧                                      𝐼,𝑧

Thus algortihm 𝒜 0 is a randomized algorithm that, when given an input 𝑥 ∼ 𝜇, makes at most
25𝑞
 𝑘 queries and has the following guarantees:


                                 E [Pr[𝒜 0 aborts]] = Pr [𝒜 0 aborts] ≤ 𝛿 + 0.12, and
                                 𝐼,𝑧 𝑥                                    𝐼,𝑥,𝑧
                                                                                                      5000𝜀
                  E [Pr[𝒜 0(𝑦)(𝐼) ≠ 𝑔(𝑥)]] = Pr [𝒜 0(𝑦)(𝐼) ≠ 𝑔(𝑥)] ≤                                        .
                  𝐼,𝑧 𝑥                                       𝐼,𝑥,𝑧                                     𝑘
By Markov’s inequality and a union bound, there must be a setting of (𝑖 ∗ , 𝑧 ∗ ) such that
Pr𝑥 [𝒜 0 aborts ] ≤ 3𝛿 + 0.36 and Pr𝑥 [𝒜 0(𝑦)(𝑖 ∗ ) ≠ 𝑔(𝑥)] ≤ 15000𝜀          00
                                                                  𝑘 . Let 𝒜 be a deterministic
algorithm that takes an input 𝑥 ∼ 𝜇 and emulates algorithm 𝒜 with 𝑖 and 𝑧 ∗ in place of the
                                                                    0      ∗

randomly sampled 𝐼, 𝑧. This algorithm queries at most 𝑘 , aborts with probability at most
                                                             25𝑞

3𝛿 + 0.36, and errs with probability at most 15000𝜀 𝑘 . Thus, it is a [𝑂(𝑞/𝑘), 3𝛿 + 0.36,    𝑘 , 𝜇]-
                                                                                          15000𝜀

distributional algorithm for 𝑔, as required.                                                      

2.1   Proof of Theorem 1.1
Proof of Theorem 1.1. Define 𝜀0 := 30000𝜀. Let 𝜇 be the input distribution for 𝑔 achieving
         𝜇
max𝜇 D 1 𝜀0 (𝑔), and let 𝜇 𝑘 be the 𝑘-fold product distribution of 𝜇. By the first inequality of
        2, 𝑘
Fact 1.7 and the first inequality of Fact 1.8, we have
                                              1                      1 𝜇𝑘
                   R𝜀 (XOR 𝑘 ◦ 𝑔) ≥             R 1 ,𝜀 (XOR 𝑘 ◦ 𝑔) ≥   D        (XOR 𝑘 ◦ 𝑔) .
                                              50 50                  50 251 ,2𝜀
Additionally, by Lemma 2.1 and the second inequalities of Fact 1.7 and Fact 1.8, we have
                   𝜇𝑘                              𝑘  𝜇         𝑘                𝑘
                 D1         (XOR 𝑘 ◦ 𝑔) ≥            D 0 (𝑔) ≥    R 2 4𝜀0 (𝑔) ≥    R 12𝜀0 (𝑔) .
                   25 ,2𝜀                         120 12 , 𝜀𝑘  120 3 , 𝑘        360 𝑘
                                                                                 
                                                    𝜇𝑘                                      𝜇𝑘
                                                                                                                               
Thus, we have R𝜀 (XOR 𝑘 ◦ 𝑔) = Ω D 1                         (XOR 𝑘 ◦ 𝑔) and D 1                     (XOR 𝑘 ◦ 𝑔) = Ω 𝑘 R 12𝜀0 (𝑔) . By
                                                    25 ,2𝜀                                  25 ,2𝜀                         𝑘

standard success amplification R             12𝜀0   (𝑔) = Θ(R (𝑔)). Putting these together yields
                                                                      𝜀
                                                                      𝑘
                                              𝑘
                                                                            
                                               𝜇𝑘
                                                                                                                   
               R𝜀 (XOR 𝑘 ◦ 𝑔) = Ω D 1                     (XOR 𝑘 ◦ 𝑔) = Ω 𝑘 R               12𝜀0   (𝑔) = Ω R (𝑔) ,𝜀
                                                 25 ,2𝜀                                      𝑘                    𝑘




                        T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                                                       10
                  A S TRONG XOR L EMMA FOR R ANDOMIZED Q UERY C OMPLEXITY
                                 
hence R𝜀 (XOR 𝑘 ◦ 𝑔) = Ω 𝑘 R 𝜀𝑘 (𝑔) completing the proof.                                       



References
 [1] Andrew Bassilakis, Andrew Drucker, Mika Göös, Lunjia Hu, Weiyun Ma, and Li-Yang Tan:
     The power of many samples in query complexity. In Proc. 47th Internat. Colloq. on Automata,
     Languages, and Programming (ICALP’20), pp. 9:1–18. Schloss Dagstuhl–Leibniz-Zentrum fuer
     Informatik, 2020. [doi:10.4230/LIPIcs.ICALP.2020.9, arXiv:2002.10654, ECCC:TR20-027] 4

 [2] Yosi Ben-Asher and Ilan Newman: Decision trees with boolean threshold queries. J. Comput.
     System Sci., 51(3):495–502, 1995. Preliminary version in CCC’95. [doi:10.1006/jcss.1995.1085]
     2

 [3] Shalev Ben-David and Eric Blais: A new minimax theorem for randomized algorithms. In
     Proc. 61st FOCS, pp. 403–411. IEEE Comp. Soc., 2020. [doi:10.1109/FOCS46700.2020.00045]
     2, 4

 [4] Shalev Ben-David and Eric Blais: A tight composition theorem for the randomized query
     complexity of partial functions. In Proc. 61st FOCS, pp. 240–246. IEEE Comp. Soc., 2020.
     [doi:10.1109/FOCS46700.2020.00031, arXiv:2002.10809] 2, 4

 [5] Shalev Ben-David, Mika Göös, Robin Kothari, and Thomas Watson: When is amplification
     necessary for composition in randomized query complexity? In Proc. 24th Internat.
     Conf. on Randomization and Computation (RANDOM’20), pp. 28:1–16. Schloss Dagstuhl–
     Leibniz-Zentrum fuer Informatik, 2020. [doi:10.4230/LIPICS.APPROX/RANDOM.2020.28,
     arXiv:2006.10957] 2, 3, 4, 6

 [6] Shalev Ben-David and Robin Kothari: Randomized query complexity of sabotaged and
     composed functions. Theory of Computing, 14(5):1–27, 2018. Preliminary version in ICALP’16.
     [doi:10.4086/toc.2018.v014a005, arXiv:1605.09071, ECCC:TR16-087] 2

 [7] Eric Blais and Joshua Brody: Optimal separation and strong direct sum for random-
     ized query complexity. In Proc. 34th Comput. Complexity Conf. (CCC’19), pp. 29:1–17.
     Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2019. [doi:10.4230/LIPIcs.CCC.2019.29,
     arXiv:1908.01020] 2, 3, 4, 6

 [8] Andrew Drucker: Improved direct product theorems for randomized query com-
     plexity. Comput. Complexity, 21(2):197–244, 2012. Preliminary version in CCC’11.
     [doi:10.1007/s00037-012-0043-7, arXiv:1005.0644, ECCC:TR10-080] 2, 3, 5

 [9] Russell Impagliazzo, Ran Raz, and Avi Wigderson: A direct product theorem. In Proc.
     9th IEEE Conf. Structure in Complexity Theory (SCT’94), pp. 88–96. IEEE Comp. Soc., 1994.
     [doi:10.1109/SCT.1994.315814] 2

                    T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                      11
     J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

[10] Rahul Jain, Hartmut Klauck, and Miklos Santha: Optimal direct sum results for
     deterministic and randomized decision tree complexity. Inform. Process. Lett., 110(20):893–
     897, 2010. [doi:10.1016/j.ipl.2010.07.020] 2, 3

[11] Marco Molinaro, David P. Woodruff, and Grigory Yaroslavtsev: Beating the direct
     sum theorem in communication complexity with implications for sketching. In Proc. 24th
     Ann. ACM–SIAM Symp. on Discrete Algorithms (SODA’13), pp. 1738–1756. SIAM, 2013.
     [doi:10.1137/1.9781611973105.125] 4

[12] Marco Molinaro, David P. Woodruff, and Grigory Yaroslavtsev: Amplification of
     one-way information complexity via codes and noise sensitivity. In Proc. 42nd Internat.
     Colloq. on Automata, Languages, and Programming (ICALP’15), pp. 960–972. Springer, 2015.
     [doi:10.1007/978-3-662-47672-7_78, ECCC:TR15-031] 4

[13] Noam Nisan, Steven Rudich, and Michael E. Saks: Products and help bits in deci-
     sion trees. SIAM J. Comput., 28(3):1035–1050, 1998. Preliminary version in FOCS’94.
     [doi:10.1137/S0097539795282444] 2

[14] Ronen Shaltiel: Towards proving strong direct product theorems. Comput. Complex-
     ity, 12(1):1–22, 2003. Preliminary version in CCC’01. [doi:10.1007/s00037-003-0175-x,
     ECCC:TR01-009] 2, 3, 4


AUTHORS

     Joshua Brody
     Associate Professor
     Department of Computer Science
     Swarthmore College
     Swarthmore, PA, USA
     brody cs swarthmore edu
     http://cs.swarthmore.edu/∼brody


     Jae Tak Kim
     Google
     Mountain View, CA, USA
     jkim17 swarthmore edu
     https://linkedin.com/in/jae-tak-kim




                    T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                    12
                A S TRONG XOR L EMMA FOR R ANDOMIZED Q UERY C OMPLEXITY

    Peem Lerdputtipongporn
    Ph. D. student
    Statistics Department
    Carnegie Mellon University
    Pittsburgh, PA, USA
    plerdput andrew cmu edu
    https://www.cmu.edu/dietrich/statistics-datascience/
        people/phd/peem-lerdputtipongporn.html


    Hariharan Srinivasulu
    Palantir Technologies
    New York, NY, USA
    srinivasulu hari gmail com
    https://www.linkedin.com/in/hari-srinivasulu/


ABOUT THE AUTHORS

    Joshua Brody, graduated from Dartmouth College in 2010; his advisor was Amit
       Chakrabarti. The subject of his thesis was communication complexity. Since
       graduating, his research interests have broadened to streaming algorithms,
       property testing, and data structure lower bounds. He was introduced to
       computer science by his father, who taught him to count on his fingers in binary
       when he was four years old. He spends most of his spare time with his children,
       who are good at math but don’t seem to share his interest in counting in binary.


    Jae Tak Kim graduated Swarthmore College in 2022, with a B. A. in computer
       science. During his undergraduate studies, he worked on research areas in query
       complexity theory and static programm analysis. In his free time, he enjoys
       climbing, reading, and listening to music. Jae Tak Kim currently works at Google,
       where he is a software engineer.


    Peem Lerdputtipongporn is a Ph. D. candidate in Statistics and Public Policy at
       Carnegie Mellon University. His advisors are David Choi and Nynke Niezink.
       His parents named him “Peem,” a short Thai word that means “commanding
       respect,” to offset the fact that he was born underweight. He received a B. A.
       in Mathematics and Computer Science from Swarthmore College in 2021. At
       Swarthmore, he he worked with Professor Joshua Brody on Query Complexity
       and Professor Steve Wang on Statistical Paleontology. His current interests are
       algorithmic fairness and statistical machine learning.



                  T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                    13
J OSHUA B RODY, JAE TAK K IM , P EEM L ERDPUTTIPONGPORN , AND H ARIHARAN S RINIVASULU

Hariharan Srinivasulu is a 2022 graduate of Swarthmore College with a B. A. in
  Physics and Computer Science. He was named after a Hindu deity, although
  he suspects his name was inspired by one of his dad’s favorite musicians. He
  was born and brought up in Chennai, India, and moved to the US for his
  undergraduate studies. He was mentored by Mike Brown and Joshua Brody
  at Swarthmore, and has done research in the areas of Computational Plasma
  Physics and Query Complexity. He hopes to enter a career in research in the
  future and is interested in areas such as quantum computing and distributed
  systems. Hariharan currently works for Palantir Technologies as a software
  engineer.




              T HEORY OF C OMPUTING, Volume 19 (11), 2023, pp. 1–14                     14