Pinning Down the Strong Wilber-1 Bound for Binary Search Trees

Authors Parinya Chalermsook, Julia Chuzhoy, Thatchaphol Saranurak,
Plaintext
                            T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71
                                         www.theoryofcomputing.org

                           S PECIAL ISSUE : APPROX-RANDOM 2020



Pinning Down the Strong Wilber-1 Bound
        for Binary Search Trees
   Parinya Chalermsook‡                           Julia Chuzhoy§        Thatchaphol Saranurak
                 Received June 13, 2021; Revised August 10, 2022; Published December 19, 2023




       Abstract. Dynamic Optimality Conjecture, postulating the existence of an 𝑂(1)-
       competitive online algorithm for binary search trees (BSTs), is among the most
       fundamental open problems in dynamic data structures. The conjecture remains
       wide open, despite extensive work and some notable progress, including, for
       example, the 𝑂(log log 𝑛)-competitive Tango Trees, which is the best currently
       known competitive ratio. One of the main hurdles towards settling the conjecture is
       that we currently do not have polynomial-time approximation algorithms achieving
       better than an 𝑂(log log 𝑛)-approximation, even in the offline setting. All known
       non-trivial algorithms for BSTs rely on comparing the algorithm’s cost with the
       so-called Wilber-1 bound (WB-1). Therefore, establishing the worst-case relationship
     An extended abstract of this paper appeared in the Proceedings of the 23rd Internat. Conf. on Approximation
Algorithms and Combinatorial Optimization (APPROX’20)
   ‡ Supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and
innovation programme (grant agreement No. 759557) and by the Academy of Finland Research Fellows program,
under grant No. 310415.
   § Supported in part by NSF grant CCF-1616584. Part of the work was done while the second author was a Weston
visiting professor at the Department of Computer Science and Applied Mathematics, Weizmann Institute of Science.


ACM Classification: Theory of computation → Data structure design and analysis
AMS Classification: 68Q25, 68W25
Key words and phrases: binary search trees, dynamic optimality, data structures


© 2023 Parinya Chalermsook, Julia Chuzhoy, and Thatchaphol Saranurak
c b Licensed under a Creative Commons Attribution License (CC-BY)                    DOI: 10.4086/toc.2023.v019a008
               PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

       between this bound and the optimal solution cost appears crucial for further progress,
       and it is an interesting open question in its own right.
          Our contribution is twofold. First, we show that the gap between WB-1 and
       the optimal solution value can be as large as Ω(log log 𝑛/log log log 𝑛); in fact, we
       show that the gap holds even for several stronger variants of the bound.1 Second,
       we show,
             given an integer
                                 𝐷 > 0, a 𝐷-approximation algorithm that runs in time
                      Ω(𝐷)
       exp 𝑂 𝑛 1/2           log 𝑛   . In particular, this yields a constant-factor approximation
       algorithm with subexponential running time.2 Moreover, we obtain a simpler and
       cleaner efficient 𝑂(log log 𝑛)-approximation algorithm that can be used in an online
       setting. Finally, we suggest a new bound, that we call the Guillotine Bound, that is
       stronger than WB-1, while maintaining its algorithm-friendly nature, that we hope
       will lead to better algorithms. All our results use the geometric interpretation of the
       problem, leading to cleaner and simpler analysis.


1     Introduction

1.1    Binary search trees

Binary search trees (BST’s) are a fundamental data structure that has been extensively studied
for many decades. Informally, suppose we are given as input an online access sequence
𝑋 = {𝑥 1 , . . . , 𝑥 𝑚 } of keys from {1, . . . , 𝑛}, and our goal is to maintain a binary search tree 𝑇
over the set {1, . . . , 𝑛} of keys. The algorithm is allowed to modify the tree 𝑇 after each access;
the tree obtained after the 𝑖th access is denoted by 𝑇𝑖+1 . Each such modification involves a
sequence of rotation operations that transform the current tree 𝑇𝑖 into a new tree 𝑇𝑖+1 . The cost
of the transformation is the total number of rotations performed plus the depth of the key 𝑥 𝑖
in the tree 𝑇𝑖 . The total cost of the algorithm is the total cost of all transformations performed
as the sequence 𝑋 is processed. We denote by OPT(𝑋) the smallest cost of any algorithm for
maintaining a BST for the access sequence 𝑋, when the whole sequence 𝑋 is known to the
algorithm in advance.
Several algorithms for BST’s, whose costs are guaranteed to be 𝑂(𝑚 log 𝑛) for any access
sequence, such as AVL-trees [1] and red-black trees [2], are known since the 60’s (see [10],
Chapters 12 and 13). Moreover, it is well known that there are length-𝑚 access sequences 𝑋
on 𝑛 keys, for which OPT(𝑋) = Ω(𝑚 log 𝑛). However, such optimal worst-case guarantees are
often unsatisfactory from both practical and theoretical perspectives, as one can often obtain
better results for “structured” inputs. Arguably, a better notion of the algorithm’s performance
to consider is instance optimality, where the algorithm’s performance is compared to the optimal
cost OPT(𝑋) for the specific input access sequence 𝑋. This notion is naturally captured by the
algorithm’s competitive ratio: we say that an algorithm for BST’s is 𝛼-competitive, if, for every
    1A recent independent paper by Lecomte and Weinstein (ESA’20) shows an even stronger, Ω(log log 𝑛), separation.
    2The term “subexponential time” in this paper refers to the running time 2𝑜(𝑛) .


                          T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                   2
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

online input access sequence 𝑋, the cost of the algorithm’s execution on 𝑋 is at most 𝛼 · OPT(𝑋).
Since for every length-𝑚 access sequence 𝑋, OPT(𝑋) ≥ 𝑚, the above-mentioned algorithms
that provide worst-case 𝑂(𝑚 log 𝑛)-cost guarantees are also 𝑂(log 𝑛)-competitive. However,
there are many known important special cases, in which the value of the optimal solution is
𝑂(𝑚), and for which the existence of an 𝑂(1)-competitive algorithm would lead to a much
better performance, including some interesting applications, such as, for example, adaptive
sorting [27, 7, 23, 26, 15, 24, 13, 9, 8, 3, 6, 5, 19].


1.1.1   The Dynamic Optimality Conjecture

A striking conjecture of Sleator and Tarjan [25] from 1985, called the dynamic optimality conjecture,
asserts that the Splay Trees provide an 𝑂(1)-competitive algorithm for BST’s. This conjecture has
sparked a long line of research, but despite the continuing effort, and the seeming simplicity
of BST’s, it remains widely open. In a breakthrough result, Demaine et al. [12] proposed the
Tango Trees algorithm, that achieves an 𝑂(log log 𝑛)-competitive ratio, and has remained the
best known algorithm for the problem, for over 15 years. A natural avenue for overcoming this
barrier is to first consider the “easier” task of designing (offline) approximation algorithms,
whose approximation factor is below 𝑂(log log 𝑛). Designing better approximation algorithms
is often a precursor to obtaining better online algorithms, and it is a natural stepping stone
towards this goal.


1.1.2   The Wilber bounds

The main obstacle towards designing better algorithms, both in the online and the offline settings,
is obtaining tight lower bounds on the value OPT(𝑋), that can be used in algorithm design. In
order to improve upon the trivial 𝑂(log 𝑛) approximation, the lower bound OPT(𝑋) ≥ 𝑚 is not
sufficient1. Wilber [29] proposed two new bounds, that we refer to as the Wilber-1 Bound, or
Wilber’s first bound, (WB-1) and the Wilber-2 Bound (WB-2). He proved that, for every input
sequence 𝑋, the values of both these bounds on 𝑋 are at most OPT(𝑋). The breakthrough
result of Demaine et al. [12], that gives an 𝑂(log log 𝑛)-competitive online algorithm, relies
on the WB-1 bound. In particular, they show that the cost of the solution produced by their
algorithm is within an 𝑂(log log 𝑛)-factor from the WB-1 bound on the given input sequence 𝑋,
and hence from OPT(𝑋). This in turn implies that, for every input sequence 𝑋, the value of the
WB-1 bound is within an 𝑂(log log 𝑛) factor from OPT(𝑋). Follow-up work [28, 16] improved
several aspects of Tango Trees, but it did not improve the approximation factor. Additional
lower bounds on OPT, that subsume both the WB-1 and the WB-2 bounds, were suggested in
[11, 14, 17], but unfortunately it is not clear how to exploit them in algorithm design. To this day,
the only method we have for designing non-trivial online or offline approximation algorithms
for BST’s is by relying on the WB-1 bound, and this seems to be the most promising approach
for obtaining better algorithms. In order to make further progress on both online and offline
   1This is due to the existence of sequences 𝑋 with OPT(𝑋) = Ω(𝑚 log 𝑛).


                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                       3
             PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

approximation algorithms for BST’s, it therefore appears crucial that we better understand the
relationship between the WB-1 bound and the optimal solution cost.
Informally, the WB-1 bound relies on recursive partitioning of the input key sequence, that can be
represented by a partitioning tree. The standard WB-1 bound (that we refer to as the weak WB-1
bound) only considers a single such partitioning tree. It is well-known (see, e.g., [12, 28, 18]),
that the gap between OPT(𝑋) and the weak WB-1 bound for an access sequence 𝑋 may be as
large as Ω(log log 𝑛). However, the “bad” access sequence 𝑋 used to obtain this gap is highly
dependent on the fixed partitioning tree 𝑇. It is then natural to consider a stronger variant of
WB-1, that we refer to as strong WB-1 bound and denote by WB(𝑋), that maximizes the weak
WB-1 bound over all such partitioning trees. As suggested by Iacono [18], and by Kozma [20],
this gives a promising approach for improving the 𝑂(log log 𝑛)-approximation factor.


1.1.3   Our results

In this paper, we show that, even for this strong variant of WB-1, the gap between OPT(𝑋) and
WB(𝑋) may be as large as Ω(log log 𝑛/log log log 𝑛). This negative result extends to an even
stronger variant of WB-1 that we discuss below.
Our second set of results is algorithmic. We show, for any positive integer
                                                                            𝐷, an (offline) 𝐷-
                                                                  Ω(𝐷)
approximation algorithm that runs in time poly(𝑚) · exp 𝑂(𝑛 1/2          log 𝑛) . When 𝐷 is constant,
we obtain an 𝑂(1)-approximation in subexponential time. When 𝐷 is Θ(log log 𝑛), our algorithm
matches current polynomial-time approximation ratio, which is 𝑂(log log 𝑛). In the latter case,
we can also adapt the algorithm to the online setting, obtaining an 𝑂(log log 𝑛)-competitive
online algorithm.
All our results use the geometric interpretation of the problem, introduced by Demaine et al. [11],
leading to clean divide-and-conquer-style arguments that avoid, for example, the notion of
pointers and rotations. We feel that this approach, in addition to providing a cleaner and simpler
view of the problem, is more natural to work with in the context of approximation algorithms,
and should be more amenable to the powerful geometric techniques in the field.


1.2     Independent work

Independently from our work, Lecomte and Weinstein [21] showed that second Wilber bound
(also called funnel bound) dominates WB-1, and moreover, they show an access sequence 𝑋 for
which the two bounds have a gap of Ω(log log 𝑛). In particular, their result implies that the gap
between WB(𝑋) and OPT(𝑋) is Ω(log log 𝑛) for that access sequence. Their result subsumes our
Theorem 1.1 entirely (but not the extension discussed in Section 1.3.3).
We note that the access sequence 𝑋 used in our negative results provides a gap of

                                    Ω(log log 𝑛/log log log 𝑛)

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                         4
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

between the WB-2 and the WB-1 bounds, although we only realized this after hearing the
statement of the results of [21]. Additionally, Lecomte and Weinstein show that WB-2 is invariant
under rotations, and use this to show that, when the WB-2 is constant, then the Independent
Rectangle bound of [11] is linear.




1.3     Statements of our results

We now formally state our results.




1.3.1    Geometric representation

We use the geometric interpretation of the problem, introduced by Demaine et al. [11], that we
refer to as the Min-Sat problem. Let 𝑃 be any set of points in the plane. We say that two points
𝑝, 𝑞 ∈ 𝑃 are aligned iff either their 𝑥-coordinates are equal, or their 𝑦-coordinates are equal. If 𝑝
and 𝑞 are not aligned, then let 𝑝,𝑞 be the smallest closed axis-aligned rectangle containing both
𝑝 and 𝑞; notice that 𝑝 and 𝑞 must be diagonally opposite corners of this rectangle. We say that
the pair (𝑝, 𝑞) of points is satisfied in 𝑃 iff there is some additional point 𝑟 ≠ 𝑝, 𝑞 in 𝑃 that lies in
𝑝,𝑞 (the point may lie on the boundary of the rectangle). Lastly, we say that the set 𝑃 of points
is satisfied iff for every pair 𝑝, 𝑞 ∈ 𝑃 of distinct points, either 𝑝 and 𝑞 are aligned, or they are
satisfied in 𝑃.
In the Min-Sat problem, the input is a set 𝑃 of points in the plane with integral 𝑥- and 𝑦-
coordinates; we assume that all 𝑥-coordinates are between 1 and 𝑛, and all 𝑦-coordinates
are between 1 and 𝑚 and distinct from each other, and that |𝑃| = 𝑚. The goal is to find a
minimum-cardinality set 𝑌 of points, such that the set 𝑃 ∪ 𝑌 of points is satisfied.
An access sequence 𝑋 over keys {1, . . . , 𝑛} can be represented by a set 𝑃 of points in the plane
as follows: if a key 𝑥 is accessed at time 𝑦, then add the point (𝑥, 𝑦) to 𝑃. Demaine et al. [11]
showed that, for every access sequence 𝑋, if we denote by 𝑃 the corresponding set of points in
the plane, then the value of the optimal solution to the Min-Sat problem on 𝑃 is Θ(OPT(𝑋)). They
also showed that, in order to obtain an 𝑂(𝛼)-approximation algorithm for BST’s, it is sufficient
to obtain an 𝛼-approximation algorithm for the Min-Sat problem. In the online version of the
Min-Sat problem, at every time step 𝑡, we discover the unique input point whose 𝑦-coordinate is
𝑡, and we need to make an irrevocable decision on which points with 𝑦-coordinate 𝑡 to add to
the solution. Demaine et al. [11] also showed that an 𝛼-competitive online algorithm for Min-Sat
implies an 𝑂(𝛼)-competitive online algorithm for BST’s. For convenience, we do not distinguish
between the input access sequence 𝑋 and the corresponding set of points in the plane, that we
also denote by 𝑋.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            5
             PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

1.3.2   Negative results for WB-1

We say that an input access sequence 𝑋 is a permutation if each key in {1, . . . , 𝑛} is accessed
exactly once. Equivalently, in the geometric view, every column with an integral 𝑥-coordinate
contains exactly one input point.
Informally, the WB-1 bound for an input sequence 𝑋 is defined as follows. Let 𝐵 be the bounding
box containing all points of 𝑋, and consider any vertical line 𝐿 drawn across 𝐵, that partitions
it into two vertical strips, separating the points of 𝑋 into two subsets 𝑋1 and 𝑋2 . Assume
that the points of 𝑋 are ordered by their 𝑦-coordinates from smallest to largest. We say that
a pair (𝑥, 𝑥 0) ∈ 𝑋 of points cross the line 𝐿, iff 𝑥 and 𝑥 0 are consecutive points of 𝑋, and they
lie on different sides of 𝐿. Let 𝐶(𝐿) be the number of all pairs of points in 𝑋 that cross 𝐿.
We then continue this process recursively with 𝑋1 and 𝑋2 , with the final value of the WB-1
bound being the sum of the two resulting bounds obtained for 𝑋1 and 𝑋2 , and 𝐶(𝐿). This
recursive partitioning process can be represented by a binary tree 𝑇 that we call a partitioning
tree (we note that the partitioning tree is not related to the BST tree that the BST algorithm
maintains). Every vertex 𝑣 of the partitioning tree is associated with a vertical strip 𝑆(𝑣),
where for the root vertex 𝑟, 𝑆(𝑟) = 𝐵. If the partitioning algorithm uses a vertical line 𝐿 to
partition the strip 𝑆(𝑣) into two sub-strips 𝑆1 and 𝑆2 , then vertex 𝑣 has two children, whose
corresponding strips are 𝑆1 and 𝑆2 . Note that every sequence of vertical lines used in the
recursive partitioning procedure corresponds to a unique partitioning tree and vice versa. Given
a set 𝑋 of points and a partitioning tree 𝑇, we denote by WB𝑇 (𝑋) the WB-1 bound obtained for
𝑋 while following the partitioning scheme defined by 𝑇. Wilber [29] showed that, for every
partitioning tree 𝑇, OPT(𝑋) ≥ Ω(WB𝑇 (𝑋)) holds. Moreover, Demaine et al. [12] showed that, if
𝑇 is a balanced tree, then OPT(𝑋) ≤ 𝑂(log log 𝑛) · WB𝑇 (𝑋). These two bounds are used to obtain
the 𝑂(log log 𝑛)-competitive algorithm of [12]. We call this variant of WB-1, that is defined with
respect to a fixed tree 𝑇, the weak WB-1 bound.
Unfortunately, it is well-known (see, e.g., [12, 28, 18]), that the gap between OPT(𝑋) and
the weak WB-1 bound on an input 𝑋 may be as large as Ω(log log 𝑛). In other words, for
any fixed partitioning tree 𝑇, there exists an input 𝑋 (that depends on 𝑇), with WB𝑇 (𝑋) ≤
𝑂(OPT(𝑋)/log log 𝑛). However, the construction of this “bad” input 𝑋 depends on the fixed
partitioning tree 𝑇.
We consider a stronger variant of WB-1, that we refer to as strong WB-1 bound and denote
by WB(𝑋), that maximizes the weak WB-1 bound over all such partitioning trees, that is,
WB(𝑋) = max𝑇 {WB𝑇 (𝑋)}.
Using this stronger bound as an alternative to weak WB-1 in order to obtain better approximation
algorithms was suggested by Iacono [18], and by Kozma [20].
Our first result rules out this approach: we show that, even for the strong WB-1 bound, the gap
between WB(𝑋) and OPT(𝑋) may be as large as Ω(log log 𝑛/log log log 𝑛), even if the input 𝑋 is
a permutation.

Theorem 1.1. For infinitely many integer 𝑛, there exists an access sequence 𝑋 on 𝑛 keys with |𝑋 | = 𝑛,

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                          6
             P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

such that 𝑋 is a permutation, OPT(𝑋) ≥ Ω(𝑛 log log 𝑛), but WB(𝑋) ≤ 𝑂(𝑛 log log log 𝑛).

                                                                 log log 𝑛
                                                                              
In particular, for every partitioning tree 𝑇, OPT (𝑋)
                                              WB (𝑋)
                                                      ≥Ω       log log log 𝑛       for infinitely many sequences
                                               𝑇
𝑋. We note that it is well known (see, e.g., [6]), that any 𝑐-approximation algorithm for
permutation input can be turned into an 𝑂(𝑐)-approximation algorithm for any input sequence.
However, the known instances that achieve an Ω(log log 𝑛)-gap between the weak WB-1 bound
and OPT are not permutations. Therefore, our result is the first to provide a super-constant gap
between WB-1 and OPT for permutations, even for the case of weak WB-1.


1.3.3   Extension of WB-1

We consider natural generalizations of the WB-1 bound that allow partitioning the plane both
horizontally and vertically. We call the new bounds the consistent Guillotine Bound and the
Guillotine Bound. Our negative result extends to the consistent Guillotine Bound but not to the
Guillotine Bound. The Guillotine Bound seems to maintain the algorithm-friendly nature of
WB-1, and in particular it naturally fits into the algorithmic framework that we propose. We
hope that this bound can lead to improved algorithms, both in the offline and the online settings


1.3.4   Separating the two Wilber bounds

The sequence 𝑋 given by Theorem 1.1 not only provides a separation between WB-1 and OPT,
but it also provides a separation between WB-1 and WB-2. The latter can be defined in the
geometric view as follows. Recall that, for a pair of points 𝑥, 𝑦 ∈ 𝑋, 𝑥,𝑦 is the smallest closed
rectangle containing both 𝑥 and 𝑦. For a point 𝑥 in the access sequence 𝑋, the funnel of 𝑥 is
the set of all points 𝑦 ∈ 𝑋, for which 𝑥,𝑦 does not contain any point of 𝑋 \ {𝑥, 𝑦}, and alt(𝑥)
is the number of alterations between the left of 𝑥 and the right of 𝑥 in the funnel of 𝑥. The
second Wilber Bound for sequence 𝑋 is then defined as: WB(2) (𝑋) = |𝑋 | + 𝑥∈𝑋 alt(𝑥). We
                                                                                  Í
show that, for the sequence 𝑋 given by Theorem 1.1, WB(2) (𝑋) ≥ Ω(𝑛 log log 𝑛) holds, and
therefore WB(2) (𝑋)/WB(𝑋) ≥ Ω(log log 𝑛/log log log 𝑛) for that sequence, implying that the gap
between WB(𝑋) and WB(2) (𝑋) may be as large as Ω(log log 𝑛/log log log 𝑛). We note that we only
realized that our results provide this stronger separation between the two Wilber bounds after
hearing the statements of the results from the independent work of Lecomte and Weinstein [21]
mentioned above.


1.3.5   Algorithmic results

We provide new simple approximation algorithms for the problem, that rely on its geometric
interpretation, namely the Min-Sat problem.
Theorem 1.2. There is an offline algorithm for Min-Sat, that, given any integer 𝐷 ≥ 1, and an access
sequence 𝑋 of length m to n keys, produces a solution of cost at most 𝐷 · OPT(𝑋) and has running time

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                    7
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK
                      Ω(𝐷)
                                       
poly(𝑚) · exp 𝑂 𝑛 1/2          log 𝑛        . For 𝐷 = Θ(log log 𝑛), the algorithm’s running time is polynomial
in 𝑛 and 𝑚, and it can be adapted to the online setting, achieving an 𝑂(log log 𝑛)-competitive ratio.

While the 𝑂(log log 𝑛)-approximation factor achieved by our algorithm in time poly(𝑚𝑛) is
similar to that achieved by other known algorithms [12, 16, 28], this is the first algorithm that
relies solely on the geometric formulation of the problem, which is arguably cleaner, simpler,
and better suited for exploiting the rich toolkit of algorithmic techniques developed in the areas
of online and approximation algorithms.


1.3.6   Erratum

Some erroneous complexity theoretic inferences, made in the paragraph following the statement
of Theorem 2 (page 33:6) in the conference version of this paper [4], are retracted in the
present article. In particular, at this time we are unable to rule out, under any plausible
complexity-theoretic assumption, the possibility that constant-factor approximation of Min-Sat
is NP-hard.
The main results are not affected and are identical in the two versions.


2     Preliminaries

All our results only use the geometric interpretation of the problem, that we refer to as the
Min-Sat problem. We include the formal definition of algorithms for BST’s and formally state
their equivalence.


2.1     The Min-Sat problem

For a point 𝑝 ∈ ℝ2 in the plane, we denote by 𝑝.𝑥 and 𝑝.𝑦 its 𝑥- and 𝑦-coordinates, respectively.
Given any pair 𝑝, 𝑝 0 of points, we say that they are aligned if 𝑝.𝑥 = 𝑝 0 .𝑥 or 𝑝.𝑦 = 𝑝 0 .𝑦. If 𝑝 and 𝑝 0
are not aligned, then we let 𝑝,𝑝0 be the smallest closed axis-aligned rectangle containing both 𝑝
and 𝑝 0; note that 𝑝 and 𝑝 0 must be diagonally opposite corners of the rectangle.
Definition 2.1. We say that a non-aligned pair 𝑝, 𝑝 0 of points is satisfied by a point 𝑝 00 if 𝑝 00 is
distinct from 𝑝 and 𝑝 0 and 𝑝 00 ∈ 𝑝,𝑝0 (where 𝑝 00 may lie on the boundary of the rectangle). We
say that a set 𝑆 of points is satisfied if for every non-aligned pair 𝑝, 𝑝 0 ∈ 𝑆 of points, there is some
point 𝑝 00 ∈ 𝑆 that satisfies this pair.

We refer to horizontal and vertical lines as rows and columns respectively. For a collection of
points 𝑋, the active rows of 𝑋 are the rows that contain at least one point in 𝑋. We define the
notion of active columns analogously. We denote by 𝑟(𝑋) and 𝑐(𝑋) the number of active rows and
active columns of the point set 𝑋, respectively. We say that a point set 𝑋 is a semi-permutation

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                 8
               P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

if every active row contains exactly one point of 𝑋. Note that, if 𝑋 is a semi-permutation,
then 𝑐(𝑋) ≤ 𝑟(𝑋). We say that 𝑋 is a permutation if it is a semi-permutation, and additionally,
every active column contains exactly one point of 𝑋. Clearly, if 𝑋 is a permutation, then
𝑐(𝑋) = 𝑟(𝑋) = |𝑋 |. We denote by 𝐵 the smallest closed rectangle containing all points of 𝑋, and
call 𝐵 the bounding box.
We are now ready to define the Min-Sat problem. The input to the problem is a set 𝑋 of points
that is a semi-permutation, and the goal is to compute a minimum-cardinality set 𝑌 of points,
such that 𝑋 ∪ 𝑌 is satisfied. We say that a set 𝑌 of points is a feasible solution for 𝑋 if 𝑋 ∪ 𝑌 is
satisfied. We denote by OPT(𝑋) the minimum value |𝑌| of any feasible solution 𝑌 for 𝑋.2
In the online version of the Min-Sat problem, at every time step 𝑡, we discover the unique input
point from 𝑋 whose 𝑦-coordinate is 𝑡, and we need to make an irrevocable decision on which
points with 𝑦-coordinate 𝑡 to add to the solution 𝑌. As shown by Demaine et al. [11], the Min-Sat
problem is equivalent to the BST problem, in the following sense:

Theorem 2.2 (Demaine et al.). Any efficient 𝛼-approximation algorithm for Min-Sat can be transformed
into an efficient 𝑂(𝛼)-approximation algorithm for BST’s, and similarly any online 𝛼-competitive
algorithm for Min-Sat can be transformed into an online 𝑂(𝛼)-competitive algorithm for BST’s.


2.2     Basic geometric properties

The following observation is well known (see, e.g., Observation 2.1 from [11]).

Observation 2.3. Let 𝑍 be any satisfied point set. Then for every pair 𝑝, 𝑞 ∈ 𝑍 of distinct points,
there is a point 𝑟 ∈ 𝑝,𝑞 \ {𝑝, 𝑞} such that 𝑟.𝑥 = 𝑝.𝑥 or 𝑟.𝑦 = 𝑝.𝑦.

Proof. Since the set 𝑍 is satisfied, rectangle 𝑝,𝑞 must contain at least one point of 𝑍 that is
distinct from 𝑝 and 𝑞. Among all such points, let 𝑟 be the one with smallest ℓ1 -distance to 𝑝.
We claim that either 𝑝.𝑥 = 𝑟.𝑥, or 𝑝.𝑦 = 𝑟.𝑦. Indeed, assume otherwise. Then 𝑝 and 𝑟 are not
aligned, but no point of 𝑍 lies in 𝑝,𝑟 \ {𝑝, 𝑟}, contradicting the fact that 𝑍 is a satisfied point
set.                                                                                              


2.2.1    Collapsing sets of columns or rows

Assume that we are given any set 𝑋 of points, and any collection 𝒞 of consecutive active columns
for 𝑋. In order to collapse the set 𝒞 of columns, we replace 𝒞 with a single representative
column 𝐶 (for concreteness, we use the column of 𝒞 with minimum 𝑥-coordinate). For every
point 𝑝 ∈ 𝑋 that lies on a column of 𝒞, we replace 𝑝 with a new point, lying on the column 𝐶,
whose 𝑦-coordinate remains the same. Formally, we replace point 𝑝 with point (𝑥, 𝑝.𝑦), where
   2In the original paper that introduced this problem [11], the value of the solution is defined as |𝑋 ∪ 𝑌|, while
our solution value is |𝑌|. For the purpose of showing the results in this paper, the two definitions are equivalent to
within a factor of 2.


                          T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                      9
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

𝑥 is the 𝑥-coordinate of the column 𝐶. We denote by 𝑋|𝒞 the resulting new set of points. We can
similarly define collapsing set of rows. The following useful observation is easy to verify.

Observation 2.4. Let 𝑆 be any set of points, and let 𝒞 be any collection of consecutive active
columns (or rows) with respect to 𝑆. If 𝑆 is a satisfied set of points, then so is 𝑆 |𝒞 .

Proof. It is sufficient to prove the observation for the case where 𝒞 contains two consecutive
active columns, that we denote by 𝐶 and 𝐶 0, which are collapsed into the column 𝐶. We can
then apply this argument iteratively to collapse any number of columns.
Assume for contradiction that the set 𝑆 |𝒞 of points is not satisfied, and let 𝑝, 𝑞 ∈ 𝑆 |𝒞 be a pair
of points that are not satisfied. Note that, if 𝑝 and 𝑞 cannot both lie on the column 𝐶 in 𝑆 |𝒞 .
Moreover, if both 𝑝 and 𝑞 lie to the right, or to the left of the column 𝐶, then they continue to be
satisfied by the same point 𝑟 ∈ 𝑆 that satisfied them in set 𝑆. We now consider two cases.
Assume first that 𝑝 lies to the left of the column 𝐶, and 𝑞 lies to the right of the column 𝐶 in
point set 𝑆 |𝒞 . Let 𝑟 be the point that satisfied the pair (𝑝, 𝑞) in point set 𝑆. If 𝑟 lied on column 𝐶
in 𝑆, then it remains on column 𝐶 in 𝑆 |𝒞 . If 𝑟 lied on column 𝐶 0 in 𝑆, then a copy of 𝑟 lies on
column 𝐶 in 𝑆 |𝒞 , and this copy continues to satisfy the pair (𝑝, 𝑞). Otherwise, point 𝑟 belongs to
set 𝑆 |𝒞 , and it continues to satisfy the pair (𝑝, 𝑞).
It now remains to consider the case when exactly one of the two points (say 𝑝) lies on the column
𝐶 in 𝑆 |𝒞 . Assume w.l.o.g. that 𝑞 lies to the right of 𝑝 and below it in 𝑆 |𝒞 . Then either 𝑝 belongs
to 𝑆 (in which case we denote 𝑝 0 = 𝑝), or 𝑝 is a copy of some point 𝑝 0 that lies on column 𝐶 0 in
𝑆. Let 𝑟 be the point that satisfies the pair (𝑝 0 , 𝑞) of points in 𝑆. Using the same reasoning as
before, it is easy to see that either 𝑟 belongs to 𝑆 |𝒞 , where it continues to satisfy the pair (𝑝, 𝑞) of
points, or a copy of 𝑟 belongs to 𝑆 |𝒞 , and it also continues to satisfy the pair (𝑝, 𝑞).
It is easy to verify that an analogue of Observation 2.4 holds for collapsing rows as well.             


2.2.2   Canonical solutions

We say that a solution 𝑌 for input 𝑋 is canonical iff every point 𝑝 ∈ 𝑌 lies on an active row and
an active column of 𝑋. It is easy to see that any solution can be transformed into a canonical
solution, without increasing its cost.

Observation 2.5. There is an efficient algorithm, that, given an instance 𝑋 of Min-Sat and any
feasible solution 𝑌 for 𝑋, computes a feasible canonical solution 𝑌ˆ for 𝑋 with |𝑌|
                                                                                 ˆ ≤ |𝑌|.

Proof. Let 𝐶 and 𝐶 0 be any pair of consecutive active columns for 𝑋, such that some point of
𝑌 lies strictly between 𝐶 and 𝐶 0. Let 𝒞 be the set of all columns that lie between 𝐶 and 𝐶 0,
including 𝐶 but excluding 𝐶 0, that contain points of 𝑋 ∪ 𝑌. We collapse the columns in 𝒞 into
the column 𝐶, obtaining a new feasible solution for instance 𝑋 (we use Observation 2.4). We
continue this process until every point of the resulting solution 𝑌 lies on an active column, and
we perform the same procedure for the rows.                                                     

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            10
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

2.3   Partitioning trees

We now turn to define partitioning trees, that are central to both defining the WB-1 bound and
to describing our algorithm.
Let 𝑋 be the a set of points that is a semi-permutation. We can assume without loss of generality
that every column with an integral 𝑥-coordinate between 1 and 𝑐(𝑋) inclusive contains at least
one point of 𝑋. Let 𝐵 be the bounding box of 𝑋. Assume that the set of active columns is
{𝐶1 , . . . , 𝐶 𝑎 }, where 𝑎 = 𝑐(𝑋), and that for all 1 ≤ 𝑖 ≤ 𝑎, the 𝑥-coordinate of column 𝐶 𝑖 is 𝑖. Let
ℒ be the set of all vertical lines with half-integral 𝑥-coordinates between 1 + 1/2 and 𝑎 − 1/2
(inclusive). Throughout, we refer to the vertical lines in ℒ as auxiliary columns. Let 𝜎 be an
arbitrary ordering of the lines of ℒ and denote 𝜎 = (𝐿1 , 𝐿2 , . . . , 𝐿 𝑎−1 ). We define a hierarchical
partition of the bounding box 𝐵 into vertical strips using 𝜎, as follows. We perform 𝑎 − 1
iterations. In the first iteration, we partition the bounding box 𝐵, using the line 𝐿1 , into two
vertical strips, 𝑆𝐿 and 𝑆𝐵 . For 1 < 𝑖 ≤ 𝑎 − 1, in iteration 𝑖 we consider the line 𝐿 𝑖 , and we let 𝑆 be
the unique vertical strip in the current partition that contains the line 𝐿 𝑖 . We then partition 𝑆
into two vertical sub-strips by the line 𝐿 𝑖 . When the partitioning algorithm terminates, every
vertical strip contains exactly one active column.




Figure 1: An Illustration of a partitioning tree and the corresponding sequence 𝜎 = (𝐿1 , . . . , 𝐿7 ).
Strip 𝑆(𝑣) corresponds to node 𝑣 that owns line 𝐿6 .

This partitioning process can be naturally described by a binary tree 𝑇 = 𝑇(𝜎), that we call a
partitioning tree associated with the ordering 𝜎 (see Figure 1). Each node 𝑣 ∈ 𝑉(𝑇) is associated
with a vertical strip 𝑆(𝑣) of the bounding box 𝐵. The strip 𝑆(𝑟) of the root vertex 𝑟 of 𝑇 is the
bounding box 𝐵. For every inner vertex 𝑣 ∈ 𝑉(𝑇), if 𝑆 = 𝑆(𝑣) is the vertical strip associated with
𝑣, and if 𝐿 ∈ ℒ is the first line in 𝜎 that lies strictly in 𝑆, then line 𝐿 partitions 𝑆 into two sub-strips,
𝑆(𝑣1 ) and 𝑆(𝑣2 ), corresponding to the two children 𝑣 1 and 𝑣 2 of 𝑣 in the partitioning tree. We
say that 𝑣 owns the line 𝐿, and we denote 𝐿 = 𝐿(𝑣). For each leaf node 𝑣, the corresponding strip
𝑆(𝑣) contains exactly one active column of 𝑋, and 𝑣 does not own any line of ℒ. For each vertex
𝑣 ∈ 𝑉(𝑇), let 𝑁(𝑣) = |𝑋 ∩ 𝑆(𝑣)| be the number of points from 𝑋 that lie in 𝑆(𝑣), and let width(𝑣)
be the width of the strip 𝑆(𝑣). Given a partition tree 𝑇 for point set 𝑋, we refer to the vertical
strips in {𝑆(𝑣)} 𝑣∈𝑇 as 𝑇-strips.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               11
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

2.4   The WB-1 bound

The WB-1 bound3 is defined with respect to an ordering (or a permutation) 𝜎 of the auxiliary
columns, or, equivalently, with respect to the partitioning tree 𝑇(𝜎). It will be helpful to keep
both these views in mind. In this paper, we will make a clear distinction between a weak variant
of the WB-1 bound, as defined by Wilber himself in [29] and a strong variant, as mentioned in
[18].
Let 𝑋 be a semi-permutation, and let ℒ be the corresponding set of auxiliary columns. Consider
an arbitrary fixed ordering 𝜎 of columns in ℒ and its corresponding partition tree 𝑇 = 𝑇(𝜎). For
each inner node 𝑣 ∈ 𝑉(𝑇), consider the set 𝑋 0 = 𝑋 ∩ 𝑆(𝑣) of input points that lie in the strip 𝑆(𝑣),
and let 𝐿(𝑣) ∈ ℒ be the line that 𝑣 owns. We denote 𝑋 0 = {𝑝 1 , 𝑝2 , . . . , 𝑝 𝑘 }, where the points
are indexed in the increasing order of their 𝑦-coordinates; since 𝑋 is a semi-permutation, no
two points of 𝑋 may have the same 𝑦-coordinate. For 1 ≤ 𝑗 < 𝑘, we say that the ordered pair
(𝑝 𝑗 , 𝑝 𝑗+1 ) of points form a crossing of 𝐿(𝑣) iff 𝑝 𝑗 , 𝑝 𝑗+1 lie on the opposite sides of the line 𝐿(𝑣). We
let cost(𝑣) be the total number of crossings of 𝐿(𝑣) by the points of 𝑋 ∩ 𝑆(𝑣). When 𝐿 = 𝐿(𝑣), we
also write cost(𝐿) to denote cost(𝑣). If 𝑣 is a leaf vertex, then its cost is set to 0.
Definition 2.6 (WB-1 bound). For any semi-permutation 𝑋, an ordering 𝜎 of the auxiliary
columns in ℒ, and the corresponding partitioning tree 𝑇 = 𝑇(𝜎), the (weak) WB-1 bound of
𝑋 with respect to 𝜎 is: WB𝜎 (𝑋) = WB𝑇 (𝑋) = 𝑣∈𝑉(𝑇) cost(𝑣). The strong WB-1 bound of 𝑋 is
                                           Í
WB(𝑋) = max𝜎 WB𝜎 (𝑋), where the maximum is taken over all permutations 𝜎 of the lines in ℒ.

It is well known that the WB-1 bound is a lower bound on the optimal solution cost:
Claim 2.7. For any semi-permutation 𝑋, WB(𝑋) ≤ 2 · OPT(𝑋).

The original proof of this fact is due to Wilber [29], which was later presented in the geometric
view by Demaine et al. [11], via the notion of independent rectangles. In Section 8, we include a
direct geometric proof of this fact.
We note a simple observation, that the cost can be bounded by the number of points on the
smaller side.
Observation 2.8. Let 𝑋 be a semi-permutation, 𝜎 an ordering of the auxiliary columns in ℒ, and
let 𝑇 = 𝑇(𝜎) be the corresponding partitioning tree. Let 𝑣 ∈ 𝑉(𝑇) be any inner vertex of the tree,
whose two child vertices are denoted by 𝑣 1 and 𝑣2 . Then cost(𝑣) ≤ 2 min{|𝑋 ∩ 𝑆(𝑣 1 )|, |𝑋 ∩ 𝑆(𝑣 2 )|}.

Proof. For simplicity, we denote 𝑋 0 = 𝑋 ∩ 𝑆(𝑣1 ) and 𝑋 00 = 𝑋 ∩ 𝑆(𝑣 2 ). Assume w.l.o.g. that
|𝑋 0 | ≤ |𝑋 00 |. Notice that, if the pair (𝑝 𝑖 , 𝑝 𝑖+1 ) of points in 𝑆(𝑣) define a crossing of 𝐿(𝑣), then one
of 𝑝 𝑖 , 𝑝 𝑖+1 must lie in 𝑋 0. Every point 𝑝 𝑗 ∈ 𝑋 0 may participate in at most two pairs of points that
define crossings: the pairs (𝑝 𝑗−1 , 𝑝 𝑗 ) and (𝑝 𝑗 , 𝑝 𝑗+1 ). Therefore, the total number of crossings of
𝐿(𝑣) is at most 2|𝑋 0 |.                                                                                      
   3Also called Interleaving bound [12], the first Wilber bound, “interleave lower bound” [29], or alternation
bound [18]


                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                12
                   P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES




                         Figure 2: An illustration of a split of 𝑋 by ℒ 0 = {𝐿1 , 𝐿2 , 𝐿3 }.


3     Geometric decomposition theorems

In this section, we develop several technical tools that will allow us to decompose a given
instance into a number of sub-instances. We then analyze the optimal solution costs and the
Wilber bound values for the resulting subinstances.


3.1       Split instances

Consider a semi-permutation 𝑋. We define the split instances with respect to any subset ℒ 0 ⊆ ℒ
of the auxiliary columns for 𝑋: Notice that the lines in ℒ 0 partition the bounding box 𝐵 into a
collection of internally disjoint strips, that we denote by {𝑆1 , . . . , 𝑆 𝑘 } where 𝑘 = |ℒ 0 | + 1. We can
then define the strip instances 𝑋𝑖 ⊆ 𝑋 as containing all vertices of 𝑋 ∩ 𝑆 𝑖 for all 1 ≤ 𝑖 ≤ 𝑘, and
the compressed instance 𝑋,       ˜ that is obtained by collapsing, for each 1 ≤ 𝑖 ≤ 𝑘, all active columns
                                                                                            𝑘
that lie in strip 𝑆 𝑖 , into a single column. We call these resulting instances (𝑋˜ , {𝑋𝑖 } 𝑖=1  ) a split of 𝑋
by ℒ 0. See Figure 2.
                                                                      𝑘
Observation 3.1. Let ℒ 0 ⊆ ℒ be a collection of lines and (𝑋˜ , {𝑋𝑖 } 𝑖=1 ) be a split of 𝑋 by ℒ 0. Then

      •       𝑖 𝑟(𝑋 𝑖 ) = 𝑟(𝑋)
          Í

      •       𝑖 𝑐(𝑋𝑖 )) = 𝑐(𝑋)
          Í

          ˜ ≤𝑘
      • 𝑐(𝑋)

The first property holds since 𝑋 is a semi-permutation.
Consider an arbitrary ordering 𝜎 of the lines in ℒ, such that the lines of ℒ 0 appear at the
beginning of 𝜎. The lines in ℒ 0 split 𝜎 naturally into 𝑘 + 1 orderings. Let 𝑆1 , . . . , 𝑆 𝑘 be the strips
obtained from partitioning box 𝐵 by ℒ 0, and for each 𝑖 ∈ [𝑘], ℒ 𝑖 is a collection of lines in strip 𝑆 𝑖 .
Now, 𝜎𝑖 can be defined by naturally inducing 𝜎 to the lines in ℒ 𝑖 , and 𝜎˜ is the ordering of lines

                             T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                           13
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

in ℒ 0 induced by 𝜎. We say that 𝜎, ˜ 𝜎1 , . . . , 𝜎 𝑘 are the split orderings of 𝜎 by ℒ 0. Similarly, the
lines ℒ also split the partitioning tree 𝑇 = 𝑇(𝜎) into 𝑇
         0                                                     e = 𝑇(𝜎)˜ and 𝑇1 , . . . , 𝑇𝑘 where 𝑇𝑖 = 𝑇(𝜎𝑖 )
for all 𝑖 ∈ [𝑘]. These partitioning trees are called the split partitioning trees of 𝑇 by ℒ 0.
                                                             𝑘
Observation 3.2. For ℒ 0 ⊆ ℒ and ordering 𝜎, let (𝑋˜ , {𝑋𝑖 } 𝑖=1 ) and ( 𝜎,
                                                                         ˜ {𝜎𝑖 }) be the split instances
and split orderings respectively. Then,

                                   𝑘
                                   Õ
                                                            ˜ = WB𝜎 (𝑋)
                                         WB𝜎𝑖 (𝑋𝑖 ) + WB𝜎˜ (𝑋)
                                   𝑖=1

This property can be viewed as a “perfect” decomposition property of the weak WB-1 bound
with respect to the split operation.


3.2   Decomposition theorem for the optimal solution

We prove the following recurrence about the “subadditivity” property of OPT under the
decomposition into split instances.
                                                                  𝑘
Theorem 3.3. Let ℒ 0 ⊆ ℒ be a collection of lines and (𝑋˜ , {𝑋𝑖 } 𝑖=1 ) be a split instance of 𝑋 by ℒ 0. Then

                                   𝑘
                                   Õ
                                                        ˜ ≤ OPT(𝑋).
                                         OPT(𝑋𝑖 ) + OPT(𝑋)
                                   𝑖=1

                  𝑘
Proof. Let {𝑆 𝑖 } 𝑖=1   be the strips partitioned by ℒ 0. Let 𝑌 be an optimal canonical solution for
𝑋, so that every point of 𝑌 lies on an active row and an active column for 𝑋. For each 𝑖, let 𝑌𝑖
denote the set of points of 𝑌 that lie in the strip 𝑆 𝑖 . Since these points lie in the interior of the
          Ð𝑘
strip, 𝑌 = 𝑖=1     𝑌𝑖 .
For each 𝑖, let ℛ 𝑖 denote the set of all rows 𝑅, such that: (i) 𝑅 contains a point of 𝑋; (ii) 𝑅 contains
no point of 𝑋𝑖 ; and (iii) at least one point of 𝑌𝑖 lies on 𝑅. Let 𝑚 𝑖 = |ℛ 𝑖 |. We need the following
claim.
Claim 3.4. There is a feasible solution 𝑌ˆ to instance 𝑋,
                                                       ˜ containing at most         𝑖 𝑚 𝑖 points.
                                                                                Í


Proof. We construct the solution 𝑌ˆ for 𝑋˜ as follows. Consider 𝑖 ∈ [𝑘]. Let 𝐶 𝑖 be the unique
column into which the columns lying in the strip 𝑆 𝑖 were collapsed. For every point 𝑝 ∈ 𝑌𝑖 that
lies on a row 𝑅 ∈ ℛ 𝑖 , we add a new point 𝜑(𝑝) on the intersection of row 𝑅 and column 𝐶 𝑖 to
the solution 𝑌. ˆ Once we process all strips 𝑖 ∈ [𝑘], we obtain a final set of points 𝑌. ˆ It is easy to
verify that |𝑌|ˆ = 𝑖 𝑚 𝑖 . In order to see that 𝑌ˆ is a feasible solution to instance 𝑋,
                   Í                                                                  ˜ it is enough to
show that the set 𝑋˜ ∪ 𝑌ˆ of points is satisfied. Notice that set 𝑋 ∪ 𝑌 of points is satisfied, and
set 𝑋˜ ∪ 𝑌ˆ is obtained from 𝑋 ∪ 𝑌 by collapsing sets of active columns lying in each strip 𝑆 𝑖 for
𝑖 ∈ [𝑘]. From Observation 2.4, the point set 𝑋˜ ∪ 𝑌ˆ is satisfied.                                     

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               14
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

We now consider the strip instances {𝑋𝑖 } 𝑖∈[𝑘] and prove the following claim, that will complete
the proof of the lemma.
Claim 3.5. For each 𝑖 ∈ [𝑘], OPT(𝑋𝑖 ) ≤ |𝑌𝑖 | − 𝑚 𝑖 .

Proof. Notice first that the point set 𝑋𝑖 ∪ 𝑌𝑖 must be satisfied. We will modify point set 𝑌𝑖 , to
obtain another set 𝑌𝑖0, so that 𝑌𝑖0 remains a feasible solution for 𝑋𝑖 , and |𝑌𝑖0 | ≤ |𝑌𝑖 | − 𝑚 𝑖 .
In order to do so, we perform 𝑚 𝑖 iterations. In each iteration, we will decrease the size of 𝑌𝑖
by at least one, while also decreasing the cardinality of the set ℛ 𝑖 of rows by exactly 1, and
maintaining the feasibility of the solution 𝑌𝑖 for 𝑋𝑖 .
In every iteration, we select two arbitrary rows 𝑅 and 𝑅0, such that: (i) 𝑅 ∈ ℛ 𝑖 ; (ii) 𝑅0 is an active
row for instance 𝑋𝑖 , and (iii) no point of 𝑌𝑖 ∪ 𝑋𝑖 lies strictly between rows 𝑅 and 𝑅0. We collapse
the rows 𝑅 and 𝑅0 into the row 𝑅0. From Observation 2.4, the resulting new set 𝑌𝑖 of points
remains a feasible solution for instance 𝑋𝑖 . We claim that |𝑌𝑖 | decreases by at least 1. In order
to show this, it is enough to show that there are two points 𝑝, 𝑝 0 ∈ 𝑋𝑖 ∪ 𝑌𝑖 , with 𝑝 ∈ 𝑅, 𝑝 0 ∈ 𝑅0,
such that the 𝑥-coordinates of 𝑝 and 𝑝 0 are the same; in this case, after we collapse the rows, 𝑥
and 𝑥 0 are mapped to the same point. Assume for contradiction that no such two points exist.
Let 𝑝 ∈ 𝑅 ∩ (𝑋𝑖 ∪ 𝑌𝑖 ), 𝑝 0 ∈ 𝑅0 ∩ 𝑌𝑖 be a pair of points with smallest horizontal distance. Such
points must exist since 𝑅 contains a point of 𝑋𝑖 and 𝑅0 contains a point of 𝑌𝑖 . But then no other
point of 𝑋𝑖 ∪ 𝑌𝑖 lies in 𝑝,𝑝0 , so the pair (𝑝, 𝑝 0) is not satisfied in 𝑋𝑖 ∪ 𝑌𝑖 , a contradiction.   

                                                                                                           


3.3   Decomposition theorem for the strong WB-1 bound.

We prove the following recurrence about the strong WB-1 bound bound.
                                                                  𝑘
Theorem 3.6. Let ℒ 0 ⊆ ℒ be a collection of lines and (𝑋˜ , {𝑋𝑖 } 𝑖=1 ) be a split instance of 𝑋 by ℒ 0. Then
                                                        𝑘
                                                        Õ
                                         ˜ +8
                             WB(𝑋) ≤ 4WB(𝑋)                   WB(𝑋𝑖 ) + 𝑂(|𝑋 |).
                                                        𝑖=1


We find this result somewhat surprising. One can think of the expression WB(𝑋)
                                                                                             Í
                                                                                  e + 𝑖∈[𝑘] WB(𝑋𝑖 )
as a WB-1 bound obtained by first cutting along the lines that serve as boundaries of the strips
𝑆 𝑖 for 𝑖 ∈ [𝑘], and then starting to cut inside the individual strips afterwards. However, WB(𝑋)
is the maximum of WB𝜎 (𝑋) obtained over all possible orderings 𝜎, including those that do not
obey this cutting order.
The remainder of this section is dedicated to the proof of Theorem 3.6. For each 1 ≤ 𝑖 ≤ 𝑘, we
denote by ℬ𝑖 be the set of consecutive active columns containing the points of 𝑋𝑖 , and we refer
to it as a block. For brevity, we also say “Wilber bound” to mean the strong WB-1 bound in this
section.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               15
             PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

3.3.1   Forbidden points

For the sake of the proof, we need the notion of forbidden points. Let 𝑋ˆ be some semi-
permutation and ℒ̂ be the set of auxiliary columns for 𝑋.ˆ Let 𝐹 ⊆ 𝑋ˆ be a set of points that we
refer to as forbidden points. We now define the strong WB-1 bound with respect to the forbidden
points, WB𝐹 (𝑋).ˆ

Consider any permutation 𝜎ˆ of the lines in ℒ̂. Intuitively, WB𝐹𝜎ˆ (𝑋)
                                                                    ˆ counts all the crossings
                      ˆ but excludes all crossing pairs (𝑝, 𝑝 0) where at least one of 𝑝, 𝑝 0 lie
contributed to WB𝜎ˆ (𝑋)
in 𝐹. Similar to WB(𝑋), we define WB𝐹 (𝑋)ˆ = max𝜎ˆ WB𝐹 (𝑋),
                                                       𝜎ˆ
                                                          ˆ where the maximum is over all
permutations 𝜎ˆ of the lines in ℒ̂.
Next, we define WB𝐹𝜎ˆ (𝑋) ˆ more formally. Let 𝑇 = 𝑇( 𝜎)   ˆ be the partitioning tree associated with 𝜎. ˆ
For each vertex 𝑣 ∈ 𝑉(𝑇), let 𝐿 = 𝐿(𝑣) be the line that belongs to 𝑣, and let Cr𝜎ˆ (𝐿) be the set of
all crossings (𝑝, 𝑝 0) that contribute to cost(𝐿); that is, 𝑝 and 𝑝 0 are two points that lie in the strip
𝑆(𝑣) on two opposite sides of 𝐿, and no other point of 𝑋ˆ ∩ 𝑆(𝑣) lies between the row of 𝑝 and
the row of 𝑝 0. Let Cr𝜎ˆ = 𝐿∈ℒ̂ Cr𝜎ˆ (𝐿). Observe that WB𝜎ˆ (𝑋) = | Cr𝜎ˆ | by definition. We say that
                             Ð
a crossing (𝑝, 𝑝 0) ∈ Cr𝜎ˆ (𝐿) is forbidden iff at least one of 𝑝, 𝑝 0 lie in 𝐹; otherwise the crossing is
allowed. We let Cr𝐹𝜎ˆ (𝐿) be the set of crossings obtained from Cr𝜎ˆ (𝐿) by discarding all forbidden
crossings. We then let Cr𝐹𝜎ˆ = 𝐿∈ℒ̂ Cr𝐹𝜎ˆ (𝐿), and WB𝐹𝜎ˆ (𝑋) ˆ = | Cr𝐹 |.
                                  Ð
                                                                        𝜎ˆ

We emphasize that WB𝐹 (𝑋)ˆ is not necessarily the same as WB(𝑋ˆ \ 𝐹), as some crossings of the
         ˆ                                                          ˆ
instance 𝑋 \ 𝐹 may not correspond to allowed crossings of instance 𝑋.


3.3.2   Proof overview and notation

Consider first the compressed instance 𝑋,       ˜ that is a semi-permutation. We denote its set of
active columns by 𝒞˜ = {𝐶1 , . . . , 𝐶 𝑘 }, where the columns are indexed in their natural left-to-right
order. Therefore, 𝐶 𝑖 is the column that was obtained by collapsing all active columns in strip
𝑆 𝑖 . It would be convenient for us to slightly modify the instance 𝑋˜ by simply multiplying
all 𝑥-coordinates of the points in 𝑋˜ and of the columns in 𝒞˜ by factor 2. Note that this does
not affect the value of the optimal solution or of the Wilber bound, but it ensures that every
consecutive pair of columns in 𝒞˜ is separated by a column with an integral 𝑥-coordinate. We let
ℒ̃ be the set of all vertical lines with half-integral coordinates in the resulting instance 𝑋.  ˜

Similarly, we modify the original instance 𝑋, by inserting, for every consecutive pair ℬ𝑖 , ℬ𝑖+1 of
blocks, a new column with an integral coordinate that lies between the columns of ℬ𝑖 and the
columns of ℬ𝑖+1 . This transformation does not affect the optimal solution cost or the value of
the Wilber bound. For all 1 ≤ 𝑖 ≤ 𝑁, we denote 𝑞 𝑖 = |ℬ𝑖 |. We denote by ℒ the set of all vertical
lines with half-integral coordinates in the resulting instance 𝑋.
                                                                    𝑞 +1
                                                n                          o
Consider any block ℬ𝑖 . We denote by ℒ 𝑖 =          𝐿1𝑖 , . . . , 𝐿 𝑖 𝑖        the set of 𝑞 𝑖 + 1 consecutive vertical

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                        16
               P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

                                                                                                     𝑞 +1
lines in ℒ, where 𝐿1𝑖 appears immediately before the first column of ℬ𝑖 , and 𝐿 𝑖 𝑖                         appears
immediately after the last column of ℬ𝑖 . Notice that ℒ = 𝑁
                                                         Ð
                                                           𝑖=1 ℒ 𝑖 .

Recall that our goal is to show that WB(𝑋) ≤ 4WB(𝑋)  ˜ + 8 Í 𝑘 WB(𝑋𝑖 ) + 𝑂(|𝑋 |). In order to do so,
                                                             𝑖=1
we fix a permutation 𝜎 of ℒ that maximizes WB𝜎 (𝑋), so that WB(𝑋) = WB𝜎 (𝑋). We then gradually
                                                          ˜ ≥ WB𝜎 (𝑋)/4 − 2 Í 𝑘 WB(𝑋𝑖 ) − 𝑂(|𝑋 |).
transform it into a permutation 𝜎˜ of ℒ̃, such that WB𝜎˜ (𝑋)                  𝑖=1
This will prove that WB(𝑋) ≤ 4WB(𝑋)    ˜ + 8 Í 𝑘 WB(𝑋𝑖 ) + 𝑂(|𝑋 |).
                                                𝑖=1
In order to perform this transformation, we will process every block ℬ𝑖 one-by-one. When
block ℬ𝑖 is processed, we will “consolidate” all lines of ℒ 𝑖 , so that they will appear almost
consecutively in the permutation 𝜎, and we will show that this process does not increase
the Wilber bound by too much. The final permutation that we obtain after processing every
block ℬ𝑖 can then be naturally transformed into a permutation 𝜎˜ of ℒ̃, whose Wilber bound
cost is similar. The main challenge is to analyze the increase in the Wilber bound in every
iteration. In order to facilitate the analysis, we will work with the Wilber bound with respect
to forbidden points. Specifically, we will define a set 𝐹 ⊆ 𝑋 of forbidden points, such that
                           Í𝑘
WB𝐹𝜎 (𝑋) ≥ WB𝜎 (𝑋)/4 − 𝑖=1      WB(𝑋𝑖 ). For every block ℬ𝑖 , we will also define a bit 𝑏 𝑖 ∈ {0, 1},
that will eventually guide the way in which the lines of ℒ 𝑖 are consolidated. As the algorithm
progresses, we will modify the set 𝐹 of forbidden points by discarding some points from it,
and we will show that the increase in the Wilber bound with respect to the new set 𝐹 is small
relatively to the original Wilber bound with respect to the old set 𝐹. We start by defining the set
𝐹 of forbidden points, and the bits 𝑏 𝑖 for the blocks ℬ𝑖 . We then show how to use these bits in
order to transform permutation 𝜎 of ℒ into a new permutation 𝜎0 of ℒ, which will in turn be
transformed into a permutation 𝜎˜ of ℒ̃.
From now on we assume that the permutation 𝜎 of the lines in ℒ is fixed.


3.3.3   Defining the set of forbidden points

Consider any block ℬ𝑖 , for 1 ≤ 𝑖 ≤ 𝑘. We denote by 𝐿∗𝑖 ∈ ℒ 𝑖 the vertical line that appears first in
the permutation 𝜎 among all lines of ℒ 𝑖 , and we denote by 𝐿∗∗𝑖
                                                                 ∈ ℒ 𝑖 the line that appears last in
𝜎 among all lines of ℒ 𝑖 .
We perform 𝑘 iteration. In iteration 𝑖, for 1 ≤ 𝑖 ≤ 𝑘, we consider the block ℬ𝑖 . We let 𝑏 𝑖 ∈ {0, 1}
be a bit chosen uniformly at random, independently from all other random bits. If 𝑏 𝑖 = 0, then
all points of 𝑋𝑖 that lie to the left of 𝐿∗𝑖 are added to the set 𝐹 of forbidden points; otherwise, all
points of 𝑋𝑖 that lie to the right of 𝐿∗𝑖 are added to the set 𝐹 of forbidden points. We show that
the expected number of the remaining crossings is large.
                                                                                                     Í𝑘
Claim 3.7. The expectation, over the choice of the bits 𝑏 𝑖 , of | Cr𝐹𝜎 | is at least | WB(𝑋)|/4 −     𝑖=1 WB(𝑋 𝑖 ).


Proof. Consider any crossing (𝑝, 𝑝 0) ∈ Cr𝜎 . We consider two cases. Assume first that there is
some index 𝑖, such that both 𝑝 and 𝑝 0 belong to 𝑋𝑖 , and they lie on opposite sides of 𝐿∗𝑖 . In
this case, (𝑝, 𝑝 0) becomes a forbidden crossing with probability 1. However, the total number

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                     17
                  PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

of all such crossings is bounded by WB(𝑋𝑖 ). Indeed, if we denote by ℒ̂ 𝑖 the set of all vertical
lines with half-integral coordinates for instance 𝑋𝑖 , then permutation 𝜎 of ℒ naturally induces
permutation 𝜎𝑖 of ℒ̂ 𝑖 . Moreover, any crossing (𝑝, 𝑝 0) ∈ Cr𝜎 with 𝑝, 𝑝 0 ∈ 𝑋𝑖 must also contribute
to the cost of 𝜎𝑖 in instance 𝑋𝑖 . Since the cost of 𝜎𝑖 is bounded by 𝑊 𝐵(𝑋𝑖 ), the number of
crossings (𝑝, 𝑝 0) ∈ Cr𝜎 with 𝑝, 𝑝 0 ∈ 𝑋𝑖 is bounded by 𝑊 𝐵(𝑋𝑖 ).
Consider now any crossing (𝑝, 𝑝 0) ∈ Cr𝜎 , and assume that there is no index 𝑖, such that
both 𝑝 and 𝑝 0 belong to 𝑋𝑖 , and they lie on opposite sides of 𝐿∗𝑖 . Then with probability
at least 1/4, this crossing remains allowed. Therefore, the expectation of | Cr𝐹𝜎 | is at least
           Í𝑘                           Í𝑘                         Í𝑘
| Cr𝜎 |/4 − 𝑖=1 WB(𝑋𝑖 ) = | WB𝜎 (𝑋)|/4 − 𝑖=1 WB(𝑋𝑖 ) = | WB(𝑋)|/4 − 𝑖=1 WB(𝑋𝑖 ).             

From the above claim, there is a choice of the bits 𝑏1 , . . . , 𝑏 𝑘 , such that, if we define the set 𝐹
                                                                                           Í𝑘
of forbidden points with respect to these bits as before, then | Cr𝐹𝜎 | ≥ WB(𝑋)/4 − 𝑖=1         WB(𝑋𝑖 ).
From now on we assume that the values of the bits 𝑏1 , . . . , 𝑏 𝑘 are fixed, and that the resulting set
                                                           Í𝑘
𝐹 of forbidden points satisfies that | Cr𝐹𝜎 | ≥ WB(𝑋)/4 − 𝑖=1      WB(𝑋𝑖 ).


3.3.4    Transforming 𝜎 into 𝜎0

We now show how to transform the original permutation 𝜎 of ℒ into a new permutation 𝜎0 of
ℒ, which we will later transform into a permutation 𝜎˜ of ℒ̃. We perform 𝑘 iterations. The input
to the 𝑖th iteration is a permutation 𝜎𝑖 of ℒ and a subset 𝐹𝑖 ⊆ 𝐹 of forbidden points. The output
of the iteration is a new permutation 𝜎𝑖+1 of ℒ, and a set 𝐹𝑖+1 ⊆ 𝐹𝑖 of forbidden points. The final
permutation is 𝜎0 = 𝜎 𝑘+1 , and the final set 𝐹 𝑘+1 of forbidden points will be empty. The input to
the first iteration is 𝜎1 = 𝜎 and 𝐹1 = 𝐹. We now fix some 1 ≤ 𝑖 ≤ 𝑘, and show how to execute the
𝑖th iteration. Intuitively, in the 𝑖th iteration, we consolidate the lines of ℒ 𝑖 . Recall that we have
denoted by 𝐿∗𝑖 , 𝐿∗∗
                   𝑖
                      ∈ ℒ 𝑖 the first and the last lines of ℒ 𝑖 , respectively, in the permutation 𝜎. We
only move the lines of ℒ 𝑖 in iteration 𝑖, so this ensures that, in permutation 𝜎𝑖 , the first line of
ℒ 𝑖 that appears in the permutation is 𝐿∗𝑖 , and the last line is 𝐿∗∗  𝑖
                                                                         .
We now describe the 𝑖th iteration. Recall that we are given as input a permutation 𝜎𝑖 of the lines
of ℒ, and a subset 𝐹𝑖 ⊆ 𝐹 of forbidden points. We consider the block ℬ𝑖 and the corresponding
bit 𝑏 𝑖 .
Assume first that 𝑏 𝑖 = 0; recall that in this case, all points of 𝑋 that lie on the columns of ℬ𝑖 to
the left of 𝐿∗𝑖 are forbidden (see Figure 3). We start by switching the locations of 𝐿∗𝑖 and 𝐿1𝑖 in the
permutation 𝜎𝑖 (recall that 𝐿1𝑖 is the leftmost line in ℒ 𝑖 ). Therefore, 𝐿1𝑖 becomes the first line of
ℒ 𝑖 in the resulting permutation. Next, we consider the location of line 𝐿∗∗  𝑖
                                                                                in 𝜎𝑖 , and we place the
         𝑞 +1                  𝑞
lines 𝐿 𝑖 𝑖     , 𝐿2𝑖 , 𝐿3𝑖 , . . . , 𝐿 𝑖 𝑖 in that location, in this order. This defines the new permutation 𝜎𝑖+1 .
Assume now that 𝑏 𝑖 = 1; recall that in this case, all points of 𝑋 that lie on the columns of ℬ𝑖 to
                                                                                                𝑞 +1
the right of 𝐿∗𝑖 are forbidden (see Figure 3). We start by switching the locations of 𝐿∗𝑖 and 𝐿 𝑖 𝑖
                                              𝑞 +1                                                  𝑞 +1
in the permutation 𝜎𝑖 (recall that 𝐿 𝑖 𝑖 is the rightmost line in ℒ 𝑖 ). Therefore, 𝐿 𝑖 𝑖 becomes
the first line of ℒ 𝑖 in the resulting permutation. Next, we consider the location of line 𝐿∗∗𝑖
                                                                                                in

                            T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                 18
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES




                                                         𝑏𝑖 = 0

                                       ℬ𝑖                                            ℬ𝑖




                                                                                           𝐿4𝑖
                                                                                     𝐿3𝑖
                                                                               𝐿2𝑖
                                             𝐿4𝑖                                                 𝐿5𝑖
                           𝐿1𝑖
                                 𝐿2𝑖
                                                   𝐿5𝑖                   𝐿1𝑖
                                       𝐿3𝑖
                      𝜎𝑖                                          𝜎𝑖+1


                                                         𝑏𝑖 = 1

                                       ℬ𝑖                                            ℬ𝑖




                                                                                           𝐿4𝑖
                                                                                     𝐿3𝑖
                                                                               𝐿2𝑖
                                             𝐿4𝑖                     𝐿1𝑖
                           𝐿1𝑖
                                 𝐿2𝑖
                                                   𝐿5𝑖
                                       𝐿3𝑖                                                       𝐿5𝑖
                      𝜎𝑖                                          𝜎𝑖+1



Figure 3: Modification from 𝜎𝑖 to 𝜎𝑖+1 . In the figure, ℒ 𝑖 = {𝐿1𝑖 , . . . , 𝐿5𝑖 }, 𝐿∗𝑖 = 𝐿3𝑖 and 𝐿∗∗
                                                                                                   𝑖
                                                                                                      = 𝐿4𝑖 .
Points with horizontal strips are forbidden.




                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               19
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

                                                    𝑞
𝜎𝑖 , and we place the lines 𝐿1𝑖 , 𝐿2𝑖 , 𝐿3𝑖 , . . . , 𝐿 𝑖 𝑖 in that location, in this order. This defines the new
permutation 𝜎𝑖+1 .
Lastly, we discard from 𝐹𝑖 all points that lie on the columns of ℬ𝑖 , obtaining the new set 𝐹𝑖+1 of
forbidden points.
Once every block ℬ𝑖 is processed, we obtain a final permutation 𝜎 𝑘+1 that we denote by 𝜎0, and
the final set 𝐹 𝑘+1 = ∅ of forbidden lines. The following lemma is central to our analysis. It shows
that the Wilber bound does not decrease by much after every iteration. The Wilber bound is
defined with respect to the appropriate sets of forbidden points.

Lemma 3.8. For all 1 ≤ 𝑖 ≤ 𝑘, WB𝐹𝜎𝑖+1
                                  𝑖+1
                                      (𝑋) ≥ WB𝐹𝜎𝑖𝑖 (𝑋) − WB(𝑋𝑖 ) − 𝑂(|𝑋𝑖 |).

Assume first that the lemma is correct. Recall that we have ensured that WB𝐹𝜎11 (𝑋) = WB𝐹𝜎 (𝑋) ≥
           Í𝑘
WB(𝑋)/4 − 𝑖=1   WB(𝑋𝑖 ). Since 𝐹 𝑘+1 = ∅, this will ensure that:

                                   Õ                                          Õ
         WB𝜎0 (𝑋) ≥ WB𝐹𝜎 (𝑋) −           WB(𝑋𝑖 ) − 𝑂(|𝑋 |) ≥ WB(𝑋)/4 − 2            WB(𝑋𝑖 ) − 𝑂(|𝑋 |).
                                     𝑖                                          𝑖

We now focus on the proof of the lemma.

Proof. In order to simplify the notation, we denote 𝜎𝑖 by 𝜎,                                   ˆ
                                                          ˆ 𝜎𝑖+1 by 𝜎ˆ 0. We also denote 𝐹𝑖 by 𝐹,
and 𝐹𝑖+1 by 𝐹ˆ 0.
Consider a line 𝐿 ∈ ℒ. Recall that Cr𝜎ˆ (𝐿) is the set of all crossings that are charged to the line
                                           ˆ
𝐿 in permutation 𝜎.ˆ Recall that Cr𝐹𝜎ˆ (𝐿) ⊆ Cr𝜎ˆ (𝐿) is obtained from the set Cr𝜎ˆ (𝐿) of crossings,
                                                                                 ˆ0
by discarding all crossings (𝑝, 𝑝 0) where 𝑝 ∈ 𝐹ˆ or 𝑝 0 ∈ 𝐹ˆ holds. The set Cr𝐹𝜎ˆ 0 (𝐿) of crossings is
defined similarly.
We start by showing that for every line 𝐿 ∈ ℒ that does not lie in ℒ 𝑖 , the number of crossings
                                                    ˆ0        ˆ
charged to it does not decrease, that is, Cr𝐹𝜎ˆ 0 (𝐿) ≥ Cr𝐹𝜎ˆ (𝐿).
                                               ˆ0         ˆ
Claim 3.9. For every line 𝐿 ∈ ℒ \ ℒ 𝑖 , Cr𝐹𝜎ˆ 0 (𝐿) ≥ Cr𝐹𝜎ˆ (𝐿).

Proof. Consider any line 𝐿 ∈ ℒ \ ℒ 𝑖 . Let 𝑣 ∈ 𝑉(𝑇( 𝜎))ˆ be the vertex of the partitioning tree 𝑇( 𝜎)     ˆ
corresponding to 𝜎 to which 𝐿 belongs, and let 𝑆 = 𝑆(𝑣) be the corresponding strip. Similarly,
                   ˆ
we define 𝑣 0 ∈ 𝑉(𝑇(𝜎ˆ 0)) and 𝑆0 = 𝑆(𝑣 0) with respect to 𝜎ˆ 0. Recall that 𝐿∗𝑖 is the first line of ℒ 𝑖 to
appear in the permutation 𝜎, and 𝐿∗∗  𝑖
                                         is the last such line. We now consider five cases.

    • Case 1. The first case happens if 𝐿 appears before line 𝐿∗𝑖 in the permutation 𝜎.      ˆ Notice
      that the prefixes of the permutations 𝜎ˆ and 𝜎ˆ are identical up to the location in which 𝐿∗𝑖
                                                     0

                  ˆ Therefore, 𝑆 = 𝑆0, and Cr𝜎ˆ (𝐿) = Cr𝜎ˆ 0 (𝐿). Since 𝐹ˆ 0 ⊆ 𝐹,
      appears in 𝜎.                                                            ˆ every crossing that is
                                                              ˆ          ˆ0              ˆ0          ˆ
                                              ˆ So Cr𝐹𝜎ˆ (𝐿) ⊆ Cr𝐹𝜎ˆ 0 (𝐿), and Cr𝐹𝜎ˆ 0 (𝐿) ≥ Cr𝐹𝜎ˆ (𝐿).
      forbidden in 𝜎ˆ 0 was also forbidden in 𝜎.

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                  20
                P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

                                                                               𝑏𝑖 = 0

                                           ℬ𝑖                                                               ℬ𝑖

                                                             𝑝1                                                                   𝑝1
                                𝑎                                                                𝑎


                                                                          𝑝2                                                               𝑝2

                                                                                                                  𝐿4𝑖
                                                                                                            𝐿3𝑖
                                                 𝐿4𝑖
                                                                                                     𝐿2𝑖
                          𝐿1𝑖                                                                                           𝐿5𝑖
                                     𝐿2𝑖
                                                       𝐿5𝑖

                                                                                                                              ′
                                                                      𝐿                        𝐿1𝑖                                     𝐿
                                           𝐿∗𝑖
                     𝜎𝑖                                                                 𝜎𝑖+1


                                                                                                                  ˆ                             ˆ0
Figure 4: Illustration of the injective mapping of each crossing in Cr𝐹𝜎ˆ (𝐿) to a crossing in Cr𝐹𝜎ˆ 0 (𝐿).
Points with horizontal strips are forbidden points from 𝐹. ˆ


   • Case 2. The second case happens if 𝐿 appears after 𝐿∗∗   𝑖
                                                                in 𝜎.
                                                                   ˆ Notice that, if we denote by
     ℒ ⊆ ℒ the set of all lines of ℒ that lie before 𝐿 in 𝜎, and define ℒ 00 similarly for 𝜎ˆ 0, then
      0                                                   ˆ
                                                                                                                                           ˆ0   ˆ
      ℒ 0 = ℒ 00. Therefore, 𝑆 = 𝑆0 holds. Using the same reasoning as in Case 1, Cr𝐹𝜎ˆ 0 (𝐿) ≥ Cr𝐹𝜎ˆ (𝐿).

   • Case 3. The third case is when 𝐿 appears between 𝐿∗𝑖 and 𝐿∗∗   𝑖
                                                                      in 𝜎,
                                                                         ˆ but neither boundary of
     the strip 𝑆 belongs to ℒ 𝑖 . If we denote by ℒ 0 ⊆ ℒ \ ℒ 𝑖 the set of all lines of ℒ \ ℒ 𝑖 that lie
     before 𝐿 in 𝜎,
                  ˆ and define ℒ 00 ⊆ ℒ \ ℒ 𝑖 similarly for 𝜎ˆ 0, then ℒ 0 = ℒ 00. Therefore, 𝑆 = 𝑆0
                                                                                                           ˆ0           ˆ
      holds. Using the same reasoning as in Cases 1 and 2, Cr𝐹𝜎ˆ 0 (𝐿) ≥ Cr𝐹𝜎ˆ (𝐿).


      Case 4. The fourth case is when 𝐿 appears between 𝐿∗𝑖 and 𝐿∗∗      𝑖
                                                                           in the permutation 𝜎,
                                                                                               ˆ
      and the left boundary of 𝑆 belongs to ℒ 𝑖 . Notice that the left boundary of 𝑆 must either
      coincide with 𝐿∗𝑖 , or appear to the right of it.
      Assume first that 𝑏 𝑖 = 0, so we have replaced 𝐿∗𝑖 with the line 𝐿1𝑖 , that lies to the left of 𝐿∗𝑖 .
      Since no other lines of ℒ 𝑖 appear in 𝜎ˆ 0 until the original location of line 𝐿∗∗𝑖
                                                                                          , it is easy to
      verify that the right boundary of 𝑆0 is the same as the right boundary of 𝑆, and its left
      boundary is the line 𝐿1𝑖 , that is, we have pushed the left boundary to the left. In order to
                                ˆ0               ˆ                                                                            ˆ
      prove that | Cr𝐹𝜎ˆ 0 (𝐿)| ≥ | Cr𝐹𝜎ˆ (𝐿)|, we map every crossing (𝑝 1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ (𝐿) to some crossing
                      ˆ0                                                                  ˆ
      (𝑝 10 , 𝑝20 ) ∈ Cr𝐹𝜎ˆ 0 (𝐿), so that no two crossings of Cr𝐹𝜎ˆ (𝐿) are mapped to the same crossing of
          ˆ0
      Cr𝐹𝜎ˆ 0 (𝐿).
                                                                  ˆ
      Consider any crossing (𝑝 1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ (𝐿) (see Figure 4). We know that 𝑝 1 , 𝑝2 ∈ 𝑆, and they
      lie on opposite sides of 𝐿. We assume w.l.o.g. that 𝑝 1 lies to the left of 𝐿. Moreover, no

                                T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                                                 21
                 PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

       point of 𝑋 ∩ 𝑆 lies between the row of 𝑝 1 and the row of 𝑝 2 . It is however possible that
       (𝑝 1 , 𝑝2 ) is not a crossing of Cr𝜎ˆ 0 (𝐿), since by moving the left boundary of 𝑆 to the left, we
       add more points to the strip, some of which may lie between the row of 𝑝 1 and the row
       of 𝑝 2 . Let 𝐴 be the set of all points that lie between the row of 𝑝 1 and the row of 𝑝2 in 𝑆0.
       Notice that the points of 𝐴 are not forbidden in 𝐹ˆ 0. Let 𝑎 ∈ 𝐴 be the point of 𝑎 whose row
       is closest to the row of 𝑝 2 ; if 𝐴 = ∅, then we set 𝑎 = 𝑝1 . Then (𝑎, 𝑝2 ) defines a crossing in
                                                                       ˆ0
       Cr𝜎ˆ 0 (𝐿), and, since neither point lies in 𝐹ˆ 0, (𝑎, 𝑝2 ) ∈ Cr𝐹0 (𝐿). In this way, we map every
                                                                         𝜎ˆ
                                      𝐹ˆ                                    ˆ0
       crossing (𝑝 1 , 𝑝2 ) ∈ Cr𝜎ˆ (𝐿) to some crossing (𝑝1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ 0 (𝐿). It is easy to verify that no
                                                              0  0
                                    ˆ                                                ˆ0
       two crossings of Cr𝐹𝜎ˆ (𝐿) are mapped to the same crossing of Cr𝐹𝜎ˆ 0 (𝐿). We conclude that
              ˆ0              ˆ
       | Cr𝐹𝜎ˆ 0 (𝐿)| ≥ | Cr𝐹𝜎ˆ (𝐿)|.
       Lastly, assume that 𝑏 𝑖 = 1. Recall that the set of all points of 𝑋 lying between 𝐿∗𝑖 and
         𝑞 +1                                                                                         𝑞 +1
       𝐿 𝑖 𝑖 is forbidden in 𝐹ˆ but not in 𝐹ˆ 0, and that we have replaced 𝐿∗𝑖 with the line 𝐿 𝑖 𝑖 , that
       lies to the right of 𝐿∗𝑖 . Therefore, the right boundary of 𝑆 remains the same, and the left
                                                                                 ˆ0            ˆ
       boundary is pushed to the right. In order to prove that | Cr𝐹𝜎ˆ 0 (𝐿)| ≥ | Cr𝐹𝜎ˆ (𝐿)|, we show
                                               ˆ                    ˆ0
       that every crossing (𝑝 1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ (𝐿) belongs to Cr𝐹𝜎ˆ 0 (𝐿). Indeed, consider any crossing
                          ˆ
       (𝑝 1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ (𝐿). We know that 𝑝 1 , 𝑝2 ∈ 𝑆, and they lie on opposite sides of 𝐿. We assume
       w.l.o.g. that 𝑝 1 lies to the left of 𝐿. Since 𝑝 1 cannot be a forbidden point, it must lie to the
                  𝑞 +1
       right of 𝐿 𝑖 𝑖 . Moreover, no point of 𝑋 ∩ 𝑆 lies between the row of 𝑝1 and the row of 𝑝 2 . It
                                                                                    ˆ0
       is now easy to verify that (𝑝 1 , 𝑝2 ) is also a crossing in Cr𝐹𝜎ˆ 0 (𝐿).


    • Case 5. The fifth case happens when 𝐿 appears between 𝐿∗𝑖 and 𝐿∗∗   𝑖
                                                                             in the permutation 𝜎,
                                                                                                 ˆ
      and the right boundary of 𝑆 belongs to ℒ 𝑖 . This case is symmetric to the fourth case and is
      analyzed similarly.

                                                                                                                        

It now remains to analyze the crossings of the lines in ℒ 𝑖 . We do so in the following two
                                                               𝑞 +1
claims. The first claim shows that switching 𝐿∗𝑖 with 𝐿1𝑖 or 𝐿 𝑖 𝑖 does not decrease the number of
crossings.
                                       ˆ0                ˆ                               ˆ0   𝑞 +1           ˆ
Claim 3.10. If 𝑏 = 0, then | Cr𝐹𝜎ˆ 0 (𝐿1𝑖 )| ≥ | Cr𝐹𝜎ˆ (𝐿∗𝑖 )|; if 𝑏 = 1, then | Cr𝐹𝜎ˆ 0 (𝐿 𝑖 𝑖 )| ≥ | Cr𝐹𝜎ˆ (𝐿∗𝑖 )|.


Proof. Assume first that 𝑏 = 0, so we have replaced 𝐿∗𝑖 with 𝐿1𝑖 in the permutation. As before,
we let 𝑣 ∈ 𝑉(𝑇( 𝜎))ˆ be the vertex to which 𝐿∗𝑖 belongs, and we let 𝑆 = 𝑆(𝑣) be the corresponding
strip. Similarly, we define 𝑣 0 ∈ 𝑉(𝑇( 𝜎ˆ 0)) and 𝑆0 = 𝑆(𝑣 0) with respect to line 𝐿1𝑖 and permutation
𝜎ˆ 0. Notice that, until the appearance of 𝐿∗𝑖 in 𝜎,
                                                   ˆ the two permutations are identical. Therefore,
𝑆 = 𝑆 must hold. Recall also that all points of 𝑋 that lie between 𝐿∗𝑖 and 𝐿1𝑖 are forbidden in
       0



                            T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                        22
                     P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

ˆ but not in 𝐹ˆ 0. In order to show that | Cr𝐹ˆ 00 (𝐿1 )| ≥ | Cr𝐹ˆ (𝐿∗ )|, it is enough to show that every
𝐹,                                           𝜎ˆ      𝑖          𝜎ˆ 𝑖
                                    ˆ                      ˆ0
crossing (𝑝 1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ (𝐿∗𝑖 ) also lies in Cr𝐹𝜎ˆ 0 (𝐿1𝑖 ).
                                                                ˆ
Consider now some crossing (𝑝1 , 𝑝2 ) ∈ Cr𝐹𝜎ˆ (𝐿∗𝑖 ). Recall that one of 𝑝1 , 𝑝2 must lie to the left of 𝐿∗𝑖
and the other to the right of it, with both points lying in 𝑆. Assume w.l.o.g. that 𝑝 1 lies to the left
                    ˆ it must lie to the left of 𝐿1 . Moreover, no point of 𝑋 ∩ 𝑆 may lie between the
of 𝐿∗𝑖 . Since 𝑝1 ∉ 𝐹,                            𝑖
                                                                                                                              ˆ0
row of 𝑝 1 and the row of 𝑝 2 . It is then easy to verify that (𝑝1 , 𝑝2 ) is also a crossing in Cr𝐹𝜎ˆ 0 (𝐿1𝑖 ),
                ˆ0                      ˆ
and so | Cr𝐹𝜎ˆ 0 (𝐿1𝑖 )| ≥ | Cr𝐹𝜎ˆ (𝐿∗𝑖 )|.
The second case, when 𝑏 = 1, is symmetric.                                                                                         

                                                                                                          ˆ
Lastly, we show that for all lines 𝐿 ∈ ℒ 𝑖 \ 𝐿∗𝑖 , their total contribution to Cr𝐹𝜎ˆ is small.
                                                            

                                              𝐹ˆ
                         𝐿∈ℒ 𝑖 \ { 𝐿∗𝑖 } | Cr 𝜎ˆ (𝐿)| ≤ WB(𝑋𝑖 ) + 𝑂(|𝑋𝑖 |).
                     Í
Claim 3.11.

Assume first that  the claimˆ is correct. We have shown so far that the total contribution of
all lines in ℒ 𝑖 \ 𝐿∗𝑖 to Cr𝐹𝜎ˆ is at most WB(𝑋𝑖 ) + 𝑂(|𝑋𝑖 |); that the contribution of one of the
               𝑞 +1            ˆ0                                                                 ˆ
lines 𝐿1𝑖 , 𝐿 𝑖 𝑖        to Cr𝐹𝜎ˆ 0 is at least as large as the contribution of 𝐿∗𝑖 to Cr𝐹𝜎ˆ ; and that for every line
                                            ˆ0                                                        ˆ
𝐿 ∉ ℒ 𝑖 , its contribution to Cr𝐹𝜎ˆ 0 is at least as large as its contribution to Cr𝐹𝜎ˆ . It then follows that
| Cr𝐹𝜎ˆ 0 | ≥ | Cr𝐹𝜎ˆ | − WB(𝑋𝑖 ) + 𝑂(|𝑋𝑖 |), and so WB𝐹𝜎𝑖+1 (𝑋) ≥ WB𝐹𝜎𝑖𝑖 (𝑋) − WB(𝑋𝑖 ) + 𝑂(|𝑋𝑖 |). Therefore,
     ˆ0              ˆ
                                                         𝑖+1
in order to prove Lemma 3.8, it is now enough to prove Claim 3.11.

Proof. (Of Claim 3.11) Consider some line 𝐿 ∈ ℒ 𝑖 \ 𝐿∗𝑖 , and let 𝑣 ∈ 𝑉(𝑇( 𝜎))
                                                                              
                                                                                  ˆ be the vertex to
which 𝐿 belongs. Notice that 𝐿 appears in 𝜎ˆ after 𝐿∗𝑖 . Therefore, if 𝑆 = 𝑆(𝑣) is the strip that 𝐿
partitioned, then at least one of the boundaries of 𝑆 lies in ℒ 𝑖 . If exactly one boundary of 𝑆
lies in ℒ 𝑖 , then we say that 𝑆 is an external strip; otherwise, we say that 𝑆 is an internal strip.
Consider now some crossing (𝑝, 𝑝 0) ∈ Cr𝜎ˆ (𝑆). Since 𝐿 ∈ ℒ 𝑖 , and at least one boundary of 𝑆 lies
in ℒ 𝑖 , at least one of the points 𝑝, 𝑝 0 must belong to 𝑋𝑖 . If exactly one of 𝑝, 𝑝 0 lies in 𝑋𝑖 , then
we say that (𝑝, 𝑝 0) is a type-1 crossing; otherwise it is a type-2 crossing. Notice that, if 𝑆 is an
internal strip, then only type-2 crossings of 𝐿 are possible. We now bound the total number of
type-1 and type-2 crossings separately, in the following two observations.
                                                                                  Ð
Observation 3.12. The total number of type-2 crossings in                             𝐿∈ℒ 𝑖 \ { 𝐿∗𝑖 } Cr 𝜎ˆ (𝐿) is at most WB(𝑋𝑖 ).


Proof. Permutation 𝜎ˆ of the lines in ℒ naturally induces a permutation 𝜎ˆ 𝑖 of the lines in ℒ 𝑖 . The
number of type-2 crossings charged to all lines in ℒ 𝑖 is then at most WB𝜎ˆ 𝑖 (𝑋𝑖 ) ≤ WB(𝑋𝑖 ).       

                                                                                      𝐿∈ℒ 𝑖 \ { 𝐿∗𝑖 } Cr 𝜎ˆ (𝐿) ≤ 𝑂(|𝑋𝑖 |).
                                                                                  Ð
Observation 3.13. The total number of type-1 crossings in


Proof. Consider a line 𝐿 ∈ ℒ 𝑖 \ 𝐿∗𝑖 , and let 𝑆 be the strip that it splits. Recall that, if there are
                                                 
any type-1 crossings in Cr𝜎ˆ (𝐿), then 𝑆 must be an external strip. Line 𝐿 partitions 𝑆 into two new

                                 T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                              23
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

strips, that we denote by 𝑆0 and 𝑆00. Notice that exactly one of these strips (say 𝑆0) is an internal
strip, and the other strip is external. Therefore, the points of 𝑋𝑖 ∩ 𝑆0 will never participate in
type-1 crossings again. Recall that, from Observation 2.8, the total number of crossings in Cr𝜎ˆ (𝐿)
is bounded by 2|𝑆0 ∩ 𝑋𝑖 |. We say that the points of 𝑆0 ∩ 𝑋𝑖 pay for these crossings. Since every
point of 𝑋𝑖 will pay for a type-1 crossing at most once, we conclude that the total number of
                    Ð
type-1 crossings in 𝐿∈ℒ 𝑖 \{ 𝐿∗ } Cr𝜎ˆ (𝐿) is bounded by 2|𝑋𝑖 |.                                   
                                 𝑖



                                                             Ð
We conclude that the total number of all crossings in            𝐿∈ℒ 𝑖 \ { 𝐿∗𝑖 } Cr𝜎ˆ (𝐿) is at most WB(𝑋𝑖 )+𝑂(|𝑋 𝑖 |).
                             ˆ                                                        𝐹ˆ
Since, for every line 𝐿, Cr𝐹𝜎ˆ (𝐿) ⊆ Cr𝜎ˆ (𝐿), we get that       𝐿∈ℒ 𝑖 \ { 𝐿∗ } | Cr 𝜎ˆ (𝐿)| ≤ WB(𝑋𝑖 ) + 𝑂(|𝑋𝑖 |). 
                                                             Í
                                                                         𝑖




                                                                                                                    


To summarize, we have transformed a permutation 𝜎 of ℒ into a permutation 𝜎0 of ℒ, and we
                                      Í𝑘
have shown that WB𝜎0 (𝑋) ≥ WB(𝑋)/4 − 2 𝑖=1 WB(𝑋𝑖 ) − 𝑂(|𝑋 |).



3.3.5   Transforming 𝜎0 into 𝜎˜

In this final step, we transform the permutation 𝜎0 of ℒ into a permutation 𝜎˜ of ℒ̃, and we will
show that WB𝜎˜ (𝑋) ˜ ≥ WB𝜎0 (𝑋) − |𝑋 |.

The transformation is straightforward. Consider some block ℬ𝑖 , and the corresponding set
                                                                           𝑞 +1
ℒ 𝑖 ⊆ ℒ of lines. Recall that the lines in ℒ 𝑖 are indexed 𝐿1𝑖 , . . . , 𝐿 𝑖 𝑖 in this left-to-right order,
                                                                          𝑞 +1
where 𝐿1𝑖 appears to the left of the first column of ℬ𝑖 , and 𝐿 𝑖 𝑖 appears to the right of the last
column of ℬ𝑖 . Recall also that, in the current permutation 𝜎0, one of the following happens:
                                                                  𝑞 +1                  𝑞
either (i) line 𝐿1𝑖 appears in the permutation first, and lines 𝐿 𝑖 𝑖 , 𝐿2𝑖 , . . . , 𝐿 𝑖 appear at some later
                                                      𝑞 +1
point consecutively in this order; or (ii) line 𝐿 𝑖 𝑖 appears in the permutation first, and lines
                      𝑞
𝐿1𝑖 , 𝐿2𝑖 , . . . , 𝐿 𝑖 appear somewhere later in the permutation consecutively in this order. Therefore,
                         𝑗                                                       𝑗−1                              𝑞 +1
for all 2 ≤ 𝑗 ≤ 𝑞, line 𝐿 𝑖 separates a strip whose left boundary is 𝐿 𝑖               and right boundary is 𝐿 𝑖 𝑖 .
                                                     𝑗
It is easy to see that the cost of each such line 𝐿 𝑖 in permutation 𝜎0 is bounded by the number of
                                                                                             𝑗−1         𝑗
points of 𝑋 that lie on the unique active column that appears between 𝐿 𝑖                          and 𝐿 𝑖 . The total
cost of all such lines is then bounded by |𝑋𝑖 |.
                                                                                                                     𝑞
Let 𝜎˜ ∗ be a sequence of lines obtained from 𝜎0 by deleting, for all 1 ≤ 𝑖 ≤ 𝑘, all lines 𝐿2𝑖 , . . . , 𝐿 𝑖
from it. Then 𝜎˜ ∗ naturally defines a permutation 𝜎˜ of the set ℒ̃ of vertical lines. Moreover,
from the above discussion, the total contribution of all deleted lines to WB𝜎0 (𝑋) is at most |𝑋 |,
so WB𝜎˜ (𝑋)˜ ≥ WB𝜎0 (𝑋) − |𝑋 | ≥ WB(𝑋)/4 − 2 Í𝑖 WB(𝑋𝑖 ) − 𝑂(|𝑋 |). We conclude that WB(𝑋)            ˜ ≥
        ˜                                                            ˜
WB𝜎˜ (𝑋) ≥ WB(𝑋)/4 − 2 𝑖 WB(𝑋𝑖 ) − 𝑂(|𝑋 |), and WB(𝑋) ≤ 4WB(𝑋) + 8 𝑖 WB(𝑋𝑖 ) + 𝑂(|𝑋 |).
                          Í                                                 Í


                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                       24
               P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

4     Separation results for the strong Wilber bound

In this section we present our negative results, proving Theorem 1.1, and extend it to obtain a
separation result between the first and second Wilber bounds.


4.1     Basic tools

Our construction combines known input sequences and their properties, some of which have
been proved in the standard tree view of binary search trees. We discuss these facts in the
geometric context.


4.1.1    Monotonically increasing sequence

We say that an input set 𝑋 of points is monotonically increasing iff 𝑋 is a permutation, and
moreover for every pair 𝑝, 𝑝 0 ∈ 𝑋 of points, if 𝑝.𝑥 < 𝑝 0 .𝑥, then 𝑝.𝑦 < 𝑝 0 .𝑦 must hold. It is well
known that the value of the optimal solution of monotonically increasing sequences is low, and
we exploit this fact in our negative results.
Observation 4.1. If 𝑋 is a monotonically increasing set of points, then OPT(𝑋) ≤ |𝑋 | − 1.

Proof. We order points in 𝑋 based on their 𝑥-coordinates as 𝑋 = {𝑝 1 , . . . , 𝑝 𝑚 } such that
𝑝 1 .𝑥 < 𝑝 2 .𝑥 < . . . < 𝑝 𝑚 .𝑥. For each 𝑖 = 1, . . . , 𝑚 − 1 we define 𝑞 𝑖 = ((𝑝 𝑖 ).𝑥, (𝑝 𝑖+1 ).𝑦) and the set
𝑌 = {𝑞1 , . . . , 𝑞 𝑚−1 }. It is easy to verify that 𝑌 is a feasible solution for 𝑋.                            


4.1.2    Bit reversal sequence (BRS)

The bit-reversal sequence, first described by Wilber [29], is a family of explicit input sequences
whose optimal value is asymptotically largest possible, that is, OPT(𝑋) = Ω(|𝑋 | log |𝑋 |). The
original sequence was described in the language of binary representation of strings. Here we
use the geometric variant of BRS, which is more convenient for our analysis.
Let 𝑖 ≥ 0 be an integer and ℛ ⊆ ℕ and 𝒞 ⊆ ℕ be subsets of active rows and columns such that
|ℛ| = |𝒞| = 2𝑖 . The level-𝑖 bit-reversal instance BRS(𝑖, ℛ, 𝒞) contains 2𝑖 points whose sets of
active rows and columns are exactly ℛ and 𝒞 respectively. The instances are defined inductively.
The level-0 instance BRS(0, {𝐶}, {𝑅}), containing a single point at the intersection of row 𝑅
and column 𝐶. Assume now that we have defined, for all 1 ≤ 𝑖 0 ≤ 𝑖, and any sets ℛ 0 , 𝒞 0 of 2𝑖
                                                                                                    0


integers, the corresponding instance BRS(𝑖 0 , ℛ 0 , 𝒞 0). We define instance BRS(𝑖 + 1, ℛ, 𝒞), where
|ℛ| = |𝒞| = 2𝑖+1 , as follows.
Consider the columns in 𝒞 in their natural left-to-right order, and define 𝒞𝑙𝑒 𝑓 𝑡 to be the
first 2𝑖 columns and 𝒞𝑟𝑖𝑔 ℎ𝑡 = 𝒞 \ 𝒞𝑙𝑒 𝑓 𝑡 . Denote ℛ = {𝑅 1 , . . . , 𝑅 2𝑖+1 }, where the rows are
indexed in their natural bottom to top order, and let ℛ 𝑒𝑣𝑒𝑛 = {𝑅2 , 𝑅4 , . . . , 𝑅 2𝑖+1 } and ℛ 𝑜𝑑𝑑 =

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                   25
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

{𝑅1 , 𝑅3 , . . . , 𝑅 2𝑖+1 −1 } be the sets of all even-indexed and all odd-indexed rows, respectively.
Notice that |𝒞𝑙𝑒 𝑓 𝑡 | = |𝒞𝑟𝑖𝑔 ℎ𝑡 | = |ℛ 𝑒𝑣𝑒𝑛 | = |ℛ 𝑜𝑑𝑑 | = 2𝑖 . The instance BRS(𝑖 + 1, ℛ, 𝒞) is defined to
be BRS(𝑖, ℛ 𝑜𝑑𝑑 , 𝒞𝑙𝑒 𝑓 𝑡 ) ∪ BRS(𝑖, ℛ 𝑒𝑣𝑒𝑛 , 𝒞𝑟𝑖𝑔 ℎ𝑡 ). See Figure 5 for an illustration.
It is well-known [29] that, if 𝑋 is a bit-reversal sequence on 𝑛 points, then OPT(𝑋) ≥ Ω(𝑛 log 𝑛).

Claim 4.2. Let 𝑋 = BRS(𝑖, 𝒞, ℛ), for any 𝑖 ≥ 0 and any sets 𝒞 and ℛ of columns and rows, respectively,
with |ℛ| = |𝒞| = 2𝑖 . Then |𝑋 | = 2𝑖 , and OPT(𝑋) ≥ WB2(𝑋) ≥ Ω(|𝑋 | log |𝑋 |).

Next, we present two additional technical tools that we use in our construction.


4.1.3   Exponentially spaced columns

Recall that we defined the bit reversal instance BRS(ℓ , ℛ, 𝒞), where ℛ and 𝒞 are sets of 2ℓ
rows and columns, respectively, that serve in the resulting instance as the sets of active rows
and columns; the instance contains 𝑛 = 2ℓ points. In the Exponentially-Spaced BRS instance
ES-BRS(ℓ , ℛ), we are still given a set ℛ of 2ℓ rows that will serve as active rows in the resulting
instance, but we define the set 𝒞 of columns in a specific way. For an integer 𝑖, let 𝐶 𝑖 be the
column whose 𝑥-coordinate is 𝑖 and 𝒞 contain, for each 0 ≤ 𝑖 < 2ℓ , the column 𝐶2𝑖 . Denoting
            ℓ
𝑁 = 2𝑛 = 22 , the 𝑥-coordinates of the columns in 𝒞 are {1, 2, 4, 8, . . . , 𝑁/2}. The instance is
then defined to be BRS(ℓ , ℛ, 𝒞) for this specific set 𝒞 of columns. Notice that the instance
contains 𝑛 = log 𝑁 = 2ℓ input points.
It is easy to see that any point set 𝑋 = ES-BRS(ℓ , ℛ) satisfies OPT(𝑋) = Ω(𝑛 log 𝑛). We remark
that this idea of exponentially spaced columns is inspired by the instance used by Iacono [18]
to prove a gap between the weak WB-1 bound and OPT(𝑋). However, Iacono’s instance is
tailored to specific partitioning tree 𝑇, and it is clear that there is another partitioning tree 𝑇 0
with OPT(𝑋) = Θ(𝑊 𝐵𝑇 0 (𝑋)). Therefore, this instance does not give a separation result for the
strong WB-1 bound, and in fact it does not provide negative results for the weak WB-1 bound
when the input point set is a permutation.


4.1.4   Cyclic shift of columns

Suppose we are given a point set 𝑋, and let 𝒞 = {𝐶0 , . . . , 𝐶 𝑁−1 } be any set of columns indexed in
their natural left-to-right order, such that all points of 𝑋 lie on columns of 𝒞 (but some columns
may contain no points of 𝑋). Let 0 ≤ 𝑠 < 𝑁 be any integer. We denote by 𝑋 𝑠 a cyclic shift of 𝑋
by 𝑠 units with respect to 𝒞, obtained as follows. For every point 𝑝 ∈ 𝑋 on column 𝐶 𝑗 , we add a
new point 𝑝 𝑠 to 𝑋 𝑠 , that lies on the same row as 𝑝 and on column 𝐶(𝑗+𝑠) mod 𝑁 . In other words,
we shift the point 𝑝 by 𝑠 steps to the right (with respect to 𝒞) in a circular manner. Equivalently,
we move the last 𝑠 columns of 𝒞 to the beginning of the instance. The following claim shows
that the value of the optimal solution does not decrease significantly in the shifted instance.


                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               26
             P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

Claim 4.3. Let 𝑋 be any point set that is a semi-permutation. Let 0 ≤ 𝑠 < 𝑁 be a shift value, and let
𝑋 0 = 𝑋 𝑠 be the instance obtained from 𝑋 by a cyclic shift of its points by 𝑠 units to the right. Then
OPT(𝑋 0) ≥ OPT(𝑋) − |𝑋 |.


Proof. Let 𝑌 0 be an optimal canonical solution to instance 𝑋 0. We partition 𝑌 0 into two subsets:
set 𝑌10 consists of all points lying on the first 𝑠 columns with integral coordinates, and set 𝑌20
consists of all points lying on the remaining columns. We also partition the points of 𝑋 0 into two
subsets 𝑋10 and 𝑋20 similarly. Notice that 𝑋10 ∪ 𝑌10 must be a satisfied set of points, and similarly,
𝑋20 ∪ 𝑌20 is a satisfied set of points. Our goal is to use these sets to construct a feasible solution
for 𝑋 of size |𝑋 | + |𝑌10 | + |𝑌20 | = |𝑋 | + OPT(𝑋 0).
Next, we partition the set 𝑋 of points into two subsets: set 𝑋1 contains all points lying on the
last 𝑠 columns with integral coordinates, and set 𝑋2 contains all points lying on the remaining
columns. Since 𝑋1 and 𝑋2 are simply horizontal shifts of the sets 𝑋10 and 𝑋20 of points, we can
define a set 𝑌1 of |𝑌10 | points such that 𝑌1 is a canonical feasible solution for 𝑋1 , and we can define
a set 𝑌2 for 𝑋2 similarly. Let 𝐶 be a column with a half-integral 𝑥-coordinate that separates 𝑋1
from 𝑋2 (that is, all points of 𝑋1 lie to the right of 𝐶 while all points of 𝑋2 lie to its left.) We
construct a new set 𝑍 of points, of cardinality |𝑋 |, such that 𝑌1 ∪ 𝑌2 ∪ 𝑍 is a feasible solution to
instance 𝑋. In order to construct the point set 𝑍, for each point 𝑝 ∈ 𝑋, we add a point 𝑝 0 with
the same 𝑦-coordinate, that lies on column 𝐶, to 𝑍. Notice that |𝑍| = |𝑋 |.
We claim that 𝑍 ∪ (𝑌1 ∪ 𝑌2 ) is a feasible solution for 𝑋, and this will complete the proof. Consider
any two points 𝑝, 𝑞 ∈ 𝑍 ∪ (𝑌1 ∪ 𝑌2 ) ∪ 𝑋 that are not aligned. Let 𝐵1 and 𝐵2 be the strips obtained
from the bounding box 𝐵 by partitioning it with column 𝐶, so that 𝑋1 ⊆ 𝐵1 and 𝑋2 ⊆ 𝐵2 . If
both 𝑝 and 𝑞 lie in the interior of the same strip, say 𝐵1 , we are done since set 𝑋1 ∪ 𝑌1 of points
is satisfied. So, assume that one of the points (say 𝑝) lies in the interior of one of the strips
(say 𝐵1 ), while the other point either lies on 𝐶, or in the interior of 𝐵2 . Then 𝑝 ∈ 𝑋1 ∪ 𝑌1 must
hold. Moreover, since 𝑌1 is a canonical solution for 𝑋1 , point 𝑝 lies on a row that is active for 𝑋1 .
Therefore, some point 𝑝 0 ∈ 𝑋1 lies on the same row (where possibly 𝑝 0 = 𝑝). But then a copy of
𝑝 0 that was added to the set 𝑍 and lies on the column 𝐶 satisfies the pair (𝑝, 𝑞).                 


4.1.5   Partial costs of WB-1 bound

We use simple facts about the Wilber bound. The following is a property of any balanced binary
search trees.

Lemma 4.4. For any semi-permutation 𝑋, WB(𝑋) ≤ 2OPT(𝑋) ≤ 𝑂(𝑟(𝑋) log 𝑐(𝑋)).

Our analysis also uses partial costs of the WB-1 bound restricted to a subtree and a path.

Claim 4.5. Consider a set 𝑋 of points that is a semi-permutation, an ordering 𝜎 of the auxiliary columns
in ℒ and the corresponding partitioning tree 𝑇 = 𝑇(𝜎). Let 𝑣 ∈ 𝑉(𝑇) be any vertex of the tree. Then the
following hold:

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            27
                PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

      • Let 𝑇𝑣 be the subtree of 𝑇 rooted at 𝑣. Then
                             Õ
                                       cost(𝑢) = WB𝑇𝑣 (𝑋 ∩ 𝑆(𝑣)) ≤ 𝑂(𝑁(𝑣) log(𝑁(𝑣)))
                            𝑢∈𝑉(𝑇𝑣 )


      • Let 𝑢 be any descendant vertex of 𝑣, and let 𝑃 be the unique path in 𝑇 connecting 𝑢 to 𝑣. Then
        Í
          𝑧∈𝑉(𝑃) cost(𝑧) ≤ 2𝑁(𝑣).


Proof. The first assertion follows from the definition of the weak WB-1 bound and Lemma 4.4.
We now prove the second assertion. Denote 𝑃 = (𝑣 = 𝑣 1 , 𝑣2 , . . . , 𝑣 𝑘 = 𝑢). For all 1 < 𝑖 ≤ 𝑘, we
let 𝑣 0𝑖 be the unique sibling of the vertex 𝑣 𝑖 in the tree 𝑇. We also let 𝑋𝑖 be the set of points of 𝑋
that lie in the strip 𝑆(𝑣 0𝑖 ), and we let 𝑋 0 be the set of all points of 𝑋 that lie in the strip 𝑆(𝑣 𝑘 ). It is
                                                                                                  𝑞
easy to verify that 𝑋2 , . . . , 𝑋 𝑘 , 𝑋 0 are all mutually disjoint (since the strips {𝑆(𝑣 0𝑖 )} 𝑖=2 and 𝑆(𝑣 𝑘 )
                                                                             Í𝑞
are disjoint), and that they are contained in 𝑋 ∩ 𝑆(𝑣). Therefore, 𝑖=2 |𝑋𝑖 | + |𝑋 0 | ≤ 𝑁(𝑣).
From Observation 2.8, for all 1 ≤ 𝑖 < 𝑘, cost(𝑣 𝑖 ) ≤ 2𝑁(𝑣 0𝑖 ) = 2|𝑋𝑖 |, and cost(𝑣 𝑘 ) ≤ 2𝑁(𝑣 𝑘 ) = 2|𝑋 0 |.
                               Í𝑞
Therefore, 𝑧∈𝑉(𝑃) cost(𝑧) ≤ 2 𝑖=2 |𝑋𝑖 | + 2|𝑋 0 | ≤ 2𝑁(𝑣).
          Í
                                                                                                          


4.2     Construction of the bad instance

We construct two instances: instance 𝑋ˆ on 𝑁 ∗ points, that is a semi-permutation (but is
somewhat easier to analyze), and instance 𝑋 ∗ in 𝑁 ∗ points, which is a permutation; the analysis
of instance 𝑋 ∗ heavily relies on the analysis of instance 𝑋. ˆ We will show that the optimal
solution value of both instances is Ω(𝑁 log log 𝑁 ), but the cost of the Wilber Bound is at most
                                         ∗          ∗

𝑂(𝑁 ∗ log log log 𝑁 ∗ ). Our construction uses the following three parameters. We let ℓ ≥ 1 be an
integer, and we set 𝑛 = 2ℓ and 𝑁 = 2𝑛 .


4.2.1    First instance

                                     ˆ which is a semi-permutation containing 𝑁 columns.
We now construct our first instance 𝑋,
Intuitively, we create 𝑁 instances 𝑋 , 𝑋 1 , . . . , 𝑋 𝑁−1 , where instance 𝑋 𝑠 is an exponentially-
                                    0

spaced BRS instance that is shifted by 𝑠 units. We then stack these instances on top of one
another in this order.
Formally, for all 0 ≤ 𝑗 ≤ 𝑁 − 1, we define a set ℛ 𝑗 of 𝑛 consecutive rows with integral coordinates,
such that the rows of ℛ 0 , ℛ 1 , . . . , ℛ 𝑁−1 appear in this bottom-to-top order. Specifically, set ℛ 𝑗
contains all rows whose 𝑦-coordinates are in {𝑗𝑛 + 1, 𝑗𝑛 + 2, . . . , (𝑗 + 1)𝑛}.
For every integer 0 ≤ 𝑠 ≤ 𝑁 − 1, we define a set of points 𝑋 𝑠 , which is a cyclic shift of instance
ES-BRS(ℓ , ℛ 𝑠 ) by 𝑠 units. Recall that |𝑋 𝑠 | = 2ℓ = 𝑛 and that the points in 𝑋 𝑠 appear on the rows
in ℛ 𝑠 and a set 𝒞𝑠 of columns, whose 𝑥-coordinates are in { 2 𝑗 + 𝑠 mod 𝑁 : 0 ≤ 𝑗 < 𝑛}. We
                                                                          

then let our final instance be 𝑋ˆ = 𝑁−1          𝑠                                      ˆ
                                          𝑠=0 𝑋 . From now on, we denote 𝑁 = | 𝑋 |. Recall that
                                                                                   ∗
                                       Ð
|𝑁 | = 𝑁 · 𝑛 = 𝑁 log 𝑁.
   ∗



                         T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                 28
             P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

Observe that the number of active columns in 𝑋ˆ is 𝑁. Since the instance is symmetric and
contains 𝑁 ∗ = 𝑁 log 𝑁 points, every column contains exactly log 𝑁 points. Each row contains
exactly one point, so 𝑋ˆ is a semi-permutation. (See Figure 5 for an illustration).




Figure 5: An illustration of our construction. The figure on the left shows the instance
BRS(2, [4], [4]). The figure on the right combines three copies 𝑋 0 , 𝑋 1 , 𝑋 2 of the corresponding
exponentially-spaced instance, with horizontal shifts of 0, 1, and 2, respectively. The red points
are shifted copies of the same point in different sub-instances.

                                                                                     ˆ
Lastly, we need the following bound on the value of the optimal solution of instance 𝑋.
                     ˆ = Ω(𝑁 ∗ log log 𝑁 ∗ )
Observation 4.6. OPT(𝑋)

Proof. From Claims 4.2 and 4.3, for each 0 ≤ 𝑠 ≤ 𝑁 − 1, each sub-instance 𝑋 𝑠 has OPT(𝑋 𝑠 ) ≥
Ω(𝑛 log 𝑛) = Ω(log 𝑁 log log 𝑁). Therefore, OPT(𝑋)   ˆ ≥ Í𝑁−1 OPT(𝑋 𝑠 ) = Ω(𝑁 log 𝑁 log log 𝑁) =
                                                            𝑠=0
Ω(𝑁 ∗ log log 𝑁 ∗ ) (we have used the fact that 𝑁 ∗ = 𝑁 log 𝑁).                               


4.2.2   Second instance

We now construct our second and final instance, 𝑋 ∗ , that is a permutation. In order to do so,
we start with the instance 𝑋,ˆ and, for every active column 𝐶 of 𝑋,     ˆ we create 𝑛 = log 𝑁 new
columns (that we view as copies of 𝐶), 𝐶 , . . . , 𝐶
                                          1          log 𝑁 , which replace the column 𝐶. We denote
this set of columns by ℬ(𝐶), and we refer it as the block of columns representing 𝐶. Recall that
the original column 𝐶 contains log 𝑁 input points of 𝑋.      ˆ We place each such input point on a
distinct column of ℬ(𝐶), so that the points form a monotonically increasing sequence (see the
definition in Section 4.1). This completes the definition of the final instance 𝑋 ∗ . We obtain the
following immediate bound on the optimal solution cost of instance 𝑋 ∗ .
                           ˆ = Ω(𝑁 ∗ log log 𝑁 ∗ ).
Claim 4.7. OPT(𝑋 ∗ ) ≥ OPT(𝑋)

Next, we proceed to prove the following theorem.
                ˆ ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ).
Theorem 4.8. WB(𝑋)

                     T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                        29
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

                          ˆ Recall that it consists of 𝑁 instances 𝑋 0 , 𝑋 1 , . . . , 𝑋 𝑁−1 that are stacked
Recall again the instance 𝑋.
on top of each other vertically in this order. We rename these instances as 𝑋1 , 𝑋2 , . . . , 𝑋𝑁 , so 𝑋 𝑗 is
exactly ES-BRS(log 𝑁), that is shifted by (𝑗 − 1) units to the right. Recall that | 𝑋ˆ | = 𝑁 ∗ = 𝑁 log 𝑁,
and each instance 𝑋𝑠 contains exactly log 𝑁 points. We denote by 𝒞 the set of 𝑁 columns,
whose 𝑥-coordinates are 1, 2, . . . , 𝑁. All points of 𝑋ˆ lie on the columns of 𝒞. For convenience,
for 1 ≤ 𝑗 ≤ 𝑁, we denote by 𝐶 𝑗 the column of 𝒞 whose 𝑥-coordinate is 𝑗.
Let 𝜎 be any ordering of the auxiliary columns in ℒ, and let 𝑇 = 𝑇(𝜎) be the corresponding
                                                                                         ˆ is
partitioning tree. It is enough to show that, for any such ordering 𝜎, the value of WB𝜎 (𝑋)
bounded by 𝑂(𝑁 log log log 𝑁 ).
                  ∗             ∗


The total costs of the bound is divided into two parts as follows. Recall that 𝑊 𝐵 𝜎 (𝑋)         ˆ is the
sum, over all vertices 𝑣 ∈ 𝑉(𝑇), of cost(𝑣). If 𝑣 is a leaf vertex, then cost(𝑣) = 0. Otherwise,
let 𝐿 = 𝐿(𝑣) be the line of ℒ that 𝑣 owns. Index the points in 𝑋 ∩ 𝑆(𝑣) by 𝑞1 , . . . , 𝑞 𝑧 in their
bottom-to-top order. A consecutive pair (𝑞 𝑗 , 𝑞 𝑗+1 ) of points is a crossing iff they lie on different
sides of 𝐿(𝑣). We distinguish between the two types of crossings that contribute towards cost(𝑣).
We say that the crossing (𝑞 𝑗 , 𝑞 𝑗+1 ) is of type-1 if both 𝑞 𝑗 and 𝑞 𝑗+1 belong to the same shifted
instance 𝑋𝑠 for some 0 ≤ 𝑠 ≤ 𝑁 − 1. Otherwise, they are of type-2. Note that, if (𝑞 𝑗 , 𝑞 𝑗+1 ) is a
crossing of type 2, with 𝑞 𝑗 ∈ 𝑋𝑠 and 𝑞 𝑗+1 ∈ 𝑋𝑠 0 , then 𝑠, 𝑠 0 are not necessarily consecutive integers,
as it is possible that for some indices 𝑠 00, 𝑋𝑠 00 has no points that lie in the strip 𝑆(𝑣). We now let
cost1 (𝑣) be the total number of type-1 crossings of 𝐿(𝑣), and cost2 (𝑣) the total number of type-2
                                                                                     Í
crossings. Note that cost(𝑣) = cost1 (𝑣) + cost2 (𝑣). We also define cost1 (𝜎) = 𝑣∈𝑉(𝑇) cost1 (𝑣) and
cost2 (𝜎) = 𝑣∈𝑉(𝑇) cost2 (𝑣). Clearly, WB𝜎 (𝑋)
            Í                                  ˆ = cost1 (𝜎) + cost2 (𝜎). We prove the following two
theorems.
Theorem 4.9. For every ordering 𝜎 of the auxiliary columns in ℒ, cost1 (𝜎) ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ).
Theorem 4.10. For every vertex 𝑣 ∈ 𝑉(𝑇), cost2 (𝑣) ≤ 𝑂(log 𝑁) + 𝑂(cost1 (𝑣)).

We prove these theorems in Section 4.3 and 4.4. The latter implies that cost2 (𝜎) ≤ 𝑂(cost1 (𝜎)) +
𝑂(|𝑉(𝑇)| · log 𝑁) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ) + 𝑂(𝑁 log 𝑁) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ). Combining the
two theorems together completes the proof of Theorem 4.8.


4.2.3   Upper bounding WB(𝑋 ∗ )

We argue that WB(𝑋 ∗ ) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ), completing the proof of Theorem 1.1. Recall that
instance 𝑋 ∗ is obtained from instance 𝑋ˆ by replacing every active column 𝐶 of 𝑋 ∗ with a block
ℬ(𝐶) of columns, and then placing the points of 𝐶 on the columns of ℬ(𝐶) so that they form a
monotone increasing sequence, while preserving their 𝑦-coordinates. The resulting collection
of all blocks ℬ(𝐶) partitions the set of all active columns of 𝑋 ∗ . We denote this set of blocks by
ℬ1 , . . . , ℬ𝑁 . The idea is to use Theorem 3.6 in order to bound WB(𝑋 ∗ ).
Consider a set of lines ℒ 0 (with half-integral 𝑥-coordinates) that partition the bounding box
𝐵 into 𝑁 strips, where the 𝑖th strip contains the block ℬ𝑖 of columns, so |ℒ 0 | = (𝑁 − 1). We

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               30
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

consider a split of instance 𝑋 ∗ by ℒ 0: This gives us a collection of strip instances 𝑋𝑖∗ 1≤𝑖≤𝑁 and
                                                                                        

the compressed instance 𝑋  e∗ . Notice that the compressed instance is precisely 𝑋,  ˆ and each strip
instance 𝑋𝑖 is a monotone increasing point set.
           ∗

Since each strip instance 𝑋𝑖∗ is monotonously increasing, from Observation 4.1 and Claim 2.7,
for all 𝑖, WB(𝑋𝑖∗ ) ≤ 𝑂(OPT(𝑋𝑖∗ )) ≤ 𝑂(|𝑋𝑖∗ |). From Theorem 3.6, we then get that: WB(𝑋 ∗ ) ≤
4WB(𝑋) ˆ + 8 Í𝑖 WB(𝑋 ∗ ) + 𝑂(|𝑋 ∗ |) ≤ 4WB(𝑋)
                                            ˆ + 𝑂(|𝑋 ∗ |) ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ).
                      𝑖



4.3   Bounding type-1 crossings

The goal of this subsection is to prove Theorem 4.9.
We prove this theorem by a probabilistic argument. Consider the following experiment. Fix the
permutation 𝜎 of ℒ. Pick an integer 𝑠 ∈ {0, . . . , 𝑁 − 1} uniformly at random, and let 𝑋 be the
resulting instance 𝑋𝑠 . This random process generates an input 𝑋 containing 𝑛 = log 𝑁 points.
Equivalently, let 𝑝 1 , 𝑝2 , . . . , 𝑝 log 𝑁 be the points in BRS(ℓ ) ordered from left to right. Once we
choose a random shift 𝑠, we move these points to columns in 𝒞𝑠 = {2 𝑗 + 𝑠 mod 𝑁 }, where
point 𝑝 𝑗 would be moved to 𝑥-coordinate 2 𝑗 + 𝑠 mod 𝑁. Therefore, in the analysis, we view the
location of points 𝑝 1 , . . . , 𝑝 log 𝑁 as random variables.
We denote by 𝜇(𝜎) the expected value of WB𝜎 (𝑋), over the choices of the shift 𝑠. The following
observation is immediate, and follows from the fact that the final instance 𝑋ˆ contains every
instance 𝑋𝑠 for all shifts 𝑠 ∈ {0, . . . , 𝑁 − 1}.

Observation 4.11. cost1 (𝜎) = 𝑁 · 𝜇(𝜎)

Therefore, in order to prove Theorem 4.9, it is sufficient to show that, for every fixed permutation
𝜎 of ℒ, 𝜇(𝜎) ≤ 𝑂(log 𝑁 log log log 𝑁) (recall that 𝑁 ∗ = 𝑁 log 𝑁).
We assume from now on that the permutation 𝜎 (and the corresponding partitioning tree 𝑇) is
fixed, and we analyze the expectation 𝜇(𝜎). Let 𝑣 ∈ 𝑉(𝑇). We say that 𝑆(𝑣) is a seam strip iff point
𝑝 1 lies in the strip 𝑆(𝑣). We say that 𝑆(𝑣) is a bad strip (or that 𝑣 is a bad node) if the following
two conditions hold: (i) 𝑆(𝑣) is not a seam strip; and (ii) 𝑆(𝑣) contains at least 100 log log 𝑁
points of 𝑋. Let ℰ(𝑣) be the bad event that 𝑆(𝑣) is a bad strip.

Claim 4.12. For every vertex 𝑣 ∈ 𝑉(𝑇), Pr [ℰ(𝑣)] ≤ 8 width(𝑆(𝑣)) .
                                                        𝑁 log
                                                         100
                                                                𝑁


Proof. Fix a vertex 𝑣 ∈ 𝑉(𝑇). For convenience, we denote 𝑆(𝑣) by 𝑆. Let 𝑠 be the random integer
chosen by the algorithm and let 𝑋𝑠 = 𝑋 be the resulting point set. Assume that 𝑆 is a bad strip,
and let 𝐿 be the vertical line that serves as the left boundary of 𝑆. Let 𝑝 𝑗 be the point of 𝑋𝑠 that
lies to the left of 𝐿, and among all such points, we take the one closest to 𝐿. Recall that for each
1 ≤ 𝑗 < log 𝑁, there are 2 𝑗 − 1 columns of 𝒞 that lie between the column of 𝑝 𝑗 and the column of
𝑝 𝑗+1 .

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            31
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

If 𝑆 is a bad strip, then it must contain points 𝑝 𝑗+1 , 𝑝 𝑗+2 , . . . , 𝑝 𝑗+𝑞 , where 𝑞 = 100 log log 𝑁.
Therefore, the number of columns of 𝒞 in strip 𝑆 is at least 2 𝑗+𝑞−2 , or, equivalently, width(𝑆) ≥
2 𝑗+𝑞 /4 ≥ (2 𝑗 log100 𝑁)/4. In particular, 2 𝑗 ≤ 4 width(𝑆)/log100 𝑁.
Therefore, in order for 𝑆 to be a bad strip, the shift 𝑠 must be chosen in such a way that the point
𝑝 𝑗 , that is the rightmost point of 𝑋𝑠 lying to the left of 𝐿, has 2 𝑗 ≤ 4 width(𝑆)/log100 𝑁. It is easy
to verify that the total number of all such shifts 𝑠 is bounded by 8 width(𝑆)/log100 𝑁.
In order to see this, consider an equivalent experiment, in which we keep the instance 𝑋1
fixed, and instead choose a random shift 𝑠 ∈ {0, . . . , 𝑁 − 1} for the line 𝐿. For the bad event
ℰ(𝑣) to happen, the line 𝐿 must fall in the interval between 𝑥-coordinate 0 and 𝑥-coordinate
8 width(𝑆)/log100 𝑁. Since every integral shift 𝑠 is chosen with the same probability 1/𝑁, the
probability that ℰ(𝑣) happens is at most 8 width(𝑆) .                                          
                                                𝑁 log
                                              100
                                                        𝑁


Consider now the partitioning tree 𝑇. We partition the vertices of 𝑇 into log 𝑁 + 1 classes
𝑄 1 , . . . , 𝑄 log 𝑁+1 . A vertex 𝑣 ∈ 𝑉(𝑇) lies in class 𝑄 𝑖 iff 2𝑖 ≤ width(𝑆(𝑣)) < 2𝑖+1 . Therefore, every
vertex of 𝑇 belongs to exactly one class.
Consider now some vertex 𝑣 ∈ 𝑉(𝑇), and assume that it lies in class 𝑄 𝑖 . We say that 𝑣 is an
important vertex for class 𝑄 𝑖 iff no ancestor of 𝑣 in the tree 𝑇 belongs to class 𝑄 𝑖 . Notice that, if 𝑢
is an ancestor of 𝑣, and 𝑢 ∈ 𝑄 𝑗 , then 𝑗 ≥ 𝑖 must hold.
For each 1 ≤ 𝑖 ≤ log 𝑁 + 1, let 𝑈 𝑖 be the set of all important vertices of class 𝑄 𝑖 .
Observation 4.13. For each 1 ≤ 𝑖 ≤ log 𝑁 + 1, |𝑈 𝑖 | ≤ 𝑁/2𝑖 .

Proof. Since no vertex of 𝑈 𝑖 may be an ancestor of another vertex, the strips in {𝑆(𝑣) | 𝑣 ∈ 𝑈 𝑖 }
are mutually disjoint, except for possibly sharing their boundaries. Since each strip has width at
least 2𝑖 , and we have exactly 𝑁 columns, the number of such strips is bounded by 𝑁/2𝑖 .        

Let ℰ be the bad event that there is some index 1 ≤ 𝑖 ≤ log 𝑁 + 1, and some important vertex
𝑣 ∈ 𝑈 𝑖 of class 𝑄 𝑖 , for which the event ℰ(𝑣) happens. Applying the Union Bound to all strips in
{𝑆(𝑣) | 𝑣 ∈ 𝑈 𝑖 } and all indices 1 ≤ 𝑖 ≤ log 𝑁, we obtain the following corollary of Claim 4.12.
Corollary 4.14. Pr [ℰ] ≤       32
                                    .
                            log99 𝑁


Proof. Fix some index 1 ≤ 𝑖 ≤ log 𝑁. Recall that for every important vertex 𝑣 ∈ 𝑈 𝑖 , the
                                                                      𝑖+1          𝑖
probability that the event ℰ(𝑣) happens is at most 8 width(𝑆(𝑣)) ≤ 8·2 100 = 16·2100 . From
                                                         100
                                                              𝑁 log   𝑁     𝑁 log   𝑁     𝑁 log   𝑁
Observation 4.13, |𝑈 𝑖 | ≤ 𝑁/2𝑖 . From the union bound, the probability that event ℰ(𝑣) happens
for any 𝑣 ∈ 𝑈 𝑖 is bounded by 16        . Using the union bound over all 1 ≤ 𝑖 ≤ log 𝑁 + 1, we
                                    100
                                      log   𝑁
conclude that Pr [ℰ] ≤       32
                                  .                                                                       
                          log99 𝑁


Lastly, we show that, if event ℰ does not happen, then the cost of the Wilber Bound is sufficiently
small.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                              32
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

Lemma 4.15. Let 1 ≤ 𝑠 ≤ 𝑁 be a shift for which ℰ does not happen. Then:


                                      𝑊 𝐵 𝜎 (𝑋𝑠 ) ≤ 𝑂(log 𝑁 log log log 𝑁).

Proof. Consider the partitioning tree 𝑇 = 𝑇(𝜎). We say that a vertex 𝑣 ∈ 𝑉(𝑇) is a seam vertex iff
𝑆(𝑣) is a seam strip, that is, the point 𝑝 1 in instance 𝑋𝑠 lies in 𝑆(𝑣). Clearly, the root of 𝑇 is a
seam vertex, and for every seam vertex 𝑣, exactly one of its children is a seam vertex. Therefore,
there is a root-to-leaf path 𝑃 that only consists of seam vertices, and every seam vertex lies on
𝑃. We denote the vertices of 𝑃 by 𝑣 1 , 𝑣2 , . . . , 𝑣 𝑞 , where 𝑣 1 is the root of 𝑇, and 𝑣 𝑞 is a leaf. For
1 < 𝑖 ≤ 𝑞, we denote by 𝑣 0𝑖 the sibling of the vertex 𝑣 𝑖 . Note that all strips 𝑆(𝑣 20 ), . . . , 𝑆(𝑣 0𝑞 ) are
                                                                               Í𝑞
mutually disjoint, except for possibly sharing boundaries, and so 𝑖=2 𝑁(𝑣 0𝑖 ) ≤ |𝑋𝑠 | = log 𝑁.
                             Í𝑞
Moreover, from Claim 4.5, 𝑖=1 cost(𝑣 𝑖 ) ≤ 2|𝑋𝑠 | = 2 log 𝑁.
For each 1 < 𝑖 ≤ 𝑞, let 𝑇𝑖 be the subtree of 𝑇 rooted at the vertex 𝑣 0𝑖 . We prove the following
claim:
Claim 4.16. For all 1 < 𝑖 ≤ 𝑞, the total cost of all vertices in 𝑇𝑖 is at most 𝑂(𝑁(𝑣 0𝑖 ) log log log 𝑁).

Assume first that the above claim is correct. Notice that every vertex of 𝑇 that does not lie on
the path 𝑃 must belong to one of the trees 𝑇𝑖 . The total cost of all vertices lying in all trees 𝑇𝑖
                      Í𝑞
is then bounded by 𝑖=2 𝑂(𝑁(𝑣 0𝑖 ) log log log 𝑁) ≤ 𝑂(log 𝑁 log log log 𝑁). Since the total cost
of all vertices on the path 𝑃 is bounded by 2 log 𝑁, overall, the total cost of all vertices in 𝑇 is
bounded by 𝑂(log 𝑁 log log log 𝑁).
In order to complete the proof of Lemma 4.15, it now remains to prove Claim 4.16.

Proof. Claim 4.16 We fix some index 1 < 𝑖 ≤ 𝑞, and consider the vertex 𝑣 0𝑖 . If the parent 𝑣 𝑖−1 of 𝑣 0𝑖
belongs to a different class than 𝑣 0𝑖 , then 𝑣 0𝑖 must be an important vertex in its class. In this case,
since we have assumed that Event ℰ does not happen, 𝑁(𝑣 0𝑖 ) ≤ 𝑂(log log 𝑁). From Claim 4.5,
the total cost of all vertices in 𝑇𝑖 is bounded by
                     Õ
                               cost(𝑣) ≤ 𝑂(𝑁(𝑣 0𝑖 ) log(𝑁(𝑣 0𝑖 ))) ≤ 𝑂(𝑁(𝑣 0𝑖 ) log log log 𝑁)
                    𝑣∈𝑉(𝑇𝑖 )

Therefore, we can assume from now on that 𝑣 𝑖−1 and 𝑣 0𝑖 both belong to the same class, that we
denote by 𝑄 𝑗 . Notice that, if a vertex 𝑣 belongs to class 𝑄 𝑗 , then at most one of its children may
belong to class 𝑄 𝑗 ; the other child must belong to some class 𝑄 𝑗0 for 𝑗 0 < 𝑗, and it must be an
important vertex in its class.
We now construct a path 𝑃𝑖 in tree 𝑇𝑖 iteratively, as follows. The first vertex on the path is 𝑣 0𝑖 .
We then iteratively add vertices at to the end of path 𝑃𝑖 one-by-one, so that every added vertex
belongs to class 𝑄 𝑗 . In order to do so, let 𝑣 be the last vertex on the current path 𝑃𝑖 . If some
child 𝑢 of 𝑣 also lies in class 𝑄 𝑖 , then we add 𝑢 to the end of 𝑃𝑖 and continue to the next iteration.
Otherwise, we terminate the construction of the path 𝑃𝑖 .

                         T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                33
             PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

Denote the sequence of vertices on the final path 𝑃𝑖 by (𝑣 0𝑖 = 𝑢1 , 𝑢2 , . . . , 𝑢𝑧 ); recall that every
vertex on 𝑃𝑖 belongs to class 𝑄 𝑗 , and that path 𝑃𝑖 is a sub-path of some path connecting 𝑣 0𝑖
to a leaf of 𝑇𝑖 . Let 𝑍 be a set of vertices containing, for all 1 < 𝑧 0 ≤ 𝑧 a sibling of the vertex
𝑢𝑧0 , and additionally the two children of 𝑢𝑧 (if they exist). Note that every vertex 𝑥 ∈ 𝑍 is
an important vertex in its class, and, since we have assumed that Event ℰ did not happen,
𝑁(𝑥) ≤ 𝑂(log log 𝑁). For every vertex 𝑥 ∈ 𝑍, we denote by 𝑇𝑥0 the subtree of 𝑇 rooted at 𝑥. From
Claim 4.5, cost(𝑇𝑥0 ) ≤ 𝑂(𝑁(𝑥) log(𝑁(𝑥))) = 𝑂(𝑁(𝑥) log log log 𝑁).
Notice that all strips in {𝑆(𝑥) | 𝑥 ∈ 𝑍} are disjoint from each other, except for possibly sharing a
boundary. It is then easy to see that 𝑥∈𝑍 𝑁(𝑥) ≤ 𝑁(𝑣 0𝑖 ). Therefore, altogether 𝑥∈𝑍 cost(𝑇𝑥0 ) ≤
                                       Í                                            Í
𝑂(𝑁(𝑣 0𝑖 ) log log log 𝑁)).
Lastly, notice that every vertex of 𝑉(𝑇𝑖 ) either lies on 𝑃𝑖 , or belongs to one of the trees 𝑇𝑥0 for
𝑥 ∈ 𝑍. Since, from Claim 4.5, the total cost of all vertices on 𝑃𝑖 is bounded by 𝑁(𝑣 0𝑖 ), altogether,
the total cost of all vertices in 𝑇𝑖 is bounded by 𝑂(𝑁(𝑣 0𝑖 ) log log log 𝑁)).                      

                                                                                                       

To summarize, if the shift 𝑠 is chosen such that Event ℰ does not happen, 𝑊 𝐵 𝜎 (𝑋𝑠 ) ≤
𝑂(log 𝑁 log log log 𝑁). Assume now that the shift 𝑠 is chosen such that Event ℰ happens.
From Corollary 4.14, the probability of this is at most Pr [ℰ] ≤ 32  . Since |𝑋𝑠 | = log 𝑁,
                                                                  99
                                                                          log   𝑁
from Corollary 4.4, WB𝜎 (𝑋𝑠 ) ≤ |𝑋𝑠 | log(|𝑋𝑠 |) ≤ log 𝑁 log log 𝑁. Therefore, altogether, we
get that 𝜇(𝜎) ≤ 𝑂(log 𝑁 log log log 𝑁), and cost1 (𝜎) = 𝑁 · 𝜇(𝜎) = 𝑂(𝑁 log 𝑁 log log log 𝑁) =
𝑂(𝑁 ∗ log log log 𝑁 ∗ ), as 𝑁 ∗ = 𝑁 log 𝑁.


4.4   Bounding type-2 crossings

This subsection is dedicated to the proof of Theorem 4.10. We fix a vertex 𝑣 ∈ 𝑉(𝑇), and we
denote 𝑆 = 𝑆(𝑣). We also let 𝐿 = 𝐿(𝑣) be the vertical line that 𝑣 owns. Our goal is to show that
the number of type-2 crossings of 𝐿 is bounded by 𝑂(cost1 (𝑣)) + 𝑂(log 𝑁).
Recall that instances 𝑋1 , . . . , 𝑋𝑁 are stacked on top of each other, so that the first log 𝑁 rows
with integral coordinates belong to 𝑋1 , the next log 𝑁 rows belong to 𝑋2 , and so on. If we have
a crossing (𝑝, 𝑝 0), where 𝑝 ∈ 𝑋𝑠 and 𝑝 0 ∈ 𝑋𝑠 0 , then we say that the instances 𝑋𝑠 and 𝑋𝑠 0 are
responsible for this crossings. Recall that 𝑝, 𝑝 0 may only define a crossing if they lie on opposite
sides of the line 𝐿, and if no point of 𝑋ˆ lies in the strip 𝑆 between the row of 𝑝 and the row of 𝑝 0.
It is then clear that every instance 𝑋𝑠 may be responsible for at most two type-2 crossings of 𝐿:
one in which the second instance 𝑋𝑠 0 responsible for the crossing has 𝑠 0 < 𝑠, and one in which
𝑠 0 > 𝑠.
We further partition the type-2 crossings into two sub-types. Consider a crossing (𝑝, 𝑝 0), and let
𝑋𝑠 , 𝑋𝑠 0 be the two instances that are responsible for it. If either of 𝑋𝑠 , 𝑋𝑠 0 contributes a type-1
crossing to the cost of 𝐿, then we say that (𝑝, 𝑝 0) is a type-(2a) crossing; otherwise it is a type-(2b)

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            34
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

crossing. Clearly, the total number of type-(2a) crossings is bounded by 𝑂(cost1 (𝑣)). It is now
sufficient to show that the total number of all type-(2b) crossings is bounded by 𝑂(log 𝑁).
Consider now some type-(2b) crossing (𝑝, 𝑝 0), and let 𝑋𝑠 and 𝑋𝑠 0 be the two instances that are
responsible for it, with 𝑝 ∈ 𝑋𝑠 . We assume that 𝑠 < 𝑠 0. Since neither instance contributes a
crossing to cost1 (𝑣), it must be the case that all points of 𝑋𝑠 ∩ 𝑆 lie to the left of 𝐿 and all points
of 𝑋𝑠 0 ∩ 𝑆 lie to the right of 𝐿 or vice versa. Moreover, if 𝑠 0 > 𝑠 + 1, then for all 𝑠 < 𝑠 00 < 𝑠 0,
𝑋𝑠 00 ∩ 𝑆 = ∅.
It would be convenient for us to collapse each of the instances 𝑋1 , . . . , 𝑋𝑁 into a single row. In
order to do so, for each 1 ≤ 𝑠 ≤ 𝑁, we replace all rows on which the points of 𝑋𝑠 lie with a single
row 𝑅 𝑠 . If some point of 𝑋𝑠 lies on some column 𝐶, then we add a point at the intersection of 𝑅 𝑠
and 𝐶.
We say that a row 𝑅 𝑠 is empty if there are no input points in 𝑅 𝑠 ∩ 𝑆. We say that it is a neutral
row, if there are points in 𝑅 𝑠 ∩ 𝑆 both to the left of 𝐿 and to the right of 𝐿. We say that it is a left
row if 𝑅 𝑠 ∩ 𝑆 only contains points lying to the left of 𝐿, and we say that it is a right row if 𝑅 𝑠 ∩ 𝑆
only contains points lying to the right of 𝐿.
If we now consider any type-(2b) crossing (𝑝, 𝑝 0), and the instances 𝑋𝑠 , 𝑋𝑠 0 that are responsible
for it, with 𝑠 < 𝑠 0, then it must be the case that exactly one of the rows 𝑅 𝑠 , 𝑅 𝑠 0 is a left row, and
the other is a right row. Moreover, if 𝑠 0 > 𝑠 + 1, then every row lying between 𝑅 𝑠 and 𝑅 𝑠 0 is an
empty row.
Let us denote the points in 𝑋1 by 𝑝 1 , . . . , 𝑝 log 𝑁 , where for each 1 ≤ 𝑖 ≤ log 𝑁, point 𝑝 𝑖 lies in
column 𝐶2𝑖 . In each subsequent instance 𝑋2 , 𝑋3 , . . ., the point is shifted by one unit to the right,
so that in instance 𝑋𝑠 it lies in column 𝐶2𝑖 +𝑠−1 ; every column in 𝒞 must contain exactly one copy
of point 𝑝 𝑖 .
Consider now all copies of the point 𝑝 𝑖 that lie in the strip 𝑆. Let ℛ 𝑖 be the set of rows containing
these copies. Then two cases are possible: either (i) ℛ 𝑖 is a contiguous set of rows, and the
copies of 𝑝 𝑖 appear on ℛ 𝑖 diagonally as an increasing sequence (the 𝑗th row of ℛ 𝑖 contains a
copy of 𝑝 𝑖 that lies in the 𝑗th column of 𝒞 in the strip 𝑆); or ℛ 𝑖 consists of two consecutive sets of
rows; the first set, that we denote by ℛ 0𝑖 , contains 𝑅1 , and the second set, that we denote by ℛ 00𝑖 ,
contains the last row 𝑅 𝑁 . The copies of the point 𝑝 𝑖 also appear diagonally in ℛ 0𝑖 and in ℛ 00𝑖 ; in
ℛ 00𝑖 the first copy lies on the first column of 𝒞 in 𝑆; in ℛ 0𝑖 the last copy lies on the last column of
𝒞 in 𝑆 (see Figure 6).
We show that for each 1 ≤ 𝑖 ≤ log 𝑁, there are at most four type-(2b) crossings of the line 𝐿 in
which a copy of 𝑝 𝑖 participates. Indeed, consider any type-(2b) crossing (𝑝, 𝑝 0) in which a copy
of 𝑝 𝑖 participates. We assume that the row of 𝑝 lies below the row of 𝑝 0. Assume first that both
𝑝 and 𝑝 0 lie on rows of ℛ 𝑖 , and let 𝑅, 𝑅0 be these two rows, with 𝑝 ∈ 𝑅, 𝑝 0 ∈ 𝑅0. Recall that, in
order for (𝑝, 𝑝 0) to define a crossing, all input points that lie on 𝑅 ∩ 𝑆 must lie to the left of 𝐿,
and all point points that lie on 𝑅0 ∩ 𝑆 must lie to the right of 𝐿, or the other way around. It is
easy to verify (see Figure 6) that one of two cases must happen: either 𝑅 contains a copy of 𝑝 𝑖
lying closest to 𝐿 on its left, and 𝑅0 contains a copy of 𝑝 𝑖 lying closest to 𝐿 on its right; or 𝑅 is the


                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            35
               PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK


                              L                                                     L




                                                            (b) The two consecutive sets of rows on
      (a) The consecutive set of rows on which              which the copies of 𝑝 𝑖 appear are ℛ 0𝑖 (on
      the copies of 𝑝 𝑖 appear is denoted by ℛ 𝑖 .          the bottom) and ℛ 00𝑖 (on the top).

                Figure 6: Two patterns in which copies of 𝑝 𝑖 may appear on strip 𝑆.


last row of ℛ 0𝑖 , and 𝑅0 is the first row of ℛ 00𝑖 . Therefore, only two such crossing, with 𝑅, 𝑅0 ∈ ℛ 𝑖
are possible.
Assume now that 𝑅 ∈ ℛ 𝑖 and 𝑅0 ∉ ℛ 𝑖 ; recall that we assume that 𝑅0 lies above 𝑅. Then all rows
that lie between 𝑅 and 𝑅0 must be empty, so it is easy to verify that 𝑅 must be the last row of ℛ 𝑖
(or it must be the last row of ℛ 0𝑖 ). In either case, at most one such crossing is possible.
Lastly, we assume that 𝑅 ∉ ℛ 𝑖 and 𝑅0 ∈ ℛ 𝑖 . The analysis is symmetric; it is easy to see that at
most one such crossing is possible.
We conclude that for each 1 ≤ 𝑖 ≤ log 𝑁, at most four type-(2b) crossings of the line 𝐿 may
involve copies of 𝑝 𝑖 , and so the total number of type-(2b) crossings of 𝐿 is bounded by 𝑂(log 𝑁).
To summarize, we have shown that for every ordering 𝜎 of the auxiliary columns in ℒ,
                                                            ˆ = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ). Since OPT(𝑋)
cost1 (𝜎), cost2 (𝜎) ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ), and so WB𝜎 (𝑋)                                     ˆ =
     ∗            ∗                                     ∗           ∗                    ˆ
Ω(𝑁 log log 𝑁 ), we obtain a gap of Ω(log log 𝑁 /log log log 𝑁 ) between OPT(𝑋) and WB(𝑋).          ˆ


4.5    Separating WB(2) and WB
                                      log log 𝑛
In this section, we extend our Ω( log log log 𝑛 )-factor separation between WB and OPT to a separation
between WB and the second Wilber bound (denoted by WB(2) ), which is defined below.4
   4Wilber originally defined this bound based on the tree view. We use an equivalent geometric definition as
discussed in [11, 18].


                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                              36
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

Let 𝑋 be a set of 𝑚 points that is a semi-permutation. Consider any point 𝑝 ∈ 𝑋. The funnel of
𝑝, denoted by funnel(𝑋 , 𝑝) is the set of all points 𝑞 ∈ 𝑋, such that 𝑞, 𝑦 < 𝑝.𝑦, and 𝑝,𝑞 contains
no point of 𝑋 \ {𝑝, 𝑞}. Denote funnel(𝑋 , 𝑝) = {𝑎 1 , 𝑎2 , . . . , 𝑎 𝑟 }, where the points are indexed in
the increasing order of their 𝑦-coordinates. Let alt(𝑋 , 𝑝) be the number of indices 1 ≤ 𝑖 < 𝑟,
such that 𝑎 𝑖 lies strictly to the left of 𝑝 and 𝑎 𝑖+1 lies strictly to the right of 𝑝, or the other way
around. The second Wilber bound is:
                                                         Õ
                                      WB(2) (𝑋) = 𝑚 +          alt(𝑋 , 𝑝).
                                                         𝑝∈𝑋


The goal of this section is to prove the following:

Theorem 4.17. For infinitely many integer 𝑛, there exists a point set 𝑋 that is a permutation with
|𝑋 | = 𝑛, such that WB(2) (𝑋) ≥ Ω(𝑛 log log 𝑛) but WB(𝑋) ≤ 𝑂(𝑛 log log log 𝑛).

As it is known that OPT(𝑋) ≥ WB(2) (𝑋) for any point set 𝑋 [29], Theorem 4.17 is a stronger
statement than Theorem 1.1. To prove Theorem 4.17, we use exactly the same permutation
sequence 𝑋 ∗ of size 𝑁 ∗ that is constructed in Section 4.2. Since we already showed that
WB(𝑋 ∗ ) ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ), it remains to show that WB(2) (𝑋 ∗ ) ≥ Ω(𝑁 ∗ log log 𝑁 ∗ ).
We use the following claim of Wilber [29].

Claim 4.18 ([29]). WB(2) (BRS(𝑛)) = Ω(𝑛 log 𝑛) for any 𝑛.

We extend this bound to the cyclically shifted BRS in the following lemma.

Lemma 4.19. For integers 𝑛 > 0, 𝑠 with 0 ≤ 𝑠 < 𝑛, let 𝑋 be the sequence obtained by performing a
cyclic shift to BRS(𝑛) by 𝑠 units. Then WB(2) (𝑋) = Ω(𝑛 log 𝑛).

Proof. Observe that, for any choice of 𝑠, there must exists a subsequence 𝑋 0 of 𝑋 such that 𝑋 0 is
a copy of BRS(𝑛 − 1). It is shown in Lemma 6.2 of [22] that for any pair of sequences 𝑍, 𝑍0 with
𝑍0 ⊆ 𝑍, WB(2) (𝑍0) ≤ WB(2) (𝑍) holds. Therefore, we conclude that

                             WB(2) (𝑋) ≥ WB(2) (BRS(𝑛 − 1)) ≥ Ω(𝑛 log 𝑛).

                                                                                                            

Now, we are ready to bound WB(2) (𝑋 ∗ ).

Lemma 4.20. WB(2) (𝑋 ∗ ) = Ω(𝑁 ∗ log log 𝑁 ∗ ).

Proof. Recall that 𝑋ˆ is the union of the sets 𝑋 0 , 𝑋 1 , . . . , 𝑋 𝑁−1 of points, where for all 0 ≤ 𝑠 ≤ 𝑁 −1,
set 𝑋 𝑠 is an exponentially-spaced BRS instance that is shifted by 𝑠 units. From the definition
                                          ˆ ≥ Í𝑁−1 WB(2) (𝑋 𝑠 ). This is since, for all 0 ≤ 𝑠 ≤ 𝑁 − 1,
of WB(2) , it is easy to see that WB(2) (𝑋)        𝑠=0


                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                37
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

for ever point 𝑝 ∈ 𝑋 𝑠 , funnel(𝑋 𝑠 , 𝑝) ⊆ funnel(𝑋ˆ , 𝑝), and moreover alt(𝑋ˆ , 𝑝) ≥ alt(𝑋 𝑠 , 𝑝). From
Lemma 4.19, we get that WB(2) (𝑋 𝑠 ) = Ω(𝑛 log 𝑛), where 𝑛 = |𝑋 𝑠 | = log 𝑁. Therefore,

                                 ˆ ≥ 𝑁 · Ω(𝑛 log 𝑛) = Ω(𝑁 ∗ log log 𝑁 ∗ ).
                          WB(2) (𝑋)

Finally, recall that the sequence 𝑋 ∗ is obtained from 𝑋ˆ by replacing each column 𝐶 of 𝑋ˆ with
a block ℬ(𝐶) of columns, and placing all points of 𝑋ˆ lying in 𝐶 on the columns of ℬ(𝐶) so
that they form a monotonically increasing sequence of length 𝑛. It is not hard to see that
                      ˆ which concludes the proof.
WB(2) (𝑋 ∗ ) ≥ WB(2) (𝑋)                                                                      


5     Guillotine bounds

In this section we define two extensions of the strong Wilber bound and extend our negative
results to one of these bounds. In subSection 5.1, we provide formal definitions of these bounds,
and we present our negative result in subsequent subsections.


5.1     Definitions

Assume that we are given an input set 𝑋 of 𝑛 points, that is a permutation. Let ℒ 𝑉 be the set of
all vertical lines with half-integral 𝑥-coordinates between 1/2 and 𝑛 − 1/2, and let ℒ 𝐻 be the
set of all horizontal lines with half-integral 𝑦-coordinates between 1/2 and 𝑛 − 1/2. Recall that
for every permutation 𝜎 of ℒ 𝑉 , we have defined a bound WB𝜎 (𝑋). We can similarly define a
bound WB0𝜎0 (𝑋) for every permutation 𝜎0 of ℒ 𝐻 . We also let WB0(𝑋) be the maximum, over
all permutations 𝜎0 of ℒ 𝐻 , of WB0𝜎0 (𝑋). Equivalently, let 𝑋 0 be an instance obtained from 𝑋 by
rotating it by 90 degrees clockwise. Then WB0(𝑋) = WB(𝑋 0). We denote by 𝐵 a bounding box
that contains all points of 𝑋.


5.1.1    Consistent Guillotine Bound

In this section we define the consistent Guillotine Bound, cGB(𝑋). Let 𝜎 be any permutations of
all lines in ℒ 𝑉 ∪ ℒ 𝐻 . We start from a bounding box 𝐵 containing all points of 𝑋 and maintain a
partition 𝒫 of the plane into rectangular regions, where initially 𝒫 = {𝐵}. We process the lines
in ℒ 𝑉 ∪ ℒ 𝐻 according to their ordering in 𝜎. Consider an iteration when a line 𝐿 is processed.
Let 𝑃1 , . . . , 𝑃𝑘 be all rectangular regions in 𝒫 that intersect the line 𝐿. For each such region 𝑃 𝑗 ,
let 𝑃 0𝑗 and 𝑃 00𝑗 be the two rectangular regions into which the line 𝐿 splits 𝑃 𝑗 . We update 𝒫 by
replacing each region 𝑃 𝑗 , for 1 ≤ 𝑗 ≤ 𝑘, with the regions 𝑃 0𝑗 and 𝑃 00𝑗 . Once all lines in ℒ 𝑉 ∪ ℒ 𝐻
are processed, we terminate the process.
This recursive partitioning procedure can be naturally associated with a partitioning tree 𝑇 = 𝑇𝜎
that is defined as follows:

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            38
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

   • Each vertex 𝑣 ∈ 𝑉(𝑇) is associated with a rectangular region 𝑆(𝑣) of the plane. If 𝑟 is the
     root of 𝑇, then 𝑆(𝑟) = 𝐵.

   • Each non-leaf vertex 𝑣 is associated with a line 𝐿(𝑣) ∈ ℒ 𝐻 ∪ ℒ 𝑉 that was used to partition
     𝑆(𝑣) into two sub-regions, 𝑆0 and 𝑆00. Vertex 𝑣 has two children 𝑣 1 , 𝑣2 in 𝑇, with 𝑆(𝑣 1 ) = 𝑆0
     and 𝑆(𝑣 2 ) = 𝑆00.

   • For each leaf node 𝑣, the region 𝑆(𝑣) contains at most one point of 𝑋.

We now define the cost cost(𝑣) of each node 𝑣 ∈ 𝑉(𝑇). If the region 𝑆(𝑣) contains no points
of 𝑋, or it contains a single point of 𝑋, then cost(𝑣) = 0. Otherwise, we define cost(𝑣) in the
same manner as before. Assume first that the line 𝐿(𝑣) is vertical. Let 𝑝 1 , . . . , 𝑝 𝑘 be all points in
𝑋 ∩ 𝑆(𝑣), indexed in the increasing order of their 𝑦-coordinates. A pair (𝑝 𝑗 , 𝑝 𝑗+1 ) of consecutive
points forms a crossing of 𝐿(𝑣) for 𝑆(𝑣), if they lie on the opposite sides of 𝐿(𝑣). We then let
cost(𝑣) be the number of such crossings.
When 𝐿(𝑣) is a horizontal line, cost(𝑣) is defined analogously: we index the points of 𝑋 ∩ 𝑆(𝑣) in
the increasing order of their 𝑥-coordinates. We then say that a consecutive pair of such points is
a crossing iff they lie on opposite sides of 𝐿(𝑣). We let cost(𝑣) be the number of such crossings.
For a fixed ordering 𝜎 of the lines in ℒ 𝑉 ∪ ℒ 𝐻 , and the corresponding partition tree 𝑇 = 𝑇(𝜎),
                      Í
we define cGB𝜎 (𝑋) = 𝑣∈𝑉(𝑇) cost(𝑣).
Lastly, we define the consistent Guillotine Bound for a point set 𝑋 that is a permutation to be
the maximum, over all orderings 𝜎 of the lines in ℒ 𝑉 ∪ ℒ 𝐻 , of cGB𝜎 (𝑋).
In the following subsection we define an even stronger bound, that we call the Guillotine bound,
and we show that for every point set 𝑋 that is a permutation, cGB(𝑋) ≤ GB(𝑋), and moreover
that GB(𝑋) ≤ 𝑂(OPT(𝑋)). It then follows that for every point set 𝑋 that is a permutation,
cGB(𝑋) ≤ 𝑂(OPT(𝑋)).

Theorem 5.1. For every integer 𝑛 0, there is an integer 𝑛 ≥ 𝑛 0, and a set 𝑋 of points that is a permutation
with |𝑋 | = 𝑛, such that OPT(𝑋) ≥ Ω(𝑛 log log 𝑛) but cGB(𝑋) ≤ 𝑂(𝑛 log log log 𝑛).

The following lemma will be helpful in the proof of Theorem 5.1; recall that WB0(𝑋) is the basic
Wilber Bound, where we cut via horizontal lines only.

Lemma 5.2. For every instance 𝑋 that is a permutation, cGB(𝑋) ≤ WB(𝑋) + WB0(𝑋).

Proof. Let 𝜎 be a permutation of ℒ 𝑉 ∪ ℒ 𝐻 , such that cGB(𝑋) = cGB𝜎 (𝑋). Notice that 𝜎
naturally induces a permutation 𝜎0 of ℒ 𝑉 and a permutation 𝜎00 of ℒ 𝐻 . We show that
cGB𝜎 (𝑋) ≤ WB𝜎0 (𝑋) + WB0𝜎00 (𝑋). In order to do so, it is enough to show that for every vertical
line 𝐿 ∈ ℒ 𝑉 , the cost that is charged to 𝐿 in the bound cGB𝜎 (𝑋) is less than or equal to the cost
that is charged to 𝐿 in the bound WB𝜎0 (𝑋), and similarly, for every horizontal line 𝐿0 ∈ ℒ 𝐻 , the
cost that is charged to 𝐿0 in the bound cGB𝜎 (𝑋) is less than or equal to the cost that is charged to
𝐿0 in the bound WB0𝜎00 (𝑋). We show the former; the proof of the latter is similar.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                              39
             PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

Consider any line 𝐿 ∈ ℒ 𝑉 . We let 𝑇 be the partitioning tree associated with cGB𝜎 (𝑋), just before
line 𝐿 is processed, and we let 𝑇 0 be defined similarly for WB𝜎0 (𝑋). Let 𝑣 ∈ 𝑉(𝑇 0) be the leaf
vertex with 𝐿 ⊆ 𝑆(𝑣), and let 𝑈 be the set of all leaf vertices 𝑢 of the tree 𝑇 with 𝑆(𝑢) ∩ 𝐿 ≠ ∅.
Observe that the set of vertical lines that appear before 𝐿 in 𝜎 and 𝜎0 is identical. Therefore,
𝑆(𝑣) = 𝑢∈𝑈 𝑆(𝑢). It is easy to verify that, for every vertex 𝑢 ∈ 𝑈, every crossing that contributes
        Ð
to cost(𝑢) is also a crossing that is charged to the line 𝐿 in the strip 𝑆(𝑣). Therefore, the total
number of crossings of line 𝐿 in tree 𝑇 0 that contribute to WB𝜎0 (𝑋) is greater than or equal to the
number of crossings of the line 𝐿 that contribute to cGB𝜎 (𝑋).
To conclude, we get that cGB(𝑋) = cGB𝜎 (𝑋) ≤ WB𝜎0 (𝑋) + WB0𝜎00 (𝑋) ≤ WB(𝑋) + WB0(𝑋).                




5.1.2   The Guillotine Bound

In this section, we define a second extension of Wilber Bound, that we call Guillotine Bound,
and denote by GB. The bound is more convenient to define using a partitioning tree instead of a
sequence of lines. Let 𝑋 be a point set which is a permutation.
We define a guillotine partition of a point set 𝑋, together with the corresponding partitioning
tree 𝑇. As before, every node 𝑣 ∈ 𝑉(𝑇) of the partitioning tree 𝑇 is associated with a rectangular
region 𝑆(𝑣) of the plane. At the beginning, we add the root vertex 𝑟 to the tree 𝑇, and we let
𝑆(𝑟) = 𝐵, where 𝐵 is the bounding box containing all points of 𝑋. We then iterate, as long as
some leaf vertex 𝑣 of 𝑇 has 𝑆(𝑣) ∩ 𝑋 containing more than one point. In each iteration, we
select any such leaf vertex 𝑣, and we select an arbitrary vertical or horizontal line 𝐿(𝑣), that is
contained in 𝑆(𝑣), and partitions 𝑆(𝑣) into two rectangular regions, that we denote by 𝑆0 and 𝑆00,
such that 𝑋 ∩ 𝑆0 , 𝑋 ∩ 𝑆00 ≠ ∅. We then add two child vertices 𝑣 1 , 𝑣2 to 𝑣, and set 𝑆(𝑣 1 ) = 𝑆0 and
𝑆(𝑣2 ) = 𝑆00. Once every leaf vertex 𝑣 has |𝑆(𝑣) ∩ 𝑋 | = 1, we terminate the process and obtain the
final partitioning tree 𝑇.
The cost cost(𝑣) of every vertex 𝑣 ∈ 𝑉(𝑇) is calculated exactly as before. We then let GB𝑇 (𝑋) =
  𝑣∈𝑉(𝑇) cost(𝑣), and we let GB(𝑋) be the maximum, over all partitioning trees 𝑇, of GB𝑇 (𝑋).
Í

We note that the main difference between cGB(𝑋) and GB(𝑋) is that in cGB bound, the
partitioning lines must be chosen consistently across all regions: that is, we choose a vertical or
a horizontal line 𝐿 that crosses the entire bounding box 𝐵, and then we partition every region
that intersects 𝐿 by this line 𝐿. In contrast, in the GB bound, we can partition each region 𝑆(𝑣)
individually, and choose different partitioning lines for different regions. It is then easy to see
that GB is more general than cGB, and, in particular, for every point set 𝑋 that is a permutation,
cGB(𝑋) ≤ GB(𝑋).
Lastly, we show that GB is a lower bound on the optimal solution cost, in the following lemma.


Lemma 5.3. For any point set 𝑋 that is a permutation, GB(𝑋) ≤ 2OPT(𝑋).

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                         40
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

5.2     Negative results for the Consistent Guillotine Bound

In this section we prove Theorem 5.1.
We use three main parameters. Let ℓ ≥ 1 be an integer, and let 𝑛 = 2ℓ and 𝑁 = 2𝑛 . As before, we
will first construct point set 𝑋ˆ that is not a permutation (in fact, it is not even a semi-permutation),
and then we will turn it into our final instance 𝑋 ∗ which is a permutation.


5.2.1    2D exponentially spaced bit reversal

We define the instance 2D-ES-BRS(ℓ ) to be a bit-reversal sequence BRS(ℓ , ℛ, 𝒞), where the sets
ℛ and 𝒞 of activerows and columns are defined as follows. Set 𝒞 contains all columns with
𝑥-coordinates in 2 𝑗 | 1 ≤ 𝑗 ≤ 𝑛 , and similarly set ℛ contains all rows with 𝑦-coordinates in
 𝑗
 2 | 1 ≤ 𝑗 ≤ 𝑛 . Note that set 𝑋 contains 𝑛 points, whose 𝑥- and 𝑦-coordinates are integers
between 1 and 𝑁.


5.2.2    2D cyclic shifts

Next, we define the shifted and exponentially spaced instance, but this time we shift both
vertically and horizontally. We assume that we are given a horizontal shift 0 ≤ 𝑠 < 𝑁 and a
vertical shift 0 ≤ 𝑠 0 < 𝑁. In order to construct the instance 𝑋 𝑠,𝑠 , we start with the instance
                                                                     0


𝑋 = 2D-ES-BRS(ℓ ), and then perform the following two operations. First, we perform a
horizontal shift by 𝑠 units as before, by moving the last 𝑠 columns with integral 𝑥-coordinates to
the beginning of the instance. Next, we perform a vertical shift, by moving the last 𝑠 0 rows with
integral 𝑦-coordinates to the bottom of the instance. We let 𝑋 𝑠,𝑠 denote the resulting instance.
                                                                   0


By applying Claim 4.3 twice, once for the horizontal shift, and once for the vertical shift, we get
that OPT(𝑋 𝑠,𝑠 ) ≥ OPT(𝑋) − 2|𝑋 | ≥ Ω(log 𝑁 log log 𝑁), since |𝑋 | = log 𝑁.
               0




5.2.3    Instance 𝑋ˆ

Next, we construct an instance 𝑋,       ˆ by combining the instances 𝑋 𝑠,𝑠 0 for 0 ≤ 𝑠, 𝑠 0 < 𝑁. In order
to do so, let 𝒞ˆ be a set of 𝑁 2 columns, with integral 𝑥-coordinates 1, . . . , 𝑁 2 . We partition 𝒞ˆ
into subsets 𝒞1 , 𝒞2 , . . . , 𝒞𝑁 , each of which contains 𝑁 consecutive columns, they appear in this
left-to-right order. We call each such set 𝒞𝑖 a super-column. We denote by 𝑆𝑉     𝑖
                                                                                    the smallest vertical
strip containing all columns of 𝒞𝑖 .
Similarly, we let ℛ̂ be a set of 𝑁 2 rows, with integral 𝑦-coordinates 1, . . . , 𝑁 2 . We partition
ℛ̂ into subsets ℛ 1 , . . . , ℛ 𝑁 , each of which contains 𝑁 consecutive rows, such that ℛ 1 , . . . , ℛ 𝑁
appear in this bottom-to-top order. We call each such set ℛ 𝑖 a super-row. We denote by 𝑆 𝑖𝐻 the
smallest horizontal strip containing all rows of ℛ 𝑖 . For all 1 ≤ 𝑖, 𝑗 ≤ 𝑁, we let 𝐵(𝑖, 𝑗) be the
intersection of the horizontal strip 𝑆 𝑖𝐻 and the vertical strip 𝑆𝑉
                                                                  𝑖
                                                                    . We plant the instance 𝑋 (𝑖−1),(𝑗−1)

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            41
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

                                                                  ˆ Let 𝑁 ∗ = | 𝑋ˆ | = 𝑁 2 log 𝑁
into the box 𝐵(𝑖, 𝑗). This completes the construction of instance 𝑋.
(recall that each instance 𝑋 𝑠,𝑠 0
                                   contains log 𝑁 points.)
Observe that, for each vertical strip 𝑆𝑉  𝑖
                                            , all instances planted into 𝑆𝑉
                                                                          𝑖
                                                                            have the same vertical
shift - (𝑖 − 1); the horizontal shift 𝑠 of each instance increases from 0 to 𝑁 − 1 as we traverse
                                       0

𝑆𝑉𝑖
    from bottom to top. In particular, the instance planted into 𝑆1𝑉 is precisely the instance 𝑋ˆ
from Section 4.2 (if we ignore inactive rows). For each 𝑖 > 1, the instance planted into 𝑆𝑉
                                                                                          𝑖
                                                                                            is very
similar to the instance 𝑋ˆ from Section 4.2, except that each of its corresponding sub-instances is
shifted vertically by exactly (𝑖 − 1) rows.
Similarly, for each horizontal strip 𝑆 𝐻
                                       𝑗
                                         , all instances planted into 𝑆 𝐻
                                                                        𝑗
                                                                          have the same horizontal
shift - (𝑗 − 1); the vertical shift 𝑠 0 of each instance increases from 0 to 𝑁 − 1 as we traverse 𝑆 𝐻
                                                                                                    𝑗
from left to right.
Since, for every instance 𝑋 𝑠,𝑠 , OPT(𝑋 𝑠,𝑠 ) = Ω(log 𝑁 log log 𝑁), we obtain the following bound.
                              0           0


                     ˆ = Ω(𝑁 2 log 𝑁 log log 𝑁) = Ω(𝑁 ∗ log log 𝑁 ∗ ).
Observation 5.4. OPT(𝑋)

Since instance 𝑋ˆ is symmetric, and every point lies on one of the 𝑁 2 rows of ℛ̂ and on one of
                 ˆ we obtain the following.
the 𝑁 2 rows of 𝒞,
                                                                  ˆ Similarly, every column of
Observation 5.5. Every row in ℛ̂ contains exactly log 𝑁 points of 𝑋.
𝒞ˆ contains exactly log 𝑁 points of 𝑋.
                                    ˆ


5.2.4    Final instance

Lastly, in order to turn 𝑋ˆ into a permutation 𝑋 ∗ , we perform a similar transformation to that in
Section 4.2: for every column 𝐶 ∈ 𝒞, we replace 𝐶 with a collection ℬ(𝐶) of log 𝑁 consecutive
columns, and we place all points that lie on 𝐶 on the columns of ℬ(𝐶), so that they form an
increasing sequence, while preserving their 𝑦-coordinates. We replace every row 𝑅 ∈ ℛ by a
collection ℬ(𝑅) of log 𝑁 rows similarly. The resulting final instance 𝑋 ∗ is now guaranteed to be
a permutation, and it contains 𝑁 ∗ = 𝑁 2 log 𝑁 points. Using the same reasoning as in Section 4.2,
                                          ˆ ≥ Ω(𝑁 ∗ log log 𝑁 ∗ ). In the remainder of this section,
it is easy to verify that OPT(𝑋 ∗ ) ≥ OPT(𝑋)
we will show that cGB(𝑋 ∗ ) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ).
Abusing the notation, for all 1 ≤ 𝑖 ≤ 𝑁 2 , we denote by 𝑆𝑉      𝑖
                                                                     the vertical strip obtained by
taking the union of all blocks ℬ(𝐶) of columns, where 𝐶 belonged to the original strip 𝑆𝑉      𝑖
                                                                                                 . We
                                𝐻
define the horizontal strips 𝑆 𝑖 similarly. Note that, from Lemma 5.2, it is enough to prove
that WB(𝑋 ∗ ) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ) and that WB0(𝑋 ∗ ) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ). We do so in the
following two subsections.


5.3     Handling vertical cuts

The goal of this section is to prove the following theorem:

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                        42
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

Theorem 5.6. WB(𝑋 ∗ ) ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ).

For all 1 ≤ 𝑖 ≤ 𝑁, we denote by ℬ𝑖 the set of active columns that lie in the vertical strip 𝑆𝑉     𝑖
                                                                                                     ,
so that ℬ1 , . . . , ℬ𝑁 partition the set of active columns of 𝑋 . Let ℒ be a collection of lines at
                                                                 ∗          0

half-integral coordinates that partitions the bounding box 𝐵 into 𝑁 strips where each strip
contains exactly the block ℬ𝑖 of columns. We consider the split of 𝑋 ∗ by the lines ℒ 0: This is a
collection of 𝑁 strip instances (that we will denote by 𝑋1∗ , . . . , 𝑋𝑁
                                                                       ∗
                                                                         ) and a compressed instance,
                        ˜
that we denote by 𝑋. In order to prove Theorem 5.6, we bound WB(𝑋𝑖∗ ) for every strip instance
𝑋𝑖∗ , and WB(𝑋) ˜ for the compressed instance 𝑋,    ˜ and then combine them using Theorem 3.6 in
order to obtain the final bound on WB(𝑋 ).     ∗



5.3.1   Bounding Wilber bound for strip instances

In this subsection, we prove the following lemma.
Lemma 5.7. For all 1 ≤ 𝑖 ≤ 𝑁, WB(𝑋𝑖∗ ) ≤ 𝑂(𝑁 log 𝑁 log log log 𝑁).

From now on we fix an index 𝑖, and consider the instance 𝑋𝑖∗ . Recall that in order to construct
instance 𝑋𝑖∗ , we started with the instances 𝑋 0,𝑖 , 𝑋 1,𝑖 , . . . , 𝑋 𝑁−1,𝑖 , each of which has the same
vertical shift (shift 𝑖), and horizontal shifts ranging from 0 to 𝑁 − 1. Let 𝑋ˆ 𝑖 be the instance
obtained by stacking these instances one on top of the other, similarly to the construction of
instance 𝑋ˆ in Section 4.2. As before, instance 𝑋ˆ 𝑖 is a semi-permutation, so every row contains at
most one point. Every column of 𝑋ˆ 𝑖 contains exactly log 𝑁 points of 𝑋ˆ 𝑖 . Let 𝒞 denote the set
of all active columns of instance 𝑋ˆ 𝑖 . For every column 𝐶 ∈ 𝒞, we replace 𝐶 with a block ℬ(𝐶)
of log 𝑁 columns, and place all points of 𝑋ˆ 𝑖 ∩ 𝐶 on the columns of ℬ(𝐶), so that they form an
increasing sequence, while preserving their 𝑦-coordinates. The resulting instance is equivalent
to 𝑋𝑖∗ (to obtain instance 𝑋𝑖∗ we also need to replace every active row 𝑅 with a block ℬ(𝑅) of
log 𝑁 rows; but since every row contains at most one point of 𝑋ˆ 𝑖 , this amounts to inserting
empty rows into the instance).
The analysis of WB(𝑋𝑖∗ ) is very similar to the analysis of WB(𝑋 ∗ ) for instance 𝑋 ∗ constructed in
Section 4.2. Notice that, as before, it is sufficient to show that WB(𝑋ˆ 𝑖 ) ≤ 𝑂(𝑁 log 𝑁 log log log 𝑁).
Indeed, consider the partition {ℬ(𝐶)} 𝐶∈𝒞 of the columns of 𝑋𝑖∗ . Then 𝑋ˆ 𝑖 can be viewed as the
compressed instance for 𝑋𝑖∗ with the respect to this partition. Each resulting strip instance
(defined by the block ℬ(𝐶) of columns) is an increasing sequence of log 𝑁 points, so the Wilber
Bound value for such an instance is 𝑂(log 𝑁). Altogether, the total Wilber Bound of all such
strip instances is 𝑂(𝑁 log 𝑁). Therefore, from Theorem 3.6, in order to prove Lemma 5.7, it is
now sufficient to show that WB(𝑋ˆ 𝑖 ) ≤ 𝑂(𝑁 log 𝑁 log log log 𝑁).
Let ℒ be the set of all vertical lines with half-integral coordinates for the instance 𝑋ˆ 𝑖 , and let 𝜎 be
any permutation of these lines. Our goal is to prove that WB𝜎 (𝑋ˆ 𝑖 ) ≤ 𝑂(𝑁 log 𝑁 log log log 𝑁).
Let 𝑇 = 𝑇(𝜎) be the partitioning tree associated with 𝜎. Consider some vertex 𝑣 ∈ 𝑉(𝑇) and the

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            43
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

line 𝐿 that 𝑣 owns. As before, we classify crossings that are charged to 𝐿 into several types. A
crossing (𝑝, 𝑝 0) is a type-1 crossing, if 𝑝 and 𝑝 0 both lie in the same instance 𝑋 𝑗,𝑖 . We say that
instance 𝑋 𝑗,𝑖 is bad for 𝐿, if it contributes at least one type-1 crossing to the cost of 𝐿. If 𝑝 ∈ 𝑋 𝑗,𝑖
and 𝑝 0 ∈ 𝑋 𝑗 ,𝑖 for 𝑗 ≠ 𝑗 0, then we say that (𝑝, 𝑝 0) is a type-2 crossing. If either instance 𝑋 𝑗,𝑖 or
               0


𝑋 𝑗 ,𝑖 is a bad instance for 𝐿, then the crossing is of type (2a); otherwise it is of type (2b).
   0



We now bound the total number of crossings of each of these types separately.

   • Type-1 Crossings. We bound the total number of all type-1 crossings exactly like in
     Section 4.3. We note that the proof does not use the vertical locations of the points in the
     sub-instances 𝑋 𝑗,𝑖 , and only relies on two properties of instance 𝑋:
                                                                          ˆ (i) the points in the
     first instance 𝑋0 (corresponding to instance 𝑋 0,𝑖 ) are exponentially spaced horizontally,
     so the 𝑥-coordinates of the points are integral powers of 2, and they are all distinct; and
     (ii) each subsequent instance 𝑋𝑠 (corresponding to instance 𝑋 𝑠,𝑖 ) is a copy of 𝑋0 that is
     shifted horizontally by 𝑠 units. Therefore, the same analysis applies, and the total number
     of type-1 crossings in 𝑋ˆ 𝑖 can be bounded by 𝑂(𝑁 log 𝑁 log log log 𝑁) as before.

   • Type-(2a) Crossings. As before, we charge each type-(2a) crossing to one of the correspond-
     ing bad instances, to conclude that the total number of type-(2a) crossings is bounded by
     the total number of type-1 crossings, which is in turn bounded by 𝑂(𝑁 log 𝑁 log log log 𝑁).

   • Type-(2b) Crossings. Recall that in order to bound the number of type-(2b) crossings,
     we have collapsed, for every instance 𝑋𝑠 , all rows of 𝑋𝑠 into a single row. If we similarly
     collapse, for every instance 𝑋 𝑠,𝑖 , all rows of this instance into a single row, we will obtain
     an identical set of points. This is because the only difference between instances 𝑋𝑠 and
     𝑋 𝑠,𝑖 is vertical position of their points. Therefore, the total number of type-(2b) crossings
     in 𝑋ˆ 𝑖 is bounded by 𝑂(𝑁 log 𝑁) as before.
        This finishes the proof of Lemma 5.7. We conclude that

                    𝑁
                    Õ
                           WB(𝑋𝑖∗ ) ≤ 𝑂(𝑁 2 log 𝑁 log log log 𝑁) = 𝑂(𝑁 ∗ log log log 𝑁 ∗ ) .        (5.1)
                     𝑖=1



5.3.2    Bounding Wilber bound for the compressed instance

In this subsection, we prove the following lemma.
              ˜ ≤ 𝑂(𝑁 ∗ ).
Lemma 5.8. WB(𝑋)

We denote the active columns of 𝑋˜ by 𝐶1 , . . . , 𝐶 𝑁 . Recall that each column 𝐶 𝑖 contains exactly
𝑁 log 𝑁 input points. Let ℛ be the set of all rows with integral coordinates, so |ℛ| = 𝑁 2 log 𝑁.
Let ℬ1 , ℬ2 , . . . , ℬ𝑁 2 be a partition of the rows in ℛ into blocks containing log 𝑁 consecutive rows
each, where the blocks are indexed in their natural bottom-to-top order. Recall that each such

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            44
                P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

                                                 ˆ and the points of 𝑋˜ that lie on the rows of ℬ𝑖
block ℬ𝑖 represents some active row of instance 𝑋,
form an increasing sequence. We also partition the rows of ℛ into super-blocks, ℬ̂1 , . . . , ℬ̂𝑁 ,
where each superblock is the union of exactly 𝑁 consecutive blocks. For each subinstance 𝑋 𝑠,𝑠 ,
                                                                                                 0


the points of the subinstance lie on rows that belong to a single super-block.
                                                                   ˜ so |ℒ| ≤ 𝑁. We fix any
Let ℒ be the set of all columns with half-integral coordinates for 𝑋,
                                         ˜ ≤ 𝑂(𝑁 ). Let 𝑇 be the partitioning tree associated
permutation 𝜎 of ℒ, and prove that WB𝜎 (𝑋)          ∗

with the permutation 𝜎.
Consider any vertex 𝑣 ∈ 𝑉(𝑇), its corresponding vertical strip 𝑆 = 𝑆(𝑣), and the vertical line
𝐿 = 𝐿(𝑣) that 𝑣 owns. Let (𝑝, 𝑝 0) be a crossing of 𝐿, so 𝑝 and 𝑝 0 both lie in 𝑆 on opposite sides of
𝐿, and no point of 𝑋˜ ∩ 𝑆 lies between the row of 𝑝 and the row of 𝑝 0. Assume that the row of 𝑝
is below the row of 𝑝 0. We say that the crossing is left-to-right if 𝑝 is to the left of 𝐿, and we say
that it is right-to-left otherwise. In order to bound the number of crossings, we use the following
two claims.

Claim 5.9. Assume that (𝑝, 𝑝 0) is a left-to-right crossing, and assume that 𝑝 lies on a row of ℬ𝑖 and 𝑝 0
lies on a row of ℬ 𝑗 , with 𝑖 ≤ 𝑗. Then either 𝑗 ≤ 𝑖 + 1 (so the two blocks are either identical or consecutive),
or block ℬ𝑖 is the last block in its super-block.

Proof. Assume that 𝑝 lies on column 𝐶 𝑠 0 and on a row of super-block ℬ̂𝑠 , so this point originally
belonged to instance 𝑋 𝑠,𝑠 . Recall that instance 𝑋 𝑠,𝑠 +1 (that lies immediately to the right of 𝑋 𝑠,𝑠 )
                             0                         0                                               0


is obtained by circularly shifting all points in instance 𝑋 𝑠,𝑠 by one unit up. In particular, a copy
                                                               0


𝑝 𝑐 of 𝑝 in 𝑋 𝑠,𝑠 +1 should lie one row above the copy of 𝑝 in 𝑋 𝑠,𝑠 , unless 𝑝 lies on the last row of
                 0                                                    0


𝑋 𝑠,𝑠 . In the latter case, block ℬ𝑖 must be the last block of its superblock ℬ̂𝑠 . In the former case,
     0


since point 𝑝 𝑐 does not lie between the row of 𝑝 and the row of 𝑝 0, and it lies on column 𝐶 𝑠+1 ,
the block of rows in which point 𝑝 0 lies must be either ℬ𝑖 or ℬ𝑖+1 , that is, 𝑗 ≤ 𝑖 + 1.              

Claim 5.10. Assume that (𝑝, 𝑝 0) is a right-to-left crossing, and assume that 𝑝 lies on a row of ℬ𝑖 and 𝑝 0
lies on a row of ℬ 𝑗 , with 𝑖 ≤ 𝑗. Then either (i) 𝑗 ≤ 𝑖 + 1 (so the two blocks are identical or consecutive); or
(ii) block ℬ𝑖 is the last block in its super-block; or (iii) block ℬ 𝑗 is the first block it its super-block; or (iv) 𝑝
lies on the last active column in strip 𝑆, and 𝑝 0 lies on the first active column in strip 𝑆.

Proof. Assume that 𝑝 lies on column 𝐶 𝑠 0 and on a row of superblock ℬ̂𝑠 , so this point originally
belonged to instance 𝑋 𝑠,𝑠 . Assume for that 𝐶 𝑠 0 is not the last active column of 𝑆, so 𝐶 𝑠 0+1 also
                          0


lies in 𝑆.
Recall that instance 𝑋 𝑠,𝑠 +1 is obtained by circularly shifting all points in instance 𝑋 𝑠,𝑠 by one
                              0                                                                             0


unit up. In particular, a copy 𝑝 𝑐 of 𝑝 in 𝑋 𝑠,𝑠 +1 should lie one row above the copy of 𝑝 in 𝑋 𝑠,𝑠 ,
                                                0                                                  0


unless 𝑝 lies on the last row of 𝑋 𝑠,𝑠 . In the latter case, block ℬ𝑖 must be the last block of its
                                       0


superblock. In the former case, since point 𝑝 𝑐 does not lie between the row of 𝑝 and the row of
𝑝 0, the block of rows in which point 𝑝 0 lies is either ℬ𝑖 or ℬ𝑖+1 , that is, 𝑗 ≤ 𝑖 + 1.
Using a symmetric argument, if 𝑝 0 does not lie on the first active column of 𝑆, then either
𝑗 ≤ 𝑖 + 1, or ℬ 𝑗 is the first block in its super-block.                                  

                          T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                      45
               PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

We can now categorize all crossings charged to the line 𝐿 into types as follows. Let (𝑝, 𝑝 0) be a
crossing, and assume that 𝑝 lies on a row of ℬ𝑖 , 𝑝 0 lies on a row of ℬ 𝑗 , and 𝑖 ≤ 𝑗. We say that
(𝑝, 𝑝 0) is a crossing of type 1, if 𝑗 ≤ 𝑖 + 1. We say that it is a crossing of type 2 if either ℬ𝑖 or ℬ 𝑗
are the first or the last blocks in their superblock. We say that it is of type 3 if 𝑝 lies on the last
active column of 𝑆 and 𝑝 0 lies on the first active column of 𝑆.
We now bound the total number of all such crossings separately.

      • Type-1 crossings Consider any pair ℬ𝑖 , ℬ𝑖+1 of consecutive blocks, and let 𝑋˜ 𝑖0 be the set
        of all points lying on the rows of these blocks. Recall that all points lying on the rows
        of ℬ𝑖 form an increasing sequence of length log 𝑁, and the same is true for all points
        lying on the rows of ℬ𝑖+1 . It is then easy to see that OPT(𝑋˜ 𝑖0) ≤ 𝑂(log 𝑁), and so the total
        contribution of crossings between the points of 𝑋˜ 𝑖0 to WB𝜎 (𝑋)   ˜ is bounded by 𝑂(log 𝑁).
        Since the total number of blocks ℬ𝑖 is bounded by 𝑁 , the total number of type-1 crossings
                                                                2

        is at most 𝑂(𝑁 2 log 𝑁).

      • Type-2 crossings In order to bound the number of type-2 crossings, observe that |ℒ| ≤ 𝑁.
        If 𝐿 ∈ ℒ is a vertical line, and 𝑆 is a strip that 𝐿 splits, then there are 𝑁 superblocks of rows
        that can contribute type-2 crossings to cost(𝐿), and each such superblock may contribute
        at most one crossing. Therefore, the total number of type-2 crossings charged to 𝐿 is at
        most 𝑁, and the total number of all type-2 crossings is 𝑂(𝑁 2 ).

      • Type-3 crossings In order to bound the number of type-3 crossings, observe that every
        column contains 𝑁 log 𝑁 points. Therefore, if 𝐿 ∈ ℒ is a vertical line, then the number
        of type-3 crossings charged to it is at most 2𝑁 log 𝑁. As |ℒ| ≤ 𝑁, we get that the total
        number of type-3 crossings is 𝑂(𝑁 2 log 𝑁).

To conclude, we have shown that WB𝜎 (𝑋)    ˜ = 𝑂(𝑁 2 log 𝑁) = 𝑂(𝑁 ∗ ), proving Lemma 5.8. By
combining Lemmas 5.7 and 5.8, together with Theorem 3.6, we conclude that WB(𝑋 ∗ ) =
𝑂(𝑁 ∗ log log log 𝑁 ∗ ), proving Theorem 5.6.


5.4     Handling horizontal cuts

We show the following analogue of Theorem 5.6.

Theorem 5.11. WB0(𝑋 ∗ ) ≤ 𝑂(𝑁 ∗ log log log 𝑁 ∗ ).

The proof of the theorem is virtually identical to the proof of Theorem 5.6. In fact, consider
the instance 𝑋 ∗∗ , that is obtained from 𝑋 ∗ , by rotating it by 90 degrees clockwise. Consider the
sequence BRS0(ℓ , ℛ, 𝒞) that is obtained by rotating the point set BRS(ℓ , ℛ, 𝒞) by 90 degrees.
Consider now the following process. Our starting point is the rotated Bit Reversal Sequence. We
then follow exactly the same steps as in the construction of the instance 𝑋 ∗ . Then the resulting
instance is precisely (a mirror reflection of) instance 𝑋 ∗∗ . Notice that the only place where our

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                           46
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

proof uses the fact that we start with the Bit Reversal Sequence is in order to show that OPT(𝑋 ∗ )
is sufficiently large. In fact we could replace the Bit Reversal Sequence with any other point
set that is a permutation, and whose optimal solution cost is as large, and in particular the Bit
Reversal Sequence that is rotated by 90 degrees would work just as well. The analysis of the
Wilber Bound works exactly as before, and Theorem 5.11 follows.




6     Algorithmic results

6.1   Overview

We provide the high level intuition for the proof of Theorem 1.2. Both the polynomial time and
the subexponential time algorithms follow the same framework. We start with a high-level
overview of this framework. For simplicity, assume that the number of active columns in the
input instance 𝑋 is an integral power of 2. The key idea is to decompose the input instance into
smaller sub-instances, using the split instances defined in Section 3.1. We solve the resulting
instances recursively and then combine the resulting solutions.
Suppose we are given an input point set 𝑋 that is a semi-permutation, with |𝑋 | = 𝑚, such that
the number of active columns is 𝑛. We consider a balanced partitioning tree 𝑇, where for every
vertex 𝑣 ∈ 𝑉(𝑇), the line 𝐿(𝑣) that 𝑣 owns splits the strip 𝑆(𝑣) in the middle, with respect to the
active columns that are contained in 𝑆(𝑣). Therefore, the height of the partitioning tree is log 𝑛.
Consider now the set 𝑈 of vertices√ of 𝑇 that lie in the middle layer of 𝑇. Let ℒ 0 be their strip
boundaries, so we have |ℒ | = Θ( 𝑛). Consider the split of 𝑋 by ℒ , obtaining a new collection
                             0                                      0
                                       √
              e , {𝑋𝑖 }) 𝑘 where 𝑘 = Θ( 𝑛). Note that each resulting strip instance 𝑋𝑖 contains
of instances (𝑋
   √                     𝑖=1
Θ( 𝑛) active columns, and so does the compressed instance 𝑋.  e
We recursively solve each such instance and then combine the resulting solutions. The key to
the algorithm and its analysis is to show that there is a collection 𝑍 of 𝑂(|𝑋 |) points, such that,
if we are given
               any solution
                            𝑌
                             b to instance 𝑋,
                                           e and, for all 1 ≤ 𝑖 ≤ 𝑘, any solution 𝑌𝑖 to instance 𝑋𝑖 ,
               Ð𝑘
then 𝑍 ∪ 𝑌
         b∪
                 𝑖=1 𝑌𝑖   is a feasible solution to instance 𝑋. We also show that the total number of
input points that appear in all instances that participate in the same recursive level is bounded
by 𝑂(OPT(𝑋)). This ensures that in every recursive level we add at most 𝑂(OPT(𝑋)) points to
the solution, and the total solution cost is at most 𝑂(OPT(𝑋)) times the number of the recursive
levels, which is bounded by 𝑂(log log 𝑛).
In order to obtain the subexponential time algorithm, we restrict the recursion to 𝐷 levels, and
then solve each resulting instance 𝑋 0 directly in time 𝑟(𝑋 0)𝑐(𝑋 0)𝑂(𝑐(𝑋 )) . This approach gives
                                                                         0

                                                                                        Ω(𝐷)
                                                                                               
an 𝑂(𝐷)-approximation algorithm with running time at most poly(𝑚) · exp 𝑛 1/2                  log 𝑛 as
desired.

                      T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                           47
               PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK




                (a) Canonical Solution                                    (b) Special Solution

Figure 7: Canonical and 𝑇-special solutions of 𝑋. The input points are shown as circles; the
points that belong to the solution 𝑌 are shown as squares.


6.2   Special solutions and reduced sets

Our algorithm will produce feasible solutions of a special form, that we call special solutions.
Recall that, given a semi-permutation point set 𝑋, the auxiliary columns for 𝑋 are a set ℒ of
vertical lines with half-integral coordinates. We say that a solution 𝑌 for 𝑋 is special iff every
point of 𝑌 lies on an row that is active for 𝑋, and on a column of ℒ. In particular, special
solutions are by definition non-canonical (see Figure 7 for an illustration).
If 𝜎0 is any ordering of the auxiliary columns in ℒ 0 ⊆ ℒ, and 𝑇 0 = 𝑇(𝜎0) is the corresponding
partitioning tree, then any point set 𝑌 that is a special solution for 𝑋 is also called a 𝑇 0-special
solution: The solutions use only auxiliary columns in ℒ 0 (equivalently, strip boundaries of 𝑆(𝑣)
for 𝑣 ∈ 𝑉(𝑇 0)).
Consider a semi-permutation 𝑋, that we think of as a potential input to the Min-Sat problem.
We denote 𝑋 = {𝑝 1 , . . . , 𝑝 𝑚 }, where the points are indexed in their natural bottom-to-top order,
so (𝑝 1 ).𝑦 < (𝑝 2 ).𝑦 < . . . < (𝑝 𝑚 ).𝑦. A point 𝑝 𝑖 is said to be redundant, if and only if the points
immediately above and below are on its column, that is, for some 𝑖, (𝑝 𝑖 ).𝑥 = (𝑝 𝑖+1 ).𝑥 = (𝑝 𝑖−1 ).𝑥.
We say that a semi-permutation 𝑋 is in the reduced form if there are no redundant points in 𝑋.
The following lemma relates the optimal solutions of any instance and its reduced form.

Lemma 6.1. Let 𝑋 be a semi-permutation, and let 𝑋 0 ⊆ 𝑋 be any point set, that is obtained from 𝑋 by
repeatedly removing redundant points. Then OPT(𝑋 0) ≤ OPT(𝑋). Moreover, if 𝑌 is a feasible solution
for 𝑋 0 such that every point of 𝑌 lies on a row that is active for 𝑋 0, then 𝑌 is also a feasible solution for 𝑋.


Proof. For the first claim, it is sufficient to show that, if 𝑋 00 is a set of points obtained from 𝑋 by
deleting a single redundant point 𝑝 𝑖 , then OPT(𝑋 00) ≤ OPT(𝑋). Let 𝑅 denote the row on which
point 𝑝 𝑖 lies, and let 𝑅0 be the row containing 𝑝 𝑖−1 . Let 𝑌 be the optimal solution to instance
𝑋. We assume w.l.o.g. that 𝑌 is a canonical solution. Consider the set 𝑍 = 𝑋 ∪ 𝑌 of point, and
let 𝑍0 be obtained from 𝑍 by collapsing the rows 𝑅, 𝑅0 into the row 𝑅0 (since 𝑌 is a canonical
solution for 𝑋, no points of 𝑋 ∪ 𝑌 lie strictly between the rows 𝑅, 𝑅0). From Observation 2.4, 𝑍0

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                   48
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

remains a satisfied point set. Setting 𝑌 0 = 𝑍0 \ 𝑋 00, it is easy to verify that 𝑌 0 is a feasible solution
to instance 𝑋 00, and moreover, |𝑌 0 | ≤ |𝑌|. Therefore, OPT(𝑋 00) ≤ |𝑌 0 | ≤ |𝑌| ≤ OPT(𝑋).
As for the second claim, it is sufficient to show that, if 𝑋 00 is a set of points obtained from 𝑋 by
deleting a single redundant point 𝑝 𝑖 , and 𝑌 is any canonical solution for 𝑋 00, then 𝑌 is also a
feasible solution for 𝑋. We can then apply this argument iteratively, until we obtain a set of
points that is in a reduced form.
Consider any feasible canonical solution 𝑌 to instance 𝑋 00. We claim that 𝑋 ∪ 𝑌 is a feasible set
of points. Indeed, consider any two points 𝑝, 𝑞 ∈ 𝑋 ∪ 𝑌 that are not aligned. If both points are
distinct from the point 𝑝 𝑖 , then they must be satisfied in 𝑋 ∪ 𝑌, since both these points lie in
𝑋 00 ∪ 𝑌. Therefore, we can assume that 𝑝 = 𝑝 𝑖 . Notice that 𝑞 ≠ 𝑝 𝑖−1 and 𝑞 ≠ 𝑝 𝑖+1 , since otherwise
𝑝 and 𝑞 must be aligned. Moreover, 𝑞 cannot lie strictly between the row of 𝑝 𝑖−1 and the row of
𝑝 𝑖+1 , as we have assumed that every point of 𝑌 lies on a row that is active for 𝑋 0 𝑋 00. But then
it is easy to verify that either point 𝑝 𝑖−1 lies in 𝑝,𝑞 (if 𝑞 is below 𝑝), or point 𝑝 𝑖+1 lies in 𝑝,𝑞
(otherwise). In either case, the pair (𝑝, 𝑞) is satisfied in 𝑋 ∪ 𝑌.                                   


From Lemma 6.1, whenever we need to solve the Min-Sat problem on an instance 𝑋, it is sufficient
to solve it on a sub-instance, obtained by iteratively removing redundant points from 𝑋. We
obtain the following immediate corollary of Lemma 6.1.

Corollary 6.2. Let 𝑋 be a semi-permutation, and let 𝑋 0 ⊆ 𝑋 be any point set, that is obtained from 𝑋
by repeatedly removing redundant points. Let 𝑌 be any special feasible solution for 𝑋 0. Then 𝑌 is also a
special feasible solution for 𝑋.


Lastly, we need the following lemma, which is a simple application of the Wilber bound.

Lemma 6.3. Let 𝑋 be a point set that is a semi-permutation in reduced form. Then OPT(𝑋) ≥ |𝑋 |/4 − 1.


Proof. Since 𝑋 is a semi-permutation, every point of 𝑋 lies on a distinct row; we denote |𝑋 | = 𝑛.
Let 𝑋 = {𝑝 1 , . . . , 𝑝 𝑛 }, where the points are indexed in the increasing order of their 𝑦-coordinates.
Let Π = {(𝑝 𝑖 , 𝑝 𝑖+1 ) | 1 ≤ 𝑖 < 𝑛} be the collection of all consecutive pairs of points in 𝑋. We say
that the pair (𝑝 𝑖 , 𝑝 𝑖+1 ) is bad iff both 𝑝 𝑖 and 𝑝 𝑖+1 lie on the same column. From the definition of
the reduced form, if (𝑝 𝑖 , 𝑝 𝑖+1 ) is a bad pair, then both (𝑝 𝑖−1 , 𝑝 𝑖 ) and (𝑝 𝑖+1 , 𝑝 𝑖+2 ) are good pairs.
Let Π0 ⊆ Π be the subset containing all good pairs. Then |Π0 | ≥ (|Π| − 1)/2 ≥ 𝑛/2 − 1. Next, we
select a subset Π00 ⊆ Π0 of pairs, such that |Π00 | ≥ |Π0 |/2 ≥ 𝑛/4 − 1, and every point in 𝑋 belongs
to at most one pair in Π00. Since every point in 𝑋 belongs to at most two pairs in Π0, it is easy to
see that such a set exists. Let 𝑌 be an optimal solution to instance 𝑋.
Consider now any pair (𝑝 𝑖 , 𝑝 𝑖+1 ) of points in Π00. Then there must be a point 𝑦 𝑖 ∈ 𝑌 that lies in
the rectangle 𝑝 𝑖 ,𝑝 𝑖+1 . Moreover, since all points of 𝑋 lie on distinct rows, and each such point
belongs to at most one pair in Π00, for 𝑖 ≠ 𝑗, 𝑦 𝑖 ≠ 𝑦 𝑗 . Therefore, |𝑌| ≥ |Π00 | ≥ 𝑛/4 − 1.        

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                 49
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

6.3   Our algorithm



Suppose we are given an input set 𝑋 of points that is a semi-permutation. Let 𝑇 be any
partitioning tree for 𝑋. We say that 𝑇 is a balanced partitioning tree for 𝑋 iff for every non-leaf
vertex 𝑣 ∈ 𝑉(𝑇) and its children 𝑣 1 , 𝑣2 , the number of active columns inside 𝑆(𝑣 1 ) and 𝑆(𝑣 2 ) are
roughly the same, that is, 𝑐(𝑋 ∩ 𝑆(𝑣 𝑖 )) ≤ d𝑐(𝑋 ∩ 𝑆(𝑣))/2e for each 𝑖 = 1, 2.
Given a partitioning tree 𝑇, we denote by Λ𝑖 the set of all vertices of 𝑇 that lie in the 𝑖th layer
of 𝑇 – that is, the vertices whose distance from the root of 𝑇 is 𝑖 (so the root belongs to Λ0 ).
The height of the tree 𝑇, denoted by height(𝑇), is the largest index 𝑖 such that Λ𝑖 ≠ ∅. If the
height of the tree 𝑇 is ℎ, then we call the set Λ dℎ/2e of vertices the middle layer of 𝑇. Notice that,
if 𝑇 is a balanced partitioning tree for input 𝑋, then its height is at most 2 log 𝑐(𝑋). The strips
{𝑆(𝑣)} 𝑣∈Λ dℎ/2e are called the middle-layer strips.
Our algorithm takes as input a set 𝑋 of points that is a semi-permutation, a balanced partition
tree 𝑇 for 𝑋, and an integral parameter 𝜌 > 0.
Intuitively, the algorithm uses the splitting operation to partition the instance 𝑋 into subinstances
that are then solved recursively, until it obtains a collection of instances whose corresponding
partitioning trees have height at most 𝜌. We then employ dynamic programming. The algorithm
returns a special feasible solution for the instance. Recall that the height of the tree 𝑇 is bounded
by 2 log 𝑐(𝑋) ≤ 2 log 𝑛. The following theorem (whose proof appears in Section 6.6) is used as
a recursion basis.




Theorem 6.4. There is an algorithm called LeafBST that, given a semi-permutation instance 𝑋 of
Min-Sat in reduced form, and a partitioning tree 𝑇 for it, produces a feasible 𝑇-special solution for 𝑋 of
cost at most 2|𝑋 | + 2OPT(𝑋), in time |𝑋 | 𝑂(1) · 𝑐(𝑋)𝑂(𝑐(𝑋)) .




We now provide a schematic description of our algorithm.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            50
               P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

 RecursiveBST(𝑋 , 𝑇, 𝜌)

      1. Keep removing redundant points from 𝑋 until 𝑋 is in reduced form.

      2. If 𝑇 has height at most 𝜌,

      3.    return LeafBST(𝑋 , 𝑇)

      4. Let ℒ 0 ⊆ ℒ be the strip boundaries of the middle-layer strips {𝑆1 , . . . , 𝑆 𝑘 }
         of 𝑇.

      5. Compute the split (𝑋
                            e , {𝑋 𝑗 } 𝑗∈[𝑘] ) of 𝑋 by ℒ 0.

      6. Compute the corresponding split subtrees (𝑇,
                                                   e {𝑇𝑗 } 𝑗∈[𝑘] ) of 𝑇 by ℒ 0.

      7. For 𝑗 ∈ [𝑘], call to RecursiveBST with input (𝑋 𝑗 , 𝑇𝑗 , 𝜌), and let 𝑌𝑗 be the
         solution returned by it.

      8. Call RecursiveBST with input (𝑋   e 𝜌), and let 𝑌ˆ be the solution returned
                                       e , 𝑇,
         by it.

      9. Let 𝑍 be a point set containing, for each 𝑗 ∈ [𝑘], for each point 𝑝 ∈ 𝑋 𝑗 ,
         two copies 𝑝 0 and 𝑝 00 of 𝑝 with 𝑝 0 .𝑦 = 𝑝 00 .𝑦 = 𝑝.𝑦, where 𝑝 0 lies on the left
         boundary of 𝑆 𝑗 , and 𝑝 00 lies on the right boundary of 𝑆 𝑗 .

   10. return 𝑌 ∗ = 𝑍 ∪ 𝑌ˆ ∪ ( 𝑗∈[𝑘] 𝑌𝑗 )
                              Ð


The cost and feasibility analyses appear in the next two subsections.


6.4    Cost analysis

In order to analyze the solution cost, consider the final solution 𝑌 ∗ to the input instance 𝑋. We
distinguish between two types of points in 𝑌 ∗ : a point 𝑝 ∈ 𝑌 ∗ is said to be of type 2 if it was
added to the solution by Algorithm LeafBST, and otherwise we say that it is of type 1. We start
by bounding the number of points of type 1 in 𝑌 ∗ .

Claim 6.5. The number of points of type 1 in the solution 𝑌 ∗ to the original instance 𝑋 is at most
𝑂(log(height(𝑇)/𝜌)) · OPT(𝑋).

Proof. Observe that the number of recursive levels is bounded by 𝜆 = 𝑂(log(height(𝑇)/𝜌)). This
is since, in every recursive level, the heights of all trees decrease by a constant factor, and we
terminate the algorithm once the tree heights are bounded by 𝜌. For each 1 ≤ 𝑖 ≤ 𝜆, let 𝒳𝑖 be the
collection of all instances in the 𝑖th recursive level, where the instances are in the reduced form.
Notice that the only points that are added to the solution by Algorithm RecursiveBST directly
are the points in the sets 𝑍. The number of such points added at recursive level 𝑖 is bounded by

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                     51
               PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK


    𝑋 0 ∈𝒳𝑖 2|𝑋 |. It is now sufficient to show that for all 1 ≤ 𝑖 ≤ 𝜆,   𝑋 0 ∈𝒳𝑖 |𝑋 | ≤ 𝑂(OPT(𝑋)). We do
Í              0                                                        Í           0

so using the following observation.
Observation 6.6. For all 1 ≤ 𝑖 ≤ 𝜆,                      0
                                        Í
                                            𝑋 0 ∈𝒳𝑖 OPT(𝑋 ) ≤ OPT(𝑋).

Assume first that the observation is correct. From Lemma 6.3, |𝑋 0 | ≤ 𝑂(OPT(𝑋 0)). Therefore,
the number of type-1 points added to the solution at recursive level 𝑖 is bounded by 𝑂(OPT(𝑋)).
We now turn to prove Observation 6.6.

Proof of Observation 6.6. The proof is by induction on the recursive level 𝑖. It is easy to see that
the claim holds for 𝑖 = 1, since, from Lemma 6.1, removing redundant points from 𝑋 to turn it
into reduced form cannot increase OPT(𝑋).
Assume   now that the claim holds for level 𝑖 − 1, and consider some level-𝑖 instance 𝑋 0 ∈ 𝒳𝑖 . Let
(𝑋 , 𝑋 𝑗 𝑗∈[𝑘] ) be the split of 𝑋 0 that we computed. Then, from Theorem 3.3, 𝑗∈[𝑘] OPT(𝑋 𝑗 ) +
                                                                                Í
 e
OPT(𝑋)
    e ≤ OPT(𝑋 0). Since, from Lemma 6.1, removing redundant points from an instance does
not increase its optimal solution cost, the observation follows.                                            

                                                                                                            

In order to obtain an efficient 𝑂(log log 𝑛)-approximation algorithm, we set 𝜌 to be a constant (it
can even be set to 1), and we use algorithm LeafBST whenever the algorithm calls to subroutine
LeafBST. Observe that the depth of the recursion is now bounded by 𝑂(log log 𝑛), and so the
total number of type-1 points in the solution is bounded by 𝑂(log log 𝑛) · OPT(𝑋). Let ℐ denote
the set of all instances to which Algorithm LeafBST is applied. Using the same arguments as
in Claim 6.5, 𝑋 0 ∈ℐ |𝑋 0 | = 𝑂(OPT(𝑋)). The number of type-2 points that Algorithm LeafBST
                Í
adds to the solution for each instance 𝑋 0 ∈ ℐ is bounded by 𝑂(OPT(𝑋 0) + |𝑋 0 |) = 𝑂(|𝑋 0 |).
Therefore, the total number of type-2 points in the solution is bounded by 𝑂(OPT(𝑋)). Overall,
we obtain a solution of cost at most 𝑂(log log 𝑛) · OPT(𝑋), and the running time of the algorithm
is polynomial in |𝑋 |.
Finally, in order to obtain the subexponential time algorithm, we set the parameter 𝜌 to be such
that the recursion depth is bounded by 𝐷. Since the number of active columns in instance
𝑋 is 𝑐(𝑋), and the height of the partitioning tree 𝑇 is bounded by 2 log 𝑐(𝑋), whilethe depth
                                                                                      log 𝑐(𝑋)       log 𝑐(𝑋)
of the recursion is at most 2 log(height(𝑇)/𝜌), it is easy to verify that 𝜌 = 𝑂         2𝐷/2
                                                                                                 =     2Ω(𝐷)
                                                                                                              .
As before, let ℐ be the set of all instances to which Algorithm LeafBST is applied. Using
the same arguments as in Claim 6.5, 𝑋 0 ∈ℐ (|𝑋 0 | + OPT(𝑋 0)) = 𝑂(OPT(𝑋)). For each such
                                        Í
instance 𝑋 0, Algorithm LeafBST produces a solution of cost 𝑂(|𝑋 0 | + OPT(𝑋 0)). Therefore,
the total number of type-2 points in the final solution is bounded by 𝑂(OPT(𝑋)). The total
number of type-1 points in the solution is therefore bounded by 𝑂(𝐷) · OPT(𝑋) as before.
Therefore, the algorithm produces a factor-𝑂(𝐷)-approximate solution. Finally, in order to
analyze the running time of the algorithm, we first bound the running time of all calls to
procedure LeafBST. The number of such calls is bounded by |𝑋 |. Consider now some instance

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               52
                P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

𝑋 0 ∈ ℐ, and its corresponding partitioning tree 𝑇 0. Since the height of 𝑇 0 is bounded by 𝜌,
we get that 𝑐(𝑋 0) ≤ 2𝜌 ≤ 2log 𝑐(𝑋)/2
                                     Ω(𝐷)               Ω(𝐷)
                                          ≤ (𝑐(𝑋))1/2 . Therefore, the running time of LeafBST
on instance 𝑋  0                      0 𝑂(1) · (𝑐(𝑋 0 ))𝑂(𝑐(𝑋 0 )) ≤ |𝑋 0 | 𝑂(1) · exp 𝑂(𝑐(𝑋 0 ) log 𝑐(𝑋 0 ) ≤
                                                                                                            
              is bounded by |𝑋 |
|𝑋 0 | 𝑂(1) · exp 𝑐(𝑋)1/2
                               Ω(𝐷)
                                      · log 𝑐(𝑋) .

The running time of the remainder of the algorithm, excluding the calls to LeafBST, is bounded
by poly(|𝑋 |). We conclude that the total running time of the algorithm is bounded by
                                                                                                       
                        𝑂(1)                 1/2Ω(𝐷)                                  1/2Ω(𝐷)
                 |𝑋 |          · exp 𝑐(𝑋)              · log 𝑐(𝑋) ≤ poly(𝑚) · exp 𝑛             · log 𝑛


6.5     Feasibility

We start by showing that the solution that the algorithm returns is 𝑇-special.

Observation 6.7. Assuming that LeafBST(𝑋 , 𝑇) returns a 𝑇-special solution, the solution 𝑌 ∗
returned by Algorithm RecursiveBST(𝑋 , 𝑇, 𝜌) is a 𝑇-special solution.

Proof. The proof is by induction on the recursion depth. The base of the induction is the calls to
Procedure LeafBST(𝑋 , 𝑇), which return 𝑇-special solutions by our assumption. Consider now
some call to Algorithm RecursiveBST(𝑋 , 𝑇, 𝜌). From the induction hypothesis, the resulting
solution 𝑌ˆ for instance 𝑋 e is 𝑇-special,
                                e          and, for every strip 𝑗 ∈ [𝑘], the resulting solution 𝑌𝑗 for
instance 𝑋 𝑗 is 𝑇𝑗 -special. Since both 𝑇
                                        e and every tree 𝑇𝑗          are subtrees of 𝑇, and since the
                                                           
                                                               𝑗∈[𝑘]
points of 𝑍 lie on boundaries of strips in 𝑆 𝑗 𝑗∈[𝑘] , the final solution 𝑌 ∗ is 𝑇-special.
                                                          
                                                                                                              

We next turn to prove that the solution 𝑌 ∗ computed by Algorithm RecursiveBST(𝑋 , 𝑇, 𝜌) is
feasible. In order to do so, we will use the following immediate observation.

Observation 6.8. Let 𝑌 ∗ be the solution returned by Algorithm RecursiveBST(𝑋 , 𝑇, 𝜌), and let
𝑗 ∈ [𝑘] be any strip index. Then:

      • Any point 𝑦 ∈ 𝑌 ∗ that lies in the interior of 𝑆 𝑗 must lie on an active row of instance 𝑋 𝑗 .

      • Any point 𝑦 ∈ 𝑌 ∗ that lies on the boundary of 𝑆 𝑗 must belong to in 𝑌ˆ ∪ 𝑍. Moreover, the
        points of 𝑌ˆ ∪ 𝑍 may not lie in the interior of 𝑆 𝑗 .

      • If 𝑅 is an active row for instance 𝑋 𝑗 , then set 𝑍 contains two points, lying on the intersection
        of 𝑅 with the left and the right boundaries of 𝑆 𝑗 , respectively.

We are now ready to prove that the algorithm returns feasible solutions. In the following proof,
when we say that a row 𝑅 is an active row of strip 𝑆 𝑗 (or equivalently of instance 𝑋 𝑗 ), we mean
that some (input) point of instance 𝑋 𝑗 lies on row 𝑅.

                               T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                           53
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

Theorem 6.9. Assume that the recursive calls to Algorithm RecursiveBST return a feasible special
solution 𝑌ˆ for instance 𝑋,
                         e and for each 𝑖 ∈ [𝑘], a feasible special solution 𝑌𝑗 for the strip instance 𝑋 𝑗 .
Then the point set 𝑌 = 𝑍 ∪ 𝑌ˆ ∪ ( 𝑗∈[𝑘] 𝑌𝑗 ) is a feasible solution for instance 𝑋.
                     ∗           Ð


Proof. It would be convenient for us to consider the set of all points in 𝑋
                                                                          e ∪ 𝑋 ∪ 𝑌 ∗ simultaneously.
In order to do so, we start with the set 𝑋 ∪ 𝑌 of points. For every 𝑗 ∈ [𝑘], we select an arbitrary
                                               ∗

active column 𝐶 𝑗 in strip 𝑆 𝑗 , and we then add a copy of every point 𝑝 ∈ 𝑋 𝑗 to column 𝐶 𝑗 . The
resulting set of points, obtained after processing all strips 𝑆 𝑗 is identical to the set 𝑋
                                                                                          e of points
(except possibly for horizontal spacing between active columns), and we do not distinguish
between them.
Consider any pair of points 𝑝, 𝑞 that lie in 𝑌 ∗ ∪ 𝑋, which are not aligned. Our goal is to prove
that some point 𝑟 ≠ 𝑝, 𝑞 with 𝑟 ∈ 𝑋 ∪ 𝑌 ∗ lies in 𝑝,𝑞 . We assume w.l.o.g. that 𝑝 lies to the left of
𝑞. We also assume that 𝑝.𝑦 < 𝑞.𝑦 (that is, point 𝑝 is below point 𝑞); the other case is symmetric.
Assume first that at least one of the two points (say 𝑝) lies in the interior of a strip 𝑆 𝑗 , for some
𝑗 ∈ [𝑘]. We then consider two cases. First, if 𝑞 also lies in the interior of the same strip, then
𝑝, 𝑞 ∈ 𝑋 𝑗 ∪ 𝑌𝑗 , and, since we have assumed that 𝑌𝑗 is a feasible solution for instance 𝑋 𝑗 , the two
points are satisfied in 𝑋 𝑗 ∪ 𝑌𝑗 , and hence in 𝑋 ∪ 𝑌 ∗ . Otherwise, 𝑞 does not lie in the interior of
strip 𝑆 𝑗 . Then, from Observation 6.8, if 𝑅 is the row on which point 𝑝 lies, then 𝑅 is an active row
for instance 𝑋 𝑗 , and the point that lies on the intersection of the row 𝑅 and the right boundary
of strip 𝑆 𝑗 was added to 𝑍. This point satisfies the pair (𝑝, 𝑞).
Therefore, we can now assume that both 𝑝 and 𝑞 lie on boundaries of strips 𝑆 𝑗 | 𝑗 ∈ [𝑘] . Since
                                                                                     
every pair of consecutive strips share a boundary, and since, from the above assumptions, 𝑝 lies
to the left of 𝑞, we can choose the strips 𝑆 𝑗 and 𝑆 𝑙 , such that 𝑝 lies on the left boundary of 𝑆 𝑗 ,
and 𝑞 lies on the right boundary of 𝑆 𝑙 . Notice that it is possible that 𝑆 𝑗 = 𝑆 𝑙 .
Notice that, if 𝑝 lies on a row that is active for strip 𝑆 𝑗 , then a point that lies on the same row
and belongs to the right boundary of the strip 𝑆 𝑗 has been added to 𝑍; this point satisfies the
pair (𝑝, 𝑞). Similarly, if 𝑞 lies on a row that is active for strip 𝑆 𝑙 , the pair (𝑝, 𝑞) is satisfied by a
point of 𝑍 that lies on the same row and belongs to the left boundary of 𝑆 𝑙 .
Therefore, it remains to consider the case where point 𝑝 lies on a row that is inactive for 𝑆 𝑗 , and
point 𝑞 lies on a row that is inactive for 𝑆 𝑙 . From Observation 6.8, both 𝑝 and 𝑞 belong to 𝑌ˆ ∪ 𝑍.
The following observation will be useful for us. Recall that the points of 𝑋
                                                                           e are not included in
𝑋 ∪𝑌 .∗


Observation 6.10. Assume that there is some point 𝑟 ≠ 𝑝, 𝑞, such that 𝑟 ∈ 𝑝,𝑞 , and 𝑟 ∈ 𝑋.
                                                                                         e Then
the pair (𝑝, 𝑞) is satisfied in set 𝑋 ∪ 𝑌 .
                                         ∗



Proof. Since 𝑟 ∈ 𝑋,
                 e and 𝑟 ∈ 𝑝,𝑞 , there must be some strip 𝑆 𝑖 , that lies between strips 𝑆 𝑗 and 𝑆 𝑙
(where 𝑗 ≤ 𝑖 ≤ 𝑙), such that point 𝑟 lies on the column 𝐶 𝑖 (recall that this is the unique active
column of 𝑆 𝑖 that may contain points of 𝑋).
                                           e But then the row 𝑅 to which point 𝑟 belongs is

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                              54
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

an active row for strip 𝑆 𝑖 . Therefore, two points, lying on the intersection of 𝑅 with the two
boundaries of strip 𝑆 𝑖 were added to 𝑍, and at least one of these points must lie in 𝑝,𝑞 . Since
𝑍 ⊆ 𝑌 ∗ , the observation follows.                                                               

From the above observation, it is sufficient to show that some point 𝑟 ∈ 𝑌ˆ ∪ 𝑍 ∪ 𝑋
                                                                                  e that is distinct
from 𝑝 and 𝑞, lies in 𝑝,𝑞 . We distinguish between three cases.
The first case happens when 𝑝, 𝑞 ∈ 𝑌.  ˆ Since set 𝑌ˆ is a feasible solution for instance 𝑋,
                                                                                          e there is
                                                                  ˆ
some point 𝑟 ∈ 𝑝,𝑞 that is distinct from 𝑝 and 𝑞, and lies in 𝑌 ∪ 𝑋. e

The second case happens when neither 𝑝 nor 𝑞 lie in 𝑌,     ˆ so both 𝑝, 𝑞 ∈ 𝑍. Consider strip 𝑆 𝑗−1
lying immediately to the left of strip 𝑆 𝑗 . Since 𝑝 lies on a row that is inactive for strip 𝑆 𝑗 , but
𝑝 ∈ 𝑍, such a strip must exist, and moreover, the row 𝑅 to which 𝑝 belongs must be active for
strip 𝑆 𝑗−1 . Therefore, the point lying on the intersection of the column 𝐶 𝑗−1 (the unique active
column of 𝑆 𝑗−1 containing points of 𝑋) e and 𝑅 belongs to 𝑋;e we denote this point by 𝑝 0.
Similarly, consider strip 𝑆 𝑙+1 lying immediately to the right of 𝑆 𝑙 . Since 𝑞 lies on a row that is
inactive for strip 𝑆 𝑙 , but 𝑞 ∈ 𝑍, such a strip must exist, and moreover, the row 𝑅0 to which 𝑞
belongs must be active for strip 𝑆 𝑙+1 . Therefore, the point lying on the intersection of 𝐶 𝑙+1 and
𝑅0 belongs to 𝑋;
               e we denote this point by 𝑞 0.

Since the set 𝑋   e ∪ 𝑌ˆ of points is satisfied, some point 𝑟 ∈ 𝑋
                                                                e ∪ 𝑌ˆ that is distinct from 𝑝 0 and 𝑞 0,
lies in 𝑝0 ,𝑞0 . Moreover, from Observation 2.3, we can choose this point so that it lies on the
boundary of the rectangle 𝑝0 ,𝑞0 . Assume first that 𝑟 lies on the left boundary of this rectangle.
Then, since 𝑌ˆ is a special solution for instance 𝑋,  e 𝑟∈𝑋  e must hold. If 𝑅00 denotes the row on
which 𝑟 lies, then 𝑅 is an active row for strip 𝑆 𝑗−1 , and so a point that lies on the intersection of
                        00

row 𝑅00 and the right boundary of 𝑆 𝑗−1 belongs to 𝑍. That point satisfies 𝑝,𝑞 . The case where 𝑟
lies on the right boundary of 𝑝0 ,𝑞0 is treated similarly.
Assume now that 𝑟 lies on the top or the bottom boundary of 𝑝0 ,𝑞0 , but not on one of its corners.
Then, since solution 𝑌ˆ is special for 𝑋,
                                       e point 𝑟 must lie in the rectangle 𝑝,𝑞 . Moreover, since we
have assumed that neither 𝑝 nor 𝑞 lie in 𝑌, ˆ 𝑟 ≠ 𝑝, 𝑞. But then 𝑟 ∈ 𝑋e ∪ 𝑌ˆ lies in 𝑝,𝑞 \ {𝑝, 𝑞} and
by Observation 6.10, pair (𝑝, 𝑞) is satisfied in 𝑋 ∪ 𝑌 .
                                                       ∗


The third case happens when exactly one of the two points (say 𝑝) lies in 𝑌,        ˆ and the other point
                    ˆ
does not lie in 𝑌, so 𝑞 ∈ 𝑍 must hold. We define the point 𝑞 ∈ 𝑋 exactly as in the second
                                                                       0    e
case. Since 𝑝, 𝑞 0 ∈ 𝑋   e ∪ 𝑌,ˆ there must be a point 𝑟 ∈ 𝑝,𝑞0 \ {𝑝, 𝑞 0 } that lies in 𝑋  e ∪ 𝑌.ˆ From
Observation 6.8, we can choose the point 𝑟, so that it lies on the left or on the bottom boundary
of 𝑝,𝑞0 (that is, its 𝑥- or its 𝑦-coordinate is aligned with the point 𝑝). If 𝑟 lies on the left boundary
of 𝑝,𝑞0 , then it also lies in 𝑝,𝑞 , and from Observation 6.10, pair (𝑝, 𝑞) is satisfied in 𝑋 ∪ 𝑌 ∗ . If it
lies on the bottom boundary of 𝑝,𝑞0 , but it is not the bottom right corner of the rectangle, then,
using the same reasoning as in Case 2, it must lie in 𝑝,𝑞 , and it is easy to see that 𝑟 ≠ 𝑞. Lastly,
if 𝑟 is the bottom right corner of 𝑝,𝑞0 , then 𝑟 ∈ 𝑋.e As before, there is a copy of 𝑟, that lies on the
left boundary of strip 𝑆 𝑙+1 and belongs to 𝑍, that satisfies the pair (𝑝, 𝑞).                             

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               55
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

6.6     Leaf instances (proof of Theorem 6.4)

The goal of this subsection is to prove Theorem 6.4. For convenience, given an input instance
𝑋 of Min-Sat, we denote 𝑟(𝑋) = 𝑚 and 𝑐(𝑋) = 𝑛. Our goal is to compute an optimal canonical
solution 𝑌 for 𝑋 in time poly(𝑚) · 𝑛 𝑂(𝑛) . The solution can then be turned into a special one using
the following observation.

Observation 6.11. There is an algorithm, that, given a set 𝑋 of points that is a semi-permutation,
and a canonical solution 𝑌 for 𝑋, computes a special solution 𝑌 0 for 𝑋, such that |𝑌 0 | ≤ 2|𝑋 | +2|𝑌|.

Proof. We construct 𝑌 0 as follows: For each point 𝑝 ∈ 𝑋 ∪ 𝑌, we add two points, 𝑝 0 and 𝑝 00 to 𝑌 0,
whose 𝑦-coordinate is the same as that of 𝑝, such that 𝑝 0 and 𝑝 00 lie on the lines of ℒ appearing
immediately to the left and immediately to the right of 𝑝, respectively. It is easy to verify that
|𝑌 0 | = 2|𝑋 | + 2|𝑌| and that 𝑌 0 is a special solution.
We now verify that 𝑋 ∪ 𝑌 0 is a satisfied set of points. Consider two points 𝑝, 𝑞 in 𝑋 ∪ 𝑌 0, such that
𝑝.𝑥 < 𝑞.𝑥. Notice that 𝑝 is either an original point in 𝑋 or it is a copy of some point 𝑝ˆ ∈ 𝑋 ∪ 𝑌.
If 𝑝 ∈ 𝑋 or 𝑝 = 𝑝ˆ 0 for 𝑝ˆ ∈ 𝑋 ∪ 𝑌, then the point 𝑝ˆ 00 lies in the rectangle 𝑝,𝑞 . Therefore, we can
assume that 𝑝 = 𝑝ˆ 00 for some point 𝑝ˆ ∈ 𝑋 ∪ 𝑌. By a similar reasoning, we can assume that 𝑞 = 𝑞ˆ 0
for some point 𝑞ˆ ∈ 𝑋 ∪ 𝑌. Since 𝑋 ∪ 𝑌 is a satisfied point set, there must be a point 𝑟 ∈ 𝑋 ∪ 𝑌
that lies in the rectangle 𝑝, ˆ 𝑞ˆ . From Observation 2.3, we can choose 𝑟 such that either 𝑟.𝑥 = 𝑝.𝑥   ˆ
or 𝑟.𝑦 = 𝑝.𝑦.
           ˆ    In either case, point 𝑟 also lies in 𝑝ˆ 00 , 𝑞ˆ 0 = 𝑝,𝑞 , so (𝑝, 𝑞) is satisfied by 𝑋 ∪ 𝑌 0.
                                           00

Therefore, 𝑋 ∪ 𝑌 0 is a satisfied set of points.                                                           

This observation allows us to turn the optimal canonical solution into a special solution of cost at
most 2|𝑋 | + 2|𝑌| ≤ 2|𝑋 | + 2OPT(𝑋), in time poly(|𝑋 |). We start by providing several definitions
and structural observations that will be helpful in designing the algorithm.


6.6.1    Conflicting sets

Our algorithm uses the notion of conflicting point sets, defined as follows.

Definition 6.12 (Conflicting Sets). Let 𝑍 and 𝑍0 be sets of points. We say that 𝑍 and 𝑍0 are
conflicting if 𝑍 ∪ 𝑍0 is not satisfied.

The following definition is central to our algorithm.

Definition 6.13 (Top representation). Let 𝑍 be any set of points. A top representation of 𝑍, that
we denote by top(𝑍), is a subset 𝑍0 ⊆ 𝑍 of points, obtained as follows: for every column 𝐶 that
contains points from 𝑍, we add the topmost point of 𝑍 that lies on 𝐶 to 𝑍0.

Observation 6.14. Let top(𝑍) ⊆ 𝑍 be the top representation of 𝑍, and let 𝑅 be a row lying strictly
above all points in 𝑍. Let 𝑌 be any set of points lying on row 𝑅. Then 𝑡𝑜𝑝(𝑍) is conflicting with
𝑌 if and only if 𝑍 is conflicting with 𝑌.

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               56
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

Proof. Assume that 𝑌 is conflicting with 𝑍, and let 𝑝 ∈ 𝑌, 𝑞 ∈ 𝑍 be a pair of points, such that no
point of 𝑌 ∪ 𝑍 lies in 𝑝,𝑞 \ {𝑝, 𝑞}. But then 𝑞 ∈ top(𝑍) must hold, and no point of top(𝑍) ∪ 𝑌 lies
in 𝑝,𝑞 \ {𝑝, 𝑞}, so top(𝑍) and 𝑌 are conflicting. Assume now that top(𝑍) and 𝑌 are conflicting,
and let 𝑝 ∈ 𝑌, 𝑞 ∈ top(𝑍) be a pair of points, such that no point of 𝑌 ∪ top(𝑍) lies in 𝑝,𝑞 \ {𝑝, 𝑞}.
But then no point of 𝑌 ∪ 𝑍 lies in 𝑝,𝑞 \ {𝑝, 𝑞}, and, since top(𝑍) ⊆ 𝑍, sets top(𝑍) and 𝑌 are
conflicting.                                                                                        


6.6.2   The setup

Let 𝑋 be the input point set, that is a semi-permutation. We denote by ℛ = {𝑅 1 , . . . , 𝑅 𝑚 } the set
of all active rows for 𝑋, and we assume that they are indexed in their natural bottom-to-top
order. We denote by 𝒞 = {𝐶1 , . . . , 𝐶 𝑛 } the set of all active columns of 𝑋, and we assume that
they are indexed in their natural left-to-right order. We also denote 𝑋 = {𝑝 1 , . . . , 𝑝 𝑚 }, where for
all 1 ≤ 𝑖 ≤ 𝑚, 𝑝 𝑖 is the unique point of 𝑋 lying on row 𝑅 𝑖 . For an index 1 ≤ 𝑡 ≤ 𝑚, we denote by
ℛ ≤𝑡 = {𝑅 1 , . . . , 𝑅 𝑡 }, and we denote by 𝑋≤𝑡 = {𝑝 1 , . . . , 𝑝 𝑡 }.
Note that, if 𝑌 is a feasible solution to instance 𝑋, then for all 1 ≤ 𝑡 ≤ 𝑚, the set 𝑋≤𝑡 ∪ 𝑌≤𝑡
of points must be satisfied (here 𝑌≤𝑡 is the set of all points of 𝑌 lying on rows of ℛ ≤𝑡 .) Our
dynamic programming-based algorithm constructs the optimal solution row-by-row, using this
observation. We use height profiles, that we define next, as the “states” of the dynamic program.
A height profile 𝜋 assigns, to every column 𝐶 𝑖 ∈ 𝒞, a value 𝜋(𝐶 𝑖 ) ∈ {1, . . . , 𝑛, ∞}. Let Π be the set
of all possible height profiles, so |Π| ≤ 𝑛 𝑂(𝑛) . For a profile 𝜋 ∈ Π, we denote by 𝑀(𝜋) the largest
value of 𝜋(𝐶 𝑖 ) for any column 𝐶 𝑖 ∈ 𝒞 that is not ∞. Given a height profile 𝜋, let 𝒞(𝜋) ⊆ 𝒞 be the
set of columns 𝐶 𝑖 with 𝜋(𝐶 𝑖 ) < ∞, and let 𝒞 0(𝜋) = 𝒞 \ 𝒞(𝜋). We can then naturally associate an
ordering 𝜌𝜋 of the columns in 𝒞(𝜋) with 𝜋 as follows: for columns 𝐶 𝑖 , 𝐶 𝑗 ∈ 𝒞(𝜋), 𝐶 𝑖 appears
before 𝐶 𝑗 in 𝜋 iff either (i) 𝜋(𝐶 𝑖 ) < 𝜋(𝐶 𝑗 ); or (ii) 𝜋(𝐶 𝑖 ) = 𝜋(𝐶 𝑗 ) and 𝑖 < 𝑗.
Consider now any point set 𝑍, where every point lies on a column of 𝒞. Let 𝑍0 = top(𝑍), and
let 𝜎(𝑍) denote the ordering of the points in 𝑍0, such that 𝑝 appears before 𝑝 0 in 𝜎 iff either (i)
𝑝.𝑦 < 𝑝 0 .𝑦; or (ii) 𝑝.𝑦 = 𝑝 0 .𝑦 and 𝑝.𝑥 < 𝑝 0 .𝑥. Consider now any profile 𝜋 ∈ Π. We say that point
set 𝑍 is consistent with profile 𝜋 (see Figure 8 for an illustration) iff the following hold:

   • For every column 𝐶 𝑖 ∈ 𝒞, if 𝜋(𝐶 𝑖 ) = ∞, then no point of 𝑍 lies on 𝐶 𝑖 ; and

   • For all 1 ≤ 𝑖 ≤ |𝑍0 |, the 𝑖th point in 𝜎(𝑍) lies on the 𝑖th column of 𝜌𝜋 .



6.6.3   Our DP

Consider some integer 𝑡 : 1 ≤ 𝑡 ≤ 𝑚 (viewed as a row index) and some height profile 𝜋; we
define 𝑀(𝜋) as the largest value of 𝜋(𝐶 𝑗 ) for 𝐶 𝑗 ∈ 𝒞 that is not ∞. We say that 𝜋 is a legal profile
for time 𝑡 iff 𝑀(𝜋) ≤ 𝑡, and, if 𝐶 𝑖 is the column containing the input point 𝑝 𝑡 , then 𝜋(𝐶 𝑖 ) = 𝑀(𝜋)

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            57
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK




Figure 8: An illustration of height profile that is consistent with a set 𝑍 of points. The set
top(𝑍) is shown by dark points. The height profile is 𝜋(𝐶1 ) = 1, 𝜋(𝐶2 ) = 𝜋(𝐶5 ) = 2 and
𝜋(𝐶3 ) = ℎ 𝑍 (𝐶4 ) = 3.


(that is, column 𝐶 𝑖 has the largest value 𝜋(𝐶 𝑖 ) that is not ∞; we note that it is possible that other
columns 𝐶 𝑗 ≠ 𝐶 𝑖 also have 𝜋(𝐶 𝑗 ) = 𝑀(𝜋)).
For every integer 1 ≤ 𝑡 ≤ 𝑚, and every height profile 𝜋 that is legal for 𝑡, there is an entry 𝑇[𝑡, 𝜋]
in the dynamic programming table. The entry is supposed to store the minimum-cardinality set
𝑍 of points with the following properties:

   • 𝑋≤𝑡 ⊆ 𝑍;

   • all points of 𝑍 lie on rows 𝑅 1 , . . . , 𝑅 𝑡 ;

   • 𝑍 is a satisfied point set; and

   • 𝑍 is consistent with 𝜋.

Clearly, there number of entries in the dynamic programming table is bounded by 𝑚𝑛 𝑂(𝑛) . We
fill out the entries in the increasing order of the index 𝑡.
We start with 𝑡 = 1. Consider any profile 𝜋 that is legal for time 𝑡. Recall that for every column
𝐶 𝑗 ∈ 𝒞, 𝜋(𝐶 𝑗 ) ∈ {1, ∞} must hold, and moreover, if 𝐶 𝑖 is the unique column containing the
point 𝑝 1 ∈ 𝑋, then 𝜋(𝐶 𝑖 ) = 1 must hold. We let 𝑇[1, 𝜋] contain the following set of points: for
every column 𝐶 𝑗 ∈ 𝒞 with 𝜋(𝐶 𝑗 ) = 1, we add the point lying in the intersection of column 𝐶 𝑗
and row 𝑅 1 to the point set stored in 𝑇[𝑡, 𝜋]. It is immediate to verify that the resulting point set
is consistent with 𝜋, it is satisfied, it contains 𝑋≤1 = {𝑝 1 }, and it is the smallest-cardinality point
set with these properties.
We now assume that for some 𝑡 ≥ 1, we have computed correctly all entries 𝑇[𝑡 0 , 𝜋0] for all
1 ≤ 𝑡 0 ≤ 𝑡, and for all profiles 𝜋0 legal for 𝑡 0. We now fix some profile 𝜋 that is legal for 𝑡 + 1, and

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                           58
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

show how to compute entry 𝑇[𝑡 + 1, 𝜋].
Let 𝑟 = 𝑀(𝜋) (recall that this is the largest value of 𝜋(𝐶 𝑗 ) that is not ∞), and let 𝒞1 ⊆ 𝒞 be the
set of all columns 𝐶 𝑗 with 𝜋(𝐶 𝑗 ) = 𝑟. Recall that, if 𝐶 𝑗 is the column containing the input point
𝑝 𝑡+1 , then 𝐶 𝑗 ∈ 𝒞1 must hold. Let 𝑃 be the set of |𝒞1 | points that lie on the intersection of the
row 𝑅 𝑡+1 and the columns in 𝒞1 .
Consider now any profile 𝜋0 that is legal for 𝑡, and let 𝑍ˆ = 𝑇[𝑡, 𝜋0]. Denote 𝑍ˆ 0 = top(𝑍).ˆ We say
that profile 𝜋 is a candidate profile if (i) 𝜋 is legal for 𝑡; (ii) the point sets 𝑃, 𝑍ˆ do not conflict;
                0                              0                                        0

(iii) 𝒞 0(𝜋0) ⊆ 𝒞 0(𝜋) ∪ 𝒞1 ; and (iv) if we discard from 𝜌𝜋 and from 𝜌𝜋0 the columns of 𝒞1 , then
the two orderings are defined over the same set of columns and they are identical. We select a
candidate profile 𝜋0 that minimizes |𝑇[𝑡, 𝜋0]|, and let 𝑍 = 𝑍ˆ ∪ 𝑃, where 𝑍ˆ is the point set stored
in 𝑇[𝑡, 𝜋0]. We then set 𝑇[𝑡 + 1, 𝜋] = 𝑍.
We now verify that set 𝑍 has all required properties. If the entry 𝑇[𝑡, 𝜋0] was computed correctly,
then point set 𝑍ˆ is satisfied. Since top(𝑍)
                                           ˆ and 𝑃 are not conflicting, from Observation 6.14
neither are 𝑍ˆ and 𝑃. Therefore, set 𝑍 is satisfied. If the entry 𝑇[𝑡, 𝜋0] was computed correctly,
then 𝑋≤𝑡 ⊆ 𝑍.ˆ Since 𝑝 𝑡+1 ∈ 𝑃, we get that 𝑋≤(𝑡+1) ⊆ 𝑍.

Next, we show that 𝑍 is consistent with the profile 𝜋. Let 𝑍0 = top(𝑍), and consider the ordering
𝜎(𝑍) of the points in 𝑍0. It is easy to verify that the last |𝑃| points in this ordering are precisely
the points of 𝑃. The remaining points in this ordering can be obtained from the point set 𝑍ˆ 0, by
first ordering them according to ordering 𝜎(𝑍), ˆ and then deleting all points lying on columns of
𝒞1 . From the definition of candidate profiles, it is easy to verify that for all 1 ≤ 𝑖 ≤ |𝑍0 |, the 𝑖th
point in 𝜎(𝑍) lies on the 𝑖th column of 𝜌𝜋 . Therefore, 𝑍 is consistent with profile 𝜋.
Lastly, it remains to show that the cardinality of 𝑍 is minimized among all sets with the above
properties. Let 𝑍 ∗ be the point set that contains 𝑋≤𝑡+1 , is satisfied, is consistent with profile 𝜋,
with every point of 𝑍 ∗ lying on rows 𝑅 1 , . . . , 𝑅 𝑡+1 , such that |𝑍 ∗ | is minimized among all such
sets. It is easy to verify that the set of points of 𝑍 ∗ lying on row 𝑅 𝑡+1 must be precisely 𝑃. This
is since point 𝑝 𝑡+1 must belong to 𝑍 ∗ , and so every column 𝐶 𝑖 with 𝜋(𝐶 𝑖 ) = 𝑀(𝜋) must have a
point lying on the intersection of 𝐶 𝑖 and 𝑅 𝑡+1 in 𝑍 ∗ . Let 𝑍ˆ = 𝑍 ∗ \ 𝑃. Clearly, 𝑍ˆ is satisfied, all
points of 𝑍ˆ lie on rows 𝑅 1 , . . . , 𝑅 𝑡 , and 𝑋≤𝑡 ⊆ 𝑍.
                                                       ˆ Moreover, there is no conflict between top(𝑍)ˆ
and 𝑃.
We define a new height profile 𝜋0 as follows: for every column 𝐶 𝑖 ∈ 𝒞, if no point of 𝑍ˆ lies on
𝐶 𝑖 , then 𝜋0(𝐶 𝑖 ) = ∞. Otherwise, let 𝑝 be the unique point in 𝑍ˆ 0 that lies on 𝐶 𝑖 , and assume that
it lies on row 𝑅 𝑡 0 . Then we set 𝜋0(𝐶 𝑖 ) = 𝑡 0. Notice that 𝒞 0(𝜋0) ⊆ 𝒞 0(𝜋) ∪ 𝒞1 , and, if we discard
from 𝜌𝜋 and from 𝜌𝜋0 the columns of 𝒞1 , then the two orderings are defined over the same set of
columns and they are identical.
It is easy to verify that the resulting profile 𝜋0 must be legal for 𝑡. Therefore, profile 𝜋0 was
considered by the algorithm. Since 𝑍ˆ 0 and 𝑃 are not conflicting, it is easy to verify that for any
other set 𝑍˜ of points that lie on rows 𝑅 1 , . . . , 𝑅 𝑡 and are consistent with profile 𝜋0, set top(𝑍)
                                                                                                      ˜
does not conflict with 𝑃. Therefore, 𝜋 must be a candidate profile for 𝜋. We conclude that
                                         0




                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                           59
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

               ˆ and |𝑇[𝑡, 𝜋]| ≤ |𝑇[𝑡, 𝜋0]| + |𝑃| = |𝑍 ∗ |, so |𝑍| ≤ |𝑍 ∗ | must hold.
|𝑇[𝑡, 𝜋0]| ≤ | 𝑍|,
The output of the algorithm is a set 𝑇[𝑚, 𝜋] \ 𝑋 of smallest cardinality among all profiles 𝜋 ∈ Π
that are consistent with 𝑚. It is immediate to verify that this is a feasible and optimal solution
for 𝑋.
As observed before, the number of entries in the dynamic programming table is 𝑚 · 𝑛 𝑂(𝑛) , and
computing each entry takes time 𝑛 𝑂(𝑛) . Therefore, the total running time of the algorithm is
bounded by 𝑚 · 𝑛 𝑂(𝑛) .



7     An 𝑂(log log 𝑛)-competitive online algorithm

In this section we extend the 𝑂(log log 𝑛)-approximation algorithm from the previous section
to the online setting, completing the proof of Theorem 1.2. To this end, the recursive algorithm
is not quite convenient to work with. In Subsection 7.1, we present an equivalent iterative
description of our algorithm. In Subsection 7.2, we slightly modify the solution 𝑌 that the
algorithm returns to obtain another solution 𝑌ˆ that is more friendly for the online setting, before
presenting the final online algorithm in Subsection 7.3.


7.1   Unfolding the recursion

Let 𝑋 be an input set of points that is semi-permutation, with |𝑋 | = 𝑚, and 𝑐(𝑟) = 𝑛. Let 𝑇 be a
balanced partitioning tree of height 𝐻 = 𝑂(log 𝑛) for 𝑋.
We now construct another tree 𝑅, that is called a recursion tree, and which is unrelated to the
tree 𝑇. Every vertex 𝑞 of the tree 𝑅 is associated with an instance ℐ(𝑞) of Min-Sat that arose
during the recursive execution of Algorithm RecursiveBST(𝑋 , 𝑇, 𝜌), with 𝜌 = 1. Specifically,
for the root 𝑟 of the tree 𝑅, we let ℐ(𝑟) = 𝑋. For every vertex 𝑞 of the tree 𝑅, if Algorithm
RecursiveBST(𝑋 , 𝑇, 𝜌), when called for instance ℐ(𝑞), constructed instances ℐ1 , . . . , ℐ𝑧 (recall
that one of these instances is a compressed instance, and the remaining instances are strip
instances), then vertex 𝑞 has 𝑧 children in tree 𝑅, each of which is associated with one of these
instances.
For a vertex 𝑞 ∈ 𝑉(𝑅), let 𝑛(𝑞) be the number active columns in the instance ℐ(𝑞). Recall that
instance ℐ(𝑞) corresponds to some subtree of the partitioning tree 𝑇, that we denote by 𝑇𝑞 . For
all 𝑖 ≥ 0, let Λ0𝑖 be the set of all vertices of the tree 𝑅 that lie at distance exactly 𝑖 from the root of
𝑅. We say that the vertices of Λ0𝑖 belong to the 𝑖th layer of 𝑅. Notice that, if vertex 𝑞 lies in the
𝑖th layer of 𝑅, then the height of the corresponding tree 𝑇𝑞 is bounded by 𝐻 · (2/3)𝑖 (the constant
                                                                𝑇 is split in its middle layer, each of
2/3 is somewhat arbitrary and is used because,         when a tree     0
                                                            0
                                                 
the resulting subtrees has height at most height(𝑇 )/2 + 1 ≤ 2𝐻/3). Recall that the recursion
terminates once we obtain instances whose corresponding partitioning trees have height 1. It is
then easy to verify that the height of the recursion tree 𝑅 is bounded by 𝑂(log log 𝑛).

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                             60
               P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES




Figure 9: An illustration of the families 𝒯𝑖 . Notice that 𝒯1 contains 5 trees, each having height 2.


Consider now some layer Λ𝑖 of the recursion tree 𝑅. We let 𝒯𝑖 be the collection of all subtrees of
the partitioning tree 𝑇 corresponding to the vertices of Λ𝑖 , so 𝒯𝑖 = 𝑇𝑞 | 𝑞 ∈ Λ𝑖 . Recall that all
trees in 𝒯𝑖 have height at most 𝐻 · (2/3)𝑖 .
Notice that 𝒯0 = {𝑇}. Let 𝑈 = {𝑣 1 , . . . , 𝑣 𝑘 } be the middle layer of 𝑇. Then 𝒯1 = 𝑇𝑣1 , . . . , 𝑇𝑣 𝑘 , 𝑇𝑟 .
                                                                                            
Set 𝒯2 is similarly obtained by subdividing every tree in 𝒯1 , and so on. (See Figure 9 for an
illustration. ) The construction of the tree sets 𝒯𝑖 can be described using the following process:

   • Start from 𝒯0 = {𝑇}.

   • For 𝑖 = 1, . . . , 𝐷, if some tree in 𝒯𝑖−1 has height greater than 1, then construct the set 𝒯𝑖 of
                             n everyotree 𝑇 ∈ 𝒯𝑖 whose height is greater than 1, consider the split
     trees as follows: For                  0

                                  e by the (boundaries of) middle-layer strips. Notice that 𝑇
        partitioning trees {𝑇𝑗 }, 𝑇                                                         e is
        rooted at the root of 𝑇 0 and each 𝑇𝑗 at a leaf of 𝑇.
                                                           e These subtrees are added into 𝒯𝑖 .

The following observation is immediate.
Observation 7.1. For each 1 ≤ 𝑖 ≤ 𝐷, 𝑇 0 ∈𝒯𝑖 𝑉(𝑇 0) = 𝑉(𝑇), and for each pair 𝑇 0 , 𝑇 00 ∈ 𝒯𝑖 of trees,
                                            Ð
either 𝑉(𝑇 0) ∩ 𝑉(𝑇 00) = ∅ or the root of one of these trees is a leaf of the other tree.


7.1.1    Boxes

Fix an index 1 ≤ 𝑖 ≤ 𝐷, and consider any tree 𝑇 0 ∈ 𝒯𝑖 . Denote the number of leaves by 𝑘,
and let 𝑣 1 , . . . , 𝑣 𝑘 be the leaves of 𝑇 0. If 𝑣 is the root vertex of the tree 𝑇 0, then we can view 𝑇 0
as defining a hierarchical partitioning scheme of the strip 𝑆(𝑣), until we obtain the partition

                        T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                 61
               PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

(𝑆(𝑣 1 ), 𝑆(𝑣 2 ), . . . , 𝑆(𝑣 𝑘 )) of 𝑆(𝑣). We define a collection of 𝑇 0-boxes as follows. Let 𝑋 0 = 𝑋 ∩ 𝑆(𝑣)
be the set of all points lying in strip 𝑆(𝑣). Assume that 𝑋 0 = {𝑝 1 , 𝑝2 , . . . , 𝑝 𝑚0 }, where the points
are indexed in the increasing order of their 𝑦-coordinates.
We now iteratively partition the point set 𝑋 0 into boxes, where each box is a consecutive set of
points of 𝑋 0. Let 𝑣 𝑖 be the leaf vertex of 𝑇 0, such that 𝑝 1 ∈ 𝑆(𝑣 𝑖 ), and let 𝑚1 be the largest index,
such that all points 𝑝 1 , . . . , 𝑝 𝑚1 lie in 𝑆(𝑣 𝑖 ). We then then define a box 𝐵1 = {𝑝 1 , 𝑝2 , . . . , 𝑝 𝑚1 }.
We discard the points 𝑝 1 , . . . , 𝑝 𝑚1 , and continue this process to define 𝐵2 (starting from point
𝑝 𝑚1 + 1), and so on, until every point of 𝑋 0 belongs to some box.
We let ℬ(𝑇 0) = 𝐵1 , 𝐵2 , . . . , 𝐵 𝑧(𝑇 0) be the resulting partition of the points in 𝑋 0, where we refer
                  
to each set 𝐵 𝑖 as a 𝑇 0-box, or just a box. For each box 𝐵0 ∈ ℬ(𝑇 0), the lowest and the highest rows
containing points of 𝐵0 are denoted by first(𝐵0) and last(𝐵0) respectively.


7.1.2   Projections of points

Recall that the solution that our recursive algorithm returns is 𝑇-special, that is, the points that
participate in the solution lie on the active rows of 𝑋 and on 𝑇-auxiliary columns. Let 𝑌 be a
feasible solution obtained by our algorithm RecursiveBST(𝑋, 𝑇, 𝜌), where 𝜌 = 1. Notice the
every point in 𝑌 is obtained by “projecting” some input point 𝑝 ∈ 𝑋 to the boundary of some
strip 𝑆(𝑣) for 𝑣 ∈ 𝑉(𝑇), where strip 𝑆(𝑣) contains 𝑝. For each point 𝑝 ∈ 𝑋 and node 𝑣 ∈ 𝑉(𝑇) of
the partitioning tree, such that 𝑝 ∈ 𝑆(𝑣), we define the set proj(𝑝, 𝑣) to contain two points on
the same row as 𝑝, that lie on the left and right boundaries of 𝑆(𝑣), respectively. We denote
by 𝑌𝑝,𝑣 the set that contain the two points proj(𝑝, 𝑣) if our algorithm adds these two points
to the solution. Since all points of 𝑋 lie on distinct rows, for any two points 𝑝 ≠ 𝑝 0, for any
pair 𝑣, 𝑣 0 ∈ 𝑉(𝑇) of vertices, proj(𝑝, 𝑣) ∩ proj(𝑝 0 , 𝑣 0) = ∅. We can now write the solution 𝑌 as
𝑌 = 𝑝∈𝑋 𝑣∈𝑉(𝑇) 𝑌𝑝,𝑣 . The following lemma characterizes the points of 𝑌 in terms of boxes.
     Ð      Ð

Lemma 7.2. For every point 𝑝 ∈ 𝑋 and every node 𝑣 ∈ 𝑉(𝑇) of the partitioning tree, |𝑌𝑝,𝑣 | = 2 if and
only if there is an index 1 ≤ 𝑖 ≤ 𝐷, and a subtree 𝑇 0 ∈ 𝒯𝑖 , such that (i) 𝑝 is the first or the last point of its
𝑇 0-box; (ii) 𝑣 lies in the middle layer of 𝑇 0; and (iii) 𝑆(𝑣) contains 𝑝.

Proof. Let 1 ≤ 𝑖 ≤ 𝐷 be an index, and let 𝑇 0 ∈ 𝒯𝑖 be the corresponding partitioning tree.
We denote the corresponding point set by 𝑋 0. Let 𝑣 1 , . . . , 𝑣 𝑘 be the leaf vertices of 𝑇 0, and
let 𝑢1 , . . . , 𝑢𝑟 be the vertices lying in the middle layer of 𝑇 0. The only points added to the
solution when instance 𝑋 0 is processed are the following: for every 1 ≤ 𝑗 ≤ 𝑟, for every point
𝑝 ∈ 𝑋 0 ∩ 𝑆(𝑢 𝑗 ), we add the two copies of 𝑝 to the boundaries of 𝑆(𝑢 𝑗 ). In subsequent, or in
previous iterations, we may add points to boundaries of 𝑆(𝑢 𝑗 ), but we will never add two copies
of the same input point to both boundaries of 𝑆(𝑢 𝑗 ). Therefore, if |𝑌𝑝,𝑣 | = 2, then there must
be an index 𝑖, and a tree 𝑇 0 ∈ 𝒯𝑖 , such that 𝑣 lies in the middle layer of 𝑇 0, and 𝑆(𝑣) contains 𝑝.
Observe that for every 𝑇 0-box 𝐵, instance 𝑋 0 only contains the first and the last point of 𝐵; all
remaining points are redundant for 𝑋 0, and such points are not projected to the boundaries of
their strips. Therefore, 𝑝 must be the first or the last point of its 𝑇 0-box.                      

                         T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                   62
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

7.1.3    An equivalent view of our algorithm

We can think of our algorithm as follows. First, we compute all families 𝒯0 , 𝒯1 , . . . , 𝒯𝐷 of trees,
and for each tree 𝑇 0 in each family 𝒯𝑖 , all 𝑇 0-boxes. For each point 𝑝 ∈ 𝑋, for each vertex 𝑣 ∈ 𝑉(𝑇)
of the partitioning tree 𝑇, such that 𝑝 ∈ 𝑆(𝑣), we add the projection points proj(𝑝, 𝑣) to the
solution, depending on whether there exists an index 𝑖 ∈ {0, 1, . . . , 𝐷} and a tree 𝑇 0 ∈ 𝒯𝑖 , such
that 𝑣 lies in the middle layer 𝑈𝑇 0 of 𝑇 0, and whether 𝑝 is the first or last point of its 𝑇 0-box.
Notice that in the online setting, when the point 𝑝 arrives, we need to be able to immediately
decide which copies of 𝑝 to add to the solution. Since the trees in the families 𝒯0 , 𝒯1 , . . . , 𝒯𝐷 are
known in advance, and we discover, for every vertex 𝑣 ∈ 𝑉(𝑇) whether 𝑝 ∈ 𝑆(𝑣) immediately
when 𝑝 arrives, the only missing information, for every relevant tree 𝑇 0, is whether vertex 𝑝 is
the first or the last vertex in its 𝑇 0-box. In fact it is easy to check whether it is the first vertex
in its 𝑇 0-box, but we will not discover whether it is the last vertex in its 𝑇 0-box until the next
iteration, at which point it is too late to add points to the solution that lie in the row of 𝑝.
In order to avoid this difficulty, we slightly modify the instance and the solution.


7.2     Online-friendly solutions

In this section we slightly modify both the input set of points and the solutions produced by our
algorithm, in order to adapt them to the online setting.


7.2.1    Modifying the instance

Let 𝑋 = {𝑝 1 , . . . , 𝑝 𝑚 } be the input setof points, where for all 1 ≤ 𝑖 ≤ 𝑚, the 𝑦-coordinate of 𝑝 𝑖 is
𝑖. We produce a new instance 𝑋 0 = 𝑝 0𝑖 , 𝑝 00𝑖 | 1 ≤ 𝑖 ≤ 𝑚 , as follows: for all 1 ≤ 𝑖 ≤ 𝑚, we let 𝑝 0𝑖
and 𝑝 00𝑖 be points whose 𝑦-coordinates are (2𝑖 − 1) and 2𝑖, respectively, and whose 𝑥-coordinate
is the same as that of 𝑝 𝑖 . We refer to 𝑝 0𝑖 and to 𝑝 00𝑖 as copies of 𝑝 𝑖 .
Clearly, |𝑋 0 | = 2|𝑋 |, and it is easy to verify that OPT(𝑋 0) ≤ 2OPT(𝑋). Indeed, if we let 𝑌 be a
solution for 𝑋, we can construct a solution 𝑌 0 for 𝑋 0, by creating, for every point 𝑞 ∈ 𝑌, two
copies 𝑞 0 and 𝑞 00, that are added to rows with 𝑦-coordinates 2𝑞.𝑦 − 1 and 2𝑞.𝑦 respectively.
For convenience, we denote 𝒯 = 𝐷       𝑖=0 𝒯𝑖 . Notice that, for every tree 𝑇 ∈ 𝒯 , for every point 𝑝 𝑖 of
                                                                             0
                                     Ð
the original input 𝑋, copy 𝑝 0𝑖 of 𝑝 𝑖 may never serve as the last point of a 𝑇 0-box, and copy 𝑝 00𝑖 of
𝑝 𝑖 may never serve as the first point of a 𝑇 0-box.


7.2.2    Modifying the solution

Let 𝑌 be the solution that our 𝑂(log log 𝑛)-approximation algorithm produces for the new
instance 𝑋 0. For convenience, for all 1 ≤ 𝑖 ≤ 2𝑚, we denote by 𝑅 𝑖 the row with 𝑦-coordinate 𝑖.
Notice that all points of 𝑌 lying on a row 𝑅2𝑖−1 are projections of the point 𝑝 0𝑖 . Since this point

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                              63
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

may only serve as the first point of a 𝑇 0-box for every tree 𝑇 0 ∈ 𝒯 , when point 𝑝 0𝑖 arrives online,
we can immediately compute all projections of 𝑝 0𝑖 that need to be added to the solution. All
points of 𝑌 lying on row 𝑅 2𝑖 are projections of the point 𝑝 00𝑖 . This point may only serve as the
last point of a 𝑇 0-box in a tree 𝑇 0 ∈ 𝒯 . But we cannot know whether 𝑝 00𝑖 is the last point in its
𝑇 0-box until we see the next input point. Motivated by these observations, we now modify the
solution 𝑌 as follows.
We perform 𝑚 iterations. In iteration 𝑖, we consider the row 𝑅 2𝑖 . If no point of 𝑌 lies on row 𝑅2𝑖 ,
then we continue to the next iteration. Otherwise, we move every point of 𝑌 that lies on row 𝑅2𝑖
to row 𝑅2𝑖+1 (that is, one row up). Additionally, we add another copy 𝑝 000𝑖
                                                                              of point 𝑝 𝑖 to row 𝑅 2𝑖 ,
while preserving its 𝑥-coordinate.
In order to show that the resulting solution is a feasible solution to instance 𝑋 0, it is sufficient to
show that the solution remains feasible after every iteration. Let 𝑌𝑖 be the solution 𝑌 obtained
before the 𝑖th iteration, and let 𝑆 𝑖 = 𝑋 0 ∪ 𝑌𝑖 . We can obtain the new solution 𝑌𝑖+1 equivalently
as follows. First, we collapse the rows 𝑅 2𝑖+1 and 𝑅 2𝑖 for the set 𝑆 𝑖 of points into the row 𝑅2𝑖+1 ,
obtaining a new set 𝑆0𝑖 of points that is guaranteed to be satisfied. Notice that now both row
𝑅 2𝑖+1 and row 𝑅2𝑖−1 contain a point at 𝑥-coordinate (𝑝 𝑖 ).𝑥, while row 𝑅 2𝑖 contains no points.
Therefore, if we add to 𝑆0𝑖 a point with 𝑥-coordinate (𝑝 𝑖 ).𝑥, that lies at row 𝑅 2𝑖 , then the resulting
set of points, that we denote by 𝑆 𝑖+1 remains satisfied. But it is easy to verify that 𝑆 𝑖+1 = 𝑋 0 ∪ 𝑌𝑖+1 ,
where 𝑌𝑖+1 is the solution obtained after iteration 𝑖. We denote by 𝑌 0 the final solution obtained
by this transformation of 𝑌. It is easy to see that |𝑌 0 | ≤ 2|𝑌| ≤ 𝑂(log log 𝑛)OPT(𝑋).


7.3   The final online algorithm

Let 𝑋 = {𝑝 1 , . . . , 𝑝 𝑚 } be an input set of points. We will describe the online algorithm that
produces a feasible solution to instance 𝑋. This algorithm would mimic the behavior of the
point set 𝑌 0 as described earlier.
Initially, the algorithm computes the collection 𝒯 of trees before any input arrives. Now, in
iteration 𝑖, when input 𝑝 𝑖 arrives, we compute 𝐹𝐼𝑅𝑆𝑇(𝑖):
                                   Ø                                             Ø
         𝐹𝐼𝑅𝑆𝑇(𝑖) =                                                                                           proj(𝑝 𝑖 , 𝑣)®
                                                          ©                                                                ª
                                                          
                       𝑇 0 ∈𝒯 :𝑝 𝑖 is first of a 𝑇 0 -box «𝑣:𝑣 is middle layer of 𝑇 0 , and 𝑆(𝑣) contains 𝑝 𝑖              ¬
We also compute 𝐿𝐴𝑆𝑇(𝑖 − 1):
                                   Ø                                             Ø
      𝐿𝐴𝑆𝑇(𝑖 − 1) =                                                                                           proj(𝑝 𝑖−1 , 𝑣)®
                                                          ©                                                                  ª
                                                          
                      𝑇 0 ∈𝒯 :𝑝 𝑖−1 is last of a 𝑇 0 -box «𝑣:𝑣 is middle layer of 𝑇 0 and 𝑆(𝑣) contains 𝑝 𝑖−1                ¬
We add copies of 𝐹𝐼𝑅𝑆𝑇(𝑖) and 𝐿𝐴𝑆𝑇(𝑖 − 1), and a copy of 𝑝 𝑖−1 points on row 𝑖. It is easy to
verify that the resulting solution is precisely the solution obtained after collapsing every two
consecutive rows in 𝑌 0 (as discussed previously). Therefore the solution is feasible and has cost
at most |𝑌 0 | ≤ 𝑂(log log 𝑛)OPT.

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                                                      64
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

8     Wilber and Guillotine bounds

In this section, we provide an alternative, geometric proof of the fact that WB(𝑋) ≤ 2OPT(𝑋),
which can be extended to the Guillotine bound. The original proof of Wilber [29] is done in the
tree view.


8.1   Wilber bound

It is sufficient to prove that, if 𝑋 is a semi-permutation, and 𝑇 is any partitioning tree for 𝑋, then
WB𝑇 (𝑋) ≤ 2OPT(𝑋).
We prove this claim by induction on the height of 𝑇. The base case, when the height of 𝑇 is 0, is
obvious: there is only one active column, and so WB𝑇 (𝑋) = OPT(𝑋) = 0.
We now consider the inductive step. Let 𝑋 be any point set that is a semi-permutation, and let 𝑇
be any partitioning tree for 𝑋, such that the height of 𝑇 is at least 1. Let 𝑣 be the root vertex of 𝑇,
𝐿 = 𝐿(𝑣) the line that 𝑣 owns, and let 𝑣 𝐿 , 𝑣 𝑅 be the two children of 𝑣. We assume w.l.o.g. that
the strip 𝑆(𝑣 𝐿 ) lies to the left of 𝑆(𝑣 𝑅 ). We denote 𝑆(𝑣) = 𝐵 – the bounding box of the instance,
𝑆(𝑣 𝐿 ) = 𝑆 𝐿 , 𝑆(𝑣 𝑅 ) = 𝑆 𝑅 , and we also denote 𝑋 𝐿 = 𝑋 ∩ 𝑆 𝐿 , 𝑋 𝑅 = 𝑋 ∩ 𝑆 𝑅 . Lastly, we let 𝑇 𝐿 and 𝑇 𝑅
be the subtrees of 𝑇 rooted at 𝑣 𝐿 and 𝑣 𝑅 , respectively. We prove the following claim.
Claim 8.1.
                             OPT(𝑋) ≥ OPT(𝑋 𝐿 ) + OPT(𝑋 𝑅 ) + cost(𝑣)/2.

Notice that, if the claim is correct, then we can use the induction hypothesis on 𝑋 𝐿 and 𝑋 𝑅 with
the trees 𝑇 𝐿 and 𝑇 𝑅 respectively, to conclude that:

                                1                                        1
                    OPT(𝑋) ≥      (WB𝑇 𝐿 (𝑋𝐿 ) + WB𝑇 𝑅 (𝑋𝑅 ) + cost(𝑣)) = WB𝑇 (𝑋).
                                2                                        2
Therefore, in order to complete the proof of Claim 2.7, it is enough to prove Claim 8.1.

Proof. Claim 8.1 Let 𝑌 be an optimal solution to instance 𝑋. We can assume w.l.o.g. that 𝑌 is a
canonical solution, so no point of 𝑌 lies on the line 𝐿. Let 𝑌 𝐿 , 𝑌 𝑅 be the subsets of points of 𝑌
that lie to the left and to the right of the line 𝐿, respectively.
Let ℛ 𝐿 be the set of all rows 𝑅, such that (i) no point of 𝑋 𝐿 lies on 𝑅, and (ii) some point of 𝑌 𝐿
lies on 𝑅. We define a set ℛ 𝑅 of rows similarly for instance 𝑋 𝑅 . The crux of the proof is the
following observation.
Observation 8.2.
                                        |ℛ 𝐿 | + |ℛ 𝑅 | ≥ cost(𝑣)/2.

Before we prove Observation 8.2, we show that Claim 8.1 follows from it. In order to do so, we
will define a new feasible solution 𝑌ˆ 𝐿 for instance 𝑋 𝐿 , containing at most |𝑌 𝐿 | − |ℛ 𝐿 | points,

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                               65
              PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

and similarly, we will define a new feasible solution 𝑌ˆ 𝑅 for instance 𝑋 𝑅 , containing at most
|𝑌 𝑅 | − |ℛ 𝑅 | points. This will prove that OPT(𝑋 𝐿 ) ≤ |𝑌 𝐿 | − |ℛ 𝐿 | and OPT(𝑋 𝑅 ) ≤ |𝑌 𝑅 | − |ℛ 𝐿 |,
so altogether, OPT(𝑋 𝐿 ) + OPT(𝑋 𝑅 ) ≤ |𝑌| − |ℛ 𝐿 | − |ℛ 𝑅 | ≤ OPT(𝑋) − cost(𝑣)/2, thus proving
Claim 8.1.
We now show how to construct the solution 𝑌ˆ 𝐿 for instance 𝑋 𝐿 . The solution 𝑌ˆ 𝑅 for instance 𝑋 𝑅
is constructed similarly.
In order to construct the solution 𝑌ˆ 𝐿 , we start with the solution 𝑌 𝐿 , and then gradually modify
it over the course of |ℛ 𝐿 | iterations, where in each iteration we reduce the number of points in
the solution 𝑌 𝐿 by at least 1, and we eliminate at most one row from ℛ 𝐿 . In order to execute an
iteration, we select two rows 𝑅, 𝑅0, with the following properties:

   • Row 𝑅 contains a point of 𝑋 𝐿 ;

   • Row 𝑅0 contains a point of 𝑌 𝐿 and it contains no points of 𝑋 𝐿 ; and

   • No point of 𝑋 𝐿 ∪ 𝑌 𝐿 lies strictly between rows 𝑅 and 𝑅0.

Note that, if ℛ 𝐿 ≠ ∅, such a pair of rows must exist. We then collapse the row 𝑅0 into the row 𝑅,
obtaining a new modified solution to instance 𝑋 𝐿 (we use Observation 2.4). We claim that the
number of points in the new solution decreases by at least 1. In order to show this, it is sufficient
to show that there must be two points 𝑝 ∈ 𝑅, 𝑝 0 ∈ 𝑅0 with the same 𝑥-coordinates; after the two
rows are collapsed, these two points are mapped to the same point. Assume for contradiction
that no such two points exist. Let 𝑝 ∈ 𝑅, 𝑝 0 ∈ 𝑅0 be two points with smallest horizontal distance.
Then it is easy to see that no point of 𝑋 𝐿 ∪ 𝑌 𝐿 lies in the rectangle 𝑝,𝑝0 , contradicting the fact
that 𝑌 𝐿 is a feasible solution for 𝑋 𝐿 .
In order to complete the proof of Claim 8.1, it is now enough to prove Observation 8.2.

Proof. Observation 8.2 We denote 𝑋 = {𝑝 1 , . . . , 𝑝 𝑚 }, where the points are indexed in the
increasing order of their 𝑦-coordinates. Recall that a pair (𝑝 𝑖 , 𝑝 𝑖+1 ) of points is a crossing, if the
two points lie on opposite sides of the line 𝐿. We say that it is a left-to-right crossing if 𝑝 𝑖 lies to
the left of 𝐿, and we say that it is a right-to-left crossing otherwise. Clearly, either at least half
the crossings of 𝐿 are left-to-right crossings, or at least half the crossings of 𝐿 are right-to-left
crossings. We assume w.l.o.g. that it is the former. Let Π denote the set of all left-to-right
crossings of 𝐿, so |Π| ≥ cost(𝑣)/2. Notice that every point of 𝑋 participates in at most one
crossing in Π. We will associate, to each crossing (𝑝 𝑖 , 𝑝 𝑖+1 ) ∈ Π, a unique row in ℛ 𝐿 ∪ ℛ 𝑅 . This
will prove that |ℛ 𝐿 | + |ℛ 𝑅 | ≥ |Π| ≥ cost(𝑣)/2.
Consider now some crossing (𝑝 𝑖 , 𝑝 𝑖+1 ). Assume that 𝑝 𝑖 lies in row 𝑅, and that 𝑝 𝑖+1 lies in row
𝑅0. Let ℛ 𝑖 be a set of all rows lying between 𝑅 and 𝑅0, including these two rows. We will show
that at least one row of ℛ 𝑖 lies in ℛ 𝐿 ∪ ℛ 𝑅 . In order to do so, let 𝐻 be the closed horizontal strip
whose bottom and top boundaries are 𝑅 and 𝑅0, respectively. Let 𝐻 𝐿 be the area of 𝐻 that lies to
the left of the line 𝐿, and that excludes the row 𝑅 – the row containing the point 𝑝 𝑖 , that also

                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            66
              P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

lies to the left of 𝐿. Similarly, let 𝐻 𝑅 be the area of 𝐻 that lies to the right of the line 𝐿, and that
excludes the row 𝑅0. Notice that, if any point 𝑦 ∈ 𝑌 𝐿 lies in 𝐻 𝐿 , that the row containing 𝑦 must
belong to ℛ 𝐿 . Similarly, if any point 𝑦 0 ∈ 𝑌 𝑅 lies in 𝐻 𝑅 , then the row containing 𝑦 0 belongs to
ℛ 𝑅 . Therefore, it is now sufficient to show that either 𝐻 𝐿 contains a point of 𝑌 𝐿 , or 𝐻 𝑅 contains
a point of 𝑌 𝑅 . Assume for contradiction that this is false. Let 𝑝 ∈ 𝑋 𝐿 ∪ 𝑌 𝐿 be the point lying on
the row 𝑅 furthest to the right (such a point must exist because we can choose 𝑝 = 𝑝 𝑖 ). Similarly,
let 𝑝 0 ∈ 𝑋 𝑅 ∪ 𝑌 𝑅 be the point lying on the row 𝑅0 furthest to the left (again, such a point must
exist because we can choose 𝑝 0 = 𝑝 𝑖+1 .) But if 𝐻 𝐿 contains no points of 𝑌 𝐿 , and 𝐻 𝑅 contains no
points of 𝑌 𝑅 , then no points of 𝑋 ∪ 𝑌 lie in the rectangle 𝑝,𝑝0 , and so the pair (𝑝, 𝑝 0) of points is
not satisfied in 𝑋 ∪ 𝑌, a contradiction.                                                                


                                                                                                       


8.2   Guillotine bound

In this section we prove Lemma 5.3, by showing that for any point set 𝑋 that is a permutation,
GB(𝑋) ≤ 2OPT(𝑋). In order to do so, it is enough to prove that, for any point set 𝑋 that is a
permutation, for any partitioning tree 𝑇 for 𝑋, GB𝑇 (𝑋) ≤ 2OPT(𝑋).
The proof is by induction on the height of 𝑇, and it is almost identical to the proof of Claim 2.7
for the standard Wilber Bound. When the height of the tree 𝑇 is 1, then |𝑋 | = 1, so GB(𝑋) = 0
and OPT(𝑋) = 0.
Consider now a partitioning tree 𝑇 whose height is greater than 1. Let 𝑇1 , 𝑇2 be the two subtrees
of 𝑇, obtained by deleting the root vertex 𝑟 from 𝑇. Let (𝑋1 , 𝑋2 ) be the partition of 𝑋 into two
subsets given by the line 𝐿(𝑟), such that 𝑇1 is a partitioning tree for 𝑋1 and 𝑇2 is a partitioning
tree for 𝑋2 . Notice that, from the definition of the GB bound:


                              GB𝑇 (𝑋) = GB𝑇1 (𝑋1 ) + GB𝑇2 (𝑋2 ) + cost(𝑟).


Moreover, from the induction hypothesis, GB𝑇1 (𝑋1 ) ≤ 2OPT(𝑋1 ) and GB𝑇2 (𝑋2 ) ≤ 2OPT(𝑋2 ).
Using Claim 8.1 (that can be easily adapted to horizontal partitioning lines), we get that:


                             OPT(𝑋) ≥ OPT(𝑋1 ) + OPT(𝑋2 ) + cost(𝑟)/2.


Therefore, altogether we get that:


                      GB𝑇 (𝑋) ≤ 2OPT(𝑋1 ) + 2OPT(𝑋2 ) + cost(𝑟) ≤ 2OPT(𝑋).



                       T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                            67
            PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

References

 [1] Georgii Maksimovich Adel’son-Vel’skii and Evgenii Mikhailovich Landis: An algorithm
     for organization of information. Dokl. Akad. Nauk SSSR (Russian), 146(2):263–266, 1962.
     Math-Net.ru. 2

 [2] Rudolf Bayer: Symmetric binary B-trees: Data structure and maintenance algorithms. Acta
     Informatica, 1(4):290–306, 1972. [doi:10.1007/BF00289509] 2

 [3] Prosenjit Bose, Karim Douïeb, John Iacono, and Stefan Langerman: The power and
     limitations of static binary search trees with lazy finger. Algorithmica, 76:1264–1275, 2016.
     Preliminary version in ISAAC’14. [doi:10.1007/s00453-016-0224-x] 3

 [4] Parinya Chalermsook, Julia Chuzhoy, and Thatchaphol Saranurak: Pinning down the
     strong Wilber 1 bound for binary search trees. In Proc. 23rd Internat. Conf. on Approximation
     Algorithms for Combinat. Opt. Probl. (APPROX’20), pp. 33:1–21. Schloss Dagstuhl–Leibniz-
     Zentrum fuer Informatik, 2020. [doi:10.4230/LIPIcs.APPROX/RANDOM.2020.33] 8

 [5] Parinya Chalermsook, Mayank Goswami, László Kozma, Kurt Mehlhorn, and
     Thatchaphol Saranurak: Greedy is an almost optimal deque. In Proc. 17th Symp. on
     Algorithms and Data Structures (WADS’15), pp. 152–165. Springer, 2015. [doi:10.1007/978-3-
     319-21840-3_13] 3

 [6] Parinya Chalermsook, Mayank Goswami, László Kozma, Kurt Mehlhorn, and
     Thatchaphol Saranurak: Pattern-avoiding access in binary search trees. In Proc. 56th
     FOCS, pp. 410–423. IEEE Comp. Soc., 2015. [doi:10.1109/FOCS.2015.32, arXiv:1507.06953]
     3, 7

 [7] Ranjan Chaudhuri and Hartmut F. Höft: Splaying a search tree in preorder takes linear
     time. SIGACT News, 24(2):88–93, 1993. [doi:10.1145/156063.156067] 3

 [8] Richard Cole: On the Dynamic Finger Conjecture for splay trees. Part II: The proof. SIAM
     J. Comput., 30(1):44–85, 2000. [doi:10.1137/S009753979732699X] 3

 [9] Richard Cole, Bud Mishra, Jeanette Schmidt, and Alan Siegel: On the Dynamic Finger
     Conjecture for splay trees. Part I: Splay sorting log 𝑛-block sequences. SIAM J. Comput.,
     30(1):1–43, 2000. [doi:10.1137/S0097539797326988] 3

[10] Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein: Introduction
     to Algorithms. MIT press, 2022. MIT Press. 2

[11] Erik D. Demaine, Dion Harmon, John Iacono, Daniel M. Kane, and Mihai Pǎtraşcu: The
     geometry of binary search trees. In Proc. 20th Ann. ACM–SIAM Symp. on Discrete Algorithms
     (SODA’09), pp. 496–505. SIAM, 2009. ACM DL. 3, 4, 5, 9, 12, 36

                     T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                      68
             P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

[12] Erik D. Demaine, Dion Harmon, John Iacono, and Mihai Pǎtraşcu: Dynamic optimality–
     almost. SIAM J. Comput., 37(1):240–251, 2007. Preliminary version in FOCS’04.
     [doi:10.1137/S0097539705447347] 3, 4, 6, 8, 12

[13] Jonathan C. Derryberry and Daniel Dominic Sleator: Skip-splay: Toward achieving the
     unified bound in the BST model. In Proc. 11th Symp. on Algorithms and Data Structures
     (WADS’09), pp. 194–205. Springer, 2009. [doi:10.1007/978-3-642-03367-4_18] 3

[14] Jonathan C. Derryberry, Daniel Dominic Sleator, and Chengwen Chris Wang: A lower
     bound framework for binary search trees with rotations. Technical report, CMU-CS-05-187,
     2005. Available on author’s website. 3

[15] Amr Elmasry: On the sequential access theorem and deque conjecture for splay trees.
     Theoret. Comput. Sci., 314(3):459–466, 2004. [doi:10.1016/j.tcs.2004.01.019] 3

[16] George F. Georgakopoulos: Chain-splay trees, or, how to achieve and prove
     log log 𝑁-competitiveness by splaying. Inform. Process. Lett., 106(1):37–43, 2008.
     [doi:10.1016/j.ipl.2007.10.001] 3, 8

[17] Dion Harmon: New Bounds on Optimal Binary Search Trees. Ph. D. thesis, MIT, 2006. MIT. 3

[18] John Iacono: In pursuit of the dynamic optimality conjecture. In A. Brodnik, A. López-Ortiz,
     V. Raman, and A. Viola, editors, Space-Efficient Data Structures, Streams, and Algorithms,
     volume 8066 of LNCS, pp. 236–250. Springer, 2013. [doi:10.1007/978-3-642-40273-9_16] 4, 6,
     12, 26, 36

[19] John Iacono and Stefan Langerman: Weighted dynamic finger in binary search trees. In
     Proc. 27th Ann. ACM–SIAM Symp. on Discrete Algorithms (SODA’16), pp. 672–691. SIAM,
     2016. [doi:10.1137/1.9781611974331.ch49] 3

[20] László Kozma: Binary search trees, rectangles and patterns. Ph. D. thesis, Saarland University,
     Saarbrücken, Germany, 2016. Saarland U. 4, 6

[21] Victor Lecomte and Omri Weinstein: Settling the relationship between Wilber’s bounds
     for dynamic optimality. In Proc. 28th Eur. Symp. Algorithms (ESA’20), pp. 68:1–21.
     Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2020. [doi:10.4230/LIPIcs.ESA.2020.68,
     arXiv:1912.02858] 4, 5, 7

[22] Caleb C. Levy and Robert E. Tarjan: A new path from splay to dynamic optimality. In
     Proc. 30th Ann. ACM–SIAM Symp. on Discrete Algorithms (SODA’19), pp. 1311–1330. SIAM,
     2019. [doi:10.1137/1.9781611975482.80] 37

[23] Joan M. Lucas: On the competitiveness of splay trees: Relations to the union-find problem.
     In Lyle A. McGeoch and Daniel D. Sleator, editors, On-line Algorithms, volume 7 of
     DIMACS Ser. in Discrete Math. and Theor. Comp. Sci., pp. 95–124. Amer. Math. Soc., 1992.
     [doi:10.1090/dimacs/007] 3

                     T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                        69
            PARINYA C HALERMSOOK , J ULIA C HUZHOY, AND T HATCHAPHOL S ARANURAK

[24] Seth Pettie: Splay trees, Davenport-Schinzel sequences, and the Deque Conjecture. In Proc.
     19th Ann. ACM–SIAM Symp. on Discrete Algorithms (SODA’08), pp. 1115–1124. SIAM, 2008.
     [doi:10.5555/1347082.1347204, arXiv:0707.2160] 3

[25] Daniel Dominic Sleator and Robert Endre Tarjan: Self-adjusting binary search trees. J.
     ACM, 32(3):652–686, 1985. Preliminary version in STOC’83. [doi:10.1145/3828.3835] 3

[26] Rajamani Sundar: On the Deque Conjecture for the splay algorithm. Combinatorica,
     12(1):95–124, 1992. [doi:10.1007/BF01191208] 3

[27] Robert Endre Tarjan: Sequential access in splay trees takes linear time. Combinatorica,
     5(4):367–378, 1985. [doi:10.1007/BF02579253] 3

[28] Chengwen C. Wang, Jonathan C. Derryberry, and Daniel Dominic Sleator: O(log log 𝑁)-
     competitive dynamic binary search trees. In Proc. 17th Ann. ACM–SIAM Symp. on Discrete
     Algorithms (SODA’06), pp. 374–383. SIAM, 2006. ACM DL. 3, 4, 6, 8

[29] Robert E. Wilber: Lower bounds for accessing binary search trees with rotations. SIAM J.
     Comput., 18(1):56–67, 1989. Preliminary version in FOCS’86. [doi:10.1137/0218004] 3, 6, 12,
     25, 26, 37, 65


AUTHORS

     Parinya Chalermsook
     Associate professor
     Department of Computer Science
     Aalto University
     Espoo, Finland
     chalermsook gmail com
     https://sites.google.com/site/parinyachalermsook/


     Julia Chuzhoy
     Professor
     Toyota Technological Institute at Chicago
     Chicago, IL, USA
     cjulia ttic edu
     https://home.ttic.edu/~cjulia/




                    T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                     70
           P INNING D OWN THE S TRONG W ILBER -1 B OUND FOR B INARY S EARCH T REES

    Thatchaphol Saranurak
    Assistant professor
    University of Michigan
    Ann Arbor, MI, USA
    thsa umich edu
    https://sites.google.com/site/thsaranurak/


ABOUT THE AUTHORS

    Parinya Chalermsook is a faculty member at the Department of Computer Science,
       Aalto University (Finland). He completed his Ph. D. at The University of Chicago
       under the supervision of Julia Chuzhoy and Janos Simon. He was at Max Planck
       Institute for Informatics as a postdoc and a senior research scientist from 2013 to
       2016. Parinya is broadly interested in algorithms and extremal combinatorics.
       When he is not doing mathematics, he enjoys reading about political philosophy.


    Julia Chuzhoy is a Professor at the Toyota Technological Institute at Chicago. She
       completed her Ph. D. in Technion, Israel, and spent three years as a postdoctoral
       scholar at MIT, the University of Pennsylvania and the Institute for Advanced
       Study in Princeton. She mainly works on algorithms for graph problems. In her
       spare time she likes to read books, learns to play piano and studies French.


    Thatchaphol Saranurak is a faculty member of the Electrical Engineering and
      Computer Science Department at the University of Michigan. Prior to this, he
      spent two years as a research assistant professor at the Toyota Technological
      Institute at Chicago. Thatchaphol received his Ph. D. from KTH Royal Institute
      of Technology, Stockholm, in 2018 under the supervision of Danupon Nanongkai.
      His main research interest is in graph algorithms with a current focus on dynamic,
      local, and distributed algorithms. He likes sushi and Japanese manga.




                   T HEORY OF C OMPUTING, Volume 19 (8), 2023, pp. 1–71                      71