## 1 Lower Bound

In this section we give several lower bounds for algorithms where all logical READs use $\leq \log _{2}(k)$ physical reads.

- Theorem 1. For any wait-free algorithm $A$ that simulates a $k$-valued safe register using binary atomic registers, if the READ algorithm uses at most $\log _{2}(k)$ reads in the worst case, then the Write algorithm uses at least $\log _{2}(k)$ writes in the worst case.

Proof. Consider the following set $S$ of executions of $A: \sigma_{i}=\operatorname{Write}\left(v_{i}\right) \alpha_{i} \operatorname{Ack} \operatorname{ReaD}_{1} \beta_{i}$ $\operatorname{Return}\left(w_{i}\right) \mid$ only the $W P$ takes steps in $\alpha_{i}$ and only $R P_{1}$ takes steps in $\beta_{i}, 0 \leq i<k$. Note that $w_{i}$ must equal $v_{i}$ to satisfy safety.

Now construct a decision tree of $R P_{1}$ 's behavior in the executions in $S$. The root of the decision tree corresponds to the first register that $R P_{1}$ reads, this register is the same for all executions in $S$ as $R P_{1}$ always begins in the same state. Now depending on the first value read, $R P_{1}$ does some amount of non-read actions then does another read of some register. Because the root only can store two values there are only two registers $R P_{1}$ can decide to read from causing the root to have two children. Continue to build the tree like this. Add leaves to indicate which value $R P_{1}$ will Return.

- Lemma 2. The decision tree is a complete binary tree with exactly $k$ leaves in which every leaf is at depth $\log _{2}(k)$ and the path from the root to the leaf labeled $v_{i}$ corresponds to $\beta_{i} \in \sigma_{i}$.

Proof. Note that there must be at least k leaves in the decision tree, as each value in V is returned in one of the executions in $S$. Since the decision tree is binary, basic facts from graph theory imply that each leaf must be at depth $\log _{2}(k)$, allowing exactly k leaves and each root-to-leaf path must correspond to a different execution in S.

- Lemma 3. None of the READs in an execution in $S$ reads the same register more than once.

Proof. Suppose in contradiction the READ in $\sigma_{i}$ reads some number of registers more than once, for some $i$. Let $x$ be the first register that $R P_{1}$ reads twice, say the $a$-th read and the $b$-th read, with $\log _{2}(k) \geq b>a$. Consider the node in the decision tree for the b-th read in the path for $\sigma_{i}$. This node must have two children since the decision tree is complete. Consider a root-to-leaf path $\pi$ in the decision tree that forks off from the path corresponding to $\sigma_{i}$ at this node. By Lemma 1, there is a one-to-one correspondence between root-to-leaf paths in the decision tree and the set of executions and so $\pi$ corresponds to $\sigma_{j}$ in $S$, for some $j \neq i$.

However, it is not possible for $\sigma_{i}$ and $\sigma_{j}$ to read different values from x at the b-th read since the only process that is taking steps during the READs is $R P_{1}$ : even if $R P_{1}$ writes to $x$ during the READ, it will write the same value in $\sigma_{i}$ as in $\sigma_{j}$, as nothing differs in those two executions until reaching the $b$-th read.

Now we can finish the proof of the theorem. Let $x_{1}$ be the first register read in the READ of each execution in $S$. Note that in half of the elements of $S$, the value read from $x_{1}$ in the READ must be different from the initial value of $x_{1}$, in order to be able to reach all the leaves on that half of the decision tree. In other words, the WRITE writes to $x_{1}$ in half the executions in $S$. Let $S_{1}$ be the subset of $S$ consisting of the executions in which the Write writes $x_{1}$; note that $\left|S_{1}\right|=\frac{k}{2}$. Now let $x_{2}$ be the register corresponding to the root of subtree in which $x_{1}$ was written. A similar argument shows that in half the executions
in $S_{1}$, the value read from $x_{2}$ in the READ must be different from the initial value of $x_{2}$. Let $S_{2}$ be the subset of $S_{1}$ consisting of the executions in which the Write writes to $x_{2}$; note that $\left|S_{2}\right|=\frac{k}{2^{2}}$. Continuing this way we obtain $S_{\text {log }_{2} k}$, which is of size 1, in which the Write writes to $x_{1}, x_{2}, \ldots, x_{\log _{2} x}$. By Lemma 2, each of these registers is distinct and thus the WRITE writes to at least $\log _{2}(k)$ registers.

- Theorem 4. For any wait-free algorithm $A$ that simulates a $k$-valued regular register using binary atomic registers, if the READ algorithm uses at most $\log _{2}(k)$ reads in the worst case, $A$ requires at least $k-1$ registers.

Proof. Consider the following executions which can be created by shifting $R P_{1}$ :

- $\operatorname{Write}(v) \alpha_{v} \operatorname{Ack} \operatorname{Read}_{1} \beta_{v} \operatorname{Return}(v) \operatorname{Write}(w) \alpha_{w_{1}} \alpha_{w_{2}} \operatorname{Ack}$
- $\operatorname{Write}(v) \alpha_{v} \operatorname{Ack} \operatorname{Write}(w) \alpha_{w_{1}} \operatorname{Read}_{1} \beta_{q} \operatorname{Return}(q) \alpha_{w_{2}} \operatorname{Ack}$
- $\operatorname{Write}(v) \alpha_{v} \operatorname{Ack} \operatorname{Write}(w) \alpha_{w_{1}} \alpha_{w_{2}} \operatorname{Ack}_{\operatorname{Read}_{1} \beta_{w} \operatorname{Return}(w)}^{(w)}$

Because these executions can be created by shifting $R P_{1}, R P_{1}$ can be in the same state each time it initiates its read, we will assume this is the case for these three executions. Due to $R P_{1}$ starting in the same state for each execution we can then construct a decision tree for $R P_{1}$ in the same manor as what was done in Theorem 1 , this tree will be identical for all three executions. Now because $R P_{1}$ uses the same algorithm and starts in the same state in each execution there exists a well defined function $f$ which maps the sequence of physical returns in each Read to the value Returned by $R P_{1}$, where $f$ is the same in each execution. From Lemma 2 each sequence of physical returns which is mapped by $f$ has a length of $\log _{2}(k)$, and because $f$ must map to $k$ distinct values, $f$ must have a $1-1$ mapping of sequences to values. Let $\gamma_{v}, \gamma_{q}$, and $\gamma_{w}$ represent the sequence of physical returns in $\beta_{v}$, $\beta_{q}$, and $\beta_{w}$ respectively. Note that to satisfy regularity $f\left(\gamma_{q}\right)=q$ must equal either $v$ or $w$. Now let v and w be the values such that the first $j$ elements of $\gamma_{v}$ and $\gamma_{w}$ are equal where $0 \leq j<\log _{2}(k)-1$. Additionally the $(j+1)$-th element of $\gamma_{v}$ and $\gamma_{w}$ are 0 and 1 respectively This occurs once more, let the $J$-th element, where $j<J \leq \log _{2}(k)$, of $\gamma_{v}$ and $\gamma_{w}$ equal 0 and 1 respectively as well. Let $t$ be the node in the decision tree of $R P_{1}$ which corresponds to the $(j+1)$-th physical read, note that $t$ must read the same physical register, $T$, for all three executions. Now let $l$ be the decedent of $t$ in the decision tree which corresponds to the $J$-th read in the first execution, and let $r$ be the decedent of $t$ in the decision tree which corresponds to the $J$-th read in the third execution. Note that $l$ and $r$ are on the same level of the decision tree. Finally say that the last physical write in $\alpha_{w_{1}}$ is the first and only physical write in $\alpha_{w_{1}}$ which writes to either $T$ or $R$.

Assume for contradiction that the physical register read by $l$ and $r$ are the same physical register, $R$.

It is easy to see that the first and third executions can easily be fulfilled even with our previous assumption. However, the second execution is not so simple. As stated previously, in order to uphold regularity we have with two scenarios for the second execution, one where $p=v$ and another where $p=w$.

For the first scenario it is clear that $T$ cannot be written to in $\alpha_{w_{1}}$ otherwise $\gamma_{v}$, the only sequence which $f$ maps to $v$, would immediately not equal $\gamma_{q}$. Following this means that $R$ must be the last physical register written to in $\alpha_{w_{1}}$. Now in order for $p$ to equal $v$, the $J$-th physical read in the second execution must equal what was read in the first execution. However as a result from both executions using the same decision tree, the first and second execution will both read the register $R$, but in the first execution this read will return 0 and in the second this read will return 1 meaning $p \neq v$ causing a contradiction.

This leaves us with the second possibility, and it is clear that $R$ cannot be written to in $\alpha_{w_{1}}$ otherwise $\gamma_{w}$, the only sequence which $f$ maps to $w$, would not equal $\gamma_{q}$. It then follows that $T$ must be the register written to in $\alpha_{w_{1}}$. Now because the second and third executions use the same decision tree both of their $J$-th reads will read the same physical register, the register $R$. However because $R$ was changed from 0 to 1 in the third execution and remained at 0 in the second execution $\gamma_{p} \neq \gamma_{w}$ which implies that $q \neq w$.

This means that for all $l$ and $r$, if $l$ and $r$ read from the same register there is no way to uphold regularity. In other words no two nodes on the same level of the decision tree which have a common ancestor can read from the same register. Combining this with Lemma 3 implies that no nodes in a decision tree may read from the same physical register. Because the decision tree has exactly $k-1$ nodes, at least $k-1$ registers must be used to implement $A$.

