TCS: Randomized Computation

Randomized Computation

Randomized Algorithm

A randomized algorithm outputs the correct value with good probability on every possible input.

Matrix multiplication

Input matrix $A, B, C$ , decide if $C = AB$

Obviously there is a deterministic and polynomial algorithm for this.

A random algorithm: Freivalds’ algorithm

Repeat the following for $k$ $k$ times.
1. Randomly choose $v\in \{0, 1\}^{n}$
2. Compute $(d = A(Bv) - Cv)$
3. Reject if $d\neq 0$
Accept

We have that this algorithm can solve this problem in $O(kn^{2})$ time with
a probability of failure $\leq 2^{-k}$

Proof:

If $AB \neq C$ , we prove $P(d = 0) \leq \frac{1}{2}$ for each time.
So $D = AB - C \neq 0$ . Let $D_{ij} \neq 0$
The $i$ -th entry of $d$ holds that:

d_{i} = \sum D_{ik}v_{k} = D_{ij}v_{j} + \sum\limits_{k\neq j}D_{ik}v_{k}

Let $s = \sum\limits_{k\neq j}D_{ik}v_{k}$ , so:

\begin{align*}P(d_{i} = 0) &= P(d_{i} = 0 \,|\, s = 0)P(s = 0) \\&\quad +P(d_{i} = 0 \,|\, s\neq 0)P(s\neq 0) \\&\leq P(v_{i} = 0)P(s = 0) + P(v_{i} = 1)P(s\neq 0)\\&\leq \frac{1}{2}(P(s = 0) + P(s\neq 0)) \leq \frac{1}{2}\end{align*}

So $P(d = 0^{n}) \leq P(d_{i} = 0) \leq \frac{1}{2}$

Maxcut Approximation

The MAX-CUT problem is NP-Complete. So our task is to find a cut $C$ whose size is not far from the optimal one $C^{*}$ .
If $\text{size}_C \ge \alpha\,\text{size}_{C^*}$ , we call $C$ is an $\alpha$ -approximation, then we have an easily way to find $\frac{1}{2}$ -approximation, which is universal randomly distribute each vertex into set $0$ or $1$ .

\begin{equation*} \mathbb{E}(\text{size}_C) = \mathbb{E} \sum_{\{u, v\} \in E} 1_{x_u \ne x_v} = \frac{1}{2} |E| \ge \frac{1}{2} \text{size}_{C^*}. \end{equation*}

This is just sufficient expection, but we can give an always-large-enough cut by conditional expection if we can compute this equation efficiently.

\begin{equation*} \mathbb{E}(\text{size}_C(x_1, \ldots, x_i, X_{i+1}, \ldots X_{n})), \end{equation*}

We maximize this in each choice.

Derandomize

Above algorithm uses $n$ random choices, covering $2^{n}$ possibilities. We can try to reduce the randomness to a polynomial number of possibilities, we can derandomize the algorithm.
Considering Universal hash function:

Consider a family of hash functions $\mathcal{H} = \{ h : U \to R \}$ . Universal hash functions are a family of functions with the random-like property while the size of the family is small. We can use a small seed to choose hash functions from the family.

Pairwise independent hash functions.

A family $\mathcal{H} = \{h : U \to R\}$ is called Pairwise independent if for any distinct $x_{1}, x_{2}\in U$ and any $y_{1}, y_{2}\in R$ , we have:

\begin{equation*}P_{h \in \mathcal{H}} \bigl( h(x_1) = y_1 \text{ and } h(x_2) = y_2 \bigr) = \frac{1}{|R|^2}.\end{equation*}

A pairwise independent hash functions mapping $\{0, 1\}^{k}$ to $\{0, 1\}$ .

\mathcal{H} = \{ h(x) = (ax + b)(\text{mod }2) \,|\, a \in \{0, 1\}^{k}\quad b\in\{0, 1\} \}

This family size is $|\mathcal{H}| = 2^{k+1}$ . Assign $k = \lceil \log n\rceil$ , then $U$ can encoding each vertex in $G$ . So $|\mathcal{H}| \leq 2n$ , which means we can go through all the hash function in $\mathcal{H}$ and output the maximized cut.

BPP

Define Prob TM as follows:

A probabilistic Turing machine is a type of NTM in which each nondeterministic step is called a coin-flip step and has two legal next moves. We assign a probability $2^{-k}$ to each branch of the machine’s computation where $k$ is the number of coin flips occur in the branch.

The probability of the machine accepting the input is defined as

\begin{equation*}P(M \text{ accepts } w) = \sum_{b:b \text{ is accepting}} P(b).\end{equation*}

This is equvilant to that each son of a vertex in NTM can be reach in the same probability.
Define the error probability $\varepsilon$ :

If $w \in A$ , then $P(M(w) = 1) \geq 1 - \varepsilon$
If $w\notin A$ , then $P(M(w) = 1) \leq \varepsilon$

Then we can define $\text{BPP}$ with error probability:

$\text{BPP}$ is the class of languages decided by probabilistic polynomial-time Turing machines with an error probability of $\frac{1}{3}$
Actually, the $\frac{1}{3}$ can be replaced by any constant exactly greatly than $\frac{1}{2}$

$\text{BPP}$ can be also defined with verifier:

A decision problem $A$ is in $\text{BPP}$ if and only if there is a polynomial-time verifier $V$ such that for all $x$ , $x\in A$ if and only if

\begin{equation*}P_{r} \bigl(V(x, r) = 1 \bigr) \ge \frac{2}{3}.\end{equation*}

Error Reduction

Any decision problem $A\in\text{BPP}$ has a polynomial-time randomized algorithm whose error probability is $2^{-p(n)}$ where $p$ is a polynomial and $n$ is the input size.

This can be proved by Chernoff bound or Sampling Theroem

Circuits v.s. BPP

Define $\text{SIZE}_{n}(s)$ :

For a finite function $g: \{0, 1\}^{n}\rightarrow\{0, 1\}$ , $g \in \text{SIZE}_{n}(s)$ if there is a circuit of at most $s$ NAND gates computing $g$ .

And we define the restricted function:

\begin{equation*} F_{\restriction n} (x) = F(x) \text{ for } x\in \{0,1\}^n. \end{equation*}

Then $F$ is non-uniformly computable in $T(n)$ size, as $F\in\text{SIZE}(T)$ if there is a sequence $C_{0}, C_{1}, \dots$ of NAND circuits such that:

$C_{n}$ computes $F_{\restriction n}$
$C_{n}$ has at most $T(n)$ gates when $n$ is sufficiently large

So the non-uniform analog $\text{P}$ :

\text{P}/\text{poly} = \bigcup\limits_{c\in\mathbb{N}}\text{SIZE}(n^{c})

Obviously, $\text{P}\subsetneq\text{P}/\text{poly}$ and it can be proved $\text{BPP}\subset\text{P}/\text{poly}$ as follows:

Due to error reduction, $A\in \text{BPP}$ has a polynomial-time randomized algorithm whose error probability is less than $2^{-n}$ , which means there is a verifier $V$ , such that

\forall x \,\, P_{y}(V(x, y) \neq A(x)) < \frac{1}{2^{n}}

So due to the union bound:

P_{y}(\exist x\,V(x, y)\neq A(x)) \leq \sum\limits_{x}P_{y}(V(x, y)\neq A(x)) < 1

As this probability is not $1$ , there must exist some $y^{*}$ for which $\forall x\, V(x, y^{*}) = A(x)$ .
Thus there exists a circuit with $\text{poly}(n)$ gates to caculate problem $A$ beacuse $y^{*}$ is polynomial

P = BPP <= P = NP

Sipser–Gács Theorem: $\text{BPP} \in \Sigma^{P}_{2} \cap \Pi_{2}^{P}$ , while the $\Sigma^{P}$ and $\Pi^{P}$ are defined as:

\begin{align*} \Sigma_{i}^{P} &= \exists\forall\exists\dots \text{P} \\ \Pi_{i}^{P} &= \forall\exists\forall\dots \text{P} \end{align*}

And we have the following theroem

$\text{P} = \text{NP}$ implies $\text{P} = \text{BPP}$

The proof is diffcult with the technique ‘probabilistic method’

And there is also a theroem that reveals the relation between $\text{B}$ and $\text{BPP}$

Relations with P NP EXP

We know $\text{P} \subsetneq \text{EXP}$ and $\text{BPP} \subseteq \text{EXP}$

Expected: $\text{P} = \text{BPP} \subsetneq \text{NP} \subseteq \text{EXP}$
Extreme: $\text{P} \subsetneq \text{NP} \subseteq \text{BPP} = \text{EXP}$
Extreme also: $\text{P} = \text{BPP} = \text{NP} \subsetneq \text{EXP}$

本文采用署名-非商业性使用-相同方式共享 4.0 国际许可协议，转载请注明出处。

TCS-Lecture-B