Collapsing and Separating Completeness Notions Under Average-Case and Worst-Case Hypotheses

Gu, Xiaoyang; Hitchcock, John M.; Pavan, A.

doi:10.1007/s00224-011-9365-0

Collapsing and Separating Completeness Notions Under Average-Case and Worst-Case Hypotheses

Published: 08 October 2011

Volume 51, pages 248–265, (2012)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Theory of Computing Systems Aims and scope Submit manuscript

Collapsing and Separating Completeness Notions Under Average-Case and Worst-Case Hypotheses

Download PDF

Xiaoyang Gu¹,
John M. Hitchcock² &
A. Pavan³

122 Accesses
2 Citations
Explore all metrics

Abstract

This paper presents the following results on sets that are complete for NP.

(i)
If there is a problem in NP that requires $2^{n^{\Omega(1)}}$ time at almost all lengths, then every many-one NP-complete set is complete under length-increasing reductions that are computed by polynomial-size circuits.
(ii)
If there is a problem in co-NP that cannot be solved by polynomial-size nondeterministic circuits, then every many-one NP-complete set is complete under length-increasing reductions that are computed by polynomial-size circuits.
(iii)
If there exist a one-way permutation that is secure against subexponential-size circuits and there is a hard tally language in NP∩co-NP, then there is a Turing complete language for NP that is not many-one complete.

Our first two results use worst-case hardness hypotheses whereas earlier work that showed similar results relied on average-case or almost-everywhere hardness assumptions. The use of average-case and worst-case hypotheses in the last result is unique as previous results obtaining the same consequence relied on almost-everywhere hardness results.

Autoreducibility of NP-Complete Sets under Strong Hypotheses

Article 18 July 2017

Length-Increasing Reductions for PSPACE-Completeness

Parameterized-NL Completeness of Combinatorial Problems by Short Logarithmic-Space Reductions and Immediate Consequences of the Linear Space Hypothesis

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

It is widely believed that many important problems in NP such as satisfiability, clique, and discrete logarithm are exponentially hard to solve. Existence of such intractable problems has a bright side: research has shown that we can use this kind of intractability to our advantage to gain a better understanding of computational complexity, for derandomizing probabilistic computations, and for designing computationally-secure cryptographic primitives. For example, if there is a problem in EXP (such as any of the aforementioned problems) that has $2^{n^{\Omega(1)}}$-size worst-case circuit complexity (i.e., that for all sufficiently large n, no subexponential size circuit solves the problem correctly on all instances of size n), then it can be used to construct pseudorandom generators [9, 32]. Using these pseudorandom generators, BPP problems can be solved in deterministic quasipolynomial time. Similar average-case hardness assumptions on the discrete logarithm and factoring problems have important ramifications in cryptography. While these hardness assumptions have been widely used in cryptography and derandomization, more recently Agrawal [3] and Agrawal and Watanabe [5] showed that they are also useful for improving our understanding of NP-completeness. In this paper, we provide further applications of such hardness assumptions.

1.1 Length-Increasing Reductions

A language in NP is NP-complete if every language in NP is reducible to it. While there are several ways to define the notion of reduction, the most common definition uses polynomial-time computable many-one functions. Many natural problems that arise in practice have been shown to be NP-complete using polynomial-time computable many-one reductions. However, it has been observed that all known NP-completeness results hold when we restrict the notion of reduction. For example, SAT is complete under polynomial-time reductions that are one-to-one and length-increasing. In fact, all known many-one complete problems for NP are complete under this type of reduction [11]. This raises the following question: are there languages that are complete under polynomial-time many-one reductions but not complete under polynomial-time, one-to-one, length-increasing reductions? Berman [7] showed that every many-one complete set for E is complete under one-to-one, length-increasing reductions. Thus for E, these two completeness notions coincide. A weaker result is known for NE. Ganesan and Homer [19] showed that all NE-complete sets are complete via one-to-one reductions that are exponentially honest.

For NP, until recently there had not been any progress on this question. Agrawal [3] showed that if one-way permutations exist, then all NP-complete sets are complete via one-to-one, length-increasing reductions that are computable by polynomial-size circuits. Hitchcock and Pavan [23] showed that NP-complete sets are complete under length-increasing P/poly reductions under the measure hypothesis on NP [30]. Recently Buhrman et al. improved the latter result to show that if the measure hypothesis holds, then all NP-complete sets are complete via length-increasing, polynomial-time computable functions that require O(loglogn) bits of advice [13]. More recently, Agrawal and Watanabe [5] showed that if regular one-way functions exist, then all NP-complete sets are complete via one-one, length-increasing, P/poly-computable reductions. All the hypotheses used in these works require the existence of an almost-everywhere hard language or an average-case hard language in NP.

In the first part of this paper, we consider hypotheses that only concern the worst-case hardness of languages in NP. Our first hypothesis concerns the deterministic time complexity of languages in NP. We show that if there is a language in NP for which every correct algorithm spends more than $2^{n^{\epsilon}}$ time at almost all lengths, then NP-complete languages are complete via length-increasing P/poly-computable reductions. The second hypothesis concerns nondeterministic circuit complexity of languages in co-NP. We show that if there is a language in co-NP that cannot be solved by nondeterministic polynomial-size circuits, then all NP-complete sets are complete via length-increasing P/poly-computable reductions. For more formal statements of the hypotheses, we refer the reader to Sect. 3. We stress that these hypotheses require only worst-case hardness. The worst-case hardness is of course required at every length, a technical condition that is necessary in order to build a reduction that works at every length rather than just infinitely often.

1.2 Turing Reductions versus Many-One Reductions

In the second part of the paper we study the completeness notion obtained by allowing a more general type of reduction—Turing reduction. Informally, with Turing reductions an instance of a problem can be solved by asking polynomially many (adaptive) queries about the instances of the other problem. A language in NP is Turing complete if there is a polynomial-time Turing reduction to it from every other language in NP. Though many-one completeness is the most commonly used completeness notion, Turing completeness also plays an important role in complexity theory. Several properties of Turing complete sets are closely tied to the separation of complexity classes. For example, Turing complete sets for EXP are sparse if and only if EXP has polynomial-size circuits. Moreover, to capture our intuition that a complete problem is easy, then the entire class is easy, Turing reductions seem to be the “correct” reductions to define completeness. In fact, the seminal paper of Cook [15] used Turing reductions to define completeness, though Levin [28] used many-one reductions.

This raises the question of whether there is a Turing complete language for NP that is not many-one complete. Ladner, Lynch and Selman [29] posed this question in 1975, thus making it one of the oldest problems in complexity theory. This question is completely resolved for exponential time classes such as EXP and NEXP [14, 37]. We know that for both these classes many-one completeness differs from Turing-completeness. However progress on the NP side has been very slow. Lutz and Mayordomo [31] were the first to provide evidence that Turing completeness differs from many-one completeness. They showed that if the measure hypothesis holds, then the completeness notions differ. Since then a few other weaker hypotheses have been used to achieve the separation of Turing completeness from many-one completeness [4, 24, 33–35].

All the hypotheses used in the above works are considered “strong” hypotheses as they require the existence of an almost everywhere hard language in NP. That is, there is a language L in NP and every algorithm that decides L takes exponential-time on all but finitely many strings. A drawback of these hypotheses is that we do not have any candidate languages in NP that are believed to be almost everywhere hard.

It has been open whether we can achieve the separation using more believable hypotheses that involve average-case hardness or worst-case hardness. None of the proof techniques used earlier seem to achieve this, as they crucially depend on the almost everywhere hardness.

In this paper, for the first time, we achieve the separation between Turing completeness and many-one completeness using average-case and worst-case hardness hypotheses. We consider two hypotheses. The first hypothesis states that there exist $2^{n^{\epsilon}}$-secure one-way permutations and the second hypothesis states that there is a language in NEEE∩coNEEE that can not be solved in triple exponential time with logarithmic advice, i.e., $\mathrm{NEEE}\cap\mathrm{coNEEE}\not\subseteq\mathrm {EEE}/\log$. We show that if both of these hypothesis are true, then there is a Turing complete language in NP that is not many-one complete.

The first hypothesis is an average-case hardness hypothesis and has been studied extensively in past. The second hypothesis is a worst-case hardness hypothesis. At first glance, this hypothesis may look a little esoteric, however, it is only used to obtain hard tally languages in NP∩co-NP that are sufficiently sparse. Similar hypotheses involving double and triple exponential-time classes have been used earlier in the literature [8, 16, 17, 22].

We use length-increasing reductions as a tool to achieve the separation of Turing completeness from many-one completeness. We first show that if one-way permutations exist then NP-complete sets are complete via length-increasing, polynomially-bounded, quasipolynomial-time computable reductions. We then show that if the second hypothesis holds, then there is a Turing complete language for NP that is not many-one complete via length-increasing, polynomially-bounded, quasipolynomial-time computable reductions. Combining these two results we obtain our separation result.

2 Preliminaries

In the paper, we use the binary alphabet Σ={0,1}. Given a language A, A _n denotes the characteristic sequence of A at length n. We also view A _n as a boolean function from Σⁿ to Σ. We use A ^≤n to denote the set of strings in A whose length is at most n, and use A ⁼ⁿ to denote the set of strings in A whose length is n.

For languages A and B, we say that A=io B, if A _n=B _n for infinitely many n. For a complexity class $\mathcal{C}$, we say that $A \in\operatorname{io}\mathcal{C}$ if there is a language $B \in {\mathcal{C}}$ such that A=io B.

For a boolean function f:Σⁿ→Σ, CC(f) is the smallest number s such that there is circuit of size s that computes f. A function f is quasipolynomial time computable (QP-computable) if can be computed deterministically in time $O(2^{\log^{O(1)} n})$. We will use the triple exponential time class $\mathrm{EEE}=\mathrm{DTIME}(2^{2^{2^{O(n)}}})$, and its nondeterministic counterpart NEEE.

A language A is in NP/poly if there is a polynomial-size circuit family C=(C ₁,C ₂,…) and a polynomial p such that for every x, x is in A if and only if there is a y of length p(|x|) such that C _|x|(x,y)=1. Let f:ℕ→ℕ be a function such that f(n)<n.

Definition

A language L is in EEE/f(n) if there is a function g:0^∗→Σ^∗ such that |g(0ⁿ)|≤f(n) and a triple exponential time algorithm M such that x is in L if and only if M accepts $\langle x, g(0^{|x|})\rangle$.

We define EEE/O(logn) to be ∪_c>0EEE/clogn.

Definition

Given a language A, let fold(A) be a language such that for every n, fold(A)_n=A ₀ A ₁⋯A _n−10.

Our proofs make use a variety of results from approximable sets, instance compression, derandomization and hardness amplification. We mention the results that we need.

Definition 1

A language A is t(n)-time 2-approximable [6] if there is a function f computable in time t(n) such that for all strings x and y, f(x,y)≠A(x)A(y).

A language A is io-lengthwise t(n)-time 2-approximable if there is a function f computable in time t(n) such that for infinitely many n, for every pair of n-bit strings x and y, f(x,y)≠A(x)A(y). We say that f is the approximator of A.

Let A be a io-lengthwise t(n)-time 2-approximable and let f be its approximator. Given a natural number m>0, we say that A is 2-approximable at length m, if for every pair of m-bit strings x and y f(x,y)≠A(x)A(y).

Amir, Beigel and Gasarch [2] proved that every polynomial-time 2-approximable set is in P/poly. Their proof also implies the following extension.

Theorem 2.1

[2]

If A is io-lengthwise t(n)-time 2-approximable, then for infinitely many n, CC(A _n)≤t ²(n).

If a language is disjunctively self-reducible and is 2-approximable, then it is in P. See the survey paper by Buhrman, Fortnow and Thierauf [10] for a proof. Though this proof carries overs to other time-bounds, it does not carry over to the infinitely-often setting. This is because the proof makes use of the fact that the approximator has the ability to compare strings at different lengths. However, if we use folded language A, then the proof carries through.

Theorem 2.2

Let A be a disjunctive self-reducible language. If fold(A) is io-lengthwise t(n)-time 2-approximable, then there is an algorithm M that decides A and M runs in time t ²(n) for infinitely many n. In addition, if fold(A) is 2-approximable at length n, then M runs in t ²(n) time on all strings of length n−1.

Given a language H′ in co-NP, let H be $\{\langle x_{1}, \ldots,x_{n}\rangle| |x_{1}| = \cdots= |x_{n}| = n,\allowbreak {x_{i} \in H'}, 1 \leq i \leq n\}$. Observe that a n-tuple consisting of strings of length n can be encoded by a string of length n ². From now we view a string of length n ² as an n-tuple of strings of length n.

Theorem 2.3

[12, 18]

Let H and H′ be defined as above. Suppose there is a language L, a polynomial-size circuit family {C _m}, and a polynomial p such that for infinitely many n, for every $x \in\Sigma^{n^{2}}$, x is in H if and only if there is a string y of length p(n) such that C(x,y) is in L ^≤n. Then H′ is in io NP/poly.

The proof of Theorem 2.3 is similar to the proofs in [12, 18]. The difference is rather than having a polynomial-time many-one reduction, here we have a NP/poly many-one reduction which works infinitely often. The nondeterminism and advice in the reduction can be absorbed into the final NP/poly decision algorithm. The NP/poly decision algorithm correctly works at infinitely many lengths, corresponding to the lengths at which the NP/poly reduction works.

Definition

A function f:{0,1}ⁿ→{0,1}^m is s-secure if for every δ<1, every t≤δs, and every circuit C:{0,1}ⁿ→{0,1}^m of size t, Pr[C(x)=f(x)]≤2^−m+δ. A function f:{0,1}^∗→{0,1}^∗ is s(n)-secure if it is s(n)-secure at all but finitely many lengths n.

Definition

An s(n)-secure one-way permutation is a polynomial-time computable bijection π:{0,1}^∗→{0,1}^∗ such that |π(x)|=|x| for all x and π ⁻¹ is s(n)-secure.

Under widely believed average-case hardness assumptions about the hardness of the RSA cryptosystem or the discrete logarithm problem, there is a secure one-way permutation [21].

Definition

A pseudorandom generator (PRG) family is a collection of functions G={G _n:{0,1}^m(n)→{0,1}ⁿ} such that G _n is uniformly computable in time 2^O(m(n)) and for every circuit of C of size n,

$$\Bigl|\Pr_{x \in\{0,1\}^n}[C(x)=1] - \Pr_{y \in\{0, 1\}^{m(n)}}[C(G_n(y))]= 1\Bigr| \leq\frac{1}{n}.$$

There are many results that show that the existence of hard functions in exponential time implies PRGs exist. We will use the following.

Theorem 2.4

[26, 32]

If there is a language A in E such that $\mathit{CC}(A_{n}) \geq 2^{n^{\epsilon}}$ for all sufficiently large n, then there exist a constant k and a PRG family $G = \{G_{n}:\{0,1\}^{\log^{k} n} \rightarrow\{0,1\}^{n}\}$.

3 Length-Increasing Reductions

In this section we provide evidence that many-one complete sets for NP are complete via length-increasing reductions. We use the following hypotheses.

Hypothesis 1

There is a language L in NP and a constant ϵ>0 such that L is not in $\operatorname{io}\mathrm {DTIME}(2^{n^{\epsilon}})$.

Informally, this means that every algorithm that decides L takes more than $2^{n^{\epsilon}}$-time on at least one string at every length.

Hypothesis 2

There is a language L in co-NP such that L is not in io NP/poly.

This means that every nondeterministic polynomial size circuit family that attempts to solve L is wrong on at least one string at each length.

We will first consider the following variant of Hypothesis 1.

Hypothesis 3

There is a language L in NP and a constant ϵ>0 such that for all but finitely many n, $\mathit {CC}(L_{n}) >2^{n^{\epsilon}}$.

We will first show that if Hypothesis 3 holds, then NP-complete sets are complete via length-increasing, P/poly reductions. Then we describe how to modify the proof to derive the same consequence under Hypothesis 1. We do this because the proof is much cleaner with Hypothesis 3 and contains all essential ideas. To use Hypothesis 1 we have to fix encodings of boolean formulas and work with folded languages.

3.1 If NP has Subexponentially Hard Languages

Theorem 3.1

If there is a language L in NP and an ϵ>0 such that for all but finitely many n, $\mathit{CC}(L_{n}) > 2^{n^{\epsilon}}$, then all NP-complete sets are complete via length-increasing, P/poly reductions.

Proof

Let A be a NP-complete set that is decidable in time $2^{n^{k}}$. Let L be a language in NP that requires $2^{n^{\epsilon}}$-size circuits at every length. Since SAT is complete via polynomial-time, length-increasing reductions, it suffices to exhibit a length-increasing, P/poly-reduction from SAT to A.

Let $\delta= \frac{\epsilon}{2k}$. Consider the following intermediate language

$$S = \{ \langle x, y, z\rangle \mid |x| = |z|= \lceil|y|^{1/\delta }\rceil,\mbox{\texttt{MAJ}}[L(x), \mathrm{SAT}(y), L(z)] = 1 \} ,$$

where MAJ is the majority function.

Clearly S is in NP. Since A is NP-complete, there is a many-one reduction f from S to A. We will first show that at every length n there exist strings on which the reduction f must be length-increasing. Let

$$T_n = \{ \langle x, z\rangle \mid |x| = |z| = \lceil n^{1/\delta }\rceil , L(x) \neq L(z), \ \forall y\in\{0,1\}^{n} |f(\langle x, y, z\rangle)| >n \} .$$

□

Lemma 3.2

For all but finitely many n, T _n≠∅.

Assuming that the above lemma holds, we complete the proof of the theorem. Given a length n, fix an (arbitrary) ordering of tuple from T _n. Let $\langle x_{n}, z_{n}\rangle$ be the first tuple from T _n. Consider the following reduction from SAT to A: Given a string y of length n, the reduction outputs $f(\langle x_{n}, y, z_{n}\rangle)$. Given x _n and y _n as advice, this reduction can be computed in polynomial time. Since |x _n| and |z _n| are polynomial in n, this is a P/poly reduction.

By the definition of T _n, L(x _n)≠L(z _n). Thus y∈SAT if and only if $\langle x_{n}, y, z_{n}\rangle\in S$, and so y is in SAT if and only if $f(\langle x_{n}, y, z_{n}\rangle)$ is in A. Again, by the definition of T _n, for every y of length n, the length of $f(\langle x_{n}, y, z_{n}\rangle)$ is bigger than n. Thus there is a P/poly-computable, length-increasing reduction from SAT to A. This, together with the proof of Lemma 3.2 we provide next, completes the proof of Theorem 3.1. □

Proof of Lemma 3.2

Suppose T _n=∅ for infinitely many n. We will show that this yields a length-wise 2-approximable algorithm for L at infinitely many lengths. This enables us to contradict the hardness of L. Consider the following algorithm:

1.
Input x, z with |x|=|z|=m.
2.
Find n such that ⌈n ^1/δ⌉=m. If no such n exists, then output 10.
3.
Find a y of length n such that $|f(\langle x, y,z\rangle)| \leq n$.
4.
If no such y is found, Output 10.
5.
If y is found, then solve the membership of $f(\langle x, y,z\rangle)$ in A. If $f(\langle x, y, z\rangle) \in A$, then output 00, else output 11.

We first bound the running time of the algorithm. Step 3 takes $O(2^{m^{\delta}})$ time. In Step 5, we decide the membership of $f(\langle x, y, z\rangle)$ in A. This step is reached only if the length of $f(\langle x, y, z\rangle)$ is at most n. Thus the time taken for this step is $(2^{n})^{k} \leq2^{m^{\epsilon/2}}$ time. Thus the total time taken by the algorithm is bounded by $2^{m^{\epsilon}/2}$.

Consider a length n at which T _n=∅. Let x and z be any two strings of length m=⌈n ^1/δ⌉. Suppose for every y of length n, the length of $f(\langle x, y, z\rangle)$ is at least n. Then it must be the case that L(x)=L(z), otherwise the tuple $\langle x, z\rangle$ belongs to T _n. Thus if the above algorithm fails to find y in Step 3, then L(x)L(z)≠10.

Suppose the algorithm succeeds in finding a y in Step 3. If $f(\langle x, y, z \rangle) \in A$, then at least one of x or z must belong to L. Thus L(x)L(z)≠00. Similarly, if $f(\langle x, y, z\rangle) \notin A$, then at least one of x or z does not belong to L, and so L(x)L(z)≠11.

Thus L is 2-approximable at length m=⌈n ^1/δ⌉. If there exist infinitely many lengths n, at which T _n is empty, then L is infinitely-often, length-wise, $2^{m^{\epsilon}/2}$-time approximable. By Theorem 2.1, L has circuits of size $2^{m^{\epsilon}}$ at infinitely many lengths. □

We will now describe how to modify the proof if we assume that Hypothesis 1 holds. Let L be the hard language guaranteed by the hypothesis. This means that there exists an ϵ>0 such that L is not in $\operatorname{io}\mathrm{DTIME}(2^{n^{\epsilon}})$. Fix an encoding of 3CNF formulas such that formulas with same numbers of variables can be encoded as strings of same length. We will work with fold(3-SAT).

Trivially, fold(3-SAT) is also NP-complete and there is a reduction f and a constant r≥1 from L to fold(3-SAT) such that all strings of length n are mapped to strings of length n ^r. Let

$$H = \mathit{fold}(3\mbox{-}\mathrm{SAT}) \cap\bigcup_n \Sigma^{n^r}.$$

The following observation is easy to prove.

Observation 3.3

If there is an algorithm $\mathcal{A}$ that decides H such that for infinitely many n the algorithm $\mathcal{A}$ runs in $2^{n^{\epsilon}}$ time on all strings of length n ^r, then L is in $\operatorname{io}\mathrm{DTIME}(2^{n^{\epsilon}})$.

Now H plays the role of L from previous theorem and the proof proceeds same as before except that we use H instead of L. Our intermediate language will be

$$S = \{ \langle x, y, z\rangle \mid |x| = |z| = (\lceil|y|^{1/\delta }\rceil)^r,\mbox{ and }\mbox{\texttt{MAJ}}[H(x), \mathrm{SAT}(y), H(z)] = 1 \} .$$

Clearly S is in NP and since A is NP-complete there is a reduction g from S to A. Consider the set T _n as before.

$$T_n = \{ \langle x, z\rangle \mid H(x) \neq H(z), |x| = |z| = (\lceil n^{1/\delta}\rceil)^r, \forall y \in\Sigma^n |g(\langle x, y,z\rangle)|>n \} .$$

We can show the following.

Lemma 3.4

For all but finitely many n, T _n≠∅.

As before, we can show that if T _n is empty for infinitely many n, then H is infinitely-often, length-wise, $2^{n^{\epsilon/2}}$-time approximable. More precisely, we can show that for every length n at which T _n is empty, H is 2-approximable in time $2^{n^{\epsilon/2}}$ at length n ^r. Since 3-SAT is disjunctive self-reducible and H coincides with fold(3-SAT) at lengths of form n ^r, it follows from Theorem 2.2 that H can be decided in time $2^{n^{\epsilon}}$-time at infinitely many lengths of the form n ^r. By Observation 3.3 this implies that L is in $\operatorname{io}\mathrm{DTIME}(2^{n^{\epsilon}})$.

If T _n is not empty at all but finitely many lengths, then there is a P/poly, length-increasing reduction from SAT to A. Thus we have the following theorem.

Theorem 3.5

If there is a language in NP that is not in $\operatorname {io}\mathrm{DTIME}(2^{n^{\epsilon}})$, then all NP-complete sets are complete via length-increasing P/poly reductions.

3.2 If co-NP is Hard for Nondeterministic Circuits

In this subsection we show that Hypothesis 2 also implies that all NP-complete sets are complete via length-increasing reductions.

Theorem 3.6

If there is a language in co-NP that is not in io NP/poly, then NP-complete sets are complete via P/poly-computable, length-increasing reductions.

Proof

We find it convenient to work with co-NP rather than NP. We will show that all co-NP-complete languages are complete via P/poly, length-increasing reductions.

Let H′ be a language in co-NP that is not in io NP/poly. Let H be

$$\{\langle x_1, \ldots, x_n\rangle\mid\forall i~1 \leq i \leq n, [x_i\in H'\mbox{ and } |x_i| = n]\}.$$

Note that every n-tuple that may potentially belong to H can be encoded by a string of length n ². From now we view a string of length n ² as an n-tuple of n bit strings.

Let $S = 0H' \cup1\overline{\mathrm{SAT}}$. Observe that S is in co-NP and S is not in io NP/poly. Since $1\overline {\mathrm {SAT}}$ is many-one complete for co-NP via length-increasing reductions, S is also co-NP-complete via length-increasing reductions. Let A be any co-NP-complete language. It suffices to exhibit a length-increasing reduction from S to A.

Consider the following intermediate language:

$$L = \{\langle x, y, z\rangle~|~|x| = |z| = |y|^2, \mbox{\texttt{MAJ}}[x \in H, y \in S, z \in H] = 1\}.$$

Clearly the above language is in co-NP. Let f be a many-one reduction from L to A. As before we will first show at every length n that there exits strings x and z such that for every y in S the length of $f(\langle x, y, z\rangle)$ is at least n. □

Lemma 3.7

For all but finitely many n, there exist two strings x _n and z _n of length n ² with H(x _n)≠H(z _n) and for every y∈S ⁼ⁿ, $|f(\langle x_{n}, y, z_{n}\rangle)| > n$.

Proof

Suppose not. Then there exist infinitely many lengths n at which for every pair of strings (of length n ²) x and z with H(x)≠H(z), there exist a y of length n such that y∈S and |f(x,y,z)|≤n.

From this we obtain a NP/poly-reduction from H to A such that for infinitely many n, for every x of length n ², |f(x)|≤n. By Theorem 2.3, this implies that H′ is in io NP/poly. We now describe the reduction. Given n let z _n be a string (of length n ²) that is not in H.

1.
Input x,|x|=n ². Advice: z _n.
2.
Guess a string y of length n.
3.
If $|f(\langle x, y, z_{n}\rangle)| > n$, the output ⊥.
4.
Output $f(\langle x, y, z_{n}\rangle)$.

Suppose x∈H. Since z _n∉H, there exists a string y of length n such that y∈S and $|f(\langle x, y, z_{n}\rangle )| \leq n$. Consider a path that correctly guesses such a y. Since z _n∉H, and y∈S, $\langle x, y, z_{n}\rangle\in A$. Thus $f(\langle x, y, z_{n}\rangle) \in A^{\leq n}$. Thus there exists at least one path on which the reduction outputs a string from A ^≤n. Now consider the case x∉H. On any path, the reduction either outputs ⊥ or outputs $f(\langle x, y, z_{n}\rangle)$. Since both z _n and x are not in H, $\langle x, y, z_{n}\rangle \notin A$. Thus $f(\langle x, y, z_{n}\rangle) \notin A$ for any y.

Thus there is a NP/poly many-one reduction from H to A such that for infinitely many n, the output of the reduction, on strings of length n ², on any path is at most n. By Theorem 2.3, this places H′ in io NP/poly.

Thus for all but finitely many lengths n, there exist strings x _n and z _n of length n ² with H(x _n)≠H(z _n) and for every y∈S ⁿ, the length of $f(\langle x_{n}, y, z_{n}\rangle)$ is at least n. □

This suggests the following reduction h from S to A. The reduction will have x _n and z _n as advice. Given a string y of length n, the reductions outputs $f(\langle x_{n}, y, z_{n}\rangle)$. This reduction is clearly length-increasing and is length-increasing on every string from S. Thus we have the following lemma.

Lemma 3.8

Consider the above reduction h from S to A, for all y∈S, |h(y)|>|y|.

Now we show how to obtain a length-increasing reduction on all strings. We make the following crucial observation.

Observation 3.9

For all but finitely many n, there is a string y _n of length n such that y _n∉S and $|f(\langle x_{n}, y_{n}, z_{n}\rangle)| > n$.

Proof

Suppose not. This means that for infinitely many n, for every y from $\overline{S} \cap\Sigma^{n}$, the length of $f(\langle x_{n}, y,z_{n}\rangle)$ is less than n. Now consider the following algorithm that solves S. Given a string y of length n, compute $f(\langle x_{n}, y, z_{n}\rangle )$. If the length of $f(\langle x_{n}, y, z_{n}\rangle) > n$, then accept y else reject y.

The above algorithm can be implemented in P/poly given x _n and z _n as advice. If y∈S, then we know that the length of $f(\langle x_{n}, y,z_{n}\rangle)$ is bigger than n, and so the above algorithm accepts. If y∉S, then by our assumption, the length of $f(\langle x_{n}, y,z_{n}\rangle)$ is at most n. In this case the algorithm rejects y. This shows that S is in io P/poly which in turn implies that H′ is in io P/poly. This is a contradiction. □

Now we are ready to describe our length increasing reduction from S to A. At length n, this reduction will have x _n, y _n and z _n as advice. Given a string y of length n, the reduction outputs $f(\langle x_{n}, y,z_{n}\rangle)$ if the length of $f(\langle x_{n}, y, z_{n}\rangle)$ is more than n. Else, the reduction outputs $f(\langle x_{n}, y_{n}, z_{n}\rangle)$.

Since H(x _n)≠H(z _n), y∈S if and only if $f(\langle x_{n}, y,z_{n}\rangle) \in A$. Thus the reduction is correct when it outputs $f(\langle x_{n}, y, z_{n}\rangle)$. The reduction outputs $f(\langle x_{n}, y_{n},z_{n}\rangle)$ only when the length of $f(\langle x_{n}, y, z_{n}\rangle)$ is at most n. We know that in this case y∉S. Since y _n∉S, $f(\langle x_{n},y_{n}, z_{n}) \notin A$.

Thus we have a P/poly-computable, length-increasing reduction from S to A. Thus all co-NP-complete languages are complete via P/poly, length-increasing reductions. This immediately implies that all NP-complete languages are complete via P/poly-computable, length-increasing reductions. □

4 Separation of Completeness Notions

In this section we consider the question of whether the Turing completeness differs from many-one completeness for NP under two plausible complexity-theoretic hypotheses:

(1)
There exists a $2^{n^{\epsilon}}$-secure one-way permutation.
(2)
$\mathrm{NEEE}\cap\mathrm{coNEEE}\not\subseteq\mathrm {EEE}/\log$.

It turns out that the first hypothesis implies that every many-one complete language for NP is complete under a particular kind of length-increasing reduction, while the second hypothesis provides us with a specific Turing complete language that is not complete under the same kind of length-increasing reductions. Therefore, the two hypotheses together separate the notions of many-one and Turing completeness for NP.

Theorem 4.1

If both of the above hypotheses are true, there is a language that is polynomial-time Turing complete for NP but not polynomial-time many-one complete for NP.

Theorem 4.1 is immediate from Lemma 4.2 and Lemma 4.3 below.

Lemma 4.2

Suppose $2^{n^{\epsilon}}$-secure one-way permutations exist. Then for every NP-complete language A and every B∈NP, there is a quasipolynomial-time computable, polynomially-bounded, length-increasing reduction from B to A.

A function f is polynomially-bounded if there is a polynomial p such that the length of f(x) is at most p(|x|) for every x.

Lemma 4.3

If NEEE∩coNEEE⊈EEE/log, then there is a polynomial-time Turing complete set for NP that is not many-one complete via quasipolynomial-time computable, polynomially-bounded, length-increasing reductions.

To show Lemma 4.2 we use results and ideas from Agrawal [3].

Let γ>0 and let S⊆{0,1}ⁿ. A function g is γ-sparsely many-one on S if for every x∈S,

$$|g^{-1}(g(x))\cap\{0,1\}^n|\leq\frac{2^n}{2^{n^\gamma}}.$$

Lemma 4.4

(Agrawal [3])

Suppose $2^{n^{\epsilon}}$-secure one-way permutations exists. For every NP-complete language L for every set S in NP, there is a reduction from f from S to L that is $\frac{\epsilon}{2}$-sparsely many-one on {0,1}ⁿ for all n∈ℕ.

Proof of Lemma 4.2

Let δ=ϵ/3. Consider the following set

$$S = \{ \langle x,y \rangle \mid x\in B\text{ and }|y| = n^{1/\delta} \} .$$

For |x|=n, let $m = |\langle x,y \rangle|$. By Lemma 4.4, there is a reduction f from S to A that is $\frac{\epsilon}{2}$-sparsely many-one on Σ^m for every m. For any x of length n, let $S_{x} = \{y \in\Sigma^{n^{1/\delta}}~|~ |f(\langle x, y\rangle)|\leq n\}$. It follows that the size of S _x is at most $\frac{2^{n+m+1}}{2^{m^{\epsilon/2}}}$. Thus for at least $1-2^{-2n^{3/2}}$ fraction of strings y from $\{0,1\}^{n^{1/\delta}}$, $|f(\langle x,y \rangle)|>n$. If we randomly pick a $y \in \{0,1\}^{n^{1/\delta}}$, then with very high probability $|f(\langle x,y \rangle)|>n$.

We can derandomize the above process. If $2^{n^{\epsilon}}$-secure one-way permutations exist, then EXP does not have $2^{n^{\epsilon}}$-size circuits. Thus by Theorem 2.4, there exists a pseudorandom generator family {G _n} that map strings of length log^k n to strings of length n. These pseudorandom generators are computable in $O(2^{\log^{d} n})$ time for some constant d.

Now the length-increasing reduction from B works as follows. Given x as input of length n, let t=n ^1/δ. Cycle through all seeds s of length log^k t till the length of $f(\langle x,G_{t}(s)\rangle)$ is bigger than n. Output $f(\langle x,G_{t}(s) \rangle)$. Since G _t is a pseudorandom generator, it follows that there exists at least one s for which the length $f(\langle x,G_{t}(s) \rangle)$ is bigger than n.

Clearly, the above reduction can be computed in time quasipolynomial in n. The output of the reduction is bounded by the length of $f(\langle x,y \rangle)$, where |y|=n ^1/δ. Since f is polynomial-time computable, the reduction is polynomial-bounded. □

The remainder of this section is devoted to proving Lemma 4.3. For this we first prove two auxiliary results. The first result establishes a connection between worst-case and average-case complexities for languages in NEEE∩coNEEE. The second result extends a well known equivalence between languages and their tally-encodings to the average-case world.

We know several results that establish worst-case to average-case connections for classes such as EXP and PSPACE [9, 25, 26, 36, 38]. The following lemma establishes a similar connection for triple exponential time classes. This lemma can be proved using classical hardness amplification results [9, 20, 25, 38] or using list decoding techniques of error correcting codes. (See [1] for a good review.) The technique is now fairly standard in the area of hardness amplification and the situation is only made easier since we are concerned with deterministic and nondeterministic computation in time $O(2^{2^{2^{O(n)}}})$. For the sake of completeness, we include a proof in the Appendix.

Lemma 4.5

If $\mathrm{NEEE}\cap\mathrm{coNEEE}\not\subseteq\mathrm {EEE}/\log$, then there is language L in NEEE∩coNEEE such that no EEE/log algorithm can decide L, at infinitely many lengths n, on more than $\frac{1}{2}+\frac{1}{n}$ fraction of strings from {0,1}ⁿ.

It is well known that any set A over Σ^∗ can be encoded as a tally set T _A such that A is worst-case hard for exponential-time if and only if T _A is worst-case hard for polynomial-time. For our purposes, we need an average-case version of the this equivalence. Below we describe particular encoding of languages using tally sets that is helpful for us and prove the average-case equivalence.

Let t ₀=2, $t_{i+1} = t_{i}^{2}$ for all i∈ℕ. Observe that $t_{i}= 2^{2^{i}}$. Let $\mathcal{T}=\{ 0^{t_{i}} \mid i\in\mathbb{N} \} $. For each l∈ℕ, let $\mathcal{T}_{l} = \{ 0^{t_{i}} \mid 2^{l}-1\leq i\leq 2^{l+1}-2 \} $. Observe that $\mathcal{T}=\bigcup_{l=0}^{\infty}\mathcal{T}_{l}$. Given a set A⊆{0,1}^∗, let $T_{A} = \{0^{2^{2^{r_{x}}}} | x\in A \} $, where r _x is the rank index of x in the standard enumeration of {0,1}^∗. It is easy to verify that for all l∈ℕ and every x,

$$x\in A \cap\{0,1\}^l \quad{\iff} \quad0^{t_{r_x}}\in T_A\cap \mathcal{T}_l. $$

(4.1)

Lemma 4.6

Let A and T _A be as above. Suppose there is a quasipolynomial time algorithm $\mathcal{A}$ such that for every l, on an ϵ fraction of strings from $\mathcal{T}_{l}$, this algorithm correctly decides the membership in T _A, and on the rest of the strings the algorithm outputs “I do not know”. There is a $2^{2^{2^{k(l+1)}}}$-time algorithm $\mathcal{A}'$ for some constant k that takes one bit of advice and correctly decides the membership in A on $\frac{1}{2}+\frac{\epsilon}{2}$ fraction of the strings at every length l.

Proof

Consider the following algorithm $\mathcal{A}''$ for A: For each x, run the algorithm $\mathcal{A}$ on $0^{t_{r_{x}}}$ and output the same answer.

If the length of x is l, then $|0^{t_{r_{x}}}|\leq2^{2^{2^{l+1}-2}}$. Thus the time algorithm $\mathcal{A}''$ takes is at most $2^{2^{2^{k(l+1)}}}$ for some constant k. Algorithm $\mathcal{A}$ decides T _A correctly on ϵ fraction of strings from $\mathcal{T}_{l}$. Thus by (4.1), the algorithm $\mathcal{A}''$ correctly decides A on ϵ fraction strings from {0,1}^l, and say “I do not know” on the rest.

For algorithm $\mathcal{A}'$, Let S _l be the set of strings of length l on which the algorithm $\mathcal{A}''$ says “I do not know”, if $|S_{l}\cap A|>\frac{1}{2} |S_{l}|$, then set the advice be 1, otherwise let the advice be 0. The algorithm $\mathcal{A}'$ simulates the algorithm $\mathcal{A}''$ and gives the advice bit as the answer when it says “I do not know”. The fraction of strings of length l on which $\mathcal{A}'$ is correct is at least $\frac{1-\epsilon}{2} + \epsilon=\frac{1}{2}+\frac{\epsilon}{2}$. □

Now we are ready to prove Lemma 4.3.

Proof of Lemma 4.3

By Lemma 4.5, there is a language L∈(NEEE∩coNEEE)−EEE/log such that no EEE/log algorithm can decide L correctly on more than a $\frac{1}{2}+\frac{1}{n}$ fraction of the inputs for infinitely many lengths n.

Without loss of generality, we can assume that $L\in\mathrm{NTIME}(2^{2^{2^{n}}})\cap \mathrm{co}\mathrm {NTIME}(2^{2^{2^{n}}})$ Let

$$T_L=\bigl\{0^{2^{2^{r_x}}}\big| x\in L\bigr\}.$$

Clearly, T _L∈NP∩coNP.

Recall that t ₀=2 and $t_{i} = 2^{2^{i}}$. Define τ:ℕ→ℕ such that τ(n)=max{i∣t _i≤n}. Observe that τ(n) is at most loglogn. Thus for every n, t _τ(n)≤n. Now we will define our Turing complete language. Let

Let A=SAT₀∪SAT₁. Since L is in NP∩co-NP and t _τ(|x|) is at most |x|, A is in NP. The following is a Turing reduction from SAT to A: Given a formula x, ask queries 0x and 1x, and accept if and only if exactly one of them is in A. Thus A is polynomial-time 2-tt complete for NP. □

Suppose A is complete via length-increasing, polynomially-bounded, quasipolynomial-time reductions. Then there is such a reduction f from {0}^∗ to A. Since f is polynomially-bounded, there is a constant d such that for every x, the length of f(x) is less that |x|^d.

The following observation is easy to prove.

Observation 4.7

Let y∈{0,1}^∗ and b∈{0,1} be such that $f(0^{t_{i}}) = by$. Then $0^{t_{\tau(|y|)}}\in T_{L}$ if and only if b=1.

Proof

Fix a length l. We will describe a quasipolynomial-time algorithm that will decide the membership in T _L on at least $\frac {1}{\log d}$ fraction of strings from $\mathcal{T}_{l}$, and says “I do not know” on other strings. By Lemma 4.6, this implies that there is EEE/1 algorithm that decides L on more than $\frac{1}{2} +\frac{1}{2\log d}$ fraction of strings from {0,1}^l. This contradicts the hardness of L and completes the proof. □

Let s=2^l−1 and r=2^l+1−2. Recall that $\mathcal{T}_{l} = \{ 0^{t_{i}} \mid s \leq i \leq r \} $. Partition $\mathcal{T}_{l}$ in sets T ₀,T ₂,…,T _m where

$$T_k = \{ 0^{t_i} \mid s+k\log d \leq i <s+(k+1)\log d\ \mbox{and}\ i \leq r \} .$$

This gives at least $\frac{2^{l}}{\log d}$ sets. We make the following observation.

Observation 4.8

Fix k. If $f(0^{t_{s+k\log d}}) = by$, then $0^{t_{\tau(|y|)}} \in T_{k}$.

Proof

Recall that the length of $f(0^{t_{s+k\log d}})$ is less than $t_{s+k\log d}^{d}$. Since $t_{i} = 2^{2^{i}}$, $t_{s+k\log d}^{d} \leq t_{s +(k+1)\log d}$. Since f is length-increasing, we have the following bound on the length of y.

$$t_{s+k\log d} \leq|y| < t_{s+(k+1)\log d}.$$

By definition of τ, we have

$$s+k\log d \leq\tau(|y|) < s+(k+1) \log d.$$

By the definition of T _k, we have $0^{t_{\tau(|y|)}} \in T_{k}$. □

This suggests the following algorithm for T _L: Let $0^{t_{j}}$ be its input. Say it lies in T _k. Compute $f(0^{t_{s+k\log d}}) = by$. If t _τ(|y|)≠t _j, then output “I do not know”. Otherwise, accept $0^{t_{j}}$ if and only if b=1. By Observation 4.7 this algorithm never errs. Since f is computable in quasipolynomial time, this algorithm runs in quasipolynomial time.

By Observation 4.8, $0^{t_{\tau(|y|)}}$ belongs to T _k and on this input the algorithm does not output “I do not Know”. Thus there is at least one string in T _k on which this algorithm does not output “I do not Know”.

Thus for every k, 0≤k≤m, there is at least one string from T _k on which the above algorithm correctly decides T _L. Thus the above algorithm correctly decides T _L on at least $\frac{1}{\log d}$ fraction of strings from $\mathcal{T}_{l}$, and never errs.

References

Arora, S., Barak, B.: Computational Complexity: A Modern Approach. Cambridge University Press, Cambridge (2009)
MATH Google Scholar
Amir, A., Beigel, R., Gasarch, W.: Some connections between bounded query classes and non-uniform complexity. Inf. Comput. 186, 104–139 (2003)
Article MATH MathSciNet Google Scholar
Agrawal, M.: Pseudo-random generators and structure of complete degrees. In: Proceedings of the Seventeenth Annual IEEE Conference on Computational Complexity, pp. 139–147 (2002)
Google Scholar
Ambos-Spies, K., Bentzien, L.: Separating NP-completeness notions under strong hypotheses. J. Comput. Syst. Sci. 61(3), 335–361 (2000)
Article MATH MathSciNet Google Scholar
Agrawal, M., Watanabe, O.: One-way functions and the isomorphism conjecture. Technical report TR09-019, Electronic Colloquium on Computational Complexity, 2009
Beigel, R.: Query-limited reducibilities. PhD thesis, Stanford University, 1987
Berman, L.: Polynomial reducibilities and complete sets. PhD thesis, Cornell University, 1977
Beigel, R., Feigenbaum, J.: On being incoherent without being very hard. Comput. Complex. 2(1), 1–17 (1992)
Article MATH MathSciNet Google Scholar
Babai, L., Fortnow, L., Nisan, N., Wigderson, A.: BPP has subexponential time simulations unless EXPTIME has publishable proofs. Comput. Complex. 3, 307–318 (1993)
Article MATH MathSciNet Google Scholar
Buhrman, H., Fortnow, L., Torenvliet, L.: Six hypotheses in search of a theorem. In: IEEE Conference on Computational Complexity, pp. 2–12 (1997)
Google Scholar
Berman, L., Hartmanis, J.: On isomorphism and density of NP and other complete sets. SIAM J. Comput. 6, 305–322 (1977)
Article MATH MathSciNet Google Scholar
Buhrman, H., Hitchcock, J.M.: NP-hard sets are exponentially dense unless NP⊆coNP/poly. In: Proceedings of the 23rd Annual IEEE Conference on Computational Complexity, pp. 1–7. IEEE Comput. Soc., Los Alamitos (2008)
Chapter Google Scholar
Buhrman, H., Hescott, B., Homer, S., Torenvliet, L.: Non-uniform reductions. Theory Comput. Syst. 47(2), 317–341 (2010)
Article MATH MathSciNet Google Scholar
Buhrman, H., Homer, S., Torenvliet, L.: Completeness for nondeterministic complexity classes. Math. Syst. Theory 24, 179–200 (1991)
Article MATH MathSciNet Google Scholar
Cook, S.A.: The complexity of theorem proving procedures. In: Proceedings of the Third ACM Symposium on the Theory of Computing, pp. 151–158 (1971)
Google Scholar
Feigenbaum, J., Fortnow, L., Laplante, S., Naik, A.: On coherence, random-self-reducibility, and self-correction. Comput. Complex. 7, 174–191 (1998)
Article MATH MathSciNet Google Scholar
Feigenbaum, J., Fortnow, L., Lund, C., Spielman, D.: The power of adaptiveness and additional queries in random-self-reductions. Comput. Complex. 4, 158–174 (1994)
Article MATH MathSciNet Google Scholar
Fortnow, L., Santhanam, R.: Infeasibility of instance compression and succinct PCPs for NP. In: Proceedings of the 40th Annual ACM Symposium on Theory of Computing, pp. 133–142 (2008)
Google Scholar
Ganesan, K., Homer, S.: Complete problems and strong polynomial reducibilities. SIAM J. Comput. 21(4), 733–742 (1992)
Article MATH MathSciNet Google Scholar
Goldreich, O., Levin, L.A.: A hard-core predicate for all one-way functions. In: Proceedings of the Twenty-First ACM Symposium on the Theory of Computing, pp. 25–32 (1989)
Google Scholar
Goldreich, O., Levin, L., Nisan, N.: On constructing 1-1 one-way function. Technical report TR95-029, ECCC, 1995
Hemaspaandra, E., Naik, A., Ogiwara, M., Selman, A.: P-selective sets and reducing search to decision vs. self-reducibility. J. Comput. Syst. Sci. 53(2), 194–209 (1996)
Article MATH Google Scholar
Hitchcock, J.M., Pavan, A.: Comparing reductions to NP-complete sets. Inf. Comput. 205(5), 694–706 (2007)
Article MATH MathSciNet Google Scholar
Hitchcock, J.M., Pavan, A., Vinodchandran, N.V.: Partial Bi-immunity, scaled dimension, and NP-completeness. Theory Comput. Syst. 42(2), 131–142 (2008)
Article MATH MathSciNet Google Scholar
Impagliazzo, R.: Hard-core distributions for somewhat hard problems. In: Proceedings of the 36th Annual Conference on Foundations of Computer Science, pp. 538–545 (1995)
Google Scholar
Impagliazzo, R., Wigderson, A.: P=BPP if E requires exponential circuits: derandomizing the XOR lemma. In: Proceedings of the 29th Symposium on Theory of Computing, pp. 220–229 (1997)
Google Scholar
Johnson, S.: A new upper bound for error correcting codes. IRE Trans. Inf. Theory 8(3), 203–207 (1962)
Article MATH Google Scholar
Levin, L.: Universal sorting problems. Probl. Inf. Transm. 9, 265–266 (1973). English translation of original in Problemy Peredaci Informacii
Google Scholar
Ladner, R., Lynch, N., Selman, A.: A comparison of polynomial time reducibilities. Theor. Comput. Sci. 1, 103–123 (1975)
Article MATH MathSciNet Google Scholar
Lutz, J.H., Mayordomo, E.: Measure, stochasticity, and the density of hard languages. SIAM J. Comput. 23(4), 762–779 (1994)
Article MATH MathSciNet Google Scholar
Lutz, J.H., Mayordomo, E.: Cook versus Karp-Levin: separating completeness notions if NP is not small. Theor. Comput. Sci. 164(1–2), 141–163 (1996)
Article MATH MathSciNet Google Scholar
Nisan, N., Wigderson, A.: Hardness vs randomness. J. Comput. Syst. Sci. 49, 149–167 (1994)
Article MATH MathSciNet Google Scholar
Pavan, A.: Comparison of reductions and completeness notions. SIGACT News 34(2), 27–41 (2003)
Article Google Scholar
Pavan, A., Selman, A.: Separation of NP-completeness notions. SIAM J. Comput. 31(3), 906–918 (2002)
Article MATH MathSciNet Google Scholar
Pavan, A., Selman, A.: Bi-immunity separates strong NP-completeness notions. Inf. Comput. 188, 116–126 (2004)
Article MATH MathSciNet Google Scholar
Sudan, M., Trevisan, L., Vadhan, S.: Pseudorandom generators without the XOR lemma. J. Comput. Syst. Sci. 62(2), 236–266 (2001)
Article MATH MathSciNet Google Scholar
Watanabe, O.: A comparison of polynomial time completeness notions. Theor. Comput. Sci. 54, 249–265 (1987)
Article MATH Google Scholar
Yao, A.: Theory and applications of trapdoor functions. In: Proceedings of the 23rd IEEE Symposium on Foundations of Computer Science, pp. 80–91 (1982)
Google Scholar

Download references

Acknowledgements

We thank anonymous reviewers for several comments and suggestions that led to a vastly improved presentation of the paper.

Author information

Authors and Affiliations

Linked In Corporation, Mountain View, USA
Xiaoyang Gu
Department of Computer Science, University of Wyoming, Laramie, USA
John M. Hitchcock
Department of Computer Science, Iowa State University, Ames, USA
A. Pavan

Authors

Xiaoyang Gu
View author publications
You can also search for this author in PubMed Google Scholar
John M. Hitchcock
View author publications
You can also search for this author in PubMed Google Scholar
A. Pavan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. Pavan.

Additional information

Preliminary version appeared at 27th International Symposium on Theoretical Aspects of Computer Science.

Part of the research of X.G. was done while the author was at Iowa State University. Research supported in part by NSF grants 0652569 and 0728806.

Research of J.M.H. supported in part by NSF grants 0515313 and 0652601 and by an NWO travel grant. Part of this research was done while this author was on sabbatical at CWI.

Research of A.P. supported in part by NSF grants 0830479 and 0916797.

Appendix

Here we provide a proof of Lemma 4.5

Definition

Let x,y∈{0,1}^m. $\Delta(x,y) = \frac{1}{m}|\{ i \mid x[i]\neq y[i] \} |$. We say that E:{0,1}ⁿ→{0,1}^m is an error correcting code with distance δ if for every x≠y∈{0,1}ⁿ, Δ(E(x),E(y))≥δ.

Theorem 5.1

(Johnson Bound [27])

If E:{0,1}ⁿ→{0,1}^m is an ECC with distance at least $\frac{1}{2}-\epsilon$, then for every x∈{0,1}^m, and $\eta \geq \sqrt{\epsilon}$, there exist at most l≤1/(2η ²) strings y ₁,…,y _l∈E({0,1}ⁿ) such that $\Delta(x, y_{i})\leq \frac{1}{2}-\eta$ for every i∈[l].

Consider an ECC $E:\{0,1\}^{2n\log n}\rightarrow\{0,1\}^{n^{4}}$ with distance 1/2−/2n. An explicit example of such ECC is the code obtained by concatenating the Walsh-Hadamard code with the Reed-Solomon code. We will view strings of length 2nlogn and n ⁴ truth tables of languages at certain lengths. For this, both 2nlogn and n ⁴ must be powers of 2. If n is of the form $2^{2^{k-1}}$, then both 2nlogn and n ⁴ are powers of two.

For any language L″, let L′ be fold(L″). It is clear that L″∈NEEE∩coNEEE if and only if L′∈NEEE∩coNEEE. The following fact is easy to verify.

Lemma 5.2

If L″∉EEE/O(logn), then L′∉EEE/O(logn) and for every EEE/O(logn) algorithm $\mathcal{A}$, there are infinitely many lengths n of the form 2^k+1+k such that $\mathcal{A}$ fails to decide L′ correctly at length n.

Lemma 4.5

If there is a language in L″∈NEEE∩coNEEE−EEE/O(logn), then there is a language L∈NEEE∩coNEEE such that for every EEE/O(logn) algorithm $\mathcal{A}$, there exist infinitely many n and $\mathcal{A}$ correctly decides L on at most $\frac{1}{2}+\frac{1}{n}$ fraction of strings from Σⁿ.

Proof

Let L″∈NEEE∩coNEEE−EEE/O(log). Let L′=fold(L″). Let L be the language such that for every k∈ℕ, $L_{2^{k+1}} = E(L'_{2^{k-1}+k})$ where E is the WH-RS code. On other lengths, we set L to be empty. It is clear that L∈NEEE∩coNEEE since we can actually compute the truth table of L′ in NEEE∩coNEEE.

Now, for the sake of contradiction, assume that there is an EEE/O(logn) algorithm $\mathcal{A}$ that correctly decides L on more than $\frac{1}{2}+\frac{1}{n}$ fraction of inputs at every length n. We will first show that this yields a EEE/O(logn) algorithm for L. Since L is empty at lengths that are not of the form 2^k+1, we can decide L easily at those lengths. Thus we will concentrate on lengths of the form 2^k+1.

Consider n of the form 2^k+1. By the definition of L, we know that $L_{n} = E(L'_{2^{k-1}+k})$. Let N=2ⁿ. Recall that the distance of the code E is $\frac{1}{2}-\epsilon$, where $\epsilon= \frac{1}{2\sqrt[4]{N}}$. Let $\eta= \frac{1}{n}=\frac{1}{\log N}> \sqrt{\epsilon}$.

By running $\mathcal{A}$ on all strings of length n compute a candidate sequence, x∈{0,1}^N, for L _n. By our assumption, A is correct on at least $\frac{1}{2}+\frac{1}{n}$ of strings from Σⁿ. Thus $\Delta(x, L_{n}) \leq\frac{1}{2}-\frac{1}{n}$. By the Johnson bound 5.1, there exist at most $l \leq \frac{n^{2}}{2}$ strings y ₁,…,y _l in $E(\Sigma^{2^{k-1}+k})$ such that $\Delta(x, y_{i}) \leq\frac{1}{2}-\frac{1}{n}$. By running E on all strings from $\Sigma^{2^{k-1}+k}$, we can compute these y _is. Since WH-RS code is computable in polynomial time, this computation can be done in triple exponential time. Since $L_{n} =E(L'_{2^{k-1}+k})$, there exists a j, 1≤j≤l such that L _n=y _j. Thus logl=O(logn) bits of information is enough to identify j for which L _n=y _j. Thus L∈EEE/O(logn).

Recall that $L_{n} = E(L'_{2^{k-1}+k}$) when n=2^k. Since L∈EEE/O(logn), by inverting the WH-RS code E, we can decide L′ correctly on all lengths of the form 2^k+1+k, using an EEE/O(logn) algorithm. This contradicts Lemma 5.2. □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gu, X., Hitchcock, J.M. & Pavan, A. Collapsing and Separating Completeness Notions Under Average-Case and Worst-Case Hypotheses. Theory Comput Syst 51, 248–265 (2012). https://doi.org/10.1007/s00224-011-9365-0

Download citation

Published: 08 October 2011
Issue Date: August 2012
DOI: https://doi.org/10.1007/s00224-011-9365-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Collapsing and Separating Completeness Notions Under Average-Case and Worst-Case Hypotheses

Abstract

Similar content being viewed by others

Autoreducibility of NP-Complete Sets under Strong Hypotheses

Length-Increasing Reductions for PSPACE-Completeness

Parameterized-NL Completeness of Combinatorial Problems by Short Logarithmic-Space Reductions and Immediate Consequences of the Linear Space Hypothesis

1 Introduction

1.1 Length-Increasing Reductions

1.2 Turing Reductions versus Many-One Reductions

2 Preliminaries

Definition

Definition

Definition 1

Theorem 2.1

Theorem 2.2

Theorem 2.3

Definition

Definition

Definition

Theorem 2.4

3 Length-Increasing Reductions

Hypothesis 1

Hypothesis 2

Hypothesis 3

3.1 If NP has Subexponentially Hard Languages

Theorem 3.1

Proof

Lemma 3.2

Proof of Lemma 3.2

Observation 3.3

Lemma 3.4

Theorem 3.5

3.2 If co-NP is Hard for Nondeterministic Circuits

Theorem 3.6

Proof

Lemma 3.7

Proof

Lemma 3.8

Observation 3.9

Proof

4 Separation of Completeness Notions

Theorem 4.1

Lemma 4.2

Lemma 4.3

Lemma 4.4

Proof of Lemma 4.2

Lemma 4.5

Lemma 4.6

Proof

Proof of Lemma 4.3

Observation 4.7

Proof

Observation 4.8

Proof

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

Definition

Theorem 5.1

Lemma 5.2

Lemma 4.5

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation