The consequences of P = NP would be “potentially stunning”, according to Stephen Cook, in his Millennium Prize overview of the P versus NP problem.
Is this really true?
Richard Lipton has argued eloquently that we should consider the possibility that P may equal NP. In this spirit, I want to briefly discuss what P = NP would imply.
As is usually noted in a discussion of P = NP consequences, one cannot simply say “if P = NP then wondrous things happen”. What is needed before we can conclude anything “potentially stunning” is an additional hypothesis along the lines of “there is a practical algorithm for solving an NP-complete problem”, by which Cook appears to mean “we can solve some NP-complete problem in time for smallish
“. For instance, an algorithm that somehow exploited the monster group to solve an NP-complete problem, in
time, would probably not have practical applications. So the additional condition is important.
Yet, even a low-degree polynomial bound for an algorithm for an NP-complete problem would not be enough for any special implications for most problems in P. If an algorithm existed to decide TSP in time , then it might be possible to improve the way courier companies allocated parcels to drivers. At the same time, factoring might require
time, but an algorithm achieving this bound would mean little for cryptography.
Many popular accounts of P versus NP seem to be based on a belief that a fast algorithm for some NP-complete problem would let us solve any problem in NP quickly. But this is not true.
Denote by some problem in P which is in
but not in
. The Time Hierarchy Theorem guarantees the existence of
for each positive
This is true regardless of the status of P versus NP.
So there are problems in P that are not “easy”, even if SAT could be solved in linear time with a small constant. If a problem is not in then for any deterministic Turing machine deciding the problem, and any constant
infinitely many instances of the problem require the machine to take more than
steps. I think it is reasonable to say that in this case, the problem is not “easy”.
So far the consequences of P = NP do not seem all that impressive, but there is one more thing worth noting.
Many of the known reductions from SAT to NP-complete problems are quite efficient. This means that if one of these NP-complete problems could be solved by an algorithm that requires at most time, then most NP-complete problems would also have time complexity “close” to
. So if P = NP then the asymptotically hardest problems in P (and NP) would be the
problems. These are not necessarily NP-complete, and they certainly would not look like most of the usual NP-complete problems.
So we need to add another condition to the list: if P = NP, and there is a fast algorithm for some NP-complete problem, and the hard problems in P are of no practical interest, then some potentially stunning things would be true.
I think this third hypothesis is quite interesting.
Reductions to NP-complete problems are all via SAT. The universal reduction in the Cook-Levin Theorem turns an instance of a problem in NP into an instance of SAT. If the instance of
is of size
, and the polynomial upper bound on the time required for a multi-tape non-deterministic Turing machine to decide
is
, then the size of the SAT instance is
(see Stephen Cook’s proof of this).
This means that the key issue (in a P = NP world) is how efficiently a language can be recognized by an NDTM. Most of the commonly studied problems in NP have certificates that can be verified in linear time. So these problems are close to SAT, and are not going to be the languages.
The important question is then:
Question: In NP, which are the languages with hard-to-verify certificates?
If P is not equal to NP, then there are problems in NP that are asymptotically harder than any problem in P. So the efficiency of reductions to SAT is not as important in this case. But even independently of P versus NP, it would be worth looking for natural candidates for problems for each
, as well as problems
that might require
time to solve on an NDTM. Proving non-trivial lower bounds is hard, but at least we could begin with some candidate problems!


Rossman’s clique result
Tags: papers
This is about three of the major results Benjamin Rossman has published to date. The first one was showing that existential positive sentences are preserved by homomorphisms, over finite structures. The second was work with Dawar and Richerby, showing that Choiceless Polynomial Time is enough to separate P from the class expressible by IFP with counting.
I here discuss the third result, from STOC 2008. This gives a superpolynomial lower bound on the size of constant-depth circuits that decide CLIQUE.
Let
be a constant. Rossman’s paper shows that for every
, no constant-depth circuits of size
distinguish between the graphs that contain an
-clique and those that contain no
-clique. This gives an
lower bound on the size of constant-depth circuits for deciding
-CLIQUE on
-vertex graphs.
The corollary is that the bounded variable fragments of first-order logic over finite ordered graphs form a strict hierarchy. It was previously not known if even
was less expressive than
, on finite ordered graphs.
A logic over ordered structures has the built-in relation symbol
between elements; the paper works with such logics.
Finite ordered graphs are important because for actual computation, we typically order the vertices, then compute over the resulting ordered structure. So there are many possible realisations of the same underlying unordered graph, one for each permutation of the vertices. The ordering of the vertices carries additional information, and this often does have a big impact in implementations. It then seems a priori not out the question that the 3-variable fragment of
over finite ordered graphs could be as expressive as
.
Note that the class of graphs can be turned into a set by taking each canonical
-vertex graph in the equivalence class up to isomorphism as the one defined over vertices
.
Now for the technical ingredient.
By “size-
graph property” I mean a function from the set of
-vertex graphs to
. Examples of size-
graph properties are “is a connected graph”, “contains a
-clique”, and “can be turned into a Hamiltonian cycle by adding or deleting at most
edges”.
Let
be a sequence of graph properties which can be represented as
for constant
. Let
(a positive integer) and
(positive but less than
) be constants. For every
, let
be an Erdős-Rényi random graph with
vertices and edge probability
, let
be a uniform random set of
vertices of
(i.e. chosen uniformly from the
-element subsets of
), and let
be the graph formed by adding all edges between the vertices in
to
.
constant depth circuits of size
Then
asymptotically almost surely.
My paraphrase: for almost all graphs from the “right” range of edge densities, constant depth circuits of at most polynomial size cannot distinguish between general instances and those guaranteed to have a
-clique.
Note that Lemma 2.1 ensures that when the edge density is in the “right” range, then there will tend to be few cliques. Hence forcing a
-clique will in the limit be a non-null operation.
Note that “has a
-clique” is a monadic second-order property, but zero-one laws do not hold for many quite weak fragments of MSO. Also, as noted by Matula in 1972, the size of the largest clique in
is almost surely
if
is not fixed but instead varies so that
is constant. As
grows, while
is fixed,
is almost surely going to have a
-clique with this model. Hence the choice of the random graph model is quite important.
The remark just before Lemma 2.1 mentions that the expectation of the
-cliques in
is greater than 1. So in many cases there will already be a
-clique in
before another one is added. Much of the paper, including the machinery on
-bounded clique-sensitive
-clique should not necessarily be expected to distinguish between a graph that quite likely already has a
-clique, from a graph that has had one added on.
number of
cores (which is explained further in a note), then seems devoted to addressing this difficulty. I have not taken the time to fully grasp how the technical lemmas work together to resolve the niggling doubt one may have at this point, that a circuit designed for deciding the presence of a
In the meantime, I look forward to a journal version with (hopefully) a simpler proof.
Quite recently Rossman has written a paper which extends this result from constant-depth to monotone circuits. This is the first average-case lower bound result for this framework. An upper bound is also presented, based on Amano’s upper bound for constant depth circuits (submitted version).