Approximately counting independent sets in bipartite graphs via graph containers

Matthew Jenssen School of Mathematics
University of Birmingham , Will Perkins Department of Mathematics, Statistics, and Computer Science
University of Illinois at Chicago and Aditya Potukuchi m.jenssen@bham.ac.uk
math@willperkins.org
adityap@uic.edu

Abstract.

By implementing algorithmic versions of Sapozhenko’s graph container methods, we give new algorithms for approximating the number of independent sets in bipartite graphs. Our first algorithm applies to $d$ -regular, bipartite graphs satisfying a weak expansion condition: when $d$ is constant, and the graph is a bipartite $\Omega(\log^{2}d/d)$ -expander, we obtain an FPTAS for the number of independent sets. Previously such a result for $d>5$ was known only for graphs satisfying the much stronger expansion conditions of random bipartite graphs. The algorithm also applies to weighted independent sets: for a $d$ -regular, bipartite $\alpha$ -expander, with $\alpha>0$ fixed, we give an FPTAS for the hard-core model partition function at fugacity $\lambda=\Omega(\log d/d^{1/4})$ . Finally we present an algorithm that applies to all $d$ -regular, bipartite graphs, runs in time $\exp\left(O\left(n\cdot\frac{\log^{3}d}{d}\right)\right)$ , and outputs a $(1+o(1))$ -approximation to the number of independent sets.

1. Introduction

Let $\mathcal{I}(G)$ denote the set of independent sets of a graph $G$ and $i(G)=|\mathcal{I}(G)|$ denote the number of independent sets of $G$ . Computing $i(G)$ is a #P-hard problem, even when restricted to bounded degree, bipartite graphs [47]. Even approximating $i(G)$ (to a constant or even subexponential factor) remains NP-hard, even when restricted to $d$ -regular graphs with $d\geq 6$ [53].

Intuitively, one might expect the problem of approximating $i(G)$ to be easier on the class of bipartite graphs; for one, there is a polynomial-time algorithm to find a maximum-size independent set in a bipartite graph while the corresponding problem is NP-hard for general graphs. Dyer, Goldberg, Greenhill, and Jerrum [11] defined the counting problem #BIS (bipartite independent set) and showed that several natural combinatorial counting problems are as hard to approximate as #BIS. These problems include counting stable matchings, approximating the ferromagnetic Potts model partition function ( $q\geq 3$ ) [24, 15], counting the number of $q$ -colorings in a bipartite graph ( $q\geq 3$ ), and approximating the ferromagnetic Ising model partition with non-uniform external fields [23].

The search for approximation algorithms for $i(G)$ that exploit bipartite structure generally falls into two categories. The first approach finds classes of graphs on which polynomial-time approximation algorithms exist. Liu and Lu [41] gave the first such algorithm, providing an FPTAS for the number of independent sets in bipartite graphs in which degrees on one side are bounded by $5$ , while degrees on the other side are unrestricted. Another line of work in this direction includes [31] in which approximation algorithms are given for the hard-core partition function $Z_{G}(\lambda)$ (counting weighted independent sets) in bounded-degree, bipartite expander graphs, based on two tools from statistical physics: polymer models and the cluster expansion (following [28]). This work was followed by several improvements, extensions, and generalizations including [40, 5, 7, 13, 12, 14, 8]. All of these algorithms are ‘low-temperature’ algorithms: they exploit the fact that on a bipartite graph with sufficient expansion, most (weighted) independent sets have few vertices from one side of the bipartition; that is, they are close to one of the two ground states consisting of all subsets of one side. In contrast, the algorithms of Weitz [55] (an FPTAS for $i(G)$ in graphs of maximum degree at most $5$ ) and Liu and Lu [41] are ‘high-temperature’ algorithms, exploiting correlation decay properties of the uniform distribution on $\mathcal{I}(G)$ (or more generally the hard-core model of weighted independent sets) for small vertex degrees.

The second category of bipartite approximation algorithms are those that apply to all bipartite graphs, with running times better than what is known for general graphs. The main result in this direction is the algorithm of Goldberg, Lapinskas, and Richerby [25] which provides an $\epsilon$ -relative approximation to $i(G)$ for bipartite graphs, running in time $O(2^{.2372n}(1/\epsilon)^{O(1)})$ , beating the best known running time for general graphs of $O(2^{.268n}(1/\epsilon)^{O(1)})$ from the same paper (in comparison, the best known running time for exact counting algorithms for general graphs is $O(2^{.3022n})$ [21]).

1.1. Main results

In this paper we use tools from combinatorics, namely the graph container method of Sapozhenko to give new approximate counting algorithms for independent sets in bipartite graphs. Our first result is an FTPAS for $i(G)$ for weakly expanding, $d$ -regular bipartite graphs, for constant $d$ .

For $z>0$ , we say that $\hat{z}$ is an $\epsilon$ -relative approximation to $z$ if $(1-\epsilon)\leq z/\hat{z}\leq(1+\epsilon)$ . A fully polynomial-time approximation scheme (FPTAS) is an algorithm that for every $\epsilon>0$ outputs an $\epsilon$ -relative approximation to $i(G)$ and runs in time polynomial in $|V(G)|$ and $1/\epsilon$ . We let $\mu_{G}$ denote the uniform distribution on $\mathcal{I}(G)$ . A polynomial-time sampling scheme for $\mu_{G}$ runs in time polynomial in $|V(G)|$ and $1/\epsilon$ and outputs an independent set with distribution $\hat{\mu}$ within $\epsilon$ total variation distance of $\mu_{G}$ .

For $\alpha>0$ , we say that a $d$ -regular bipartite graph $G$ with bipartition $X, Y$ is a bipartite $\alpha$ -expander if for every $A\subseteq X$ and $A\subseteq Y$ of size at most $|X|/2$ , we have

(1)

\displaystyle|N(A)|\geq(1+\alpha)|A|.

Theorem 1.

There exists a constant $C_{1}>0$ so that for $d$ fixed and sufficiently large, and $\alpha=\frac{C_{1}\log^{2}d}{d}$ , there is an FPTAS for $i(G)$ and a polynomial-time sampling scheme for $\mu_{G}$ for the class of $d$ -regular, bipartite $\alpha$ -expander graphs.

Previous results on expander graphs include an FPTAS for $i(G)$ in the case that $G$ is a typical $d$ -regular, random bipartite graph [31, 40, 8]. These algorithms exploit the very strong expansion conditions satisfied by a random graph: sets of size $\tilde{O}(n/d)$ on each side of the bipartition expand by a factor $\tilde{\Omega}(d)$ .

A natural and powerful generalization of the notion of counting independent sets is to consider weighted independent sets in the form of the hard-core model partition function (also known as the independence polynomial)

Z_{G}(\lambda)=\sum_{I\in\mathcal{I}(G)}\lambda^{|I|}\,.

The corresponding probability measure on independent sets, known as the hard-core model, is given by

\mu_{G,\lambda}(I)=\frac{\lambda^{|I|}}{Z_{G}(\lambda)}\,.

Taking $\lambda=1$ gives $i(G)$ and $\mu_{G}$ . In previous works, an FPTAS for $Z_{G}(\lambda)$ for bounded-degree, bipartite $\alpha$ -expanders was obtained for $\lambda$ much larger than $1$ , specifically $\lambda\geq K\Delta^{c/\alpha}$ for constants $c,K>1$ [31, 7, 12]. In particular, under the expansion conditions of Theorem 1, these algorithms require $\lambda\geq\Delta^{\tilde{\Omega}(\Delta)}$ .

Our next result adapts the algorithm of Theorem 1 to work for bipartite $\alpha$ -expanders for $\lambda$ much smaller than $1$ .

Theorem 2.

For every $\alpha>0$ there exists a constant $C_{2}>0$ so that for $d\geq 3$ and $\lambda>\frac{C_{2}\log d}{d^{1/4}}$ there is an FPTAS for $Z_{G}(\lambda)$ and a polynomial-time sampling scheme for $\mu_{G,\lambda}$ for the class of $d$ -regular, bipartite $\alpha$ -expanders.

In fact in proving Theorems 1 and 2 we can interpolate between the two cases, letting the lower bound on $\lambda$ shrink as the expansion condition gets stronger. The more general condition obtained is essentially the same as the condition for slow mixing of Glauber dynamics given by Galvin and Tetali [20].

Our next result is an approximation algorithm for $i(G)$ for all (not necessarily expanding) $d$ -regular bipartite graphs $G$ , where $d$ may either be constant or growing with the size of the graph, $n$ . When $d\to\infty$ as $n\to\infty$ , the algorithm runs in subexponential time. This algorithm estimates $i(G)$ by separating the contribution from non-expanding sets and expanding sets on each side of the bipartition and uses the expander algorithm of Theorem 1 as a subroutine.

Theorem 3.

For every $c>0$ , there is a randomized algorithm that given a $d$ -regular, $n$ -vertex bipartite graph $G$ outputs an $n^{-c}$ -relative approximation to $i(G)$ with probability at least $2/3$ and runs in time

\exp\left(O\left(\frac{n\log^{3}d}{d}\right)\right)\,.

Note that while the algorithm of Theorem 3 applies more generally than that of Theorem 1, the algorithmic guarantees are weaker in several senses (in addition to the slower running time): the algorithm uses randomness and the accuracy is limited to being polynomially small in $n$ . Moreover we do not provide a corresponding sampling algorithm for Theorem 3; the problem is not self-reducible for regular graphs, and so there is not a direct reduction of approximate sampling to counting. We can overcome this in Theorem 1, however, using the self-reducibility of polymer models.

1.2. Background

The study of independent sets plays a central role in combinatorics since a broad range of problems can be phrased in terms of independent sets in graphs (and more generally hypergraphs). The container method is one of the most powerful combinatorial tools for studying independent sets. At a high level, the container method exploits a clustering phenomenon exhibited by independent sets which can often be used to deduce useful structural information for typical independent sets in a given graph or hypergraph. For graphs, the method was developed in the early 1980’s by Kleitman and Winston [35, 36] and was independently discovered by Sapozhenko who used the method to enumerate independent sets in regular graphs [49, 50]. See the survey of Samotij [48] for background and examples. The full potential of the container method only recently became apparent with the powerful generalization of the method to the context of hypergraphs developed by Saxton and Thomason [52] and Balogh, Morris and Samotij [4]. These developments have made the container method one of the most influential tools in modern combinatorics.

In this paper, we will only need ideas from the theory of graph containers, and our treatment is most closely related to that of Sapozhenko [49]. A canonical application of Sapozhenko’s version of the container method is his proof that the number of independent sets in the $d$ -dimensional hypercube, $Q_{d}$ , is asymptotically equal to $2\sqrt{e}\cdot 2^{2^{d-1}}$ (a result originally proved by Korshunov and Sapozhenko [38]). See also Galvin’s exposition of Sapozhenko’s proof [18].

Recently, Jenssen and Perkins [32] used Sapozhenko’s graph containers for $Q_{d}$ (and Galvin’s extension to weighted independent sets [17]) along with the theory of polymer models and the cluster expansion to deduce refined counting estimates and detailed probabilistic information for independent sets in $Q_{d}$ . The polymer models they consider closely resemble those used by Jenssen, Perkins and Keevash [31] to design approximate counting algorithms in bipartite expander graphs. However, the hypercube $Q_{d}$ is a far weaker expander than the graphs considered in [31], ruling out a direct application of the the cluster expansion method. To overcome this obstacle, they showed that the container method arises naturally as a tool for proving cluster expansion convergence. This synthesis of the cluster expansion and the container method has now seen a number of applications to enumeration problems in combinatorics [30, 33, 9, 3].

Graph containers have in fact been used previously in a number of results in theoretical computer science, including [20, 16, 19]. These results use Sapozhenko’s techniques to prove a type of negative algorithmic result: that the Glauber dynamics (a local Markov chain) for sampling independent sets (or $q$ -colorings) in bipartite graphs with sufficient expansion exhibit torpid mixing; that is, the mixing time is exponentially large in the size of the graph.

In this paper, we return to the algorithmic context to prove positive results, showing that the container method can be algorithmically implemented to design efficient approximate counting and sampling algorithms for a broad class of bipartite graphs. There is in fact a connection been the torpid mixing results above and the current algorithmic results: a key step in using polymer models and the cluster expansion as algorithms to approximately count independent sets in bipartite graphs is showing that most (weighted) independent sets can be accounted for by one of two polymer models that capture small deviations from independent sets that are fully contained in one side of the bipartition. This step (Lemmas 16 below) amounts to proving a kind of torpid mixing result, closely related to the result of Galvin and Tetali who proved torpid mixing for $d$ -regular bipartite $\alpha$ -expanders, for $\alpha=\Omega(\log^{3}d/d)$ [20, Corollary 1.3], similar to the class of graphs to which Theorem 1 applies.

While Theorem 1 gives efficient approximate counting and sampling algorithms for a larger class of bipartite expander graphs than previous approaches, it does not say much about the tractability of #BIS, beyond ruling out a class of graphs as hard examples. In another prominent approximation problem of undetermined complexity, the Unique Games problem, efficient approximation algorithms for expander graphs [2, 37, 44] were later leveraged via graph decompositions into expanding pieces to find subexponential-time algorithms for all instances [1]. This suggests a very natural goal of finding much faster algorithms for approximating #BIS.

Question 1.

Is there a subexponential-time approximation algorithm for #BIS?

Theorem 3 makes a small step in this direction, giving subexponential-time algorithms for regular graphs of growing degree, using an expander algorithm as a subroutine to account for the contribution to $i(G)$ from expanding sets. Non-expanding sets are accounted for separately, also using ideas from graph containers. It is tempting to think that algorithmic graph decomposition results (e.g. [45, 6]) could be used in conjunction with expander algorithms for #BIS, but it is not clear to us how to use results that bound the number of edges between expanding pieces to obtain improved approximation algorithms for #BIS, and so this remains an interesting direction for future research.

1.3. Outline

In Section 2 we give a brief warm-up to show how ideas from graph containers can be used for approximately counting independent sets in graphs. In Section 3 we present the graph container results we will need for the main algorithmic results. In Section 4 we define a polymer model which we will use to approximate the number of independent sets in expander graphs. In Section 5 we prove Theorem 1. In Section 6 we prove Theorem 3. In Section 7 we extend the graph container results of Section 3 and the polymer model of Section 4 to the case of weighted independent sets to prove Theorem 2.

2. Warm-up: algorithms from containers

In this section, we demonstrate how ideas from graph containers can be used to design faster algorithms to approximately count independent sets in (not necessarily bipartite) $d$ -regular graphs. This section is intended as a warm-up which introduces the interplay between the container method and counting algorithms, and is not needed for the proofs of Theorems 1, 2 and 3.

Let us assume that we have an algorithm $A$ that runs in time $a(n)$ on a general (not necessarily $d$ -regular) graph $G$ on $n$ vertices and outputs an $\epsilon$ -relative approximation to $i(G)$ . We will show that if $G$ is $d$ -regular for $d=\omega(1)$ , one can obtain an algorithm running in time $a(n/2)\cdot 2^{o(n)}$ .

Let $T:=n\ln d/d$ . We note that throughout the paper, we let $\ln$ denote the natural logarithm and let $\log$ denote $\log_{2}$ . We may then write

i(G)=i_{<T}(G)+i_{\geq T}(G)

where $i_{<T}(G)$ is the number of independent sets of $G$ of size less than $T$ , and $i_{\geq T}(G)$ is the the number of independent sets of size at least $T$ . One can compute $i_{<T}(G)$ by brute-force in time $\binom{n}{T}\cdot\operatorname{poly}(n)$ . To compute $i_{\geq T}(G)$ , we use a subtle idea of Sapozhenko [51] (also see [29], [34]) with a tighter analysis. First, we fix an ordering $\prec$ of the vertices of the graph $G$ . Given a subset $S\subseteq V(G)$ and vertex $v\in V(G)$ , we let $d_{S}(v)$ denote the number of neighbor of $v$ in $S$ . The following algorithm takes an independent set $I$ of size at least $T$ as input and returns a “certificate” $\xi$ :

•

$t,i\leftarrow 0$ , $\xi\leftarrow(0)^{n}$ , $V_{0}\leftarrow V(G)$
•
while $t\leq T$ do
- –
  
  $v\leftarrow\operatorname{argmax}_{v\in V_{i}}d_{V_{i}}(v)$ with ties broken using $\prec$
- –
  if $v\in I$
  - *
    
    $V_{i+1}\leftarrow V_{i}\setminus(\{v\}\cup N(v))$
  - *
    
    $t\leftarrow t+1$ , $i\leftarrow i+1$ , $\xi_{i}\leftarrow 1$ .
- –
  if $v\not\in I$
  - *
    
    $V_{i+1}\leftarrow V_{i}\setminus\{v\}$
  - *
    
    $i\leftarrow i+1$ .
•

return $\xi$

It is worth noting that $\xi$ is not the indicator vector of any subset of $V(G)$ , but rather an indicator vector describing the steps at which a vertex $v\in I$ is removed from $V_{i}$ . Assume that the algorithm runs for $k=k(\xi)$ steps (i.e., $k$ is that final value that $i$ takes in the execution of the algorithm) when the output is $\xi$ and let $V_{\xi}=V_{k}$ . A key property of $\xi$ is that it determines both $V_{\xi}$ , and $I\cap(V(G)\setminus V_{\xi})$ . We may therefore group independent sets according to their certificate $\xi$ and write

(2)

i_{\geq T}(G)=\sum_{\begin{subarray}{c}\xi\in\{0,1\}^{n},\\ |\xi|=T\end{subarray}}i(G[V_{\xi}]).

The advantage of this expression is that the number of possible certificates $\binom{n}{T}$ is relatively small and, as we show next, the size of each $V_{\xi}$ is close to $n/2$ . We have therefore exploited the clustering phenomenon of independent sets characteristic of the container method to effectively halve the size of the input graph to our algorithm (albeit at the expense of losing regularity of the graph).

We now give the argument to bound the size of each $V_{\xi}$ . For any subset $S\subseteq V(G)$ , let us denote $e_{S}$ to be the number of edges in the induced subgraph $G[S]$ . A straightforward extension of the Hoffman bound (see e.g. [10, 43, 22, 27]), and using the fact that the smallest eigenvalue of a $d$ -regular graph is at least $-d$ , gives us

2e_{S}\geq\frac{2d}{n}|S|^{2}-d|S|.

So for $i\leq k$ , we have

(3)

\max_{v\in V_{i}}d_{V_{i}}(v)\geq\frac{2e_{V_{i}}}{|V_{i}|}\geq\frac{d}{n}\left(2|V_{i}|-n\right),

and so if $v\in I$ , then $|V_{i+1}|\leq|V_{i}|\left(1-\frac{2d}{n}\right)+d$ , and so at the end of the algorithm, we have $|V_{k}|\leq\frac{n}{2}+O\left(\frac{n\ln d}{d}\right)$ .

Thus, if we have an algorithm $A$ running in time $a(n)=a(n,\epsilon)$ on general graphs on $n$ vertices which outputs an $\epsilon$ -relative approximation to $i(G)$ for a general graph $G$ on $n$ vertices, then an $\epsilon$ -relative approximation to (2) may be computed in time $\binom{n}{T}\cdot a\left(\frac{n}{2}+O\left(\frac{n\ln d}{d}\right)\right)$ . Combining this with the brute-force computation of $i<T$ gives an algorithm running in time

\operatorname{poly}(n)\cdot\binom{n}{\frac{n\ln d}{d}}^{2}\cdot a\left(\frac{n}{2}+O\left(\frac{n\ln d}{d}\right)\right)

that outputs an $\epsilon$ -relative approximation to $i(G)$ for a $d$ -regular graph $G$ on $n$ vertices. So in particular, if $d=\omega(1)$ , then using the algorithm of Goldberg, Lapinskas, and Richerby [25] as a blackbox, we obtain a $2^{(0.134+o(1))n}$ time algorithm for approximating $i(G)$ in $d$ -regular graphs on $n$ vertices. We will see below in Section 6 that with more sophisticated ideas this running time can be made subexponential provided that the error $\epsilon=n^{-a}$ for some $a>0$ (Theorem 3).

3. Graph container lemmas

In this section we introduce results from the theory of graph containers that will be key for the algorithms of Theorems 1, 2 and 3. Many of the ideas here have their roots in the aforementioned container method of Sapozhenko [49].

We assume throughout that $G$ is a $d$ -regular bipartite graph on $2n$ vertices with bipartition $X, Y$ , so $|X|=|Y|=n$ . For a subset $A\subseteq X$ , we use $W$ to denote $N(A)$ . Let us define $[A]:=\{u\in X~{}|~{}N(u)\subseteq W\}$ to be the closure of $A$ . Let us call $A$ $2$ -linked if the subgraph of $G^{2}$ induced by $A$ is connected. We say that $A$ is expanding if

|W|-|[A]|\geq(C_{1}/2)\frac{\log^{2}d}{d}|W|\,,

where the constant $C_{1}>0$ will be sufficiently large and chosen later. Otherwise, we say that $A$ is non-expanding.

Let $\mathcal{G}(v,a,w)$ denote the set of $2$ -linked expanding sets $A$ such that $A\ni v$ , $|[A]|=a$ , and $|W|=w$ . Let $\mathcal{G}^{\prime}(v,a)$ be the set of $2$ -linked non-expanding sets $A\ni v$ such that $A=[A]$ , and $|A|=a$ .

Remark: Observe that if $G$ is a bipartite $\alpha$ -expander (as defined at (1)) then

(4)

|N(A)|\geq(1+\alpha)|[A]|

for each $A\subseteq X$ or $A\subseteq Y$ such that $|[A]|\leq n/2$ . Indeed, if $|[A]|\leq n/2$ , then $|N(A)|=|N([A])|\geq(1+\alpha)|[A]|$ . Since $\alpha\leq 1$ , inequality (4) implies that $|[A]|\leq|N(A)|(1-\alpha/2)$ , and so

(5)

|N(A)|-|[A]|\geq(\alpha/2)|N(A)|

for each $A$ such that $|[A]|\leq n/2$ . In the following, condition (5) is slightly more convenient to work with and motivates our definition of an expanding set.

We now state our main technical lemmas. The first bounds the number of expanding sets and the second bounds the number of non-expanding sets (and gives an algorithm to enumerate them).

Lemma 4.

There is an absolute constant $c_{1}>0$ such that for every $v, a, w$ we have

|\mathcal{G}(v,a,w)|\leq 2^{w-c_{1}(w-a)}.

Lemma 5.

There is an absolute constant $c_{2}>0$ such that for every $v, a$ , we have

|\mathcal{G}^{\prime}(v,a)|\leq 2^{c_{2}\frac{a\log^{2}d}{d}}.

Moreover, there is an algorithm running in time $2^{O\left(\frac{a\log^{2}d}{d}\right)}\cdot\operatorname{poly}(n)$ that outputs the set $\mathcal{G}^{\prime}(v,a)$ .

Recall that for a subset $A\subseteq X$ , we use $W$ to denote $N(A)$ , with $|[A]|=a$ , $|W|=w$ . Set $t=w-a$ . For every $s>0$ , let $W_{s}=\{u\in W~{}|~{}d_{[A]}(u)\geq s\}$ . We next define a notion of an approximation of the set $W$ (which in turn determines $[A]$ ).

Definition 6.

A set $F\subseteq W$ is an essential subset for $A$ if

(1)

$F\supseteq W_{d/2}$
(2)

$N(F)\supseteq[A]$ .

The next lemma gives a family $\mathcal{C}(v,a,w)\subset 2^{Y}$ that contains an essential subset for each member of $\mathcal{G}(v,a,w)$ . Crucially the set of approximating sets $\mathcal{C}(v,a,w)$ is far smaller than $\mathcal{G}(v,a,w)$ .

Lemma 7.

There is a family $\mathcal{C}(v,a,w)\subset 2^{Y}$ of size at most

2^{\frac{16w\log^{2}d}{d}}

such that $\mathcal{C}(v,a,w)$ contains an essential subset of every $2$ -linked set $A\ni v$ such that $|[A]|=a$ and $|W|=w$ . Moreover, there is an algorithm running in time $2^{\frac{16w\log^{2}d}{d}}\cdot\operatorname{poly}(n)$ that outputs the set $\mathcal{C}(v,a,w)$ .

We prove Lemma 7 below.

The following lemma of Park [46] strengthens a result of Sapozhenko [49] (the lemma is proved implicitly in [34] also).

Lemma 8.

There is an absolute constant $c_{3}>0$ such that the following holds: for every $F\subseteq X$ , let $\mathcal{G}(F,a,w)$ be the set of expanding $2$ -linked sets $A\subseteq X$ such that $|[A]|=a$ , $|W|=w$ , and $F$ is an essential subset of $A$ . Then

|\mathcal{G}(F,a,w)|\leq 2^{w-c_{3}\left(w-a\right)}.

With these lemmas in hand, we now prove Lemma 4 and Lemma 5.

Proof of Lemma 4.

First note that

(6)

\displaystyle\mathcal{G}(v,a,w)\subseteq\bigcup_{F\in\mathcal{C}(v,a,w)}\mathcal{G}(F,a,w)

and so by Lemmas 7 and 8, we have that

	$\displaystyle\|\mathcal{G}(v,a,w)\|$	$\displaystyle\leq\sum_{F\in\mathcal{C}(v,a,w)}\|\mathcal{G}(F,a,w)\|$
		$\displaystyle\leq\|\mathcal{C}(v,a,w)\|\cdot\max_{F\in\mathcal{C}(v,a,w)}\|\mathcal{G}(F,a,w)\|$
		$\displaystyle\leq 2^{\frac{16w\log^{2}d}{d}}\cdot 2^{w-c_{3}\left(w-a\right)}$
		$\displaystyle\leq 2^{w-\left(\frac{c_{3}}{2}\right)\left(w-a\right)}$

where for the last inequality we used that $w-a\geq(C_{1}/2)\frac{\log^{2}d}{d}w$ by the definition of an expanding set and assumed that $C_{1}>64/c_{3}$ . ∎

Proof of Lemma 5.

Let $F$ be an essential subset of $A$ where $A$ is non-expanding. Note that each vertex in $W\setminus F$ has at least $d/2$ neighbors in $[A]^{c}$ , and there are at most $d(w-a)$ edges between $W$ and $[A]^{c}$ . It follows that

(d/2)\cdot|W\setminus F|\leq d(w-a)\

and so

|W\setminus F|\leq 2(w-a)\leq C_{1}\frac{\log^{2}d}{d}w\,.

Moreover, $W\setminus F\subset N^{2}(F)$ , and $N^{2}(F)\leq wd^{2}$ , and so there are at most

\binom{wd^{2}}{\leq C_{1}w(\log^{2}d)/d}\leq 2^{O\left(\frac{a\log^{3}d}{d}\right)}

choices for $W$ , each of which determines a $[A]$ . Let $\mathcal{G}^{\prime}(F,a)$ denote the collection of $A\subset X$ such that $A=[A]$ , $v\in A$ , $A$ is $2$ -linked, non-expanding and $F$ is an essential subset for $A$ . Then by the above

|\mathcal{G}^{\prime}(F,a)|=2^{O\left(\frac{a\log^{3}d}{d}\right)}\,.

Moreover, we can generate the set $\mathcal{G}^{\prime}(F,a)$ in time $2^{O\left(\frac{a\log^{3}d}{d}\right)}\operatorname{poly}(n)$ by listing each set $W$ that is a union of $F$ with a subset of $N^{2}(F)$ of size at most $C_{1}w(\log^{2}d)/d$ , generating the corresponding closed set $[A]$ such that $N(A)=W$ , and checking it satisfies the required conditions.

Now, by Lemma 7,

(7)

\displaystyle\mathcal{G}^{\prime}(v,a)\subseteq\bigcup_{w\in\left[a,a\left(1+C_{1}\frac{\log^{2}d}{d}\right)\right]}\bigcup_{F\in\mathcal{C}(v,a,w)}\mathcal{G}^{\prime}(F,a)

and so

|\mathcal{G}^{\prime}(v,a)|\leq\left(C_{1}\frac{a\log^{2}d}{d}\right)\cdot 2^{O\left(\frac{a\log^{3}d}{d}\right)}=2^{O\left(\frac{a\log^{3}d}{d}\right)}\,.

Similarly, by (7), Lemma 7, and the above algorithm for generating $\mathcal{G}^{\prime}(F,a)$ , we may generate $\mathcal{G^{\prime}}(v,a)$ in time $2^{O\left(\frac{a\log^{3}d}{d}\right)}\cdot\operatorname{poly}(n)$ . ∎

We will need a covering result originally due to Lovász [42] and Stein [54].

Theorem 9.

Let $H$ be a bipartite graph on vertex sets $P$ and $Q$ where the degree of each vertex in $P$ is at least $a$ and the degree of each vertex in $Q$ is at most $b$ . Then there is subset $Q^{\prime}\subset Q$ of size at most $\frac{|Q|}{a}(1+\ln b)$ such that $P\subseteq N(Q^{\prime})$ .

We record the following corollary of Theorem 9 which we will use in the proof of Lemma 7.

Corollary 10.

Let $A\subseteq X$ be $2$ -linked and let $A\ni v$ . Then the following hold:

(1)

There exists a $2$ -linked subset $A^{\prime}\subseteq A$ of size at most $2\frac{a}{d}\ln d+2\frac{w}{d}$ such that $A^{\prime}\ni v$ and $N(A^{\prime})$ is an essential subset for $A$ .
(2)

There exists a $2$ -linked subset $A^{\prime\prime}\subseteq A$ of size at most $2\frac{a}{d}\ln d+2\frac{w}{d}+2(w-a)$ such that $N(A^{\prime\prime})=W$ .

Proof.

We begin by proving (1). Let $A_{0}\subset[A]$ be a maximal subset of vertices containing $v$ with pairwise disjoint neighborhoods. Clearly $|A_{0}|\leq\frac{w}{d}$ and $N^{2}(A_{0})\supseteq A$ . Theorem 9 guarantees a subset $A_{1}\subseteq A$ of size at most $2\frac{a}{d}\ln d$ such that $W_{d/2}\supseteq N(A_{1})$ . Suppose $A_{0}\cup A_{1}$ is not $2$ -linked, then there are at most $\frac{w}{d}$ $2$ -linked components. Indeed, this is true since $N(A_{0}\cup A_{1})\subseteq W$ and each two linked component covers at least $d$ vertices of $W$ . Since $[A]$ is $2$ -linked, it follows that one can choose a subset $A_{2}\subseteq[A]$ of size at most $\frac{w}{d}$ such that $A^{\prime}:=A_{0}\cup A_{1}\cup A_{2}$ is $2$ -linked. We note that $|A^{\prime}|\leq 2\frac{a}{d}\ln d+2\frac{w}{d}$ . To show that $N(A^{\prime})$ is an essential subset for $A$ observe that

•

$N^{2}(A^{\prime})\supseteq N^{2}(A_{0})\supseteq[A]$ , and
•

$W_{d/2}\subseteq N(A_{1})\subseteq N(A^{\prime})$ .

We now turn to (2). Note that each vertex in $W\setminus W_{d/2}$ has at least $d/2$ neighbors in $[A]^{c}$ , and there are at most $d(w-a)$ edges between $W$ and $[A]^{c}$ . It follows that

(d/2)\cdot|W\setminus W_{d/2}|\leq d(w-a)\

and so

|W\setminus W_{d/2}|\leq 2(w-a).

Let $A_{3}\subseteq A$ be a minimal cover of $W\setminus W_{d/2}$ . We have that $|A_{3}|\leq|W\setminus W_{d/2}|\leq 2(w-a)$ , and every vertex of $A_{3}$ is at a distance $2$ from some vertex in $A_{0}\subseteq A^{\prime}$ by the maximality of $A_{0}$ . Thus $A^{\prime\prime}=A^{\prime}\cup A_{3}$ is $2$ -linked, $|A^{\prime\prime}|\leq 2(w-a)+|A^{\prime}|$ and $N(A^{\prime\prime})\supseteq W$ , completing the proof. ∎

Finally we prove Lemma 7. The proof we present is different and simpler than the one in [49], whose proof works for a quantitatively weaker notion of expansion, but in return, asks that no two vertices share many common neighbors.

Proof of Lemma 7.

Let $A\ni v$ be a $2$ -linked subset as in the statement of the lemma. By Corollary 10, there exists a $2$ -linked subset $A^{\prime}\subset A$ of size at most $4\frac{w}{d}\ln d$ such that $v\ni A^{\prime}$ and $N(A^{\prime})$ is an essential subset for $A$ . In view of this, we let $\mathcal{B}(v,a,w)$ be the set of all $2$ -linked subsets of $A$ , containing $v$ , of size at most $4\frac{w}{d}\ln d$ and let

\mathcal{C}(v,a,w)=\left\{N(A):A\in\mathcal{B}(v,a,w)\right\}\,.

It remains to upper bound the size of $\mathcal{C}(v,a,w)$ and describe an algorithm that outputs the set. Note that $|\mathcal{B}(v,a,w)|=|\mathcal{C}(v,a,w)|$ and $|\mathcal{B}(v,a,w)|$ is at most the number of trees in $G^{2}$ containing $v$ as the root with at most $4\frac{w\log d}{d}$ vertices. Note that the maximum degree in $G^{2}$ is at most $d(d-1)$ . So $\mathcal{B}(v,a,w)$ can be enumerated using the following procedure:

(1)

Assume an ordering on the neighbors of every vertex in the graph $G^{2}$ . Let us use $v_{i}$ to denote the $i$ ’th neighbor of a vertex $v$ . Let us also denote $v_{0}=v$ .
(2)

Generate a list $S\in\{0,\ldots,d(d-1)\}^{8\frac{w\ln d}{d}}$ .
(3)

Consider the set $T_{S}=\left\{v^{(0)},\ldots,v^{(s)}\right\}$ where $v^{(0)}=v$ and $v^{(i)}=v^{(i-1)}_{S_{i}}$ .
(4)

If $|T_{S}|\leq 4\frac{w\ln d}{d}$ , then output $T_{S}$ .

Consider any tree in $G^{2}$ with root $v$ and $s\leq 4\frac{w\ln d}{d}$ nodes. There is at least one choice of the list $S$ that causes the above procedure to output the vertices of this tree, namely, if $(S_{1},\ldots,S_{2s})$ is the DFS traverse order and $S_{i}=0$ for $i\geq 2s$ .

For each list $S$ , the procedure takes $\operatorname{poly}(n)$ time, and the number of possible lists is $(d(d-1)+1)^{8\frac{w}{d}\ln d}\leq 2^{\frac{16w\log^{2}d}{d}}$ . Therefore there is an algorithm running in time $2^{\frac{16w\log^{2}d}{d}}\cdot\operatorname{poly}(n)$ that outputs the set $\mathcal{C}(v,a,w)$ . ∎

4. Polymer Models

In this section we introduce a variant of the polymer models used by Jenssen, Keevash, and Perkins [31] to obtain approximation algorithms for bipartite expander graphs. Polymer models originated in statistical physics (e.g. [26, 39]) as a means to study spin models on lattices. Recently they were used to design algorithms for spin models at low temperatures [28].

Fix a $d$ -regular bipartite graph $G$ with bipartition $(X,Y)$ of size $n$ each. A polymer of $G$ is a $2$ -linked, expanding subset of $X$ . Recall, from the previous section, that a set $A\subseteq X$ is expanding if

(8)

\displaystyle|N(A)|-|[A]|\geq(C_{1}/2)\frac{|N(A)|\log^{2}d}{d}

where $C_{1}$ is the constant from Theorem 1. Let $\mathcal{P}(G)$ denote the set of all polymers of $G$ . The weight of a polymer $\gamma$ is given by $2^{-|N(\gamma)|}=:w_{\gamma}$ . We call two polymers $\gamma$ and $\gamma^{\prime}$ compatible if $\gamma\cup\gamma^{\prime}$ is not $2$ -linked, and incompatible otherwise, in which case we write $\gamma\nsim\gamma^{\prime}$ . Let $\Omega(G)$ denote the collection all subsets of mutually compatible polymers (including the empty set). The polymer model partition function is

(9)

\Xi_{G}^{X}:=\sum_{\Lambda\in\Omega(G)}\prod_{\gamma\in\Lambda}w_{\gamma}\,,

and the associated Gibbs distribution $\nu^{X}_{G}$ on $\Omega(G)$ defined by

\nu^{X}_{G}(\Lambda)=\frac{\prod_{\gamma\in\Lambda}w_{\gamma}}{\Xi_{G}^{X}}\text{ for }\Lambda\in\Omega(G)\,.

When the weights of the polymer model are small enough one can hope to understand the partition function and the Gibbs distribution via perturbative techniques, and in particular, the cluster expansion.

For a tuple $\Gamma$ of polymers, the incompatibility graph, $H(\Gamma)$ , is the graph with vertex set $\Gamma$ and an edge between any two incompatible polymers. A cluster $\Gamma$ is an ordered tuple of polymers so that $H(\Gamma)$ is connected. Let us use $\mathcal{C}(G)$ to denote the set of all clusters of $G$ . The cluster expansion of $\Xi_{G}^{X}$ is the formal series expansion

(10)

\ln\Xi_{G}^{X}=\sum_{\Gamma\in\mathcal{C}(G)}\phi(H(\Gamma))\prod_{\gamma\in\Gamma}w_{\gamma}\,,

where $\phi$ is the Ursell function defined by

\phi(H)=\sum_{\begin{subarray}{c}E^{\prime}\subseteq E(H)\\ (V(H),E^{\prime})\text{ connected}\end{subarray}}(-1)^{|E^{\prime}|}\,.

Since $\mathcal{C}(G)$ is an infinite set, the series in (10) is an infinite sum. In our application, we will work with a truncated sum after after establishing a fast enough rate of convergence. For these convergence bounds, (as in [28, 31]) we use a special case of the Koteckỳ–Preiss condition [39].

Theorem 11 ([39]).

Fix functions $f:\mathcal{P}(G)\rightarrow[0,\infty)$ and $g:\mathcal{P}(G)\rightarrow[0,\infty)$ . Suppose that for every $\gamma\in\mathcal{P}(G)$ , we have

(11)

\sum_{\gamma^{\prime}\not\sim\gamma}w_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}\leq f(\gamma).

Then the cluster expansion converges absolutely. Moreover, for every vertex $v$ ,

(12)

\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \Gamma\ni v\end{subarray}}\left|\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right|e^{g(\Gamma)}\leq 1.

We will use this to obtain some relevant structural information about the polymer model. For polymers $\gamma$ , define modified polymer weights $\tilde{w}_{\gamma}=2^{-|N(\gamma)|}\cdot 2^{\frac{|\gamma|\log^{2}d}{d}}$ and let

(13)

\displaystyle\tilde{\Xi}_{G}^{X}=\sum_{\Lambda\in\Omega(G)}\prod_{\gamma\in\Lambda}\tilde{w}_{\gamma}

be the modified polymer model partition function. We first show that this polymer model satisfies the Koteckỳ-Preiss condition for a suitable choice of functions $f$ and $g$ .

Lemma 12.

Consider the modified polymer model where the polymers are all the $2$ -linked subsets $A\subseteq X$ that are expanding, and the weight of a polymer $\gamma$ is given by

\tilde{w}_{\gamma}=2^{-|N(\gamma)|}\cdot 2^{\frac{|\gamma|\log^{2}d}{d}}.

Let $f(\gamma)=\ln 2\cdot\frac{|\gamma|\log^{2}d}{d}$ , and $g(\gamma)=2\ln 2\cdot\frac{|N(\gamma)|\log^{2}d}{d}$ . Then for $d$ sufficiently large, every polymer $\gamma$ satisfies (11). The conclusion also holds for the original polymer model with weights $w_{\gamma}$ and the same choice of functions $f(\cdot),g(\cdot)$ .

Proof of Lemma 12.

It suffices to prove the claim for the modified polymer model since $\tilde{w}_{\gamma}>w_{\gamma}$ for all $\gamma$ .

Recall, from the proof of Lemma 4 that $c_{1}\geq c_{3}/2\geq 64/C_{1}$ . We evaluate

	$\displaystyle\sum_{\gamma^{\prime}\not\sim\gamma}\tilde{w}_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$	$\displaystyle\leq\sum_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}\tilde{w}_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}2^{-\|N(\gamma^{\prime})\|}e^{2f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}\left(\sum_{t\geq(C_{1}/2)\frac{w\log^{2}d}{d}}\|\mathcal{G}(v,w-t,w)\|2^{-w}\cdot e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}\right)$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}\left(\sum_{t\geq(C_{1}/2)\frac{w\log^{2}d}{d}}\|\mathcal{G}(v,w-t,w)\|2^{-w}\right)$
		$\displaystyle\leq\|\gamma\|\sum_{w\geq d}e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}\left(\sum_{t\geq(C_{1}/2)\frac{w\log^{2}d}{d}}2^{-c_{1}\cdot t}\right)$
		$\displaystyle\leq d\|\gamma\|\sum_{w\geq d}e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}2^{-8\frac{w\log^{2}d}{d}}$
		$\displaystyle\leq d^{2}\|\gamma\|2^{-4\log^{2}d}$
		$\displaystyle\ll\ln 2\cdot\frac{\|\gamma\|\log^{2}d}{d}=f(\gamma)\,.\qed$

Therefore, Theorem 11 gives us that

(14)

\displaystyle\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \Gamma\ni v\end{subarray}}\left|\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right|e^{g(\gamma)}\leq 1\,,

and the same with $\tilde{w}_{\gamma}$ replacing $w_{\gamma}$ .

Now define the exponential of the truncated cluster expansion

(15)

{\Xi}_{G}^{X}(\ell):=\exp\left(\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \|\Gamma\|\leq\ell\end{subarray}}\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right),

where $\|\Gamma\|:=\sum_{\gamma\in\Gamma}|\gamma|$ . The following lemma bounds the error in approximating $\Xi_{G}^{X}$ by ${\Xi}_{G}^{X}(\ell)$ .

Lemma 13.

We have for every $\ell\geq 1$ ,

(16)

\left|\ln\Xi_{G}^{X}-\ln{\Xi}_{G}^{X}(\ell)\right|\leq n\cdot 2^{-2\frac{\ell\log^{2}d}{d}}\,.

In particular, if $\ell\geq\frac{d}{2\log^{2}d}\log(n/\epsilon)$ , then

\left|\ln\Xi_{G}^{X}-\ln{\Xi}_{G}^{X}(\ell)\right|\leq\epsilon\,.

Proof.

First recall that for a cluster $\Gamma$ ,

g(\Gamma)=\sum_{\gamma\in\Gamma}g(\gamma)=2\ln 2\frac{\log^{2}d}{d}\sum_{\gamma\in\Gamma}|N(\gamma)|\geq 2\ln 2\frac{\log^{2}d}{d}\|\Gamma\|.

It follows from (14) that

	$\displaystyle\left\|\ln\Xi_{G}^{X}-\ln{\Xi}_{G}^{X}(\ell)\right\|$	$\displaystyle=\left\|\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \\|\Gamma\\|>\ell\end{subarray}}\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right\|$
		$\displaystyle\leq\sum_{v}\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \Gamma\ni v\\ \\|\Gamma\\|>\ell\end{subarray}}\left\|\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right\|$
		$\displaystyle\leq ne^{-2\ln 2\frac{\ell\log^{2}d}{d}}$
		$\displaystyle=n\cdot 2^{-2\frac{\ell\log^{2}d}{d}}\,.\qed$

Let $\|\mathbf{\Lambda}\|=\sum_{\gamma\in\mathbf{\Lambda}}|\gamma|$ for $\mathbf{\Lambda}\sim\nu_{G}^{X}$ . We have the following large deviation result for $\|\mathbf{\Lambda}\|$ , following [32, Lemma 16].

Lemma 14.

For any $\delta\in(0,1)$ , there is a $d_{0}=d_{0}(\delta)$ such for $d\geq d_{0}$ , we have

\mathbb{P}(\|\mathbf{\Lambda}\|\geq\delta n)\leq 2^{-\left(\frac{\delta n\log^{2}d}{2d}\right)}.

Proof.

With $\tilde{\Xi}_{G}^{X}$ as defined at (13), we have that $\ln\tilde{\Xi}_{G}^{X}-\ln\Xi_{G}^{X}=\ln\mathbf{E}e^{\zeta\cdot\|\mathbf{\Lambda}\|}$ for $\zeta=\frac{\ln 2\log^{2}d}{d}$ . Summing (14) over all $v$ , and using the fact that $|N(\gamma)|\geq d$ for every $\gamma$ , we get

(17)

\ln\tilde{\Xi}_{G}^{X}\leq\sum_{v}\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \Gamma\ni v\end{subarray}}\left|\phi(\Gamma)\prod_{\gamma\in\Gamma}\tilde{w}_{\gamma}\right|\leq n\cdot 2^{-2\log^{2}d}.

Therefore, we have

\displaystyle\ln\mathbf{E}e^{\zeta\cdot\|\mathbf{\Lambda}\|}\leq\ln\tilde{\Xi}_{G}^{X}\leq n\cdot 2^{-2\log^{2}d}\,,

and so by Markov’s inequality we have

	$\displaystyle\mathbb{P}(\\|\mathbf{\Lambda}\\|\geq\delta n)$	$\displaystyle\leq\exp\left(-\zeta\delta n+n\cdot 2^{-2\log^{2}d}\right)$
		$\displaystyle\leq 2^{-\frac{\delta n\log^{2}d}{2d}},$

where the last inequality follows because $(1/2)\ln 2\geq(\delta\log^{2}d)^{-1}\cdot d\cdot 2^{-2\log^{2}d}$ for large enough $d$ . ∎

Approximate counting and sampling for polymer models

Here, we will use an algorithm from [31] to approximate $\Xi_{G}^{X}$ and approximately sample from $\nu_{G}^{X}$ .

Lemma 15.

Then there is an FPTAS for $\Xi_{G}^{X}$ that runs in time $(n/\epsilon)^{O(d)}$ . Moreover, for every $\epsilon>0$ there is a randomized algorithm that runs in time $(n/\epsilon)^{O(d)}$ that outputs a configuration $\Lambda\in\Omega(G)$ with distribution $\nu^{X}_{\text{alg}}$ so that $\|\nu^{X}_{\text{alg}}-\nu_{G}^{X}\|_{TV}<\epsilon$ .

To give a sense of the above algorithms, we briefly describe the FPTAS for $\Xi_{G}^{X}$ . Using Lemma 13, it is enough to compute $\Xi_{G}^{X}(\ell)$ (as defined in (15)) where $\ell=\left\lceil\frac{d}{2\log^{2}d}\log(n/\epsilon)\right\rceil$ .

This may be done by

(1)

listing all clusters $\Gamma$ of size at most $\ell$ ,
(2)

computing $\phi(\Gamma)$ ,
(3)

computing $\prod_{\gamma\in\Gamma}w_{\gamma}$ , and
(4)

evaluating $\Xi_{G}^{X}(\ell)$ by exponentiating the truncated cluster expansion.

To approximately sample from $\nu_{G}^{X}$ we appeal to the self-reducibility of the abstract polymer model, see [28, Theorem 10].

Remark 1.

In the proof of Theorem 3 in Section 6, we will in fact need to approximate the partition function of various subgraphs of $G$ . These subgraphs, $H\subseteq G$ , all have the following form: $H$ is the subgraph induced on vertex set $X^{\prime}\cup Y^{\prime}$ for some $X^{\prime}\subseteq X$ , $Y^{\prime}\subseteq Y$ such that $N(X^{\prime})\subseteq Y^{\prime}$ . For such a subgraph, we define a polymer of $H$ to be an expanding $2$ -linked subset of $X^{\prime}$ and define the partition function $\Xi_{H}^{X^{\prime}}$ in the obvious way. We note that the polymers of $H$ are a subset of the polymers of $G$ . In particular, since the polymer model on $G$ satisfies the hypothesis of Theorem 11, the polymer model on $H$ also satisfies the hypothesis of Theorem 11 with the same functions $f$ and $g$ . In particular, Lemmas 13, 14 and 15 also apply to the polymer model on $H$ .

5. An algorithm for expander graphs: proof of Theorem 1

In this section, we prove Theorem 1. We assume throughout that $G$ is a $d$ -regular bipartite $\alpha$ -expander with $\alpha=C_{1}\frac{\log d}{d}$ . We let $X, Y$ denote the vertex classes of $G$ and let $n=|X|=|Y|$ . We note that by (5), we have that $|N(A)|-|A||\geq(C_{1}/2)\frac{|N(A)|\log^{2}d}{d}$ for every $A\subset X$ or $A\subset Y$ such that $|[A]|\leq n/2$ , i.e. each such set $A$ is expanding.

First we show that $i(G)$ can be approximated well by a linear combination of the polymer model partition functions $\Xi_{G}^{X}$ and $\Xi_{G}^{Y}$ (as defined in (9)). We may then use the algorithm of Section 4 to approximate $\Xi_{G}^{X}$ and $\Xi_{G}^{Y}$ . Recall that we let $\mathcal{I}=\mathcal{I}(G)$ denote the set of independent sets of $G$ . For the sampling algorithms, we show that $\mu_{G}$ can be approximated by a mixture of probability distributions on $\mathcal{I}$ derived from the polymer measures $\nu^{X}_{G}$ , $\nu_{G}^{Y}$ . We let $\hat{\nu}^{X}_{G}$ denote the probability distribution on $\mathcal{I}$ defined as follows:

(1)

Sample a collection of compatible polymers $\Lambda$ from the measure $\nu^{X}_{G}$ .
(2)

Set $I=J\cup\bigcup_{\gamma\in\Lambda}\gamma$ where $J$ is a uniformly random subset of $Y\backslash\bigcup_{\gamma\in\Lambda}N(\gamma)$ .

We define $\hat{\nu}^{Y}_{G}$ analogously and define the mixture

\hat{\mu}_{G}=\frac{\Xi_{G}^{X}}{\Xi_{G}^{X}+\Xi_{G}^{Y}}\hat{\nu}^{X}_{G}+\frac{\Xi_{G}^{Y}}{\Xi_{G}^{X}+\Xi_{G}^{Y}}\hat{\nu}^{Y}_{G}\,.

Lemma 16.

For $n$ sufficiently large,

2^{n}\cdot\left(\Xi_{G}^{X}+\Xi_{G}^{Y}\right)

is an $\epsilon$ -relative approximation to $i(G)$ where $\epsilon=2^{-\frac{n\log^{2}d}{60d}}.$ Moreover,

\|\mu_{G}-\hat{\mu}_{G}\|_{TV}\leq 2\epsilon\,.

Proof.

Let us define

\mathcal{I}_{X}:=\{I\in\mathcal{I}~{}|~{}\text{every $2$-linked component of $I\cap X$ is expanding}\}\,

i.e. the set of all $I$ such that $\nu_{G}^{X}(I)>0$ . We note that since the independence number of $G$ is $n$ , we have that for each $I\in\mathcal{I}$ ,

\min(|[I\cap X]|,|[I\cap Y]|)\leq n/2\,.

In particular, every component of either $I\cap X$ or $I\cap Y$ is expanding. Therefore $\mathcal{I}=\mathcal{I}_{X}\cup\mathcal{I}_{Y}$ and so

(18)

\displaystyle i(G)=|\mathcal{I}_{X}|+|\mathcal{I}_{Y}|-|\mathcal{I}_{X}\cap\mathcal{I}_{Y}|.

Note that

(19)

\mathcal{I}_{X}=2^{n}\cdot\Xi_{G}^{X}

and moreover, the set of $2$ -linked components of $I\cap X$ for a uniformly chosen $I\in\mathcal{I}_{X}$ is distributed exactly accordingly to $\nu_{G}^{X}$ . It will suffice to bound the size of $\mathcal{I}_{X}\cap\mathcal{I}_{Y}$ .

Letting $\mathcal{I}^{\delta}_{X}=\{I\in\mathcal{I}_{X}:|I\cap X|\leq\delta n\}$ , for $\delta\in(0,1)$ , it follows by Lemma 14 that

(20)

|\mathcal{I}_{X}\backslash\mathcal{I}^{\delta}_{X}|\leq|\mathcal{I}_{X}|\cdot 2^{-\frac{\delta n\log^{2}d}{2d}}.

Defining $\mathcal{I}_{Y}$ , $\mathcal{I}^{\delta}_{Y}$ , analogously and taking $\delta=1/30$ we have

(21)

|\mathcal{I}^{\delta}_{X}\cap\mathcal{I}^{\delta}_{Y}|\leq\binom{2n}{\leq 2\delta n}\leq 2^{n}\cdot 2^{-\frac{n\log^{2}d}{60d}-1}\leq|\mathcal{I}_{X}|\cdot 2^{-\frac{n\log^{2}d}{60d}-1}.

By (19), (20) and (21) we conclude that

(22)

\displaystyle|\mathcal{I}_{X}\cap\mathcal{I}_{Y}|\leq|\mathcal{I}_{X}\backslash\mathcal{I}_{X}^{\delta}|+|\mathcal{I}_{Y}\backslash\mathcal{I}_{Y}^{\delta}|+|\mathcal{I}^{\delta}_{Y}\cap\mathcal{I}_{X}^{\delta}|\leq 2^{n}(\Xi_{G}^{X}+\Xi_{G}^{Y})2^{\frac{-n\log^{2}d}{60d}}\,,

and therefore, by (18),

(23)

\displaystyle 2^{n}(\Xi_{G}^{X}+\Xi_{G}^{Y})\cdot\left(1-2^{\frac{-n\log^{2}d}{60d}}\right)\leq i(G)\leq 2^{n}(\Xi_{G}^{X}+\Xi_{G}^{Y}).

This completes the proof of the first claim. For the second claim we recall the following formula for the total variation distance between discrete probability measures:

(24)

\displaystyle\|\mu_{G}-\hat{\mu}_{G}\|_{TV}=\sum_{I:\hat{\mu}_{G}(I)>\mu_{G}(I)}\hat{\mu}_{G}(I)-\mu_{G}(I)\,.

We note that for $I\in\mathcal{I}_{X}\triangle\mathcal{I}_{Y}$ , $\hat{\mu}_{G}(I)=2^{-n}(\Xi_{G}^{X}+\Xi_{G}^{Y})^{-1}$ , whereas for $I\in\mathcal{I}_{X}\cap\mathcal{I}_{Y}$ , $\hat{\mu}_{G}(I)=2^{1-n}(\Xi_{G}^{X}+\Xi_{G}^{Y})^{-1}$ . It follows from (23) that $\hat{\mu}_{G}(I)>\mu_{G}(I)$ only if $I\in\mathcal{I}_{X}\cap\mathcal{I}_{Y}$ . By (22) and (24), we then have

\|\mu_{G}-\hat{\mu}_{G}\|_{TV}\leq 2\cdot 2^{\frac{-n\log^{2}d}{60d}}\,.\qed

Now we prove Theorem 1.

Proof of Theorem 1.

Set $\epsilon_{0}=2^{-\frac{n\log^{2}d}{60d}}$ . First suppose $\epsilon\leq 2\epsilon_{0}$ , then $i(G)$ may be computed exactly and a uniformly random independent set can be sampled by brute-force in time $2^{n+o(n)}=(1/\epsilon)^{O(d/\log^{2}d)}$ .

Now suppose $\epsilon>2\epsilon_{0}$ . By Lemma 16, it is enough to compute an $(\epsilon/4)$ -relative approximation to both $\Xi_{G}^{X}$ and $\Xi_{G}^{Y}$ . By using the algorithm given by Lemma 15, this takes time $(n/\epsilon)^{O(d)}$ .

For the approximate sampling algorithm, we note that by Lemma 16, it is enough to obtain an $\epsilon/2$ -approximate sample from $\hat{\mu}_{G}$ . We do this as follows: we first compute $\epsilon/8$ -relative approximations to $\Xi_{G}^{X}$ and $\Xi_{G}^{Y}$ by computing $\Xi_{G}^{X}(\ell)$ and $\Xi_{G}^{Y}(\ell)$ respectively, with $\ell$ chosen as in Lemma 15. We then pick $X$ or $Y$ with respective probabilities $\Xi_{G}^{X}(\ell)/(\Xi_{G}^{X}(\ell)+\Xi_{G}^{Y}(\ell))$ and $\Xi_{G}^{Y}(\ell)/(\Xi_{G}^{X}(\ell)+\Xi_{G}^{Y}(\ell))$ , and then use the polymer sampling algorithm of Lemma 15 to approximately sample a configuration of compatible polymers $\Lambda$ from $X$ (resp. $Y$ ), accurate to within total variation distance $\epsilon/8$ . Given the polymer configuration $\Lambda$ we then independently select each vertex of $Y\setminus N(\Lambda)$ (resp. $X\setminus N(\Lambda)$ ) with probability $1/2$ and add these to the independent set. The distribution of the output is then within total variation distance $\epsilon/2$ of $\hat{\mu}_{G}$ . See the sampling algorithm of [31] for details on the calculation of this bound. ∎

6. An algorithm for general regular bipartite graphs: Proof of Theorem 3

In this section we prove Theorem 3, giving an algorithm that, for any constant $c>0$ , returns an $n^{-c}$ -relative approximation for the number of independent sets in a general $d$ -regular bipartite graph. As in the previous sections we let $G$ denote a $d$ -regular graph on $2n$ vertices with vertex classes $X$ and $Y$ . The algorithm proceeds by separating the contribution of expanding and non-expanding $2$ -linked sets of $X$ to the independent set count. To estimate the the contribution from non-expanding components, we use a simple argument inspired by the container method (see Lemma 17 below). To estimate the contribution from expanding components, we appeal to the algorithm of Lemma 15.

We begin with the the following lemma which will allow us to group non-expanding components according to their closure. We say that a set $A\subseteq X$ is closed if $A=[A]$ .

Lemma 17.

Let $A\subseteq X$ be a $2$ -linked, closed, non-expanding set. Then there is a randomized $\operatorname{poly}(n)\cdot{\epsilon^{-2}}\ln(1/\delta)\cdot 2^{O\left(\frac{|A|\log^{2}d}{d}\right)}$ -time algorithm that outputs an $\epsilon$ -relative approximation to the number of $2$ -linked $B\subseteq A$ such that $N(B)=N(A)$ with probability at least $1-\delta$ .

Proof.

Let $W=N(A)$ , $w=|W|$ and $a=|[A]|$ . Let

\mathcal{D}:=\{B\subseteq A~{}|~{}N(B)=W~{}\text{and}~{}B~{}\text{is $2$-linked}\}

be the set whose size we would like to estimate. By Corollary 10 there exists a set $A^{\prime}\in\mathcal{D}$ of size at most

2\frac{a}{d}\ln d+2\frac{w}{d}+2(w-a)=O\left(\frac{a\log^{2}d}{d}\right)\,.

It follows that

|\mathcal{D}|\geq 2^{a-O\left(\frac{a\log^{2}d}{d}\right)}.

Indeed every superset $B\supseteq A^{\prime}$ satisfies $N(B)=W$ and $B$ is $2$ -linked. The first property is clear, since $N(B)\supseteq N(A^{\prime\prime})=W$ . The second property holds because every vertex in $B$ is either in $A^{\prime}$ or at a distance $2$ from some vertex in $A^{\prime}$ , which is itself $2$ -linked. It follows that $|\mathcal{D}|$ can be estimated to relative error $\epsilon$ by sampling

\frac{1}{\epsilon^{2}}\ln(1/\delta)\cdot 2^{O\left(\frac{a\log^{2}d}{d}\right)}

subsets of $A$ . ∎

The algorithm

For a closed, $2$ -linked subset $A\subseteq X$ , let us denote

\mathcal{D}(A):=\#\{B\subseteq A~{}|~{}B~{}\text{is}~{}2\text{-linked},~{}N(B)=N(A)\}.

We now define an algorithm with inputs a graph $G$ on $n$ vertices and an accuracy parameter $\epsilon>0$ as follows. Let $L:=\lceil\frac{d}{2\log^{2}d}\log(2n/\epsilon)\rceil$ . If $d\leq\sqrt{n}$ , the algorithm is as follows:

(1)

List all vectors $(a_{1},\ldots,a_{\ell})$ of positive integers such that $\ell\leq n/d$ and $\sum a_{i}\leq n$ .
(2)

For each vector $(a_{1},\ldots,a_{\ell})$ from Step 1, list all sets $\{A_{1},\ldots,A_{\ell}\}$ such that the $N(A_{i})$ ’s are pairwise disjoint and $A_{i}\in\mathcal{G}^{\prime}(v_{i},a_{i})$ for some $v_{i}\in X$ for each $i$ .
(3)

For each set $\{A_{1},\ldots,A_{\ell}\}$ from Step 2, compute $\tilde{\mathcal{D}}(A_{i})$ , which is a $(\epsilon/2n)$ -relative approximation to $\mathcal{D}(A_{i})$ using Lemma 17, setting $\delta=(1/3)\cdot 2^{-n^{2}}$ for all $i$ .
(4)

For each set $A=\{A_{1},\ldots,A_{\ell}\}$ from Step 2, let $Y_{A}=Y\setminus N(\cup_{i=1}^{\ell}A_{i})$ , $X_{A}=X\setminus N^{2}(\cup_{i=1}^{\ell}A_{i})$ , $G_{A}=G[X_{A}\cup Y_{A}]$ and compute ${\Xi}^{X_{A}}_{G_{A}}(L)$ .
(5)

Output

$\sum_{\ell=0}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\left(\prod_{i=1}^{\ell}\tilde{\mathcal{D}}(A_{i})\right)\cdot\left(2^{|Y|-\sum_{i=1}^{\ell}N(A_{i})}\cdot{\Xi}^{X_{A}}_{G_{A}}(L)\right)\,.$

If $d>\sqrt{n}$ , the algorithm is to run Steps 1-3 above and output

(25)

\displaystyle\sum_{\ell=0}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\left(\prod_{i=1}^{\ell}\tilde{\mathcal{D}}(A_{i})\right)\cdot\left(2^{|Y|-\sum_{i=1}^{\ell}N(A_{i})}\right)\,.

Proof of Theorem 3

We first prove the correctness of the algorithm: for any $c>0$ and $\epsilon=n^{-c}$ , the output is an an $\epsilon$ -relative approximation to $i(G)$ . As before we let $X, Y$ denote the vertex classes of $G$ . Suppose first that $d\leq\sqrt{n}$ . We then have

	$\displaystyle i(G)$	$\displaystyle=2^{\|Y\|}\cdot\sum_{t=0}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{t}~{}\text{compatible}\\ A_{i}~{}2\text{-linked}~{}\forall i\end{subarray}}\left(\prod_{i=1}^{t}2^{-N(A_{i})}\right)$
		$\displaystyle=2^{\|Y\|}\cdot\sum_{t=0}^{n/d}\sum_{\ell=0}^{t}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding},\\ 2\text{-linked}\end{subarray}}\left(\prod_{i=1}^{\ell}2^{-N(A_{i})}\cdot\sum_{\begin{subarray}{c}A_{\ell+1},\ldots,A_{t}\\ \subseteq X\setminus N^{2}(\cup_{j=1}^{\ell}A_{j})\\ \text{compatible}\\ \forall i~{}A_{i}~{}\text{expanding},\\ 2\text{-linked}\end{subarray}}\prod_{i=\ell+1}^{t}2^{-N(A_{i})}\right)$
		$\displaystyle=2^{\|Y\|}\cdot\sum_{t=0}^{n/d}\sum_{\ell=0}^{t}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\sum_{\begin{subarray}{c}B_{1},\ldots,B_{\ell}\\ \forall i,~{}B_{i}\subseteq A_{i},\\ N(B_{i})=N(A_{i}),\\ B_{i}~{}2\text{-linked}\end{subarray}}\left(\prod_{i=1}^{\ell}2^{-N(A_{i})}\cdot\sum_{\begin{subarray}{c}A_{\ell+1},\ldots,A_{t}\\ \subseteq X\setminus N^{2}(\cup_{j=1}^{\ell}A_{j})\\ \text{compatible}\\ \forall i~{}A_{i}~{}\text{expanding},\\ 2\text{-linked}\end{subarray}}\prod_{i=\ell+1}^{t}2^{-N(A_{i})}\right)$
		$\displaystyle=\sum_{\ell=1}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\sum_{\begin{subarray}{c}B_{1},\ldots,B_{\ell}\\ \forall i,~{}B_{i}\subseteq A_{i},\\ N(B_{i})=N(A_{i}),\\ B_{i}~{}2\text{-linked}\end{subarray}}\left(2^{\|Y\|-\sum_{i=1}^{\ell}N(A_{i})}\cdot\sum_{t=1}^{n/d-\ell}\sum_{\begin{subarray}{c}A_{1}^{\prime},\ldots,A_{t}^{\prime}\\ \subseteq X\setminus N^{2}(\cup_{j=1}^{\ell}A_{j})\\ \text{compatible}\\ \forall i~{}A_{i}^{\prime}~{}\text{expanding},\\ 2\text{-linked}\end{subarray}}\prod_{i=1}^{t}2^{-N(A_{i}^{\prime})}\right)$
		$\displaystyle=\sum_{\ell=1}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\left(\prod_{i=1}^{\ell}\mathcal{D}(A_{i})\right)\cdot\left(2^{\|Y\|-\sum_{i=1}^{\ell}N(A_{i})}\cdot{\Xi}^{X_{A}}_{G_{A}}\right)$
		$\displaystyle=(1\pm\epsilon)\sum_{\ell=1}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\left(\prod_{i=1}^{\ell}\tilde{\mathcal{D}}(A_{i})\right)\cdot\left(2^{\|Y\|-\sum_{i=1}^{\ell}N(A_{i})}\cdot{\Xi}^{X_{A}}_{G_{A}}(L)\right).$

For the last equality we used that by Remark 1, we may apply Lemma 13 to $G_{A}$ , and so ${\Xi}^{X_{A}}_{G_{A}}(L)$ is an $\epsilon$ -relative approximation to ${\Xi}^{X_{A}}_{G_{A}}$ . Observe that there are at most $2^{n^{2}}$ many choices of nonexpanding $A_{1},\ldots,A_{\ell}$ . So by union bounding over all these choices, the last summation is exactly what the algorithm outputs with probability at least $1-\delta\cdot 2^{n^{2}}=2/3$ , we have the required approximation guarantee. If $d\geq\sqrt{n}$ , then we note that since the polymers of $G_{A}$ are a subset of the the polymers of $G$ , we have by (17) that $\ln\Xi_{G_{A}}^{X_{A}}\leq\ln\Xi_{G}^{X}\leq 2^{-\Omega(\log^{2}n)}$ . It follows that $1$ is trivially an $\epsilon$ -relative approximation to $\Xi^{X_{A}}_{G_{A}}$ (recall that $\epsilon=n^{-c}$ ) and so (25) is an $\epsilon$ -relative approximation to $i(G)$ .

We now show that if $\epsilon=n^{-c}$ , with $c>0$ fixed, the above algorithm runs in time $2^{O\left(\frac{\log^{3}d}{d}n\right)}$ . We consider the algorithm step by step.

Step 1. For $\ell\leq n/d$ , and $k\leq n$ the number of vectors $(a_{1},\ldots,a_{\ell})$ of positive integers such that $\sum a_{i}=k$ (i.e. the number of ordered partitions of $k$ with $\ell$ parts) is

\binom{k-1}{\ell-1}\leq\binom{n}{n/d}=2^{O\left(\frac{\log d}{d}n\right)}\,.

Moreover, it is clear that the set of all such partitions can be listed in time $2^{O\left(\frac{\log d}{d}n\right)}$ and so Step 1 takes time $2^{O\left(\frac{\log d}{d}n\right)}$ .

Step 2. Let $(a_{1},\ldots,a_{\ell})$ be a vector of positive integers such that $\ell\leq n/d$ and $\sum a_{i}\leq n$ . We first list all tuples vertices $\{v_{1},\ldots,v_{\ell}\}\subset X$ which takes time $\binom{n}{\ell}=2^{O\left(\frac{\log d}{d}n\right)}$ . For each $\{v_{1},\ldots,v_{\ell}\}$ , we then appeal to Lemma 5 to output the tuple $(\mathcal{G}^{\prime}(v_{1},a_{1}),\ldots,\mathcal{G}^{\prime}(v_{\ell},a_{\ell}))$ in time $2^{O\left(\frac{\log^{3}d}{d}n\right)}$ . We note that by Lemma 5

|\mathcal{G}^{\prime}(v_{1},a_{1})\times\ldots\times\mathcal{G}^{\prime}(v_{\ell},a_{\ell})|\leq\prod_{i=1}^{\ell}2^{O\left(\frac{\log^{3}d}{d}a_{i}\right)}=2^{O\left(\frac{\log^{3}d}{d}n\right)}\,.

We may therefore check each element of $\mathcal{G}^{\prime}(v_{1},a_{1})\times\ldots\times\mathcal{G}^{\prime}(v_{\ell},a_{\ell})$ to see if it satisfies the required conditions and output the desired list in time $2^{O\left(\frac{\log^{3}d}{d}n\right)}$ .

Step 3. Given a set $\{A_{1},\ldots,A_{\ell}\}$ from Step 2, we use Lemma 17 to compute an $\epsilon/n$ -relative approximation $\tilde{\mathcal{D}}(A_{i})$ to $\mathcal{D}(A_{i})$ for all $i$ . This takes time

(\epsilon/n)^{-2}\ln(1/\delta)2^{O\left(\frac{\log^{2}d}{d}n\right)}=2^{O\left(\frac{\log^{2}d}{d}n\right)}

where we recall that $\epsilon=n^{-C}$ and $\delta=(1/3)\cdot 2^{-n^{2}}$ .

Step 4. If $d<\sqrt{n}$ , then given any set $A=\{A_{1},\ldots,A_{\ell}\}$ from Step 2, we may compute ${\Xi}^{X_{A}}_{G_{A}}(L)$ in time $(n/\epsilon)^{O(d)}=2^{O\left(\frac{\log^{2}d}{d}n\right)}$ by the algorithm in Section 4 restricted to polymers of $G_{A}$ . If $d\geq\sqrt{n}$ , we skip Step 4.

We conclude that the algorithm takes time $2^{O\left(\frac{\log^{3}d}{d}n\right)}$ in total.

7. Weighted independent sets: proof of Theorem 2

The proof of Theorem 2 will follow the same lines as that of Theorem 1, with the main difference being that polymer weights will now be $w_{\gamma}=\frac{\lambda^{|\gamma|}}{(1+\lambda)^{|N(\gamma)|}}$ , generalizing the $\lambda=1$ case of Theorem 1.

We assume throughout this section that $G$ is a $d$ -regular, bipartite $\alpha$ -expander with bipartition $X, Y$ of size $n$ each.

Define a polymer model with polymers consisting of the small $2$ -linked, subsets of $X$ (resp. $Y$ ) with two polymers compatible if their union is not $2$ -linked (recall that $\gamma\subset X$ is small if $|[\gamma]|\leq n/2$ ). The weight of a polymer $\gamma$ is $w_{\gamma}=\frac{\lambda^{|\gamma|}}{(1+\lambda)^{|N(\gamma)|}}$ . Let $\Xi_{G}^{X}(\lambda)$ be the polymer model partition function and $\nu_{G,\lambda}^{X}$ be the corresponding Gibbs measure on collections of compatible polymers.

Theorem 2 follows from the following two lemmas. Lemma 18 below is the analogue of Lemma 15 and Lemma 19 is the analogue of Lemma 16.

Lemma 18.

For every $\alpha>0$ , there exists $C_{2}>0$ so that if $\lambda\geq\frac{C_{2}\log d}{d^{1/4}}$ then there is an FPTAS to compute $\Xi_{G}^{X}(\lambda)$ and $\Xi_{G}^{Y}(\lambda)$ and a polynomial-time sampling scheme for $\nu_{G,\lambda}^{X}$ and $\nu_{G,\lambda}^{Y}$ .

As in Section 5, we define a probability measure on independent sets as a mixture of measures derived from the two polymer models. Define the distribution $\hat{\nu}^{X}_{G,\lambda}$ on $\mathcal{I}(G)$ as follows.

(1)

Sample a collection of compatible polymers $\Lambda$ from the measure $\nu^{X}_{G,\lambda}$ .
(2)

Set $I=J\cup\bigcup_{\gamma\in\Lambda}\gamma$ where $J$ is a random subset of $Y\backslash\bigcup_{\gamma\in\Lambda}N(\gamma)$ formed by including each vertex independently with probability $\lambda/(1+\lambda)$ .

Define $\hat{\nu}^{Y}_{G,\lambda}$ analogously and define the mixture

\hat{\mu}_{G,\lambda}=\frac{\Xi_{G}^{X}(\lambda)}{\Xi_{G}^{X}(\lambda)+\Xi_{G}^{Y}(\lambda)}\hat{\nu}^{X}_{G,\lambda}+\frac{\Xi_{G}^{Y}(\lambda)}{\Xi_{G}^{X}(\lambda)+\Xi_{G}^{Y}(\lambda)}\hat{\nu}^{Y}_{G,\lambda}\,.

Lemma 19.

For every $\alpha>0$ , there exists $C_{2}>0$ so that if $\lambda\geq\frac{C_{2}\log d}{d^{1/4}}$ then for $n$ sufficiently large,

(1+\lambda)^{n}\left(\Xi_{G}^{X}(\lambda)+\Xi_{G}^{Y}(\lambda)\right)

is an $\epsilon$ -relative approximation to $Z_{G}(\lambda)$ and

\|\mu_{G,\lambda}-\hat{\mu}_{G,\lambda}\|_{TV}<\epsilon

where $\epsilon=\exp(-\Omega(n))$ , with the implicit constant a function of $d$ .

To prove Lemmas 18 and 19 we extend the estimates of Sections 3 and 4 to the more general case. The main graph container lemma comes from the paper of Galvin and Tetali [20]. Let us define

\mathcal{W}_{\lambda}(v,a,w):=\sum_{\begin{subarray}{c}A\subset X\\ |[A]|\leq n/2,\\ A~{}\text{is $2$-linked}\\ |N(A)|=w\end{subarray}}\lambda^{|A|}(1+\lambda)^{-w},

and

\beta(\lambda):=\frac{\log^{2}(1+\lambda)}{\log(1+\lambda)+\log(2d^{5}/\alpha)}.

We use the following lemma from [20].

Lemma 20.

There are constants $c_{4}$ and $c_{5}$ such that the following holds: Let $G$ be a bipartite $\alpha$ -expander and suppose $\lambda>0$ . If $\beta(\lambda)$ satisfies

\beta(\lambda)\geq c_{4}\max\left\{\frac{\log(d^{5}/\alpha)}{\sqrt{d}},\frac{2\log^{2}d}{\alpha d}\right\}.

Then

\mathcal{W}_{\lambda}(v,a,w)\leq 2^{-c_{5}(w-a)\beta(\lambda)}.

The hypothesis of the above lemma says that $\alpha\beta(\lambda)\geq 2c_{4}\cdot\frac{\log^{2}d}{d}$ . However, in our application, we will also assume that that $\beta(\lambda)$ is also large enough to ensure

(26)

\alpha\beta(\lambda)\geq\frac{4000}{c_{5}}\cdot\frac{\log^{2}d}{d}\,.

This may be done by assuming that $\lambda\geq\frac{C\log d}{d^{1/4}}$ for a large enough constant $C_{2}$ .

We now prove Lemma 18.

Proof of Lemma 18.

As in the proof of Lemma 15, the FPTAS and polynomial-time sampling scheme will follow from verifying the Koteckỳ-Preiss condition for the polymer model.

The polymer model satisfies (11) with $f(\gamma)=c_{5}\alpha\ln 2\frac{\beta(\lambda)|\gamma|}{8}$ and $g(\gamma)=c_{5}\alpha\ln 2\frac{\beta(\lambda)|N(\gamma)|}{8}$ as shown by the following computation.

	$\displaystyle\sum_{\gamma^{\prime}\not\sim\gamma}w_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$	$\displaystyle\leq\sum_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}w_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}\lambda^{\|\gamma\|}(1+\lambda)^{-\|N(\gamma^{\prime})\|}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}\left(\sum_{t\geq(\alpha/2)w}\mathcal{W}_{\lambda}(v,w-t,w)\cdot e^{c_{5}\alpha\ln 2\frac{\beta(\lambda)w}{4}}\right)$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}e^{c_{5}\alpha\ln 2\frac{\beta(\lambda)w}{4}}\left(\sum_{t\geq(\alpha/2)w}\mathcal{W}_{\lambda}(v,w-t,w)\right)$
		$\displaystyle\leq\|\gamma\|\sum_{w\geq d}2^{c_{5}\alpha\frac{\beta(\lambda)w}{4}}\left(\sum_{t\geq(\alpha/2)w}2^{-c_{5}\beta(\lambda)t}\right)$
		$\displaystyle\leq 2d\|\gamma\|\sum_{w\geq d}2^{c_{5}\alpha\frac{\beta(\lambda)w}{4}}2^{-c_{5}\alpha\frac{\beta(\lambda)w}{2}}$
		$\displaystyle\leq 4d^{2}\|\gamma\|2^{-c_{5}\alpha\frac{\beta(\lambda)d}{4}}$
		$\displaystyle\leq 4d^{2}\|\gamma\|2^{-500\log^{2}d}\cdot 2^{-c_{5}\alpha\frac{\beta(\lambda)d}{8}}$
		$\displaystyle\leq\|\gamma\|\cdot c_{5}\alpha\frac{\beta(\lambda)}{16}.$

The last inequality follows because for $d\geq 3$ , we have $2^{500\log^{2}d}<4d^{2}$ and for any $x,y>0$ , we have $2^{-x}\leq\frac{x}{2y}$ if $x\geq 500\log^{2}y$ .

Define ${\Xi}_{G}^{X}(\ell,\lambda)$ to be the exponential of the truncated cluster expansion as in (15). By the calculation of Lemma 13, we have for every $\ell\geq 1$ ,

(27)

\left|\ln\Xi_{G}^{X}(\lambda)-\ln{\Xi}_{G}^{X}(\ell,\lambda)\right|\leq n\cdot 2^{-500\frac{\log^{2}d\cdot\ell}{d}}\,.

In particular, if $\ell\geq\frac{d}{1000\log^{2}d}\log(n/\epsilon)$ , then

\left|\ln\Xi_{G}^{X}-\ln{\Xi}_{G}^{X}(\ell,\lambda)\right|\leq\epsilon\,.\qed

We now prove Lemma 19.

Proof of Lemma 19.

Consider a modified polymer model with weights $\tilde{w}_{\gamma}(\lambda)=\lambda^{|\gamma|}(1+\lambda)^{-|N(\gamma)|}2^{c_{5}\alpha\frac{\beta(\lambda)}{16}}$ . The calculation in the proof of Lemma 18 shows that the modified polymer model with weights $\tilde{w}_{\gamma}(\lambda)$ satisfies (11) with $f(\gamma)=c_{5}\alpha\ln 2\frac{\beta(\lambda)|\gamma|}{16}$ and $g(\gamma)=c_{5}\alpha\ln 2\frac{\beta(\lambda)|N(\gamma)|}{8}$ .

Let $\tilde{\Xi}_{G}^{X}(\lambda)$ denote the modified polymer model partition function as in (13). We then have

\ln\tilde{\Xi}_{G}^{X}(\lambda)-\ln\Xi_{G}^{X}(\lambda)=\ln\mathbf{E}e^{\zeta\cdot\|\mathbf{\Lambda}\|}

for $\zeta=c_{5}\alpha\ln 2\frac{\beta(\lambda)}{16}$ . Summing (14) over all $v$ , and using the fact that $|N(\gamma)|\geq d$ for every $\gamma$ , we get (as in (17))

(28)

\ln\tilde{\Xi}_{G}^{X}\leq\sum_{v}\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \Gamma\ni v\end{subarray}}\left|\phi(\Gamma)\prod_{\gamma\in\Gamma}\tilde{w}_{\gamma}\right|\leq n\cdot 2^{-c_{5}\alpha\beta(\lambda)\frac{d}{8}}.

Therefore, we have

\displaystyle\ln\mathbf{E}e^{\zeta\cdot\|\mathbf{\Lambda}\|}\leq\ln\tilde{\Xi}_{G}^{X}\leq n\cdot 2^{-c_{5}\alpha\beta(\lambda)\frac{d}{8}}.

Fix any $\delta\geq\frac{10}{\sqrt{d}}$ . By Markov’s inequality, we have

	$\displaystyle\mathbb{P}(\\|\mathbf{\Lambda}\\|\geq\delta n)$	$\displaystyle\leq\exp\left(-\zeta\delta n+n\cdot 2^{-c_{5}\alpha\beta(\lambda)\frac{d}{8}}\right)$
		$\displaystyle\leq 2^{-c_{5}\alpha\frac{\beta(\lambda)}{32}\cdot\delta n}$
(29)			$\displaystyle\leq 2^{-\frac{100\log^{2}d}{d}\cdot\delta n},$

where the penultimate inequality follows because for any $x,y>0$ , we have $2^{-x\cdot y}\leq\frac{x}{2\sqrt{y}}$ if $x\geq 500\log^{2}y$ , and therefore

2^{-c_{5}\alpha\beta(\lambda)\frac{d}{8}}\leq\frac{c_{5}\alpha\beta(\lambda)}{16\sqrt{d}}\leq\frac{\zeta\delta}{2}.

The final inequality (7) follows from (26).

As in the proof of Lemma 16, let $\mathcal{I}_{X}=\{I\in\mathcal{I}:\nu_{G,\lambda}^{X}(I)>0\}$ i.e. the set of all $I$ such that each $2$ -linked component of $I\cap X$ is small. For $\delta>0$ , let $\mathcal{I}^{\delta}_{X}=\{I\in\mathcal{I}_{X}:|I\cap X|\leq\delta n\}$ and define $\mathcal{I}_{Y}$ , $\mathcal{I}^{\delta}_{Y}$ similarly. As in Lemma 16, $\mathcal{I}=\mathcal{I}_{X}\cup\mathcal{I}_{Y}$ and so

(30)

\displaystyle Z_{G}(\lambda)=\sum_{I\in\mathcal{I}_{X}}\lambda^{|I|}+\sum_{I\in\mathcal{I}_{Y}}\lambda^{|I|}-\sum_{I\in\mathcal{I}_{X}\cap\mathcal{I}_{Y}}\lambda^{|I|}=(1+\lambda)^{n}(\Xi_{G}^{X}(\lambda)+\Xi_{G}^{Y}(\lambda))-\sum_{I\in\mathcal{I}_{X}\cap\mathcal{I}_{Y}}\lambda^{|I|}\,.

Now, let $I$ be a random independent set chosen from the distribution $\nu_{G,\lambda}^{X}$ . It follows from (7) that for $\delta\geq\frac{10}{\sqrt{d}}$ ,

(31)

\mathbb{P}(|I\cap X|>\delta n)=\sum_{I\in\mathcal{I}_{X}\backslash\mathcal{I}_{X}^{\delta}}\frac{\lambda^{|I|}}{(1+\lambda)^{n}\Xi_{G}^{X}(\lambda)}\leq 2^{-\frac{100\log^{2}d}{d}\cdot\delta n}\,.

Furthermore, we have

	$\displaystyle\sum_{I\in\mathcal{I}^{\delta}_{X}\cap\mathcal{I}^{\delta}_{Y}}\frac{\lambda^{\|I\|}}{(1+\lambda)^{n}\Xi_{G}^{X}(\lambda)}$	$\displaystyle\leq\frac{\sum_{i,j\leq\delta n}\binom{n}{i}\binom{n}{j}\lambda^{i+j}}{(1+\lambda)^{n}}$
		$\displaystyle=\frac{\left(\sum_{i\leq\delta n}\binom{n}{i}\lambda^{i}\right)^{2}}{(1+\lambda)^{n}}$
		$\displaystyle=(1+\lambda)^{n}\mathbb{P}\left(\operatorname{Bin}\left(n,\frac{\lambda}{1+\lambda}\right)\leq\delta n\right)^{2}$

This quantity can be made to be at most $e^{-\frac{\delta n}{16}}$ for $\delta=\frac{\lambda}{100(1+\lambda)}$ . We note that $\delta\geq\frac{C\log^{2}d}{d^{1/4}}\geq\frac{10}{\sqrt{d}}$ for a large enough $C$ , and so

	$\displaystyle\sum_{I\in\mathcal{I}_{X}\cap\mathcal{I}_{Y}}\lambda^{\|I\|}$	$\displaystyle\leq\sum_{I\in\mathcal{I}_{X}\backslash\mathcal{I}_{X}^{\delta}}\lambda^{\|I\|}+\sum_{I\in\mathcal{I}_{Y}\backslash\mathcal{I}_{Y}^{\delta}}\lambda^{\|I\|}+\sum_{I\in\mathcal{I}^{\delta}_{X}\cap\mathcal{I}^{\delta}_{Y}}\lambda^{\|I\|}$
		$\displaystyle\leq(1+\lambda)^{n}(\Xi_{G}^{X}(\lambda)+\Xi_{G}^{Y}(\lambda))e^{-\Omega(n)}\,.$

By (30), we conclude that

\displaystyle\left|\frac{Z_{G}(\lambda)}{(1+\lambda)^{n}(\Xi_{G}^{X}(\lambda)+\Xi_{G}^{Y}(\lambda))}-1\right|=e^{-\Omega(n)}\,.

The bound on total variation distance follows in the same manner as in the proof of Lemma 16. ∎

Acknowledgements

WP is supported in part by NSF grant DMS-1847451. AP is supported in part by NSF grant CCF-1934915.

References

[1] Sanjeev Arora, Boaz Barak, and David Steurer, Subexponential algorithms for unique games and related problems, Journal of the ACM (JACM) 62 (2015), 1–25.
[2] Sanjeev Arora, Subhash A Khot, Alexandra Kolla, David Steurer, Madhur Tulsiani, and Nisheeth K Vishnoi, Unique games on expanding constraint graphs are easy, Proceedings of the fortieth annual ACM Symposium on Theory of Computing (STOC), 2008, pp. 21–28.
[3] József Balogh, Ramon I Garcia, and Lina Li, Independent sets in the middle two layers of Boolean lattice, Journal of Combinatorial Theory, Series A 178 (2021), 105341.
[4] József Balogh, Robert Morris, and Wojciech Samotij, Independent sets in hypergraphs, Journal of the American Mathematical Society 28 (2015), 669–709.
[5] Sarah Cannon and Will Perkins, Counting independent sets in unbalanced bipartite graphs, Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SIAM, 2020, pp. 1456–1466.
[6] Charles Carlson, Ewan Davies, and Alexandra Kolla, Efficient algorithms for the Potts model on small-set expanders, arXiv preprint arXiv:2003.01154 (2020).
[7] Zongchen Chen, Andreas Galanis, Leslie A Goldberg, Will Perkins, James Stewart, and Eric Vigoda, Fast algorithms at low temperatures via Markov chains, Random Structures & Algorithms 58 (2021), 294–321.
[8] Zongchen Chen, Andreas Galanis, Daniel Štefankovič, and Eric Vigoda, Sampling colorings and independent sets of random regular bipartite graphs in the non-uniqueness region, arXiv preprint arXiv:2105.01784 (2021).
[9] Ewan Davies, Matthew Jenssen, and Will Perkins, A proof of the Upper Matching Conjecture for large graphs, Journal of Combinatorial Theory, Series B 151 (2021), 393–416.
[10] Philippe Delsarte, An algebraic approach to the association schemes of coding theory, Philips Res. Rep. Suppl. 10 (1973).
[11] Martin Dyer, Leslie Ann Goldberg, Catherine Greenhill, and Mark Jerrum, The relative complexity of approximate counting problems, Algorithmica 38 (2004), 471–500.
[12] Tobias Friedrich, Andreas Göbel, Martin S Krejca, and Marcus Pappik, Polymer dynamics via cliques: New conditions for approximations, arXiv preprint arXiv:2007.08293 (2020).
[13] Andreas Galanis, Leslie Ann Goldberg, and James Stewart, Fast algorithms for general spin systems on bipartite expanders, 45th International Symposium on Mathematical Foundations of Computer Science (MFCS 2020), Schloss Dagstuhl-Leibniz-Zentrum für Informatik, 2020.
[14] Andreas Galanis, Leslie Ann Goldberg, and James Stewart, Fast mixing via polymers for random graphs with unbounded degree, arXiv preprint arXiv:2105.00524 (2021).
[15] Andreas Galanis, Daniel Stefankovic, Eric Vigoda, and Linji Yang, Ferromagnetic Potts model: Refined #BIS-hardness and related results, SIAM Journal on Computing 45 (2016), 2004–2065.
[16] David Galvin, Sampling 3-colourings of regular bipartite graphs, Electronic Journal of Probability 12 (2007), 481–497.
[17] David Galvin, A threshold phenomenon for random independent sets in the discrete hypercube, Combinatorics, Probability and Computing 20 (2011), 27–51.
[18] David Galvin, Independent sets in the discrete hypercube, arXiv preprint arXiv:1901.01991 (2019).
[19] David Galvin and Dana Randall, Torpid mixing of local Markov chains on 3-colorings of the discrete torus, Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, 2007, pp. 376–384.
[20] David J. Galvin and Prasad Tetali, Slow mixing of Glauber dynamics for the hard-core model on regular bipartite graphs, Random Struct. Algorithms 28 (2006), 427–443.
[21] Serge Gaspers and Edward J Lee, Faster graph coloring in polynomial space, International Computing and Combinatorics Conference, Springer, 2017, pp. 371–383.
[22] Chris D Godsil and Mike W Newman, Eigenvalue bounds for independent sets, Journal of Combinatorial Theory, Series B 98 (2008), 721–734.
[23] Leslie Ann Goldberg and Mark Jerrum, The complexity of ferromagnetic Ising with local fields, Combinatorics, Probability and Computing 16 (2007), 43–61.
[24] Leslie Ann Goldberg and Mark Jerrum, Approximating the partition function of the ferromagnetic Potts model, Journal of the ACM (JACM) 59 (2012), 1–31.
[25] Leslie Ann Goldberg, John Lapinskas, and David Richerby, Faster exponential-time algorithms for approximately counting independent sets, arXiv preprint arXiv:2005.05070 (2020).
[26] Christian Gruber and Hervé Kunz, General properties of polymer systems, Communications in Mathematical Physics 22 (1971), 133–161.
[27] Willem H Haemers, Hoffman’s ratio bound, Linear Algebra and its Applications 617 (2021), 215–219.
[28] Tyler Helmuth, Will Perkins, and Guus Regts, Algorithmic Pirogov–Sinai theory, Probability Theory and Related Fields 176 (2020), 851–895.
[29] L. Ilinca and Jeff Kahn, Counting maximal antichains and independent sets, Order 30 (2013), 427–435.
[30] Matthew Jenssen and Peter Keevash, Homomorphisms from the torus, arXiv preprint arXiv:2009.08315 (2020).
[31] Matthew Jenssen, Peter Keevash, and Will Perkins, Algorithms for #BIS-hard problems on expander graphs, SIAM Journal on Computing 49 (2020), 681–710.
[32] Matthew Jenssen and Will Perkins, Independent sets in the hypercube revisited, Journal of the London Mathematical Society 102 (2020), 645–669.
[33] Matthew Jenssen, Will Perkins, and Aditya Potukuchi, Independent sets of a given size and structure in the hypercube, arXiv preprint arXiv:2106.09709 (2021).
[34] Jeff Kahn and Jinyoung Park, The number of maximal independent sets in the Hamming cube, arXiv preprint arXiv:1909.04283 (2019).
[35] Daniel J Kleitman and Kenneth J Winston, On the number of graphs without 4-cycles, Discrete Mathematics 41 (1982), 167–172.
[36] DJ Kleitman and KJ Winston, The asymptotic number of lattices, Combinatorical Mathematics, Optimal Designs and their Applications (J. Srivastava, ed.), Ann. Discrete Math 6 (1980), 243–249.
[37] Alexandra Kolla, Spectral algorithms for unique games, Computational Complexity 20 (2011), 177–206.
[38] AD Korshunov and AA Sapozhenko, The number of binary codes with distance 2, Problemy Kibernet 40 (1983), 111–130.
[39] Roman Koteckỳ and David Preiss, Cluster expansion for abstract polymer models, Communications in Mathematical Physics 103 (1986), 491–498.
[40] Chao Liao, Jiabao Lin, Pinyan Lu, and Zhenyu Mao, Counting independent sets and colorings on random regular bipartite graphs, Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2019), Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2019.
[41] Jingcheng Liu and Pinyan Lu, FPTAS for #BIS with degree bounds on one side, Proceedings of the forty-seventh annual ACM symposium on Theory of Computing, 2015, pp. 549–556.
[42] L. Lovász, On the ratio of optimal integral and fractional covers, Discrete Mathematics 13 (1975), 383 – 390.
[43] László Lovász, On the Shannon capacity of a graph, IEEE Transactions on Information theory 25 (1979), 1–7.
[44] Konstantin Makarychev and Yury Makarychev, How to play unique games on expanders, International Workshop on Approximation and Online Algorithms, Springer, 2010, pp. 190–200.
[45] Shayan Oveis Gharan and Luca Trevisan, Partitioning into expanders, Proceedings of the twenty-fifth annual ACM-SIAM Symposium on Discrete Algorithms, SIAM, 2014, pp. 1256–1266.
[46] Jinyoung Park, Note on the number of balanced independent sets in the Hamming cube, arXiv preprint arXiv:2103.11198 (2021).
[47] J Scott Provan and Michael O Ball, The complexity of counting cuts and of computing the probability that a graph is connected, SIAM Journal on Computing 12 (1983), 777–788.
[48] Wojciech Samotij, Counting independent sets in graphs, European Journal of Combinatorics 48 (2015), 5–18.
[49] AA Sapozhenko, On the number of connected subsets with given cardinality of the boundary in bipartite graphs, Metody Diskret Analiz 45 (1987), 42–70.
[50] Aleksandr Antonovich Sapozhenko, On the number of independent sets in extenders, Diskretnaya Matematika 13 (2001), 56–62.
[51] Aleksandr Antonovich Sapozhenko, The number of independent sets in graphs, Moscow University Mathematics Bulletin 62 (2007), 116–118.
[52] David Saxton and Andrew Thomason, Hypergraph containers, Inventiones Mathematicae 201 (2015), 925–992.
[53] Allan Sly, Computational transition at the uniqueness threshold, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science, IEEE, 2010, pp. 287–296.
[54] S.K Stein, Two combinatorial covering theorems, Journal of Combinatorial Theory, Series A 16 (1974), 391 – 397.
[55] Dror Weitz, Counting independent sets up to the tree threshold, Proceedings of the thirty-eighth annual ACM Symposium on Theory of Computing, 2006, pp. 140–149.

	$\displaystyle\|\mathcal{G}(v,a,w)\|$	$\displaystyle\leq\sum_{F\in\mathcal{C}(v,a,w)}\|\mathcal{G}(F,a,w)\|$
		$\displaystyle\leq\|\mathcal{C}(v,a,w)\|\cdot\max_{F\in\mathcal{C}(v,a,w)}\|\mathcal{G}(F,a,w)\|$
		$\displaystyle\leq 2^{\frac{16w\log^{2}d}{d}}\cdot 2^{w-c_{3}\left(w-a\right)}$
		$\displaystyle\leq 2^{w-\left(\frac{c_{3}}{2}\right)\left(w-a\right)}$

	$\displaystyle\sum_{\gamma^{\prime}\not\sim\gamma}\tilde{w}_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$	$\displaystyle\leq\sum_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}\tilde{w}_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}2^{-\|N(\gamma^{\prime})\|}e^{2f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}\left(\sum_{t\geq(C_{1}/2)\frac{w\log^{2}d}{d}}\|\mathcal{G}(v,w-t,w)\|2^{-w}\cdot e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}\right)$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}\left(\sum_{t\geq(C_{1}/2)\frac{w\log^{2}d}{d}}\|\mathcal{G}(v,w-t,w)\|2^{-w}\right)$
		$\displaystyle\leq\|\gamma\|\sum_{w\geq d}e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}\left(\sum_{t\geq(C_{1}/2)\frac{w\log^{2}d}{d}}2^{-c_{1}\cdot t}\right)$
		$\displaystyle\leq d\|\gamma\|\sum_{w\geq d}e^{4\ln 2\cdot\frac{w\log^{2}d}{d}}2^{-8\frac{w\log^{2}d}{d}}$
		$\displaystyle\leq d^{2}\|\gamma\|2^{-4\log^{2}d}$
		$\displaystyle\ll\ln 2\cdot\frac{\|\gamma\|\log^{2}d}{d}=f(\gamma)\,.\qed$

	$\displaystyle\left\|\ln\Xi_{G}^{X}-\ln{\Xi}_{G}^{X}(\ell)\right\|$	$\displaystyle=\left\|\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \\|\Gamma\\|>\ell\end{subarray}}\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right\|$
		$\displaystyle\leq\sum_{v}\sum_{\begin{subarray}{c}\Gamma\in\mathcal{C}(G)\\ \Gamma\ni v\\ \\|\Gamma\\|>\ell\end{subarray}}\left\|\phi(\Gamma)\prod_{\gamma\in\Gamma}w_{\gamma}\right\|$
		$\displaystyle\leq ne^{-2\ln 2\frac{\ell\log^{2}d}{d}}$
		$\displaystyle=n\cdot 2^{-2\frac{\ell\log^{2}d}{d}}\,.\qed$

	$\displaystyle i(G)$	$\displaystyle=2^{\|Y\|}\cdot\sum_{t=0}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{t}~{}\text{compatible}\\ A_{i}~{}2\text{-linked}~{}\forall i\end{subarray}}\left(\prod_{i=1}^{t}2^{-N(A_{i})}\right)$
		$\displaystyle=2^{\|Y\|}\cdot\sum_{t=0}^{n/d}\sum_{\ell=0}^{t}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding},\\ 2\text{-linked}\end{subarray}}\left(\prod_{i=1}^{\ell}2^{-N(A_{i})}\cdot\sum_{\begin{subarray}{c}A_{\ell+1},\ldots,A_{t}\\ \subseteq X\setminus N^{2}(\cup_{j=1}^{\ell}A_{j})\\ \text{compatible}\\ \forall i~{}A_{i}~{}\text{expanding},\\ 2\text{-linked}\end{subarray}}\prod_{i=\ell+1}^{t}2^{-N(A_{i})}\right)$
		$\displaystyle=2^{\|Y\|}\cdot\sum_{t=0}^{n/d}\sum_{\ell=0}^{t}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\sum_{\begin{subarray}{c}B_{1},\ldots,B_{\ell}\\ \forall i,~{}B_{i}\subseteq A_{i},\\ N(B_{i})=N(A_{i}),\\ B_{i}~{}2\text{-linked}\end{subarray}}\left(\prod_{i=1}^{\ell}2^{-N(A_{i})}\cdot\sum_{\begin{subarray}{c}A_{\ell+1},\ldots,A_{t}\\ \subseteq X\setminus N^{2}(\cup_{j=1}^{\ell}A_{j})\\ \text{compatible}\\ \forall i~{}A_{i}~{}\text{expanding},\\ 2\text{-linked}\end{subarray}}\prod_{i=\ell+1}^{t}2^{-N(A_{i})}\right)$
		$\displaystyle=\sum_{\ell=1}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\sum_{\begin{subarray}{c}B_{1},\ldots,B_{\ell}\\ \forall i,~{}B_{i}\subseteq A_{i},\\ N(B_{i})=N(A_{i}),\\ B_{i}~{}2\text{-linked}\end{subarray}}\left(2^{\|Y\|-\sum_{i=1}^{\ell}N(A_{i})}\cdot\sum_{t=1}^{n/d-\ell}\sum_{\begin{subarray}{c}A_{1}^{\prime},\ldots,A_{t}^{\prime}\\ \subseteq X\setminus N^{2}(\cup_{j=1}^{\ell}A_{j})\\ \text{compatible}\\ \forall i~{}A_{i}^{\prime}~{}\text{expanding},\\ 2\text{-linked}\end{subarray}}\prod_{i=1}^{t}2^{-N(A_{i}^{\prime})}\right)$
		$\displaystyle=\sum_{\ell=1}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\left(\prod_{i=1}^{\ell}\mathcal{D}(A_{i})\right)\cdot\left(2^{\|Y\|-\sum_{i=1}^{\ell}N(A_{i})}\cdot{\Xi}^{X_{A}}_{G_{A}}\right)$
		$\displaystyle=(1\pm\epsilon)\sum_{\ell=1}^{n/d}\sum_{\begin{subarray}{c}A_{1},\ldots,A_{\ell}~{}\text{compatible}\\ \forall i,~{}A_{i}~{}\text{non-expanding}\\ 2\text{-linked, closed}\end{subarray}}\left(\prod_{i=1}^{\ell}\tilde{\mathcal{D}}(A_{i})\right)\cdot\left(2^{\|Y\|-\sum_{i=1}^{\ell}N(A_{i})}\cdot{\Xi}^{X_{A}}_{G_{A}}(L)\right).$

	$\displaystyle\sum_{\gamma^{\prime}\not\sim\gamma}w_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$	$\displaystyle\leq\sum_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}w_{\gamma^{\prime}}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{\gamma^{\prime}\not\sim v}\lambda^{\|\gamma\|}(1+\lambda)^{-\|N(\gamma^{\prime})\|}e^{f(\gamma^{\prime})+g(\gamma^{\prime})}$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}\left(\sum_{t\geq(\alpha/2)w}\mathcal{W}_{\lambda}(v,w-t,w)\cdot e^{c_{5}\alpha\ln 2\frac{\beta(\lambda)w}{4}}\right)$
		$\displaystyle\leq\|\gamma\|\cdot\max_{v\in\gamma}\sum_{w\geq d}e^{c_{5}\alpha\ln 2\frac{\beta(\lambda)w}{4}}\left(\sum_{t\geq(\alpha/2)w}\mathcal{W}_{\lambda}(v,w-t,w)\right)$
		$\displaystyle\leq\|\gamma\|\sum_{w\geq d}2^{c_{5}\alpha\frac{\beta(\lambda)w}{4}}\left(\sum_{t\geq(\alpha/2)w}2^{-c_{5}\beta(\lambda)t}\right)$
		$\displaystyle\leq 2d\|\gamma\|\sum_{w\geq d}2^{c_{5}\alpha\frac{\beta(\lambda)w}{4}}2^{-c_{5}\alpha\frac{\beta(\lambda)w}{2}}$
		$\displaystyle\leq 4d^{2}\|\gamma\|2^{-c_{5}\alpha\frac{\beta(\lambda)d}{4}}$
		$\displaystyle\leq 4d^{2}\|\gamma\|2^{-500\log^{2}d}\cdot 2^{-c_{5}\alpha\frac{\beta(\lambda)d}{8}}$
		$\displaystyle\leq\|\gamma\|\cdot c_{5}\alpha\frac{\beta(\lambda)}{16}.$