Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs

Similar documents
Ramsey numbers of some bipartite graphs versus complete graphs

Lecture 5. Symmetric Shearer s Lemma

Iterated Point-Line Configurations Grow Doubly-Exponentially

Linear First-Order Equations

A Sketch of Menshikov s Theorem

Lower Bounds for the Smoothed Number of Pareto optimal Solutions

Lecture Introduction. 2 Examples of Measure Concentration. 3 The Johnson-Lindenstrauss Lemma. CS-621 Theory Gems November 28, 2012

II. First variation of functionals

u!i = a T u = 0. Then S satisfies

Chromatic number for a generalization of Cartesian product graphs

The chromatic number of graph powers

Regular tree languages definable in FO and in FO mod

On the enumeration of partitions with summands in arithmetic progression

Lower Bounds for Local Monotonicity Reconstruction from Transitive-Closure Spanners

Computing Exact Confidence Coefficients of Simultaneous Confidence Intervals for Multinomial Proportions and their Functions

6 General properties of an autonomous system of two first order ODE

Least-Squares Regression on Sparse Spaces

arxiv: v1 [math.co] 15 Sep 2015

Robustness and Perturbations of Minimal Bases

Separation of Variables

On combinatorial approaches to compressed sensing

The Exact Form and General Integrating Factors

Calculus of Variations

Acute sets in Euclidean spaces

Generalized Tractability for Multivariate Problems

Math 1B, lecture 8: Integration by parts

Permanent vs. Determinant

7.1 Support Vector Machine

A new proof of the sharpness of the phase transition for Bernoulli percolation on Z d

Lower bounds on Locality Sensitive Hashing

Lecture 22. Lecturer: Michel X. Goemans Scribe: Alantha Newman (2004), Ankur Moitra (2009)

LATTICE-BASED D-OPTIMUM DESIGN FOR FOURIER REGRESSION

NOTES ON EULER-BOOLE SUMMATION (1) f (l 1) (n) f (l 1) (m) + ( 1)k 1 k! B k (y) f (k) (y) dy,

What s in an Attribute? Consequences for the Least Common Subsumer

Robust Forward Algorithms via PAC-Bayes and Laplace Distributions. ω Q. Pr (y(ω x) < 0) = Pr A k

Math 342 Partial Differential Equations «Viktor Grigoryan

Equilibrium in Queues Under Unknown Service Times and Service Value

Diophantine Approximations: Examining the Farey Process and its Method on Producing Best Approximations

arxiv: v4 [cs.ds] 7 Mar 2014

Proof of SPNs as Mixture of Trees

REAL ANALYSIS I HOMEWORK 5

On the Expansion of Group based Lifts

Survey Sampling. 1 Design-based Inference. Kosuke Imai Department of Politics, Princeton University. February 19, 2013

ALGEBRAIC AND ANALYTIC PROPERTIES OF ARITHMETIC FUNCTIONS

PDE Notes, Lecture #11

Characterizing Real-Valued Multivariate Complex Polynomials and Their Symmetric Tensor Representations

Three-Variable Bracket Polynomial for Two-Bridge Knots

Combinatorica 9(1)(1989) A New Lower Bound for Snake-in-the-Box Codes. Jerzy Wojciechowski. AMS subject classification 1980: 05 C 35, 94 B 25

SYNCHRONOUS SEQUENTIAL CIRCUITS

Sharp Thresholds. Zachary Hamaker. March 15, 2010

THE GENUINE OMEGA-REGULAR UNITARY DUAL OF THE METAPLECTIC GROUP

Implicit Differentiation

Tractability results for weighted Banach spaces of smooth functions

New bounds on Simonyi s conjecture

Selection principles and the Minimal Tower problem

On colour-blind distinguishing colour pallets in regular graphs

Two formulas for the Euler ϕ-function

TOEPLITZ AND POSITIVE SEMIDEFINITE COMPLETION PROBLEM FOR CYCLE GRAPH

LECTURE NOTES ON DVORETZKY S THEOREM

On the Aloha throughput-fairness tradeoff

Witt#5: Around the integrality criterion 9.93 [version 1.1 (21 April 2013), not completed, not proofread]

Efficient Construction of Semilinear Representations of Languages Accepted by Unary NFA

JUST THE MATHS UNIT NUMBER DIFFERENTIATION 2 (Rates of change) A.J.Hobson

Analyzing Tensor Power Method Dynamics in Overcomplete Regime

The number of K s,t -free graphs

n 1 conv(ai) 0. ( 8. 1 ) we get u1 = u2 = = ur. Hence the common value of all the Uj Tverberg's Tl1eorem

arxiv: v2 [math.pr] 27 Nov 2018

Calculus of Variations

Topic 7: Convergence of Random Variables

IPA Derivatives for Make-to-Stock Production-Inventory Systems With Backorders Under the (R,r) Policy

Logarithmic spurious regressions

Agmon Kolmogorov Inequalities on l 2 (Z d )

d-dimensional Arrangement Revisited

On non-antipodal binary completely regular codes

Euler equations for multiple integrals

Calculus in the AP Physics C Course The Derivative

Applications of the Wronskian to ordinary linear differential equations

Approximate Constraint Satisfaction Requires Large LP Relaxations

Exact Max 2-Sat: Easier and Faster

Discrete Mathematics

Designing Information Devices and Systems II Fall 2017 Note Theorem: Existence and Uniqueness of Solutions to Differential Equations

Some properties of random staircase tableaux

Chapter 6: Energy-Momentum Tensors

The Principle of Least Action

Generalizing Kronecker Graphs in order to Model Searchable Networks

Leaving Randomness to Nature: d-dimensional Product Codes through the lens of Generalized-LDPC codes

Table of Common Derivatives By David Abraham

A Unified Theorem on SDP Rank Reduction

An Optimal Algorithm for Bandit and Zero-Order Convex Optimization with Two-Point Feedback

Computing Derivatives

A PAC-Bayesian Approach to Spectrally-Normalized Margin Bounds for Neural Networks

LIAISON OF MONOMIAL IDEALS

Levy Process and Infinitely Divisible Law

Range and speed of rotor walks on trees

The Approximability and Integrality Gap of Interval Stabbing and Independence Problems

Lectures - Week 10 Introduction to Ordinary Differential Equations (ODES) First Order Linear ODEs

Convergence of Random Walks

arxiv:hep-th/ v1 3 Feb 1993

The total derivative. Chapter Lagrangian and Eulerian approaches

Mark J. Machina CARDINAL PROPERTIES OF "LOCAL UTILITY FUNCTIONS"

Transcription:

Perfect Matchings in Õ(n1.5 ) Time in Regular Bipartite Graphs Ashish Goel Michael Kapralov Sanjeev Khanna Abstract We consier the well-stuie problem of fining a perfect matching in -regular bipartite graphs with 2n vertices an m = n eges. While the best-known algorithm for general bipartite graphs (ue to Hopcroft an Karp) takes O(m n) time, in regular bipartite graphs, a perfect matching is known to be computable in O(m) time. Very recently, the O(m) boun was improve to O(min{m, n2.5 ln n }) expecte time, an expression that is boune by Õ(n1.75 ). In this paper, we further improve this result by giving an O(min{m, n2 ln 3 n }) expecte time algorithm for fining a perfect matching in regular bipartite graphs; as a function of n alone, the algorithm takes expecte time O((n ln n) 1.5 ). To obtain this result, we esign an analyze a two-stage sampling scheme that reuces the problem of fining a perfect matching in a regular bipartite graph to the same problem on a subsample bipartite graph with O(n ln n) eges. The first-stage is a sub-linear time uniform sampling that reuces the size of the input graph while maintaining certain structural properties of the original graph. The secon-stage is a non-uniform sampling that takes linear-time (on the reuce graph) an outputs a graph with O(n ln n) eges, while preserving a matching with high probability. This matching is then recovere using the Hopcroft-Karp algorithm. While the stanar analysis of Hopcroft-Karp also gives us an Õ(n1.5 ) running n2 time, we present a tighter analysis for our special case that results in the stronger Õ(min{m, }) time mentione earlier. Our proof of correctness of this sampling scheme uses a new corresponence theorem between cuts an Hall s theorem witnesses for a perfect matching in a bipartite graph that we prove. We believe this theorem may be of inepenent interest; as another example application, we show that a perfect matching in the support of an n n oubly stochastic matrix with m non-zero entries can be foun in expecte time Õ(m + n 1.5 ). Departments of Management Science an Engineering an (by courtesy) Computer Science, Stanfor University. Email: ashishg@stanfor.eu. Research supporte by NSF ITR grant 0428868, NSF CAREER awar 0339262, an a grant from the Stanfor-KAUST alliance for acaemic excellence. Institute for Computational an Mathematical Engineering, Stanfor University. Email: kapralov@stanfor.eu. Research supporte by a Stanfor Grauate Fellowship. Department of Computer an Information Science, University of Pennsylvania, Philaelphia PA. Email: sanjeev@cis.upenn.eu. Supporte in part by a Guggenheim Fellowship, an IBM Faculty Awar, an by NSF Awar CCF-0635084.

1 Introuction A bipartite graph G = (P, Q, E) with vertex set P Q an ege set E P Q is sai to be regular if every vertex has the same egree. We use m = n to enote the number of eges in G an n to represent the number of vertices in P (as a consequence of regularity, P an Q have the same size). Regular bipartite graphs are a funamental combinatorial object, an arise, among other things, in expaner constructions, scheuling, routing in switch fabrics, an task-assignment [15, 1, 6]. A regular bipartite graph of egree can be ecompose into exactly perfect matchings, a fact that is an easy consequence of Hall s theorem [4], an is closely relate to the Birkhoff-von Neumann ecomposition of a oubly stochastic matrix [3, 17]. Fining a matching in a regular bipartite graph is a well-stuie problem, starting with the algorithm of König in 1916 [13], which is now known to run in time O(mn). The well-known bipartite matching algorithm of Hopcroft an Karp [9] can be use to obtain a running time of O(m n). In graphs where is a power of 2, the following elegant iea, ue to Gabow an Kariv [7], leas to an algorithm with O(m) running time. First, compute an Euler tour of the graph (in time O(m)) an then follow this tour in an arbitrary irection. Exactly half the eges will go from left to right; these form a regular bipartite graph of egree /2. The total running time T (m) thus follows the recurrence T (m) = O(m) + T (m/2) which yiels T (m) = O(m). Extening this iea to the general case prove quite har, an after a series of improvements (eg. by Cole an Hopcroft [5], an then by Schrijver [16] to O(m)), Cole, Ost, an Schirra [6] gave an O(m) algorithm for the case of general. Their main interest was in ege coloring of general bipartite graphs, where fining perfect matchings in regular bipartite graphs is an important subroutine. Very recently, Goel, Kapralov, an Khanna [8], gave a sampling-base algorithm that computes a perfect matching in - regular bipartite graphs in O(min{m, n2.5 ln n }) expecte time, an expression that is boune by Õ(n1.75 ). The algorithm of [8] uses uniform sampling to reuce the number of eges in the input graph while preserving a perfect matching, an then runs the Hopcroft-Karp algorithm on the sample graph. Our Results an Techniques: We present a significantly faster algorithm for fining perfect matchings in regular bipartite graphs. ( ) Theorem 1.1 There is an O min{m, n2 ln 3 n } expecte time algorithm to fin a perfect matching in a - regular bipartite graph G. As a function of n alone, the running time state above is O((n ln n) 1.5 ). Since the O(m) running time is guarantee by the algorithm of Cole, Ost, an Schirra, we are only concerne with the case where is Ω( n ln n). For this regime, our algorithm reuces the perfect matching problem on a regular bipartite graph G to the same problem on a (not necessarily regular) sparse bipartite graph H with O(n ln n) eges. This reuction takes time O( n2 ln 3 n ). We then use the Hopcroft-Karp algorithm on H to recover a perfect matching. A black-box use of the analysis of the Hopcroft-Karp algorithm woul suggest a running time of O( n2 ln 3 n + n 1.5 ln n). However, we show that the final sample graph has some special structure that guarantees that the Hopcroft-Karp algorithm woul complete in time O( n2 ln 2 n ) whp. For every pair A P, B Q, we efine a witness set W (A, B) to be the set of all eges going from A to Q \ B. Of particular interest are what we call Hall witness sets, which correspon to A > B ; the well-known Hall s theorem [4] says that a bipartite graph H(P, Q, E H ) contains a perfect matching iff E H inclues an ege from each Hall witness set. Thus any approach that reuces the size of the input bipartite graph by sampling must ensure that some ege from every Hall witness set is inclue in the sample graph; otherwise the sample graph no longer contains a perfect matching. Goel, Kapralov, an Khanna [8] showe that no uniform sampling scheme on a -regular bipartite graph can reuce the number of eges to o( n2 ln n ) while preserving a perfect matching, an hence their Õ(n1.75 )-time algorithm is the best possible running time achievable via uniform sampling followe by a black-box invocation of the Hopcroft-Karp analysis. 1

In orer to get past this barrier, we use here a two-stage sampling process. The first stage is a uniform sampling (along the lines of [8]) which generates a reuce-size graph G = (P, Q, E ) that preserves not only a perfect matching but also a key relationship between the sizes of relevant witness sets an cuts in the graph G. The secon stage is to run the non-uniform Benczúr-Karger sampling scheme [2] on G to generate a graph G with Õ(n) eges while preserving a perfect matching w.h.p. Since this step requires Ω( E ) time, we crucially rely on the fact that G oes not contain too many eges. While our algorithm is easy to state an unerstan, the proof of correctness is quite involve. The Benczúr-Karger sampling was evelope to generate, for any graph, a weighte subgraph with Õ(n) eges that approximately preserves the size of all cuts in the original graph. The central iea unerlying our result is to show that there exists a collection of core witness sets that can be ientifie in an almost one-one manner with cuts in the graph such that the probability mass of eges in each witness set is comparable to the probability mass of the eges in the cut ientifie with it. Further, every witness set in the graph has a representative in this collection of core witness sets. Informally, this allows us to employ cut-preserving sampling schemes such as Benczúr-Karger as witness-preserving schemes. We note here that the natural mapping which assigns the witness set of a pair (A, B) to the cut eges associate with this pair can map arbitrarily many witness sets to the same cut an is not useful for our purposes. One of our contributions is an uncrossing theorem for witness sets, that we refer to as the proportionate uncrossing theorem. Informally speaking, it says that given any collection of witness sets R such that the probability mass of each witness set is comparable to that of its associate cut, there exists another collection T of witness sets such that (i) the natural mapping to cuts as efine above is half-injective for T, that is, at most two witness sets in T map to any given cut, (ii) the probability mass of each witness set is comparable to the probability mass of its associate cut, an (iii) any subset of eges that hits every witness set in T also hits every witness set in R. The collection T is referre to as a proportional uncrossing of R. As shown in Figure 1(a), we can not achieve an injective mapping, an hence the half-injectivity is unavoiable. We believe the half-injective corresponence between witness sets an cuts, as facilitate by the proportionate uncrossing theorem, is of inepenent interest, an will perhaps have other applications in this space of problems. We also emphasize here that the uncrossing theorem hols for all bipartite graphs, an not only regular bipartite graphs. Inee, the graph G on which we invoke this theorem oes not inherit the regularity property of the original graph G. As another illustrative example, consier the celebrate Birkhoff-von Neumann theorem [4, 17] which says that every oubly stochastic matrix can be expresse as a convex combination of permutation matrices (i.e., perfect matchings). In some applications, it is of interest to o an iterative ecomposition whereby a single matching is recovere in each iteration. The best-known boun for this problem, to our knowlege, is an O(mb) time algorithm that follows from the work of Gabow an Kariv [7]; here b enotes the maximum number of bits neee to express any entry in M. The following theorem is an easy consequence of our proportionate uncrossing result. Theorem 1.2 Given an n n oubly-stochastic matrix M with m non-zero entries, one can fin a perfect matching in the support of M in Õ(m + n1.5 ) expecte time. The proof of this theorem an a iscussion of known results about this problem are given in section 6. Though this result itself represents only a moest improvement over the earlier O(mb) running time, it is an instructive illustration of the utility of the proportionate uncrossing theorem. It is worth noting that while the analysis of Goel, Kapralov, an Khanna was along broaly similar lines (sample eges from the original graph, followe by running the Hopcroft-Karp algorithm), the proportionate uncrossing theorem evelope in this paper requires significant new ieas an is crucial to incorporating the non-uniform sampling stage into our algorithm. Further, the running time of the Hopcroft-Karp algorithm is easily seen to be Ω(m n) even for the 2-regular graph consisting of Θ( n) isjoint cycles of lengths 2, 4,..., n respectively; the stronger analysis for our special case requires both our uncrossing theorem as 2

well as a stronger ecomposition 1. As a step in this analysis, we prove the inepenently interesting fact that after sampling eges from a -regular bipartite graph with rate c ln n, for some suitable constant c, we obtain a graph that has a matching of size n O(n/) whp an such a matching can be foun in O(n/) augmenting phases of the Hopcroft-Karp algorithm whp. Organization: Section 2 reviews an presents some useful corollaries of relevant earlier work. In Section 3, we establish the proportionate uncrossing theorem. In section 4, we present an analyze our two-stage sampling scheme, an section 5 outlines the stronger analysis of the Hopcroft-Karp algorithm for our special case. Section 6 contains the proof of Theorem 1.2 an a iscussion of known results on fining perfect matchings in the support of ouble stochastic matrices. 2 Preliminaries In this section, we aapt an present recent results of Goel, Kapralov, an Khanna [8] as well as the Benczúr- Karger sampling theorem [2] for our purposes, an also prove a simple technical lemma for later use. 2.1 Bipartite Decompositions an Relevant Witness Pairs Let G = (P, Q, E) be a regular bipartite graph, with vertex set P Q an ege set E P Q. Consier any partition of P into k sets P 1, P 2,..., P k, an a partition of Q into Q 1, Q 2,..., Q k. Let G i enote the (not necessarily regular) bipartite graph (P i, Q i, E i ) where E i = E (P i Q i ). We will call this a ecomposition of G. Given A P an B Q, efine the witness set corresponing to the pair (A, B), enote W (A, B), as the set of all eges between A an Q \ B, an efine the cut C(A, B) as the set of all eges between A B an (P \ A) (Q \ B). The rest of the efinitions in this section are with respect to some arbitrary but fixe ecomposition of G. Definition 2.1 An ege (u, v) E is relevant if (u, v) E i for some i. Definition 2.2 Let E R be the set of all relevant eges. A pair (A, B) is sai to be relevant if 1. A P i an B Q i for some i, 2. A > B, an 3. There oes not exist another A P i, B Q i, such that A A, A > B, an W (A, B ) E R W (A, B) E R. Informally, a relevant pair is one which is containe completely within a single piece in the ecomposition, an is minimal with respect to that piece. The following lemma is implicit in [8] an is prove in appenix A for completeness. Lemma 2.3 Let R enote all relevant pairs (A, B) with respect to a ecomposition of G(P, Q, E), an let E R enote all relevant eges. Consier any graph G = (P, Q, E ). If for all (A, B) R, we have W (A, B) E E R, then G has a perfect matching. 1 It is known that the Hopcroft-Karp algorithm terminates quickly on bipartite expaners [14], but those techniques on t help in our setting since we start with an arbitrary regular bipartite graph. 3

2.2 A Corollary of Benczúr-Karger Sampling Scheme The Benczúr-Karger sampling theorem [2] shows that for any graph, a relatively small non-uniform ege sampling rate suffices to ensure that every cut in the graph is hit by the sample eges (i.e. it has a non-empty intersection) with high probability. The sampling rate use for each ege e inversely epens on its strength, as efine below. Definition 2.4 [2] A k-strong component of a graph H is a maximal vertex-inuce subgraph of H with ege-connectivity k. The strength of an ege e in a graph H is the maximum value of k such that a k-strong component contains e. Definition 2.5 Given a graph H = (V, E), let H [j] = (V, E [j] ) enote the subgraph of H restricte to eges of strength j or higher, where j is some integer in {1, 2,..., V }. It is easy to see that whenever a cut in a graph H(V, E) contains an ege of strength k, then the cut must contain at least k eges. Furthermore, for any 1 < j V, each connecte component of graph H [j] is containe insie some connecte component of H [j 1]. The Benczúr-Karger theorem utilizes these properties to show that it suffices to sample each ege e with probability Θ(min{1, ln n/s e }). We now exten this sampling result to any collection of ege-sets for which there exists an injection (oneone mapping) to cuts of comparable inverse strengths. The statement of our theorem 2.6 closely mirrors the Benczúr-Karger sampling theorem, an the proof is also along the same general lines. However, the proof oes not follow from the Benczúr-Karger sampling theorem in a black-box fashion, so a proof is provie in appenix B. Theorem 2.6 Let H(V, E) be any graph on n vertices, an let C enote the set of all possible ege cuts in H, an γ (0, 1] be { a constant. } Let H be a subgraph of H obtaine by sampling each ege e in H with probability p e = min 1, c ln n γs e, where s e enotes the strength of ege e, an c is a suitably large constant. Further, let X be a collection of subsets of eges, an let f be a one-one (not necessarily onto) mapping from X to C satisfying e X 1/s e > γ e f(x) 1/s e for all X X. Then Pr[No ege in X is chosen in H ] 1 n 2. X X The result below from [2] bouns the number of eges chosen by the sampling in Theorem 2.6. Theorem 2.7 Let H(V, E) be any graph on n{ vertices, an let H be a subgraph of H obtaine by sampling each ege e in H with probability p e = min 1, c ln n, where s e enotes the strength of ege e, an c is s e } any constant. Then with probability at least 1 1 n 2, the graph H contains at most c n ln n eges, where c is another suitably large constant. We conclue with a simple property of integer multisets that we will use later. A similar statement was use in [11] (lemma 4.5). A proof is provie in appenix C for completeness. Lemma 2.8 Let S 1 an S 2 be two arbitrary multisets of positive integers such that S 1 > γ S 2 for some γ > 0. Then there exists an integer j such that 1 i > γ 1. i i j an i S 1 i j an i S 2 4

3 Proportionate Uncrossing of Witness Sets Consier a bipartite graph G = (P, Q, E), with a non-negative weight function t efine on the eges. Assume further that we are given a set of relevant eges E R E. We can exten the efinition of t to sets of eges, so that t(s) = e S t(e), where S E. Definition 3.1 For any γ > 0 an A P, B Q, the pair (A, B) is sai to be γ-thick with respect to (G, t, E R ) if t(w (A, B) E R ) > γt(c(a, B)), i.e., the total weight of the relevant eges in W (A, B) is strictly more than γ times the total weight of C(A, B). A set of pairs R = {(A 1, B 1 ), (A 2, B 2 ),..., (A K, B K )} where each A i P an each B i Q is sai to be a γ-thick collection with respect to (G, t, E R ) if every pair (A i, B i ) R is γ-thick. The quantities G, t, an E R will be fixe for this section, an for brevity, we will omit the phrase with respect to (G, t, E R ) in the rest of this section. Before efining proportionate uncrossings of witness sets, we will informally point out the motivation for oing so. If a pair (A, B) is γ-thick for some constant γ, an if we know that a sampling process where ege e is chosen with probability t chooses some ege from C(A, B) with high probability, then increasing the sampling probability by a factor of 1/γ shoul result in some relevant ege from W (A, B) being chosen with high probability as well, a fact that woul be very useful in the rest of this paper. The sampling sub-routines that we employ in the rest of this paper are analyze by using union-boun over all cuts, an in orer to apply the same union boun, it woul be useful if each witness set were to correspon to a unique cut. However, in figure 1(a), we show two pairs (A, B) an (X, Y ) which are both (1/3)-thick with respect to the uniform weight function t 1 but correspon to the same cut; we call this a crossing of the pairs (A, B) an (X, Y ), rawing intuition from the figure. In general, we can have many witness sets that map to the same cut. We woul like to uncross these witness sets by fining subsets of each witness set that map to unique cuts, but there is no way to uncross figure 1(a) in this fashion. Fortunately, an somewhat surprisingly, this is the worst case: any collection of γ-thick pairs can be uncrosse into another collection such that all the pairs in the new collection are also γ-thick (hence the term proportionate uncrossing), every original witness set has a representative in this new collection, an no more than two new pairs have the same cut. Figure 1(b) shows two 1 3 -thick pairs that can be uncrosse using a single 1 3-thick representative, (A X, B Y ). We will spen the rest of this section formalizing the notion of proportionate uncrossings an proving their existence. The uncrossing process is algorithmically inefficient, but we only nee to emonstrate existence for the purpose of this paper. The arguments in this section represent the primary technical contribution of this paper; these arguments apply to bipartite graphs in general (not necessarily regular), an may be inepenently interesting. P Q P Q A B A B X Y X Y (a) (b) Figure 1: Both (a) an (b) epict two 1 3-thick pairs (A, B) an (X, Y ) that have ifferent witness sets but the same cut (i.e. W (A, B) W (X, Y ) but C(A, B) = C(X, Y )). The pairs in (a) can not be uncrosse, whereas the pairs in (b) can be uncrosse by choosing the single pair (A X, B Y ) as a representative. 5

3.1 Proportionate Uncrossings: Definitions an Properties Definition 3.2 A γ-uncrossing of a γ-thick collection R is another γ-thick collection of pairs T that satisfies the three properties below: P1: For every pair (A, B) R there exists a pair (A, B ) T such that C(A, B ) C(A, B), an W (A, B ) W (A, B). We will refer to (A, B ) as a representative of (A, B). P2 For every (A, B ) T, there exists (A, B) R such that C(A, B ) C(A, B). P3: (Half-injectivity): There can not be three istinct pairs (A, B), (A, B ), an (A, B ) in T such that C(A, B) = C(A, B ) = C(A, B ). Since T has the same (or larger) thickness as the thickness guarantee that we ha for R, it seems appropriate to refer to T as a proportionate uncrossing of R. Definition 3.3 A γ-partial-uncrossing of a γ-thick collection R is another γ-thick collection of pairs T which satisfies properties P1,P2 above but not necessarily P3. The following three lemmas follow immeiately from the two efinitions above, an it will be useful to state them explicitly. Informally, the first says that every collection is its own partial uncrossing, the secon says that uncrossings can be compose, an the thir says that the union of the partial uncrossings of two collections is a partial uncrossing of the union of the collections. Lemma 3.4 If R is a γ-thick collection, then R is a γ-partial uncrossing of itself. Lemma 3.5 If S is a γ-partial uncrossing of a γ-thick collection R, an T is a γ-uncrossing of S, then T is also a γ-uncrossing of R. Lemma 3.6 If R 1 an R 2 are two γ-thick collections, T 1 is a γ-partial-uncrossing of R 1, an T 2 is a γ- partial-uncrossing of R 2, then T 1 T 2 is a γ-partial-uncrossing of R 1 R 2. 3.2 Proportionate Uncrossings: An Existence Theorem The main technical result of this section is the following: Theorem 3.7 For every γ-thick collection R, there exists a γ-uncrossing of R. The proof is via inuction over the largest cut corresponing to any pair in the collection R; each inuctive step uncrosses the witness sets which correspons to this largest cut. Before proving this theorem, we nee to provie several useful efinitions an also establish a key lemma. Define some total orering over all subsets of E which respects set carinality, so that if E 1 < E 2 then E 1 E 2. Overloa notation to use C(R) to enote the set of cuts {C(A, B) : (A, B) R}. Analogously, use W (R) to enote the set of witness sets corresponing to pairs in R. Since C(A, B) may be equal to C(A, B ) for (A, B) (A, B ), it is possible that C(R) may be smaller than R. In fact, if R an C(R) are equal, then R is its own γ-uncrossing an the theorem is trivially true. Similarly, it is possible that W (A, B) is equal to W (A, B ) for two ifferent pairs (A, B) an (A, B ) in R. However, suppose W (A, B) = W (A, B ) an C(A, B) = C(A, B ) for two ifferent pairs (A, B) an (A, B ) in R. In this case, we can remove one of the two pairs from the collection to obtain a new collection R ; it is easy to see that a γ-uncrossing of R is also a γ-uncrossing of R. So we will assume without loss of generality that for any two pairs (A, B) an (A, B ) in R, either W (A, B) W (A, B ) or C(A, B) C(A, B ); we will call this the non-reunancy assumption. 6

We will now prove a key lemma which contains the meat of the uncrossing argument. When we use this lemma later in the proof of theorem 3.7, we will only use the fact that there exists a γ-partial-uncrossing of R, where R satisfies the preconitions of the lemma. However, the stronger claim of existence of a γ-uncrossing oes not require much aitional work an appears to be an interesting graph theoretic argument in its own right, so we prove this stronger claim. Lemma 3.8 If R is a γ-thick collection such that R > 2, R satisfies the non-reunancy assumption, an C(R) contains a single set S, then there exists a γ-uncrossing T of R. Further, for every pair (A, B) T, we have C(A, B) S. Proof: Let R = {(A 1, B 1 ), (A 2, B 2 ),..., (A J, B J )}. Since C(A i, B i ) = S for all i, we know by the nonreunancy assumption that W (A i, B i ) W (A i, B i ) for i i. We break the proof own into multiple stages. 1. Definition of Venn witnesses an Venn cuts. For any J-imensional bit-vector b {0, 1} J, efine A (b) = P \, an similarly, B (b) = Q \. b i =1 A i b i =0 A i We overloa notation an use W (b) to enote the witness set W (A (b), B (b) ) an C (b) to enote the cut set C(A (b), B (b) ). A noe u belongs to A (b) if it is in every set A i such that b i = 1 an not in any of the sets A i for which b i = 0. Thus, each A (b) correspons to one of the regions in the Venn iagram of the sets A 1, A 2,..., A J, an the analogous statement hols for each B (b). Hence, we will refer to the sets W (b) an C (b) as the Venn-witness an the Venn-cut for b, respectively, an refer to the pair (A (b), B (b) ) as a Venn pair. Also, we will use b to refer to a vector which iffers from b in every bit. 2. The special structure of Venn witnesses an Venn cuts. Consier an ege (u, v) that goes out of A (b). Suppose that ege goes to B () where b an b. Then there must exist 1 i, i J such that b i = i an b i i. Since b i = i, either u A i, v B i (if b i = i = 1) or u A i, v B i (if b i = i = 0). In either case the ege (u, v) oes not belong to the cut C(A i, B i ), an since all pairs in R have the same cut S, we conclue that (u, v) S. On the other han, since b i i, either u A i, v B i (if b i = 1, i = 0) or u A i, v B i (if b i = 0, i = 1). In either case the ege (u, v) belongs to the cut C(A i, B i ) an hence to S, which is a contraiction. Thus, any ege from A (b) goes to either B (b) or B (b). If the ege (u, v) goes to B (b) then it oes not belong to any witness set in W (R), any Venn witness set, any Venn cut, or S. If (u, v) goes to B (b) then it belongs to S, to the Venn witness set W (b), to the Venn cuts C (b) an C (b), an to no other Venn witness set or Venn cut. This ege also belongs to W (A i, B i ) for all i such that b i = 1. These observations, an the efinitions of Venn witnesses, cuts, an pairs easily lea to the following consequences: b i =1 B i b i =0 W (b) W () = if b, (1) W (A i, B i ) = b {0,1} J :b i =1 B i W (b), (2) C (b) = C (b), (3) C (b) C () = if b an b, (4) 7

an finally, ( i, 1 i J) : S = C (b), (5) b {0,1} J :b i =1 W (b) W (b) = C (b). (6) 3. The collection T. Define T to consist of all γ-thick Venn pairs (A (b), B (b) ) where b is not the all zero vector. 4. Proving that T is a γ-uncrossing of R. (P1): Fix some i, 1 i J. Since R is a γ-thick collection, it follows from the efinition that (A i, B i ) must be a γ-thick pair. From equations 2 an 1, we know that t(w (A i, B i ) E R ) = b {0,1} J :b i =1 t(w (b) E R ). We also know, from equations 4 an 5, that t(s) = b {0,1} J :b i =1 t(c (b)). Hence, there must be some b {0, 1} J such that b i = 1 an (A (b), B (b) ) is γ-thick, which in turn implies that (A (b), B (b) ) is in T. This is the representative of (A i, B i ) an hence T satisfies P1. (P2): This follows trivially from equation 5. (P3): From equation 4 we know that there are only two possible Venn pairs (specifically, (A (b), B (b) ) an (A (b), B (b) )) that have the same non-empty cut C (b). Observe that our efinition of γ-thickness involves strict inequality, an hence Venn pairs where the Venn witness set an the Venn cut are both empty can t be γ-thick an can t be in T. 5. Proving that C(X, Y ) S for all pairs (X, Y ) T. Any cut C(A, B) C(T ) is of the form C (b) for some J-imensional bit vector b. Each C (b) S, from equation 5. We will now show that this containment is strict. Suppose not, i.e., there exists some C (b) = S. By equation 3, C (b) = S as well. Since J > 2, either b or b must have two bits that are set to 1; without loss of generality, assume that b 1 = b 2 = 1. From equations 1 an 6, we know that C (b) (an hence S) is the isjoint union of W (b) an W (b). Any ege in W (b) must belong to both W (A 1, B 1 ) an W (A 2, B 2 ), whereas any ege in W (b) can not belong to either W (A 1, B 1 ) or W (A 2, B 2 ). Hence, W (A 1, B 1 ) = W (A 2, B 2 ) = W (b) which contraicts the non-reunancy assumption on R. Therefore, we must have C (b) S. Proof of Theorem 3.7: The proof will be by inuction over the largest set in C(R) accoring to the orering. Let M(R) enote this largest set. For the base case, suppose M(R) is the smallest set S uner the orering. Then S must be singleton, C(R) must have just a single set S, an W (R) must also have a single witness set, which must be the same as S since R is γ-thick. By the non-reunancy assumption, R must have at most one pair, an is its own γ-uncrossing. For the inuctive step, consier any possible cut S an assume that the theorem is true when M(R) S. We will show that the theorem is also true when M(R) = S, which will complete the inuctive proof. Suppose there is a unique (A, B) R such that C(A, B) = S. Intuitively, one woul expect this to be the easy case, since there is no uncrossing to be one for S, an inee, this case is quite straightforwar. Define R = R (A, B). Let T enote a γ-uncrossing of R, which is guarantee to exist by the inuctive hypothesis. Since T is γ-thick, so is T = T {(A, B)}. The pair (A, B) clearly has a representative in T (itself), an any (A, B ) R (A, B) has a representative in T an hence also in T. Thus, T satisfies property P1 for being a γ-uncrossing of R. Every set in C(T ) is a subset of some cut in C(R ) (by property P2) an C(A, B) is also in C(R), an hence T satisfies property P2 for being a γ-uncrossing of R. Every set in T is smaller than C(A, B) accoring to an T satisfies property P3. Hence, T also satisfies property P3. Thus, T is a γ-uncrossing of R. If there are exactly two istinct pairs (A, B) an (A, B ) in R such that C(A, B) = C(A, B ) = S, then the same argument works again, except that R = R \ {(A, B), (A, B )} an T = T {(A, B), (A, B )}. We now nee to tackle the most interesting case of the inuctive step, where there are more than two pairs in R that correspon to the same cut S. Write R = R 1 R 2 where C(A, B) S for all (A, B) R 1 8

an C(A, B) = S for all (A, B) R 2. Recall that for two ifferent pairs (A, B) an (A, B ) in R 2, we must have W (A, B) W (A, B ) by the non-reunancy assumption. From lemma 3.8, there exists a γ- partial-uncrossing, say S 2, of R 2 with the property that for every set S C(S 2 ), we have S S, an hence S S. By lemma 3.4, we know that R 1 is its own γ-partial-uncrossing. Further, by efinition of R 1, every set S C(R 1 ) must satisfy S S. Define S = R 1 S 2. By lemma 3.6, S is a γ-partial-uncrossing of R 1 R 2, i.e., of R. Further, for every cut S C(S), we have S S. Hence, by our inuctive hypothesis, there exists a γ-uncrossing of S; let T be a γ-uncrossing of S. By lemma 3.5, T is also a γ-uncrossing of R, which completes the inuctive proof. Remark 3.9 An alternate approach to relating cuts an witness sets is to suitably moify the proof of the Benczúr-Karger sampling theorem, circumventing the nee for the proportionate uncrossing theorem. The iea is base on the observation that Karger s sampling theorem also hols for vertex cuts in graphs. Since Benczúr-Karger sampling theorem is prove using multiple invocations of Karger s sampling theorem, it is possible to set up a corresponence between cuts an witness sets using a vertex-cut version of the Benczúr- Karger sampling theorem. However, we prefer to use here the approach base on the proportionate uncrossing theorem as it is an interesting combinatorial statement in its own right. 4 An Õ(n1.5 ) Time Algorithm for Fining a Perfect Matching We present here an Õ(n1.5 ) time ranomize algorithm to fin a perfect matching in a given -regular bipartite graph G(P, Q, E) on 2n vertices. Throughout this section, we follow the convention that for any pair (A, B), the sets C(A, B) an W (A, B) are efine with respect to the graph G. Our starting point is the following theorem, establishe by Goel, Kapralov, an Khanna [8] 2 Theorem 4.1 Let G(P, Q, E) be a -regular bipartite graph, ɛ any number in (0, 1 2 ), an c a suitably large constant that epens on ɛ. There exists a ecomposition of G into k = O(n/) vertex-isjoint bipartite graphs, say G 1 = (P 1, Q 1, E 1 ), G 2 = (P 2, Q 2, E 2 ),..., G k = (P k, Q k, E k ), such that 1. Each G i contains at least /2 perfect matchings, an the minimum cut in each G i is Ω( 2 /n). 2. Let R enote the set of relevant pairs with respect to this ecomposition, an E R enote the set of relevant eges. Then for each (A, B) in R, we have W (A, B) E R 1 2 C(A, B). 3. Let G (P, Q, E ) be a ranom graph generate by sampling the eges of G uniformly at ranom with cn ln n probability p =. Then with probability at least 1 1/n, for every pair (A, B) R, 2 ( ) 1 ɛ W (A, B) E E R > (1 ɛ)p W (A, B) E R > C(A, B) E. 2(1 + ɛ) The last conition above says that in aition to all cuts, all relevant witness ege sets are also preserve to within (1 ± ɛ) of their expecte value in G, with high probability. We emphasize here that the ecomposition highlighte in Theorem 4.1 will be use only in the analysis of our algorithm; the algorithm itself is oblivious to this ecomposition. Our algorithm consists of the following three steps. (S1) Generate a ranom graph G = (P, Q, E ) by sampling eges of G uniformly at ranom with probability p = c 1n ln n where c 2 1 is a constant as in Theorem 4.1 3. We choose ɛ to be any fixe constant not larger than 0.2. 2 Part 1 of theorem 4.1 correspons to theorem 2.3 in [8], part 2 is prove as part of the proof of theorem 2.1 in [8], an part 3 combines remark 2.5 in [8] with Karger s sampling theorem [10]. 3 The time require for this sampling is proportional to the number of eges chosen, assuming the graph is presente in an ajacency list representation with each list store in an array. 9

(S2) The graph G contains O( n2 ln n ) eges w.h.p. We now run the Benczúr-Karger sampling algorithm [2] that takes O( E ln 2 n) time to compute the strength s e of every ege e an samples each ege e with probability p e ; 4 here p e is as given by Theorem 2.6 with γ = 1/3. We show below that w.h.p. the graph G = (P, Q, E ) obtaine from this sampling contains a perfect matching. (S3) Finally, we run the Hopcroft-Karp algorithm to obtain a maximum carinality matching in G O(n 1.5 ln n) time since by Theorem 2.7, G contains O(n ln n) eges w.h.p. in Running time: With high probability, the running time of this algorithm is boune by O( n2 ln3 n + n 1.5 ln n). Since we can always use the algorithm of Cole, Ost, an Schirra [6] instea, the final running time is O(min{m, n2 ln3 n + n 1.5 ln n}). This reuces to O(m) if n ln n; to O(n 1.5 ln n) when n ln 2 n; an to at most O((n ln n) 1.5 ) in the narrow range n ln n < < n ln 2 n. Correctness: To prove correctness, we nee to show that G contains a perfect matching w.h.p. Theorem 4.2 The graph G contains a perfect matching with probability 1 O(1/n). Proof: Consier the ecomposition efine in Theorem 4.1. Let R enote the set of relevant pairs with respect to this ecomposition, an let E R enote the set of all relevant eges with respect to this ecomposition. We will now focus on proving that, with high probability, for every (A, B) R, W (A, B) E R E ; by Lemma 2.3, this is sufficient to prove the theorem. For convenience, efine W (A, B) = W (A, B) E an C (A, B) = C(A, B) E. Assume for now that the low-probability event in Theorem 4.1 oes not occur. Thus, by choosing ɛ 0.2, we know that for γ = 1/3, every relevant pair (A, B) R satisfies W (A, B) E R > γ C (A, B). Let s e enote the strength of e in G. Recall that G [j] = (V, E [j] ) is the graph with the same vertex set as G but consisting of only those eges in E which have strength at least j. Define W [j] (A, B) to be the set of all eges in W (A, B) E [j] ; efine C [j] (A, B) analogously. Define t(e) = 1/s e. Since W (A, B) E R > γ C (A, B), by Lemma 2.8, there must exist a j such that 1 s e (W (A,B) E R ),s e e j > γ 1 s e C (A,B),s e e j which implies that (A, B) is γ-thick with respect to (G [j], t, E R), as efine in Definition 3.1. Partition R into R [1], R [2],..., R [n], such that if (A, B) R [j] then (A, B) is γ-thick with respect to (G [j], t, E R), breaking ties arbitrarily if (A, B) can belong to multiple R [j]. Consier an arbitrary non-empty R [j]. Let T represent a γ-uncrossing of R [j], as guarantee by Theorem 3.7. By property P3, no three pairs in a γ-uncrossing can have the same cut; partition T into T 1 an T 2 such that every pair (A, B) T 1 has a unique cut C [j] (A, B) an the same hols for T 2. We focus on T 1 for now. For any (A, B) T 1, efine Y (A, B) = W [j] (A, B) E R. Define X = {Y (A, B) : (A, B) T 1 }. For any X X, efine f(x) = C [j] (A, B) for some arbitrary (A, B) T 1 such that X = Y (A, B). The function f is one-one by construction, an since (A, B) is γ-thick, we know that e X 1/s e > γ e f(x) 1/s e. Thus, X satisfies the preconitions of Theorem 2.6. Further, the sampling probability p e in step (S2) of the algorithm is chosen to correspon to γ = 1/3. Thus, with probability at least 1 1/n 2, X E is non-empty for all X X, i.e., W [j] (A, B) E R E for all (A, B) T 1. Since G [j] is a subgraph of G, we can conclue that W (A, B) E R E for all (A, B) T 1 with probability at least 1 1/n 2. 4 In fact, this sampling algorithm computes an upper boun on s e, but this only affects the running time an the number of eges sample by a constant factor. > 0, 10

Since the analogous argument hols for T 2, we obtain W (A, B) E R E for all (A, B) T with probability at least 1 2/n 2. Since T is a γ-uncrossing of R [j], we use property P1 to conclue that W (A, B) E R E for all (A, B) R [j], again with probability at least 1 2/n 2. Applying the union boun over all j, we further conclue that W (A, B) E R E for all (A, B) R with probability at least 1 2/n. As mentione before, this suffices to prove that G has a perfect matching with probability at least 1 2/n, by Lemma 2.3. We assume that conition 3 in theorem 4.1 is satisfie; this is violate with probability at most 1 n, which proves that G has a perfect matching with probability at least 1 3 n. As presente above, the algorithm takes time min{õ(n1.5 ), O(m)} with high probability, an outputs a perfect matching with probability 1 O(1/n). We conclue with two simple observations. First. it is easy to convert this into a Monte Carlo algorithm with a worst case running-time of min{õ(n1.5 ), O(m)}, or a Las Vegas algorithm with an expecte running-time of min{õ(n1.5 ), O(m)}. If either the sampling process in steps (S1) or (S2) returns too many eges, or step (S3) oes not prouce a perfect matching, then (a) abort the computation to get a Monte Carlo algorithm, or (b) run the O(m) time algorithm of Cole, Ost, an Schirra [6] to get a Las Vegas algorithm. Secon, by choosing larger constants uring steps (S1) an (S2), it is easy to amplify the success probability to be at least 1 O( 1 ) for any fixe j 1. n j 5 An Improve O ( min{n, (n 2 ln 3 n)/} ) Boun on the Runtime In this section we give an improve analysis of the runtime of the Hopcroft-Karp algorithm on the subsample graph, ultimately leaing to a boun of O ( min{n, (n 2 ln 3 n)/} ) for our algorithm. The main ingreients of our analysis are (1) a ecomposition of the graph G into O(n/) vertex-isjoint Ω()-ege-connecte subgraphs, (2) a moification of the uncrossing argument that reveals properties of sufficiently unbalance witness sets in the sample graph obtaine in step S2, an (3) an upper boun on length of the shortest augmentating path in the sample graph relative to any matching of size smaller than n 2n/. 5.1 Combinatorial uncrossings Theorem 5.2 below, which we state for general bipartite graphs, requires a variant of the uncrossing theorem that we formulate now. We introuce the efinition of combinatorial uncrossings: Definition 5.1 Let R be any collection of pairs (A, B), A P, B Q. A combinatorial uncrossing of R is a tuple (T, I), where T is another collection an I is a mapping from R to subsets of T, such that the following properties are satisfie: Q1: For all (A, B) R 1. {W (A, B )} (A,B ) I(A,B) are isjoint; 2. {C(A, B )} (A,B ) I(A,B) are isjoint; 3. {A B } (A,B ) I(A,B) are isjoint; 4. A A, B B for all (A, B ) I(A, B); 5. W (A, B) = C(A, B) = (A,B ) I(A,B) (A,B ) I(A,B) W (A, B ) C(A, B ). 11

Q2: (Half-injectivity) There cannot be three istinct pairs (A, B), (A, B ), (A, B ) in T such that C(A, B) = C(A, B ) = C(A, B ). The proof of existence of combinatorial uncrossings is along the lines of the proof of existence of γ-thick uncrossings, so we omit it here. For a graph H we enote W H (A, B) = W (A, B) E(H) an C H (A, B) = C(A, B) E(H), an omit the subscript when the unerlying graph is fixe. Theorem 5.2 There exists a constant c > 0 such that for all ɛ > 0 such that for all bipartite graphs G = (P, Q, E) on 2n vertices with a minimum cut of size at least κ the following hols. If a graph G is obtaine by sampling eges of G uniformly at ranom with probability p > c ln n, then w.h.p. for all A P, an ɛ 2 κ B Q, we have p W G (A, B) ɛp C G (A, B) W G (A, B) p W G (A, B) + ɛp C G (A, B). Proof: Define R as the set of pairs (A, B), A P V (G), B Q V (G). Denote a combinatorial uncrossing of R by (T, I). We first prove the statement for pairs from T, an then exten it to pairs from R to obtain the esire result. Consier a pair (A, B) T. Denote G (A, B) = W G (A, B) p W G (A, B). We shall write W (A, B) an C(A, B) instea of W G (A, B) an C G (A, B) in what follows for brevity. We have by Chernoff bouns that for a given pair (A, B) T [ ( ) ] ɛ C(A, B) 2 p W (A, B) Pr [ G (A, B) > ɛp C(A, B) ] < 2 exp W (A, B) 3 [ ( )] p C(A, B) 2 exp ɛ 2 3 since C(A, B) W (A, B). Since T satisfies Q2, we get that Pr [ (A, B) T : G (A, B) > ɛp C(A, B) ] < 2 exp [ ɛ 2 p C(A, B) /3 ] 4 W (A,B) W (T ) C(A,B) C(T ) exp [ ɛ 2 p C(A, B) /3 ] = O(n r ) for c = 3(r + 2) by Corollary 2.4 in [10]. This implies that for c 3(r + 2) we have with probability 1 O(n r ) for all (A, B) T G (A, B) ɛp C(A, B). (7) Now consier any pair (A, B) R. Summing (7) over all (A, B ) I(A, B) an using properties Q1.1-5, we get G (A, B) ɛp C(A, B ) = ɛp C(A, B), for all (A, B) R as require. 5.2 Decomposition of the graph G (A,B ) I(A,B) Corollary 5.6, which relates the size of sufficiently unbalance witness sets in the sample graph to the size of the corresponing cuts is the main result of this subsection. It follows from theorem 5.2 an a stronger (than [8]) ecomposition of bipartite -regular graphs that we outline now. For A V (G) we enote the set of eges in the cut (A, V (G) \ A) in G by δ(a). 12

Theorem 5.3 Any -regular graph G with 2n vertices can be ecompose into vertex-isjoint inuce subgraphs G 1 = (P 1, Q 1, E 1 ), G 2 = (P 2, Q 2, E 2 ),..., G k = (P k, Q k, E k ), where k 4n/ + 1, that satisfy the following properties: 1. The minimum cut in each G i is at least /8. 2. k+1 i=1 δ G(V (G i )) 2n. To prove Theorem 5.3, we give a proceure that ecomposes the graph G into vertex-isjoint inuce subgraphs G 1 (P 1, Q 1, E 1 ), G 2 (P 2, Q 2, E 2 ),..., G k (P k, Q k, E k ), k 4n/ + 1 such that the min-cut in G j is at least /8 an at most n eges run between pieces of the ecomposition. The proceure is as follows. Initialize H 1 := G, an set i := 1. 1. Fin a smallest proper subset X i V (H i ) such that δ Hi (X i ) < /4. If no such set exists, efine G i to be the graph H i an terminate. 2. Define G i to be the subgraph of H i inuce by vertices in X i, i.e. X i = P i Q i = V (G i ). Also, efine H i+1 to be the graph H i with vertices from X i remove. 3. Increment i an go to step 1. We now prove that the output of the ecomposition proceure satisfies the properties claime above. Lemma 5.4 The min-cut in G i is greater than /8. Proof: If G i contains a single vertex the min-cut is infinite by efinition, so we assume wlog that G i contains at least two vertices. The proof is essentially the same as the proof of property P1 of the ecomposition proceure in [8] (see Theorem 2.4). Suppose that there exists a cut (V, V c ) in G i where V V (G i ) an V c = V (G i )\V, such that δ Gi (V ) /8 (note that it is possible that V P i an V Q i ). We have δ Hi (V ) \ δ Gi (V ) + δ Hi (V c ) \ δ Gi (V c ) < /4 by the choice of X i in (1). Suppose without loss of generality that δ Hi (V ) \ δ Gi (V ) < /8. Then δ Hi (V ) < /4 an V X i, which contraicts the choice of X i as the smallest cut of value at most /4 in step (1) of the proceure. Lemma 5.5 The number of steps in the ecomposition proceure is k 4n/, an at most n eges are remove in the process. Proof: We call a vertex v V (G i ) ba if its egree in G i is smaller than /2. Note that for each 1 i k either G i contains a ba vertex or V (G i ). Note that since strictly fewer than /4 eges are remove in each iteration, the number of ba vertices create in the first j iterations is strictly less than j(/4)/(/2) = j/2. Hence, uring at least half of the j iterations at least vertices were remove from the graph, i.e. j V (G i ) (j/2) = j/2. i=1 This implies that the process terminates in at most 4n/ steps, an the number of eges remove is at most (4n/) /4 = n. Proof of Theorem 5.3: The proof follows by putting together lemmas 5.4 an 5.5. We overloa notation here by enoting W (B, A) = W (P \ A, Q \ B) = C(A, B) \ W (A, B) for A P, B Q. The main result of this subsection is 13

Corollary 5.6 Let G = (P, Q, E ) be a graph obtaine by sampling the eges of a -regular bipartite graph G = (P, Q, E) on 2n vertices inepenently with probability p. There exists a constant c > 0 such that if p > c ln n ɛ 2 then whp for all pairs (A, B), A P, B Q, A B + 2n/ one has that W (A, B) E > 1 3ɛ 1+3ɛ W (B, A) E for all ɛ < 1/10. In particular, G contains a matching of size at least n 2n/ whp. Proof: Set A i = A P i, B i = B Q i, where G i = (P i, Q i, E i ) are the pieces of the ecomposition obtaine in Theorem 5.3. For each (A i, B i ) such that G i is not an isolate vertex we have by Lemma 5.4 an Theorem 5.2 W Gi (A i, B i ) E p W Gi (A i, B i ) < ɛp C Gi (A i, B i ). If G i is an isolate vertex, we have W Gi (A i, B i ) E = p W Gi (A i, B i ) = 0. Since the latter estimate is stronger than the former, we shall not consier the isolate vertices separately in what follows. Aing these inequalities over all i we get k W Gi (A i, B i ) E p i=1 k W Gi (A i, B i ) ɛp i=1 k C Gi (A i, B i ). (8) Denote the set of eges remove uring the ecomposition process by E r. Denote E 1 = E r W (A, B) an E 2 = E r W (B, A). Since W (A, B) E = k i=1 W G i (A i, B i ) E + E 1 E an k i=1 W G i (A i, B i ) = W (A, B) E 1, this implies W (A, B) E p W (A, B) ɛp C(A, B) p E 1. (9) Likewise, since W (B, A) = W (P \ A, Q \ B), we have i=1 W (B, A) E p W (B, A) + ɛp C(A, B) + p E 2. Since A B + 2n/, we have W (A, B) W (B, A) + 2n, so W (A, B) E p W (A, B) ɛp C(A, B) p E 1 p( W (B, A) + 2n) ɛp C(A, B) p E 1 p E 2 W (B, A) E 2ɛp C(A, B) + p(2n E r ) W (B, A) E 2ɛp C(A, B) + pn. Aing (9) for the pairs (A, B) an (B, A), we get C(A, B) E (1 2ɛ)p( C(A, B) n), i.e. p C(A, B) 1 1 2ɛ C(A, B) E + pn. Hence, we have W (A, B) E W (B, A) E 2ɛp C(A, B) + pn W (B, A) E 2ɛ 1 2ɛ C(A, B) E + (1 2ɛ)pn W (B, A) E 2ɛ 1 2ɛ ( W (A, B) E + W (B, A) E ) + (1 2ɛ)pn, which implies for ɛ < 1/10. This completes the proof. W (A, B) E > 1 3ɛ 1 + 3ɛ W (B, A) E Remark 5.7 The result in corollary 5.6 is tight up to an O(ln ) factor for = Ω( n). 14

Proof: The following construction gives a lower boun of n Ω ( n ln ). Denote by Gn, the graph from Theorem 4.1 in [8] an enote by G n, a graph obtaine by sampling eges of G n, at the rate of c ln n for a constant c > 0. Define the graph G as isjoint copies of G 2 ln,, an enote the sample graph by G. Note that by Theorem 4.1 the maximum matching in each copy of G 2 ln, has size at most 2 ln 1 whp, an since the number of vertices in G is N = 2 2 ln, the maximum matching in G has size at most N Ω ( N ln ) whp. 5.3 Runtime analysis of the Hopcroft-Karp algorithm In this section we erive a boun on the runtime of the Hopcroft-Karp algorithm on the subsample graph obtaine in step S2 of our algorithm. The main object of our analysis is the alternating level graph, which we now efine. Given a partial matching of a graph G = (P, Q, E), the alternating level graph is efine inuctively. Define sets A j an B j, j = 1,... (, L as follows. Let A 0 be the set of unmatche vertices in ) P an let B 0 =. Then let B j+1 = Γ(A j ) \ i<j B i, where Γ(A) is the set of neighbours of vertices in A V (G), an let A j be the set of vertices matche to vertices from B j. The construction terminates when either B j+1 contains an unmatche vertex or when B j+1 =, an then we set L = j. We use the notation A (j) = k j A k, B (j) = k j B k. We now give an outline of the Hopcroft-Karp algorithm for convenience of the reaer. Given a non-maximum matching, the algorithm starts by constructing the alternating level graph escribe above an stops when an unmatche vertex is foun. Then the algorithm fins a maximal set of vertex-isjoint augmenting paths of length L (this can be one by epth-first search in O(m) time) an performs the augmentations, thus completing one augmentation phase. It can be shown that each augmentation phase increases the length of the shortest augmenting path. Stanar analysis of the run-time for general bipartite graphs is base on the observation that once n augmentations have been performe, the constructe matching necessarily has size at most n smaller than the maximum matching. We enote the graph obtaine by sampling eges of G inepenently with probability p = c ln n for a constant c > 0 by G, an let G be a graph obtaine by aing an arbitrary set of eges of G to G. For A V (G) enote the set of eges in the cut (A, V (G) \ A) in G by δ(a) an the set of eges in the same cut in G by δ (A). Similarly, we enote the vertex neighbourhoo of A in G by Γ(A) an the vertex neighbourhoo in G by Γ (A). We consier the alternating level graph in G an prove that whp for any partial matching of size smaller than n 2n/ for each 1 j L either B j 1 B j B j+1 = Ω() or B j expans by at least a factor of ln n in either forwar or backwar irection ( B j+1 (ln n) B j or B j 1 (ln n) B j ). This implies that L = O ( n ln ln ln n), thus yieling the same boun on the length of the shortest augmenting path by virtue of corollary 5.6. The main technical result of this subsection is Lemma 5.8 Let the set of eges E be obtaine by sampling eges of a bipartite -regular graph G = (P, Q, E) on 2n vertices uniformly with probability p. Let G = (P, Q, E ) an G = (P, Q, E E ), where E is an arbitrary subset of E. There exist constants c > 0, ɛ > 0 such that if p c ln n ɛ 2, then whp for any partial matching in G of size smaller than n 2n/ there exists an augmenting path of length O ( n ln ln ln n). The following expansion property of the graph G will be use to prove lemma 5.8: Lemma 5.9 Define γ(t) = (1 exp( t))/t. For all ɛ, t > 0 there exists a constant c > 0 that epens on t an ɛ such that if G is obtaine by sampling the eges of G inepenently with probability p > c ln n, then whp for every set A P, A t/p Γ (A) (1 ɛ)pγ(t) A. The corresponing claim also hols for B Q, B t/p. 15