arxiv:cond-mat/ v1 3 Oct 2002

Similar documents
UNDERSTANDING BOLTZMANN S ANALYSIS VIA. Contents SOLVABLE MODELS

Phase Transitions. µ a (P c (T ), T ) µ b (P c (T ), T ), (3) µ a (P, T c (P )) µ b (P, T c (P )). (4)

Grand Canonical Formalism

Networks: Lectures 9 & 10 Random graphs

Mini course on Complex Networks

MACROSCOPIC VARIABLES, THERMAL EQUILIBRIUM. Contents AND BOLTZMANN ENTROPY. 1 Macroscopic Variables 3. 2 Local quantities and Hydrodynamics fields 4

Legendre-Fenchel transforms in a nutshell

CHAPTER 4. Cluster expansions

arxiv:cond-mat/ v4 [cond-mat.stat-mech] 19 Jun 2007

MAXIMA AND MINIMA CHAPTER 7.1 INTRODUCTION 7.2 CONCEPT OF LOCAL MAXIMA AND LOCAL MINIMA

Chapter 2 Ensemble Theory in Statistical Physics: Free Energy Potential

Considering our result for the sum and product of analytic functions, this means that for (a 0, a 1,..., a N ) C N+1, the polynomial.

arxiv: v1 [quant-ph] 11 Mar 2016

(x 1, y 1 ) = (x 2, y 2 ) if and only if x 1 = x 2 and y 1 = y 2.

IV. Classical Statistical Mechanics

The Liapunov Method for Determining Stability (DRAFT)

Notes on Complex Analysis

Laplace s Equation. Chapter Mean Value Formulas

On improving matchings in trees, via bounded-length augmentations 1

MS 3011 Exercises. December 11, 2013

Connectedness. Proposition 2.2. The following are equivalent for a topological space (X, T ).

Thermodynamic equilibrium

Newtonian Mechanics. Chapter Classical space-time

Legendre-Fenchel transforms in a nutshell

RENORMALIZATION OF DYSON S VECTOR-VALUED HIERARCHICAL MODEL AT LOW TEMPERATURES

Critical phenomena in complex networks

B. Differential Equations A differential equation is an equation of the form

THEODORE VORONOV DIFFERENTIABLE MANIFOLDS. Fall Last updated: November 26, (Under construction.)

Ω = e E d {0, 1} θ(p) = P( C = ) So θ(p) is the probability that the origin belongs to an infinite cluster. It is trivial that.

The Non-commutative S matrix

Generating Functions

k-protected VERTICES IN BINARY SEARCH TREES

TELCOM2125: Network Science and Analysis

Series Solutions. 8.1 Taylor Polynomials

Topological properties

arxiv: v2 [math.pr] 4 Sep 2017

4-coloring P 6 -free graphs with no induced 5-cycles

Multivariate Distribution Models

Metropolis Monte Carlo simulation of the Ising Model

Exact solution of site and bond percolation. on small-world networks. Abstract

Asymptotic Expansions

Light-Cone Quantization of Electrodynamics

Abstract. 2. We construct several transcendental numbers.

= w. These evolve with time yielding the

REVIEW OF DIFFERENTIAL CALCULUS

A class of trees and its Wiener index.

September Math Course: First Order Derivative

1 Mechanistic and generative models of network structure

Taylor series. Chapter Introduction From geometric series to Taylor polynomials

Khinchin s approach to statistical mechanics

Supplementary Notes for W. Rudin: Principles of Mathematical Analysis

The Equivalence of Ergodicity and Weak Mixing for Infinitely Divisible Processes1

Distributions of statistical mechanics

Renormalization-group study of the replica action for the random field Ising model

Phase Transitions in Physics and Computer Science. Cristopher Moore University of New Mexico and the Santa Fe Institute

Two special equations: Bessel s and Legendre s equations. p Fourier-Bessel and Fourier-Legendre series. p

154 Chapter 9 Hints, Answers, and Solutions The particular trajectories are highlighted in the phase portraits below.

Continued fractions for complex numbers and values of binary quadratic forms

CHAPTER 6. Differentiation

ON KRONECKER PRODUCTS OF CHARACTERS OF THE SYMMETRIC GROUPS WITH FEW COMPONENTS

1. Principle of large deviations

arxiv:cond-mat/ v1 28 Feb 2005

Vector and Tensor Calculus

1 Lyapunov theory of stability

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

Infinite series, improper integrals, and Taylor series

Isomorphisms between pattern classes

Gaussian processes and Feynman diagrams

arxiv:cond-mat/ v1 10 Aug 2002

10 Transfer Matrix Models

Rate Equation Approach to Growing Networks

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

Notes for Expansions/Series and Differential Equations

Separation of Variables in Linear PDE: One-Dimensional Problems

Gaussian integers. 1 = a 2 + b 2 = c 2 + d 2.

The Einstein-Podolsky-Rosen thought experiment and Bell s theorem

5.3 METABOLIC NETWORKS 193. P (x i P a (x i )) (5.30) i=1

Chapter 2 Classical Probability Theories

Modeling and Analysis of Dynamic Systems

Introduction to Real Analysis Alternative Chapter 1

The Einstein-Podolsky-Rosen thought-experiment and Bell s theorem

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

Notes: Most of the material presented in this chapter is taken from Jackson, Chap. 2, 3, and 4, and Di Bartolo, Chap. 2. 2π nx i a. ( ) = G n.

Topological properties of Z p and Q p and Euclidean models

Analysis-3 lecture schemes

Fourth Week: Lectures 10-12

5.4 Bessel s Equation. Bessel Functions

Summation Techniques, Padé Approximants, and Continued Fractions

Topic 5: The Difference Equation

Foundations of Mathematics MATH 220 FALL 2017 Lecture Notes

Lecture 4: Entropy. Chapter I. Basic Principles of Stat Mechanics. A.G. Petukhov, PHYS 743. September 7, 2017

Systems of Linear ODEs

Phase transitions in discrete structures

Perturbation theory for anharmonic oscillations

Combinatorial Optimization

General Notation. Exercises and Problems

Functions. A function is a rule that gives exactly one output number to each input number.

Mathematical Methods for Physics and Engineering

Part III. 10 Topological Space Basics. Topological Spaces

On rational approximation of algebraic functions. Julius Borcea. Rikard Bøgvad & Boris Shapiro

Transcription:

Metric structure of random networks S.N. Dorogovtsev,,, J.F.F. Mendes,, and A.N. Samukhin,, Departamento de Física and Centro de Física do Porto, Faculdade de Ciências, Universidade do Porto Rua do Campo Alegre 687, 469-7 Porto, Portugal A.F. Ioffe Physico-Technical Institute, 94 St. Petersburg, Russia arxiv:cond-mat/85 v 3 Oct We propose a consistent approach to the statistics of the shortest paths in random graphs with a given degree distribution. This approach goes further than a usual tree ansatz and rigorously accounts for loops in a network. We calculate the distribution of shortest-path lengths (intervertex distances) in these networks and a number of related characteristics for the networks with various degree distributions. We show that in the large network limit this extremely narrow intervertex distance distribution has a finite width while the mean intervertex distance grows with the size of a network. The size dependence of the mean intervertex distance is discussed for various situations. Key words: networks, random graphs, intervertex distance, connected components 5..-a, 5-4.-a, 5-5.+q, 87.8.Sn I. INTRODUCTION An intervertex distance in a network is naturally defined as the length of the shortest path between a pair of vertices. So, the statistics of intervertex distances, that is an intervertex distance distribution, actually determine the metric structure of a random network. This distribution is the basic structural characteristic of random networks which are under extensive study by physicists for the last years (e.g. see Refs.,,3,4]). Networks with fat-tailed degree distributions show a number of exciting effects (see Refs. 5,6,7]) and are especially intriguing. (By definition, degree is the total number of connections of a vertex, which is called sometimes the connectivity of a vertex ; a degree distribution is the distribution of degrees of vertices.) The intervertex distance distribution was obtained only for several very specific graphs. Even for basic uncorrelated random networks with a given degree distribution, the first moment of the intervertex distance distribution, that is the mean intervertex distance or the mean shortest-path length of a network, was only estimated,]. This estimation used the important fact that these networks are tree-like locally. This is not true at a large scale. In the recent paper 8] the presence of loops was taken into account (see also Ref. 9]) for estimating the mean intervertex distance of networks with a fat-tailed degree distribution. In this paper we propose a rigorous approach which takes into account both the locally tree-like structure of uncorrelated networks and the presence of loops on a large scale. This approach allows us to explicitly calculate the intervertex distance distribution and its moments, and to describe their dependence on the size of a network. Our approach is valid for uncorrelated random networks with a given degree distribution. These basic equilibrium networks are graphs, which are maximally random under the constraint that their degree distribution is given. In graph theory these networks (loosely speaking, one of their versions) are called labelled random graphs with a given degree sequence or the configuration model,3,4,5]. These networks are a starting point for the study of the effects of complex degree distributions, and so are of fundamental importance. The (uncorrelated) random graph with a given degree distribution can be constructed in the following way. Take N vertices. Attach to the vertices spines, {q i }, i,..., N, according to a given sequence {N(q)}, q,,..., where N q N (q), so that the vertices look like a family of hedgehogs. Connect various spines at random. This procedure provides the maximally random graph with a given degree distribution Π (q) N (q) /N. Without lack of generality we can set the number of zero-degree (i.e. isolated) vertices to be zero, Π (). The main global topological properties of such a networks are governed by the parameter,6,7]: z q q, () which is the ratio of the mean numbers of the second- and first-nearest neighbours of a vertex in the network. q and q are the first and the second moments of the degree distribution. For z <, all the connected components of the network remain finite in the infinite network limit (by definition, this is a thermodynamic limit). If z >, the

giant connected component arises, whose size is proportional to the size of the whole network. The condition for the emergence of the giant connected component 6,7], z >, may be written as: q q q (q ) Π (q) q (q ) Π (q) Π () >. () q q So, the giant connected component is formed only if the fraction of dead ends, Π(), is sufficiently small. If dead ends are absent, giant connected component exists and, in the thermodynamic limit, includes almost all vertices. (We do not consider the case when the network consists solely of the vertices of degree two). We present a consistent approach allowing rigorous calculations of the intervertex distance statistics within a giant connected component in such random networks. The main object we will consider is the sizes of connected components of a vertex in the graph (see a schematic view of the structure of an uncorrelated network in fig. ). The n-th order connected component of a vertex consists of all the vertices within the first n coordination spheres of the vertex; in other words the distance to the central vertex in the n-th connected component does not exceed n. Obviously, in a random graph the size of the connected component is a fluctuating random variable. FIG.. Connected components of a vertex. Three first components, shown inside the shaded area, are trees. The higher ones, shown outside the shaded area, are assumed to contain a finite fraction of the network, and, therefore, may contain closed loops. The idea of our method is to construct a recurrent relation expressing the size distribution of the n + -th connected component through that of the n-th connected component. This relation can be derived in two limiting cases: when the size of a connected component is negligibly small compared to the size of a network and when a connected component is a finite fraction of an infinitely large network. Sewing together the results in these limiting cases yields the complete set of connected component size distributions. In particular, this allows us to obtain the intervertex distance distribution in a random graphs. The main results of this paper are as follows: () We find an explicit expression for the mean intervertex distance, ln N/ ln z + const. In Ref. ] this result was obtained as an estimate, here we present an exact result with an exact constant. () We obtain the form of the intervertex distance distribution and show that in the networks under consideration, almost all vertices within a giant connected component are nearly equidistant. More precisely, we found that the mean square deviation of an intervertex distance is finite in the infinite network. To find the intervertex distribution function, one has to solve the functional equation, whose form is determined by the degree distribution in the network. Sometimes this can be done explicitly (two examples are considered in the paper). However, even in a general case, all essential features of the distance distribution can be reproduced analytically. First, the cumulative distance distribution Q (d, N) (the probability that the intervertex distance is less than or equal to d), appears to be actually the function of l d d (N), Q (l, N) Q (l), where d (N) ln (AN) / ln z is the average intervertex distance, and A is a number of the order of unity.

Second, we find both the asymptotics of Q (l) at large deviations l of distances from the mean value, positive and negative. At large negative l, this asymptotics is determined by the first two moments of the degree distribution, Q (l) z l. The asymptotics of Q (l) on the other side at large positive l is determined by vertices with the lowest degrees. Obviously, Q (l) m as l, where m is the capacity of the giant connected component, because m is precisely the probability that a randomly chosen pair of vertices is interconnected. If the lowest degree is either one or two, then the asymptotics of m Q (l) at l + decreases exponentially with a linear preexponential factor. If the lowest degree of the vertex is three or higher, then the asymptotics decreases faster than an exponent. The scheme suggested herein works only if the parameter z is finite, which means the convergence of the first and second moments of the degree distribution in the thermodynamic limit. This is not the case if this degree distribution asymptotically behaves as Π (q) q γ with γ 3 at large degrees q. We have studied the case of Π (q) q γ exp ( q/q ), < γ < 3 with large but finite value of the cut-off parameter q. As a result, we have found that in this case d ln N/q, and Q (l) is independent both of the system size and the cut-off parameter. Again, the mean square deviation of intervertex distances is of the order of unity, and all the vertices in the giant connected component are nearly equidistant. This result is valid in the limit N, when we assume that the cut-off parameter is large. In reality, in the finite-size networks a degree distribution has some natural size-dependent cut-off. How the cut-off parameter varies with the size of the network, depends on the details of a construction procedure. For example, in the configuration model the position of the cut-off depends on how the limit N (q) /N Π (q) is approached. We show that the picture described above remains valid, if the cut-off parameter grows with the network size sufficiently slowly, namely, not faster than ln q ln N/ ln ln N. Thus, for scale-free networks with < γ < 3, we arrive at the stable distribution of intervertex distances around their steadily growing mean value d(n) if the construction procedure ensures not very fast growth of the degree distribution cut-off with the network size. In this situation, d grows with N slower than ln N but faster than ln ln N. The paper is organized as follows. In Section II we define main notions. In Section III we remind and essentially refine the approach of Ref. ], which is based on the tree ansatz and valid for finite-size connected components in the infinite network. In Section IV we present the recursion relation between the sizes of the n-th and n + -th connected components in the limit, when these sizes are both infinite, taking into account loops. In Section V we explain, how the results of two previous sections can be sewed together in the region, where the size of a connected component is large compared to unity but small compared to the size of the whole network. In Section VI the results of previous sections are briefly summed up and general results for various quantities of interest are presented. In Section VII, as an illustrative example, we present an exact analytical solution for the uncorrelated network with the degree distribution Π (q) Cq ζ q, ζ <. In Section VIII the network with the degree distribution Π (q) q γ ζ q, < γ < 3 and ζ <, ζ is studied. In Section IX we summarize the results obtained in the paper and discuss the size dependence d(n) in situations when it grows slower than ln N, e.g. as ln ln N or ln N/ ln ln N. Some technical details are presented in two Appendices. II. DEFINITIONS A graph consists of vertices connected by edges. Undirected graph is described by its symmetric adjacency matrix â. Elements of this matrix are either a ij a ji, if vertices i and j are connected, or a ij a ji otherwise. We consider only graphs with a ii, that is ones without tadpoles edges with both ends attached to the same vertex. The degree of a vertex, q i, (sometimes it is called the vertex connectivity) is the number of edges, attached to the vertex: q i j a ij. Random networks are usually described in terms of a statistical ensemble: the set of graphs G with corresponding statistical weights a non-negative function P (g), g G, defined on this set 8,9,,]. Let us consider a statistical ensemble of undirected graphs, each of which contains N vertices. Let us choose an ensemble characterized by a degree distribution Π (q) and maximally random otherwise. Several ensembles, equivalent in the thermodynamic limit N, may be used ]. For example, one can use a microcanonical one, usually referred to as the configuration model. Here we ascribe equal statistical weights to all possible graphs with N q N (q) vertices, N (q) of them have a degree q, q,,... (without lack of generality we can exclude the possibility that a vertex is of zero degree). We assume, that in the thermodynamic limit, N, N (q) /N Π (q). For this ensemble, we have the degree distribution: δ K (q i q) N δ K (q i q) Π (q). (3) N i One can show that in the thermodynamic limit even degrees of the nearest-neighbour vertices are uncorrelated in such networks: 3

N δ K (q i q) a ij δ K (q j q ) qq L q Π (q) Π (q ), (4) i j where q L/N is the average vertex degree. We introduced the notation δ K (q q ) for the Kronecker symbol δ qq. Relation (4) plays a crucial role. In fact, the scheme presented here is based on this relation. We call the set of vertices, for which the shortest distance from some vertex equals n, the n-th shell of this vertex. The union of the shells of a vertex from zeroth to n-th one inclusively is called the n-th (connected) component of the vertex. Following ], it is convenient to use the degree distribution in Z-representation (sometimes this object is called the generating function of the distribution): φ (x) N N x qi Π (q) x q. (5) i q Another useful quantity is what may be called an edge multiplication distribution function Π (q). This is the conditional probability that in a connected pair of vertices, a vertex has its degree equal to q + : j a ijδ K (q i q ) Π (q) a ijδ K (q i q ) a ij j a ij (q + ) Π (q + ) q, (6) or, in Z-representation, φ (x) q Π (q) x q aij x qi a ij qi x qi q φ (x) φ (). (7) We make use that in our ensemble all pairs of vertices are statistically equivalent. Π (q) may also be thought of as the probability that, choosing a random edge (but not a vertex!), and going along it in some of directions, we arrive at a vertex which has a degree equal to q + and therefore, there exist q different possibilities to move further. III. MICROSCOPIC COMPONENTS By microscopic components of a vertex we mean the components of a size negligible compared to the size of the network. The role of φ (x) may be understood from the following reasoning. Let us choose a random vertex i of degree q i. Assume that its n-th connected component is a tree. Then it consists of q i trees generated by every edge attached to the vertex. Let S n (j) j... q i be the number of vertices in such a tree. Obviously, the total number of vertices in the n-th component M n + q i j S(j) n, and S n (j), by the definition of the statistical ensemble under consideration, are equally distributed, independent (in the thermodynamic limit) random variables. For the distribution of M n, we have in Z-representation: Φ n (x) x Mn (x S x n ) q xφ F n (x)], where F n (x) x Sn is the distribution function of Sn, the number of vertices in the n-th order tree, formed by a randomly chosen edge. We have: S n+ + q j S(j) n, where the distribution function of q is φ (x) in Z-representation. Then we obtain finally for the size distribution of the n-th component: Φ n (x) xφ F n (x)], F n+ (x) xφ F n (x)], F (x) x. (8) As n and x <, F n (x) F (x), where F (x) describes the size distribution of finite components attached to a randomly chosen edge. Then H (x) φ F (x)] is the finite-component size distribution. Note that t c F () lim x F (x) is the stable fixed point of the recursion relation t n+ φ (t n ). Taking into account that φ (x) is monotonously increasing and convex downward as < x <, and φ (), one can conclude that t c if 4

φ () z, and t c < if z >. In the latter case we have H () φ (t c ) <. But H () is the probability that a randomly chosen vertex belongs to some finite component. This means that as z φ () φ () φ () >, (9) a giant connected component appears in the network. Its capacity (the probability that a vertex belongs to the giant component) is m φ (t c ). () The average number of vertices in the n-th connected component of a vertex is S n Φ n () + z F n (), z φ (). One can easily find from Eq. (8) that F n () (zn ) / (z ) z n as n, where z φ () φ () /φ (). Let us introduce, instead of F n, the sequence of functions f n that is defined as F n (x) f n ( z n z ln x The recursion relation then turns into ( f n+ (y) exp z ) ( z n+ y φ f n y zn )] z n+, f (y) e y. As n, it may be replaced with ). f n+ (y) φ f n (y/z )], f (y) e y. () From φ () it follows that f n () F n (). Also, we have now f n () independent of n. Taking into account that φ (x) is analytic, monotonically growing and convex downward at < x <, one can prove that the sequence f n converges as n to some function f(y). This latter may be found from the stationarity condition: f (y) φ f (y/z )] ; f (), f (). () The above conditions determine f (y) uniquely. One can check this, e.g. taking subsequent derivatives of Eq. () at y, which allows us to express f (k) () through f (l) (), l < k. Then we have asymptotically at n : ( z n Φ n (x) φ f z ln )]. (3) x The distribution function for the size of n-th connected component, M n, in the usual representation, P n (M), is the inverse Z-transform of Φ n : dx P n (M) πi Φ n (x) x M. (4) x δ Taking into account Eq. (), we have in the limit n : P n (M) (z ) z n p (z ) z n M ] ; p (s) +i +δ i +δ dy πi g (y) esy, g (y) φ f (y)]. (5) Note the order of limiting transitions adopted in this section: lim n lim N. The first is the thermodynamic limit, and only then the order of a connected component, n, is tended to infinity. Several essential assumptions had been made during the derivation of the above formulae. First, it was required that φ () and φ () are finite, which means that the degree distribution Π (q) has finite first and second moments. This will be assumed everywhere below. And second, the graph must be a tree. In fact, it is sufficient that the n-th connected component of a vertex is a tree. But almost all n th components of infinite uncorrelated random graph are trees at finite n. Indeed, the probability that two vertices in the n-th component are connected by an edge is proportional to the ratio of the total numbers of edges inside and outside this component. If n is fixed and N, this ratio scales as M n /N. Our final result for the component size distribution, Eqs. (), (5), is valid in the limit n, which must be taken after the thermodynamic limit N. We emphasize that the order of limits is extremely essential here. 5

IV. MACROSCOPIC COMPONENTS Now we assume a different situation: the size of the graph N and the order of a connected component, n, simultaneously tend to infinity. At the same time, we assume that the distribution of the capacity, m n S n /N, of the n-th connected component, p n (m), tends to some limiting N-independent distribution. From the results of the previous section it follows that we have to assume that z n /N remains constant. In this case the n-th connected component is no longer a tree. However, in this case it appears to be possible to derive an exact (in the thermodynamic limit) relation between p n (m) and p n+ (m). The idea is to use the law of large numbers. Assume we have the n-th connected component with M n Nm n vertices. Also, we assume that the number of edges, which connect vertices inside the n-th component to vertices outside this component is L n Nl n. Due to the randomness of the graph, m n+ and l n+ would be fluctuating variables even if m n and l n are fixed. However, in the thermodynamic limit fluctuations of intensive variables m n+ and l n+ tend to zero. So, the evolution of the n-th connected component, as n is growing, is governed by -d mapping: (m n, l n ) (m n+, l n+ ). This mapping may be constructed as follows. Let Π n (q), or φ n (y) in Z-representation, be the degree distribution of vertices outside the n-th component. (Do not mix φ n (y) and φ (y) φ (y)/φ ().) Their total degree is N ( m n ) φ n (). Nl n edges of this number are to be chosen to connect with vertices of the n + -th shell. All such choices are equiprobable, because of the nature of the statistical ensemble of graphs under consideration. So, the probability that a vertex of a degree q outside the n-th component is not connected to a vertex inside the n-th component equals ( c n ) q, where c n l n / ( m n ) φ n ()]. Then the fraction of vertices, remaining outside the n + -th shell, m n+, is given by m n+ m n Also, we have a recursive relation for the degree distribution function: Π n (q) ( c n ) q φ n ( c n ). (6) q or, in Z-representation, Π n+ (q) m n m n+ Π n (q) ( c n ) q, (7) φ n+ (y) φ n ( c n ) y] φ n ( c n ). (8) Repeatedly applying Eq. (8), introducing t n ( c n ) ( c n ) ( c ), and using φ n (x) φ(x), one can write: From Eqs. (6) and (9) we obtain the relation relating t n and m n. From the definition of t n we obtain the following equation: t n+ ( c n ) t n t n φ n (y) φ (t ny) φ (t n ). (9) m n φ (t n ), () t n l n ( m n ) φ n () t n l n φ (t n ), () where Eqs. (9), (), and the definition of c n were used. Eqs. () and () express m n+ in terms of m n and l n. The total degree of vertices outside the n-th component may be written as N ( m n ) φ n () Nt n φ (t n ) and outside the n + -th one as Nt n+ φ (t n+ ). Therefore, the total degree of vertices in the n + shell is N t n φ (t n ) t n+ φ (t n+ )]. Of this number, Nl n edges are attached to vertices in the n-th component. Each of remaining free N t n φ (t n ) t n+ φ (t n+ ) l n ] edges may be attached either to a vertex outside the n + -th component, or to some other vertex in the n + -th shell (see fig. ). The respective probabilities relates as the total degree outside the n + -th component, Nt n+ φ (t n+ ), and the number of free edges in the n + -th shell. So, we have for the number of edges Nl n+, going out from the n + -th component: l n+ t n φ (t n ) t n+ φ (t n+ ) l n ] t n+φ ] (t n+ ) t n φ t n+ φ (t n+ ) φ (t n+ ) (t n ) l n φ, () (t n ) 6

where Eq. () was used. Eqs. () and () define the -d mapping (t n, l n ) (t n+, l n+ ), where t n is related to the capacity of the n-th component m n by Eq. (). This mapping can be reduced to a -d one, because the first integral of the -d mapping can easily be found. Namely, using Eq. (), one can express l n through t n and t n, and substitute this expression into Eq. (). The result is t n+ φ (t n ) t n φ (t n ). (3) Repeatedly applying this relation and taking into account the fact that the (limiting) starting point of (t n, l n ) sequence is (, ), we obtain the following recursion relation: t n+ φ (t n ) φ () φ (t n ). (4) V. SEWING TOGETHER In the last two sections we described the recursion relations in the problem under consideration in two limiting cases. Now we must sew them together. Let us define G l (t) through ( the recursion ) relation G l+ (t) φ G l (t)] with the initial condition G (t) t. Introducing f l (x) as G l (t) f l z l ln t, we have fl+ (x) φ f l (x/z )], f (x) e x, which exactly coincides with Eq. (). Then in the limit l we have G l (t) f ( z l ln ) t, where f (x) liml f l (x) must be found from Eq. (). This is the same function as that in Eq. (5). Iterating Eq. (4) l times yields ( t n+l f z l ln ) f z l ( t n ) ]. (5) t n Here, it must be assumed t n together with l. Distribution functions P n (t n ) and P n+l (t n+l ) are connected by the relation P n+l (t n+l ) dt n+l P n (t n ) dt n. Therefore, we obtain P n+l (t) z l f (t) Pn z l f (t) ], (6) where f and f are an inverse function and its derivative. As n and N under the condition z n N, the distribution of the capacity of the n-th connected component m n M n /N can be obtained from Eq. (5). But in this limit we have from Eq. (): m n M n /N φ (t n ) z ( t n ), z φ (). Then, we obtain: P n (t) z (z ) z n Np z (z ) z n N ( t) ]. (7) Substituting Eq. (7) into Eq. (6) and denoting n + l as n yields finally: P n (t) ν n f (t) p νn f (t) ] ; ν n z (z ) z n N. (8) This formula is valid if N, n without any restriction on the order of the limits. VI. GENERAL RESULTS Thus, we suggest a regular procedure for calculating the statistical properties of intervertex distances in a random network. This procedure is valid for any large graph with uncorrelated vertices, provided that the degree distribution Π (q) has finite first and second moments. Quantities of interest may be expressed in terms of the function g (y) and its inverse Laplace transform p (x). To obtain them one has to perform the following steps:. Calculate the Z-transform of the distribution function φ (x), Eq. (5), and φ (x) φ (x) /φ ().. Find f (y), which is the solution of the equation f (z y) φ f (y)], where z φ (), with the conditions f (), f (). 7

3. Obtain g (y) φ f (y)]. 4. Calculate p (x), which is the inverse Laplace transform of g (y), Eq. (5). The most nontrivial is step no general methods for the analytic solution of such functional equations are known. However, asymptotic behaviour of f (y) may be easily extracted. At y, we have f (y) y + o (y). Therefore, g (y) z y + o (y), z φ (). As y +, we have f (y) t c, where t c < is the root of the equation t c φ (t c ). At large positive y, one can write f (y) t c + h (y), where h (y) when y +. Then Eq. () can be linearized with respect to h, which gives: h (z y) z c h (y), (9) where z c φ (t c) <. Looking for the solution in the form h (y) Ay α, one can easily obtain the exponent α ln z c / ln z >. Then we obtain asymptotic behaviour of g (y) at large positive y: g (y) φ (t c ) + g (y) m + m g (y), g (y) By α, α ln (/z c) ln z. (3) For p (x), the inverse Laplace transform of g (y), we have: dx p (x) g (), dx xp (x) g () z φ (). (3) From g (+ ) m it follows that p (x) has a δ-functional part, p (x) ( m ) δ (x) + m p (x), p (x) and g (y) are related through the Laplace transform. From the asymptotic expression for g (y) at large y it follows the one for p (x) at small x: p (x) B Γ (α) xα. (3) Various physical quantities may be expressed in terms of the functions g(x) and p(x). For example, the distribution functions P n (m) of the relative size of the n-th connected component, m n M n /N can be expressed from Eq. (8) and the relation m n φ (t n ). We have: P n (m) ν n g ( m) p νn g ( m) ], (33) where g is the function, inverse to g(x). One can write P n (m) ( m ) δ (m) + m Pn (m), where the first term corresponds to finite connected components of the graph, and the second one to the giant connected component. The order of a connected component, n, and the size of the graph, N, enter in the distribution functions in the combination ν n, which may be written as ν n z n n, n ln z (z ) N] ln z. (34) Therefore, P n (m, N) P (n n, m). If n < n, n n, one can write P (n n, m) z n n p ( z n n m ). (35) This limit corresponds to the sizes of connected components being infinitely small compared with the graph size. In this case, for m z n n the small-size asymptotics of the distribution function is P (n n, m) B Γ (α) zn n c m α. (36) In the opposite limit, n > n, n n, the contribution of the giant connected component to the distribution function is concentrated near m m. Here one can write, inverting the asymptotic expression (3) for g: ( ) /α ( ) ] /α P (n n, m) zn n m m p z n n m m. (37) αb B B In this case, for z n n c m m, we have: 8

P (n n, m) ( ) zn n c B. (38) Γ (α + ) m m Let Q n be the probability that two randomly chosen vertices are separated by a distance less than or equal to n. In fact, this is a function of ν n, or, equivalently, of n n, see Eq. (34). That is, we have Q n Q (n n ). This hull function, Q (l), can be expressed as Q (l) m dm mp (l, m) dx p (x) g ( z l x)]. (39) Here we used Eq. (33) and introduced x z l g ( m) as the integration variable. At l <, l, we have g ( z l x ) z z l x in the actual region of integration. Then, taking into account Eq. (3), we obtain Q (l) z zl. (4) As l n n +, the multiple in the square brackets in the integral in Eq. (39) becomes equal to m everywhere except at x, where this multiple is zero. Then we have: lim l + Q (l) m + dx p (x) m. (4) Here the δ-functional part of p (x) is excluded from the integral. This result is obvious the distance between two vertices is less than infinity if both the vertices belong to the giant connected component. One can show (see Appendix A) that for large positive l, ] Q (l) m B ln (/z c ) (l l ) zc l, (4) Γ (α + ) where l Γ (α + ) B ln (/z c ) α B ln (/z c ) dx x α ln x ψ (α)] x p (x) (α ) p (x)] dy y α ln y ψ (α)] y g (y) + α g (y)], (43) where ψ (α) Γ (α) /Γ (α). It is convenient to characterize the distribution of distances in the graph using the size-independent (in the thermodynamic limit) probability density of l n n, R (l): The average value and the dispersion of l are equal to ] l dx p (x) ln x + γ e ln z R (l) dq (l) m. (44) dl ln z ] dy g (y) ln y γ e, (45) ( l l) ln z { ln z { ] } dx p (x) ln x dx p (x) ln x + π 6 ] } dy g (y) ln y + dy g (y) ln y + π 6 (46) (see Appendix A). γ e.5776... is the Euler-Masceroni constant. Note that the asymptotic formulae (3), (3), (36), (37), (38) and (4) are valid only if φ (t c ) and φ (t c) φ (t c ). This conditions are violated if φ (), which is the case when t c and m. If φ (), but φ () (vertices of degree one are absent, but vertices of degree two are present), the asymptotics of f (y) 9

at large y is again f (y) Ay α, α ln (/z c ) / ln z, z c φ () Π () / q. In this particular case, because of g (y) φ f (y)] f (y)] y α, in all formulae α must be replaced with α, and, respectively, z c with zc. The situation is different if φ () φ (). Assume that the minimal vertex degree in the network is k 3. Then at x, φ (x) Π (k) x k and φ (x) kπ (k) x k / q. So, instead of Eq. (9), we have at large y: f (y) ζ k f (y/z )] k, ζ k kπ (k) q, (47) where ζ k only if Π (q) δ K (q k) (in this case the equation for f can be solved exactly). Its solution is f (y) ζ /(k ) k exp ( Ay b) ln (k ), b <, (48) ln z with some constant positive coefficient A. Let us prove that b, i.e. z k. Indeed, qz q (k ) q (q k) Π (q). (49) The equality is possible only in the case Π (q) δ K (q k). The asymptotics of g (y) at large y is qk g (y) Π (k) f (y)] k ( q k ) k/(k ) Π (k)] /(k ) exp ( By b), (5) where B ka. The asymptotics of the function p (x) at x can be obtained by making the saddle point evaluation of the inverse Laplace transform integral in Eq. (5). We have: p (x) ( Dx ( b)/( b)] exp Cx b/( b)), C (b b/( b) b /( b)) B /( b), D (bb)/( b)] π ( b), (5) The asymptotic expressions (36) (38) must be replaced with a new ones in this case. For brevity, we present here explicitly only the asymptotic expression for the cumulative distance distribution Q (l) when the distance deviation l is positive and large, i.e. the one which replaces Eq. (4). We have: Q (l) F dx p (x) g (x/ν) dx x ( b)/( b)] exp ( Cx b/( b) Bν b x b), (5) assuming that the actual region of integration is ν x. Here F ( q/k) k/(k ) Π (k)] /(k ) D and ν z l. Then, the saddle point calculation of this integral gives: Q (l) ] Gz l/ exp Hz l/( b), (53) where G and H are some positive numbers which can be expressed in terms of b, B, k and Π (k). VII. NETWORKS WITH AN EXPONENTIALLY DECAYING DEGREE DISTRIBUTION Here we present a two-parameter family of degree distributions, for which one can obtain exact analytical expressions for P n (t), and, consequently, for the intervertex distance distribution. These are the degree distributions, for which φ (x) is a fractional linear function. These functions form a group with respect to the operation of functional composition. Indeed, the composition of any fractional linear functions is a fractional linear function, and the inverse of any fractional linear function is also a fractional linear function. Then, one can look for the solution of Eq. (), f (y), in the form of a fractional linear function too. It is more convenient to define a one-parameter family of linear fractionals f (x), and then to write φ as φ (x) f z f (x) ].

Any linear fractional f (x), under the conditions f () and f () may be written as f (y) t c + ( t c) t c + y, (54) It depends on one parameter t c f ( ), whose meaningful values belong to the interval (, ). Then φ (x) may be expressed as φ (x) (z ) t c + ( z t c ) x z t c (z ) x. (55) The degree distribution φ (x) dx φ (x), φ (), can be restored up to an additive constant. Since φ () is the fraction of zero-degree vertices, whose effect on the properties of the network is trivial, it is natural to set the integration constant so that such vertices will be excluded, φ (). The result is φ (x) βx + ( β) ln ( ζx) + ζx ln ( ζ) + ζ where the parameters β and ζ are connected with z and t c as follows: z ( β) ζ ζ t c β ζ ζ The degree distribution in the original representation is In this case, the average vertex degree is ζ β ( ζ) ln ( ζ) + ζ],, (56) ln ( ζ) + ζ β ( ζ) ln ( ζ) + ζ] ζ. (57) Π (q) βδ (q ) δ (q )] β ζ q ln ( ζ) q. (58) q z φ () β β ( ζ) ln ( ζ) (59) and the relative size of the giant connected component is m φ (t c ) ( t c) β ln ( ζ) + ζ] ( β) ln z ln ( ζ) + ζ. (6) CDN β GCC ζ FIG.. Phase diagram of the model with an exponentially decaying degree distribution. Here the GCC indicates the presence of a giant connected component in the network, and the CDN a completely disconnected network. The capacity (relative size) of the giant connected component is m along the line β.

The giant connected component exists if z >, which in our case corresponds to β < β c (ζ), where β c (ζ) ζ 3 ζ (ζ ) ( ζ) ln ( ζ). (6) The phase diagram of the model is shown in fig.. It should be noted, that the giant connected component disappears if the number of one-degree vertices (dead ends) exceeds some critical value. If these vertices are absent, (almost) the entire network is a single connected component. The composition function g (x) φ f (x)], which is the Laplace transform of the distribution function p (s) of s n z (z ) z n M n, M n being the size of the n-th connected component for a large but finite n, is g (y) m + β ( t c) t c + y + β ln ( ζ) + ζ The calculation of the inverse Laplace transform is straightforward: p (x) ( m ) δ (x) + β ( t c ) exp ( t c ) x] β ln ( ζ) + ζ { ] } ( tc ) /z + y ln + ζ ( t c). (6) t c + y t c + y { x exp ( t c) x/z ] exp ( t c ) x]] ζ ( t c ) exp ( t c ) x] }. (63) Below, for the sake of simplicity, we present the results for β only, when Π () and the dead ends are absent. (See results for b in fig. 3.) In this case t c and m, i.e. the giant connected component (almost) coincides with the whole graph..8 ζ..8 ζ.4 Q(l).6.4. - -5 - -5 5.8.6.4 3 4. ζ.6 ζ.8.8 3 4-4 - 4 4 Q(l).6.4. 3 4-3 - - l.6.4. 3 - -5 5 l FIG. 3. Cumulative distance distribution function Q(l) in the model with an exponentially decaying degree distribution for various values of ζ and β. When ζ., curves,, 3, and 4 correspond to β.3,.5,.5, and.5, respectively. If ζ.4, curves,, 3, and 4 correspond to β.5,.4,.,., respectively. If ζ.6, curves,, 3, and 4 correspond to β.7,.5,.3,., respectively. If ζ.8, curves,, 3, and 4 correspond to β.8,.6,.4,., respectively. The distribution function P n (m) P (n n, m) depends upon the size of the network through n : n + ln ln ( ζ) + ζ ζ 3. (64) Its dependence on m can be represented in a parametric form by introducing a parameter t, related to the size of the connected component m as

ln ( ζt) / ( ζ)] ζ ( t) m (t) φ (t). (65) ln ( ζ) + ζ Using this parametrization, Eq. (33) can be written as: P (l, t) ν f (t) m (t) p νf (t) ] { ζt ζ t exp ( ζ) ν t ] ( + ζν t ) ( exp ν t )}, ν ( ζ) l. (66) ( t) t t t Eqs. (65) and (66) determine P (l, m) in a parametric form. At small m, P (l, m) has the asymptotics: ( P (l, m) ( ζ) q ζ exp ( ζ) ν m q ) ( + ζν m q ) ( exp ν m q )], (67) m where q φ () ζ / ( ζ) ln ( ζ) + ζ ] is the average vertex degree. Equation (68) is valid if m. On the other hand, as m, ν, we have: ( )] P (l, m) ζ ζ ( m) exp ζ ( ζ) ν ( m). (68) The cumulative distance distribution Q n is a function of l n n (N): Q (l) dt m (t) m (t) P (l, t). (69) We failed to find this integral analytically, but asymptotic expressions can be presented. As l > and l, ζ ] A, l ζ + ln ( ζ)] Q (l) A (l l ) ( ζ) l, (7) γ e ζ + ln ( ζ) ( ζ) ln ( ζ) ζ ln ( ζ). (7) Here we have ( ζ) l instead of ( ζ) l in the asymptotics, because one-degree vertices are absent in the network, but vertices of degree two are present. On the other hand, as l < and l, we have ( Q (l) ζ ) ( ζ) l. (7) ln ( ζ) + ζ The position of the center of the distance distribution and its mean square deviation are given by Eqs. (45) and (46) respectively. Calculating the integrals yields l γ e ln ( ζ) ln ( ζ) ln ( ζ) + ζ, (73) ( π l l) ln ( ζ) + ln ( ζ) 3 ln ( ζ) + ζ ] ln ( ζ). (74) ln ( ζ) + ζ VIII. POWER-LAW DEGREE DISTRIBUTION WITH AN EXPONENTIAL CUT-OFF The general scheme, introduced in this paper, is applicable only if the degree distribution has finite first and second moments in the thermodynamic limit. If, for example, the degree distribution Π (q) is asymptotically a scale-free one, Π (q) q γ, at large q, and exponent γ 3, then our considerations fail. In this section we shall consider the networks with power-law degree distributions, < γ < 3, and with an exponential cut-off at large degrees. 3

The crucial point of our formalism is to find the general solution of the recursive relation t n+ φ (t n ), or, more precisely, to find how this solution will behave at the large number of iterations n. One can see that this recursion relation is easily solvable in the following case: where < γ < 3 and t c <. Indeed, we have: ( ) γ x φ (x) ( t c ), (75) t c ( t t n ( t c ) t c ) (γ ) n, (76) that is the analytic form of the solution with any initial condition. However, we have to have finite value of z φ () to apply the general scheme. Then, let us define φ (x) ( ζx)γ γ, (77) ( ζ) where we set the parameter t c regulating the size of the giant connected component to be equal to zero, which means that the giant connected component contains almost all the vertices. The problem can be solved for an arbitrary t c, but then the results would look essentially more cumbersome. The parameter ζ < corresponds to the cut-off. Indeed, the Z-transformed degree distribution φ (x) may be easily obtained from Eq. (77), by integrating its right-hand side. We have: Then the degree distribution is Π (q) φ (x) ( ζx)γ + ζ (γ ) x ( ζ) γ + ζ (γ ) ] ( ζ) γ sin πγ + ζ (γ ) Γ (γ) π. (78) Γ (q γ + ) ζ q, (79) q! from which one can easily see that Π (q) q γ ζ q at q. In the following we shall assume ζ. We have to find the solution f (y) of the functional equation f (y) φ f (y/z )], or: ( ζ) γ ] f (y) ζf (y/z )] γ, (8) where z φ () (γ ) ( ζ) γ 3. (8) The function f (y) must satisfy the initial conditions f () and f (). Therefore, for small enough y we have: f (y) y. This approximate equality holds when y f () /f () / f () ( ζ) γ (see Appendix B). On the other hand, at large enough y one can set in Eq. (8) ζ (but not z!). The resulting equation can be easily solved: f (y) exp ( Ax ϑ), f (y) f (y/z )] γ, (8) ϑ ln z (3 γ) ln ( ζ) +. (83) ln / (γ )] ln (γ ) The constant A must be determined by sewing together the expressions for small and large y (see Appendix B), which gives: A /eϑ. Thus, we have: f (y) exp ) ( y ϑ. (84) eϑ 4

This formula is valid if y ( ζ) γ (Appendix B), i.e. for a large enough cut-off parameter ln ζ, this formula is valid almost everywhere except small vicinity of y. Then, g (y) φ f (y)] is: g (y) { exp γ ] (γ ) y ϑ (γ ) exp eϑ ) } ( y ϑ + γ. (85) eϑ The inverse Laplace transform of g (y) is p (x) γ x ϑ { exp γ e ) ( xϑ exp eϑ ]} (γ ) xϑ eϑ (86) (see Appendix B). The region of validity of this formula is x ( ζ) γ. γ. γ.4 P(m) 5 5..4.6.8..4.6.8 γ.6 γ.8 P(m) 5 5..4.6.8 m..4.6.8 m FIG. 4. Series of the connected-component size distributions P(l, m) in the model with a power law degree distribution for various values of the γ exponent. As the distribution more and more concentrates near m, the parameter l of the curves takes the values.5,,.,.5 when γ.; the values.,.5,,.,.5 when γ.4;.,.,, 4. when γ.6; and 4.,.,., 5.,. when γ.8. Now, let us consider the distribution of the size of the n-th connected component, P n (m). Since it is impossible to calculate the inverse of g (y) analytically, we use the parametric form of Eq. (33), introducing a parameter t, which is connected with m as m (t) φ (t) (γ ) ( t) ( t) γ ]. (87) γ Then we have: P n (t) ν n f (t) m (t) p ν n f (t) ], (88) where f (t) is the inverse of the function f (y), Eq. (84) and ν n z (z ) z n N. Combining Eqs. (33), (83), (87) and (88) we obtain: where P n (t) µ n ( t) γ ] ( t) ln ( t) { exp µ n ln ( t) ] exp ]} (γ ) µn, (89) ln ( t) 5

µ n (eϑ) νn ϑ (γ )n N ϑ (eϑ) (γ ) n n, (9) ϑ ln N (ln ϑ + ) n, ln / (γ )] (9) ϑ is given by Eq. (83). Eqs. (87) and (89) define the distributions of the sizes of the n-th connected component P n (m) in the parametric form, see fig. 4. The order of a connected component, n, and the size of the system, N, enter here only in the combination n n (N), i.e. one can write P n (m) P (n n, m). One can write the following asymptotic expression for P (l, m): at small m, m exp (γ ) l], P (l, m) (γ ) (γ )l m ln 3, (9) ( q/m) where q (γ ) / (γ ) is the mean degree. When m close to, or, more precisely, when m, (γ ) l, we have: ] } 3/ P (l, m) (γ ) l γ exp { (γ ) l γ. (93) ( m) ( m) Obviously, the distribution is concentrated near m when l <, l, and near m when l >, l. The intervertex distance distribution actually depends on n n l only. For the cumulative distance distribution Q n Prob (Distance n) Q (n n ) we have: Q (l) dm mp (l, m) This integral can easily be evaluated, and we obtain dt m (t) P (l, t). (94) (γ ) µ/ Q (l) (γ ) {(γ ) K (µ /) + K (γ ) µ /] (γ ) / K (γ ) / µ /]}, (95) where µ (γ ) l and K is the McDonald function, see fig. 5. For large negative l, using the large argument asymptotics of K (z) we obtain the following expression: Q (l) ( ) γ π / (γ ) l/4 exp (γ ) l/]. (96) γ.8.6 Q(l).4. 3 4 - -5 5 l FIG. 5. Cumulative distance distribution function Q(l) in the model with a power-law degree distribution for various values of the γ exponent. Curves,, 3, and 4 correspond to γ.,.4,.6, and.8, respectively. 6

For a large positive l we have: Q (l) ( ) (γ ) (l l ) (γ ) l ln γ, l γ e 3/ γ γ ln (γ ). (97) ln / (γ )] The hull function Q (l) is characterized by the position of its center: l + dq (l) dl l dl ln (γ ) γ e ] ln (γ ) γ (98) and by its width δl + dl ( l l ) dq (l) dl ln (γ ) π 6 (γ ) ln (γ ) (γ ) ]. (99) Note that all the results of this section were obtained assuming two limiting transitions. First the size of the network tends to infinity, while the cut-off parameter is kept finite. This allows us to apply the general formalism based on Eq. (). And only afterwards the cut-off parameter ln ζ tends to infinity. This allows us to obtain the solution of Eq. () in the leading order. The limiting transitions in this section are performed precisely in this order. Situations where these two limiting transitions must be performed simultaneously will be discussed in the next, conclusive section. IX. CONCLUSIONS The most crucial restriction in our formalism is that vertices of the network are uncorrelated, so that the network is completely defined by a given degree distribution Π (q). This allowed us to trace the evolution of the n-th connected component of a vertex as the n is growing. This is possible, however, only if Π (q) has finite first and second moments, q and q. These networks contain (almost) no closed loops of finite size. Almost all loops are of the order of the average intervertex distance in the network, ln N. The problem of the intervertex distance statistics for these random network is reduced to the solution of the functional equation (). It is possible to solve it only in some particular cases. However, all the asymptotic properties of the distance distributions may be extracted from this equation. Undefined constants in the resulting asymptotic expressions (35) (53) can be found numerically, if necessary. The general results may be summarized as follows:. The average distance d between two vertices in the giant connected component of the network depends on the network size N as d ln (AN), z, () ln z q q where A is some number. We assume z >, which ensures the existence of a giant connected component.. The mean square deviation of the intervertex distance is some finite number σ (d d). That is, in the large network almost all vertices in the giant connected component are nearly equidistant from each other: the distance is almost certainly d plus or minus a few links. 3. The (cumulative) intervertex distance distribution is actually a function of d d, Q ( d d ). It is nearly as its argument is large and negative, d < d, d d, and tends to m (the probability that two randomly chosen vertices belong to the giant connected component) as d > d, d d. (Note that the narrowness of an intervertex distance distribution also was observed in other types of networks,3].) 4. At large negative l d d, the asymptotics of Q (l) is Q (l) z l. This result is evident. Indeed, the average size of the n-th connected component of a vertex is m n z n, which holds when m n M n /N and these components are tree graphs. 7

5. The asymptotics of Q (l) at large positive l depends on the minimal vertex degree (we assume this degree is nonzero in any case). If q min k, we have Q (l) lzc l, where z c < is some positive number (see the beginning of the Section VI). If k, the asymptotics is Q (l) lzc l with the same z c. So, in these two cases the probability that the distance between two vertices is essentially larger than its average, decays ( essentially as ) the exponent of the deviation. If, however, k 3, the situation is different: Q (l) z l/ exp Hz l/( β), where β ln (k ) / ln z < and H is some positive number. So, this decay is essentially more rapid than an exponential one. The origin of this difference is clear. In the first case, k, the giant connected component contains some number of dead ends; also, when k, long chains of vertices are present in the giant connected component. But, contrastingly, when k 3, the giant connected component is compact. 6. We obtain asymptotic expressions for the size distribution P n (m) for the n-th connected component. From this basic distribution, another valuable information about the structure of the network can be obtained. For example, from P n (m), one can obtain the length distribution for a closed loops in the network. In Sections VII and VIII our general formalism was applied to networks with specific types of degree distribution function. In Section VII we considered a two-parameter family of degree distributions. We chose Π (q) ζ q /q for q, ζ <, Π () β <. A motivation for such a choice was that the function φ (x) is a linear-rational one, which allowed us to solve the main equation () analytically. In Section VIII we studied the problem: what are the statistics of intervertex distances in the networks with the finite first and divergent second moments of the degree distribution? We introduced a degree distribution, which behaves as Π (q) q γ ζ q, < γ < 3, at large degrees q. So, the degree distribution is a power law one in the limit ζ. We found the leading contributions to the size distribution for the n-th connected component and, consequently, to the intervertex distance distribution in this limit. The results may be summarized as follows:. The mean intervertex distance is given by: d ln N ln C ln ( ζ) +. () (3 γ) ln ( ζ) ln (γ ) Note that here N, and ζ is kept small but finite, so the first term on the right-hand part is the leading one.. The intervertex distance distribution actually depends on l d d, and the form of this dependence (see Eq. (95)) appears to be independent of the cut-off position. 3. The mean square deviation of the distances (Eq. (99)) is again a finite number, it depends neither on the network size nor on the cut-off. 4. The probability to find a pair of vertices separated by a distance essentially larger than d exponentially decays with l d d, or, more precisely, as l (γ ) l. The probability to find a pair separated by a distance essentially smaller than d decays faster than an exponent, namely as (γ ) l/4 exp (γ ) l/] (here l <, l ). Now let us discuss the problem: under what conditions these results remain true if one simultaneously tends to infinity both the size of the system and the position of the cut-off in the degree distribution. That is, simultaneously, N and ζ. The main question studied in this paper is: how does the size distribution of the n-th connected component changes with its number n? In Z-representation, at sufficiently small n, this evolution is described by Eq. (8). We replace this equation with Eq. (3), provided that the function f is a solution of the functional equation (). This can be done if, on the one hand, n is large enough, so that Eq. (8) may be replaced with its asymptotic form, and, on the other hand, n is small enough the size of the connected component is still essentially smaller than the size of the network. Let us estimate how large n must be to satisfy the first requirement. For this, let us choose some t close enough to, so that φ (t ) z ( t ). This means that the second term of the Taylor series of φ (t ) near t, (/) φ () ( t ), is smaller than the first one. Since z φ () ( ζ)γ 3 and φ () ( ζ)γ 4, this is satisfied if at least t ζ. The condition for the replacement of the evolution equation (8) with its asymptotical form means that the functions F n (x) and F n+ (x) can be reduced to each other by the rescaling of the independent variable, F n+ (x) F n (z x). This is true, if after n iterations of the interval of linearity of the function φ (x), (t, ), t ζ, the resulting interval (t n, ) nearly coincides with its limit at n, (, ). In other words, we must require t n. Outside the interval of linearity we can write: 8