RECENT COMMON ANCESTORS OF ALL PRESENT-DAY INDIVIDUALS

Size: px
Start display at page:

Download "RECENT COMMON ANCESTORS OF ALL PRESENT-DAY INDIVIDUALS"

Transcription

1 RECENT COMMON ANCESTORS OF ALL PRESENT-DAY INDIVIDUALS Joseph T. Chag Departmet of Statistics, Yale Uiversity Abstract Previous study of the time to a commo acestor of all preset-day idividuals has focused o models i which each idividual has just oe paret i the previous geeratio. For example, mitochodrial Eve is the most recet commo acestor (MRCA) whe acestry is defied oly through materal lies. I the stadard Wright-Fisher model with populatio size, the expected umber of geeratios to the MRCA is about 2, ad the stadard deviatio of this time is also of order. Here we study a two-paret aalog of the Wright-Fisher model that defies acestry usig both parets. I this model, if the populatio size is large, the umber of geeratios, T, back to a MRCA has a distributio that is cocetrated aroud lg (where lg deotes base-2 logarithm), i the sese that the ratio T /(lg ) coverges i probability to 1 as. Also, cotiuig to trace back further ito the past, at about 1.77 lg geeratios before the preset, all partial acestry of the curret populatio eds, i the followig sese: with high probability for large, i each geeratio at least 1.77 lg geeratios before the preset, all idividuals who have ay descedats amog the preset-day idividuals are actually acestors of all preset-day idividuals. COALESCENT, WRIGHT-FISHER MODEL, GALTON-WATSON PROCESS, GENEALOGICAL MODELS, POPULATION GENETICS AMS 1991 SUBJECT CLASSIFICATION: PRIMARY 92D25 SECONDARY 60J85 Ruig head: Recet commo acestors Postal address: Yale Uiversity Statistics Departmet Box Yale Statio New Have, CT Phoe: Fax: joseph.chag@yale.edu Versio: Jue 12, 1998

2 1 Itroductio Startig with the set of all of us preset-day humas, imagie tracig back i time through our mothers, our mothers mothers, ad so o. This is the materal family tree of makid, ad we are at its leaves. Recet research has suggested that the woma at the root of this tree lived roughly 100,000 or 200,000 years ago, perhaps i Africa (Ca et al., 1987; Vigilat et al., 1991). This woma has bee dubbed mitochodrial Eve, sice all preset-day huma mitochodrial DNA desceded from hers. Mitochodrial Eve was udoubtedly ot the oly woma alive at her time, so the ame Eve is misleadig, as has bee poited out by a umber of authors; see, e.g., Ayala (1995). However, this misuderstadig aside, questios of the origis of makid ad the ature of our relatioships to each other are still of kee iterest, ad the research o mitochodrial Eve has received a great deal of publicity, geeratig headlies i the popular press as well as i scietific publicatios. Svate Pääbo (1995) explais:...the recet date of our mitochodrial acestor is i a sese the really cotroversial coclusio from these studies. Everyoe agrees that we trace our acestry to Homo erectus, who emerged i Africa ad from there coloized most of Eurasia about a millio years ago or eve earlier. What the mitochodrial data seem to show, however, is that we have a much more recet acestor, oe who lived some 100,000 or 200,000 years ago. What captures the imagiatio is ot the particular choice to trace back through the materal lie, but rather it is the idea that all of preset-day humaity may have a commo acestor who lived as little as 100,000 years ago, a time that seems to may to be surprisigly recet. If we retai this idea while removig the restrictio to the materal lie, the questio becomes: How far back i time do we eed to trace the full geealogy of makid i order to fid ay idividual who is a commo acestor of all preset-day idividuals? I this paper we address this sort of questio i a simple mathematical model. The coalescet model of Kigma (1982) forms the basis of may of the calculatios, formal ad iformal, used i recet treatmets of questios about mitochodrial Eve ad related topics. The coalescet is a large-populatio limit of a umber of the fudametal models of populatio geetics, icludig the Wright-Fisher process. These models are haploid, with each idividual i a give geeratio havig a sigle paret i the previous geeratio. The Wright-Fisher model assumes radom matig, i the sese that the paret of a give idividual is equally likely to be ay of the idividuals i the previous geeratio. The stadard model also postulates a costat populatio size, which may be a effective populatio size whe modelig more geeral situatios. A umber of importat properties of the coalescet model are used i applicatios. For example, the model implies a relatioship betwee coalescece times ad populatio size: the expected coalescece time (measured i geeratios) of a large sample is about twice the populatio size. Hudso (1990) gives a survey of the theory ad applicatios of the coalescet. Here we study a atural two-paret aalog of the Wright-Fisher process. (This process was previously cosidered by Kämmerle (1991) ad Möhle (1994); see the ed of this sectio for a discussio of related work.) We assume the populatio size is costat at. Geeratios are discrete ad ooverlappig. The geealogy is formed by this radom process: i each geeratio, each idividual chooses two parets at radom from the previous geeratio. The 2

3 choices are made just as i the stadard Wright-Fisher model radomly ad equally likely over the possibilities the oly differece beig that here each idividual chooses twice istead of oce. All choices are made idepedetly. Thus, for example, it is possible that whe a idividual chooses his two parets, he chooses the same idividual twice, so that i fact he eds up with just oe paret; this happes with probability 1/. This model is desiged oly as a simple startig poit for thought; of course it is ot meat to be particularly realistic. Still, oe might worry that this simple model igores cosideratios of sex ad allows impossible geealogies. If this seems bothersome, a alterative iterpretatio of the same process is that each idividual is actually a couple, ad that the populatio cosists of moogamous couples. The the radom choices cause o cotradictios: the husbad ad wife each were bor to a couple from the previous geeratio. They could eve come from the same couple i the previous geeratio. Our iterest here is i fidig idividuals who are commo acestors of all preset-day idividuals. For coveiece, we use the abbreviatio CA to refer to a commo acestor of all preset-day idividuals, ad MRCA stads for most recet commo acestor. It turs out that mixig occurs extremely rapidly i the two-paret model, so that CA s may be foud withi a umber of geeratios that depeds logarithmically o the populatio size. I particular, our first mai result says that the umber of geeratios back to a MRCA is about lg, where lg deotes logarithm to base 2. Theorem 1. Let T deote the umber of geeratios, coutig back i time from the preset, to a MRCA of all preset-day idividuals, i a populatio of size. The T lg P 1 as. This cotrasts dramatically with the oe-paret situatio. For example if is 1 millio, the the oe-paret MRCA ( Eve ) is expected to occur about 2 millio geeratios ago, whereas a two-paret MRCA occurs with high probability withi the last 20 geeratios or so. Also, the variability i the oe-paret situatio is such that the actual time to the MRCA may easily be as small as half the expected time or as large as double the expected time, say, eve i arbitrarily large populatios. I cotrast, the time to a MRCA for the two-paret model is much less variable. For example, if the populatio is large eough, it is very ulikely that a radom realizatio of the two-paret MRCA time will differ from lg by eve oe percet. This paper also addresses a secod related questio. Imagie tracig back through the two-paret geealogy. Accordig to Theorem 1, after about lg geeratios, we will reach the most recet geeratio that cotais a CA. That geeratio might cotai just oe CA, or it might cotai more tha oe. I ay case, if we cotiue tracig back further through successive geeratios, the the title of CA becomes much less of a prestigious distictio. For example, both parets of a CA will be CA s, ad all gradparets of a CA will be CA s, ad so o. Evetually, i a give geeratio, may (ad i fact most) of the idividuals will be CA s. At some poit we reach a geeratio i which some idividuals are CA s (havig all preset-day idividuals as descedats) ad some are extict (havig o preset-day idividuals as descedats), but o idividual is itermediate (havig some but ot all preset-day idividuals as descedats). That is, at this poit, everyoe who is ot extict is a CA. This coditio persists forever as we trace back i time: every idividual is a CA or 3

4 S 0/ 0/ S S {4,5} S 0/ S S S S {4,5} 0/ {1,2,3,4} {4,5} {1,2,3,4} {2,3} S 0/ {3,5} {2,3} {4} {5} {1,2,4} Figure 1. A example illustratig the model. Here the fourth idividual i geeratio 2 is a CA. By geeratio 5, all idividuals are CA s or extict: idividuals 1,4, ad 5 are CA s, ad idividuals 2 ad 3 are extict. extict. The ext result shows that this coditio is reached very rapidly i the model studied here. Theorem 2. Let U deote the umber of geeratios, coutig back i time before the preset, to a geeratio i which each idividual is either a CA of all preset-day idividuals or a acestor of o preset-day idividual. Let γ deote the smaller of the two umbers satisfyig the equatio γe γ =2e 2, ad let ζ = 1/(lg γ) The U (1 + ζ)lg P 1 as. Thus, withi about 1.77 lg geeratios, a tiy amout of time i compariso with the order time required to get a oe-paret CA, everyoe i the populatio is either a CA of all preset-day idividuals or extict. Figure 1 shows a small example to illustrate the defiitios ad statemets. The populatio size is 5. At the bottom of the figure is geeratio 0, the preset. Goig up i the graph correspods to goig back i time, so that the top row is geeratio 5. For each idividual I i each previous geeratio, we calculate the set of preset-day idividuals (idividuals i geeratio 0) that are descedats of I. For example, the set of preset-day descedats of idividual #1 i geeratio 1 is{3, 5}. The calculatios propagate backward i time accordig to the rule: the set of descedats of a idividual I is the uio of the sets of descedats of the childre of I. For example, the set of preset-day descedats of idividual #4 i geeratio 2 is the uio {3, 5} {5} {1, 2, 4}, which is the whole populatio S. Thus, idividual #4 i geeratio 2 is a CA of the set S of all preset-day idividuals. Cotiuig backward i time, at geeratio 5 we reach the stage where each 4

5 idividual has as descedats either the whole populatio S or the empty set. That is, each idividual i geeratio 5 is either a CA or extict, havig as descedats either everybody or obody from the set of preset-day idividuals, ad all geeratios prior to geeratio 5 also have this property. I the example show, T 5 = 2 ad U 5 =5. What is the sigificace of these results? A applicatio to the world populatio of humas would be a obvious misuse. For example, we would ot claim that a commo acestor of every preset-day huma may be foud withi the last lg geeratios. Eve if we took to be 5 billio, this would imply a CA just about 32 geeratios ago perhaps 500 years or so. A importat source of the iapplicability of the model to this situatio is the obvious o-radom ature of matig i the history of makid. For example, parets are much more likely to live withi a few miles of their childre tha a thousad miles away or halfway aroud the world. So the model studied here is too simple to be directly applicable to the evolutio of makid as a whole. I such complicated situatios, the results soud a ote of cautio: if the logarithmic time to CA s seems patetly implausible, the at least oe of the assumptios of the model, such as the radom matig assumptio, must be causig a great deal of trouble. O the other had, it would be iterestig to kow whether there are simpler real-life situatios i which the assumptios of the model do apply reasoably well ad the theorems provide reasoably accurate quatitative descriptios. Perhaps a relatively homogeeous populatio lackig discerible structures (geographic or otherwise) that iteract strogly with reproductio would be a promisig cadidate. The radom time aalyzed i Theorem 2 seems of atural iterest i this process ad may also be pertiet to certai questios about species trees or populatio trees (as opposed to gee trees ). I may cotexts the species tree is cosidered to be the real object of iterest, ad we use geetic data ad gee trees to attempt to lear about the species tree. For example, for humas, chimpazees, ad gorillas, is the true species tree (HC)G, (HG)C, or (CG)H? Roughly, the coceptual framework of this questio is as follows. There were two speciatio evets that split a sigle species acestral to humas, chimpazees, ad gorillas ito the three separate moder species. The tree (HC)G, for example, says that the first such split separated the subpopulatio that evetually became moder gorillas from the remaider, which later split to become moder humas ad chimpazees. Ufortuately, more precise defiitios of the cocept of species tree that remai useful i difficult or uclear cases seem hard to come by. Oe might adopt the viewpoit that the proper startig poit for a defiitio of species tree is the full two-paret geealogy of all preset-day idividuals. Give such a defiitio, if we kew all details of this geealogy, the we could read off a aswer to the H, C, ad G questio (the aswer might be oe of the 3 choices above that is, the species tree is ot well defied or at least ot bifurcatig). Oe iterpretatio of the time U is as follows. Suppose we imagie a case where evolutio really proceeded accordig to a eat successio of speciatio evets. Uder a certai reasoable defiitio of a species tree, if the times betwee those speciatio evets exceed U, the the species tree is guarateed to be well defied ad coicide with the history of speciatio evets. This idea will be discussed more fully elsewhere. A caveat to forestall potetial misuderstadig: This paper is ot about geetics. That is, it is ot about who gets what gees; it is about somethig more primitive, amely, the acestor-descedat relatioship. Oe-paret models are appropriate i tracig the history of a sample of orecombiig gees or small bits of DNA; a sigle ucleotide desceds from a sigle ucleotide from either the mother or father, but ot both. Here we are cosiderig 5

6 acestry i the more commo, demographic sese of the word, as applied to people, for example, rather tha gees. Previous geetics research that is somewhat related, although still very differet from the preset study, cosiders models icorporatig recombiatio. This type of model has bee ivestigated i a umber of papers, icludig those of Hudso (1983) ad Griffiths ad Marjoram (1997). The history of a sample of DNA sequeces may be described by a collectio of geealogies, with each ucleotide positio i the DNA havig its ow oe-paret geealogy. The geealogies for two positios that experiece o recombiatio betwee them will be cogruet, with the paths of the two geealogies goig back through the same idividuals, whereas a recombiatio betwee two positios causes the geealogies of those positios to differ. Each of the geealogies i the collectio will have its ow MRCA (the ucleotide at its root), which may occur i differet idividuals. Each of these idividuals will be a CA i the sese cosidered i this paper, but the most recet of these idividuals is geerally ot a MRCA i our sese. Our MRCA is more recet, sice the paths from acestors to descedats cosist of all potetial paths for gees to be trasmitted, ad may iclude paths that did ot happe to be take by ay gees. No previous results about these geetic models have bee similar to the results here, for example, i gettig times of order log. This is ot surprisig, sice the asymptotics would require a assumptio that the sequece legths ad the umber of recombiatios ted to ifiity. This is aother maifestatio of the statemet that the questios we are ivestigatig here are ot fudametally geetics questios. There is some previous work o the process we study here ad related processes. Two papers of Kämmerle (1989, 1991) itroduce a geeral class of two-paret (called bisexual i those papers) versios of the Wright-Fisher ad other processes. These papers focus o two mai questios. First, they aalyze the probability of extictio of a set of idividuals i the preset geeratio, that is, the probability that the set of idividuals evetually has o descedats i some future geeratio. Secod, i a two-paret versio of the Mora model, they study the umber R (t) of idividuals t geeratios ago who have at least oe descedat i the preset geeratio. Kämmerle (1989) fids that the Markov chai {R (t) :t =0, 1,...}, suitably ormalized ad suitably iitialized (with the iitializatio essetially requirig that the chai is started i steady state), coverges weakly as to a discrete-time Orstei-Uhlebeck process. Möhle (1994) both geeralizes ad refies the results of Kämmerle. I particular, Möhle provides a detailed aalysis of the extictio probabilities i a two-paret Wright-Fisher model that approximates the probabilities up to o(1/). He also establishes weak covergece i a geeral class of two-paret models, icludig a Orstei-Uhlebeck limit for the two-paret Wright-Fisher process. Möhle also has a umber of other papers i press, icludig oe that relaxes the assumptio of costat populatio size. These previous results are complemetary to the results i this paper. The previous papers cosidered idividuals who have at least 1 descedat i a give future geeratio. Here we cosider CA s, who have as descedats all members of the future geeratio. The previous results about the process {R (t)} apply to large t, that is, to the behavior the process may geeratios before the preset, with the process i steady state. Here we focus o the behavior of a related process at small (i.e. recet) times, startig far away from steady state. We show that at about t =1.77 lg geeratios before the preset, with high probability the R (t) idividuals who have at least 1 preset-day descedat are all i fact CA s. 6

7 2 Simulatios Table 1 presets a small simulatio study cosistig of 25 trials each for = 500, = 1000, = 2000, ad = Two umbers are reported for each trial: T, the umber of geeratios back to a MRCA, ad U, the geeratio at which every idividual is either a CA or extict. = 500 = 1000 = 2000 = Table 1. A small simulatio study. For each of four populatio sizes, the two times T ad U are reported for 25 trials. I these simulatios the distributio of the time back to a MRCA is ideed quite cocetrated aroud the value lg, which is early 9 for = 500, early 10 for = 1000, ad so o. Thus, the simulatio results show that the asymptotic ( ) statemet of Theorem 1 is ot so asymptotic, i that it describes the situatio well eve for rather small values of. The behavior predicted by Theorem 2 is also reflected reasoably well i the simulatios, although oe might have guessed a umerical costat closer to 2 rather tha 1.77 from this small study. 7

8 3 Proofs 3.1 Geeral ideasad tools We start with the observatio that although Theorems 1 ad 2 are phrased i terms of coutig geeratios back i time from the preset util some coditio obtais, these results may be proved by coutig forward i time from a fixed geeratio. For example, the evet {T m} requires that a CA of all idividuals i geeratio 0 may be foud amog geeratios 1, 2,..., m. This is equivalet to requirig that if we start with geeratio m ad trace forward i time, the some idividual i geeratio m becomes a CA of all idividuals i some geeratio t { m +1, m +2,...,0}. So we will cout geeratios forward i time, ad for coveiece let us reumber geeratios so that the iitial geeratio is geeratio 0. The populatio at geeratio t 0 cosists of idividuals deoted by I t,1,i t,2,...,i t,. We ca picture I t,1,i t,2,...,i t, as dots i a array as i Figure 1, with I t,j beig the jth dot i row t. The associatio of a umber j to idividual I t,j is a arbitrary labelig of the idividuals withi geeratio t. Assiged oly as a meas of referrig to idividuals, the labels have o sigificace i the model, which does ot order the idividuals withi a geeratio. Let µ t,1,ν t,1,µ t,2,ν t,2,...,µ t,,ν t, be idepedet ad uiformly distributed o the set {1,...,}. We iterpret µ t,j ad ν t,j as labels of the parets of idividual I t,j ; that is, the parets of I t,j are I t 1,µt,j ad I t 1,νt,j. Defiig a sequece of radom sets G0 i, Gi 1,... recursively by Gi 0 = {i} ad G i t = {j : µ t,j G i t 1 or ν t,j G i t 1}, G i t is the set of labels of the descedats of I 0,i i geeratio t. Let G i t deote the cardiality of G i t. The coditioal probability that idividual I t+1,j has at least oe paret amog the G i t members of G i t is P ({µ t+1,j G i t} {ν t+1,j G i t} G i t)=(g i t/)+(g i t/) (G i t/)(g i t/). The process {G i t : t =0, 1,...} is a Markov chai with trasitio probabilities (G i t+1 G i t) Bi, 2Gi t ( G i t ) 2, (1) where Bi(, p) deotes the biomial distributio for the umber of successes i idepedet trials each havig success probability p. Throughout the proof, {G t } will deote a Markov chai with trasitio probabilities as i (1), although i differet parts of the proof we will cosider differet possible iitial values G 0. For example, takig G 0 = 1 correspods to followig the descedats of a particular idividual i geeratio 0. I the early stages of the process, while G t remais small relative to, i view of (1) the coditioal distributio of G t+1 give G t is early Poisso(2G t ), that is, the Poisso distributio with mea 2G t. I other words, while {G t } remais small, it evolves early as a Galto-Watso brachig process {Y t } with offsprig distributio Poisso(2). Kämmerle (1991) gave a formal statemet of a result of this ature. A special case of his result says that for fixed u, the joit distributio of (G 0,G 1,...,G u ) coverges to that of (Y 0,Y 1,...,Y u )as. For our purposes, we will use the followig result that allows us to approximate 8

9 probabilities for the G process by those for the Y process up to a higher order of accuracy ad over loger itervals of time that may have radom legths. Lemma 3. Let Y 0,Y 1,... deote a Galto-Watso brachig process with offsprig distributio Poisso(2). Suppose that Y 0 = G 0 =1. Defie τb Y = if{t : Y t b} ad τ0b Y = if{t : Y t =0or Y t b}, with correspodig defiitios for τb G ad τ0b G.As,ifm ad b satisfy mb 2 = o(), the P {τ G b >m} = P {τ Y b >m}(1 + o(1)) (2) ad P {τ G 0b >m} = P {τ Y 0b >m}(1 + o(1)). (3) Proof. A straightforward calculatio bouds the likelihood ratio { ) } L(y x) := P {G t+1 = y G t = x} P {Y t+1 = y Y t = x} = P Bi (, 2x ( x2 = y 2 e 2x 1 2x ) y P {Poisso(2x) =y} + x2 2, so that ( log L(y x) 2x +( y) 2x ) + x2 2 (x 2 +2xy)/. This holds wheever the deomiator P {Y t+1 = y Y t = x} is positive, that is, for all x>0 ad y 0, ad also for x = y = 0. Thus, for all such pairs of x ad y satisfyig x<bad y<b,wehave log L(y x) 3b 2 /. A similar calculatio gives the lower boud log L(y x) 5b 2 /(2)[1 + O(b/)], so that log L(y x) 3b 2 / for sufficietly large. Soifx 1,...,x m are all less tha b, the P {G 1 = x 1,...,G m = x m } = P {G 1 = x 1 G 0 =1} P {G m = x m G m 1 = x m 1 } = P {Y 1 = x 1,...,Y m = x m }L(x 1 1) L(x m x m 1 ) P {Y 1 = x 1,...,Y m = x m }e 3mb2 / ad Thus, P {G 1 = x 1,...,G m = x m } P{Y 1 = x 1,...,Y m = x m }e 3mb2 /. P {τb G >m} = P {G 1 = x 1,...,G m = x m } (4) 0 x 1 <b 0 x m<b 0 x 1 <b 0 x m<b = P {τ Y b >m}e 3mb2 / P {Y 1 = x 1,...,Y m = x m }e 3mb2 / 9

10 ad, similarly, P {τ G b >m} P {τ Y b >m}e 3mb2 /, so that, by the assumptio that mb 2 = o(), we obtai P {τ G b >m} = P {τ Y b >m}(1 + o(1)). This proves (2). The proof of (3) uses the same reasoig, with the summatios i (4) ragig over 0 <x t <brather tha 0 x t <b. The previous result will be useful because the Poisso Galto-Watso process is simple ad well uderstood. The ext lemma records a few well kow items for future referece. Lemma 4. Let Y 0,Y 1,... deote a Galto-Watso process with offsprig distributio Poisso(2). Defie the momet geeratig fuctio ψ(z) =E(z Y 1 )=e 2+2z. The extictio probability ρ = P {Y t =0for some t} is the smaller of the two solutios of ψ(ρ) =ρ, ad ρ = γ/2, where γ is as defied i Theorem 2. The t-fold compositio ψ t = ψ ψ satisfies ψ t (z) ρ for all 0 z ρ. The relatio ρ = γ/2 is cofirmed by comparig the defiitios of ρ ad γ. Despite the simple relatioship, we will keep the two differet letters i our otatio for coceptual clarity. Defiig g t = G t /, wehave E(g t+1 g t )=2g t g 2 t = g t (2 g t ). (5) That is, if the fractio of descedats of a give idividual is curretly g t, it is expected to multiply by a factor of 2 g t i the ext geeratio. For example, i the early stages of the process whe the fractio g t is small, it early doubles i expectatio i the ext geeratio. For very small g t (of the order 1/, for example) the radom variability is large; for example, the process could easily go extict. This is whe it is most useful to approximate the G process by the Poisso(2) Galto-Watso process. O the other had, for larger values of g t, the multiplicatio factor g t+1 /g t, although expected to be somewhat smaller, has much less variability. The deviatios of this factor from its expected value are bouded probabilistically by large deviatios iequalities for the biomial distributio. We will use the followig iequality of Berstei (1946) as a basic tool. Lemma 5.[Berstei s iequality] If X Bi(, p) ad r > 0, the { r 2 } P {X p + r} exp. (6) 2p(1 p)+(2/3)r Sice X Bi(, 1 p), the right side of (6) is also a upper boud for the probability P {X p r}. 3.2 Proof of Theorem 1 Outlie. The proof will be divided ito several parts. We start from geeratio 0 ad trace forward i time. Stage 1: By the ed of stage 1, we idetify a idividual I i geeratio 0 who has a umber of descedats that is small compared to, but large eough so that I is ulikely ever to become extict. I particular, we look for a geeratio t such that some idividual I i 10

11 geeratio 0 has at least lg 2 () descedats i geeratio t. With probability approachig 1, this happes i time o(lg ), egligible compared with lg ; this is show by usig Lemma 3 to approximate our process by a Poisso Galto-Watso process. The rest of the proof will show that with probability approachig 1, idividual I becomes a CA withi (1 + ɛ)(lg ) geeratios, where ɛ is a arbitrary positive umber. Stage 2: Let β (0, 1). Stage 2 follows the descedats of I util reachig a geeratio cotaiig at least β descedats. I view of (5), sice β is a small fractio of for large, throughout Stage 2 the umber of descedats i a geeratio is expected to be early double the umber of descedats i the previous geeratio. Ad lg 2 () is large eough so that the multiplicatio factor will be very close to its expected value, with high probability. So stage 2 should ot take much more tha about lg( β )=β lg() geeratios. Stage 3: This stage brigs the cout of descedats of I up from β to (1/2). Sice the fractio of descedats durig stage 3 stays below 1/2, the expected multiplicatio factor is at least 2 1/2 = 3/2. Agai, this multiplicatio factor is very reliable, so that with high probability stage 3 takes o more tha about log 3/2 {(/2)/( β )} geeratios. We ca make this a arbitrarily small fractio of lg by choosig β close eough to 1. Stage 4: Now we switch to lookig at the fractio B t of idividuals i a geeratio who are ot descedats of idividual I. This fractio is expected to square each geeratio. This causes B t to decrease very quickly. Fixig α (1/2, 2/3), we show that stage 4, which takes the fractio B t from 1/2 dow to α, takes oly order lg lg time. Stage 5: This completes the process, edig whe the B process hits 0, ad idividual I has become a CA. We show that this takes just oe geeratio with high probability. Upper boud: Combiig the results of Stages 1 through 5 gives the probabilistic upper boud lim P {T (1 + ɛ)lg} =1. Lower boud: Here we show that lim P {T (1 ɛ)lg} = 1. This is doe by usig Berstei s iequality to prove a assertio of the followig form: For positive r ad δ, oce the process of descedats of ay give idividual reaches a power r of, it is very ulikely to icrease by a factor of more tha 2 + δ i a geeratio, whereas it would have to do so i order to have T < (1 ɛ)lg. Stage 1. Here we will show that with high probability, withi a umber of geeratios egligible compared to lg, we ca fid a geeratio with at least lg 2 idividuals who share a commo acestor. For simplicity we give a crude argumet that circumvets the eed to cosider ay depedece amog the processes {G i t : t 0} startig from differet idividuals I 0,i. This could also be doe alog the lies of the argumet i Lemma 19 below, where we eed to cofrot this depedece. Lemma 6. Defie τ b = if{t : G t b}. Assumig that G 0 =1, lim if P {τ lg 2 3lglg} > 0. Proof. Let b ad m deote lg 2 ad 3lglg, respectively. Let {Y t } be a Galto-Watso process with offsprig distributio Poisso(2), ad defie M t = Y t 2 t. The process {M t } is a oegative martigale that coverges almost surely to a limit M, say, with 11

12 P {M =0} = ρ<1. Note that P {τb Y >m} P {Y m <b} = P {M m <b2 m }. Therefore, usig Fatou s lemma ad the assumptio that b2 m 0, lim sup P {τb Y >m} lim sup P {M m <b2 m } P(lim sup{m m <b2 m }) = P {M m <b2 m ifiitely ofte} P {M =0} = ρ<1. By Lemma 3, P {τ b >m} = P {τ Y b >m}(1 + o(1)) as. Therefore, lim sup P {τ b >m} lim sup P {τ Y b >m} ρ<1. So lim if P {τ b m} 1 ρ>0. Propositio 7. Let G i t deote the umber of descedats i geeratio t of idividual I 0,i (the ith idividual i geeratio 0), ad let G t = max 1 i {G i t}. Defie τb G = if{t : G t b}. The τ G lg 2 = o P (lg ). Proof. We use a geometric trials argumet. Let m = 3lglg, ad choose a sequece {k } with k ad k m = o(lg ). Perform a sequece of k trials as follows. For the first trial, start with idividual I 0,1, ad follow his progey for m geeratios. We say the trial is a success if I 0,1 has at least lg 2 descedats i geeratio m ; by Lemma 6this happes with probability at least c, say, where c>0. If the trial is a failure, start a ew trial, followig the progey of idividual I m,1 for m more geeratios. Ad so o. We stop at the first success, havig foud a idividual with at least lg 2 descedats. The probability that this sequece of trials fails to termiate by geeratio k m is at most (1 c) k, which teds to 0. Thus, with probability tedig to 1, there is a κ {0,...,k 1} such that idividual I κm,1 has at least lg 2 descedats i geeratio (κ +1)m. Let I deote ay acestor of I κm,1 i geeratio 0. We will show i the remaider of the proof that for each ɛ>0, with probability tedig to 1 as, idividual I becomes a CA withi (1 + ɛ)lg geeratios. Stage 2. The followig simple cosequece of Berstei s iequality will be a coveiet tool. Lemma 8. If δ 3/4 ad G t δ/20, the P {G t+1 (2 δ)g t G t } exp( δ 2 G t /5). The ext result shows that the probability that Stage 2 takes more tha lg geeratios approaches 0 as. I fact, we show that this probability is o(1/); this will be used i the proof of Theorem 2. Propositio 9. Assume that G 0 lg 2, ad let 0 <β<1. Defie T 2 = if{t : G t β }. The P {T 2 > lg } = o(1/) as. Proof. Take 0 <δ<3/4 such that lg(2 δ) >β, ad defie ( ) β b() = log 2 δ lg 2. 12

13 Note that b() β lg lg, lg(2 δ) at least for 3, so that P {T 2 > lg } P {T 2 >b()}. We will show that P {T 2 >b()} = o(1/). The iequality T 2 >b() implies that G t+1 < (2 δ)g t for some 0 t b() 1. The first such t must also satisfy G t lg 2. Thus, b() 1 P {T 2 >b()} P {G t+1 < (2 δ)g t,g t lg 2, T 2 >b()} b() 1 t=0 t=0 P { G t+1 < (2 δ)g t, lg 2 G t β}. However, β δ/20 for sufficietly large. Therefore, o the evet {lg 2 G t β },we may apply Lemma 8 to obtai ( ) P {G t+1 < (2 δ)g t G t } exp δ2 5 lg2 = (δ2 /5)(lg e)(lg ). Thus, P {T 2 >b()} b() (δ2 /5)(lg e)(lg ) = o(1/) as. Stage 3. This stage starts i a geeratio i which the umber of descedats of I is just over β ad eds whe the umber of descedats i a geeratio reaches (1/2). Defiig g t = G t /, wehavee(g t+1 g t )=g t (2 g t ). The idea is that if g t 1/2, the i the ext geeratio g t is expected to multiply by a factor of 2 g t 3/2. So with high probability, throughout stage 3, at each geeratio the umber of descedats will multiply by at least 2, say, sice 2 < 3/2. So to get from β to (1/2), we should eed at most log 2 (1/2)1 β = 2[(1 β)lg 1] geeratios. Propositio 10. Assume G 0 β, ad defie T 3 = if{t : G t (1/2)}. The P {T 3 > 2(1 β)lg} = o(1/) as. Proof. The proof is similar to that of Propositio 9. For β G t /2, a straightforward calculatio usig Berstei s iequality gives P {G t+1 2G t G t } exp(.001g t ) exp(.001 β ). Note that log 2 {(/2)/β } = 2(1 β)lg 2. So if T 3 > 2(1 β)lg, the we must have G t+1 2G t for some t<2(1 β)lg satisfyig β G t /2. Thus, P {T 3 > 2(1 β)lg} 2(1 β)(lg ) exp(.001 β )=o(1/) as. 13

14 Stage 4. Let B t deote 1 G t /, the fractio of idividuals i geeratio t who are ot descedats of the chose idividual I. The (B t+1 B t,b t 1,...) 1 Bi(, B2 t ), (7) sice a idividual is ot a descedat of I whe both of his parets fail to be descedats of I. Fix α (1/2, 2/3). Stage 4 takes the B t process from 1/2 dowto α. The idea is this. Sice E(B t+1 B t )=Bt 2, we expect B t to square each geeratio. We will show that the probability P {B t+1 B 3/2 t } is small throughout stage 4 (ote 3/2 < 2). This will be good eough, sice if B t+1 <B 3/2 t holds throughout stage 4, the stage 4 is completed i order lg lg time. Propositio 11. Cosider a process B 0,B 1,... satisfyig (7), ad suppose B 0 1/2. Let α (1/2, 2/3) ad defie T 4 = if{t : B t α }. The P {T 4 2lglg} = o(1/) as. Proof. By Berstei s iequality, P {B t+1 B 3/2 t B t } = P {Bi(, Bt 2 ) B 3/2 t B t } { 2 Bt 3 (1 B 1/2 t ) 2 } exp 2Bt 2(1 B2 t )+(2/3)B3/2 t (1 B 1/2 t ) { B t (1 B 1/2 t ) 2 } = exp 2(1 Bt 2)+(2/3)B 1/2 t (1 B 1/2. t ) If α B t 1/2, the (1 B 1/2 t ) , ad B t (1 B 1/2 t ) 2 2(1 Bt 2)+(2/3)B 1/2 t (1 B 1/2 t ) α 2+(2/3) (the last iequality holdig for 6 2/α ), so that P {B t+1 B 3/2 t { B t } exp (3/2)α}. α/ (3/2)α For 2, if B 0 1/2 ad B t+1 B 3/2 t B 2lglg 1 α. Therefore, for t =0, 1,..., 2lglg 1, the {T 4 > 2lglg } {B 2lglg > α } 2lglg 1 t=0 {B t+1 >B 3/2 t, α <B t 1/2}, so that P {T 4 > 2lglg } 2lglg 1 t=0 2lglg exp P {B t+1 B 3/2 t, α <B t 1/2} { (3/2)α} = o(1/). 14

15 Stage 5. This stage starts with the {B t } process below α ad eds whe it hits 0. We show that with high probability this takes just oe geeratio. Propositio 12. Suppose B 0 α. The P {B 1 =0} 1 as. Proof. Sice B 1 (1/)Bi(, B0 2 ) ad 2α >1, we have P {B 1 =0} =(1 B0 2) (1 2α ) 1. Upper boud. Propositio 13. For each ɛ>0, P {T > (1 + ɛ)lg} 0 as. Proof. Defie T 1 to be the time at which stage 1 eds. The we kow that T 1 is fiite with probability 1, ad, for arbitrary positive ξ, P {T 1 >ξlg } 0as. At the ed of stage 1 we have foud a idividual I, say, i geeratio 0 who has at least lg 2 () descedats i geeratio T 1. Let G t deote the umber of descedats of I i geeratio t, ad let τ(b) deote if{t : G t b}. Our previous results have show that Thus, P {τ( β ) T 1 > lg } = o(1/), P {τ(/2) τ( β ) > 2(1 β)lg τ( β ) < } = o(1/), P {τ( 1 α ) τ(/2) > 2lglg τ(/2) < } = o(1/), P {τ() τ( 1 α ) > 1 τ( 1 α ) < } = o(1). P {T >ξlg +lg + 2(1 β)lg + 2 lg lg +1} P {T 1 >ξlg } + P {T 1 <,τ( β ) T 1 > lg } + P {τ( β ) <,τ(/2) τ( β ) > 2(1 β)lg} + P {τ(/2) <,τ( 1 α ) τ(/2) > 2lglg} + P {τ( 1 α ) <,τ() τ( 1 α ) > 1} = o(1) + o(1/) + o(1/) + o(1/) + o(1) = o(1). Give ɛ>0, takig ξ ad β such that ξ + 2(1 β) <ɛ, we see that P {T > (1 + ɛ)lg} 0. Lower boud. We will use Berstei s iequality i the followig form. Lemma 14. For δ 3/2, P {G t+1 (2 + δ)g t G t } exp[ δ 2 G t /5]. Propositio 15. For each ɛ>0, P {T < (1 ɛ)lg} 0. 15

16 Proof. Fix ɛ (0, 1). Proceedig forward i time from geeratio 0, we wat to show that the probability that oe of the idividuals i geeratio 0 becomes a CA before geeratio (1 ɛ)lg teds to 1 as. Defie G 0 = 1 ad (G t+1 G t,...,g 0 ) Bi(, 2G t / (G t /) 2 ). Here we thik of G t as the umber of descedats of idividual I 0,1 i geeratio t. Fix r (0,ɛ) so that 2 (1 r)/(1 ɛ) (2, 3.5). Let { G t } evolve like {G t } except that it is trucated (or reflected ) below at the value r. That is, ( G t+1 G t,..., G 0 ) max Bi, 2 G t ( ) 2 Gt, r. Defiig τ G = if{t : G t = } ad τ G = if{t : G t = }, obviously P {τ G u} P {τ G u} for all u. Sice G 0 = r,ifτ G (1 ɛ)lg, the we must have G t+1 G t 2 (1 r)/(1 ɛ) for some t< (1 ɛ)lg. Defiig δ =2 (1 r)/(1 ɛ) 2 (0, 3/2), by Lemma 14 the probability of this is at most (1 ɛ)lg exp( δ 2 r /5), which is o(1/) as. Thus, we have show that the probability that idividual I 0,1 has become a CA by geeratio (1 ɛ)lg is o(1/). So the evet that at least oe of the idividuals i geeratio 0 becomes a CA by geeratio (1 ɛ)lg is a uio of such evets of probability o(1/), ad hece has probability that teds to 0 as. 3.3 Proof of Theorem 2 Idea. The idea of the proof is as follows. Defie t = (ζ ɛ)lg ad u = (ζ + ɛ)lg. For each i =1,...,, the process {G i t : t =0, 1,...} follows the descedats of idividual I 0,i.We are waitig util all of the processes {G 1 t },...,{G t } have reached either 0 or (some will reach 0 ad some will reach ). The key igrediet of the argumet is this assertio: With high probability, there are may i s such that G i t [1, lg 2 ()] ad there is o i such that G i u [1, lg 2 ()]. This follows from Lemma 3 together with a aalysis of the Galto-Watso process with offsprig distributio Poisso(2). For a upper boud, cosider the situatio at time u. Some of the processes have become extict ad reached 0, ad we are just waitig for the other, oextict processes to reach 0 or. The key assertio says that with high probability, all of the oextict processes have reached values above lg 2 (). This level is high eough so that with high probability these processes will all icrease predictably ad reach withi (1 + ɛ)lg additioal geeratios; this was show i the proof of Theorem 1. So with high probability, U u +(1+ɛ)lg. For a lower boud, the key assertio states that with high probability may of the processes are i the iterval [1, lg 2 ()] at time t.itisvery ulikely that all of these will go extict. Furthermore, sice these processes are startig from at most lg 2 () at time t, with high probability it will take more tha (1 ɛ)lg additioal geeratios for ay of them to reach. SoU >t +(1 ɛ)lg with high probability. A brachig process result. Lemma 16. Let {Y t } be a Galto-Watso process whose offsprig distributio is Poisso with mea 2, startig at Y 0 =1. Defie γ as i Theorem 2, ad let b 1,b 2,... be positive itegers 16

17 satisfyig lg(b t )=o(t) as t. The 1 lim t t lg P {1 Y t b t } =lg(γ) Proof. We use a umber of results from chapter 1 of Athreya ad Ney (1972). First, the Mootoe Ratio Lemma says that for each k there is a λ k < such that Also, Λ(s) := P {Y t = k} P {Y t =1} λ k as t. λ k s k < for all s (0, 1). Fially, usig the otatio ad facts collected i Lemma 4, we have k=1 P {Y t =1} = ψ t(0) = ψ [ψ t 1 (0)]ψ t 1(0) = ψ [ψ t 1 (0)]P {Y t 1 =1}, so that P {Y t =1} P {Y t 1 =1} ψ (ρ) =2ρ = γ. I particular, (1/t)lgP{Y t =1} lg γ. For s (0, 1), b t P {Y t = k} P {Y t =1} λ k k=1 k=1 b t b t P {Y t =1}s bt λ k s k k=1 P {Y t =1}s bt Λ(s). (8) If we take s close to 1 (e.g. s =1 b 1 t, say), the the term s bt will remai bouded ad preset o difficulty. So we would like to kow how Λ(s) grows as s 1. Defie ϕ to be the iverse fuctio ψ 1, ad ϕ k to be the k-fold compositio ϕ ϕ. By equatio (6) o page 12 of Athreya ad Ney (1972), for each s (ρ, 1), Λ(ϕ(s)) = γ 1 [Λ(s) Λ(e 2 )] γ 1 Λ(s). Therefore, sice ρ<1/2 <ϕ(1/2) <ϕ 2 (1/2) < < 1, Λ(ϕ k (1/2)) γ k Λ(1/2). However, sice ψ (1) = 2, we may choose a umber Φ so that ϕ k (1/2) 1 (1.9) k Φ ad, therefore, Λ(1 (1.9) k Φ) γ k Λ(1/2) 17

18 hold for all sufficietly large k. From this, it follows that Λ(1 y) Λ(1/2)(y/Φ) (lgγ)/(lg1.9) y 2lgγ holds for all sufficietly small positive y. Now substitutig s =1 b 1 t i (8), there is a fiite costat C such that b t k=1 P {Y t = k} Cγ t Λ(1 b 1 t ) Cγ t b 2lg(1/γ) t. Thus, as log as b t grows subgeometrically, that is, lg(b t )=o(t), we have lim t (1/t)lgP {1 Y t b t } lg(γ). Combiig this with the fact that lim t (1/t)lgP {Y t =1} =lg(γ) completes the proof. Upper boud. Lemma 17. Let I 0,i deote idividual i i geeratio 0. Defie G i t to be the umber descedats of I 0,i i geeratio t; i particular, G i 0 =1for all i =1,...,. Also defie τ i 0,b = if{t : G i t =0or G i t b}, ad let The P (A ) 0 as. A = {τ i 0,lg 2 > (ζ + ɛ)lg}. (9) i=1 Proof. Defie τ Y 0b = if{t : Y t =0orY t b}. Sice {τ Y 0b >t} {1 Y t <b}, Lemma 16ad (3) give lim (1/t)lgP {τ i 0b >t} lg(γ) if lg(b) =o(t) ad tb 2 = o(). (10) Lettig ɛ>0 ad applyig (10) to t =(ζ + ɛ)(lg ) ad b =lg 2 () gives lg P {τ i 0,lg 2 > (ζ + ɛ)(lg )} (lg(γ)+δ)(ζ + ɛ)(lg ) () for all δ ad all sufficietly large. Takig δ sufficietly small, from the defiitio of ζ we see that P {τ i 0,lg 2 > (ζ + ɛ)(lg )} = o(1/) as, () so that P (A )=o(1). We have show that, with high probability, all idividuals i geeratio 0 have either o descedats or more tha lg 2 () descedats i geeratio (ζ + ɛ)lg for ɛ>0. Next we will show that for ay give ɛ>0, with high probability, each idividual havig more tha lg 2 () descedats i geeratio (ζ + ɛ)lg becomes a CA withi (1 + ɛ)lg additioal geeratios. Most of the work required to prove this has already bee doe i the proof of 18

19 Theorem 1; the extra igrediet is the followig lemma, which takes a closer look at stage 5. We retai the defiitio B t =1 (G t /) from above. Lemma 18. Let α (1/2, 2/3) ad take k(α) > 1/(2α 1). Suppose that B 0 α ad defie T 5 = if{t : B t =0}. The P {T 5 >k(α)} = o(1/) as. Proof. Sice (B t+1 B t ) 1 Bi(, B2 t ), o the evet {B t α } we have P {B t+1 > 0 B t } =1 (1 B 2 t ) 1 (1 2B 2 t )=2B 2 t 2 1 2α, where the first iequality holds for sufficietly large (sice α>1/2 implies that B 2 t is arbitrarily small for sufficietly large ). I particular, P {0 <B t+1 α 0 <B t α } 2 1 2α for sufficietly large. (11) Next, by Berstei s iequality, o the evet {B t α }, P {B t+1 > α B t } = P { 1 Bi(, B2 t ) >Bt 2 +( α Bt 2 ) B t } [ 2 ( α Bt 2 ) 2 ] exp 2Bt 2(1 B2 t )+ 2 3 ( α Bt 2 [ ] ) exp 2 2α 2 1 2α α Sice the expoet is asymptotic to (3/2) 1 α, clearly P {B t+1 > α B t } exp[ 1 α ] o {B t α } for sufficietly large. Assumig that B 0 α, Therefore, sice we obtai k k 1 {B t > α } {B t α,b t+1 > α }. t=0 t=0 P {B t α,b t+1 > α } = E [ {B t α }P {B t+1 > α B t } ] exp[ 1 α ], Thus, usig (11) ad (12),. [ k ] P {B t > α } k exp[ 1 α ]. (12) t=0 ( k ) ( k ) P {T 5 >k} P {B t > α } + P {0 <B t α } t=0 t=0 k exp[ 1 α ]+(2 1 2α ) k. 19

20 Applyig this to k = k(α) > 1/(2α 1) gives the desired result. Proof of the upper boud i Theorem 2. Let U deote the time at which everyoe from geeratio 0 has become a CA or extict. Recall the defiitio of A from (9), ad let τ i (b) = if{t : G i t b}. Sice {U > (1 + ζ +2ɛ)lg} A [A c {U > (1 + ζ +2ɛ)lg}] A {τ i (lg 2 ) (ζ + ɛ)lg, τ i () > (1 + ζ +2ɛ)lg}, i=1 to show that P {U > (1 + ζ +2ɛ)lg} = o(1), by Lemma 17 it suffices to show that P {τ 1 (lg 2 ) (ζ + ɛ)lg, τ 1 () > (1 + ζ +2ɛ)lg} = o(1/). To see this, observe that the results of Stages 2 through 4 from the proof of Theorem 1 show that ad Lemma 18 gives Cosequetly, P {τ 1 ( β ) τ 1 (lg 2 ) > lg τ 1 (lg 2 ) < } = o(1/), P {τ 1 (/2) τ 1 ( β ) > 2(1 β)lg τ 1 ( β ) < } = o(1/), P {τ 1 ( 1 α ) τ 1 (/2) > 2lglg τ 1 (/2) < } = o(1/), P {τ 1 () τ 1 ( 1 α ) >k(α) τ 1 ( 1 α ) < } = o(1/). P {τ 1 () τ 1 (lg 2 ) > lg + 2(1 β)lg + 2 lg lg + k(α) τ 1 (lg 2 ) < } = o(1/). Choosig β sufficietly close to 1, we see that for ay give ɛ>0, Thus, as desired. P {τ 1 () τ 1 (lg 2 ) > (1 + ɛ)lg τ 1 (lg 2 ) < } = o(1/). P {τ 1 (lg 2 ) (ζ + ɛ)lg, τ 1 () > (1 + ζ +2ɛ)lg} P {τ 1 (lg 2 ) <,τ 1 () τ 1 (lg 2 ) > (1 + ɛ)lg} = o(1/), Lower boud. The proof goes as follows. First we show that at time t = (ζ ɛ)lg, there are may idividuals i who have G i t [1, lg 2 ]. The probability that all of these idividuals evetually become extict is egligibly small. I the probable evet that ot all of these idividuals become extict, the time U must wait for at least oe of them to become a CA. From the previous results we kow that this will take a additioal (1 ɛ)lg geeratios. Here is some otatio that will be used throughout the proof. Let t deote (ζ ɛ)lg. For 1 i, defie J i to be the evet {G i t [1, lg 2 ] for all t t }; we will also deote by J i 20

21 the idicator radom variable correspodig to this evet. Thus, J i = 1 meas that idividual i i geeratio 0 does ot become extict by time t ad that the umber of descedats of this idividual also remais relatively small (o more tha lg 2 ()) up to time t. At time t these idividuals still have a chace to become CA s, but they have ot yet made much progress toward doig so. The umber of such idividuals is N = i=1 J i. The ext lemma shows that there is little depedece betwee the umbers of descedats of differet idividuals i the early stages of the process. The lemma gives a upper boud o a probability; a similar lower boud may be obtaied, but it is ot eeded i the remaider of the proof. Lemma 19. P (J 1 J 2 ) [P (J 1 )] 2 (1 + o(1)) as. Proof. Cosider idividuals I 0,1 ad I 0,2, that is, idividuals 1 ad 2 i geeratio 0. Let A t deote the umber of idividuals i geeratio t who are descedats of I 0,1 but ot of I 0,2. Let C t deote the umber of idividuals i geeratio t who are descedats of I 0,2 but ot of I 0,1. Let B t deote the umber of idividuals i geeratio t who are descedats of both I 0,1 ad I 0,2. This otatio is local to this proof; i particular, B t has a differet meaig here tha it did i the proof of Theorem 1. So G 1 t = A t + B t ad G 2 t = C t + B t. Lettig H t =(A t,b t,c t ), the process {H t } is a Markov chai. For coveiece we use the otatio P H (a t,b t,c t ) for P {A t = a t,b t = b t,c t = c t }, P H (a t+1,b t+1,c t+1 a t,b t,c t ) for P {A t+1 = a t+1,b t+1 = b t+1,c t+1 = c t+1 A t = a t,b t = b t,c t = c t }, ad so o. We begi by observig that P (J 1 J 2 ) P (J 1 J 2 {B t = 0 for all t t }). (13) This is easy to see ituitively: If A t ad C t are both bouded by lg 2 () ad B t = 0, the the coditioal probability that B t+1 > 0 is at most 2A t C t / = O(lg 4 ()/). This suggests that for each s t, give the evet J 1 J 2, the coditioal probability that B t is positive for the first time at t = s is O(lg 4 ()/). Addig these probabilities over all s t = O(lg ) would the give P (B t > 0 for some t t J 1 J 2 )=O(lg 5 ()/). This is correct, ad the relatio [ ( lg 5 )] () P (J 1 J 2 )=P(J 1 J 2 {B t = 0 for all t t }) 1+O follows from a rather tedious calculatio whose details we omit. The calculatio bouds ratios of biomial probabilities i similar way to a argumet that is give later i this proof. Lettig L deote the iterval [1, lg 2 ], we wat a upper boud o the probability P (J 1 J 2 {B t = 0 for all t t }) = P H (a 1, 0,c 1,a 2, 0,c 2,...,a t, 0,c t ) a 1 L a t L c 1 L c t L = P H (a 1, 0,c 1 ) P H (a 2, 0,c 2 a 1, 0,c 1 ) a 1,c 1 L a 2,c 2 L P H (a t, 0,c t a t 1, 0,c t 1). a t,c t L 21

22 Defiig ad we may write α s = 2a s a s(a s +2c s ) 2, β s = 2a sc s 2, γ s = 2c s c s(c s +2a s ) 2, P H (a t, 0,c t a t 1, 0,c t 1 )=P {Bi(, α t 1 )=a t }P {Bi( a t, P {Bi( a t c t, β t 1 1 α t 1 γ t 1 )=0}. γ t 1 1 α t 1 )=c t } We wat to compare this to the aalogous probability for two idepedet {G t } processes, that is, to P G (a t a t 1 )P G (c t c t 1 )=P {Bi(, α t 1 + β t 1 )=a t }P {Bi(, γ t 1 + β t 1 )=c t }. The ratio is the product of three terms: P H (a t, 0,c t a t 1, 0,c t 1 ) P G (a t a t 1 )P G (c t c t 1 ) (14) P {Bi(, α t 1 )=a t } P {Bi(, α t 1 + β t 1 )=a t }, (15) γ P {Bi( a t, t 1 1 α t 1 )=c t } P {Bi(, γ t 1 + β t 1 )=c t }, (16) ad β t 1 P {Bi( a t c t, )=0}. (17) 1 α t 1 γ t 1 We boud the third term (17) simply by 1; i fact it is close to 1. The term (15) is αt 1 at (1 α ( ) t 1) at (α t 1 + β t 1 ) at (1 α t 1 β t 1 ) β t 1 1+ at 1 α t 1 β t 1 ( lg 4 ) () =1+O, sice β t 1 β t 1 2lg4 () 1 α t 1 β t 1 2 for a t 1,c t 1 lg 2 (). By a similar calculatio, (16) is also 1 + O we obtai ( P H (a t, 0,c t a t 1, 0,c t 1 ) lg 4 ) P G (a t a t 1 )P G (c t c t 1 ) =1+O (). 22 ( ) 1 lg 4 (). Multiplyig,

23 Thus, P (J 1 J 2 {B t = 0 for all t t }) = P H (a 1, 0,c 1 ) P H (a 2, 0,c 2 a 1, 0,c 1 ) a 1,c 1 L a 2,c 2 L P H (a t, 0,c t a t 1, 0,c t 1) a t,c t L P G (a 1 1)P G (c 1 1) P G (a 2 a 1 )P G (c 2 c 1 ) a 1,c 1 L a 2,c 2 L [ ( lg 4 )] t () P G (a t a t 1)P G (c t c t 1) 1+O a t,c t L = P G (a 1 1)P G (a 2 a 1 ) P G (a t a t 1) a t,...,a t L P G (c 1 1)P G (c 2 a 1 ) P G (c t a t 1) c t,...,c t L [ ( [ ] 2 lg = P {G 1 5 )] () t L for all t t } 1+O ( lg =[P (J 1 )] [1+O 2 5 )] (). This completes the proof. [ 1+O ( lg 4 () )] t Lemma 20. N i probability as. Proof. We will show that the mea ad stadard deviatio of N satisfy EN ad SD(N )=o(e(n )). To see that EN = P J 1, begi with (3), which gives P (J 1 ) P {Y t [1, lg 2 ()] for all t t }. This last probability is very close to P {Y t [1, lg 2 ()]}. Ideed, the differece P {Y t [1, lg 2 ()]} P{Y t [1, lg 2 ()] for all t t } = P {Y t [1, lg 2 ()],Y t > lg 2 () for some t<t }, (18) is the probability that the Y process exceeds lg 2 () some time before t but the decreases to be below lg 2 () at time t. Sice the Berstei iequality applied to the Poisso distributio gives P {Y t+1 Y t Y t } exp[ (3/14)Y t ] exp[ (3/14) lg 2 ()] o {Y t > lg 2 ()}, the differece (18) is bouded by t exp[ (3/14) lg 2 ()] = o(1/). By Lemma 16, lg P {Y t [1, lg 2 ()]} (ζ ɛ)lg 1 t lg P {Y t [1, lg 2 ()]} lg γ = 1 ζ. 23

24 which implies that P {Y t [1, lg 2 ()]}. Thus, P (J 1 ) P {Y t [1, lg 2 ()] for all t t } = [P {Y t [1, lg 2 ()]} + o(1/)]. Fially, to see that SD(N )=o(e(n )), we apply Lemma 19 to obtai Var(N )=E(N) 2 (EN ) 2 = P J 1 + ( 1)P (J 1 J 2 ) (P J 1 ) 2 P J 1 + ( 1)[P (J 1 )] 2 (1 + o(1)) (P J 1 ) 2 = o( 2 (PJ 1 ) 2 )=o((en ) 2 ). Proof of the lower boud i Theorem 2. Defiig W = {i : G i t [1, lg 2 ()]}, wehave P {U t +(1 ɛ)lg()} P {W = } + P {evetual extictio for all i W } +P {G i t = for some i W +(1 ɛ)lg() }. (19) P The cardiality of the set W is N. Sice N, clearly the probability that all idividuals {I 0,i : i W } evetually become extict coverges to 0; this is a easy cosequece of results about extictio probabilities of Kämmerle (1991) or Möhle (1994). So it remais to show that the last probability i (19) teds to 0. To see this, takig i W, observe that for the evet {G i t = } to occur the +(1 ɛ)lg() {Gi t} process must go from below lg 2 () at time t to at time t +(1 ɛ)lg(). That is, the process must go from below lg 2 () to withi a time spa of at most (1 ɛ)lg() geeratios. However, by the proof of Propositio 15, we kow that this has probability o(1/), so that, takig the uio over i W gives a total probability of o(1). 4 Discussio A motivatio behid this study was the iterest surroudig the idea of all of makid havig a recet commo acestor. I thikig about a mathematical treatmet of that idea, it seemed atural to remove the restrictio to the materal lie ad cosider a two-paret model. We have see that CA s occur very recetly i the two-paret model studied here. The most recet CA occurs, with high probability, about lg geeratios ago. Withi 1.77 lg geeratios, with high probability, all idividuals who are ot extict are CA s. These results describe the behavior of populatios satisfyig certai assumptios of radom matig ad so o. If our world really satisfied such assumptios, the athropological excitemet about the recetess of mitochodrial Eve would be misplaced: I oly a tiy fractio of the time back to mitochodrial Eve, commo acestors of makid would aboud, ad i fact a radomly chose idividual would be a CA with probability about 0.8. If we wish to uderstad aalogous questios i more complicated models that could better address pheomea such as the evolutio of makid, further study is required. For example, the absece of geographic structure is a key feature limitig the applicability of the model studied here to such situatios. 24

The coalescent coalescence theory

The coalescent coalescence theory The coalescet coalescece theory Peter Beerli September 1, 009 Historical ote Up to 198 most developmet i populatio geetics was prospective ad developed expectatios based o situatios of today. Most work

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

4.3 Growth Rates of Solutions to Recurrences

4.3 Growth Rates of Solutions to Recurrences 4.3. GROWTH RATES OF SOLUTIONS TO RECURRENCES 81 4.3 Growth Rates of Solutios to Recurreces 4.3.1 Divide ad Coquer Algorithms Oe of the most basic ad powerful algorithmic techiques is divide ad coquer.

More information

Application to Random Graphs

Application to Random Graphs A Applicatio to Radom Graphs Brachig processes have a umber of iterestig ad importat applicatios. We shall cosider oe of the most famous of them, the Erdős-Réyi radom graph theory. 1 Defiitio A.1. Let

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

MA131 - Analysis 1. Workbook 2 Sequences I

MA131 - Analysis 1. Workbook 2 Sequences I MA3 - Aalysis Workbook 2 Sequeces I Autum 203 Cotets 2 Sequeces I 2. Itroductio.............................. 2.2 Icreasig ad Decreasig Sequeces................ 2 2.3 Bouded Sequeces..........................

More information

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3 MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 19

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 19 CS 70 Discrete Mathematics ad Probability Theory Sprig 2016 Rao ad Walrad Note 19 Some Importat Distributios Recall our basic probabilistic experimet of tossig a biased coi times. This is a very simple

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Sequences I. Chapter Introduction

Sequences I. Chapter Introduction Chapter 2 Sequeces I 2. Itroductio A sequece is a list of umbers i a defiite order so that we kow which umber is i the first place, which umber is i the secod place ad, for ay atural umber, we kow which

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22 CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first

More information

Sequences. Notation. Convergence of a Sequence

Sequences. Notation. Convergence of a Sequence Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it

More information

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 15

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 15 CS 70 Discrete Mathematics ad Probability Theory Summer 2014 James Cook Note 15 Some Importat Distributios I this ote we will itroduce three importat probability distributios that are widely used to model

More information

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as

More information

Random Models. Tusheng Zhang. February 14, 2013

Random Models. Tusheng Zhang. February 14, 2013 Radom Models Tusheg Zhag February 14, 013 1 Radom Walks Let me describe the model. Radom walks are used to describe the motio of a movig particle (object). Suppose that a particle (object) moves alog the

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Lecture 2. The Lovász Local Lemma

Lecture 2. The Lovász Local Lemma Staford Uiversity Sprig 208 Math 233A: No-costructive methods i combiatorics Istructor: Ja Vodrák Lecture date: Jauary 0, 208 Origial scribe: Apoorva Khare Lecture 2. The Lovász Local Lemma 2. Itroductio

More information

Discrete Mathematics and Probability Theory Spring 2012 Alistair Sinclair Note 15

Discrete Mathematics and Probability Theory Spring 2012 Alistair Sinclair Note 15 CS 70 Discrete Mathematics ad Probability Theory Sprig 2012 Alistair Siclair Note 15 Some Importat Distributios The first importat distributio we leared about i the last Lecture Note is the biomial distributio

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

Math 216A Notes, Week 5

Math 216A Notes, Week 5 Math 6A Notes, Week 5 Scribe: Ayastassia Sebolt Disclaimer: These otes are ot early as polished (ad quite possibly ot early as correct) as a published paper. Please use them at your ow risk.. Thresholds

More information

1 Statement of the Game

1 Statement of the Game ANALYSIS OF THE CHOW-ROBBINS GAME JON LU May 10, 2016 Abstract Flip a coi repeatedly ad stop wheever you wat. Your payoff is the proportio of heads ad you wish to maximize this payoff i expectatio. I this

More information

Lesson 10: Limits and Continuity

Lesson 10: Limits and Continuity www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals

More information

Discrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions

Discrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions CS 70 Discrete Mathematics for CS Sprig 2005 Clacy/Wager Notes 21 Some Importat Distributios Questio: A biased coi with Heads probability p is tossed repeatedly util the first Head appears. What is the

More information

HOMEWORK 2 SOLUTIONS

HOMEWORK 2 SOLUTIONS HOMEWORK SOLUTIONS CSE 55 RANDOMIZED AND APPROXIMATION ALGORITHMS 1. Questio 1. a) The larger the value of k is, the smaller the expected umber of days util we get all the coupos we eed. I fact if = k

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Math 224 Fall 2017 Homework 4 Drew Armstrog Problems from 9th editio of Probability ad Statistical Iferece by Hogg, Tais ad Zimmerma: Sectio 2.3, Exercises 16(a,d),18. Sectio 2.4, Exercises 13, 14. Sectio

More information

The Random Walk For Dummies

The Random Walk For Dummies The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli

More information

MA131 - Analysis 1. Workbook 3 Sequences II

MA131 - Analysis 1. Workbook 3 Sequences II MA3 - Aalysis Workbook 3 Sequeces II Autum 2004 Cotets 2.8 Coverget Sequeces........................ 2.9 Algebra of Limits......................... 2 2.0 Further Useful Results........................

More information

Problem Set 2 Solutions

Problem Set 2 Solutions CS271 Radomess & Computatio, Sprig 2018 Problem Set 2 Solutios Poit totals are i the margi; the maximum total umber of poits was 52. 1. Probabilistic method for domiatig sets 6pts Pick a radom subset S

More information

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A. Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)

More information

The Growth of Functions. Theoretical Supplement

The Growth of Functions. Theoretical Supplement The Growth of Fuctios Theoretical Supplemet The Triagle Iequality The triagle iequality is a algebraic tool that is ofte useful i maipulatig absolute values of fuctios. The triagle iequality says that

More information

Randomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)

Randomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018) Radomized Algorithms I, Sprig 08, Departmet of Computer Sciece, Uiversity of Helsiki Homework : Solutios Discussed Jauary 5, 08). Exercise.: Cosider the followig balls-ad-bi game. We start with oe black

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

MA131 - Analysis 1. Workbook 7 Series I

MA131 - Analysis 1. Workbook 7 Series I MA3 - Aalysis Workbook 7 Series I Autum 008 Cotets 4 Series 4. Defiitios............................... 4. Geometric Series........................... 4 4.3 The Harmoic Series.........................

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Sequences, Mathematical Induction, and Recursion. CSE 2353 Discrete Computational Structures Spring 2018

Sequences, Mathematical Induction, and Recursion. CSE 2353 Discrete Computational Structures Spring 2018 CSE 353 Discrete Computatioal Structures Sprig 08 Sequeces, Mathematical Iductio, ad Recursio (Chapter 5, Epp) Note: some course slides adopted from publisher-provided material Overview May mathematical

More information

Seunghee Ye Ma 8: Week 5 Oct 28

Seunghee Ye Ma 8: Week 5 Oct 28 Week 5 Summary I Sectio, we go over the Mea Value Theorem ad its applicatios. I Sectio 2, we will recap what we have covered so far this term. Topics Page Mea Value Theorem. Applicatios of the Mea Value

More information

CHAPTER 10 INFINITE SEQUENCES AND SERIES

CHAPTER 10 INFINITE SEQUENCES AND SERIES CHAPTER 10 INFINITE SEQUENCES AND SERIES 10.1 Sequeces 10.2 Ifiite Series 10.3 The Itegral Tests 10.4 Compariso Tests 10.5 The Ratio ad Root Tests 10.6 Alteratig Series: Absolute ad Coditioal Covergece

More information

Math 2784 (or 2794W) University of Connecticut

Math 2784 (or 2794W) University of Connecticut ORDERS OF GROWTH PAT SMITH Math 2784 (or 2794W) Uiversity of Coecticut Date: Mar. 2, 22. ORDERS OF GROWTH. Itroductio Gaiig a ituitive feel for the relative growth of fuctios is importat if you really

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

NUMERICAL METHODS FOR SOLVING EQUATIONS

NUMERICAL METHODS FOR SOLVING EQUATIONS Mathematics Revisio Guides Numerical Methods for Solvig Equatios Page 1 of 11 M.K. HOME TUITION Mathematics Revisio Guides Level: GCSE Higher Tier NUMERICAL METHODS FOR SOLVING EQUATIONS Versio:. Date:

More information

CS284A: Representations and Algorithms in Molecular Biology

CS284A: Representations and Algorithms in Molecular Biology CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by

More information

Entropy Rates and Asymptotic Equipartition

Entropy Rates and Asymptotic Equipartition Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Math 155 (Lecture 3)

Math 155 (Lecture 3) Math 55 (Lecture 3) September 8, I this lecture, we ll cosider the aswer to oe of the most basic coutig problems i combiatorics Questio How may ways are there to choose a -elemet subset of the set {,,,

More information

Probability, Expectation Value and Uncertainty

Probability, Expectation Value and Uncertainty Chapter 1 Probability, Expectatio Value ad Ucertaity We have see that the physically observable properties of a quatum system are represeted by Hermitea operators (also referred to as observables ) such

More information

On forward improvement iteration for stopping problems

On forward improvement iteration for stopping problems O forward improvemet iteratio for stoppig problems Mathematical Istitute, Uiversity of Kiel, Ludewig-Mey-Str. 4, D-24098 Kiel, Germay irle@math.ui-iel.de Albrecht Irle Abstract. We cosider the optimal

More information

7 Sequences of real numbers

7 Sequences of real numbers 40 7 Sequeces of real umbers 7. Defiitios ad examples Defiitio 7... A sequece of real umbers is a real fuctio whose domai is the set N of atural umbers. Let s : N R be a sequece. The the values of s are

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Mathematical Induction

Mathematical Induction Mathematical Iductio Itroductio Mathematical iductio, or just iductio, is a proof techique. Suppose that for every atural umber, P() is a statemet. We wish to show that all statemets P() are true. I a

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n, CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe

More information

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero?

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero? 2 Lebesgue Measure I Chapter 1 we defied the cocept of a set of measure zero, ad we have observed that every coutable set is of measure zero. Here are some atural questios: If a subset E of R cotais a

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

Recursive Algorithms. Recurrences. Recursive Algorithms Analysis

Recursive Algorithms. Recurrences. Recursive Algorithms Analysis Recursive Algorithms Recurreces Computer Sciece & Egieerig 35: Discrete Mathematics Christopher M Bourke cbourke@cseuledu A recursive algorithm is oe i which objects are defied i terms of other objects

More information

x a x a Lecture 2 Series (See Chapter 1 in Boas)

x a x a Lecture 2 Series (See Chapter 1 in Boas) Lecture Series (See Chapter i Boas) A basic ad very powerful (if pedestria, recall we are lazy AD smart) way to solve ay differetial (or itegral) equatio is via a series expasio of the correspodig solutio

More information

SEQUENCES AND SERIES

SEQUENCES AND SERIES 9 SEQUENCES AND SERIES INTRODUCTION Sequeces have may importat applicatios i several spheres of huma activities Whe a collectio of objects is arraged i a defiite order such that it has a idetified first

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Chapter 6 Overview: Sequences and Numerical Series. For the purposes of AP, this topic is broken into four basic subtopics:

Chapter 6 Overview: Sequences and Numerical Series. For the purposes of AP, this topic is broken into four basic subtopics: Chapter 6 Overview: Sequeces ad Numerical Series I most texts, the topic of sequeces ad series appears, at first, to be a side topic. There are almost o derivatives or itegrals (which is what most studets

More information

Lecture 2: April 3, 2013

Lecture 2: April 3, 2013 TTIC/CMSC 350 Mathematical Toolkit Sprig 203 Madhur Tulsiai Lecture 2: April 3, 203 Scribe: Shubhedu Trivedi Coi tosses cotiued We retur to the coi tossig example from the last lecture agai: Example. Give,

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES

OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES Peter M. Maurer Why Hashig is θ(). As i biary search, hashig assumes that keys are stored i a array which is idexed by a iteger. However, hashig attempts to bypass

More information

Chapter 7: Numerical Series

Chapter 7: Numerical Series Chapter 7: Numerical Series Chapter 7 Overview: Sequeces ad Numerical Series I most texts, the topic of sequeces ad series appears, at first, to be a side topic. There are almost o derivatives or itegrals

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

CS / MCS 401 Homework 3 grader solutions

CS / MCS 401 Homework 3 grader solutions CS / MCS 401 Homework 3 grader solutios assigmet due July 6, 016 writte by Jāis Lazovskis maximum poits: 33 Some questios from CLRS. Questios marked with a asterisk were ot graded. 1 Use the defiitio of

More information

Kinetics of Complex Reactions

Kinetics of Complex Reactions Kietics of Complex Reactios by Flick Colema Departmet of Chemistry Wellesley College Wellesley MA 28 wcolema@wellesley.edu Copyright Flick Colema 996. All rights reserved. You are welcome to use this documet

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

On Random Line Segments in the Unit Square

On Random Line Segments in the Unit Square O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,

More information

Hashing and Amortization

Hashing and Amortization Lecture Hashig ad Amortizatio Supplemetal readig i CLRS: Chapter ; Chapter 7 itro; Sectio 7.. Arrays ad Hashig Arrays are very useful. The items i a array are statically addressed, so that isertig, deletig,

More information

Lecture 27. Capacity of additive Gaussian noise channel and the sphere packing bound

Lecture 27. Capacity of additive Gaussian noise channel and the sphere packing bound Lecture 7 Ageda for the lecture Gaussia chael with average power costraits Capacity of additive Gaussia oise chael ad the sphere packig boud 7. Additive Gaussia oise chael Up to this poit, we have bee

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

1 Hash tables. 1.1 Implementation

1 Hash tables. 1.1 Implementation Lecture 8 Hash Tables, Uiversal Hash Fuctios, Balls ad Bis Scribes: Luke Johsto, Moses Charikar, G. Valiat Date: Oct 18, 2017 Adapted From Virgiia Williams lecture otes 1 Hash tables A hash table is a

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

Notes for Lecture 11

Notes for Lecture 11 U.C. Berkeley CS78: Computatioal Complexity Hadout N Professor Luca Trevisa 3/4/008 Notes for Lecture Eigevalues, Expasio, ad Radom Walks As usual by ow, let G = (V, E) be a udirected d-regular graph with

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

1 Review and Overview

1 Review and Overview DRAFT a fial versio will be posted shortly CS229T/STATS231: Statistical Learig Theory Lecturer: Tegyu Ma Lecture #3 Scribe: Migda Qiao October 1, 2013 1 Review ad Overview I the first half of this course,

More information

18.440, March 9, Stirling s formula

18.440, March 9, Stirling s formula Stirlig s formula 8.44, March 9, 9 The factorial fuctio! is importat i evaluatig biomial, hypergeometric, ad other probabilities. If is ot too large,! ca be computed directly, by calculators or computers.

More information

Information Theory and Statistics Lecture 4: Lempel-Ziv code

Information Theory and Statistics Lecture 4: Lempel-Ziv code Iformatio Theory ad Statistics Lecture 4: Lempel-Ziv code Łukasz Dębowski ldebowsk@ipipa.waw.pl Ph. D. Programme 203/204 Etropy rate is the limitig compressio rate Theorem For a statioary process (X i)

More information

Discrete Mathematics and Probability Theory Spring 2013 Anant Sahai Lecture 18

Discrete Mathematics and Probability Theory Spring 2013 Anant Sahai Lecture 18 EECS 70 Discrete Mathematics ad Probability Theory Sprig 2013 Aat Sahai Lecture 18 Iferece Oe of the major uses of probability is to provide a systematic framework to perform iferece uder ucertaity. A

More information

Lecture 6: Integration and the Mean Value Theorem. slope =

Lecture 6: Integration and the Mean Value Theorem. slope = Math 8 Istructor: Padraic Bartlett Lecture 6: Itegratio ad the Mea Value Theorem Week 6 Caltech 202 The Mea Value Theorem The Mea Value Theorem abbreviated MVT is the followig result: Theorem. Suppose

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Math 113, Calculus II Winter 2007 Final Exam Solutions

Math 113, Calculus II Winter 2007 Final Exam Solutions Math, Calculus II Witer 7 Fial Exam Solutios (5 poits) Use the limit defiitio of the defiite itegral ad the sum formulas to compute x x + dx The check your aswer usig the Evaluatio Theorem Solutio: I this

More information

Approximations and more PMFs and PDFs

Approximations and more PMFs and PDFs Approximatios ad more PMFs ad PDFs Saad Meimeh 1 Approximatio of biomial with Poisso Cosider the biomial distributio ( b(k,,p = p k (1 p k, k λ: k Assume that is large, ad p is small, but p λ at the limit.

More information