arxiv: v1 [cs.ne] 2 Nov 2017

Size: px
Start display at page:

Download "arxiv: v1 [cs.ne] 2 Nov 2017"

Transcription

1 Noame mauscript No. will be iserted by the editor) Ruig Time Aalysis of the +)-EA for OeMax ad LeadigOes uder Bit-wise Noise Chao Qia Chao Bia Wu Jiag Ke Tag Received: date / Accepted: date arxiv: v [cs.ne] 2 Nov 207 Abstract I may real-world optimizatio problems, the objective fuctio evaluatio is subject to oise, ad we caot obtai the exact objective value. Evolutioary algorithms EAs), a type of geeral-purpose radomized optimizatio algorithm, have show able to solve oisy optimizatio problems well. However, previous theoretical aalyses of EAs maily focused o oise-free optimizatio, which makes the theoretical uderstadig largely isufficiet. Meawhile, the few existig theoretical studies uder oise ofte cosidered the oe-bit oise model, which flips a radomly chose bit of a solutio before evaluatio; while i may realistic applicatios, several bits of a solutio ca be chaged simultaeously. I this paper, we study a atural extesio of oe-bit oise, the bit-wise oise model, which idepedetly flips each bit of a solutio with some probability. We aalyze the ruig time of the +)-EA solvig OeMax ad LeadigOes uder bit-wise oise for the first time, ad derive the rages of the oise level A prelimiary versio of this paper has appeared at GECCO 7. C. Qia School of Computer Sciece ad Techology, Uiversity of Sciece ad Techology of Chia, Hefei , Chia chaoqia@ustc.edu.c C. Bia School of Computer Sciece ad Techology, Uiversity of Sciece ad Techology of Chia, Hefei , Chia biacht@mail.ustc.edu.c W. Jiag School of Computer Sciece ad Techology, Uiversity of Sciece ad Techology of Chia, Hefei , Chia jw992@mail.ustc.edu.c K. Tag Departmet of Computer Sciece ad Egieerig, Souther Uiversity of Sciece ad Techology, Shezhe 58055, Chia tagk3@sustc.edu.c

2 2 Chao Qia et al. for polyomial ad super-polyomial ruig time bouds. The aalysis o LeadigOes uder bit-wise oise ca be easily trasferred to oe-bit oise, ad improves the previously kow results. Sice our aalysis discloses that the +)-EA ca be efficiet oly uder low oise levels, we also study whether the samplig strategy ca brig robustess to oise. We prove that usig samplig ca sigificatly icrease the largest oise level allowig a polyomial ruig time, that is, samplig is robust to oise. Keywords Noisy optimizatio evolutioary algorithms samplig ruig time aalysis computatioal complexity Itroductio I real-world optimizatio tasks, the exact objective i.e., fitess) fuctio evaluatio of cadidate solutios is ofte impossible, istead we ca obtai oly a oisy oe due to a wide rage of ucertaities [22]. For example, i machie learig, a predictio model is evaluated oly o a limited amout of data, which makes the estimated performace deviated from the true performace; i social etwork aalysis, computig the ifluece spread objective of a set of users is #P-hard [8], ad thus is ofte estimated by simulatig the radom diffusio process [23], which brigs oise. I the presece of oise, the difficulty of solvig a optimizatio problem may icrease. Evolutioary algorithms EAs) [5], ispired by atural pheomea, are a type of radomized metaheuristic optimizatio algorithm. They are likely to be able to hadle oise, sice the correspodig atural pheomea have bee well processed i oisy atural eviromets. I fact, EAs have bee successfully applied to solve may oisy optimizatio problems [7,24]. Compared with the applicatio, the theoretical aalysis of EAs is far behid. But i the last two decades, much effort has bee devoted to the ruig time aalysis a essetial theoretical aspect) of EAs. Numerous aalytical results for EAs solvig sythetic problems as well as combiatorial problems have bee derived, e.g., [4, 25]. Meawhile, a few geeral approaches for ruig time aalysis have bee proposed, e.g., drift aalysis [,3,2], fitess-level methods [9,33], ad switch aalysis [35]. However, previous ruig time aalyses of EAs maily focused o oisefree optimizatio, where the fitess evaluatio is exact. Oly a few pieces of work o oisy evolutioary optimizatio have bee reported. Droste [4] first aalyzed the +)-EA o the OeMax problem i the presece of oebit oise ad showed that the tight rage of the oise probability p allowig a polyomial ruig time is Olog /), where is the problem size. Gieße ad Kötzig [20] recetly studied the LeadigOes problem, ad proved that the expected ruig time is polyomial if p /6e 2 ) ad expoetial if p = /2. For iefficiet optimizatio of the +)-EA uder high oise levels, some implicit mechaisms of EAs were proved to be robust to oise. I [20], it was

3 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 3 show that the µ+)-ea with a small populatio of size Θlog ) ca solve OeMax i polyomial time eve if the probability of oe-bit oise reaches. The robustess of populatios to oise was also proved i the settig of o-elitist EAs [0, 27]. However, Friedrich et al. [8] showed the limitatio of populatios by provig that the µ+)-ea eeds super-polyomial time for solvig OeMax uder additive Gaussia oise N 0, σ 2 ) with σ 2 3. This difficulty ca be overcome by the compact geetic algorithm cga) [8] ad a simple At Coloy Optimizatio ACO) algorithm [7], both of which fid the optimal solutio i polyomial time with a high probability. ACO was also show able to efficietly fid solutios with reasoable approximatios o some istaces of sigle destiatio shortest paths problems with edge weights disturbed by oise [2,6,34]. The ability of explicit oise hadlig strategies was also theoretically studied. Qia et al. [30] proved that the threshold selectio strategy is robust to oise: the expected ruig time of the +)-EA usig threshold selectio o OeMax uder oe-bit oise is always polyomial regardless of the oise level. For the +)-EA solvig OeMax ad LeadigOes uder oe-bit or additive Gaussia oise, the samplig strategy was show able to reduce the ruig time from expoetial to polyomial i high oise levels [29]. Akimoto et al. [2] also proved that samplig with a large sample size ca make optimizatio uder additive ubiased oise behave as optimizatio i a oise-free eviromet. The iterplay betwee samplig ad implicit oise-hadlig mechaisms e.g., crossover) has bee statistically studied i [9]. The studies metioed above maily cosidered the oe-bit oise model, which flips a radom bit of a solutio before evaluatio with probability p. However, the oise model, which ca chage several bits of a solutio simultaeously, may be more realistic ad eeds to be studied, as metioed i the first oisy theoretical work [4]. I this paper, we study the bit-wise oise model, which is characterized by a pair p, q) of parameters. It happes with probability p, ad idepedetly flips each bit of a solutio with probability q before evaluatio. We aalyze the ruig time of the +)-EA solvig OeMax ad LeadigOes uder bit-wise oise with two specific parameter settigs p, ) ad, q). The rages of p ad q for a polyomial upper boud ad a super-polyomial lower boud are derived, as show i the middle row of Table. For the +)-EA o LeadigOes, we also trasfer the ruig time bouds from bit-wise oise p, ) to oe-bit oise by usig the same proof procedure. As show i the bottom right of Table, our results improve the previously kow oes [20]. Note that for the +)-EA solvig the LeadigOes problem, the curret aalysis as show i the last colum of Table ) does ot cover all the rages of p ad q. We thus coduct experimets to estimate the expected ruig time for the ucovered values of p ad q. The empirical results show that the curretly derived rages of p ad q allowig a polyomial ruig time are possibly tight.

4 4 Chao Qia et al. Table For the ruig time of the +)-EA o OeMax ad LeadigOes uder prior oise models, the rages of oise parameters for a polyomial upper boud ad a super-polyomial lower boud are show below. +)-EA OeMax LeadigOes bit-wise oise p, ) Olog /), ωlog /) Olog /2 ), ωlog /) bit-wise oise, q) Olog / 2 ), ωlog / 2 ) [20] Olog / 3 ), ωlog / 2 ) oe-bit oise Olog /), ωlog /) [4] [0, /6e 2 )], /2 [20]; Olog / 2 ), ωlog /) Table 2 For the ruig time of the +)-EA usig samplig o OeMax ad LeadigOes uder prior oise models, the rages of oise parameters for a polyomial upper boud ad a super-polyomial lower boud are show below. +)-EA usig samplig OeMax LeadigOes bit-wise oise p, ) [0, ], [0, ], bit-wise oise, q) /2 / O), /2 / ω) [/2, ] Olog /), ωlog /) oe-bit oise [0, ], [0, ], From the results i Table, we fid that the +)-EA is efficiet oly uder low oise levels. For example, for the +)-EA solvig OeMax uder bit-wise oise p, ), the expected ruig time is polyomial oly whe p = Olog /). We the study whether the samplig strategy ca brig robustess to oise. Samplig is a popular way to cope with oise i fitess evaluatio [3], which, istead of evaluatig the fitess of oe solutio oly oce, evaluates the fitess several times ad the uses the average to approximate the true fitess. We aalyze the ruig time of the +)-EA usig samplig uder both bit-wise oise ad oe-bit oise. The rages of p ad q for a polyomial upper boud ad a super-polyomial lower boud are show i Table 2. Note that the aalysis covers all the rages of p ad q. Compared with the results i Table, we fid that usig samplig sigificatly improve the oise-tolerace ability. For example, by usig samplig, the +)-EA ow ca always solve OeMax uder bit-wise oise p, ) i polyomial time. From the aalysis procedure, we also fid the reaso why samplig is effective or ot. Let fx) ad f x) deote the true ad oisy fitess of a solutio, respectively. For two solutios x ad y with fx) > fy), whe the oise level is high i.e., the values of p ad q are large), the probability Pf x) f y)) i.e., the true worse solutio y appears to be better) becomes large, which will mislead the search directio ad the lead to a super-polyomial ruig time. I such a situatio, if the expected gap betwee f x) ad f y) is positive, samplig will icrease this tred ad make Pf x) f y)) sufficietly small; if it is egative e.g., o OeMax uder bit-wise oise, q) with q /2), samplig will cotiue to icrease Pf x) f y)), ad obviously will ot work. We also ote that if the positive gap betwee f x) ad f y) is too small e.g., o OeMax uder bit-

5 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 5 wise oise, q) with q = /2 / ω) ), a polyomial sample size will be ot sufficiet ad samplig also fails to guaratee a polyomial ruig time. This paper exteds our prelimiary work [28]. Sice the theoretical aalysis o the LeadigOes problem is ot complete, we add experimets to complemet the theoretical results i.e., Sectio 4.4). We also add the robustess aalysis of samplig to oise i.e., Sectio 5). Note that the robustess of samplig to oe-bit oise has bee studied i our previous work [29]. It was show that samplig ca reduce the ruig time of the +)-EA from expoetial to polyomial o OeMax whe the oise probability p = as well as o LeadigOes whe p = /2. Therefore, our results here are more geeral. We prove that samplig is effective for ay value of p, as show i the last row of Table 2. Furthermore, we aalyze the robustess of samplig to bit-wise oise for the first time. The rest of this paper is orgaized as follows. Sectio 2 itroduces some prelimiaries. The ruig time aalysis of the +)-EA o OeMax ad LeadigOes uder oise is preseted i Sectios 3 ad 4, respectively. Sectio 5 gives the aalysis of the +)-EA usig samplig. Sectio 6 cocludes the paper. 2 Prelimiaries I this sectio, we first itroduce the optimizatio problems, evolutioary algorithms ad oise models studied i this paper, respectively, the itroduce the samplig strategy, ad fially preset the aalysis tools that we use throughout this paper. 2. OeMax ad LeadigOes I this paper, we use two well-kow pseudo-boolea fuctios OeMax ad LeadigOes. The OeMax problem as preseted i Defiitio aims to maximize the umber of -bits of a solutio. The LeadigOes problem as preseted i Defiitio 2 aims to maximize the umber of cosecutive -bits coutig from the left of a solutio. Their optimal solutio is... briefly deoted as ). It has bee show that the expected ruig time of the +)-EA o OeMax ad LeadigOes is Θ log ) ad Θ 2 ), respectively [5]. Defiitio OeMax) The OeMax Problem of size is to fid a bits biary strig x such that x = arg max x {0,} fx) = ). i= x i Defiitio 2 LeadigOes) The LeadigOes Problem of size is to fid a bits biary strig x such that x = arg max x {0,} fx) = ) i. i= j= x j

6 6 Chao Qia et al. 2.2 Bit-wise Noise There are maily two kids of oise models: prior ad posterior [20,22]. The prior oise comes from the variatio o a solutio, while the posterior oise comes from the variatio o the fitess of a solutio. Previous theoretical aalyses ofte focused o a specific prior oise model, oe-bit oise. As preseted i Defiitio 3, it flips a radom bit of a solutio before evaluatio with probability p. However, i may realistic applicatios, oise ca chage several bits of a solutio simultaeously rather tha oly oe bit. We thus cosider the bit-wise oise model. As preseted i Defiitio 4, it happes with probability p, ad idepedetly flips each bit of a solutio with probability q before evaluatio. To the best of our kowledge, oly bit-wise oise with p = ad q [0, ] has bee recetly studied. Gieße ad Kötzig [20] proved that for the +)-EA o OeMax, the expected ruig time is polyomial if q = Olog / 2 ) ad super-polyomial if q = ωlog / 2 ). I this paper, we study two specific bit-wise oise models: p [0, ] q = ad p = q [0, ], which are briefly deoted as bit-wise oise p, ) ad bit-wise oise, q), respectively. Defiitio 3 Oe-bit Noise) Give a parameter p [0, ], let f x) ad fx) deote the oisy ad true fitess of a biary solutio x {0, }, respectively, the { f fx) with probability p, x) = fx ) with probability p, where x is geerated by flippig a uiformly radomly chose bit of x. Defiitio 4 Bit-wise Noise) Give parameters p, q [0, ], let f x) ad fx) deote the oisy ad true fitess of a biary solutio x {0, }, respectively, the { f fx) with probability p, x) = fx ) with probability p, where x is geerated by idepedetly flippig each bit of x with probability q )-EA The +)-EA as described i Algorithm is studied i this paper. For oisy optimizatio, oly a oisy fitess value f x) istead of the exact oe fx) ca be accessed, ad thus step 4 of Algorithm chages to be if f x ) f x). Note that the reevaluatio strategy is used as i [2,4,20]. That is, besides evaluatig f x ), f x) will be reevaluated i each iteratio of the

7 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 7 +)-EA. The ruig time is usually defied as the umber of fitess evaluatios eeded to fid a optimal solutio w.r.t. the true fitess fuctio f for the first time [2,4,20]. Algorithm +)-EA) Give a fuctio f over {0, } to be maximized, it cosists of the followig steps:. x := uiformly radomly selected from {0, }. 2. Repeat util the termiatio coditio is met 3. x := flip each bit of x idepedetly with prob. /. 4. if fx ) fx) 5. x := x. 2.4 Samplig I oisy evolutioary optimizatio, samplig as described i Defiitio 5 has ofte bee used to reduce the egative effect of oise [,6]. It approximates the true fitess fx) usig the average of a umber of radom evaluatios. For the +)-EA usig samplig, step 4 of Algorithm chages to be if ˆfx ) ˆfx). Note that m = is equivalet to that samplig is ot used. The effectiveess of samplig was ot theoretically aalyzed util recetly. Qia et al. [29] proved that samplig is robust to oe-bit oise ad additive Gaussia oise. Particularly, uder oe-bit oise, it was show that samplig ca reduce the ruig time from expoetial to polyomial for the +)-EA solvig OeMax whe the oise probability p = ad LeadigOes whe p = /2. Defiitio 5 Samplig) Samplig first evaluates the fitess of a solutio m times idepedetly ad obtais the oisy fitess values f x),..., f mx), ad the outputs their average, i.e., ˆfx) = m m i= f i x). 2.5 Aalysis Tools The process of the +)-EA solvig OeMax or LeadigOes ca be directly modeled as a Markov chai {ξ t } + t=0. We oly eed to take the solutio space {0, } as the chai s state space i.e., ξ t X = {0, } ), ad take the optimal solutio as the chai s optimal state i.e., X = { }). Give a Markov chai {ξ t } + t=0 ad ξˆt = x, we defie its first hittig time FHT) as τ = mi{t ξˆt+t X, t 0}. The mathematical expectatio of τ, Eτ ξˆt = x) = + i=0 i Pτ = i), is called the expected first hittig time EFHT) startig from ξˆt = x. If ξ 0 is draw from a distributio π 0, Eτ ξ 0 π 0 ) = x X π 0x)Eτ ξ 0 = x) is called the EFHT of the Markov chai over the iitial distributio π 0. Thus, the expected ruig time of the

8 8 Chao Qia et al. +)-EA startig from ξ 0 π 0 is equal to + 2 Eτ ξ 0 π 0 ), where the term correspods to evaluatig the iitial solutio, ad the factor 2 correspods to evaluatig the offsprig solutio x ad reevaluatig the paret solutio x i each iteratio. If usig samplig, the expected ruig time of the +)-EA is m + 2m Eτ ξ 0 π 0 ), sice estimatig the fitess of a solutio eeds m umber of idepedet fitess evaluatios. Note that we cosider the expected ruig time of the +)-EA startig from a uiform iitial distributio i this paper. I the followig, we give three drift theorems that will be used to derive the EFHT of Markov chais i the paper. Lemma Additive Drift [2]) Give a Markov chai {ξ t } + t=0 ad a distace fuctio V x), if for ay t 0 ad ay ξ t with V ξ t ) > 0, there exists a real umber c > 0 such that EV ξ t ) V ξ t+ ) ξ t ) c, the the EFHT satisfies that Eτ ξ 0 ) V ξ 0 )/c. Lemma 2 Simplified Drift [26]) Let X t, t 0, be real-valued radom variables describig a stochastic process. Suppose there exists a iterval [a, b] R, two costats δ, ɛ > 0 ad, possibly depedig o l := b a, a fuctio rl) satisfyig rl) = ol/ logl)) such that for all t 0 the followig two coditios hold:. EX t X t+ a < X t < b) ɛ, 2. P X t+ X t j X t > a) rl) + δ) j for j N 0. The there is a costat c > 0 such that for T := mi{t 0 : X t a X 0 b} it holds PT 2 cl/rl) ) = 2 Ωl/rl)). Lemma 3 Simplified Drift with Self-loops [3]) Let X t, t 0, be real-valued radom variables describig a stochastic process. Suppose there exists a iterval [a, b] R, two costats δ, ɛ > 0 ad, possibly depedig o l := b a, a fuctio rl) satisfyig rl) = ol/ logl)) such that for all t 0 the followig two coditios hold:. a < i < b : EX t X t+ X t = i) ɛ PX t+ i X t = i), 2. i > a, j N 0 : P X t+ X t j X t = i) rl) + δ) j PX t+ i X t = i). The there is a costat c > 0 such that for T := mi{t 0 : X t a X 0 b} it holds PT 2 cl/rl) ) = 2 Ωl/rl)).

9 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 9 3 The OeMax problem I this sectio, we aalyze the ruig time of the +)-EA o OeMax uder bit-wise oise. Note that for bit-wise oise, q), it has bee proved that the expected ruig time is polyomial if ad oly if q = Olog / 2 ), as show i Theorem. Theorem [20] For the +)-EA o OeMax uder bit-wise oise, q), the expected ruig time is polyomial if q = Olog / 2 ) ad super-polyomial if q = ωlog / 2 ). For bit-wise oise p, ), we prove i Theorems 2 ad 3 that the tight rage of p allowig a polyomial ruig time is Olog /). Istead of usig the origial drift theorems, we apply the upper ad lower bouds of the +)-EA o oisy OeMax i [20]. Let x k deote ay solutio with k umber of -bits, ad f x k ) deote its oisy objective value, which is a radom variable. Lemma 4 ituitively meas that if the probability of recogizig the true better solutio by oisy evaluatio is large, the ruig time ca be polyomially upper bouded. O the cotrary, Lemma 5 shows that if the probability of makig a right compariso is small, the ruig time ca be expoetially lower bouded. Both of them are proved by applyig stadard drift theorems, ad ca be used to simplify our aalysis. Note that i the origial upper boud of the +)-EA o oisy OeMax i.e., Theorem 5 i [20]), it requires that Eq. 4) holds with oly j = k, but the proof actually also requires that oisy OeMax satisfies the mootoicity property, i.e., for all j < k <, Pf x k ) < f x k+ )) Pf x j ) < f x k+ )). We have combied these two coditios i Lemma 4 by requirig Eq. 4) to hold with ay j k istead of oly j = k. Lemma 4 [20] Suppose there is a positive costat c /5 ad some 2 < l /2 such that j k < : Pf x j ) < f x k+ )) l ; j k < l : Pf x j ) < f x k+ )) c k, ) the the +)-EA optimizes f i expectatio i O log ) + 2 Ol) iteratios. Lemma 5 [20] Suppose there is some l /4 ad a costat c 6 such that l k < : Pf x k ) < f x k+ )) c k, the the +)-EA optimizes f i 2 Ωl) iteratios with a high probability. Theorem 2 For the +)-EA o OeMax uder bit-wise oise p, ), the expected ruig time is polyomial if p = Olog /).

10 0 Chao Qia et al. Proof We prove it by usig Lemma 4. For ay positive costat b, suppose that p b log /. We set the two parameters i Lemma 4 as c = mi{ 5, b} 2b log ad l = c 2, 2 ]. For ay j k <, f x j ) f x k+ ) implies that f x j ) k + or f x k+ ) k, either of which happes with probability at most p. By the uio boud, we get j k <, Pf x j ) f x k+ )) 2p For ay j k < l, we easily get 2b log = lc l. Pf x j ) f x k+ )) lc < c k. By Lemma 4, we kow that the expected ruig time is O log ) + 2 O2b log /c), i.e., polyomial. Theorem 3 For the +)-EA o OeMax uder bit-wise oise p, ), the expected ruig time is super-polyomial if p = ωlog /) ωlog /) ad expoetial if p = Olog /). Proof We use Lemma 5 to prove it. Let c = 6. The case p = ωlog /) ωlog /) is first aalyzed. For ay positive costat b, let l = b log. For ay k l, we get Pf x k ) f x k+ )) Pf x k ) = k) Pf x k+ ) k). To make f x k ) = k, it is sufficiet that the oise does ot happe, i.e., Pf x k ) = k) p. To make f x k+ ) k, it is sufficiet to flip oe -bit ad keep other bits uchaged by oise, i.e., Pf x k+ ) k) p k+ ). Thus, Sice c k c l Pf x k ) f x k+ )) p) p k + e = ωlog /). cb log =, the coditio of Lemma 5 holds. Thus, the expected ruig time is 2 Ωb log ) where b is ay costat), i.e., superpolyomial. For the case p = Olog /), let l =. We use aother lower boud p ) for Pf x k ) = k), sice it is sufficiet that o bit flips by oise. Thus, we have Pf x k ) f x k+ )) p, the coditio of Lemma 5 holds. Thus, the expected ru- Sice c k ig time is 2 Ω ), i.e., expoetial. c ) p k + e = Ω).

11 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 4 The LeadigOes problem I this sectio, we first aalyze the ruig time of the +)-EA o the LeadigOes problem uder bit-wise oise p, ) ad bit-wise oise, q), respectively. The, we trasfer the aalysis from bit-wise oise p, ) to oebit oise; the results are complemetary to the kow oes recetly derived i [20]. However, our aalysis does ot cover all the rages of p ad q. For those values of p ad q where o theoretical results are kow, we coduct experimets to empirically ivestigate the ruig time. 4. Bit-wise Noise p, ) For bit-wise oise p, ), we prove i Theorems 4-6 that the expected ruig time is polyomial if p = Olog / 2 ) ad super-polyomial if p = ωlog /). Their proofs are accomplished by applyig additive drift aalysis, the simplified drift theorem with self-loops ad the simplified drift theorem, respectively. Theorem 4 For the +)-EA o LeadigOes uder bit-wise oise p, ), the expected ruig time is polyomial if p = Olog / 2 ). Proof We use Lemma to prove it. For ay positive costat b, suppose that p b log / 2. Let LOx) deote the true umber of leadig -bits of a solutio x. We first costruct a distace fuctio V x) as, for ay x with LOx) = i, V x) = + c ) + c i, ) where c = 4b log +. It is easy to verify that V x X = { }) = 0 ad V x / X ) > 0. The, we ivestigate EV ξ t ) V ξ t+ ) ξ t = x) for ay x with LOx) < i.e., x / X ). Assume that curretly LOx) = i, where 0 i. Let P mut x, x ) deote the probability of geeratig x by mutatio o x. We divide the drift ito two parts: positive E + ad egative E. That is, where E + = E = x :LOx )>i x :LOx )<i EV ξ t ) V ξ t+ ) ξ t = x) = E + E, P mut x, x ) Pf x ) f x)) V x) V x )), P mut x, x ) Pf x ) f x)) V x ) V x)). For the positive drift, we eed to cosider that the umber of leadig -bits is icreased. By mutatio, we have P mut x, x ) = PLOx ) i + ) = ) i, 4) x :LOx )>i 2) 3)

12 2 Chao Qia et al. sice it eeds to flip the i + )-th bit which must be 0) of x ad keep the i leadig -bits uchaged. For ay x with LOx ) i +, f x ) < f x) implies that f x ) i or f x) i +. Note that, Pf x ) i ) = p ) ) i, 5) sice at least oe of the first i leadig -bits of x eeds to be flipped by oise; Pf x) i + ) = p ) i, 6) sice it eeds to flip the first 0-bit of x ad keep the leadig -bits uchaged by oise. By the uio boud, we get Pf x ) f x)) = Pf x ) < f x)) p ) ) i+ p i + 2, 7) where the last iequality is by p = Olog / 2 ). Furthermore, for ay x with V x ) i +, V x) V x ) + c ) i+ + c ) i c = + c i. 8) ) By combiig Eqs. 4.), 4.) ad 4.), we have E + ) i 2 c + c ) i c c ) i, where the last iequality is by )i ) e 3. For the egative drift, we eed to cosider that the umber of leadig -bits is decreased. By mutatio, we have P mut x, x ) = PLOx ) i ) = ) i, 9) x :LOx )<i sice it eeds to flip at least oe leadig -bit of x. For ay x with LOx ) i where i ), f x ) f x) implies that f x ) i or f x) i. Note that, Pf x ) i) p ) i, 0) sice for the first i bits of x, it eeds to flip the 0-bits whose umber is at least ) ad keep the -bits uchaged by oise; Pf x) i ) = p ) ) i, )

13 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 3 sice at least oe leadig -bit of x eeds to be flipped by oise. By the uio boud, we get Pf x ) f x)) p p 2 ) ) i p i +. 2) Furthermore, for ay x with LOx ) i, V x ) V x) + ) c i. 3) By combiig Eqs. 4.), 4.) ad 4.), we have E ) ) i p i + + c ) ) i ) p + c ) i 2p + c i. e 3 ) Thus, by subtractig E from E +, we have EV ξ t ) V ξ t+ ) ξ t = x) + c ) i c 6 2 2p ) 3 + c ) ) i 4b log + 2b log , 4) where the secod iequality is by c = 4b log + ad p b log / 2. Note that V x) + c ) e c = e 4b log + = e 4b. By Lemma, we get Eτ ξ 0 ) 6 2 e 4b = O 4b+2 ), i.e., the expected ruig time is polyomial. Theorem 5 For the +)-EA o LeadigOes uder bit-wise oise p, ), if p = ωlog /) o), the expected ruig time is super-polyomial. Proof We use Lemma 3 to prove it. Let X t = x 0 be the umber of 0-bits of the solutio x after t iteratios of the +)-EA. Let c be ay positive costat. We cosider the iterval [0, c log ], i.e., the parameters a = 0 i.e., the global optimum) ad b = c log i Lemma 3. The, we aalyze the drift EX t X t+ X t = i) for i < c log. As i the proof of Theorem 4, we divide the drift ito two parts: positive E + ad egative E. That is, EX t X t+ X t = i) = E + E, where E + = P mut x, x ) Pf x ) f x)) i x 0 ), E = x : x 0<i x : x 0>i P mut x, x ) Pf x ) f x)) x 0 i).

14 4 Chao Qia et al. For the positive drift, we eed to cosider that the umber of 0-bits is decreased. For mutatio o x where x 0 = i), let X ad Y deote the umber of flipped 0-bits ad -bits, respectively. The, X Bi, ) ad Y B i, ), where B, ) is the biomial distributio. To estimate a upper boud o E +, we assume that the offsprig solutio x with x 0 < i is always accepted. Thus, we have E + x : x 0<i = i = i i P mut x, x )i x 0 ) = k= k i j= j k= j=k j= j PX = j) = i. i k PX Y = k) 5) k= PX = j) PY = j k) k PX = j) PY = j k) For the egative drift, we eed to cosider that the umber of 0-bits is icreased. We aalyze the i cases where oly oe -bit is flipped i.e., x 0 = i + ), which happes with probability ). Assume that LOx) = k i. If the j-th where j k) leadig -bit is flipped, the offsprig solutio x will be accepted i.e., f x ) f x)) if f x ) j ad f x) j. Note that, Pf x ) j ) = p + p ) j p j 2, 6) where the equality is sice it eeds to keep the j leadig -bits of x uchaged, ad the last iequality is by p = o); Pf x) j ) = p ) ) j 7) = p ) j + ) j ) p e j pj 3, where the equality is sice at least oe of the first j leadig -bits of x eeds to be flipped by oise. Thus, we get Pf x ) f x)) pj 6. 8) If oe of the i k o-leadig -bits is flipped, LOx ) = LOx) = k. We ca use the same aalysis procedure as Eq. 4.) i the proof of Theorem 4 to derive that Pf x ) f x)) p k + 2, 9)

15 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 5 where the secod iequality is by p = o). Combiig all the i cases, we get E k pj ) 6 + i k i + i) 20) 2 j= pkk + ) + i k ) pk2 e i k. 6 By subtractig E from E +, we get EX t X t+ X t = i) i pk i k. 6 To ivestigate coditio ) of Lemma 3, we also eed to aalyze the probability PX t+ i X t = i). For X t+ i, it is ecessary that at least oe bit of x is flipped ad the offsprig x is accepted. We cosider two cases: ) at least oe of the k leadig -bits of x is flipped; 2) the k leadig - bits of x are ot flipped ad at least oe of the last k bits is flipped. For case ), the mutatio probability is )k ad the acceptace probability is at most p k+ by Eq. 4.). For case 2), the mutatio probability is )k ) k ) k ad the acceptace probability is at most. Thus, we have Whe k < p, we have PX t+ i X t = i) p + k. 2) EX t X t+ X t = i) i i k 6 k 2 p/2 7c log 6 k 2 24 p + k ), 22) where the secod iequality is by k > p ad i < c log, the third iequality is by p = ωlog /) ad the last is by k > p. Whe k p, we have EX t X t+ X t = i) i pk ) c log p 44 p 288 p + k ), 576 where the secod iequality is by p = o) ad i < c log, the third is by p = ωlog /) ad the last is by k p. Combiig Eqs. 4.), 4.) ad 4.), we get that coditio ) of Lemma 3 holds with ɛ = 576. For coditio 2) of Lemma 3, we eed to show P X t+ X t j X t = i) rl) +δ) PX j t+ i X t = i) for i. For PX t+ i X t = i), we aalyze the cases where oly oe bit is flipped. Usig the similar

16 6 Chao Qia et al. aalysis procedure as E, except that flippig ay bit rather tha oly -bit is cosidered here, we easily get PX t+ i X t = i) pkk + ) k 6. 24) For X t+ X t j, it is ecessary that at least j bits of x are flipped ad the offsprig solutio x is accepted. We cosider two cases: ) at least oe of the k leadig -bits is flipped; 2) the k leadig -bits are ot flipped. For j ) j case ), the mutatio probability is at most k ad the acceptace probability is at most p k+ by Eq. 4.). For case 2), the mutatio probability is at most )k ) k j ad the acceptace probability is at most. Thus, j we have P X t+ X t j X t = i) 25) k ) + pk j j + ) k ) k j j pkk + ) j + k 2 pkk + ) 2 j k ) j. By combiig Eq. 4.) with Eq. 4.), we get that coditio 2) of Lemma 3 holds with δ = ad rl) = 44. Note that l = b a = c log. By Lemma 3, the expected ruig time is 2 Ωc log ), where c is ay positive costat. Thus, the expected ruig time is super-polyomial. Theorem 6 For the +)-EA o LeadigOes uder bit-wise oise p, ), the expected ruig time is expoetial if p = Ω). Proof We use Lemma 2 to prove it. Let X t = i be the umber of 0-bits of the solutio x after t iteratios of the +)-EA. We cosider the iterval i [0, /2 ]. To aalyze the drift EX t X t+ X t = i) = E + E, we use the same aalysis procedure as Theorem 5. For the positive drift, we have = o). For the egative drift, we re-aalyze Eqs. 4.) ad 4.). E + i From Eqs. 4.) ad 4.), we get that Pf x ) j ) p j Pf x) j ) pj 3. Thus, Eq. 4.) becomes Pf x ) f x)) p2 j 3 ) ad j ). 26) For Eq. 4.), we eed to aalyze the acceptace probability for LOx ) = LOx) = k. Sice it is sufficiet to keep the first k + ) bits of x ad x uchaged i oise, Eq. 4.) becomes Pf x ) f x)) p 2 ) 2k+) p 2 k + ) 2. 27)

17 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 7 By applyig the above two iequalities to Eq. 4.), we have k E p2 j j + ) i k) k)2 e = Ω), j= where the equality is by p = Ω). Thus, E + E = Ω). That is, coditio ) of Lemma 2 holds. Sice it is ecessary to flip at least j bits of x, we have ) P X t+ X t j X t ) j j j! 2 2 j, which implies that coditio 2) of Lemma 2 holds with δ = ad rl) = 2. Note that l = /2. Thus, by Lemma 2, the expected ruig time is expoetial. 4.2 Bit-wise Noise, q) For bit-wise oise, q), we prove i Theorems 7-9 that the expected ruig time is polyomial if q = Olog / 3 ) ad super-polyomial if q = ωlog / 2 ). The proof idea is similar to that for bit-wise oise p, ). The mai differece led by the chage of oise is the probability of acceptig the offsprig solutio, i.e., Pf x ) f x)). Theorem 7 For the +)-EA o LeadigOes uder bit-wise oise, q), the expected ruig time is polyomial if q = Olog / 3 ). Proof The proof is very similar to that of Theorem 4. The chage of oise oly affects the probability of acceptig the offsprig solutio i the aalysis. For ay positive costat b, suppose that q b log / 3. For the positive drift E +, we eed to re-aalyze Pf x ) f x)) i.e., Eq. 4.) i the proof of Theorem 4) for the paret x with LOx) = i ad the offsprig x with LOx ) i +. By bit-wise oise, q), Eqs. 4.) ad 4.) chage to Pf x ) i ) = q) i ; Pf x) i + ) = q) i q. Thus, by the uio boud, Eq. 4.) becomes Pf x ) f x)) q) i + q) i q) 28) = q) i+ qi + ) /2, where the last iequality is by q = Olog / 3 ). For the egative drift E, we eed to re-aalyze Pf x ) f x)) i.e., Eq. 4.) i the proof of Theorem 4) for the paret x with LOx) = i where

18 8 Chao Qia et al. i ) ad the offsprig x with LOx ) i. By bit-wise oise, q), Eqs. 4.) ad 4.) chage to Pf x ) i) q q) i, Pf x) i ) = q) i. Thus, by the uio boud, Eq. 4.) becomes Pf x ) f x)) q q) i + q) i 29) = q) i 2q) i )q) 2q) i + )q, where the secod iequality is by q) i i )q ad 2q > 0 for q = Olog / 3 ). By applyig Eq. 4.2) ad Eq. 4.2) to E + ad E, respectively, Eq. 4.) chages to EV ξ t ) V ξ t+ ) ξ t = x) + c ) i 4b log c ) i c 2qi + ) 62 3 ) 2b log That is, the coditio of Lemma still holds with 6 2. Thus, the expected ruig time is polyomial. Theorem 8 For the +)-EA o LeadigOes uder bit-wise oise, q), if q = ωlog / 2 ) o/), the expected ruig time is super-polyomial. Proof We use the same aalysis procedure as Theorem 5. The oly differece is the probability of acceptig the offsprig solutio due to the chage of oise. For the positive drift, we still have E + i, sice we optimistically assume that x is always accepted i the proof of Theorem 5. For the egative drift, we eed to re-aalyze Pf x ) f x)) for the paret solutio x with LOx) = k ad the offsprig solutio x with LOx ) = j where j k + ). For j k, to derive a lower boud o Pf x ) f x)), we cosider the j cases where f x) = l ad f x ) l for 0 l j. Sice Pf x) = l) = q) l q ad Pf x ) l) = q) l, Eq. 4.) chages to ) j Pf x ) f x)) q) l q q) l l=0 = 2 q)2j + q ) 2j ) q q)2j 2 q) 2j qj q qj 2, 30) where the last iequality is by q) 2j 2qj /2 sice q = o/). For j = k+ i.e., LOx ) = LOx) = k), we ca use the same aalysis as Eq. 4.2) to derive a lower boud /2, sice the last iequality of Eq. 4.2) still holds with q = o/). Thus, Eq. 4.) also holds here, i.e., Pf x ) f x)) 2. 3)

19 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 9 By applyig Eqs. 4.2) ad 4.2) to E, Eq. 4.) chages to Thus, we have E qk2 2 + i k. 6 EX t X t+ X t = i) = E + E i qk2 2 i k. 6 For the upper boud aalysis of PX t+ i X t = i) i the proof of Theorem 5, we oly eed to replace the acceptace probability p k+ i the case of LOx ) < LOx) with k + )q i.e., Eq. 4.2)). Thus, Eq. 4.) chages to PX t+ i X t = i) k + )q + k q + k. To compare EX t X t+ X t = i) with PX t+ i X t = i), we cosider two cases: k < 2 q ad k 2 q. By usig q = ωlog / 2 ) ad applyig the same aalysis procedure as Eqs. 4.) ad 4.), we ca derive that coditio ) of Lemma 3 holds with ɛ = 92. For the lower boud aalysis of PX t+ i X t = i), by applyig Eqs. 4.2) ad 4.2), Eq. 4.) chages to PX t+ i X t = i) qkk + ) 2 + k 6. For the aalysis of X t+ X t j, by replacig the acceptace probability i the case of LOx ) < LOx) with k + )q, Eq. 4.) chages to p k+ qkk + ) P X t+ X t j X t = i) 4 2 j + k qkk + ) + k j ) 48 2 j. That is, coditio 2) of Lemma 3 holds with δ =, rl) = 48. Thus, the expected ruig time is super-polyomial. Theorem 9 For the +)-EA o LeadigOes uder bit-wise oise, q), the expected ruig time is expoetial if q = Ω/). Proof We use Lemma 2 to prove it. Let X t = i be the umber of 0-bits of the solutio x after t iteratios of the +)-EA. We cosider the iterval i [0, /2 ]. To aalyze the drift EX t X t+ X t = i), we use the same aalysis procedure as the proof of Theorem 5. We first cosider q = Ω/) o). We eed to aalyze the probability Pf x ) f x)), where the offsprig solutio x is geerated by flippig oly oe -bit of x. Let LOx) = k. For the case where the j-th where j k) leadig -bit is flipped, as the aalysis of Eq. 4.2), we get Pf x ) f x)) q)2j 2 q) 2j qj q.

20 20 Chao Qia et al. If q) 2j < 2, q)2j 2 qj 4 ; otherwise, q)2j q qj 2 Pf x ) f x)) mi{/4, qj/2}.. Thus, we have For the case that flips oe o-leadig -bit i.e., LOx ) = LOx) = k), to derive a lower boud o Pf x ) f x)), we cosider f x) = l ad f x ) l for 0 l k. Thus, k Pf x ) f x)) q) l q q) l + q) k+ q) k q)2k 2 l=0 + q) 2k+ = 2 + q)2k 2 q ) 2, where the last iequality is by q = o). By applyig the above two iequalities to Eq. 4.), we get E k { mi e 4, qj } + i k. 2 2 j= If k 2, k j= mi{ 4, qj 2 } = Ω) sice q = Ω/). If k < 2, i k 2 = Ω) sice i. Thus, E = Ω). For q = Ω), we use the trivial lower boud q for the probability of acceptig the offsprig solutio x, sice it is sufficiet to flip the first leadig -bit of x by oise. The, E i)q kq + i k)q) = = Ω). e e Thus, for q = Ω/), we have EX t X t+ X t = i) = E + E i Ω) = Ω). That is, coditio ) of Lemma 2 holds. Its coditio 2) trivially holds with δ = ad rl) = 2. Thus, the expected ruig time is expoetial. 4.3 Oe-bit Noise For the +)-EA o LeadigOes uder oe-bit oise, it has bee kow that the ruig time is polyomial if p /6e 2 ) ad expoetial if p = /2 [20]. We exted this result by provig i Theorem 0 that the ruig time is polyomial if p = Olog / 2 ) ad super-polyomial if p = ωlog /). The proof ca be accomplished as same as that of Theorems 4, 5 ad 6 for bit-wise oise p, ). This is because although the probabilities Pf x ) f x)) of acceptig the offsprig solutio are differet, their bouds used i the proofs for bit-wise oise p, ) still hold for oe-bit oise.

21 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 2 Theorem 0 For the +)-EA o LeadigOes uder oe-bit oise, the expected ruig time is polyomial if p = Olog / 2 ), super-polyomial if p = ωlog /) o) ad expoetial if p = Ω). Proof We re-aalyze Pf x ) f x)) for oe-bit oise, ad show that the bouds o Pf x ) f x)) used i the proofs for bit-wise oise p, ) still hold for oe-bit oise. For the proof of Theorem 4, Eqs. 4.) ad 4.) chage to Pf x ) i ) = p i, Pf x) i + ) = p, ad thus Eq. 4.) still holds; Eqs. 4.) ad 4.) chage to Pf x ) i) p, Pf x) i ) = p i, ad thus Eq. 4.) still holds. For the proof of Theorem 5, Eqs. 4.) ad 4.) chage to Pf x ) j ) = p j, Pf x) j ) = p j, ad thus Eq. 4.) still holds. For the proof of Theorem 6, Eq. 4.) still holds by the above two equalities; Eq. 4.) still holds sice the probability of keepig the first k + ) bits of a solutio uchaged i oe-bit oise is p k+ p k+ ). 4.4 Experimets I the above three subsectios, we have proved that for the +)-EA solvig the LeadigOes problem, if uder bit-wise oise p, ), the expected ruig time is polyomial whe p = Olog / 2 ) ad super-polyomial whe p = ωlog /); if uder bit-wise oise, q), the expected ruig time is polyomial whe q = Olog / 3 ) ad super-polyomial whe q = ωlog / 2 ); if uder oe-bit oise, the expected ruig time is polyomial whe p = Olog / 2 ) ad super-polyomial whe p = ωlog /). However, the curret aalysis does ot cover all the rages of p ad q. We thus have coducted experimets to complemet the theoretical results. For bit-wise oise p, ), we do ot kow whether the ruig time is polyomial or super-polyomial whe p = ωlog / 2 ) Olog /). We empirically estimate the expected ruig time for p = log /) 2, log / 3/2 ad log /. O each problem size, we ru the +)-EA 000 times idepedetly. I each ru, we record the umber of fitess evaluatios util a optimal solutio w.r.t. the true fitess fuctio is foud for the first time. The the total umber of evaluatios of the 000 rus are averaged as the estimatio of the expected ruig time. To show the relatioship betwee the expected ruig time ad the problem size clearly, we plot the curve

22 22 Chao Qia et al Estimated ratio Problem size Estimated ratio Problem size Estimated ratio Problem size a) p = log /) 2 b) p = log / 3/2 c) p = log / Figure The expected ruig time for the +)-EA solvig LeadigOes uder bit-wise oise p, ), where the y-axis is the logarithm of the estimated expected ruig time) divided by log. Note that a logarithmic scale is used for the x-axis [5, 30]) i subfigure a). Estimated ratio Problem size Estimated ratio Problem size Estimated ratio Problem size a) q = log ) 2 / 3 b) q = log / 5/2 c) q = log / 2 Figure 2 The expected ruig time for the +)-EA solvig LeadigOes uder bit-wise oise, q), where the y-axis is the logarithm of the estimated expected ruig time) divided by log. Note that a logarithmic scale is used for the x-axis [5, 30]) i subfigure a). Estimated ratio Problem size Estimated ratio Problem size Estimated ratio Problem size a) p = log /) 2 b) p = log / 3/2 c) p = log / Figure 3 The expected ruig time for the +)-EA solvig LeadigOes uder oe-bit oise, where the y-axis is the logarithm of the estimated expected ruig time) divided by log. Note that a logarithmic scale is used for the x-axis [5, 30]) i subfigure a). of logexpected ruig time)/ log, as show i Figure. Note that i subfigure a), the problem size is i the rage from 5 to 30, ad a base e logarithmic scale is used. We ca observe that all the curves grow i a closely liear tred. These empirical results imply that the expected ruig time for p = log /) 2, log / 3/2 ad log / is approximately i the order of Θlog ), Θ) ad Θ), respectively. For bit-wise oise, q), the expected ruig time is theoretically ot kow whe q = ωlog / 3 ) Olog / 2 ). We thus empirically estimate the expected ruig time for q = log ) 2 / 3, log / 5/2 ad log / 2. From Figure 2, we ca also observe that all the curves grow i a closely liear tred. Therefore, the observatio suggests that the expected ruig time for q = log ) 2 / 3, log / 5/2 ad log / 2 is approximately i the order of Θlog ), Θ) ad Θ), respectively.

23 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 23 Figure 3 shows the empirical results for oe-bit oise, which are similar to that observed for bit-wise oise p, ). That is, the expected ruig time for p = log /) 2, log / 3/2 ad log / is approximately i the order of Θlog ), Θ) ad Θ), respectively. Thus, these empirical results disclose that the curret rage of p or q allowig a polyomial ruig time may be tight, that is, the expected ruig time is super-polyomial for the ucovered rage of p or q i theoretical aalysis. The rigorous aalysis is ot easy. We may eed to aalyze trasitio probabilities betwee fitess levels more precisely, ad desig a igeious distace fuctio or use more advaced aalysis tools. We leave it as a future work. 5 The Robustess of Samplig to Noise From the derived results i the above two sectios, we ca observe that the +)-EA is efficiet for solvig OeMax ad LeadigOes oly uder low oise levels. For example, for the +)-EA solvig OeMax uder bitwise oise p, ), the optimal solutio ca be foud i polyomial time oly whe p = Olog /). I this sectio, we show that usig the samplig strategy ca sigificatly icrease the largest oise level allowig a polyomial ruig time. For example, if usig samplig, the +)-EA ca always solve OeMax uder bit-wise oise p, ) i polyomial time, regardless of the value of p. 5. The OeMax Problem We prove i Theorems ad 4 that uder bit-wise oise p, ) or oe-bit oise, the +)-EA ca always solve OeMax i polyomial time by usig samplig. For bit-wise oise, q), the tight rage of q allowig a polyomial ruig time is /2 / O), as show i Theorems 2 ad 3. Let x k deote ay solutio with k umber of -bits, ad f x k ) deote its oisy objective value. For provig polyomial upper bouds, we use Lemma 4, which gives a sufficiet coditio based o the probability Pf x j ) < f x k+ )) for j k. But for the +)-EA usig samplig, the probability chages to be P ˆfx j ) < ˆfx k+ )), where ˆfx j ) = m m i= f i xj ) as show i Defiitio 5. Lemma 4 requires a lower boud o P ˆfx j ) < ˆfx k+ )). Our proof idea as preseted i Lemma 6 is to derive a lower boud o the expectatio of f x k+ ) f x j ) ad the apply Chebyshev s iequality. We will directly use Lemma 6 i the followig proofs. For provig super-polyomial lower bouds, we use Lemma 5 by replacig Pf x k ) < f x k+ )) with P ˆfx k ) < ˆfx k+ )). Let poly) idicate ay polyomial of. Before givig the proof, we first ituitively explai why samplig is always effective for bit-wise oise p, ) ad oe-bit oise, while it fails for bitwise oise, q) whe q = /2 / ω) or q /2. For two solutios x ad y

24 24 Chao Qia et al. with fx) > fy), if uder bit-wise oise p, ) ad oe-bit oise, the oisy fitess f x) is larger tha f y) i expectatio, ad usig samplig will icrease this tred ad make the probability of acceptig the true worse solutio y sufficietly small. If uder bit-wise oise, q), whe q = /2 / ω), although the oisy fitess f x) is still larger i expectatio, the gap is very small i the order of / ω) ) ad a polyomial sample size is ot sufficiet to make the probability of acceptig the true worse solutio y small eough; whe q /2, the oisy fitess f x) is smaller i expectatio, ad usig samplig will icrease this tred ad it obviously does ot work. Lemma 6 Suppose there exists a real umber δ > 0 such that j k < : Ef x k+ ) f x j )) δ, the the +)-EA usig samplig with m = 3 /δ 2 eeds polyomial umber of iteratios i expectatio for solvig oisy OeMax. Proof We use Lemma 4 to prove it. For ay j k <, let Y k,j = f x k+ ) f x j ) ad Ŷk,j = ˆfx k+ ) ˆfx j ). We the eed to aalyze the probability P ˆfx j ) < ˆfx k+ )) = PŶk,j > 0). Deote the expectatio EY k,j ) as µ k,j ad the variace VarY k,j ) as σ 2 k,j. It is easy to verify that EŶk,j) = µ k,j ad VarŶk,j) = σk,j 2 /m. By Chebyshev s iequality, we have PŶk,j 0) P Ŷk,j µ k,j µ k,j /2) 4σ 2 k,j/mµ 2 k,j). Sice µ k,j δ > 0, σ 2 k,j = EY 2 k,j ) µ2 k,j 2 ad m = 3 /δ 2, we have PŶk,j 0) 4/ log /5), where the last iequality holds with sufficietly large. Let l = log. The, PŶk,j > 0) log 5 > l. Let c = 5. For k < l, PŶk,j > 0) c k. Thus, the coditio of Lemma 4 i.e., Eq. 4)) holds. We the get that the expected umber of iteratios is O log ) + 2 Olog ) = O), i.e., polyomial. Theorem For the +)-EA o OeMax uder bit-wise oise p, ), if usig samplig with m = 4 3, the expected ruig time is polyomial. Proof We use Lemma 6 to prove it. Sice Ef x j )) = j p ) + j) p = 2p )j + p, we have, for ay j k <, Ef x k+ ) f x j )) = 2p ) k + j) 2p /2, where the last iequality holds with 4. Thus, by Lemma 6, we get that the expected umber of iteratios of the +)-EA usig samplig with m = 4 3 is polyomial. Sice each iteratio takes 2m = 8 3 umber of fitess evaluatios, the expected ruig time is also polyomial.

25 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 25 Theorem 2 For the +)-EA o OeMax uder bit-wise oise, q) with q = /2 / O), if usig samplig, there exists some m = Opoly)) such that the expected ruig time is polyomial. Proof We use Lemma 6 to prove it. Sice q = /2 / O), there exists a positive costat c such that q /2 / c. It is easy to verify that Ef x j )) = j q) + j)q = 2q)j + q. Thus, for ay j k <, Ef x k+ ) f x j )) = 2q)k + j) 2q 2/ c. By Lemma 6, we get that if usig samplig with m = 3+2c /4, the expected umber of iteratios is polyomial, ad the the expected ruig time is polyomial. Thus, the theorem holds. Theorem 3 For the +)-EA o OeMax uder bit-wise oise, q) with q = /2 / ω) or q /2, if usig samplig with ay m = Opoly)), the expected ruig time is expoetial. Proof We use Lemma 5 to prove it. Note that for the +)-EA usig samplig, we have to aalyze P ˆfx k )< ˆfx k+ )) istead of Pf x k )<f x k+ )). Let Z deote a radom variable which satisfies that PZ = 0) = q ad PZ = ) = q. I the followig proof, each Z i is a idepedet radom variable, which has the same distributio as Z. We have f x k ) = k i= Z i + i=k+ Z i), ad the, f x k+ ) f x k ) k+ = Z i + Z i ) i= + = Z i i= i=k+2 2 i=+2 Z i. +k i=+ Z i 2 i=+k+ Z i ) Sice ˆfx k ) = m m i= f i xk ), which is the average of m idepedet evaluatios, we have m ˆfx k+ ) ˆfx k )) = = = m 2j+2 j=0 i=2j+ m 2j+2 j=0 i=2j+ m m Z i + 2j++ j=0 i=2j+ 2j++ j=0 i=2j+3 Z i + Z m, m Z i m Z i 2j+) j=0 i=2j++2 2j+) j=0 i=2j++2 Z i m Z i m

26 26 Chao Qia et al. where Z = m 2j++ j=0 i=2j+3 Z i m 2j+) j=0 i=2j++2 Z i. To make ˆfx k ) ˆfx k+ ), it is sufficiet that Z 0 ad m 2j+2 j=0 i=2j+ Z i m. That is, P ˆfx k ) ˆfx m k+ )) PZ 0) P 2j+2 j=0 i=2j+ Z i m. 32) Sice Z is the differece betwee the sum of the same umber of Z i, Z has the same distributio as Z. Thus, PZ 0) + PZ 0) = PZ 0) + P Z 0) = 2PZ 0), which implies that PZ 0) /2. 33) We the ivestigate P m 2j+2 j=0 i=2j+ Z i m). Sice m 2j+2 j=0 i=2j+ Z i is the sum of 2m idepedet radom variables which have the same distributio as Z, we have m 2j+2 m ) P Z i m 2m = q) t q 2m t, t m P 2j+2 j=0 i=2j+ j=0 i=2j+ Z i > m = 2m t=m+ For ay t < m, let r = q)t q 2m t q) 2m t q t q = /2 / ω), we have q r q 4 ω) t=0 ) m 2m ) 2m q) t q 2m t = q) 2m t q t. t t t=0 = q q )2m 2t. If q /2, we have r. If ) 2m = 2q q ) 2m e ) 2m ) 2m/ ω) /4 ) e, where the first iequality is by q /2, the secod iequality is by 2q = 2/ ω) ad q /2, ad the last is by m = Opoly)). Thus, P m 2j+2 j=0 i=2j+ Z i m) > /3 P m 2j+2 j=0 i=2j+ Z i > m), which implies that m 2j+2 P Z i m > /4. 34) j=0 i=2j+ By applyig Eqs. 5.) ad 5.) to Eq. 5.), we get P ˆfx k ) ˆfx k+ )) /8. Let c = 6 ad l = /28. For ay l k <, P ˆfx k ) < ˆfx k+ )) = P ˆfx k ) ˆfx k+ )) cl c k),

27 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 27 i.e., the coditio of Lemma 5 holds. Thus, the expected umber of iteratios is 2 Ω/28), ad the expected ruig time is expoetial. Theorem 4 For the +)-EA o OeMax uder oe-bit oise, if usig samplig with m = 4 3, the expected ruig time is polyomial. Proof It is easy to verify that the expectatio of f x j ) i.e., Ef x j ))) uder oe-bit oise is as same as that uder bit-wise oise p, ). Thus, the proof ca be fiished as same as that of Theorem. 5.2 The LeadigOes Problem The bit-wise oise p, ) model is first cosidered. We prove i Theorem 5 that the +)-EA usig samplig ca solve the LeadigOes problem i polyomial time, regardless of the value of p. The proof idea is similar to that of Theorem 4. The mai differece is the probability of acceptig the offsprig solutio x, which is chaged from Pf x ) f x)) to P ˆfx ) ˆfx)) due to samplig. Lemma 7 gives some bouds o this probability, which will be used i the proof of Theorem 5. Let LOx) deote the true umber of leadig -bits of a solutio x. Lemma 7 For the LeadigOes problem uder bit-wise oise p, ), if usig samplig with m = 44 6, it holds that ) for ay x with LOx) = i < ad y with LOy) i 2 or LOy) = i y i+ = 0, P ˆfx) ˆfy)) / 2. 2) for ay y with LOy) <, P ˆf ) ˆfy)) /4 4 ). Proof The proof is fiished by derivig a lower boud o the expectatio of f x) f y) which is equal to the expectatio of ˆfx) ˆfy)) ad the applyig Chebyshev s iequality. We first cosider case ). For ay x with LOx) = i <, i Ef x)) p) i + p ) j j ) + p j= ) i i + ) + p i Ef x)) p) i + + p j= p ) j ) i + p ) i+ i, 35) j ) ) i+ i. Note that whe flippig the first 0-bit of x ad keepig the i leadig -bits uchaged, the fitess is at least i + ad at most. The for ay i <, we have Ef x) f y) LOx) = i LOy) = i ) 36)

Running Time Analysis of the (1+1)-EA for OneMax and LeadingOnes under Bit-wise Noise

Running Time Analysis of the (1+1)-EA for OneMax and LeadingOnes under Bit-wise Noise Ruig Time Aalysis of the +-EA for OeMax ad LeadigOes uder Bit-wise Noise Chao Qia Uiversity of Sciece ad Techology of Chia Hefei 3007, Chia chaoqia@ustc.edu.c Wu Jiag Uiversity of Sciece ad Techology of

More information

On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments

On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments O the Effectiveess of Samplig for Evolutioary Optimizatio i Noisy Eviromets Chao Qia,2 chaoqia@ustc.edu.c Yag Yu 2 yuy@ju.edu.c Ke Tag ketag@ustc.edu.c Yaochu Ji 3 yaochu.ji@surrey.ac.uk Xi Yao,4 x.yao@cs.bham.ac.uk

More information

arxiv: v1 [cs.ai] 20 Nov 2013

arxiv: v1 [cs.ai] 20 Nov 2013 Aalyzig Evolutioary Optimizatio i Noisy Eviromets Chao Qia, Yag Yu, Zhi-Hua Zhou Natioal Key Laboratory for Novel Software Techology Najig Uiversity, Najig 20023, Chia arxiv:3.4987v [cs.ai] 20 Nov 203

More information

Analyzing Evolutionary Optimization in Noisy Environments

Analyzing Evolutionary Optimization in Noisy Environments Evolutioary Computatio /EVCO_a_0070-Qia Jauary 5, 206 6:7 Aalyzig Evolutioary Optimizatio i Noisy Eviromets Chao Qia qiac@lamda.ju.edu.c Natioal Key Laboratory for Novel Software Techology, Najig Uiversity,

More information

Machine Learning Brett Bernstein

Machine Learning Brett Bernstein Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio

More information

Problem Set 2 Solutions

Problem Set 2 Solutions CS271 Radomess & Computatio, Sprig 2018 Problem Set 2 Solutios Poit totals are i the margi; the maximum total umber of poits was 52. 1. Probabilistic method for domiatig sets 6pts Pick a radom subset S

More information

General Lower Bounds for the Running Time of Evolutionary Algorithms

General Lower Bounds for the Running Time of Evolutionary Algorithms Geeral Lower Bouds for the Ruig Time of Evolutioary Algorithms Dirk Sudholt Iteratioal Computer Sciece Istitute, Berkeley, CA 94704, USA Abstract. We preset a ew method for provig lower bouds i evolutioary

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

Selection Hyper-heuristics Can Provably be Helpful in Evolutionary Multi-objective Optimization

Selection Hyper-heuristics Can Provably be Helpful in Evolutionary Multi-objective Optimization Selectio Hyper-heuristics Ca Provably be Helpful i Evolutioary Multi-objective Optimizatio Chao Qia 1,2, Ke Tag 1, ad Zhi-Hua Zhou 2 1 UBRI, School of Computer Sciece ad Techology, Uiversity of Sciece

More information

Application to Random Graphs

Application to Random Graphs A Applicatio to Radom Graphs Brachig processes have a umber of iterestig ad importat applicatios. We shall cosider oe of the most famous of them, the Erdős-Réyi radom graph theory. 1 Defiitio A.1. Let

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

The Growth of Functions. Theoretical Supplement

The Growth of Functions. Theoretical Supplement The Growth of Fuctios Theoretical Supplemet The Triagle Iequality The triagle iequality is a algebraic tool that is ofte useful i maipulatig absolute values of fuctios. The triagle iequality says that

More information

Basics of Probability Theory (for Theory of Computation courses)

Basics of Probability Theory (for Theory of Computation courses) Basics of Probability Theory (for Theory of Computatio courses) Oded Goldreich Departmet of Computer Sciece Weizma Istitute of Sciece Rehovot, Israel. oded.goldreich@weizma.ac.il November 24, 2008 Preface.

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Element sampling: Part 2

Element sampling: Part 2 Chapter 4 Elemet samplig: Part 2 4.1 Itroductio We ow cosider uequal probability samplig desigs which is very popular i practice. I the uequal probability samplig, we ca improve the efficiecy of the resultig

More information

Mixtures of Gaussians and the EM Algorithm

Mixtures of Gaussians and the EM Algorithm Mixtures of Gaussias ad the EM Algorithm CSE 6363 Machie Learig Vassilis Athitsos Computer Sciece ad Egieerig Departmet Uiversity of Texas at Arligto 1 Gaussias A popular way to estimate probability desity

More information

Reinforcement Learning Based Dynamic Selection of Auxiliary Objectives with Preserving of the Best Found Solution

Reinforcement Learning Based Dynamic Selection of Auxiliary Objectives with Preserving of the Best Found Solution Reiforcemet Learig Based Dyamic Selectio of Auxiliary Objectives with Preservig of the Best Foud Solutio arxiv:1704.07187v1 [cs.ne] 24 Apr 2017 Abstract Efficiecy of sigle-objective optimizatio ca be improved

More information

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization ECE 90 Lecture 4: Maximum Likelihood Estimatio ad Complexity Regularizatio R Nowak 5/7/009 Review : Maximum Likelihood Estimatio We have iid observatios draw from a ukow distributio Y i iid p θ, i,, where

More information

arxiv: v1 [cs.ne] 4 Sep 2017

arxiv: v1 [cs.ne] 4 Sep 2017 Theoretical Aalysis of Stochastic Search Algorithms Per Kristia Lehre School of Computer Sciece, Uiversity of Birmigham, Birmigham, UK Pietro S. Oliveto Departmet of Computer Sciece, Uiversity of Sheffield,

More information

Lecture 2 February 8, 2016

Lecture 2 February 8, 2016 MIT 6.854/8.45: Advaced Algorithms Sprig 206 Prof. Akur Moitra Lecture 2 February 8, 206 Scribe: Calvi Huag, Lih V. Nguye I this lecture, we aalyze the problem of schedulig equal size tasks arrivig olie

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ.

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ. 2 5. Weighted umber of late jobs 5.1. Release dates ad due dates: maximimizig the weight of o-time jobs Oce we add release dates, miimizig the umber of late jobs becomes a sigificatly harder problem. For

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

Hashing and Amortization

Hashing and Amortization Lecture Hashig ad Amortizatio Supplemetal readig i CLRS: Chapter ; Chapter 7 itro; Sectio 7.. Arrays ad Hashig Arrays are very useful. The items i a array are statically addressed, so that isertig, deletig,

More information

Stochastic Simulation

Stochastic Simulation Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso

More information

Monte Carlo Integration

Monte Carlo Integration Mote Carlo Itegratio I these otes we first review basic umerical itegratio methods (usig Riema approximatio ad the trapezoidal rule) ad their limitatios for evaluatig multidimesioal itegrals. Next we itroduce

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

ON POINTWISE BINOMIAL APPROXIMATION

ON POINTWISE BINOMIAL APPROXIMATION Iteratioal Joural of Pure ad Applied Mathematics Volume 71 No. 1 2011, 57-66 ON POINTWISE BINOMIAL APPROXIMATION BY w-functions K. Teerapabolar 1, P. Wogkasem 2 Departmet of Mathematics Faculty of Sciece

More information

Analysis of the Chow-Robbins Game with Biased Coins

Analysis of the Chow-Robbins Game with Biased Coins Aalysis of the Chow-Robbis Game with Biased Cois Arju Mithal May 7, 208 Cotets Itroductio to Chow-Robbis 2 2 Recursive Framework for Chow-Robbis 2 3 Geeralizig the Lower Boud 3 4 Geeralizig the Upper Boud

More information

Vector Quantization: a Limiting Case of EM

Vector Quantization: a Limiting Case of EM . Itroductio & defiitios Assume that you are give a data set X = { x j }, j { 2,,, }, of d -dimesioal vectors. The vector quatizatio (VQ) problem requires that we fid a set of prototype vectors Z = { z

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Posted-Price, Sealed-Bid Auctions

Posted-Price, Sealed-Bid Auctions Posted-Price, Sealed-Bid Auctios Professors Greewald ad Oyakawa 207-02-08 We itroduce the posted-price, sealed-bid auctio. This auctio format itroduces the idea of approximatios. We describe how well this

More information

CS322: Network Analysis. Problem Set 2 - Fall 2009

CS322: Network Analysis. Problem Set 2 - Fall 2009 Due October 9 009 i class CS3: Network Aalysis Problem Set - Fall 009 If you have ay questios regardig the problems set, sed a email to the course assistats: simlac@staford.edu ad peleato@staford.edu.

More information

CS284A: Representations and Algorithms in Molecular Biology

CS284A: Representations and Algorithms in Molecular Biology CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by

More information

Chapter 2 The Monte Carlo Method

Chapter 2 The Monte Carlo Method Chapter 2 The Mote Carlo Method The Mote Carlo Method stads for a broad class of computatioal algorithms that rely o radom sampligs. It is ofte used i physical ad mathematical problems ad is most useful

More information

Drift analysis and average time complexity of evolutionary algorithms

Drift analysis and average time complexity of evolutionary algorithms Artificial Itelligece 127 (2001) 57 85 Drift aalysis ad average time complexity of evolutioary algorithms Ju He a,xiyao b, a Departmet of Computer Sciece, Norther Jiaotog Uiversity, Beijig 100044, PR Chia

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Sequences. Notation. Convergence of a Sequence

Sequences. Notation. Convergence of a Sequence Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it

More information

Lecture 11: Pseudorandom functions

Lecture 11: Pseudorandom functions COM S 6830 Cryptography Oct 1, 2009 Istructor: Rafael Pass 1 Recap Lecture 11: Pseudoradom fuctios Scribe: Stefao Ermo Defiitio 1 (Ge, Ec, Dec) is a sigle message secure ecryptio scheme if for all uppt

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Expectation-Maximization Algorithm.

Expectation-Maximization Algorithm. Expectatio-Maximizatio Algorithm. Petr Pošík Czech Techical Uiversity i Prague Faculty of Electrical Egieerig Dept. of Cyberetics MLE 2 Likelihood.........................................................................................................

More information

A collocation method for singular integral equations with cosecant kernel via Semi-trigonometric interpolation

A collocation method for singular integral equations with cosecant kernel via Semi-trigonometric interpolation Iteratioal Joural of Mathematics Research. ISSN 0976-5840 Volume 9 Number 1 (017) pp. 45-51 Iteratioal Research Publicatio House http://www.irphouse.com A collocatio method for sigular itegral equatios

More information

Recurrence Relations

Recurrence Relations Recurrece Relatios Aalysis of recursive algorithms, such as: it factorial (it ) { if (==0) retur ; else retur ( * factorial(-)); } Let t be the umber of multiplicatios eeded to calculate factorial(). The

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Zeros of Polynomials

Zeros of Polynomials Math 160 www.timetodare.com 4.5 4.6 Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered with fidig the solutios of polyomial equatios of ay degree

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

1 Hash tables. 1.1 Implementation

1 Hash tables. 1.1 Implementation Lecture 8 Hash Tables, Uiversal Hash Fuctios, Balls ad Bis Scribes: Luke Johsto, Moses Charikar, G. Valiat Date: Oct 18, 2017 Adapted From Virgiia Williams lecture otes 1 Hash tables A hash table is a

More information

On a Smarandache problem concerning the prime gaps

On a Smarandache problem concerning the prime gaps O a Smaradache problem cocerig the prime gaps Felice Russo Via A. Ifate 7 6705 Avezzao (Aq) Italy felice.russo@katamail.com Abstract I this paper, a problem posed i [] by Smaradache cocerig the prime gaps

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Recursive Algorithms. Recurrences. Recursive Algorithms Analysis

Recursive Algorithms. Recurrences. Recursive Algorithms Analysis Recursive Algorithms Recurreces Computer Sciece & Egieerig 35: Discrete Mathematics Christopher M Bourke cbourke@cseuledu A recursive algorithm is oe i which objects are defied i terms of other objects

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

A Novel Genetic Algorithm using Helper Objectives for the 0-1 Knapsack Problem

A Novel Genetic Algorithm using Helper Objectives for the 0-1 Knapsack Problem A Novel Geetic Algorithm usig Helper Objectives for the 0-1 Kapsack Problem Ju He, Feidu He ad Hogbi Dog 1 arxiv:1404.0868v1 [cs.ne] 3 Apr 2014 Abstract The 0-1 kapsack problem is a well-kow combiatorial

More information

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology Advaced Aalysis Mi Ya Departmet of Mathematics Hog Kog Uiversity of Sciece ad Techology September 3, 009 Cotets Limit ad Cotiuity 7 Limit of Sequece 8 Defiitio 8 Property 3 3 Ifiity ad Ifiitesimal 8 4

More information

Lecture 7: October 18, 2017

Lecture 7: October 18, 2017 Iformatio ad Codig Theory Autum 207 Lecturer: Madhur Tulsiai Lecture 7: October 8, 207 Biary hypothesis testig I this lecture, we apply the tools developed i the past few lectures to uderstad the problem

More information

A Rigorous View On Neutrality

A Rigorous View On Neutrality A Rigorous View O Neutrality Bejami Doerr Michael Gewuch Nils Hebbighaus Frak Neuma Algorithms ad Complexity Group Max-Plack-Istitut für Iformatik Saarbrücke, Germay Departmet of Computer Sciece Christia-Albrechts-Uiversity

More information

5. Fractional Hot deck Imputation

5. Fractional Hot deck Imputation 5. Fractioal Hot deck Imputatio Itroductio Suppose that we are iterested i estimatig θ EY or eve θ 2 P ry < c where y fy x where x is always observed ad y is subject to missigess. Assume MAR i the sese

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

Study on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm

Study on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm Joural of ad Eergy Egieerig, 05, 3, 43-437 Published Olie April 05 i SciRes. http://www.scirp.org/joural/jpee http://dx.doi.org/0.436/jpee.05.34058 Study o Coal Cosumptio Curve Fittig of the Thermal Based

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

On Random Line Segments in the Unit Square

On Random Line Segments in the Unit Square O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,

More information

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices Radom Matrices with Blocks of Itermediate Scale Strogly Correlated Bad Matrices Jiayi Tog Advisor: Dr. Todd Kemp May 30, 07 Departmet of Mathematics Uiversity of Califoria, Sa Diego Cotets Itroductio Notatio

More information

CS / MCS 401 Homework 3 grader solutions

CS / MCS 401 Homework 3 grader solutions CS / MCS 401 Homework 3 grader solutios assigmet due July 6, 016 writte by Jāis Lazovskis maximum poits: 33 Some questios from CLRS. Questios marked with a asterisk were ot graded. 1 Use the defiitio of

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Lecture 12: November 13, 2018

Lecture 12: November 13, 2018 Mathematical Toolkit Autum 2018 Lecturer: Madhur Tulsiai Lecture 12: November 13, 2018 1 Radomized polyomial idetity testig We will use our kowledge of coditioal probability to prove the followig lemma,

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Randomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)

Randomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018) Radomized Algorithms I, Sprig 08, Departmet of Computer Sciece, Uiversity of Helsiki Homework : Solutios Discussed Jauary 5, 08). Exercise.: Cosider the followig balls-ad-bi game. We start with oe black

More information

IP Reference guide for integer programming formulations.

IP Reference guide for integer programming formulations. IP Referece guide for iteger programmig formulatios. by James B. Orli for 15.053 ad 15.058 This documet is iteded as a compact (or relatively compact) guide to the formulatio of iteger programs. For more

More information

Maximum Likelihood Estimation and Complexity Regularization

Maximum Likelihood Estimation and Complexity Regularization ECE90 Sprig 004 Statistical Regularizatio ad Learig Theory Lecture: 4 Maximum Likelihood Estimatio ad Complexity Regularizatio Lecturer: Rob Nowak Scribe: Pam Limpiti Review : Maximum Likelihood Estimatio

More information

Math 216A Notes, Week 5

Math 216A Notes, Week 5 Math 6A Notes, Week 5 Scribe: Ayastassia Sebolt Disclaimer: These otes are ot early as polished (ad quite possibly ot early as correct) as a published paper. Please use them at your ow risk.. Thresholds

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Rank Modulation with Multiplicity

Rank Modulation with Multiplicity Rak Modulatio with Multiplicity Axiao (Adrew) Jiag Computer Sciece ad Eg. Dept. Texas A&M Uiversity College Statio, TX 778 ajiag@cse.tamu.edu Abstract Rak modulatio is a scheme that uses the relative order

More information

Lecture 10 October Minimaxity and least favorable prior sequences

Lecture 10 October Minimaxity and least favorable prior sequences STATS 300A: Theory of Statistics Fall 205 Lecture 0 October 22 Lecturer: Lester Mackey Scribe: Brya He, Rahul Makhijai Warig: These otes may cotai factual ad/or typographic errors. 0. Miimaxity ad least

More information

A Risk Comparison of Ordinary Least Squares vs Ridge Regression

A Risk Comparison of Ordinary Least Squares vs Ridge Regression Joural of Machie Learig Research 14 (2013) 1505-1511 Submitted 5/12; Revised 3/13; Published 6/13 A Risk Compariso of Ordiary Least Squares vs Ridge Regressio Paramveer S. Dhillo Departmet of Computer

More information

6.883: Online Methods in Machine Learning Alexander Rakhlin

6.883: Online Methods in Machine Learning Alexander Rakhlin 6.883: Olie Methods i Machie Learig Alexader Rakhli LECTURES 5 AND 6. THE EXPERTS SETTING. EXPONENTIAL WEIGHTS All the algorithms preseted so far halluciate the future values as radom draws ad the perform

More information

Design and Analysis of Algorithms

Design and Analysis of Algorithms Desig ad Aalysis of Algorithms Probabilistic aalysis ad Radomized algorithms Referece: CLRS Chapter 5 Topics: Hirig problem Idicatio radom variables Radomized algorithms Huo Hogwei 1 The hirig problem

More information

Lecture 2. The Lovász Local Lemma

Lecture 2. The Lovász Local Lemma Staford Uiversity Sprig 208 Math 233A: No-costructive methods i combiatorics Istructor: Ja Vodrák Lecture date: Jauary 0, 208 Origial scribe: Apoorva Khare Lecture 2. The Lovász Local Lemma 2. Itroductio

More information

1 Review and Overview

1 Review and Overview CS9T/STATS3: Statistical Learig Theory Lecturer: Tegyu Ma Lecture #6 Scribe: Jay Whag ad Patrick Cho October 0, 08 Review ad Overview Recall i the last lecture that for ay family of scalar fuctios F, we

More information

CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION

CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION 4. Itroductio Numerous bivariate discrete distributios have bee defied ad studied (see Mardia, 97 ad Kocherlakota ad Kocherlakota, 99) based o various methods

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Upper and Lower Bounds on Unrestricted Black-Box Complexity of Jump n,l

Upper and Lower Bounds on Unrestricted Black-Box Complexity of Jump n,l Upper ad Lower Bouds o Urestricted Black-Box Complexity of Jump,l Maxim Buzdalov 1, Mikhail Kever 1, ad Bejami Doerr 1 ITMO Uiversity, 49 Kroverkskiy av., Sait-Petersburg, Russia, 197101 mbuzdalov@gmail.com,

More information

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n, CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe

More information

OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES

OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES Peter M. Maurer Why Hashig is θ(). As i biary search, hashig assumes that keys are stored i a array which is idexed by a iteger. However, hashig attempts to bypass

More information

The Maximum-Likelihood Decoding Performance of Error-Correcting Codes

The Maximum-Likelihood Decoding Performance of Error-Correcting Codes The Maximum-Lielihood Decodig Performace of Error-Correctig Codes Hery D. Pfister ECE Departmet Texas A&M Uiversity August 27th, 2007 (rev. 0) November 2st, 203 (rev. ) Performace of Codes. Notatio X,

More information

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A. Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)

More information