arxiv: v1 [cs.ne] 2 Nov 2017

Size: px

Start display at page:

Download "arxiv: v1 [cs.ne] 2 Nov 2017"

Wesley Nelson
6 years ago
Views:

1 Noame mauscript No. will be iserted by the editor) Ruig Time Aalysis of the +)-EA for OeMax ad LeadigOes uder Bit-wise Noise Chao Qia Chao Bia Wu Jiag Ke Tag Received: date / Accepted: date arxiv: v [cs.ne] 2 Nov 207 Abstract I may real-world optimizatio problems, the objective fuctio evaluatio is subject to oise, ad we caot obtai the exact objective value. Evolutioary algorithms EAs), a type of geeral-purpose radomized optimizatio algorithm, have show able to solve oisy optimizatio problems well. However, previous theoretical aalyses of EAs maily focused o oise-free optimizatio, which makes the theoretical uderstadig largely isufficiet. Meawhile, the few existig theoretical studies uder oise ofte cosidered the oe-bit oise model, which flips a radomly chose bit of a solutio before evaluatio; while i may realistic applicatios, several bits of a solutio ca be chaged simultaeously. I this paper, we study a atural extesio of oe-bit oise, the bit-wise oise model, which idepedetly flips each bit of a solutio with some probability. We aalyze the ruig time of the +)-EA solvig OeMax ad LeadigOes uder bit-wise oise for the first time, ad derive the rages of the oise level A prelimiary versio of this paper has appeared at GECCO 7. C. Qia School of Computer Sciece ad Techology, Uiversity of Sciece ad Techology of Chia, Hefei , Chia chaoqia@ustc.edu.c C. Bia School of Computer Sciece ad Techology, Uiversity of Sciece ad Techology of Chia, Hefei , Chia biacht@mail.ustc.edu.c W. Jiag School of Computer Sciece ad Techology, Uiversity of Sciece ad Techology of Chia, Hefei , Chia jw992@mail.ustc.edu.c K. Tag Departmet of Computer Sciece ad Egieerig, Souther Uiversity of Sciece ad Techology, Shezhe 58055, Chia tagk3@sustc.edu.c

2 2 Chao Qia et al. for polyomial ad super-polyomial ruig time bouds. The aalysis o LeadigOes uder bit-wise oise ca be easily trasferred to oe-bit oise, ad improves the previously kow results. Sice our aalysis discloses that the +)-EA ca be efficiet oly uder low oise levels, we also study whether the samplig strategy ca brig robustess to oise. We prove that usig samplig ca sigificatly icrease the largest oise level allowig a polyomial ruig time, that is, samplig is robust to oise. Keywords Noisy optimizatio evolutioary algorithms samplig ruig time aalysis computatioal complexity Itroductio I real-world optimizatio tasks, the exact objective i.e., fitess) fuctio evaluatio of cadidate solutios is ofte impossible, istead we ca obtai oly a oisy oe due to a wide rage of ucertaities [22]. For example, i machie learig, a predictio model is evaluated oly o a limited amout of data, which makes the estimated performace deviated from the true performace; i social etwork aalysis, computig the ifluece spread objective of a set of users is #P-hard [8], ad thus is ofte estimated by simulatig the radom diffusio process [23], which brigs oise. I the presece of oise, the difficulty of solvig a optimizatio problem may icrease. Evolutioary algorithms EAs) [5], ispired by atural pheomea, are a type of radomized metaheuristic optimizatio algorithm. They are likely to be able to hadle oise, sice the correspodig atural pheomea have bee well processed i oisy atural eviromets. I fact, EAs have bee successfully applied to solve may oisy optimizatio problems [7,24]. Compared with the applicatio, the theoretical aalysis of EAs is far behid. But i the last two decades, much effort has bee devoted to the ruig time aalysis a essetial theoretical aspect) of EAs. Numerous aalytical results for EAs solvig sythetic problems as well as combiatorial problems have bee derived, e.g., [4, 25]. Meawhile, a few geeral approaches for ruig time aalysis have bee proposed, e.g., drift aalysis [,3,2], fitess-level methods [9,33], ad switch aalysis [35]. However, previous ruig time aalyses of EAs maily focused o oisefree optimizatio, where the fitess evaluatio is exact. Oly a few pieces of work o oisy evolutioary optimizatio have bee reported. Droste [4] first aalyzed the +)-EA o the OeMax problem i the presece of oebit oise ad showed that the tight rage of the oise probability p allowig a polyomial ruig time is Olog /), where is the problem size. Gieße ad Kötzig [20] recetly studied the LeadigOes problem, ad proved that the expected ruig time is polyomial if p /6e 2 ) ad expoetial if p = /2. For iefficiet optimizatio of the +)-EA uder high oise levels, some implicit mechaisms of EAs were proved to be robust to oise. I [20], it was

3 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 3 show that the µ+)-ea with a small populatio of size Θlog ) ca solve OeMax i polyomial time eve if the probability of oe-bit oise reaches. The robustess of populatios to oise was also proved i the settig of o-elitist EAs [0, 27]. However, Friedrich et al. [8] showed the limitatio of populatios by provig that the µ+)-ea eeds super-polyomial time for solvig OeMax uder additive Gaussia oise N 0, σ 2 ) with σ 2 3. This difficulty ca be overcome by the compact geetic algorithm cga) [8] ad a simple At Coloy Optimizatio ACO) algorithm [7], both of which fid the optimal solutio i polyomial time with a high probability. ACO was also show able to efficietly fid solutios with reasoable approximatios o some istaces of sigle destiatio shortest paths problems with edge weights disturbed by oise [2,6,34]. The ability of explicit oise hadlig strategies was also theoretically studied. Qia et al. [30] proved that the threshold selectio strategy is robust to oise: the expected ruig time of the +)-EA usig threshold selectio o OeMax uder oe-bit oise is always polyomial regardless of the oise level. For the +)-EA solvig OeMax ad LeadigOes uder oe-bit or additive Gaussia oise, the samplig strategy was show able to reduce the ruig time from expoetial to polyomial i high oise levels [29]. Akimoto et al. [2] also proved that samplig with a large sample size ca make optimizatio uder additive ubiased oise behave as optimizatio i a oise-free eviromet. The iterplay betwee samplig ad implicit oise-hadlig mechaisms e.g., crossover) has bee statistically studied i [9]. The studies metioed above maily cosidered the oe-bit oise model, which flips a radom bit of a solutio before evaluatio with probability p. However, the oise model, which ca chage several bits of a solutio simultaeously, may be more realistic ad eeds to be studied, as metioed i the first oisy theoretical work [4]. I this paper, we study the bit-wise oise model, which is characterized by a pair p, q) of parameters. It happes with probability p, ad idepedetly flips each bit of a solutio with probability q before evaluatio. We aalyze the ruig time of the +)-EA solvig OeMax ad LeadigOes uder bit-wise oise with two specific parameter settigs p, ) ad, q). The rages of p ad q for a polyomial upper boud ad a super-polyomial lower boud are derived, as show i the middle row of Table. For the +)-EA o LeadigOes, we also trasfer the ruig time bouds from bit-wise oise p, ) to oe-bit oise by usig the same proof procedure. As show i the bottom right of Table, our results improve the previously kow oes [20]. Note that for the +)-EA solvig the LeadigOes problem, the curret aalysis as show i the last colum of Table ) does ot cover all the rages of p ad q. We thus coduct experimets to estimate the expected ruig time for the ucovered values of p ad q. The empirical results show that the curretly derived rages of p ad q allowig a polyomial ruig time are possibly tight.

4 4 Chao Qia et al. Table For the ruig time of the +)-EA o OeMax ad LeadigOes uder prior oise models, the rages of oise parameters for a polyomial upper boud ad a super-polyomial lower boud are show below. +)-EA OeMax LeadigOes bit-wise oise p, ) Olog /), ωlog /) Olog /2 ), ωlog /) bit-wise oise, q) Olog / 2 ), ωlog / 2 ) [20] Olog / 3 ), ωlog / 2 ) oe-bit oise Olog /), ωlog /) [4] [0, /6e 2 )], /2 [20]; Olog / 2 ), ωlog /) Table 2 For the ruig time of the +)-EA usig samplig o OeMax ad LeadigOes uder prior oise models, the rages of oise parameters for a polyomial upper boud ad a super-polyomial lower boud are show below. +)-EA usig samplig OeMax LeadigOes bit-wise oise p, ) [0, ], [0, ], bit-wise oise, q) /2 / O), /2 / ω) [/2, ] Olog /), ωlog /) oe-bit oise [0, ], [0, ], From the results i Table, we fid that the +)-EA is efficiet oly uder low oise levels. For example, for the +)-EA solvig OeMax uder bit-wise oise p, ), the expected ruig time is polyomial oly whe p = Olog /). We the study whether the samplig strategy ca brig robustess to oise. Samplig is a popular way to cope with oise i fitess evaluatio [3], which, istead of evaluatig the fitess of oe solutio oly oce, evaluates the fitess several times ad the uses the average to approximate the true fitess. We aalyze the ruig time of the +)-EA usig samplig uder both bit-wise oise ad oe-bit oise. The rages of p ad q for a polyomial upper boud ad a super-polyomial lower boud are show i Table 2. Note that the aalysis covers all the rages of p ad q. Compared with the results i Table, we fid that usig samplig sigificatly improve the oise-tolerace ability. For example, by usig samplig, the +)-EA ow ca always solve OeMax uder bit-wise oise p, ) i polyomial time. From the aalysis procedure, we also fid the reaso why samplig is effective or ot. Let fx) ad f x) deote the true ad oisy fitess of a solutio, respectively. For two solutios x ad y with fx) > fy), whe the oise level is high i.e., the values of p ad q are large), the probability Pf x) f y)) i.e., the true worse solutio y appears to be better) becomes large, which will mislead the search directio ad the lead to a super-polyomial ruig time. I such a situatio, if the expected gap betwee f x) ad f y) is positive, samplig will icrease this tred ad make Pf x) f y)) sufficietly small; if it is egative e.g., o OeMax uder bit-wise oise, q) with q /2), samplig will cotiue to icrease Pf x) f y)), ad obviously will ot work. We also ote that if the positive gap betwee f x) ad f y) is too small e.g., o OeMax uder bit-

5 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 5 wise oise, q) with q = /2 / ω) ), a polyomial sample size will be ot sufficiet ad samplig also fails to guaratee a polyomial ruig time. This paper exteds our prelimiary work [28]. Sice the theoretical aalysis o the LeadigOes problem is ot complete, we add experimets to complemet the theoretical results i.e., Sectio 4.4). We also add the robustess aalysis of samplig to oise i.e., Sectio 5). Note that the robustess of samplig to oe-bit oise has bee studied i our previous work [29]. It was show that samplig ca reduce the ruig time of the +)-EA from expoetial to polyomial o OeMax whe the oise probability p = as well as o LeadigOes whe p = /2. Therefore, our results here are more geeral. We prove that samplig is effective for ay value of p, as show i the last row of Table 2. Furthermore, we aalyze the robustess of samplig to bit-wise oise for the first time. The rest of this paper is orgaized as follows. Sectio 2 itroduces some prelimiaries. The ruig time aalysis of the +)-EA o OeMax ad LeadigOes uder oise is preseted i Sectios 3 ad 4, respectively. Sectio 5 gives the aalysis of the +)-EA usig samplig. Sectio 6 cocludes the paper. 2 Prelimiaries I this sectio, we first itroduce the optimizatio problems, evolutioary algorithms ad oise models studied i this paper, respectively, the itroduce the samplig strategy, ad fially preset the aalysis tools that we use throughout this paper. 2. OeMax ad LeadigOes I this paper, we use two well-kow pseudo-boolea fuctios OeMax ad LeadigOes. The OeMax problem as preseted i Defiitio aims to maximize the umber of -bits of a solutio. The LeadigOes problem as preseted i Defiitio 2 aims to maximize the umber of cosecutive -bits coutig from the left of a solutio. Their optimal solutio is... briefly deoted as ). It has bee show that the expected ruig time of the +)-EA o OeMax ad LeadigOes is Θ log ) ad Θ 2 ), respectively [5]. Defiitio OeMax) The OeMax Problem of size is to fid a bits biary strig x such that x = arg max x {0,} fx) = ). i= x i Defiitio 2 LeadigOes) The LeadigOes Problem of size is to fid a bits biary strig x such that x = arg max x {0,} fx) = ) i. i= j= x j

6 6 Chao Qia et al. 2.2 Bit-wise Noise There are maily two kids of oise models: prior ad posterior [20,22]. The prior oise comes from the variatio o a solutio, while the posterior oise comes from the variatio o the fitess of a solutio. Previous theoretical aalyses ofte focused o a specific prior oise model, oe-bit oise. As preseted i Defiitio 3, it flips a radom bit of a solutio before evaluatio with probability p. However, i may realistic applicatios, oise ca chage several bits of a solutio simultaeously rather tha oly oe bit. We thus cosider the bit-wise oise model. As preseted i Defiitio 4, it happes with probability p, ad idepedetly flips each bit of a solutio with probability q before evaluatio. To the best of our kowledge, oly bit-wise oise with p = ad q [0, ] has bee recetly studied. Gieße ad Kötzig [20] proved that for the +)-EA o OeMax, the expected ruig time is polyomial if q = Olog / 2 ) ad super-polyomial if q = ωlog / 2 ). I this paper, we study two specific bit-wise oise models: p [0, ] q = ad p = q [0, ], which are briefly deoted as bit-wise oise p, ) ad bit-wise oise, q), respectively. Defiitio 3 Oe-bit Noise) Give a parameter p [0, ], let f x) ad fx) deote the oisy ad true fitess of a biary solutio x {0, }, respectively, the { f fx) with probability p, x) = fx ) with probability p, where x is geerated by flippig a uiformly radomly chose bit of x. Defiitio 4 Bit-wise Noise) Give parameters p, q [0, ], let f x) ad fx) deote the oisy ad true fitess of a biary solutio x {0, }, respectively, the { f fx) with probability p, x) = fx ) with probability p, where x is geerated by idepedetly flippig each bit of x with probability q )-EA The +)-EA as described i Algorithm is studied i this paper. For oisy optimizatio, oly a oisy fitess value f x) istead of the exact oe fx) ca be accessed, ad thus step 4 of Algorithm chages to be if f x ) f x). Note that the reevaluatio strategy is used as i [2,4,20]. That is, besides evaluatig f x ), f x) will be reevaluated i each iteratio of the

7 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 7 +)-EA. The ruig time is usually defied as the umber of fitess evaluatios eeded to fid a optimal solutio w.r.t. the true fitess fuctio f for the first time [2,4,20]. Algorithm +)-EA) Give a fuctio f over {0, } to be maximized, it cosists of the followig steps:. x := uiformly radomly selected from {0, }. 2. Repeat util the termiatio coditio is met 3. x := flip each bit of x idepedetly with prob. /. 4. if fx ) fx) 5. x := x. 2.4 Samplig I oisy evolutioary optimizatio, samplig as described i Defiitio 5 has ofte bee used to reduce the egative effect of oise [,6]. It approximates the true fitess fx) usig the average of a umber of radom evaluatios. For the +)-EA usig samplig, step 4 of Algorithm chages to be if ˆfx ) ˆfx). Note that m = is equivalet to that samplig is ot used. The effectiveess of samplig was ot theoretically aalyzed util recetly. Qia et al. [29] proved that samplig is robust to oe-bit oise ad additive Gaussia oise. Particularly, uder oe-bit oise, it was show that samplig ca reduce the ruig time from expoetial to polyomial for the +)-EA solvig OeMax whe the oise probability p = ad LeadigOes whe p = /2. Defiitio 5 Samplig) Samplig first evaluates the fitess of a solutio m times idepedetly ad obtais the oisy fitess values f x),..., f mx), ad the outputs their average, i.e., ˆfx) = m m i= f i x). 2.5 Aalysis Tools The process of the +)-EA solvig OeMax or LeadigOes ca be directly modeled as a Markov chai {ξ t } + t=0. We oly eed to take the solutio space {0, } as the chai s state space i.e., ξ t X = {0, } ), ad take the optimal solutio as the chai s optimal state i.e., X = { }). Give a Markov chai {ξ t } + t=0 ad ξˆt = x, we defie its first hittig time FHT) as τ = mi{t ξˆt+t X, t 0}. The mathematical expectatio of τ, Eτ ξˆt = x) = + i=0 i Pτ = i), is called the expected first hittig time EFHT) startig from ξˆt = x. If ξ 0 is draw from a distributio π 0, Eτ ξ 0 π 0 ) = x X π 0x)Eτ ξ 0 = x) is called the EFHT of the Markov chai over the iitial distributio π 0. Thus, the expected ruig time of the

8 8 Chao Qia et al. +)-EA startig from ξ 0 π 0 is equal to + 2 Eτ ξ 0 π 0 ), where the term correspods to evaluatig the iitial solutio, ad the factor 2 correspods to evaluatig the offsprig solutio x ad reevaluatig the paret solutio x i each iteratio. If usig samplig, the expected ruig time of the +)-EA is m + 2m Eτ ξ 0 π 0 ), sice estimatig the fitess of a solutio eeds m umber of idepedet fitess evaluatios. Note that we cosider the expected ruig time of the +)-EA startig from a uiform iitial distributio i this paper. I the followig, we give three drift theorems that will be used to derive the EFHT of Markov chais i the paper. Lemma Additive Drift [2]) Give a Markov chai {ξ t } + t=0 ad a distace fuctio V x), if for ay t 0 ad ay ξ t with V ξ t ) > 0, there exists a real umber c > 0 such that EV ξ t ) V ξ t+ ) ξ t ) c, the the EFHT satisfies that Eτ ξ 0 ) V ξ 0 )/c. Lemma 2 Simplified Drift [26]) Let X t, t 0, be real-valued radom variables describig a stochastic process. Suppose there exists a iterval [a, b] R, two costats δ, ɛ > 0 ad, possibly depedig o l := b a, a fuctio rl) satisfyig rl) = ol/ logl)) such that for all t 0 the followig two coditios hold:. EX t X t+ a < X t < b) ɛ, 2. P X t+ X t j X t > a) rl) + δ) j for j N 0. The there is a costat c > 0 such that for T := mi{t 0 : X t a X 0 b} it holds PT 2 cl/rl) ) = 2 Ωl/rl)). Lemma 3 Simplified Drift with Self-loops [3]) Let X t, t 0, be real-valued radom variables describig a stochastic process. Suppose there exists a iterval [a, b] R, two costats δ, ɛ > 0 ad, possibly depedig o l := b a, a fuctio rl) satisfyig rl) = ol/ logl)) such that for all t 0 the followig two coditios hold:. a < i < b : EX t X t+ X t = i) ɛ PX t+ i X t = i), 2. i > a, j N 0 : P X t+ X t j X t = i) rl) + δ) j PX t+ i X t = i). The there is a costat c > 0 such that for T := mi{t 0 : X t a X 0 b} it holds PT 2 cl/rl) ) = 2 Ωl/rl)).

9 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 9 3 The OeMax problem I this sectio, we aalyze the ruig time of the +)-EA o OeMax uder bit-wise oise. Note that for bit-wise oise, q), it has bee proved that the expected ruig time is polyomial if ad oly if q = Olog / 2 ), as show i Theorem. Theorem [20] For the +)-EA o OeMax uder bit-wise oise, q), the expected ruig time is polyomial if q = Olog / 2 ) ad super-polyomial if q = ωlog / 2 ). For bit-wise oise p, ), we prove i Theorems 2 ad 3 that the tight rage of p allowig a polyomial ruig time is Olog /). Istead of usig the origial drift theorems, we apply the upper ad lower bouds of the +)-EA o oisy OeMax i [20]. Let x k deote ay solutio with k umber of -bits, ad f x k ) deote its oisy objective value, which is a radom variable. Lemma 4 ituitively meas that if the probability of recogizig the true better solutio by oisy evaluatio is large, the ruig time ca be polyomially upper bouded. O the cotrary, Lemma 5 shows that if the probability of makig a right compariso is small, the ruig time ca be expoetially lower bouded. Both of them are proved by applyig stadard drift theorems, ad ca be used to simplify our aalysis. Note that i the origial upper boud of the +)-EA o oisy OeMax i.e., Theorem 5 i [20]), it requires that Eq. 4) holds with oly j = k, but the proof actually also requires that oisy OeMax satisfies the mootoicity property, i.e., for all j < k <, Pf x k ) < f x k+ )) Pf x j ) < f x k+ )). We have combied these two coditios i Lemma 4 by requirig Eq. 4) to hold with ay j k istead of oly j = k. Lemma 4 [20] Suppose there is a positive costat c /5 ad some 2 < l /2 such that j k < : Pf x j ) < f x k+ )) l ; j k < l : Pf x j ) < f x k+ )) c k, ) the the +)-EA optimizes f i expectatio i O log ) + 2 Ol) iteratios. Lemma 5 [20] Suppose there is some l /4 ad a costat c 6 such that l k < : Pf x k ) < f x k+ )) c k, the the +)-EA optimizes f i 2 Ωl) iteratios with a high probability. Theorem 2 For the +)-EA o OeMax uder bit-wise oise p, ), the expected ruig time is polyomial if p = Olog /).

10 0 Chao Qia et al. Proof We prove it by usig Lemma 4. For ay positive costat b, suppose that p b log /. We set the two parameters i Lemma 4 as c = mi{ 5, b} 2b log ad l = c 2, 2 ]. For ay j k <, f x j ) f x k+ ) implies that f x j ) k + or f x k+ ) k, either of which happes with probability at most p. By the uio boud, we get j k <, Pf x j ) f x k+ )) 2p For ay j k < l, we easily get 2b log = lc l. Pf x j ) f x k+ )) lc < c k. By Lemma 4, we kow that the expected ruig time is O log ) + 2 O2b log /c), i.e., polyomial. Theorem 3 For the +)-EA o OeMax uder bit-wise oise p, ), the expected ruig time is super-polyomial if p = ωlog /) ωlog /) ad expoetial if p = Olog /). Proof We use Lemma 5 to prove it. Let c = 6. The case p = ωlog /) ωlog /) is first aalyzed. For ay positive costat b, let l = b log. For ay k l, we get Pf x k ) f x k+ )) Pf x k ) = k) Pf x k+ ) k). To make f x k ) = k, it is sufficiet that the oise does ot happe, i.e., Pf x k ) = k) p. To make f x k+ ) k, it is sufficiet to flip oe -bit ad keep other bits uchaged by oise, i.e., Pf x k+ ) k) p k+ ). Thus, Sice c k c l Pf x k ) f x k+ )) p) p k + e = ωlog /). cb log =, the coditio of Lemma 5 holds. Thus, the expected ruig time is 2 Ωb log ) where b is ay costat), i.e., superpolyomial. For the case p = Olog /), let l =. We use aother lower boud p ) for Pf x k ) = k), sice it is sufficiet that o bit flips by oise. Thus, we have Pf x k ) f x k+ )) p, the coditio of Lemma 5 holds. Thus, the expected ru- Sice c k ig time is 2 Ω ), i.e., expoetial. c ) p k + e = Ω).

11 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 4 The LeadigOes problem I this sectio, we first aalyze the ruig time of the +)-EA o the LeadigOes problem uder bit-wise oise p, ) ad bit-wise oise, q), respectively. The, we trasfer the aalysis from bit-wise oise p, ) to oebit oise; the results are complemetary to the kow oes recetly derived i [20]. However, our aalysis does ot cover all the rages of p ad q. For those values of p ad q where o theoretical results are kow, we coduct experimets to empirically ivestigate the ruig time. 4. Bit-wise Noise p, ) For bit-wise oise p, ), we prove i Theorems 4-6 that the expected ruig time is polyomial if p = Olog / 2 ) ad super-polyomial if p = ωlog /). Their proofs are accomplished by applyig additive drift aalysis, the simplified drift theorem with self-loops ad the simplified drift theorem, respectively. Theorem 4 For the +)-EA o LeadigOes uder bit-wise oise p, ), the expected ruig time is polyomial if p = Olog / 2 ). Proof We use Lemma to prove it. For ay positive costat b, suppose that p b log / 2. Let LOx) deote the true umber of leadig -bits of a solutio x. We first costruct a distace fuctio V x) as, for ay x with LOx) = i, V x) = + c ) + c i, ) where c = 4b log +. It is easy to verify that V x X = { }) = 0 ad V x / X ) > 0. The, we ivestigate EV ξ t ) V ξ t+ ) ξ t = x) for ay x with LOx) < i.e., x / X ). Assume that curretly LOx) = i, where 0 i. Let P mut x, x ) deote the probability of geeratig x by mutatio o x. We divide the drift ito two parts: positive E + ad egative E. That is, where E + = E = x :LOx )>i x :LOx )<i EV ξ t ) V ξ t+ ) ξ t = x) = E + E, P mut x, x ) Pf x ) f x)) V x) V x )), P mut x, x ) Pf x ) f x)) V x ) V x)). For the positive drift, we eed to cosider that the umber of leadig -bits is icreased. By mutatio, we have P mut x, x ) = PLOx ) i + ) = ) i, 4) x :LOx )>i 2) 3)

12 2 Chao Qia et al. sice it eeds to flip the i + )-th bit which must be 0) of x ad keep the i leadig -bits uchaged. For ay x with LOx ) i +, f x ) < f x) implies that f x ) i or f x) i +. Note that, Pf x ) i ) = p ) ) i, 5) sice at least oe of the first i leadig -bits of x eeds to be flipped by oise; Pf x) i + ) = p ) i, 6) sice it eeds to flip the first 0-bit of x ad keep the leadig -bits uchaged by oise. By the uio boud, we get Pf x ) f x)) = Pf x ) < f x)) p ) ) i+ p i + 2, 7) where the last iequality is by p = Olog / 2 ). Furthermore, for ay x with V x ) i +, V x) V x ) + c ) i+ + c ) i c = + c i. 8) ) By combiig Eqs. 4.), 4.) ad 4.), we have E + ) i 2 c + c ) i c c ) i, where the last iequality is by )i ) e 3. For the egative drift, we eed to cosider that the umber of leadig -bits is decreased. By mutatio, we have P mut x, x ) = PLOx ) i ) = ) i, 9) x :LOx )<i sice it eeds to flip at least oe leadig -bit of x. For ay x with LOx ) i where i ), f x ) f x) implies that f x ) i or f x) i. Note that, Pf x ) i) p ) i, 0) sice for the first i bits of x, it eeds to flip the 0-bits whose umber is at least ) ad keep the -bits uchaged by oise; Pf x) i ) = p ) ) i, )

13 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 3 sice at least oe leadig -bit of x eeds to be flipped by oise. By the uio boud, we get Pf x ) f x)) p p 2 ) ) i p i +. 2) Furthermore, for ay x with LOx ) i, V x ) V x) + ) c i. 3) By combiig Eqs. 4.), 4.) ad 4.), we have E ) ) i p i + + c ) ) i ) p + c ) i 2p + c i. e 3 ) Thus, by subtractig E from E +, we have EV ξ t ) V ξ t+ ) ξ t = x) + c ) i c 6 2 2p ) 3 + c ) ) i 4b log + 2b log , 4) where the secod iequality is by c = 4b log + ad p b log / 2. Note that V x) + c ) e c = e 4b log + = e 4b. By Lemma, we get Eτ ξ 0 ) 6 2 e 4b = O 4b+2 ), i.e., the expected ruig time is polyomial. Theorem 5 For the +)-EA o LeadigOes uder bit-wise oise p, ), if p = ωlog /) o), the expected ruig time is super-polyomial. Proof We use Lemma 3 to prove it. Let X t = x 0 be the umber of 0-bits of the solutio x after t iteratios of the +)-EA. Let c be ay positive costat. We cosider the iterval [0, c log ], i.e., the parameters a = 0 i.e., the global optimum) ad b = c log i Lemma 3. The, we aalyze the drift EX t X t+ X t = i) for i < c log. As i the proof of Theorem 4, we divide the drift ito two parts: positive E + ad egative E. That is, EX t X t+ X t = i) = E + E, where E + = P mut x, x ) Pf x ) f x)) i x 0 ), E = x : x 0<i x : x 0>i P mut x, x ) Pf x ) f x)) x 0 i).

14 4 Chao Qia et al. For the positive drift, we eed to cosider that the umber of 0-bits is decreased. For mutatio o x where x 0 = i), let X ad Y deote the umber of flipped 0-bits ad -bits, respectively. The, X Bi, ) ad Y B i, ), where B, ) is the biomial distributio. To estimate a upper boud o E +, we assume that the offsprig solutio x with x 0 < i is always accepted. Thus, we have E + x : x 0<i = i = i i P mut x, x )i x 0 ) = k= k i j= j k= j=k j= j PX = j) = i. i k PX Y = k) 5) k= PX = j) PY = j k) k PX = j) PY = j k) For the egative drift, we eed to cosider that the umber of 0-bits is icreased. We aalyze the i cases where oly oe -bit is flipped i.e., x 0 = i + ), which happes with probability ). Assume that LOx) = k i. If the j-th where j k) leadig -bit is flipped, the offsprig solutio x will be accepted i.e., f x ) f x)) if f x ) j ad f x) j. Note that, Pf x ) j ) = p + p ) j p j 2, 6) where the equality is sice it eeds to keep the j leadig -bits of x uchaged, ad the last iequality is by p = o); Pf x) j ) = p ) ) j 7) = p ) j + ) j ) p e j pj 3, where the equality is sice at least oe of the first j leadig -bits of x eeds to be flipped by oise. Thus, we get Pf x ) f x)) pj 6. 8) If oe of the i k o-leadig -bits is flipped, LOx ) = LOx) = k. We ca use the same aalysis procedure as Eq. 4.) i the proof of Theorem 4 to derive that Pf x ) f x)) p k + 2, 9)

15 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 5 where the secod iequality is by p = o). Combiig all the i cases, we get E k pj ) 6 + i k i + i) 20) 2 j= pkk + ) + i k ) pk2 e i k. 6 By subtractig E from E +, we get EX t X t+ X t = i) i pk i k. 6 To ivestigate coditio ) of Lemma 3, we also eed to aalyze the probability PX t+ i X t = i). For X t+ i, it is ecessary that at least oe bit of x is flipped ad the offsprig x is accepted. We cosider two cases: ) at least oe of the k leadig -bits of x is flipped; 2) the k leadig - bits of x are ot flipped ad at least oe of the last k bits is flipped. For case ), the mutatio probability is )k ad the acceptace probability is at most p k+ by Eq. 4.). For case 2), the mutatio probability is )k ) k ) k ad the acceptace probability is at most. Thus, we have Whe k < p, we have PX t+ i X t = i) p + k. 2) EX t X t+ X t = i) i i k 6 k 2 p/2 7c log 6 k 2 24 p + k ), 22) where the secod iequality is by k > p ad i < c log, the third iequality is by p = ωlog /) ad the last is by k > p. Whe k p, we have EX t X t+ X t = i) i pk ) c log p 44 p 288 p + k ), 576 where the secod iequality is by p = o) ad i < c log, the third is by p = ωlog /) ad the last is by k p. Combiig Eqs. 4.), 4.) ad 4.), we get that coditio ) of Lemma 3 holds with ɛ = 576. For coditio 2) of Lemma 3, we eed to show P X t+ X t j X t = i) rl) +δ) PX j t+ i X t = i) for i. For PX t+ i X t = i), we aalyze the cases where oly oe bit is flipped. Usig the similar

16 6 Chao Qia et al. aalysis procedure as E, except that flippig ay bit rather tha oly -bit is cosidered here, we easily get PX t+ i X t = i) pkk + ) k 6. 24) For X t+ X t j, it is ecessary that at least j bits of x are flipped ad the offsprig solutio x is accepted. We cosider two cases: ) at least oe of the k leadig -bits is flipped; 2) the k leadig -bits are ot flipped. For j ) j case ), the mutatio probability is at most k ad the acceptace probability is at most p k+ by Eq. 4.). For case 2), the mutatio probability is at most )k ) k j ad the acceptace probability is at most. Thus, j we have P X t+ X t j X t = i) 25) k ) + pk j j + ) k ) k j j pkk + ) j + k 2 pkk + ) 2 j k ) j. By combiig Eq. 4.) with Eq. 4.), we get that coditio 2) of Lemma 3 holds with δ = ad rl) = 44. Note that l = b a = c log. By Lemma 3, the expected ruig time is 2 Ωc log ), where c is ay positive costat. Thus, the expected ruig time is super-polyomial. Theorem 6 For the +)-EA o LeadigOes uder bit-wise oise p, ), the expected ruig time is expoetial if p = Ω). Proof We use Lemma 2 to prove it. Let X t = i be the umber of 0-bits of the solutio x after t iteratios of the +)-EA. We cosider the iterval i [0, /2 ]. To aalyze the drift EX t X t+ X t = i) = E + E, we use the same aalysis procedure as Theorem 5. For the positive drift, we have = o). For the egative drift, we re-aalyze Eqs. 4.) ad 4.). E + i From Eqs. 4.) ad 4.), we get that Pf x ) j ) p j Pf x) j ) pj 3. Thus, Eq. 4.) becomes Pf x ) f x)) p2 j 3 ) ad j ). 26) For Eq. 4.), we eed to aalyze the acceptace probability for LOx ) = LOx) = k. Sice it is sufficiet to keep the first k + ) bits of x ad x uchaged i oise, Eq. 4.) becomes Pf x ) f x)) p 2 ) 2k+) p 2 k + ) 2. 27)

17 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 7 By applyig the above two iequalities to Eq. 4.), we have k E p2 j j + ) i k) k)2 e = Ω), j= where the equality is by p = Ω). Thus, E + E = Ω). That is, coditio ) of Lemma 2 holds. Sice it is ecessary to flip at least j bits of x, we have ) P X t+ X t j X t ) j j j! 2 2 j, which implies that coditio 2) of Lemma 2 holds with δ = ad rl) = 2. Note that l = /2. Thus, by Lemma 2, the expected ruig time is expoetial. 4.2 Bit-wise Noise, q) For bit-wise oise, q), we prove i Theorems 7-9 that the expected ruig time is polyomial if q = Olog / 3 ) ad super-polyomial if q = ωlog / 2 ). The proof idea is similar to that for bit-wise oise p, ). The mai differece led by the chage of oise is the probability of acceptig the offsprig solutio, i.e., Pf x ) f x)). Theorem 7 For the +)-EA o LeadigOes uder bit-wise oise, q), the expected ruig time is polyomial if q = Olog / 3 ). Proof The proof is very similar to that of Theorem 4. The chage of oise oly affects the probability of acceptig the offsprig solutio i the aalysis. For ay positive costat b, suppose that q b log / 3. For the positive drift E +, we eed to re-aalyze Pf x ) f x)) i.e., Eq. 4.) i the proof of Theorem 4) for the paret x with LOx) = i ad the offsprig x with LOx ) i +. By bit-wise oise, q), Eqs. 4.) ad 4.) chage to Pf x ) i ) = q) i ; Pf x) i + ) = q) i q. Thus, by the uio boud, Eq. 4.) becomes Pf x ) f x)) q) i + q) i q) 28) = q) i+ qi + ) /2, where the last iequality is by q = Olog / 3 ). For the egative drift E, we eed to re-aalyze Pf x ) f x)) i.e., Eq. 4.) i the proof of Theorem 4) for the paret x with LOx) = i where

18 8 Chao Qia et al. i ) ad the offsprig x with LOx ) i. By bit-wise oise, q), Eqs. 4.) ad 4.) chage to Pf x ) i) q q) i, Pf x) i ) = q) i. Thus, by the uio boud, Eq. 4.) becomes Pf x ) f x)) q q) i + q) i 29) = q) i 2q) i )q) 2q) i + )q, where the secod iequality is by q) i i )q ad 2q > 0 for q = Olog / 3 ). By applyig Eq. 4.2) ad Eq. 4.2) to E + ad E, respectively, Eq. 4.) chages to EV ξ t ) V ξ t+ ) ξ t = x) + c ) i 4b log c ) i c 2qi + ) 62 3 ) 2b log That is, the coditio of Lemma still holds with 6 2. Thus, the expected ruig time is polyomial. Theorem 8 For the +)-EA o LeadigOes uder bit-wise oise, q), if q = ωlog / 2 ) o/), the expected ruig time is super-polyomial. Proof We use the same aalysis procedure as Theorem 5. The oly differece is the probability of acceptig the offsprig solutio due to the chage of oise. For the positive drift, we still have E + i, sice we optimistically assume that x is always accepted i the proof of Theorem 5. For the egative drift, we eed to re-aalyze Pf x ) f x)) for the paret solutio x with LOx) = k ad the offsprig solutio x with LOx ) = j where j k + ). For j k, to derive a lower boud o Pf x ) f x)), we cosider the j cases where f x) = l ad f x ) l for 0 l j. Sice Pf x) = l) = q) l q ad Pf x ) l) = q) l, Eq. 4.) chages to ) j Pf x ) f x)) q) l q q) l l=0 = 2 q)2j + q ) 2j ) q q)2j 2 q) 2j qj q qj 2, 30) where the last iequality is by q) 2j 2qj /2 sice q = o/). For j = k+ i.e., LOx ) = LOx) = k), we ca use the same aalysis as Eq. 4.2) to derive a lower boud /2, sice the last iequality of Eq. 4.2) still holds with q = o/). Thus, Eq. 4.) also holds here, i.e., Pf x ) f x)) 2. 3)

19 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 9 By applyig Eqs. 4.2) ad 4.2) to E, Eq. 4.) chages to Thus, we have E qk2 2 + i k. 6 EX t X t+ X t = i) = E + E i qk2 2 i k. 6 For the upper boud aalysis of PX t+ i X t = i) i the proof of Theorem 5, we oly eed to replace the acceptace probability p k+ i the case of LOx ) < LOx) with k + )q i.e., Eq. 4.2)). Thus, Eq. 4.) chages to PX t+ i X t = i) k + )q + k q + k. To compare EX t X t+ X t = i) with PX t+ i X t = i), we cosider two cases: k < 2 q ad k 2 q. By usig q = ωlog / 2 ) ad applyig the same aalysis procedure as Eqs. 4.) ad 4.), we ca derive that coditio ) of Lemma 3 holds with ɛ = 92. For the lower boud aalysis of PX t+ i X t = i), by applyig Eqs. 4.2) ad 4.2), Eq. 4.) chages to PX t+ i X t = i) qkk + ) 2 + k 6. For the aalysis of X t+ X t j, by replacig the acceptace probability i the case of LOx ) < LOx) with k + )q, Eq. 4.) chages to p k+ qkk + ) P X t+ X t j X t = i) 4 2 j + k qkk + ) + k j ) 48 2 j. That is, coditio 2) of Lemma 3 holds with δ =, rl) = 48. Thus, the expected ruig time is super-polyomial. Theorem 9 For the +)-EA o LeadigOes uder bit-wise oise, q), the expected ruig time is expoetial if q = Ω/). Proof We use Lemma 2 to prove it. Let X t = i be the umber of 0-bits of the solutio x after t iteratios of the +)-EA. We cosider the iterval i [0, /2 ]. To aalyze the drift EX t X t+ X t = i), we use the same aalysis procedure as the proof of Theorem 5. We first cosider q = Ω/) o). We eed to aalyze the probability Pf x ) f x)), where the offsprig solutio x is geerated by flippig oly oe -bit of x. Let LOx) = k. For the case where the j-th where j k) leadig -bit is flipped, as the aalysis of Eq. 4.2), we get Pf x ) f x)) q)2j 2 q) 2j qj q.

20 20 Chao Qia et al. If q) 2j < 2, q)2j 2 qj 4 ; otherwise, q)2j q qj 2 Pf x ) f x)) mi{/4, qj/2}.. Thus, we have For the case that flips oe o-leadig -bit i.e., LOx ) = LOx) = k), to derive a lower boud o Pf x ) f x)), we cosider f x) = l ad f x ) l for 0 l k. Thus, k Pf x ) f x)) q) l q q) l + q) k+ q) k q)2k 2 l=0 + q) 2k+ = 2 + q)2k 2 q ) 2, where the last iequality is by q = o). By applyig the above two iequalities to Eq. 4.), we get E k { mi e 4, qj } + i k. 2 2 j= If k 2, k j= mi{ 4, qj 2 } = Ω) sice q = Ω/). If k < 2, i k 2 = Ω) sice i. Thus, E = Ω). For q = Ω), we use the trivial lower boud q for the probability of acceptig the offsprig solutio x, sice it is sufficiet to flip the first leadig -bit of x by oise. The, E i)q kq + i k)q) = = Ω). e e Thus, for q = Ω/), we have EX t X t+ X t = i) = E + E i Ω) = Ω). That is, coditio ) of Lemma 2 holds. Its coditio 2) trivially holds with δ = ad rl) = 2. Thus, the expected ruig time is expoetial. 4.3 Oe-bit Noise For the +)-EA o LeadigOes uder oe-bit oise, it has bee kow that the ruig time is polyomial if p /6e 2 ) ad expoetial if p = /2 [20]. We exted this result by provig i Theorem 0 that the ruig time is polyomial if p = Olog / 2 ) ad super-polyomial if p = ωlog /). The proof ca be accomplished as same as that of Theorems 4, 5 ad 6 for bit-wise oise p, ). This is because although the probabilities Pf x ) f x)) of acceptig the offsprig solutio are differet, their bouds used i the proofs for bit-wise oise p, ) still hold for oe-bit oise.

21 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 2 Theorem 0 For the +)-EA o LeadigOes uder oe-bit oise, the expected ruig time is polyomial if p = Olog / 2 ), super-polyomial if p = ωlog /) o) ad expoetial if p = Ω). Proof We re-aalyze Pf x ) f x)) for oe-bit oise, ad show that the bouds o Pf x ) f x)) used i the proofs for bit-wise oise p, ) still hold for oe-bit oise. For the proof of Theorem 4, Eqs. 4.) ad 4.) chage to Pf x ) i ) = p i, Pf x) i + ) = p, ad thus Eq. 4.) still holds; Eqs. 4.) ad 4.) chage to Pf x ) i) p, Pf x) i ) = p i, ad thus Eq. 4.) still holds. For the proof of Theorem 5, Eqs. 4.) ad 4.) chage to Pf x ) j ) = p j, Pf x) j ) = p j, ad thus Eq. 4.) still holds. For the proof of Theorem 6, Eq. 4.) still holds by the above two equalities; Eq. 4.) still holds sice the probability of keepig the first k + ) bits of a solutio uchaged i oe-bit oise is p k+ p k+ ). 4.4 Experimets I the above three subsectios, we have proved that for the +)-EA solvig the LeadigOes problem, if uder bit-wise oise p, ), the expected ruig time is polyomial whe p = Olog / 2 ) ad super-polyomial whe p = ωlog /); if uder bit-wise oise, q), the expected ruig time is polyomial whe q = Olog / 3 ) ad super-polyomial whe q = ωlog / 2 ); if uder oe-bit oise, the expected ruig time is polyomial whe p = Olog / 2 ) ad super-polyomial whe p = ωlog /). However, the curret aalysis does ot cover all the rages of p ad q. We thus have coducted experimets to complemet the theoretical results. For bit-wise oise p, ), we do ot kow whether the ruig time is polyomial or super-polyomial whe p = ωlog / 2 ) Olog /). We empirically estimate the expected ruig time for p = log /) 2, log / 3/2 ad log /. O each problem size, we ru the +)-EA 000 times idepedetly. I each ru, we record the umber of fitess evaluatios util a optimal solutio w.r.t. the true fitess fuctio is foud for the first time. The the total umber of evaluatios of the 000 rus are averaged as the estimatio of the expected ruig time. To show the relatioship betwee the expected ruig time ad the problem size clearly, we plot the curve

22 22 Chao Qia et al Estimated ratio Problem size Estimated ratio Problem size Estimated ratio Problem size a) p = log /) 2 b) p = log / 3/2 c) p = log / Figure The expected ruig time for the +)-EA solvig LeadigOes uder bit-wise oise p, ), where the y-axis is the logarithm of the estimated expected ruig time) divided by log. Note that a logarithmic scale is used for the x-axis [5, 30]) i subfigure a). Estimated ratio Problem size Estimated ratio Problem size Estimated ratio Problem size a) q = log ) 2 / 3 b) q = log / 5/2 c) q = log / 2 Figure 2 The expected ruig time for the +)-EA solvig LeadigOes uder bit-wise oise, q), where the y-axis is the logarithm of the estimated expected ruig time) divided by log. Note that a logarithmic scale is used for the x-axis [5, 30]) i subfigure a). Estimated ratio Problem size Estimated ratio Problem size Estimated ratio Problem size a) p = log /) 2 b) p = log / 3/2 c) p = log / Figure 3 The expected ruig time for the +)-EA solvig LeadigOes uder oe-bit oise, where the y-axis is the logarithm of the estimated expected ruig time) divided by log. Note that a logarithmic scale is used for the x-axis [5, 30]) i subfigure a). of logexpected ruig time)/ log, as show i Figure. Note that i subfigure a), the problem size is i the rage from 5 to 30, ad a base e logarithmic scale is used. We ca observe that all the curves grow i a closely liear tred. These empirical results imply that the expected ruig time for p = log /) 2, log / 3/2 ad log / is approximately i the order of Θlog ), Θ) ad Θ), respectively. For bit-wise oise, q), the expected ruig time is theoretically ot kow whe q = ωlog / 3 ) Olog / 2 ). We thus empirically estimate the expected ruig time for q = log ) 2 / 3, log / 5/2 ad log / 2. From Figure 2, we ca also observe that all the curves grow i a closely liear tred. Therefore, the observatio suggests that the expected ruig time for q = log ) 2 / 3, log / 5/2 ad log / 2 is approximately i the order of Θlog ), Θ) ad Θ), respectively.

23 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 23 Figure 3 shows the empirical results for oe-bit oise, which are similar to that observed for bit-wise oise p, ). That is, the expected ruig time for p = log /) 2, log / 3/2 ad log / is approximately i the order of Θlog ), Θ) ad Θ), respectively. Thus, these empirical results disclose that the curret rage of p or q allowig a polyomial ruig time may be tight, that is, the expected ruig time is super-polyomial for the ucovered rage of p or q i theoretical aalysis. The rigorous aalysis is ot easy. We may eed to aalyze trasitio probabilities betwee fitess levels more precisely, ad desig a igeious distace fuctio or use more advaced aalysis tools. We leave it as a future work. 5 The Robustess of Samplig to Noise From the derived results i the above two sectios, we ca observe that the +)-EA is efficiet for solvig OeMax ad LeadigOes oly uder low oise levels. For example, for the +)-EA solvig OeMax uder bitwise oise p, ), the optimal solutio ca be foud i polyomial time oly whe p = Olog /). I this sectio, we show that usig the samplig strategy ca sigificatly icrease the largest oise level allowig a polyomial ruig time. For example, if usig samplig, the +)-EA ca always solve OeMax uder bit-wise oise p, ) i polyomial time, regardless of the value of p. 5. The OeMax Problem We prove i Theorems ad 4 that uder bit-wise oise p, ) or oe-bit oise, the +)-EA ca always solve OeMax i polyomial time by usig samplig. For bit-wise oise, q), the tight rage of q allowig a polyomial ruig time is /2 / O), as show i Theorems 2 ad 3. Let x k deote ay solutio with k umber of -bits, ad f x k ) deote its oisy objective value. For provig polyomial upper bouds, we use Lemma 4, which gives a sufficiet coditio based o the probability Pf x j ) < f x k+ )) for j k. But for the +)-EA usig samplig, the probability chages to be P ˆfx j ) < ˆfx k+ )), where ˆfx j ) = m m i= f i xj ) as show i Defiitio 5. Lemma 4 requires a lower boud o P ˆfx j ) < ˆfx k+ )). Our proof idea as preseted i Lemma 6 is to derive a lower boud o the expectatio of f x k+ ) f x j ) ad the apply Chebyshev s iequality. We will directly use Lemma 6 i the followig proofs. For provig super-polyomial lower bouds, we use Lemma 5 by replacig Pf x k ) < f x k+ )) with P ˆfx k ) < ˆfx k+ )). Let poly) idicate ay polyomial of. Before givig the proof, we first ituitively explai why samplig is always effective for bit-wise oise p, ) ad oe-bit oise, while it fails for bitwise oise, q) whe q = /2 / ω) or q /2. For two solutios x ad y

24 24 Chao Qia et al. with fx) > fy), if uder bit-wise oise p, ) ad oe-bit oise, the oisy fitess f x) is larger tha f y) i expectatio, ad usig samplig will icrease this tred ad make the probability of acceptig the true worse solutio y sufficietly small. If uder bit-wise oise, q), whe q = /2 / ω), although the oisy fitess f x) is still larger i expectatio, the gap is very small i the order of / ω) ) ad a polyomial sample size is ot sufficiet to make the probability of acceptig the true worse solutio y small eough; whe q /2, the oisy fitess f x) is smaller i expectatio, ad usig samplig will icrease this tred ad it obviously does ot work. Lemma 6 Suppose there exists a real umber δ > 0 such that j k < : Ef x k+ ) f x j )) δ, the the +)-EA usig samplig with m = 3 /δ 2 eeds polyomial umber of iteratios i expectatio for solvig oisy OeMax. Proof We use Lemma 4 to prove it. For ay j k <, let Y k,j = f x k+ ) f x j ) ad Ŷk,j = ˆfx k+ ) ˆfx j ). We the eed to aalyze the probability P ˆfx j ) < ˆfx k+ )) = PŶk,j > 0). Deote the expectatio EY k,j ) as µ k,j ad the variace VarY k,j ) as σ 2 k,j. It is easy to verify that EŶk,j) = µ k,j ad VarŶk,j) = σk,j 2 /m. By Chebyshev s iequality, we have PŶk,j 0) P Ŷk,j µ k,j µ k,j /2) 4σ 2 k,j/mµ 2 k,j). Sice µ k,j δ > 0, σ 2 k,j = EY 2 k,j ) µ2 k,j 2 ad m = 3 /δ 2, we have PŶk,j 0) 4/ log /5), where the last iequality holds with sufficietly large. Let l = log. The, PŶk,j > 0) log 5 > l. Let c = 5. For k < l, PŶk,j > 0) c k. Thus, the coditio of Lemma 4 i.e., Eq. 4)) holds. We the get that the expected umber of iteratios is O log ) + 2 Olog ) = O), i.e., polyomial. Theorem For the +)-EA o OeMax uder bit-wise oise p, ), if usig samplig with m = 4 3, the expected ruig time is polyomial. Proof We use Lemma 6 to prove it. Sice Ef x j )) = j p ) + j) p = 2p )j + p, we have, for ay j k <, Ef x k+ ) f x j )) = 2p ) k + j) 2p /2, where the last iequality holds with 4. Thus, by Lemma 6, we get that the expected umber of iteratios of the +)-EA usig samplig with m = 4 3 is polyomial. Sice each iteratio takes 2m = 8 3 umber of fitess evaluatios, the expected ruig time is also polyomial.

25 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 25 Theorem 2 For the +)-EA o OeMax uder bit-wise oise, q) with q = /2 / O), if usig samplig, there exists some m = Opoly)) such that the expected ruig time is polyomial. Proof We use Lemma 6 to prove it. Sice q = /2 / O), there exists a positive costat c such that q /2 / c. It is easy to verify that Ef x j )) = j q) + j)q = 2q)j + q. Thus, for ay j k <, Ef x k+ ) f x j )) = 2q)k + j) 2q 2/ c. By Lemma 6, we get that if usig samplig with m = 3+2c /4, the expected umber of iteratios is polyomial, ad the the expected ruig time is polyomial. Thus, the theorem holds. Theorem 3 For the +)-EA o OeMax uder bit-wise oise, q) with q = /2 / ω) or q /2, if usig samplig with ay m = Opoly)), the expected ruig time is expoetial. Proof We use Lemma 5 to prove it. Note that for the +)-EA usig samplig, we have to aalyze P ˆfx k )< ˆfx k+ )) istead of Pf x k )<f x k+ )). Let Z deote a radom variable which satisfies that PZ = 0) = q ad PZ = ) = q. I the followig proof, each Z i is a idepedet radom variable, which has the same distributio as Z. We have f x k ) = k i= Z i + i=k+ Z i), ad the, f x k+ ) f x k ) k+ = Z i + Z i ) i= + = Z i i= i=k+2 2 i=+2 Z i. +k i=+ Z i 2 i=+k+ Z i ) Sice ˆfx k ) = m m i= f i xk ), which is the average of m idepedet evaluatios, we have m ˆfx k+ ) ˆfx k )) = = = m 2j+2 j=0 i=2j+ m 2j+2 j=0 i=2j+ m m Z i + 2j++ j=0 i=2j+ 2j++ j=0 i=2j+3 Z i + Z m, m Z i m Z i 2j+) j=0 i=2j++2 2j+) j=0 i=2j++2 Z i m Z i m

26 26 Chao Qia et al. where Z = m 2j++ j=0 i=2j+3 Z i m 2j+) j=0 i=2j++2 Z i. To make ˆfx k ) ˆfx k+ ), it is sufficiet that Z 0 ad m 2j+2 j=0 i=2j+ Z i m. That is, P ˆfx k ) ˆfx m k+ )) PZ 0) P 2j+2 j=0 i=2j+ Z i m. 32) Sice Z is the differece betwee the sum of the same umber of Z i, Z has the same distributio as Z. Thus, PZ 0) + PZ 0) = PZ 0) + P Z 0) = 2PZ 0), which implies that PZ 0) /2. 33) We the ivestigate P m 2j+2 j=0 i=2j+ Z i m). Sice m 2j+2 j=0 i=2j+ Z i is the sum of 2m idepedet radom variables which have the same distributio as Z, we have m 2j+2 m ) P Z i m 2m = q) t q 2m t, t m P 2j+2 j=0 i=2j+ j=0 i=2j+ Z i > m = 2m t=m+ For ay t < m, let r = q)t q 2m t q) 2m t q t q = /2 / ω), we have q r q 4 ω) t=0 ) m 2m ) 2m q) t q 2m t = q) 2m t q t. t t t=0 = q q )2m 2t. If q /2, we have r. If ) 2m = 2q q ) 2m e ) 2m ) 2m/ ω) /4 ) e, where the first iequality is by q /2, the secod iequality is by 2q = 2/ ω) ad q /2, ad the last is by m = Opoly)). Thus, P m 2j+2 j=0 i=2j+ Z i m) > /3 P m 2j+2 j=0 i=2j+ Z i > m), which implies that m 2j+2 P Z i m > /4. 34) j=0 i=2j+ By applyig Eqs. 5.) ad 5.) to Eq. 5.), we get P ˆfx k ) ˆfx k+ )) /8. Let c = 6 ad l = /28. For ay l k <, P ˆfx k ) < ˆfx k+ )) = P ˆfx k ) ˆfx k+ )) cl c k),

27 Ruig Time Aalysis of the +)-EA uder Bit-wise Noise 27 i.e., the coditio of Lemma 5 holds. Thus, the expected umber of iteratios is 2 Ω/28), ad the expected ruig time is expoetial. Theorem 4 For the +)-EA o OeMax uder oe-bit oise, if usig samplig with m = 4 3, the expected ruig time is polyomial. Proof It is easy to verify that the expectatio of f x j ) i.e., Ef x j ))) uder oe-bit oise is as same as that uder bit-wise oise p, ). Thus, the proof ca be fiished as same as that of Theorem. 5.2 The LeadigOes Problem The bit-wise oise p, ) model is first cosidered. We prove i Theorem 5 that the +)-EA usig samplig ca solve the LeadigOes problem i polyomial time, regardless of the value of p. The proof idea is similar to that of Theorem 4. The mai differece is the probability of acceptig the offsprig solutio x, which is chaged from Pf x ) f x)) to P ˆfx ) ˆfx)) due to samplig. Lemma 7 gives some bouds o this probability, which will be used i the proof of Theorem 5. Let LOx) deote the true umber of leadig -bits of a solutio x. Lemma 7 For the LeadigOes problem uder bit-wise oise p, ), if usig samplig with m = 44 6, it holds that ) for ay x with LOx) = i < ad y with LOy) i 2 or LOy) = i y i+ = 0, P ˆfx) ˆfy)) / 2. 2) for ay y with LOy) <, P ˆf ) ˆfy)) /4 4 ). Proof The proof is fiished by derivig a lower boud o the expectatio of f x) f y) which is equal to the expectatio of ˆfx) ˆfy)) ad the applyig Chebyshev s iequality. We first cosider case ). For ay x with LOx) = i <, i Ef x)) p) i + p ) j j ) + p j= ) i i + ) + p i Ef x)) p) i + + p j= p ) j ) i + p ) i+ i, 35) j ) ) i+ i. Note that whe flippig the first 0-bit of x ad keepig the i leadig -bits uchaged, the fitess is at least i + ad at most. The for ay i <, we have Ef x) f y) LOx) = i LOy) = i ) 36)

Running Time Analysis of the (1+1)-EA for OneMax and LeadingOnes under Bit-wise Noise

Running Time Analysis of the (1+1)-EA for OneMax and LeadingOnes under Bit-wise Noise Ruig Time Aalysis of the +-EA for OeMax ad LeadigOes uder Bit-wise Noise Chao Qia Uiversity of Sciece ad Techology of Chia Hefei 3007, Chia chaoqia@ustc.edu.c Wu Jiag Uiversity of Sciece ad Techology of