arxiv: v1 [cs.ai] 20 Nov 2013
|
|
- Thomas Perry
- 5 years ago
- Views:
Transcription
1 Aalyzig Evolutioary Optimizatio i Noisy Eviromets Chao Qia, Yag Yu, Zhi-Hua Zhou Natioal Key Laboratory for Novel Software Techology Najig Uiversity, Najig 20023, Chia arxiv:3.4987v [cs.ai] 20 Nov 203 Abstract May optimizatio tasks have to be hadled i oisy eviromets, where we caot obtai the exact evaluatio of a solutio but oly a oisy oe. For oisy optimizatio tasks, evolutioary algorithms (EAs), a kid of stochastic metaheuristic search algorithm, have bee widely ad successfully applied. Previous work maily focuses o empirical studyig ad desigig EAs for oisy optimizatio, while, the theoretical couterpart has bee little ivestigated. I this paper, we ivestigate a largely igored questio, i.e., whether a optimizatio problem will always become harder for EAs i a oisy eviromet. We prove that the aswer is egative, with respect to the measuremet of the expected ruig time. The result implies that, for optimizatio tasks that have already bee quite hard to solve, the oise may ot have a egative effect, ad the easier a task the more egatively affected by the oise. O a represetative problem where the oise has a strog egative effect, we examie two commoly employed mechaisms i EAs dealig with oise, the re-evaluatio ad the threshold selectio strategies. The aalysis discloses that the two strategies, however, both are ot effective, i.e., they do ot make the EA more oise tolerat. We the fid that a small modificatio of the threshold selectio allows it to be prove as a effective strategy for dealig with the oise i the problem. Key words: Noisy optimizatio, evolutioary algorithms, re-evaluatio, threshold selectio, ruig time, computatioal complexity Correspodig author addresses: qiac@lamda.ju.edu.c (Chao Qia), yuy@ju.edu.c (Yag Yu), zhouzh@ju.edu.c (Zhi-Hua Zhou) Preprit submitted for review October 8, 208
2 . Itroductio Optimizatio tasks ofte ecouter oisy eviromets. For example, i airplae desig, every prototype is evaluated by simulatios so that the evaluatio result may ot be perfect due to the simulatio error; ad i machie learig, a predictio model is evaluated oly o a limited amout of data so that the estimated performace is shifted from the true performace. Noisy eviromets could chage the property of a optimizatio problem, thus traditioal optimizatio techiques may have low efficacy. While, evolutioary algorithms (EAs) [4] have bee widely ad successfully adopted for oisy optimizatio tasks [5, 22, 9, 8]. EAs are a kid of radomized metaheuristic optimizatio algorithms, ispired by atural pheomea icludig evolutio of species, swarm cooperatio, immue system, etc. EAs typically ivolve a cycle of three stages: reproductio stage produces ew solutios based o the curretly maitaied solutios; evaluatio stage evaluates the ewly geerated solutios; selectio stage wipes out bad solutios. A ispiratio of usig EAs for oisy optimizatio is that the correspodig atural pheomea have bee processed successfully i oisy eviromets, ad hece the algorithmic simulatios are also likely to be able to hadle oise. Besides, improved mechaisms have bee iveted for better hadlig oise. Two represetative strategies are re-evaluatio ad threshold selectio: by the re-evaluatio strategy [2, 6], wheever the fitess (also called cost or objective value) of a solutio is required, EAs make a idepedet evaluatio of the solutio despite of whether the solutio has bee evaluated before, such that the fitess is smoothed; by the threshold selectio strategy [23, 6, 5], i the selectio stage EAs accept a ewly geerated solutio oly if its fitess is larger tha the fitess of the old solutio by at least a threshold, such that the risk of acceptig a bad solutio due to oise is reduced. A assumptio implied by usig a oise hadlig mechaism i EAs is that the oise makes the optimizatio harder, so that a better hadlig mechaism ca reduce the egative effect by the oise [3, 7, 26, 3]. This paper firstly ivestigates if this assumptio is true. We start by presetig a experimetal evidece usig (+)-EA optimizig the hardest case i the pseudo-boolea fuctio class [25]. Experimet results idicate that the oise, however, makes the optimizatio easier rather tha harder, uder the measuremet of expected ruig time. Followig the experimet evidece, we the derive sufficiet theoretical coditios, uder which the oise will make the optimizatio easier or harder. By fillig the coditios, we preset proofs that, for the (+λ)-ea (a class of EAs employig offsprig populatio size λ), the oise will make the optimizatio easier o the hardest case i the pseudo-boolea fuctio class, while harder o the easiest case. The proofs imply that we eed to take care of the oise oly whe the optimizatio is moderately or less complex, ad igore this issue whe the optimizatio task itself is quite hard. 2
3 For the situatios where the oise eeds to be cared, this paper examies the re-evaluatio ad the threshold selectio strategies for their polyomial oise tolerace (PNT). For a kid of oise, the PNT of a EA is the maximum oise level such that the expected ruig time of the algorithm is polyomial. The closer the PNT is to, the better the oise tolerace is. Takig the easiest pseudo-boolea fuctio case as the represetative problem, we aalyze the PNT for differet cofiguratios of the (+)-EA with respect to the oe-bit oise, whose level is characterized by the oise probability. For the (+)-EA (without ay oise hadlig strategy), we prove that the PNT has a lower boud Ω(poly()) ad a upper boud O(2 poly()). Sice the (+)-EA with re-evaluatio has the PNT Θ( log ) [0], it is surprisigly that the re-evaluatio makes the PNT much worse. We further prove that for the (+)-EA with re-evaluatio usig threshold selectio, whe the threshold is, the PNT is ot less tha 2e, ad whe the threshold is 2, the PNT has a lower boud Ω(poly()) ad a upper boud O(2 poly()). The PNT bouds idicate that threshold selectio improves the re-evaluatio strategy, however, o improvemets from the (+)-EA are foud. We the itroduce a small modificatio ito the threshold selectio strategy to tur the origial hard threshold to be a smooth threshold. We prove that with the smooth threshold selectio strategy the PNT is, i.e., the (+)-EA is always a polyomial algorithm disregard the probability of oe-bit oise o the problem. The rest of this paper is orgaized as follows. Sectio 2 itroduces some backgroud. Sectio 3 shows that the oise may ot always be bad, ad presets a sufficiet coditio for that. Sectio 4 aalyzes oise hadlig strategies. Sectio 5 cocludes. 2. Backgroud 2.. Noisy Optimizatio A geeral optimizatio problem ca be represeted as arg max x f(x), where the objective f is also called fitess i the cotext of evolutioary computatio. I real-world optimizatio tasks, the fitess evaluatio for a solutio is usually disturbed by oise, ad cosequetly we ca ot obtai the exact fitess value but oly a oisy oe. I this paper, we will ivolve the followig kids of oise, ad we will always deote f N (x) ad f(x) as the oisy ad true fitess of a solutio x, respectively. additive oise f N (x) = f(x) + δ, where δ is uiformly selected from [δ, δ 2 ] at radom. multiplicative oise f N (x) = f(x) δ, where δ is uiformly selected from [δ, δ 2 ] at radom. 3
4 oe-bit oise f N (x) = f(x) with probability ( p ) (0 p ); otherwise, f N (x) = f(x ), where x is geerated by flippig a uiformly radomly chose bit of x {0, }. This oise is for problems where solutios are represeted i biary strigs. Additive ad multiplicative oise has bee ofte used for aalyzig the effect of oise [7, 2]. Oebit oise is specifically for optimizig pseudo-boolea problems over {0, }, ad also the ivestigated oise i the oly previous work for aalyzig ruig time of EAs i oisy optimizatio [0]. For oe-bit oise, p cotrols the oise level. I this paper we assume that the parameters of the eviromet (i.e., p, δ ad δ 2 ) do ot chage over time. It is possible that a large oise could make a optimizatio problem extremely hard for particular algorithms. We are iterested i the oise level, uder which a algorithm could be tolerat to have polyomial ruig time. We defie the polyomial oise tolerace (PNT) as Defiitio, which characterizes the maximum oise level for allowig a polyomial expected ruig time. Note that, the oise level ca be measured by the adjustig parameter, e.g., δ, δ 2 for the additive ad multiplicative oise, ad p for the oe-bit oise. We will study the PNT of EAs for aalyzig the effectiveess of oise hadlig strategies. Defiitio (Polyomial Noise Tolerace (PNT)) The polyomial oise tolerace of a algorithm o a problem, with respect to a kid of oise, is the maximum oise level such that the algorithm has expected ruig time polyomial to the problem size Evolutioary Algorithms Evolutioary algorithms (EAs) [4] are a kid of populatio-based metaheuristic optimizatio algorithms. Although there exist may variats, the commo procedure of EAs ca be described as follows:. Geerate a iitial set of solutios (called populatio); 2. Reproduce ew solutios from the curret populatio; 3. Evaluate the ewly geerated solutios; 4. Update the populatio by removig bad solutios; 5. Repeat steps 2-5 util some criterio is met. The (+)-EA, as i Algorithm, is a simple EA for maximizig pseudo-boolea problems over {0, }, which reflects the commo structure of EAs. It maitais oly oe solutio, ad repeatedly improves the curret solutio by usig bit-wise mutatio (i.e., the 3rd step of Algorithm ). It has bee widely used for the ruig time aalysis of EAs, e.g., [7, 2]. 4
5 Algorithm ((+)-EA) Give pseudo-boolea fuctio f with solutio legth, it cosists of the followig steps:. x := radomly selected from {0, }. 2. Repeat util the termiatio coditio is met 3. x := flip each bit of x with probability p. 4. if f(x ) f(x) 5. x := x. where p (0, 0.5) is the mutatio probability. The (+λ)-ea, as i Algorithm 2, applies a offsprig populatio size λ. I each iteratio, it first geerates λ offsprig solutios by idepedetly mutatig the curret solutio λ times, ad the selects the best solutio from the curret solutio ad the offsprig solutios as the ext solutio. It has bee used to disclose the effect of offsprig populatio size by ruig time aalysis [20, 24]. Note that, (+)-EA is a special case of (+λ)-ea with λ =. Algorithm 2 ((+λ)-ea) Give pseudo-boolea fuctio f with solutio legth, it cosists of the followig steps:. x := radomly selected from {0, }. 2. Repeat util the termiatio coditio is met 3. i :=. 4. Repeat util i > λ. 5. x i := flip each bit of x with probability p. 6. i := i x = arg max x {x,x,...,x λ } f(x ). where p (0, 0.5) is the mutatio probability. The ruig time of EAs is usually defied as the umber of fitess evaluatios (i.e., computig f( )) util a optimal solutio is foud for the first time, sice the fitess evaluatio is the computatioal process with the highest cost of the algorithm [7, 28] Markov Chai Modelig We will aalyze EAs by modelig them as Markov chais i this paper. Here, we first give some prelimiaries. EAs geerate solutios oly based o their curretly maitaied solutios, thus, they ca be modeled ad aalyzed as Markov chais, e.g., [7, 28]. A Markov chai {ξ t } + t=0 modelig a EA is costructed by takig the EA s populatio space X as the chai s state space, i.e. ξ t X. Let X X deote the set of all optimal populatios, which cotais at least oe optimal solutio. The goal 5
6 of the EA is to reach X from a iitial populatio. Thus, the process of a EA seekig X ca be aalyzed by studyig the correspodig Markov chai. A Markov chai {ξ t } + t=0 (ξ t X ) is a radom process, where t 0, ξ t+ depeds oly o ξ t. A Markov chai {ξ t } + t=0 is said to be homogeeous, if t 0, x, y X : P (ξ t+ = y ξ t = x) = P (ξ = y ξ 0 = x). () I this paper, we always deote X ad X as the state space ad the optimal state space of a Markov chai, respectively. Give a Markov chai {ξ t } + t=0 ad ξˆt = x, we defie the first hittig time (FHT) of the chai as a radom variable τ such that τ = mi{t ξˆt+t X, t 0}. That is, τ is the umber of steps eeded to reach the optimal state space for the first time startig from ξˆt = x. The mathematical expectatio of τ, E[[τ ξˆt = x]] = ip (τ = i), is called the expected first hittig time (EFHT) of this chai startig from ξˆt = x. If ξ 0 is draw from a distributio π 0, E[[τ ξ 0 π 0 ]] = x X π 0(x)E[[τ ξ 0 = x]] is called the expected first hittig time of the Markov chai over the iitial distributio π 0. For the correspodig EA, the ruig time is the umbers of calls to the fitess fuctio util meetig a optimal solutio for the first time. Thus, the expected ruig time startig from ξ 0 ad that startig from ξ 0 π 0 are respectively equal to N + N 2 E[[τ ξ 0 ]] ad N + N 2 E[[τ ξ 0 π 0 ]], (2) where N ad N 2 are the umber of fitess evaluatios for the iitial populatio ad each iteratio, respectively. For example, for (+)-EA, N = ad N 2 = ; for (+λ)-ea, N = ad N 2 = λ. Note that, whe ivolvig the expected ruig time of a EA o a problem i this paper, if the iitial populatio is ot specified, it is the expected ruig time startig from a uiform iitial distributio π u, i.e., N + N 2 E[[τ ξ 0 π u ]] = N + N 2 x X X E[[τ ξ 0 = x]]. The followig two lemmas o the EFHT of Markov chais [4] will be used i this paper. Lemma Give a Markov chai {ξ t } + t=0, we have Lemma 2 x X : E[[τ ξ t = x]] = 0; x / X : E[[τ ξ t = x]] = + y X P (ξ t+ = y ξ t = x)e[[τ ξ t+ = y]]. Give a homogeeous Markov chai {ξ t } + t=0, it holds t, t 2 0, x X : E[[τ ξ t = x]] = E[[τ ξ t2 = x]]. 6
7 For aalyzig the EFHT of Markov chais, drift aalysis [7, 8] is a commoly used tool, which will also be used i this paper. To use drift aalysis, it eeds to costruct a fuctio V (x) (x X ) to measure the distace of a state x to the optimal state space X. The distace fuctio V (x) satisfies that V (x X ) = 0 ad V (x / X ) > 0. The, by ivestigatig the progress o the distace to X i each step, i.e., E[[V (ξ t ) V (ξ t+ ) ξ t ]], a upper (lower) boud of the EFHT ca be derived through dividig the iitial distace by a lower (upper) boud of the progress. Lemma 3 (Drift Aalysis [7, 8]) Give a Markov chai {ξ t } + t=0 ad a distace fuctio V (x), if it satisfies that for ay t 0 ad ay ξ t with V (ξ t ) > 0, the the EFHT of this chai satisfies that where c l, c u are costats. 0 < c l E[[V (ξ t ) V (ξ t+ ) ξ t ]] c u, V (ξ 0 )/c u E[[τ ξ 0 ]] V (ξ 0 )/c l, 2.4. Pseudo-Boolea Fuctios The pseudo-boolea fuctio class i Defiitio 2 is a large fuctio class which oly requires the solutio space to be {0, } ad the objective space to be R. May well-kow NP-hard problems (e.g., the vertex cover problem ad the 0- kapsack problem) belog to this class. Diverse pseudo- Boolea problems with differet structures ad difficulties have bee used for aalyzig the ruig time of EAs, ad the to disclose properties of EAs, e.g., [, 7, 2]. Note that, we cosider oly maximizatio problems i this paper sice miimizig f is equivalet to maximizig f. Defiitio 2 (Pseudo-Boolea Fuctio) A fuctio i the pseudo-boolea fuctio class has the form: f : {0, } R. I hardest (or called Trap) problem i Defiitio 3 is a special istace i this class, which is to maximize the umber of 0 bits of a solutio except the global optimum... (briefly deoted as ). Its optimal fuctio value is 2, ad the fuctio value for ay o-optimal solutio is ot larger tha 0. It has bee widely used i the theoretical aalysis of EAs, ad the expected ruig time of (+)-EA with mutatio probability has bee proved to be Θ( ) [2]. It has also bee recogized as the hardest istace i the pseudo-boolea fuctio class with a uique global optimum for the (+)-EA [25]. Defiitio 3 (I hardest Problem) I hardest Problem of size is to fid a bits biary strig x such that x ( = arg max x {0,} f(x) = 3 x i 7 x i ),
8 where x i is the i-th bit of a solutio x {0, }. I easiest (or called OeMax) problem i Defiitio 4 is to maximize the umber of bits of a solutio. The optimal solutio is, which has the maximal fuctio value. The ruig time of EAs has bee well studied o this problem [7, 2, 27]. Particularly, the expected ruig time of (+)-EA with mutatio probability o it has bee proved to be Θ( log ) [2]. It has also bee recogized as the easiest istace i the pseudo-boolea fuctio class with a uique global optimum for the (+)-EA [25]. Defiitio 4 (I easiest Problem) I easiest Problem of size is to fid a bits biary strig x such that x ( = arg max x {0,} f(x) = where x i is the i-th bit of a solutio x {0, }. x i ), 3. Noise is Not Always Bad 3.. Empirical Evidece It has bee observed that oisy fitess evaluatio ca make a optimizatio harder for EAs, sice it may make a bad solutio have a better fitess, ad the mislead the search directio of EAs. Droste [0] proved that the ruig time of (+)-EA ca icrease from polyomial to expoetial due to the presece of oise. However, whe studyig the ruig time of (+)-EA solvig the hardest case I hardest i the pseudo-boolea fuctio class, we have observed oppositely that oise ca also make a optimizatio easier for EAs, which meas that the presece of the oise decreases the ruig time of EAs for fidig the optimal solutio. For I hardest problem over {0, }, there are 2 possible solutios, which are deoted by their correspodig iteger values 0,,..., 2, respectively. The, we estimate the expected ruig time of (+)-EA maximizig I hardest whe startig from every solutio. For each iitial solutio, we repeat idepedet rus for 000 times, ad the the average ruig time is recorded as a estimatio of the expected ruig time (briefly called as ERT). We ru (+)-EA without oise, with additive oise ad with multiplicative oise, respectively. For the mutatio probability of (+)-EA, we use the commo settig p =. For additive oise, δ = ad δ 2 =, ad for multiplicative oise, δ = 0. ad δ 2 = 0. The results for = 3, 4, 5 are plotted i Figure. We ca observe that the curves by these two kids of oise are always uder the curve without oise, which shows that I hardest problem becomes easier for (+)-EA i a oisy eviromet. Note that, the three curves meet at the last poit, sice the iitial solutio 2 is the optimal solutio ad the ERT =. 8
9 Estimated ERT without oise 5 additive multiplicative Iitial solutio Estimated ERT without oise 50 additive multiplicative Iitial solutio Estimated ERT without oise 500 additive multiplicative Iitial solutio (a) = 3 (b) = 4 (c) = 5 Figure : Estimated ERT compariso for (+)-EA solvig I hardest problem with or without oise A Sufficiet Coditio I this sectio, by comparig the expected ruig time of EAs with ad without oise, we derive a sufficiet coditio uder which the oise will make a optimizatio easier for EAs. Most practical EAs employ time-ivariat operators, thus we ca model a EA without oise by a homogeeous Markov chai. While for a EA with oise, sice oise may chage over time, we ca just model it by a Markov chai. Note that, the two EAs with ad without oise are differet oly o whether the fitess evaluatio is disturbed by oise, thus, they must have the same values o N ad N 2 for their ruig time Eq.2. The, comparig their expected ruig time is equivalet to comparig the EFHT of their correspodig Markov chais. We first defie a partitio of the state space of a homogeeous Markov chai based o the EFHT, ad the defie a jumpig probability of a Markov chai from oe state to oe state space i oe step. It is easy to see that X 0 i Defiitio 5 is just X, sice E[[τ ξ 0 X ]] = 0. Defiitio 5 (EFHT-Partitio) For a homogeeous Markov chai {ξ t } + t=0, the EFHT-Partitio is a partitio of X ito o-empty subspaces {X 0, X,..., X m } such that () x, y X i, E[[τ ξ 0 = x]] = E[[τ ξ 0 = y]]; (2) E[[τ ξ 0 X 0 ]] < E[[τ ξ 0 X ]] <... < E[[τ ξ 0 X m ]]. Defiitio 6 For a Markov chai {ξ t } + t=0, P t ξ (x, X ) = y X P (ξ t+ = y ξ t = x) is the probability of jumpig from state x to state space X X i oe step at time t. Theorem Give a EA A ad a problem f, let a Markov chai {ξ t } + t=0 ad a homogeeous Markov chai {ξ t} + t=0 model A ruig o f with oise ad without oise respectively, ad deote {X 0, X,..., X m } as the EFHT-Partitio of {ξ t} + t=0, if for all t 0, x X X 0, ad for all itegers i [0, m ], i j=0 P t ξ (x, X j ) i j=0 P t ξ (x, X j), (3) 9
10 the oise makes f easier for A, i.e., for all x X, E[[τ ξ 0 = x]] E[[τ ξ 0 = x]]. The coditio of this theorem (i.e., Eq.3) ituitively meas that the presece of oise leads to a larger probability of jumpig ito good states (i.e., X j with small j values), startig from which the EA eeds less time for fidig the optimal solutio. For the proof, we eed the followig lemma, which is proved i the appedix. Lemma 4 Let m (m ) be a iteger. If it satisfies that () 0 i m, P i, Q i 0, ad m P i = m Q i = ; (2) 0 E 0 < E <... < E m ; (3) 0 k m, k P i k Q i, the it holds that m P i E i m Q i E i. Proof of Theorem. holds. We use Lemma 3 to derive a boud o E[[τ ξ 0 ]], based o which this theorem For usig Lemma 3 to aalyze E[[τ ξ 0 ]], we first costruct a distace fuctio V (x) as x X, V (x) = E[[τ ξ 0 = x]], (4) which satisfies that V (x X ) = 0 ad V (x / X ) > 0 by Lemma. The, we ivestigate E[[V (ξ t ) V (ξ t+ ) ξ t = x]] for ay x with V (x) > 0 (i.e., x / X ). E[[V (ξ t ) V (ξ t+ ) ξ t = x]] = V (x) E[[V (ξ t+ ) ξ t = x]] = V (x) y X P (ξ t+ = y ξ t = x)v (y) = E[[τ ξ 0 = x]] y X P (ξ t+ = y ξ t = x)e[[τ ξ 0 = y]] (by Eq.4) = + y X P (ξ = y ξ 0 = x)e[[τ ξ = y]] y X P (ξ t+ = y ξ t = x)e[[τ ξ 0 = y]] (by Lemma ) = + y X P (ξ t+ = y ξ t = x)e[[τ ξ 0 = y]] y X P (ξ t+ = y ξ t = x)e[[τ ξ 0 = y]] (by Eq. ad Lemma 2, sice {ξ t} + t=0 is homogeeous.) = + m j=0 (P t ξ (x, X j) P t ξ (x, X j ))E[[τ ξ 0 X j ]]. (by Defiitios 5 ad 6) 0
11 Sice m j=0 P t ξ (x, X j) = m j=0 P t ξ (x, X j) =, E[[τ ξ 0 X j ]] icreases with j ad Eq.3 holds, by Lemma 4, we have m Thus, we have, for all t 0, all x / X, j=0 P t ξ (x, X j)e[[τ ξ 0 X j ]] m j=0 P t ξ (x, X j )E[[τ ξ 0 X j ]]. E[[V (ξ t ) V (ξ t+ ) ξ t = x]]. Thus, by Lemma 3, we get for all x X, E[[τ ξ 0 = x]] V (x) = E[[τ ξ 0 = x]], (the = is by Eq.4) which implies that oise leads to less time for fidig the optimal solutio, i.e., oise makes optimizatio easier. We prove below that the experimetal example satisfies this sufficiet coditio. We cosider (+λ)- EA, which covers (+)-EA ad is much more geeral. Let {ξ t } + t=0 ad {ξ t} + t=0 model (+λ)-ea with ad without oise for maximizig I hardest problem, respectively. For I hardest problem, it is to maximize the umber of 0 bits except the optimal solutio. It is ot hard to see that the EFHT E[[τ ξ 0 = x]] oly depeds o x 0 (i.e., the umber of 0 bits). We deote E (j) as E[[τ ξ 0 = x]] with x 0 = j. The order of E (j) is showed i Lemma 5, the proof of which is i the Appedix. Lemma 5 For ay mutatio probability 0 < p < 0.5, it holds that E (0) < E () < E (2) <... < E (). Theorem 2 Either additive oise with δ 2 δ 2 or multiplicative oise with δ 2 > δ > 0 makes I hardest problem easier for (+λ)-ea with mutatio probability less tha 0.5. Proof. The proof is by showig that the coditio of Theorem (i.e., Eq.3) holds here. By Lemma 5, the EFHT-Partitio of {ξ t} + t=0 is X i = {x {0, } x 0 = i} (0 i ) ad m i Theorem equals to here. Let f N (x) ad f(x) deote the oisy ad true fitess, respectively. For ay x X k (k ), we deote P (0) ad P (j) ( j ) as the probability that for the λ offsprig solutios x,..., x λ geerated by bit-wise mutatio o x, mi{ x 0,..., x λ 0 } = 0 (i.e., the least umber of 0 bits is 0), ad mi{ x 0,..., x λ 0 } > 0 max{ x 0,..., x λ 0 } = j (i.e., the largest umber of 0 bits is j while the least umber of 0 bits is larger tha 0), respectively. The, we aalyze oe-step trasitio probabilities from x for both {ξ t} + t=0 (i.e., without oise) ad {ξ t} + t=0 (i.e., with oise).
12 For {ξ t} + t=0, because oly the optimal solutio or the solutio with the largest umber of 0 bit amog the paret solutio ad λ offsprig solutios will be accepted, we have P t ξ (x, X 0) = P (0); j k : P t ξ (x, X j) = 0; P t ξ (x, X k) = k j= P (j); k + j : P t ξ (x, X j) = P (j). (5) For {ξ t } + t=0 with additive oise, sice δ 2 δ 2, we have f N ( ) f( ) + δ 2 + δ 2 2 = δ 2 ; y, f N (y) f(y) + δ 2 δ 2. For multiplicative oise, sice δ 2 > δ > 0, the f N ( ) > 0; y, f N (y) 0. Thus, for these two oises, we have y, f N ( ) f N (y), which implies that if the optimal solutio is geerated, it will always be accepted. Thus, we have, ote that X 0 = { }, P t ξ (x, X 0 ) = P (0). (6) Due to the fitess evaluatio disturbed by oise, the solutio with the largest umber of 0 bit amog the paret solutio ad λ offsprig solutios may be rejected. Thus, we have k + i : Pξ t (x, X j ) P (j). (7) j=i j=i By combiig Eq.5, Eq.6 ad Eq.7, we have i : Pξ t (x, X j ) j=i Pξ t (x, X j). j=i Sice j=0 P t ξ (x, X j) = j=0 P t ξ (x, X j) =, the above iequality is equivalet to 0 i : i Pξ t (x, X j ) j=0 i Pξ t (x, X j), j=0 which implies that the coditio Eq.3 of Theorem holds. Thus, we ca get that I hardest problem becomes easier for (+λ)-ea uder these two kids of oise. Theorem gives a sufficiet coditio for that oise makes optimizatio easier. If its coditio Eq.3 chages the iequality directio, which implies that oise leads to a smaller probability of jumpig to good states, it obviously becomes a sufficiet coditio for that oise makes optimizatio harder. We show it i Theorem 3, the proof of which is as similar as that of Theorem, except that the iequality directio eeds to be chaged. 2
13 Theorem 3 Give a EA A ad a problem f, let a Markov chai {ξ t } + t=0 ad a homogeeous Markov chai {ξ t} + t=0 model A ruig o f with oise ad without oise respectively, ad deote {X 0, X,..., X m } as the EFHT-Partitio of {ξ t} + t=0, if for all t 0, x X X 0, ad for all itegers i [0, m ], i j=0 P t ξ (x, X j ) i j=0 P t ξ (x, X j), (8) the oise makes f harder for A, i.e., for all x X, E[[τ ξ 0 = x]] E[[τ ξ 0 = x]]. The we apply this coditio to the case that (+λ)-ea is used for optimizig the easiest case I easiest i the pseudo-boolea fuctio class. Let {ξ t } + t=0 ad {ξ t} + t=0 model (+λ)-ea with ad without oise for maximizig I easiest problem, respectively. It is ot hard to see that the EFHT E[[τ ξ 0 = x]] oly depeds o x 0. We deote E 2 (j) as E[[τ ξ 0 = x]] with x 0 = j. The order of E 2 (j) is showed i Lemma 6, the proof of which is i the Appedix. Lemma 6 For ay mutatio probability 0 < p < 0.5, it holds that E 2 (0) < E 2 () < E 2 (2) <... < E 2 (). Theorem 4 Ay oise makes I easiest problem harder for (+λ)-ea with mutatio probability less tha 0.5. Proof. We use Theorem 3 to prove it. By Lemma 6, the EFHT-Partitio of {ξ t} + t=0 is X i = {x {0, } x 0 = i} (0 i ). For ay o-optimal solutio x X k (k > 0), we deote P (j) (0 j ) as the probability that the least umber of 0 bits for the λ offsprig solutios geerated by bit-wise mutatio o x is j. For {ξ t} + t=0, because the solutio with the least umber of 0 bits amog the paret solutio ad λ offsprig solutios will be accepted, we have 0 j k : P t ξ (x, X j) = P (j); P t ξ (x, X k) = j=k P (j); k + j : P t ξ (x, X j) = 0. For {ξ t } + t=0, due to the fitess evaluatio disturbed by oise, the solutio with the least umber of 0 bits amog the paret solutio ad λ offsprig solutios may be rejected. Thus, we have i i 0 i k : Pξ t (x, X j ) P (j). The, we ca get 0 i : j=0 j=0 i Pξ t (x, X j ) j=0 i Pξ t (x, X j). j=0 This implies that the coditio Eq.8 of Theorem 3 holds. Thus, by Theorem 3, we ca get that oise makes I easiest problem harder for (+λ)-ea. 3
14 3.3. Discussio We have show that oise makes I hardest ad I easiest problems easier ad harder, respectively, for (+λ)-ea. These two problems are kow to be the hardest ad the easiest istace respectively i the pseudo-boolea fuctio class with a uique global optimum for the (+)-EA [25]. We ca ituitively iterpret the discovered effect of oise for EAs o these two problems. For I hardest problem, the EA searches alog the deceptive directio while oise ca add some radomess to make the EA have some possibility to ru alog the right directio; for I easiest problem, the EA searches alog the right directio while oise ca oly harm the optimizatio process. We thus hypothesize that we eed to take care of the oise oly whe the optimizatio problem is moderately or less complex. To further verify our hypothesis, we employ the Jump m, problem, which is a problem with adjustable difficulty ad ca be cofigured as I eaisest whe m = ad I hardest whe m =. Defiitio 7 (Jump m, Problem) Jump m, Problem of size with m is to fid a bits biary strig x such that ( m + x = arg max x {0,} Jump m, (x) = x i if x i m or x i = x i otherwise where x i is the i-th bit of a solutio x {0, }. We test (+)-EA with mutatio probability o Jump m,. It is kow that the expected ruig time of the (+)-EA o Jump m, is Θ( m + log ) [2], which implies that Jump m, with larger value of m is harder. I the experimet, we set = 5, ad for oise, we use the additive oise with δ = 0.5 δ 2 = 0.5, the multiplicative oise with δ = δ 2 = 2, ad the oe-bit oise with p = 0.5, respectively. We record the expected ruig time gap startig from each iitial solutio gap = (E[[τ]] E[[τ ]])/E[[τ ]], where E[[τ]] ad E[[τ ]] deote the expected ruig time of the EA optimizig the problem with ad without oise, respectively. The larger the gap meas that the oise has a more egative effect, while the smaller the gap meas that the oise has a less egative effect. For each iitial solutio ad each cofiguratio of oise, we repeat the ruig of the (+)-EA 000 times, ad estimate the expected ruig time by the average ruig time, ad thus estimate the gap. The results are plotted i Figure 2. We ca observe that the gaps for larger m are lower (i.e., the egative effect by oise decreases as the problem hardess icreases), ad the gaps for large m ted to be 0 or egative values (i.e., oise ca have o or positive effect whe the optimizatio is quite hard). These empirical observatios give support to our hypothesis that the oise should be hadled carefully oly whe the optimizatio is moderately or less complex. ), 4
15 Estimated gap m= m=2 m=3 m=4 m=5 Estimated gap m= m=2 m=3 m=4 m=5 Estimated gap m= m=2 m=3 m=4 m= Iitial solutio Iitial solutio Iitial solutio (a) additive oise (b) multiplicative oise (c) oe-bit oise Figure 2: Estimated ERT gap for (+)-EA solvig Jump m,5 problem with or without oise. 4. O the Usefuless of Noise Hadlig Strategies 4.. Re-evaluatio There are aturally two fitess evaluatio optios for EAs [2, 2, 6, 9]: sigle-evaluatio we evaluate a solutio oce, ad use the evaluated fitess for this solutio i the future. re-evaluatio every time we access the fitess of a solutio by evaluatio. For example, for (+)-EA i Algorithm, if usig re-evaluatio, both f(x ) ad f(x) will be calculated ad recalculated i each iteratio; if usig sigle-evaluatio, oly f(x ) will be calculated ad the previous obtaied fitess f(x) will be reused. Ituitively, re-evaluatio ca smooth oise ad thus could be better for oisy optimizatios, but it also icreases the fitess evaluatio cost ad thus icreases the ruig time. Its usefuless was ot yet clear. Note that, the aalysis i the previous sectio assumes sigle-evaluatio. I this sectio, we take the I easiest problem, where oise has bee proved to have a strog egative effect i the previous sectio, as the represetative problem, ad compare these two optios for (+)-EA with mutatio probability solvig this problem uder oe-bit oise to show whether reevaluatio is useful. Note that for oe-bit oise, p cotrols the oise level, that is, oise becomes stroger as p gets larger, ad it is also the variable of the PNT. Theorem 5 The PNT of (+)-EA usig sigle-evaluatio with mutatio probability o I easiest problem is lower bouded by /Ω(poly()) ad upper bouded by /O(2 poly()), where poly() idicates ay polyomial of, with respect to oe-bit oise. The theorem is straightforwardly derived from the followig lemma. 5
16 Lemma 7 For (+)-EA usig sigle-evaluatio with mutatio probability o I easiest problem uder oe-bit oise, the expected ruig time is O( 2 + /( p )) ad Ω(p /(2 ( p ))). Proof. Let L deote the oisy fitess value f N (x) of the curret solutio x. Because (+)-EA does ot accept a solutio with a smaller fitess (i.e., the 4th step of Algorithm ) ad it does t re-evaluate the fitess of the curret solutio x, L (0 L ) will ever decrease. We first aalyze the expected steps util L icreases whe startig from L = i (deoted by E[[i]]), ad the sum up them to get a upper boud E[[i]] for the expected steps util L reaches the maximum value. For E[[i]], we aalyze the probability P that L icreases i two steps whe L = i, the E[[i]] = 2 P. Note that, oe-bit oise ca make L be x, x or x +, where x = x i is the umber of bits. Whe aalyzig the oisy fitess f N (x ) of the offsprig x i each step, we eed to first cosider bit-wise mutatio o x ad the oe radom bit flip for oise. Whe 0 < L <, x = L, L or L +. () For x = L, P L+ ( )( ) p L + L+ ( )( ) ( p ) L ( )( ) ( p ), sice it is sufficiet to flip oe 0 bit for mutatio ad oe 0 bit for oise i the first step, or flip oe 0 bit for mutatio ad o bit for oise i the first step ad flip oe 0 bit for mutatio ad o bit for oise i the secod step. (2) For x = L, P ( ) p L + L ( ) ( p ), sice it is sufficiet to flip o bit for mutatio ad oe 0 bit for oise, or flip oe 0 bit for mutatio ad o bit for oise i the first step. (3) For x = L +, P ( ) L ( p + p ), sice it is sufficiet to flip o bit for mutatio ad o bit or oe 0 bit for oise i the first step. Thus, for these three cases, we have P p ( L L )( ) + ( )2( ) ( p ) 2 L (p + ( p ) 2 ( L)( L ) 2 3( L)( L ) ) e 2 2 4e 2 2, where the is by ( ) e ad the 2 is by 0 p. L Whe L = 0, x = 0 or. By cosiderig case (2) ad (3), we ca get the same lower boud for P. Whe L = ad the optimal solutio has ot bee foud, x = 2 or. By cosiderig case () ad (2), we ca get P 3/(2e 2 2 ). Based o the above aalysis, we ca get that the expected steps util L = is at most 2 E[[i]] 2 ( L=0 4e 2 2 3( L)( L ) + 2e2 2 ), i.e., O( 2 ). 3 6
17 Whe L =, x = or (i.e., the optimal solutio has bee foud). If x =, the optimal solutio will be geerated ad accepted i oe step with probability ( ) ( p ) ( p) e, because it eeds to flip the uique 0 bit for mutatio ad o bit for oise. This implies that the expected steps for fidig the optimal solutio is at most Thus, we ca get the upper boud O( 2 + e ( p ). p ) for the expected ruig time of the whole process. The, we are to aalyze the lower boud. Assume that the iitial solutio x iit has umber of bits, i.e., x iit =. If the fitess of x iit is evaluated as, which happes with probability p, before fidig the optimal solutio, the solutio will always have umber of bits ad its fitess will always be. From the above aalysis, we kow that i such a situatio, the probability of geeratig ad acceptig the optimal solutio i oe step is ( ) ( p ) ( p). Thus, the expected ruig time for fidig the optimal solutio whe startig from x iit = is at least p ( p ) = p ( p. Because the iitial solutio is uiformly distributed over {0, ) }, the probability that the algorithm starts from x iit = is /2. Thus, we ca get the lower boud Ω( p 2 ( p ) Theorem 6 ) for the expected ruig time of the whole process. The PNT of (+)-EA usig re-evaluatio with mutatio probability o I easiest problem is Θ( log() ), with respect to oe-bit oise. The theorem is straightforwardly derived from the followig lemma. Lemma 8 ([0]) For (+)-EA usig re-evaluatio with mutatio probability o I easiest problem uder oe-bit oise, the expected ruig time is polyomial whe p O(log()/), ad the ruig time is polyomial with super-polyomially small probability whe p ω(log()/) Threshold Selectio Durig the process of evolutioary optimizatio, most of the improvemets i oe geeratio are small. Whe usig re-evaluatio, due to oisy fitess evaluatio, a cosiderable portio of these improvemets are ot real, where a worse solutio appears to have a better fitess ad the survives to replace the true better solutio which has a worse fitess. This may mislead the search directio of EAs, ad the slow dow the efficiecy of EAs or make EAs get trapped i the local optimal solutio, as observed i Sectio 4.. To deal with this problem, a selectio strategy for EAs hadlig oise was proposed [23]. threshold selectio a offsprig solutio will be accepted oly if its fitess is larger tha the paret solutio by at least a predefied threshold τ 0. 7
18 For example, for (+)-EA with threshold selectio as i Algorithm 3, its 4th step chages to be if f(x ) f(x) + τ rather tha if f(x ) f(x) i Algorithm. Such a strategy ca reduce the risk of acceptig a bad solutio due to oise. Although the good local performace (i.e., the progress of oe step) of EAs with threshold selectio has bee show o some problems [23, 6, 5], its usefuless for the global performace (i.e., the ruig time util fidig the optimal solutio) of EAs uder oise is ot yet clear. Algorithm 3 ((+)-EA with threshold selectio) Give pseudo-boolea fuctio f with solutio legth, ad a predefied threshold τ 0, it cosists of the followig steps:. x := radomly selected from {0, }. 2. Repeat util the termiatio coditio is met 3. x := flip each bit of x with probability p. 4. if f(x ) f(x) + τ 5. x := x. where p (0, 0.5) is the mutatio probability. I this sectio, we compare the ruig time of (+)-EA with ad without threshold selectio solvig I easiest problem uder oe-bit oise to show whether threshold selectio will be useful. Note that, the aalysis here assumes re-evaluatio. Algorithm 4 shows a radom walk o a graph. Lemma 9 gives a upper boud o the expected steps for a radom walk to visit each vertex of a graph at least oce, which will be used i the followig aalysis. Algorithm 4 (Radom Walk) Give a udirected coected graph G = (V, E) with vertex set V ad edge set E, it cosists of the followig steps:. start at a vertex v V. 2. Repeat util the termiatio coditio is met 3. choose a eighbor u of v i G uiformly at radom. 4. set v := u. Lemma 9 ([]) Give a udirected coected graph G = (V, E), the expected cover time of a radom walk o G is upper bouded by 2 E ( V ), where the cover time of a radom walk o G is the umber of steps util each vertex v V has bee visited at least oce. Theorem 7 The PNT of (+)-EA usig re-evaluatio with threshold selectio τ = ad mutatio probability o I easiest problem is ot less tha 2e, with respect to oe-bit oise. 8
19 The theorem ca be directly derived from the followig lemma. Lemma 0 For (+)-EA usig re-evaluatio with threshold selectio τ = ad mutatio probability o I easiest problem uder oe-bit oise, the expected ruig time is O( 3 ) whe p 2e. Proof. We deote the umber of oe bits of the curret solutio x by L (0 L ). Let P d deote the probability that the offsprig solutio x by bit-wise mutatio o x has L + d ( L d L) umber of oe bits, ad let P d deote the probability that the ext solutio after bit-wise mutatio ad selectio has L + d umber of oe bits. The, we aalyze P d. We cosider 0 L. Note that oe-bit oise ca chage the true fitess of a solutio by at most, i.e., f N (x) f(x). () Whe d 2, f N (x ) L+d+ L f N (x). Because a offsprig solutio will be accepted oly if f N (x ) f N (x) +, the offsprig solutio x will be discarded i this case, which implies that d 2 : P d = 0. (2) Whe d =, the offsprig solutio x will be accepted oly if f N (x ) = L f N (x) = L, the probability of which is p L+ Thus, P = P (p L p L+ ). p L, sice it eeds to flip oe 0 bit of x ad flip oe bit of x. (3) Whe d =, if f N (x) = L, the probability of which is p L, the offsprig solutio x will be accepted, sice f N (x ) L + = L > f N (x); if f N (x) = L f N (x ) L +, the probability of L which is ( p ) ( p + p ), x will be accepted; if f N (x) = L + f N (x ) = L + 2, the probability of which is p L p L, x will be accepted; otherwise, x will be discarded. Thus, P L = P (p + ( p L L )( p + p ) + p p L ). (4) Whe d 2, it is easy to see that P d > 0. Because we are to get the upper boud of the expected ruig time for fidig the optimal solutio for the first time, we pessimistically assume that d 2 : P d = 0. The, we compare P with P. P L P p L ( L ) p p L( L) e 2, where the secod iequality is by P L ( ) sice it is sufficiet to flip just oe 0 bit, ad the last iequality is by ( ) e. P = P (p L p L + ) L (p L p L + L L( L + ) L( L) ) p p e2 2 e 2, where the first iequality is by P L sice it is ecessary to flip at least oe bit, the secod iequality is by p L( L+) 2e, ad the last iequality is by 2 L. Thus, we have for all 0 L, P P. Because we are to get the upper boud of the expected ruig time for fidig, we ca pessimistically assume that P = P. The, we ca view the 9
20 evolutioary process as a radom walk o the path {0,, 2,..., }. We call a step that jumps to the eighbor state a relevat step. Thus, by Lemma 9, it eeds at most 2 2 expected relevat steps to fid. Because the probability of a relevat step is at least P P ( p ) 2 L ( ) ( 2e )2 ( 2e )2 /e, the expected ruig time for a relevat step is O(). Thus, the expected ruig time of (+)-EA with τ = o I easiest problem with p 2e is upper bouded by O(3 ). Theorem 8 The PNT of (+)-EA usig re-evaluatio with threshold selectio τ = 2 ad mutatio probability o I easiest problem is lower bouded by /Ω(poly()) ad upper bouded by /O(2 poly()), where poly() idicates ay polyomial of, with respect to oe-bit oise. The theorem ca be directly derived from the followig lemma. Lemma For (+)-EA usig re-evaluatio with threshold selectio τ = 2 ad mutatio probability o I easiest problem uder oe-bit oise, the expected ruig time is O( log /(p ( p ))) ad Ω( 2 /(2 p ( p ))). Proof. Let L (0 L ) deote the umber of oe bits of the curret solutio x. Here, a offsprig solutio x will be accepted oly if f N (x ) f N (x) 2. As i the proof of Lemma 0, we ca derive d : P d = 0; P ( L = P p (( p L L ) + p ) + ( p )(p ) ) ; d 2 : P d > 0. Thus, L will ever decrease i the evolutio process, ad it ca icrease i oe step with probability P d>0 > P L ( )( ) (( p )p ( ) + L( L ) p2 2 ) 2e ( p L )p. The, we ca get that the expected steps util L = (i.e., the optimal solutio is foud) is at most L=0 2e ( p )p ( L), i.e., O( log p ( p ) ). The, we are to aalyze the lower boud. Assume that the iitial solutio x iit has umber of bits. Before fidig the optimal solutio, the solutio x i the populatio will always satisfy x = because d : P d = 0. The optimal solutio (i.e., x = ) will be foud i oe step with probability P = P p ( p )( ) = ( )( ) p ( p )( ) p( p) e. Thus, the expected steps for fidig the optimal solutio whe startig from x iit = is at least e p ( p ). By the uiform distributio of the iitial solutio, the probability that x iit = is /2. Thus, we ca get the lower boud Ω( 2 2 p ( p ) ) for the expected ruig time of the whole process. 20
21 4.3. Smooth Threshold Selectio We propose the smooth threshold selectio as i Defiitio 8, which modifies the origial threshold selectio by chagig the hard threshold value to a smooth oe. We are to show that, by such a small modificatio, the PNT of (+)-EA o I easiest problem is improved to, which meas that the expected ruig time of (+)-EA is always polyomial disregard the oe-bit oise level. Defiitio 8 (Smooth Threshold Selectio) Let δ be the gap betwee the fitess of the offsprig solutio x ad the paret solutio x, i.e., δ = f(x ) f(x). The, the selectio process will behave as follows: () if δ 0, x will be rejected; (2) if δ =, x will be accepted with probability 5 ; (3) if δ >, x will be accepted. Theorem 9 The PNT of (+)-EA usig re-evaluatio with smooth threshold selectio ad mutatio probability o I easiest problem is, with respect to oe-bit oise. Proof. We first aalyze P d as that aalyzed i the proof of Lemma 0. The oly differece is that whe the fitess gap betwee the offsprig ad the paret solutio is, the offsprig solutio will be accepted with probability 5 here, while it will be always accepted i the proof of Lemma 0. Thus, for smooth threshold selectio, we ca similarly derive d 2 : P d = 0; P L = P (p p L + P ( L = P p (p L + L + p p d 2 : P d > 0. L ) 5 ; 5 + ( p L ) + p ) + ( p )(( p ) ) ; p L ) Note that L (0 L ) deotes the umber of oe bits of the curret solutio x. Our goal is to reach L =. If startig from L =, L will reach i oe step with probability P L P (p p L + L ( L ) (p p 5e 2 ( 5e ( p )( p ) L + 5 ) 5 + ( p )( p ) 5 ) p2 + ( p ) 2 ) (by L = ad ( ) e ) 2 Ω( 2 ). (by 0 p ) Thus, for reachig L =, we eed to reach L = for O( 2 ) times i expectatio. 2
22 The, we aalyze the expected ruig time util L =. I this process, we ca pessimistically assume that L = will ever be reached, because our fial goal is to get the upper boud o the expected ruig time for reachig L =. For 0 L 2, we have P P L P (p p L ) L P (p p L+ ) 5 5( L)( L ) el( L + ) L = 5( L ) e( + 2 L ) >. ( ) (p L p L ) L (p L p L+ ) 5 Agai, we ca pessimistically assume that P = P ad d 2, P d = 0, because we are to get the upper boud o the expected ruig time util L =. The, we ca view the evolutioary process for reachig L = as a radom walk o the path {0,, 2,..., }. We call a step that jumps to the eighbor state a relevat step. Thus, by Lemma 9, it eeds at most 2( ) 2 expected relevat steps to reach L =. Because the probability of a relevat step is at least P P (( p )( p ) 5 + p L L 5e 2 (( p ) 2 + p 2 ( L)( L ) 2 ) 2 5e 2 (( p ) p2 ) 2 5e 2 p L , the expected ruig time for a relevat step is O( 4 ). The, the expected ruig time for reachig L = is O( 6 ). 5 ) Thus, the expected ruig time of the whole optimizatio process is O( 8 ) for ay p [0, ], ad the this theorem holds. We draw a ituitive uderstadig from the proof of Theorem 9 that why the smooth threshold selectio ca be better tha the origial threshold selectios. By chagig the hard threshold to be a smooth threshold, it ca ot oly make the probability of acceptig a false better solutio i oe step small eough, i.e. P P, but also make the probability of producig progress i oe step large eough, i.e., P is ot small. 5. Discussios ad Coclusios This paper studies theoretical issues of oisy optimizatio by evolutioary algorithms. First, we discover that a optimizatio problem may become easier istead of harder i a oisy eviromet. We the derive a sufficiet coditio uder which oise makes optimizatio easier or harder. By fillig this coditio, we have show that for (+λ)-ea, oise makes the optimizatio 22
23 o the hardest ad the easiest case i the pseudo-boolea fuctio class easier ad harder, respectively. We also hypothesize that we eed to take care of oise oly whe the optimizatio problem is moderately or less complex. Experimets o the Jump m, problem, which has a adjustable difficulty parameter, supported our hypothesis. I problems where the oise has a egative effect, we the study the usefuless of two commoly employed oise-hadlig strategies, re-evaluatio ad threshold selectio. The study takes the easiest case i the pseudo-boolea fuctio class as the represetative problem, where the oise sigificatly harms the expected ruig time of the (+)-EA. We use the polyomial oise tolerace (PNT) level as the performace measure, ad aalyzed the PNT of each EA. The re-evaluatio strategy seems to be a reasoable method for reducig radom oise. However, we derive that the (+)-EA with sigle-evaluatio has a PNT lower boud /Ω(poly()) from Theorem 5 which is close to, whilst the (+)-EA with re-evaluatio has the PNT Θ(log()/) which ca be quite close to zero as is large. It is surprise to see that the re-evaluatio strategy leads to a much worse oise tolerace tha that without ay oise hadlig method. The re-evaluatio with threshold selectio strategy has a better PNT comparig with the re-evaluatio aloe. Whe the threshold is, we derive a PNT lower boud 2e from Theorem 7, ad whe the threshold is 2, we obtai /Ω(poly()) from Theorem 8. The improvemet from re-evaluatio aloe could be explaied as that the threshold selectio filters out fake progresses that caused by the oise. However, it still showed o improvemets from the (+)-EA without ay oise hadlig method. We the proposed the smooth threshold selectio, which acts like the threshold selectio with threshold 2 but accepts progresses with a probability. We proved that the (+)-EA with the smooth threshold selectio has the PNT from Theorem 9, which exceeds that of (+)-EA without ay oise hadlig method. Our explaatio is that, like the origial threshold selectio, the proposed oe filters out fake progresses, while it also keep some chaces to accept real progresses. Although the ivestigated EAs ad problems i this paper are simple ad specifically used for the theoretical aalysis of EAs, the aalysis still disclosed couter-ituitive results ad, particularly, demostrated that theoretical ivestigatio is essetial i desigig better oise hadlig strategies. We are optimistic that our fidigs may be helpful for practical uses of EAs, which will be studied i the future. 6. Ackowledgemets to be added... 23
24 Refereces [] R. Aleliuas, R. Karp, R. Lipto, L. Lovasz, ad C. Rackoff. Radom walks, uiversal traversal sequeces, ad the complexity of maze problems. I Proceedigs of the 20th Aual Symposium o Foudatios of Computer Sciece (FOCS 79), pages , Sa Jua, Puerto Rico, 979. [2] D. V. Arold ad H.-G. Beyer. Local performace of the (+)-ES i a oisy eviromet. IEEE Trasactios o Evolutioary Computatio, 6():30 4, [3] D. V. Arold ad H.-G. Beyer. A compariso of evolutio strategies with other direct search methods i the presece of oise. Computatioal Optimizatio ad Applicatios, 24():35 59, [4] T. Bäck. Evolutioary Algorithms i Theory ad Practice: Evolutio Strategies, Evolutioary Programmig, Geetic Algorithms. Oxford Uiversity Press, Oxford, UK, 996. [5] T. Bartz-Beielstei. New experimetalism applied to evolutioary computatio. PhD thesis, Uiversity of Dortmud, [6] T. Beielstei ad S. Marko. Threshold selectio, hypothesis tests, ad DOE methods. I Proceedigs of the IEEE Cogress o Evolutioary Computatio (CEC 02), pages , Hoolulu, HI, [7] H.-G. Beyer. Evolutioary algorithms i oisy eviromets: theoretical issues ad guidelies for practice. Computer Methods i Applied Mechaics ad Egieerig, 86(2): , [8] S.-J. Chag, H.-S. Hou, ad Y.-K. Su. Automated passive filter sythesis usig a ovel tree represetatio ad geetic programmig. IEEE Trasactios o Evolutioary Computatio, 0(): 93 00, [9] Y. Chag ad S. Che. A ew query reweightig method for documet retrieval based o geetic algorithms. IEEE Trasactios o Evolutioary Computatio, 0(5):67 622, [0] S. Droste. Aalysis of the (+) EA for a oisy OeMax. I Proceedigs of the 6th ACM Aual Coferece o Geetic ad Evolutioary Computatio (GECCO 04), pages , Seattle, WA, [] S. Droste, T. Jase, ad I. Wegeer. A rigorous complexity aalysis of the (+) evolutioary algorithm for liear fuctios with Boolea iputs. Evolutioary Computatio, 6(2):85 96, 998. [2] S. Droste, T. Jase, ad I. Wegeer. O the aalysis of the (+) evolutioary algorithm. Theoretical Computer Sciece, 276(-2):5 8,
Analyzing Evolutionary Optimization in Noisy Environments
Evolutioary Computatio /EVCO_a_0070-Qia Jauary 5, 206 6:7 Aalyzig Evolutioary Optimizatio i Noisy Eviromets Chao Qia qiac@lamda.ju.edu.c Natioal Key Laboratory for Novel Software Techology, Najig Uiversity,
More informationarxiv: v1 [cs.ne] 2 Nov 2017
Noame mauscript No. will be iserted by the editor) Ruig Time Aalysis of the +)-EA for OeMax ad LeadigOes uder Bit-wise Noise Chao Qia Chao Bia Wu Jiag Ke Tag Received: date / Accepted: date arxiv:7.00956v
More informationRunning Time Analysis of the (1+1)-EA for OneMax and LeadingOnes under Bit-wise Noise
Ruig Time Aalysis of the +-EA for OeMax ad LeadigOes uder Bit-wise Noise Chao Qia Uiversity of Sciece ad Techology of Chia Hefei 3007, Chia chaoqia@ustc.edu.c Wu Jiag Uiversity of Sciece ad Techology of
More informationOn the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments
O the Effectiveess of Samplig for Evolutioary Optimizatio i Noisy Eviromets Chao Qia,2 chaoqia@ustc.edu.c Yag Yu 2 yuy@ju.edu.c Ke Tag ketag@ustc.edu.c Yaochu Ji 3 yaochu.ji@surrey.ac.uk Xi Yao,4 x.yao@cs.bham.ac.uk
More informationGeneral Lower Bounds for the Running Time of Evolutionary Algorithms
Geeral Lower Bouds for the Ruig Time of Evolutioary Algorithms Dirk Sudholt Iteratioal Computer Sciece Istitute, Berkeley, CA 94704, USA Abstract. We preset a ew method for provig lower bouds i evolutioary
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationw (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ.
2 5. Weighted umber of late jobs 5.1. Release dates ad due dates: maximimizig the weight of o-time jobs Oce we add release dates, miimizig the umber of late jobs becomes a sigificatly harder problem. For
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More information1 Hash tables. 1.1 Implementation
Lecture 8 Hash Tables, Uiversal Hash Fuctios, Balls ad Bis Scribes: Luke Johsto, Moses Charikar, G. Valiat Date: Oct 18, 2017 Adapted From Virgiia Williams lecture otes 1 Hash tables A hash table is a
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationIP Reference guide for integer programming formulations.
IP Referece guide for iteger programmig formulatios. by James B. Orli for 15.053 ad 15.058 This documet is iteded as a compact (or relatively compact) guide to the formulatio of iteger programs. For more
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio
More informationInformation-based Feature Selection
Iformatio-based Feature Selectio Farza Faria, Abbas Kazeroui, Afshi Babveyh Email: {faria,abbask,afshib}@staford.edu 1 Itroductio Feature selectio is a topic of great iterest i applicatios dealig with
More informationProblem Set 2 Solutions
CS271 Radomess & Computatio, Sprig 2018 Problem Set 2 Solutios Poit totals are i the margi; the maximum total umber of poits was 52. 1. Probabilistic method for domiatig sets 6pts Pick a radom subset S
More informationOn Algorithm for the Minimum Spanning Trees Problem with Diameter Bounded Below
O Algorithm for the Miimum Spaig Trees Problem with Diameter Bouded Below Edward Kh. Gimadi 1,2, Alexey M. Istomi 1, ad Ekateria Yu. Shi 2 1 Sobolev Istitute of Mathematics, 4 Acad. Koptyug aveue, 630090
More informationLecture 2. The Lovász Local Lemma
Staford Uiversity Sprig 208 Math 233A: No-costructive methods i combiatorics Istructor: Ja Vodrák Lecture date: Jauary 0, 208 Origial scribe: Apoorva Khare Lecture 2. The Lovász Local Lemma 2. Itroductio
More informationA Rigorous View On Neutrality
A Rigorous View O Neutrality Bejami Doerr Michael Gewuch Nils Hebbighaus Frak Neuma Algorithms ad Complexity Group Max-Plack-Istitut für Iformatik Saarbrücke, Germay Departmet of Computer Sciece Christia-Albrechts-Uiversity
More informationIntroduction to Computational Molecular Biology. Gibbs Sampling
18.417 Itroductio to Computatioal Molecular Biology Lecture 19: November 16, 2004 Scribe: Tushara C. Karuarata Lecturer: Ross Lippert Editor: Tushara C. Karuarata Gibbs Samplig Itroductio Let s first recall
More informationRecursive Algorithms. Recurrences. Recursive Algorithms Analysis
Recursive Algorithms Recurreces Computer Sciece & Egieerig 35: Discrete Mathematics Christopher M Bourke cbourke@cseuledu A recursive algorithm is oe i which objects are defied i terms of other objects
More informationCS284A: Representations and Algorithms in Molecular Biology
CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by
More informationReinforcement Learning Based Dynamic Selection of Auxiliary Objectives with Preserving of the Best Found Solution
Reiforcemet Learig Based Dyamic Selectio of Auxiliary Objectives with Preservig of the Best Foud Solutio arxiv:1704.07187v1 [cs.ne] 24 Apr 2017 Abstract Efficiecy of sigle-objective optimizatio ca be improved
More informationLecture 9: Hierarchy Theorems
IAS/PCMI Summer Sessio 2000 Clay Mathematics Udergraduate Program Basic Course o Computatioal Complexity Lecture 9: Hierarchy Theorems David Mix Barrigto ad Alexis Maciel July 27, 2000 Most of this lecture
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationCHAPTER 10 INFINITE SEQUENCES AND SERIES
CHAPTER 10 INFINITE SEQUENCES AND SERIES 10.1 Sequeces 10.2 Ifiite Series 10.3 The Itegral Tests 10.4 Compariso Tests 10.5 The Ratio ad Root Tests 10.6 Alteratig Series: Absolute ad Coditioal Covergece
More informationComputational Intelligence Winter Term 2018/19
Computatioal Itelligece Witer Term 28/9 Prof. Dr. Güter Rudolph Lehrstuhl für Algorithm Egieerig (LS ) Fakultät für Iformatik TU Dortmud Pla for Today Lecture Evolutioary Algorithms (EA) Optimizatio Basics
More informationRecursive Algorithm for Generating Partitions of an Integer. 1 Preliminary
Recursive Algorithm for Geeratig Partitios of a Iteger Sug-Hyuk Cha Computer Sciece Departmet, Pace Uiversity 1 Pace Plaza, New York, NY 10038 USA scha@pace.edu Abstract. This article first reviews the
More informationInfinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
More informationOn forward improvement iteration for stopping problems
O forward improvemet iteratio for stoppig problems Mathematical Istitute, Uiversity of Kiel, Ludewig-Mey-Str. 4, D-24098 Kiel, Germay irle@math.ui-iel.de Albrecht Irle Abstract. We cosider the optimal
More informationSpectral Partitioning in the Planted Partition Model
Spectral Graph Theory Lecture 21 Spectral Partitioig i the Plated Partitio Model Daiel A. Spielma November 11, 2009 21.1 Itroductio I this lecture, we will perform a crude aalysis of the performace of
More informationSelection Hyper-heuristics Can Provably be Helpful in Evolutionary Multi-objective Optimization
Selectio Hyper-heuristics Ca Provably be Helpful i Evolutioary Multi-objective Optimizatio Chao Qia 1,2, Ke Tag 1, ad Zhi-Hua Zhou 2 1 UBRI, School of Computer Sciece ad Techology, Uiversity of Sciece
More informationIntro to Learning Theory
Lecture 1, October 18, 2016 Itro to Learig Theory Ruth Urer 1 Machie Learig ad Learig Theory Comig soo 2 Formal Framework 21 Basic otios I our formal model for machie learig, the istaces to be classified
More informationLecture 7: October 18, 2017
Iformatio ad Codig Theory Autum 207 Lecturer: Madhur Tulsiai Lecture 7: October 8, 207 Biary hypothesis testig I this lecture, we apply the tools developed i the past few lectures to uderstad the problem
More informationRandom Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.
Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)
More informationRandomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)
Radomized Algorithms I, Sprig 08, Departmet of Computer Sciece, Uiversity of Helsiki Homework : Solutios Discussed Jauary 5, 08). Exercise.: Cosider the followig balls-ad-bi game. We start with oe black
More informationDrift analysis and average time complexity of evolutionary algorithms
Artificial Itelligece 127 (2001) 57 85 Drift aalysis ad average time complexity of evolutioary algorithms Ju He a,xiyao b, a Departmet of Computer Sciece, Norther Jiaotog Uiversity, Beijig 100044, PR Chia
More informationPH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
More informationApplication to Random Graphs
A Applicatio to Radom Graphs Brachig processes have a umber of iterestig ad importat applicatios. We shall cosider oe of the most famous of them, the Erdős-Réyi radom graph theory. 1 Defiitio A.1. Let
More informationAnalysis of Algorithms. Introduction. Contents
Itroductio The focus of this module is mathematical aspects of algorithms. Our mai focus is aalysis of algorithms, which meas evaluatig efficiecy of algorithms by aalytical ad mathematical methods. We
More information62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
More information10-701/ Machine Learning Mid-term Exam Solution
0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it
More informationStochastic Simulation
Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso
More informationHOMEWORK 2 SOLUTIONS
HOMEWORK SOLUTIONS CSE 55 RANDOMIZED AND APPROXIMATION ALGORITHMS 1. Questio 1. a) The larger the value of k is, the smaller the expected umber of days util we get all the coupos we eed. I fact if = k
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationSupport vector machine revisited
6.867 Machie learig, lecture 8 (Jaakkola) 1 Lecture topics: Support vector machie ad kerels Kerel optimizatio, selectio Support vector machie revisited Our task here is to first tur the support vector
More informationOPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES
OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES Peter M. Maurer Why Hashig is θ(). As i biary search, hashig assumes that keys are stored i a array which is idexed by a iteger. However, hashig attempts to bypass
More informationDesign and Analysis of Algorithms
Desig ad Aalysis of Algorithms Probabilistic aalysis ad Radomized algorithms Referece: CLRS Chapter 5 Topics: Hirig problem Idicatio radom variables Radomized algorithms Huo Hogwei 1 The hirig problem
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationOptimally Sparse SVMs
A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationStudy on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm
Joural of ad Eergy Egieerig, 05, 3, 43-437 Published Olie April 05 i SciRes. http://www.scirp.org/joural/jpee http://dx.doi.org/0.436/jpee.05.34058 Study o Coal Cosumptio Curve Fittig of the Thermal Based
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More informationOptimization Methods MIT 2.098/6.255/ Final exam
Optimizatio Methods MIT 2.098/6.255/15.093 Fial exam Date Give: December 19th, 2006 P1. [30 pts] Classify the followig statemets as true or false. All aswers must be well-justified, either through a short
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationA Novel Genetic Algorithm using Helper Objectives for the 0-1 Knapsack Problem
A Novel Geetic Algorithm usig Helper Objectives for the 0-1 Kapsack Problem Ju He, Feidu He ad Hogbi Dog 1 arxiv:1404.0868v1 [cs.ne] 3 Apr 2014 Abstract The 0-1 kapsack problem is a well-kow combiatorial
More informationOn Random Line Segments in the Unit Square
O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationNotes for Lecture 11
U.C. Berkeley CS78: Computatioal Complexity Hadout N Professor Luca Trevisa 3/4/008 Notes for Lecture Eigevalues, Expasio, ad Radom Walks As usual by ow, let G = (V, E) be a udirected d-regular graph with
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationCS322: Network Analysis. Problem Set 2 - Fall 2009
Due October 9 009 i class CS3: Network Aalysis Problem Set - Fall 009 If you have ay questios regardig the problems set, sed a email to the course assistats: simlac@staford.edu ad peleato@staford.edu.
More informationExpectation-Maximization Algorithm.
Expectatio-Maximizatio Algorithm. Petr Pošík Czech Techical Uiversity i Prague Faculty of Electrical Egieerig Dept. of Cyberetics MLE 2 Likelihood.........................................................................................................
More informationIntroductory statistics
CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key
More information10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random
Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),
More informationUpper and Lower Bounds on Unrestricted Black-Box Complexity of Jump n,l
Upper ad Lower Bouds o Urestricted Black-Box Complexity of Jump,l Maxim Buzdalov 1, Mikhail Kever 1, ad Bejami Doerr 1 ITMO Uiversity, 49 Kroverkskiy av., Sait-Petersburg, Russia, 197101 mbuzdalov@gmail.com,
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationLast time, we talked about how Equation (1) can simulate Equation (2). We asserted that Equation (2) can also simulate Equation (1).
6896 Quatum Complexity Theory Sept 23, 2008 Lecturer: Scott Aaroso Lecture 6 Last Time: Quatum Error-Correctio Quatum Query Model Deutsch-Jozsa Algorithm (Computes x y i oe query) Today: Berstei-Vazirii
More informationSequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet
More informationLecture 10 October Minimaxity and least favorable prior sequences
STATS 300A: Theory of Statistics Fall 205 Lecture 0 October 22 Lecturer: Lester Mackey Scribe: Brya He, Rahul Makhijai Warig: These otes may cotai factual ad/or typographic errors. 0. Miimaxity ad least
More informationChapter 6 Infinite Series
Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat
More informationRuntime Analysis of the (µ+1) EA on Simple Pseudo-Boolean Functions
Rutime Aalysis of the (µ+1) EA o Simple Pseudo-Boolea Fuctios Carste Witt FB Iformatik, LS 2, Uiversität Dortmud, 44221 Dortmud, Germay carste.witt@cs.ui-dortmud.de November 14, 2005 Abstract Although
More informationSRC Technical Note June 17, Tight Thresholds for The Pure Literal Rule. Michael Mitzenmacher. d i g i t a l
SRC Techical Note 1997-011 Jue 17, 1997 Tight Thresholds for The Pure Literal Rule Michael Mitzemacher d i g i t a l Systems Research Ceter 130 Lytto Aveue Palo Alto, Califoria 94301 http://www.research.digital.com/src/
More informationACO Comprehensive Exam 9 October 2007 Student code A. 1. Graph Theory
1. Graph Theory Prove that there exist o simple plaar triagulatio T ad two distict adjacet vertices x, y V (T ) such that x ad y are the oly vertices of T of odd degree. Do ot use the Four-Color Theorem.
More informationThe Random Walk For Dummies
The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli
More informationRandom Models. Tusheng Zhang. February 14, 2013
Radom Models Tusheg Zhag February 14, 013 1 Radom Walks Let me describe the model. Radom walks are used to describe the motio of a movig particle (object). Suppose that a particle (object) moves alog the
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationFastest mixing Markov chain on a path
Fastest mixig Markov chai o a path Stephe Boyd Persi Diacois Ju Su Li Xiao Revised July 2004 Abstract We ider the problem of assigig trasitio probabilities to the edges of a path, so the resultig Markov
More informationarxiv: v1 [cs.ne] 4 Sep 2017
Theoretical Aalysis of Stochastic Search Algorithms Per Kristia Lehre School of Computer Sciece, Uiversity of Birmigham, Birmigham, UK Pietro S. Oliveto Departmet of Computer Sciece, Uiversity of Sheffield,
More informationA Note on the Symmetric Powers of the Standard Representation of S n
A Note o the Symmetric Powers of the Stadard Represetatio of S David Savitt 1 Departmet of Mathematics, Harvard Uiversity Cambridge, MA 0138, USA dsavitt@mathharvardedu Richard P Staley Departmet of Mathematics,
More information6.867 Machine learning
6.867 Machie learig Mid-term exam October, ( poits) Your ame ad MIT ID: Problem We are iterested here i a particular -dimesioal liear regressio problem. The dataset correspodig to this problem has examples
More informationPolynomial identity testing and global minimum cut
CHAPTER 6 Polyomial idetity testig ad global miimum cut I this lecture we will cosider two further problems that ca be solved usig probabilistic algorithms. I the first half, we will cosider the problem
More informationAlgorithms for Clustering
CR2: Statistical Learig & Applicatios Algorithms for Clusterig Lecturer: J. Salmo Scribe: A. Alcolei Settig: give a data set X R p where is the umber of observatio ad p is the umber of features, we wat
More informationSome special clique problems
Some special clique problems Reate Witer Istitut für Iformatik Marti-Luther-Uiversität Halle-Witteberg Vo-Seckedorff-Platz, D 0620 Halle Saale Germay Abstract: We cosider graphs with cliques of size k
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationarxiv: v1 [cs.ne] 26 Mar 2014
How Crossover Speeds Up Buildig-Block Assembly i Geetic Algorithms arxiv:403.6600v [cs.ne] 26 Mar 204 Dirk Sudholt Departmet of Computer Sciece, Uiversity of Sheffield, Uited Kigdom Abstract We re-ivestigate
More information4.3 Growth Rates of Solutions to Recurrences
4.3. GROWTH RATES OF SOLUTIONS TO RECURRENCES 81 4.3 Growth Rates of Solutios to Recurreces 4.3.1 Divide ad Coquer Algorithms Oe of the most basic ad powerful algorithmic techiques is divide ad coquer.
More informationMath 216A Notes, Week 5
Math 6A Notes, Week 5 Scribe: Ayastassia Sebolt Disclaimer: These otes are ot early as polished (ad quite possibly ot early as correct) as a published paper. Please use them at your ow risk.. Thresholds
More informationThe Growth of Functions. Theoretical Supplement
The Growth of Fuctios Theoretical Supplemet The Triagle Iequality The triagle iequality is a algebraic tool that is ofte useful i maipulatig absolute values of fuctios. The triagle iequality says that
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationAPPENDIX: STUDY CASES A SURVEY OF NONPARAMETRIC TESTS FOR THE STATISTICAL ANALYSIS OF EVOLUTIONARY COMPUTATION EXPERIMENTS
A survey of oparametric tests for the statistical aalysis of evolutioary computatio experimets. Appedix 1 APPENDIX: STUDY CASES A SURVEY OF NONPARAMETRIC TESTS FOR THE STATISTICAL ANALYSIS OF EVOLUTIONARY
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationLecture 1: Basic problems of coding theory
Lecture 1: Basic problems of codig theory Error-Correctig Codes (Sprig 016) Rutgers Uiversity Swastik Kopparty Scribes: Abhishek Bhrushudi & Aditya Potukuchi Admiistrivia was discussed at the begiig of
More informationRecurrence Relations
Recurrece Relatios Aalysis of recursive algorithms, such as: it factorial (it ) { if (==0) retur ; else retur ( * factorial(-)); } Let t be the umber of multiplicatios eeded to calculate factorial(). The
More informationThe Rand and block distances of pairs of set partitions
The Rad ad block distaces of pairs of set partitios Frak Ruskey 1 ad Jeifer Woodcock 1 Dept. of Computer Sciece, Uiversity of Victoria, CANADA Abstract. The Rad distaces of two set partitios is the umber
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More informationREGRESSION WITH QUADRATIC LOSS
REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d
More informationRank Modulation with Multiplicity
Rak Modulatio with Multiplicity Axiao (Adrew) Jiag Computer Sciece ad Eg. Dept. Texas A&M Uiversity College Statio, TX 778 ajiag@cse.tamu.edu Abstract Rak modulatio is a scheme that uses the relative order
More informationRoberto s Notes on Series Chapter 2: Convergence tests Section 7. Alternating series
Roberto s Notes o Series Chapter 2: Covergece tests Sectio 7 Alteratig series What you eed to kow already: All basic covergece tests for evetually positive series. What you ca lear here: A test for series
More informationMarkov Decision Processes
Markov Decisio Processes Defiitios; Statioary policies; Value improvemet algorithm, Policy improvemet algorithm, ad liear programmig for discouted cost ad average cost criteria. Markov Decisio Processes
More information