University of Texas at El Paso. Vladik Kreinovich University of Texas at El Paso,

Size: px
Start display at page:

Download "University of Texas at El Paso. Vladik Kreinovich University of Texas at El Paso,"

Transcription

1 Uiversity of Texas at El Paso Departmetal Techical Reports (CS) Departmet of Computer Sciece How to Deal with Ucertaities i Computig: from Probabilistic ad Iterval Ucertaity to Combiatio of Differet Approaches, with Applicatios to Egieerig ad Bioiformatics Vladik Kreiovich Uiversity of Texas at El Paso, vladik@utep.edu Follow this ad additioal works at: Part of the Computer Scieces Commos Commets: Techical Report: UTEP-CS To appear i Proceedigs of the Eighth Iteratioal Coferece o the Applicatios of Digital Iformatio ad Web Techologies ICADIWT'2017, Ciudad Juarez, Chihuahua, Mexico, March 29-31, 2017 Recommeded Citatio Kreiovich, Vladik, "How to Deal with Ucertaities i Computig: from Probabilistic ad Iterval Ucertaity to Combiatio of Differet Approaches, with Applicatios to Egieerig ad Bioiformatics" (2017). Departmetal Techical Reports (CS) This Article is brought to you for free ad ope access by the Departmet of Computer Sciece at DigitalCommos@UTEP. It has bee accepted for iclusio i Departmetal Techical Reports (CS) by a authorized admiistrator of DigitalCommos@UTEP. For more iformatio, please cotact lweber@utep.edu.

2 How to Deal with Ucertaities i Computig: from Probabilistic ad Iterval Ucertaity to Combiatio of Differet Approaches, with Applicatios to Egieerig ad Bioiformatics Vladik KREINOVICH a,1 a Departmet of Computer Sciece, Uiversity of Texas at El Paso, USA Abstract. Most data processig techiques traditioally used i scietific ad egieerig practice are statistical. These techiques are based o the assumptio that we kow the probability distributios of measuremet errors etc. I practice, ofte, we do ot kow the distributios, we oly kow the boud o the measuremet accuracy hece, after the get the measuremet result x, the oly iformatio that we have about the actual (ukow) value x of the measured quatity is that x belogs to the iterval [ x, x + ]. Techiques for data processig uder such iterval ucertaity are called iterval computatios; these techiques have bee developed sice 1950s. I may practical problems, we have a combiatio of differet types of ucertaity, where we kow the probability distributio for some quatities, itervals for other quatities, ad expert iformatio for yet other quatities. The purpose of this paper is to describe the theoretical backgroud for iterval ad combied techiques ad to briefly describe the existig practical applicatios. Keywords. ucertaity, iterval computatios, probabilistic ucertaity 1. Data Processig uder Ucertaity: A Geeral Problem Need for idirect measuremets. I may practical situatios, we are iterested i the value of a quatity y which is difficult (or eve impossible) to measure directly, such as the amout of oil i a give area or tomorrow s temperature. Sice we caot measure y directly, a atural idea is to measure y idirectly, i.e., fid some easier-to-measure quatities x 1,...,x which are related to y by a kow depedece y = f (x 1,...,x ), ad use the results x i of measurig x i to estimate y as f ( x 1,..., x ). 1 Author: Vladik Kreiovich, Departmet of Computer Sciece, Uiversity of Texas at El Paso, 500 W. Uiversity, El Paso, TX 79968, USA; vladik@utep.edu.

3 x 1 x 2 f ỹ = f ( x 1,..., x ) x Idirect measuremets are also kow as data processig. Need to take ucertaity ito accout. Measuremets are ever 100% accurate: the measuremet result x i is, i geeral, somewhat differet from the actual (ukow) def values x i, ad we have a o-zero measuremet error x i = x i x i. Hece, the result ỹ = f ( x 1,..., x ) of idirect measuremet is, i geeral, differet, from the actual value y = f (x 1,...,x ). How big is the resultig ucertaity y def = ỹ y? x 1 x 2... f y x Probabilistic ad iterval ucertaity. Traditioal approach to ucertaity assumes that we kow the probability distributios for x i (usually, they are assumed to be Gaussia). I may practical situatios, we ca ideed determie these probability distributios by calibratig the correspodig measurig istrumet (MI), i.e., to comparig its result with the results of measurig the same quatity by a much more accurate ( stadard ) MI. However, there are two case whe the calibratio is ot possible: state-of-the-art measuremets, e.g., i fudametal sciece, whe we use the best MI available ad so, o more-accurate MIs are available for compariso, ad maufacturig, whe calibratio of each MI is possible, but would be prohibitively expesive. I such cases, all we kow is the upper boud i o the (absolute value of the) measuremet error x i, the boud provided by the maufacturer of the MI; see, e.g., [13]. I this case, oce we have the measuremet result x i, the oly thig we kow about the actual (ukow) value x i is that x i belogs to the iterval [ x i i, x i + i ]. Algorithms for processig such iterval ucertaity are kow as iterval computatio; see, e.g., [3,8].

4 2. Iterval Computatios Iterval computatios: a problem. We kow a algorithm y = f (x 1,...,x ) ad we kow itervals x i = [x i,x i ]. We wat to compute the rage of possible values of y: y = [y,y] = { f (x 1,...,x ) x 1 [x 1,x 1 ],...,x [x,x ]}. x 1 x 2 f y = f (x 1,...,x ) x I geeral, this problem is NP-hard eve for quadratic f ; see, e.g., [5]. So, from the practical viewpoit, we face the followig challeges: whe are feasible algorithms possible? ad whe computig y is ot feasible, how ca we fid a good approximatio Y y. Iterval computatios: a brief history. Iterval computatios ca be traced to Archimedes who used lower ad upper bouds to estimate π. I moder times, this area was revived by three 1950s pioeers: M. Warmus (Polad), T. Suaga (Japa), ad R. Moore (USA). The early 1960s witessed the first boom, icludig takig iterval ucertaity ito accout whe plaig spaceflights to the Moo. Curret applicatios rage from desigig elemetary particle colliders to checkig whether a comet will hit the Earth to robotics ad chemical egieerig; see, e.g., [3,8]. But why ot use Maximum Etropy (MaxEt) approach? Ofte i practice, may differet probability distributios are cosistet with the same observatios. Istead of cosiderig all possible distributios as iterval computatios do why ot select oe of these distributios e.g., the oe with the largest etropy. For example, if all we kow is that x belogs to a iterval, the this MaxEt approach leads to a uiform distributio o this iterval. For several variables, if we have o iformatio about their depedece, this approach cocludes that these variables are idepedet. This idea is reasoable, but it has a serious limitatio. As a example, let us take the simplest possible data proceedig algorithm y = x x. I this case, y = x x. Let us assume that all measuremet errors x i are limited to the iterval [, ]. The, the worst case situatio is whe x i = ad thus, y =. O the other had, if we use the MaxEt approach, the for large, due to Cetral Limit Theorem, y is close to ormal, with σ =. So, if we, as usual, igore 3 deviatios exceedig 6σ, we get a upper boud but it is possible that =. Coclusio: usig a sigle distributio ca be very misleadig, especially if we wat guarateed results ad this is very importat i high-risk applicatio areas such as space exploratio or uclear egieerig.

5 Iterval arithmetic: foudatios of iterval techiques. The geeral problem of iterval computatios is to compute the rage of a give fuctio f (x 1,...,x ) o give itervals x i. For arithmetic operatios f (x 1,x 2 ) (ad for elemetary fuctios), we have explicit formulas for the rage; e.g., whe x 1 x 1 = [x 1,x 1 ] ad x 2 x 2 = [x 2,x 2 ], the: The rage x 1 + x 2 for x 1 + x 2 is [x 1 + x 2,x 1 + x 2 ]. The rage x 1 x 2 for x 1 x 2 is [x 1 x 2,x 1 x 2 ]. The rage x 1 x 2 for x 1 x 2 is [y,y], where y = mi(x 1 x 2,x 1 x 2,x 1 x 2,x 1 x 2 ) ad y = max(x 1 x 2,x 1 x 2,x 1 x 2,x 1 x 2 ). The rage 1/x 1 for 1/x 1 is [1/x 1,1/x 1 ] (if 0 x 1 ). Straightforward iterval computatios: example. Iside the computer, each computatio is a sequece of arithmetic operatios. For example, computig f (x) = (x 2) (x + 2) meas that we compute r 1 := x 2, the r 2 := x + 2, ad fially, r 3 := r 1 + r 2. To fid the rage of fuctio o a give iterval, a seemigly atural idea is to perform the same operatios, but with itervals istead of umbers. For example, for x [1,2], we compute r 1 := [1,2] [2,2] = [ 1,0], r 2 := [1,2]+[2,2] = [3,4], ad r 3 := [ 1,0] [3,4] = [ 4,0]. However, the actual rage is arrower: f (x) = [ 3,0]. Thus, we eed to come up with more efficiet ways of computig a eclosure Y y. First idea: use of mootoicity. For arithmetic operatios, we had exact rages because +,, ad are mootoic i each variable. I geeral, if f (x 1,...,x ) is (ostrictly) icreasig ( f ) i each x i, the f (x 1,...,x ) = [ f (x 1,...,x ), f (x 1,...,x )]. We ca get similar simplifyig formulas if f for some x i ad f for other x j. How ca we check whether f? It is well kow that f i x i if f x i 0 for all values x = (x 1,...,x ). So, to check for mootoicity, it is sufficiet to check whether the rage [r i,r i ] of f x i o x i has r i 0. Differetiatio ca easily doe, e.g., by Automatic Differetiatio (AD) tools. Estimatig rages of f x i ca be doe, e.g., by straightforward iterval computatios. So, if the rage [r i,r i ] of each f x i o x i has r i 0, the f (x 1,...,x ) = [ f (x 1,...,x ), f (x 1,...,x )]. I the above example of f (x) = (x 2) (x + 2) o x = [1,2], we have d f dx = 1 (x+2)+(x 2) 1 = 2x. The rage of this derivative is [r,r] = [2,4], with 2 0. Thus, we get f ([1,2]) = [ f (1), f (2)] = [ 3,0], which is the exact rage. Secod idea: cetered form. Not all fuctios are mootoic. I such cases, we ca use aother idea from calculus the Itermediate Value Theorem, accordig to which

6 f (x 1,...,x ) = f ( x 1,..., x ) + f (χ) (x i x i ) x i for some χ i x i. We ca thus coclude that f (x 1,...,x ) Y, where Y = ỹ + f (x 1,...,x ) [ i, i ]. x i Here, differetiatio ca be performed by the AD tools, ad the rage of the derivatives ca be estimated, if appropriate, by mootoicity, or by straightforward iterval computatios or itself by cetered form (this will take more time but lead to a more accurate estimatio). As a example, we ca take a o-mootoic fuctio f (x) = x (1 x) o the iterval x = [0,1]. Computig this fuctio meas computig r 1 := 1 x ad r 2 := x r 1. Thus, straightforward iterval computatios lead to r 1 := [1,1] [0,1] = [0,1] ad r 2 := [0, 1] [0, 1] = [0, 1], while the actual rage is [0, 0.25]. Here, x = [ x, x + ], with x = 0.5 ad = 0.5, ad d f = 1 (1 x) + x ( 1) = dx 1 2x, thus d f (x) = 1 2 [0,1] = [ 1,1]. So, we get the estimate dx Y = f ( x) + d f (x) [, ] = 0.5 (1 0.5) + [ 1,1] [ 0.5,0.5] = dx [ 0.5,0.5] = [ 0.25,0.75]. The upper boud is better tha for straightforward iterval computatios, but the lower boud is worse. This example shows that we eed other ideas. Third idea: bisectio. The cetered form is based o the first order formula whose accuracy is O( 2 i ). Thus, if he itervals are too wide, we ca split oe of them i half decreasig the error from 2 i to 2 i /4 ad take the uio of the resultig rages. For example, for f (x) = x (1 x) o x x = [0,1], we split ito x = [0,0.5] ad x = [0.5,1]. O both subitervals, f (x) is mootoic: e.g., o x, 1 2 x = 1 2 [0.5,1] = [ 1,0] with r 0. So, we get the exact bouds f (x ) = [ f (0), f (0.5)] = [0,0.25] ad f (x ) = [ f (1), f (0.5)] = [0,0.25], ad the exact overall estimate f (x ) f (x ) = [0,0.25]. Of course, this is a toy example; i practice, we do ot always get the exact rage, but by combiig the above three ideas, we get reasoable estimates for the rage. Alterative approach: affie arithmetic. So far, we compute the rage of x (1 x) by multiplyig rages of x ad 1 x. We igored the fact that both factors deped o x ad are, thus, depedet. A atural idea is thus, for each itermediate result a, to keep a explicit depedece o x i = x i x i (at least its liear terms). I other words, we represet each itermediate result as a = a 0 + a i x i + [a,a]. We start with x i = x i x i, i.e., with

7 x i + 0 x x i 1 + ( 1) x i + 0 x i x + [0,0], where a 0 = x i, a i = 1, a j = 0 for j i, ad [a,a] = [0,0]. The, for a = a 0 + a i x i + a ad b = b 0 + b i x i + b, we get: Additio: c 0 = a 0 + b 0, c i = a i + b i, c = a + b. Subtractio: c 0 = a 0 b 0, c i = a i b i, c = a b. Multiplicatio: c 0 = a 0 b 0, c i = a 0 b i + b 0 a i, ad c = a 0 b + b 0 a + i j a i b j [ i, i ] [ j, j ] + a i b i [ i, i ] 2 + i ( ) ( ) a i [ i, i ] i b + b i [ i, i ] i a + a b. For example, for f (x) = x (1 x) o x [0,1], = 1, x = 0.5, ad = 0.5. The computer computes f (x) by computig r 1 := 1 x ad r 2 := x r 1. I affie arithmetic, we start with x = 0.5 x + [0,0], the compute r 1 := 1 (0.5 x) = x ad r 2 := (0.5 x) (0.5 + x), i.e., r 2 = x [, ] 2 = [ 2,0]. The resultig rage is y = [ 0.25,0] = [0,0.25], which is the exact rage. I our simple example, we got the exact rage. However, i geeral, rage estimatio is NP-hard. This meas that ay feasible (polyomial-time) algorithm will sometimes lead to excess width: Y y. I particular, affie arithmetic may lead to excess width. How to get more accurate estimates? Oe idea is to use bisectio. Aother idea is that, i additio to liear terms i x i, we also keep quadratic ad higher order terms i the Taylor expasio. This Taylor arithmetic has ideed bee successful, e.g., i collider desig. Iterval computatios vs. affie arithmetic: comparative aalysis. Affie arithmetic is usually more accurate, but it is also slower. Ideed, i iterval computatios, for each itermediate result a, we compute two values: edpoits a ad a of [a,a]. I affie arithmetic, for each a, we compute + 3 values: a 0, a 1,..., a, a, ad a. Thus, affie arithmetic is times slower. Solvig systems of equatios: extedig kow algorithms to situatios with iterval ucertaity. I may practical situatios, we eed to fid the ukows y i from the system of equatios g i (y 1,...,y ) = a i. May algorithms y j = f j (a 1,...,a m ) are kow for the cases whe we kow the values a i. I practice, however, we ofte a i with iterval ucertaity: we oly kow that a i [a i,a i ]. I this case, we wat to fid the correspodig rages of y j. A atural idea is to apply iterval computatios techiques to fid the rage f j ([a 1,a 1 ],...,[a,a ]). For specific equatios, we ofte already kow which ideas work best. For example, for liear equatios Ay = b, we ca use the kow fact that y is mootoic i b. Solvig systems of equatios whe o algorithm is kow. The mai idea is that we parse each equatio ito elemetary costraits, ad the use iterval computatios to improve origial rages util we get a arrow rage (= solutio).

8 First example: the equatio x x 2 = 0.5 for x [0,1] (this equatio has o solutio). Parsig leads to r 1 = x 2, ad 0.5(= r 2 ) = x r 1. From r 1 = x 2, we extract two rules: (1) x r 1 = x 2 ad (2) r 1 x = r 1. From 0.5 = x r 1, we extract two more rules: (3) x r 1 = x 0.5 ad (4) r 1 x = r We will apply these rules oe by oe, the agai, util we coverge. We start with x = [0,1], r = (, ). Rule (1) leads to r = [0,1] 2 = [0,1], so the ew rage for r is r ew = (, ) [0,1] = [0,1]. Now, Rule (2) leads to x ew = [0,1] [0,1] = [0,1] o chage. Rule (3) leads to r ew = ([0,1] 0.5) [0,1] = [ 0.5,0.5] [0,1] = [0,0.5]. Rule (4) leads to x ew = ([0,0.5] + 0.5) [0,1] = [0.5,1] [0,1] = [0.5,1]. Rule (1) lead to r ew = [0.5,1] 2 [0,0.5] = [0.25,0.5]. Rule (2) leads to x ew = [0.25,0.5] [0.5,1] = [0.5,0.71]; here, we roud a dow ad a up, to guaratee eclosure. Rule (3) leads to r ew = ([0.5,0.71] 0.5) [0.25,5] = [0.0.21] [0.25,0.5], i.e., r ew = /0. We coclude that the origial equatio has o solutios. Aother example: equatio x x 2 = 0 for x [0,1]. Parsig leads to r 1 = x 2, ad 0 = x r 1. So, we get the followig rules: (1) r = x 2 ; (2) x = r; (3) r = x; (4) x = r. We start with x = [0,1] ad r = (, ). The problem is that after Rule (1), we re stuck with x = r = [0,1]. A atural solutio is thus to bisect x = [0,1] ito [0,0.5] ad [0.5,1]. For the 1st subiterval: Rule (1) leads to r ew = [0,0.5] 2 [0,0.5] = [0,0.25]; Rule (4) leads to x ew = [0,0.25]; Rule (1) leads to r ew = [0,0.25] 2 = [0,0.0625]; Rule (4) leads to x ew = [0,0.0625]; etc. we coverge to x = 0. For the secod subiterval, we coverge to x = 1. Optimizatio: extedig kow algorithms to situatios with iterval ucertaity. I optimizatio, we eed to fid y 1,...,y m for which g(y 1,...,y m,a 1,...,a m ) max. I may practical situatios, we kow a i with iterval ucertaity: a i [a i,a i ]. We wat to fid the correspodig rages of y j. Ofte, for the case of exactly kow a i, we have a algorithm y j = f j (a 1,...,a ) for solvig the optimizatio problem. This is true, e.g., for a quadratic objective fuctio g. I this case, we ca apply iterval computatios techiques to fid the rage f j ([a 1,a 1 ],...,[a,a ]). For specific f, we ofte already kow which ideas work best, ad thus, we get a eve better solutio. Optimizatio whe o algorithm is kow. The mai idea is to divide the origial box x ito subboxes b. The, if max x b g(x) < g(x ) for a kow x, we dismiss the subbox b. As a example, let us take g(x) = x (1 x) for x = [0,1]. Let s divide [0,1] ito 10 subboxes b = [0,0.1],[0.1,0.2],... For each b, we fid g( b); the largest is = The, we compute G(b) = g( b) + (1 2 b) [, ], ad dismiss subboxes for which Y < For example, for [0.2,0.3], we have

9 0.25 (1 0.25) + (1 2 [0.2,0.3]) [ 0.05,0.05]. Here, Y = < , so we dismiss [0.2,0.3]. As a result, we keep oly boxes [0.3,0.7]. Further subdivisio will get us closer ad closer to x = A Example Where We Need to Combie Differet Approaches to Ucertaity: Chip Desig Formulatio of the problem. Oe of the mai problems of computer chip desig is to estimate the clock cycle o the desig stage. The clock cycle of a chip is costraied by the maximum path delay over all the circuit paths D def = max(d 1,...,D N ). The path delay D i alog the i-th path is the sum of the delays correspodig to the gates ad wires alog this path. Each of these delays, i tur, depeds o several factors such as the variatio caused by the curret desig practices, evirometal desig characteristics (e.g., variatios i temperature ad i supply voltage), etc.; see, e.g., [12]. Traditioal (iterval) approach to estimatig the clock cycle. The traditioal approach assumes that each factor takes the worst possible value. The resultig time delay correspods to the case whe all the factors are at their worst. This is ot be a very realistic case: differet factors are usually idepedet, ad thus, combiatio of may worst cases is improbable. As a result, the worst-case estimates are 30% above the observed clock time. Sice the clock time is set too high, chips are over-desiged ad uder-performig. Robust statistical methods are eeded. I the ideal case, whe we kow all the probability distributios, we ca use Mote-Carlo simulatios to estimate the correspodig ucertaity. I practice, we oly have partial iformatio about the distributios of some of the parameters. Usually, we kow the mea, ad we kow some characteristic of the deviatio from the mea e.g., the iterval that is guarateed to cotai possible values of this parameter. A possible approach would be to apply Mote-Carlo methods with several possible distributios. The problem is that there are ifiitely may possible distributios, ad so there is o guaratee that the result of usig fiitely may of them is a valid boud for all possible distributios. It is therefore desirable to provide robust bouds, i.e., bouds that work for all possible distributios. Towards a mathematical formulatio of the problem. Each gate delay d depeds o the differece x 1,...,x betwee the actual ad the omial values of the parameters. These differeces are usually small, so we ca safely igore terms which are quadratic ad higher order i x i ad thus assume that d is a liear fuctio of x i. Each path delay D i is the sum of gate delays, thus D i is also a liear fuctio of x j : D i = a i + j=1 a i j x j for some a i ad a i j. The desired maximum delay D = maxd i is thus the maximum of liear fuctios. It is kow that every maximum of liear fuctios is covex. Thus, the depedece D = F(x 1,...,x ) is covex. i

10 We kow that the factors x i are idepedet. We kow the distributios of some of the factors. For others, we kow rages [x j,x j ] ad meas E j. We are give the largest allowed probability ε > 0 of the fault. We eed to fid the smallest y 0 for which, for all possible distributios, we have y y 0 with the probability 1 ε. Additioal property: depedecy is o-degeerate. Sometimes, we lear additioal iformatio about oe of the factors x j. For example, we lear that x j actually belogs to a proper subiterval of the origial iterval [x j,x j ]. As a result, the origial class P of possible distributios is replaced with a smaller oe P P. I this case, the ew value y 0 ca oly decrease: y 0 y 0. If x j is irrelevat for y, the y 0 = y 0. Our assumptio is that irrelevat variables have bee weeded out. I this case, if we arrow dow oe of the itervals [x j,x j ], the resultig value y 0 decreases: y 0 < y 0. Formulatio of the problem i precise terms. GIVEN:, k, ε > 0; a covex fuctio y = F(x 1,...,x ) 0; k cdfs F j (x), k + 1 j ; itervals x 1,...,x k, values E 1,...,E k, TAKE: all joit probability distributios o R for which: all x i are idepedet, x j x j, E[x j ] = E j for j k, ad x j have distributio F j (x) for j > k. FIND: the smallest y 0 s.t. for all such distributios, F(x 1,...,x ) y 0 with probability 1 ε. WHEN: the problem is o-degeerate if we arrow dow oe of the itervals x j, y 0 decreases. Mai result ad how we ca use it. Our mai result is that y 0 is attaied whe for def each j from 1 to k, we take x j = x j with probability p j = x j E j, ad x j = x j with the x j x j remaiig probability p j def = E j x j x j x j. (The proof is give i the Appedix.) Thus, we arrive at the followig algorithm: simulate these distributios for x j, j < k; simulate kow distributios for j > k; use the simulated values x (s) j sort N values y (s) : y (1) y (2)... y (Ni ); take y (Ni (1 ε)) as y 0. to fid y (s) = F(x (s) 1,...,x(s) ); Commet about Mote-Carlo techiques. It is usually believed that Mote-Carlo methods are iferior to aalytical: they are approximate, they require large computatio time; ad simulatios for several distributios, may mis-calculate the (desired) maxi-

11 mum over all distributios. I our case, however, the value correspodig to the selected distributios ideed provide the desired maximum value y 0. This is a particular case of the geeral fact that justified Mote-Carlo methods ofte lead to faster computatios tha aalytical techiques a good example is multi-d itegratio, where Mote-Carlo methods were origially iveted. 4. Combiig Iterval ad Probabilistic Ucertaity: Geeral Case Need for such a combiatio. Chip desig is just oe of the examples of the practical situatios i which some iformatio comes i probabilistic form ad some i iterval form. So, we eed to lear how to combie these two types of ucertaity. How to represet a probability distributio. There are may ways to represet a probability distributio. A atural idea is to look for a objective. The objective of processig ucertai data is to make decisios, ad it is kow that ratioal decisio makig is equivalet to maximizig the expected values of a appropriate fuctio u(x, a) called utility: E x [u(x,a)] max; see, e.g., [2,7,9,14]. a I may practical situatios, the fuctio u(x) is smooth. We ca therefore expad u(x) i Taylor series ad keep oly the first few terms: we have u(x) = u(x 0 )+ (x x 0 ) u (x 0 ) +... The, to fid the expected value, it is eough to kow the momets E[(x x 0 ) k ]. If we do ot kow the exact momets, the we should kow (iterval) bouds o momets. Sometimes, we have a threshold-type u(x): e.g., whe a fie is imposed if the cocetratio of a certai pollutats exceeds a certai threshold. I this case, to compute E[u], we eed to kow cdf F(x) = Prob(ξ x). I the case of ucertaity, we have bouds [F(x),F(x)]; such bouds are kow as p-boxes. Extesio of iterval arithmetic to probabilistic case: successes ad challeges. To process the correspodig ucertaity, it is reasoable to parse the origial algorithm ito a sequece of elemetary operatios +,,, 1/x, max, mi, ad the use the formulas for these elemetary operatios. Explicit formulas are kow for itervals, for p-boxes, ad for itervals + 1st momets E i. I some cases, these formulas are easy to get, e.g., for +,, ad for product of idepedet x i ; see, e.g., [1,6,10]. For multiplicatio y = x 1 x 2, whe we have o iformatio about the correlatio, we get o-trivial formulas: E = max(p 1 + p 2 1,0) x 1 x 2 + mi(p 1,1 p 2 ) x 1 x 2 + mi(1 p 1, p 2 ) x 1 x 2 + max(1 p 1 p 2,0) x 1 x 2 ; E = mi(p 1, p 2 ) x 1 x 2 + max(p 1 p 2,0) x 1 x 2 + max(p 2 p 1,0) x 1 x 2 + mi(1 p 1,1 p 2 ) x 1 x 2, where p i def = (E i x i )/(x i x i ). The mai remaiig challeges are to deal with situatios whe we kow itervals + 1st ad 2d momets, or whe we ow momets + p-boxes. I some cases, such formulas are kow. Let us describe oe of these cases.

12 5. Case Study: Bioiformatics Practical problem. It is importat to fid geetic differece betwee cacer cells ad healthy cells. Ideal solutio to this problem is to directly measure cocetratios c of the gee i cacer cells ad h i healthy cells. I reality, such cells are difficult to separate. Thus, what we really measure is ot c, but y i x i c+(1 x i ) h, where x i is the percetage of cacer cells i i-th sample. This amout ca be described as a x i + h y i, where a def = c h. Ideal case. If we kow x i exactly, we ca use the Least Squares Method (a x i +h y i ) 2 C(x,y) mi. We the fid a = ad h = E(y) a E(x), where E(x) = a,h V (x) 1 x i, V (x) = 1 1 (x i E(x)) 2, ad C(x,y) = 1 1 (x i E(x)) (y i E(y)). Real-life case. I practice, experts maually cout x i, ad oly provide iterval bouds x i, e.g., x i [0.7,0.8]. We the eed to fid the rage of a ad h correspodig to all possible values x i [x i,x i ]. Solvig the problem. I geeral, we kow itervals x 1 = [x 1,x 1 ],..., x = [x,x ], ad we eed to compute the rage of E(x), V (x), etc. I geeral, this problem is NP-hard eve for the populatio variace V (x) [10]. However, there are efficiet algorithms for V ad C(x,y) for reasoable situatios; see, e.g., [10]. So, to solve our problem, we use these algorithms to fid itervals for C(x,y) ad for V (x) ad the divide these itervals. 6. Aother Case Study: Detectig Outliers Outlier detectio is importat. I may applicatio areas, it is importat to detect outliers, i.e., uusual, abormal values: i medicie, uusual values may idicate disease; i geophysics, abormal values may idicate a mieral deposit (or a erroeous measuremet result); i structural itegrity testig, abormal values may idicate faults i a structure, etc. The traditioal egieerig approach is that a ew measuremet result x is classified as a outlier if x [L,U], where L def = E k 0 σ ad U def = E + k 0 σ, ad k 0 > 1 is preselected. Most frequetly, k 0 = 2, 3, or 6. Outlier detectio uder iterval ucertaity. I may practical situatios, we oly have itervals x i = [x i,x i ]. Differet values x i x i lead to differet itervals [L,U]. We ca therefore distiguish betwee possible ad guarateed outliers. A possible outlier is whe x is outside some k 0 -sigma iterval. Fidig possible outliers is importat i structural itegrity so as ot to miss a fault. A guarateed outlier is a value which is outside all k 0 -sigma itervals. For example, before a breast surgery, we wat to make sure that there is a micro-calcificatio. Oe ca easily check that x is a possible outlier if x [L,U], ad x is a guarateed outlier if x [L,U]. So, to detect outliers, we must fid the rages [L,L] ad [U,U] of L = E k 0 σ ad U = E + k 0 σ. Algorithms for computig such rages have ideed bee proposed; see, e.g., [10].

13 7. Fuzzy Ucertaity Need for fuzzy ucertaity. Kowledge ofte comes from experts, ad expert usually describe their kowledge by usig imprecise ( fuzzy ) words from atural laguage such as small. To describe this kowledge i precise terms, a special techique called fuzzy was iveted; see, e.g., [4,11,15]. I this techiques, for each possible value x of the correspodig quatity, we ask the expert to estimate the degree µ(x) to which this value satisfies the appropriate property (i.e., to what extet this value is small). The expert ca do it, e.g., by markig a value o a scale from 0 to 10; if the expert marks 7, we take µ(x) = 7/10. The resultig fuctio µ(x) is kow as a membership fuctio or a fuzzy set. Data processig uder fuzzy ucertaity. We have a data processig algorithm y = f (x 1,...,x ) ad fuzzy sets µ i (x i ). We eed to come up with a fuzzy set µ(y) that describes the resultig ucertaity about y. To come up with the appropriate formulas, let us take ito accout that y is a possible value of the possible value of the correspodig quatity Y if ad oly if there exist umbers x 1,...,x such that each x i is a possible value of X i ad f (x 1,...,x ) = y. We kow the degrees µ i (x i ) to which each x i is a possible value of X i. So, if use mi(µ(a), µ(b)) to describe our degree of belief i A& B, ad max(d(a),d(b)) for A B, we coclude that µ(y) = max mi(µ 1(x 1 ),..., µ (x )). x 1,...,x : f (x 1,...,x )=y This formula is kow as Zadeh s extesio priciple. Reductio to iterval computatios. How ca we compute µ(y)? A direct optimizatio would be computatioally expesive. Good ew is that such computatios ca be simplified if we represet each fuzzy set µ i by its α-cuts X i (α) def = {x i : µ i (x i ) α}. It turs out that for cotiuous f, for every α, the α-cut Y (α) is equal to the rage of f o α-cuts of X i : Y (α) = f (X 1 (α),...,x (α)). Thus, to compute µ(y), we ca simply, for α = 0,0.1,0.2,...,1 apply iterval computatios techiques to compute Y (α). Refereces [1] S. Ferso, V. Kreiovich, L. Gizburg, D.S. Myers, ad K. Setz, Costructig Probability Boxes ad Dempster-Shafer Structures, Sadia Natioal Laboratories, Report SAND , Jauary [2] P.C. Fishbur, Utility Theory for Decisio Makig, Joh Wiley & Sos Ic., New York, [3] L. Jauli, M. Kiefer, O. Dicrit, ad E. Walter, Applied Iterval Aalysis, Spriger, Lodo, [4] G. Klir ad B. Yua, Fuzzy Sets ad Fuzzy Logic, Pretice Hall, Upper Saddle River, New Jersey, [5] V. Kreiovich, A. Lakeyev, J. Roh, ad P. Kahl, Computatioal Complexity ad Feasibility of Data Processig ad Iterval Computatios, Kluwer, Dordrecht, [6] V. Kreiovich, Iterval computatios ad iterval-related statistical techiques: tools for estimatig ucertaity of the results of data processig ad idirect measuremets, I: F. Pavese ad A.B. Forbes (eds.), Data Modelig for Metrology ad Testig i Measuremet Sciece, Birkhauser-Spriger, Bosto, 2009, pp

14 [7] R.D. Luce ad R. Raiffa, Games ad Decisios: Itroductio ad Critical Survey, Dover, New York, [8] R.E. Moore, R.B. Kearfott, ad M.J. Cloud, Itroductio to Iterval Aalysis, SIAM, Philadelphia, [9] H.T. Nguye, O. Kosheleva, ad V. Kreiovich, Decisio makig beyod Arrow s impossibility theorem, with the aalysis of effects of collusio ad mutual attractio, Iteratioal Joural of Itelliget Systems 24(1) (2009) [10] H.T. Nguye, V. Kreiovich, B. Wu, ad G. Xiag, Computig Statistics uder Iterval ad Fuzzy Ucertaity, Spriger Verlag, Berli, Heidelberg, [11] H.T. Nguye ad E.A. Walker, A First Course i Fuzzy Logic, Chapma ad Hall/CRC, Boca Rato, Florida, [12] M. Orshasky, W.-S. Wag, G. Xiag, ad V. Kreiovich, Iterval-based robust statistical techiques for o-egative covex fuctios, with applicatio to timig aalysis of computer chips, Proceedigs of the Secod Iteratioal Workshop o Reliable Egieerig Computig REC 2006, Savaah, Georgia, February 22-24, 2006, pp [13] S.G. Rabiovich, Measuremet Errors ad Ucertaity: Theory ad Practice, Spriger Verlag, Berli, [14] H. Raiffa, Decisio Aalysis, Addiso-Wesley, Readig, Massachusetts, [15] L.A. Zadeh, Fuzzy sets, Iformatio ad Cotrol 8(1965) A. Proof of the Result about Chips: Mai Idea Let us fix the optimal distributios for x 2,...,x ; the, Prob(D y 0 ) = p 1 (x 1 ) p 2 (x 2 )... (x 1,...,x ):D(x 1,...,x ) y 0 So, Prob(D y 0 ) = N c i q i, where v i are possible values of x 1, p i is the probability of v i, i=0 ad c i do ot deped o x 1. The restrictios o q i are: q i 0, N i=0 q i = 1, ad N q i v i = E 1. Thus, the worst-case distributio for x 1 is a solutio to the followig liear programmig (LP) problem: Miimize N c i q i uder the costraits N q i = 1 ad N i=0 i=0 q i 0, i = 0,1,2,...,N. i=0 i=0 q i v i = E 1, It is kow that i LP with N +1 ukows q 0,q 1,...,q N, N +1 costraits are equalities. I our case, we have 2 equalities, so at least N 1 costraits q i 0 are equalities. Hece, o more tha 2 values q i are o-0. If the correspodig values v or v of x 1 for which the probability is o-zero are i (x 1,x 1 ), the for [v,v ] x 1 we get the same y 0 i cotradictio to o-degeeracy. Thus, the worst-case distributio is located at x 1 ad x 1. The coditio that the mea of x 1 is E 1 leads to the desired formulas for p 1 ad p 1. Similarly, we get formulas for all other values p i ad p i. The statemet is prove.

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Towards Decision Making under Interval, Set-Valued, Fuzzy, and Z-Number Uncertainty: A Fair Price Approach

Towards Decision Making under Interval, Set-Valued, Fuzzy, and Z-Number Uncertainty: A Fair Price Approach Towards Decisio Makig uder Iterval, Set-Valued, Fuzzy, ad Z-Number Ucertaity: A Fair Price Approach Joe Lorkowski, Rafik Aliev, ad Vladik Kreiovich Abstract I this paper, we explore oe of the possible

More information

A Simple Probabilistic Explanation of Term Frequency-Inverse Document Frequency (tf-idf) Heuristic (and Variations Motivated by This Explanation)

A Simple Probabilistic Explanation of Term Frequency-Inverse Document Frequency (tf-idf) Heuristic (and Variations Motivated by This Explanation) Uiversity of Texas at El Paso DigitalCommos@UTEP Departmetal Techical Reports (CS) Departmet of Computer Sciece 5-2014 A Simple Probabilistic Explaatio of Term Frequecy-Iverse Documet Frequecy (tf-idf)

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Math 113 Exam 3 Practice

Math 113 Exam 3 Practice Math Exam Practice Exam will cover.-.9. This sheet has three sectios. The first sectio will remid you about techiques ad formulas that you should kow. The secod gives a umber of practice questios for you

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

1 Approximating Integrals using Taylor Polynomials

1 Approximating Integrals using Taylor Polynomials Seughee Ye Ma 8: Week 7 Nov Week 7 Summary This week, we will lear how we ca approximate itegrals usig Taylor series ad umerical methods. Topics Page Approximatig Itegrals usig Taylor Polyomials. Defiitios................................................

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

Across-the-Board Spending Cuts Are Very Inefficient: A Proof

Across-the-Board Spending Cuts Are Very Inefficient: A Proof Uiversity of Texas at El Paso DigitalCommos@UTEP Departmetal Techical Reports (CS) Departmet of Computer Sciece 6-2015 Across-the-Board Spedig Cuts Are Very Iefficiet: A Proof Vladik Kreiovich Uiversity

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Chapter 10: Power Series

Chapter 10: Power Series Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because

More information

Math 113 Exam 3 Practice

Math 113 Exam 3 Practice Math Exam Practice Exam 4 will cover.-., 0. ad 0.. Note that eve though. was tested i exam, questios from that sectios may also be o this exam. For practice problems o., refer to the last review. This

More information

MAT1026 Calculus II Basic Convergence Tests for Series

MAT1026 Calculus II Basic Convergence Tests for Series MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Math 113 Exam 4 Practice

Math 113 Exam 4 Practice Math Exam 4 Practice Exam 4 will cover.-.. This sheet has three sectios. The first sectio will remid you about techiques ad formulas that you should kow. The secod gives a umber of practice questios for

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ.

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ. 2 5. Weighted umber of late jobs 5.1. Release dates ad due dates: maximimizig the weight of o-time jobs Oce we add release dates, miimizig the umber of late jobs becomes a sigificatly harder problem. For

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Analysis of Experimental Measurements

Analysis of Experimental Measurements Aalysis of Experimetal Measuremets Thik carefully about the process of makig a measuremet. A measuremet is a compariso betwee some ukow physical quatity ad a stadard of that physical quatity. As a example,

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i : Error Error & Ucertaity The error is the differece betwee a TRUE value,, ad a MEASURED value, i : E = i There is o error-free measuremet. The sigificace of a measuremet caot be judged uless the associate

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n Review of Power Series, Power Series Solutios A power series i x - a is a ifiite series of the form c (x a) =c +c (x a)+(x a) +... We also call this a power series cetered at a. Ex. (x+) is cetered at

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

CHAPTER I: Vector Spaces

CHAPTER I: Vector Spaces CHAPTER I: Vector Spaces Sectio 1: Itroductio ad Examples This first chapter is largely a review of topics you probably saw i your liear algebra course. So why cover it? (1) Not everyoe remembers everythig

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

CHAPTER 10 INFINITE SEQUENCES AND SERIES

CHAPTER 10 INFINITE SEQUENCES AND SERIES CHAPTER 10 INFINITE SEQUENCES AND SERIES 10.1 Sequeces 10.2 Ifiite Series 10.3 The Itegral Tests 10.4 Compariso Tests 10.5 The Ratio ad Root Tests 10.6 Alteratig Series: Absolute ad Coditioal Covergece

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5 CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Math 2784 (or 2794W) University of Connecticut

Math 2784 (or 2794W) University of Connecticut ORDERS OF GROWTH PAT SMITH Math 2784 (or 2794W) Uiversity of Coecticut Date: Mar. 2, 22. ORDERS OF GROWTH. Itroductio Gaiig a ituitive feel for the relative growth of fuctios is importat if you really

More information

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology Advaced Aalysis Mi Ya Departmet of Mathematics Hog Kog Uiversity of Sciece ad Techology September 3, 009 Cotets Limit ad Cotiuity 7 Limit of Sequece 8 Defiitio 8 Property 3 3 Ifiity ad Ifiitesimal 8 4

More information

Lesson 10: Limits and Continuity

Lesson 10: Limits and Continuity www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

SNAP Centre Workshop. Basic Algebraic Manipulation

SNAP Centre Workshop. Basic Algebraic Manipulation SNAP Cetre Workshop Basic Algebraic Maipulatio 8 Simplifyig Algebraic Expressios Whe a expressio is writte i the most compact maer possible, it is cosidered to be simplified. Not Simplified: x(x + 4x)

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Seunghee Ye Ma 8: Week 5 Oct 28

Seunghee Ye Ma 8: Week 5 Oct 28 Week 5 Summary I Sectio, we go over the Mea Value Theorem ad its applicatios. I Sectio 2, we will recap what we have covered so far this term. Topics Page Mea Value Theorem. Applicatios of the Mea Value

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

1 Review of Probability & Statistics

1 Review of Probability & Statistics 1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5

More information

4.3 Growth Rates of Solutions to Recurrences

4.3 Growth Rates of Solutions to Recurrences 4.3. GROWTH RATES OF SOLUTIONS TO RECURRENCES 81 4.3 Growth Rates of Solutios to Recurreces 4.3.1 Divide ad Coquer Algorithms Oe of the most basic ad powerful algorithmic techiques is divide ad coquer.

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

Properties and Tests of Zeros of Polynomial Functions

Properties and Tests of Zeros of Polynomial Functions Properties ad Tests of Zeros of Polyomial Fuctios The Remaider ad Factor Theorems: Sythetic divisio ca be used to fid the values of polyomials i a sometimes easier way tha substitutio. This is show by

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

10-701/ Machine Learning Mid-term Exam Solution

10-701/ Machine Learning Mid-term Exam Solution 0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it

More information

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 12

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 12 Machie Learig Theory Tübige Uiversity, WS 06/07 Lecture Tolstikhi Ilya Abstract I this lecture we derive risk bouds for kerel methods. We will start by showig that Soft Margi kerel SVM correspods to miimizig

More information

The random version of Dvoretzky s theorem in l n

The random version of Dvoretzky s theorem in l n The radom versio of Dvoretzky s theorem i l Gideo Schechtma Abstract We show that with high probability a sectio of the l ball of dimesio k cε log c > 0 a uiversal costat) is ε close to a multiple of the

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.

More information

MIDTERM 3 CALCULUS 2. Monday, December 3, :15 PM to 6:45 PM. Name PRACTICE EXAM SOLUTIONS

MIDTERM 3 CALCULUS 2. Monday, December 3, :15 PM to 6:45 PM. Name PRACTICE EXAM SOLUTIONS MIDTERM 3 CALCULUS MATH 300 FALL 08 Moday, December 3, 08 5:5 PM to 6:45 PM Name PRACTICE EXAM S Please aswer all of the questios, ad show your work. You must explai your aswers to get credit. You will

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

CSE 527, Additional notes on MLE & EM

CSE 527, Additional notes on MLE & EM CSE 57 Lecture Notes: MLE & EM CSE 57, Additioal otes o MLE & EM Based o earlier otes by C. Grat & M. Narasimha Itroductio Last lecture we bega a examiatio of model based clusterig. This lecture will be

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Part I: Covers Sequence through Series Comparison Tests

Part I: Covers Sequence through Series Comparison Tests Part I: Covers Sequece through Series Compariso Tests. Give a example of each of the followig: (a) A geometric sequece: (b) A alteratig sequece: (c) A sequece that is bouded, but ot coverget: (d) A sequece

More information

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Lecture 7: October 18, 2017

Lecture 7: October 18, 2017 Iformatio ad Codig Theory Autum 207 Lecturer: Madhur Tulsiai Lecture 7: October 8, 207 Biary hypothesis testig I this lecture, we apply the tools developed i the past few lectures to uderstad the problem

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

MA131 - Analysis 1. Workbook 3 Sequences II

MA131 - Analysis 1. Workbook 3 Sequences II MA3 - Aalysis Workbook 3 Sequeces II Autum 2004 Cotets 2.8 Coverget Sequeces........................ 2.9 Algebra of Limits......................... 2 2.0 Further Useful Results........................

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

GUIDELINES ON REPRESENTATIVE SAMPLING

GUIDELINES ON REPRESENTATIVE SAMPLING DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue

More information

Chapter 9: Numerical Differentiation

Chapter 9: Numerical Differentiation 178 Chapter 9: Numerical Differetiatio Numerical Differetiatio Formulatio of equatios for physical problems ofte ivolve derivatives (rate-of-chage quatities, such as velocity ad acceleratio). Numerical

More information

Measure and Measurable Functions

Measure and Measurable Functions 3 Measure ad Measurable Fuctios 3.1 Measure o a Arbitrary σ-algebra Recall from Chapter 2 that the set M of all Lebesgue measurable sets has the followig properties: R M, E M implies E c M, E M for N implies

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Study on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm

Study on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm Joural of ad Eergy Egieerig, 05, 3, 43-437 Published Olie April 05 i SciRes. http://www.scirp.org/joural/jpee http://dx.doi.org/0.436/jpee.05.34058 Study o Coal Cosumptio Curve Fittig of the Thermal Based

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22 CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Math 113, Calculus II Winter 2007 Final Exam Solutions

Math 113, Calculus II Winter 2007 Final Exam Solutions Math, Calculus II Witer 7 Fial Exam Solutios (5 poits) Use the limit defiitio of the defiite itegral ad the sum formulas to compute x x + dx The check your aswer usig the Evaluatio Theorem Solutio: I this

More information

Modeling and Estimation of a Bivariate Pareto Distribution using the Principle of Maximum Entropy

Modeling and Estimation of a Bivariate Pareto Distribution using the Principle of Maximum Entropy Sri Laka Joural of Applied Statistics, Vol (5-3) Modelig ad Estimatio of a Bivariate Pareto Distributio usig the Priciple of Maximum Etropy Jagathath Krisha K.M. * Ecoomics Research Divisio, CSIR-Cetral

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

ME NUMERICAL METHODS Fall 2007

ME NUMERICAL METHODS Fall 2007 ME - 310 NUMERICAL METHODS Fall 2007 Group 02 Istructor: Prof. Dr. Eres Söylemez (Rm C205, email:eres@metu.edu.tr ) Class Hours ad Room: Moday 13:40-15:30 Rm: B101 Wedesday 12:40-13:30 Rm: B103 Course

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information