Lecture 1 Probability and Statistics

Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark Twai the statemet shows the persuasive power of umbers use of statistics to bolster weak argumets tedecy of people to disparage statistics that do ot support their positios The purpose of P3700: how to uderstad the statistical ucertaity of observatio/measuremet how to use statistics to argue agaist a weak argumet (or bolster a weak argumet?) how to argue agaist people disparagig statistics that do ot support their positios how to lie with statistics? K.K. Ga L1: Probability ad Statistics 1

Why there is statistical ucertaity? You sell 7 cryogeic equipmet last moth You kow how to cout ad 7 is the exact umber of equipmet sold there is o ucertaity o 7! However if you used the statistics of 7 to predict the future sale or compare with past sale there is a ucertaity o 7 the umber of equipmet sold could be 5, 8, or 10! must iclude the ucertaity i the calculatio What is the ucertaity o 7? Lecture 2: 7 = 2.6 there is a 68% chace that the expected umber of equipmet sold per moth is 4.4-9.6 However the umber of equipmet sold per moth is a discrete umber there is a ~68% chace that the expected umber of equipmet sold per moth is 4-10 should use Poisso statistics as i Lecture 2 for more precise predictio K.K. Ga L1: Probability ad Statistics 2

Itroductio: Uderstadig of may physical pheomea deped o statistical ad probabilistic cocepts: Statistical Mechaics (physics of systems composed of may parts: gases, liquids, solids.) 1 mole of aythig cotais 6x10 23 particles (Avogadro's umber) impossible to keep track of all 6x10 23 particles eve with the fastest computer imagiable resort to learig about the group properties of all the particles partitio fuctio: calculate eergy, etropy, pressure... of a system Quatum Mechaics (physics at the atomic or smaller scale) wavefuctio = probability amplitude probability of a electro beig located at (x,y,z) at a certai time. Uderstadig/iterpretatio of experimetal data deped o statistical ad probabilistic cocepts: how do we extract the best value of a quatity from a set of measuremets? how do we decide if our experimet is cosistet/icosistet with a give theory? how do we decide if our experimet is iterally cosistet? how do we decide if our experimet is cosistet with other experimets? I this course we will cocetrate o the above experimetal issues! K.K. Ga L1: Probability ad Statistics 3

Defiitio of probability: Suppose we have N trials ad a specified evet occurs r times. example: rollig a dice ad the evet could be rollig a 6. defie probability (P) of a evet (E) occurrig as: P(E) = r/n whe N examples: six sided dice: P(6) = 1/6 coi toss: P(heads) = 0.5 P(heads) should approach 0.5 the more times you toss the coi. for a sigle coi toss we ca ever get P(heads) = 0.5! by defiitio probability is a o-egative real umber bouded by 0 P 1 if P = 0 the the evet ever occurs if P = 1 the the evet always occurs sum (or itegral) of all probabilities if they are mutually exclusive must = 1. evets are idepedet if: P(AÇB) = P(A)P(B) Çºitersectio, Èº uio coi tosses are idepedet evets, the result of ext toss does ot deped o previous toss. evets are mutually exclusive (disjoit) if: P(AÇB) = 0 or P(AÈB) = P(A) + P(B) i coi tossig, we either get a head or a tail. K.K. Ga L1: Probability ad Statistics 4

Probability ca be a discrete or a cotiuous variable. Discrete probability: P ca have certai values oly. examples: tossig a six-sided dice: P(x i ) = P i here x i = 1, 2, 3, 4, 5, 6 ad P i = 1/6 for all x i. tossig a coi: oly 2 choices, heads or tails. for both of the above discrete examples (ad i geeral) whe we sum over all mutually exclusive possibilities: P( x i ) =1 Cotiuous probability: P ca be ay umber betwee 0 ad 1. defie a probability desity fuctio, pdf, f ( x) f ( x)dx = dp( x α x + dx) with a cotiuous variable probability for x to be i the rage a x b is: just like the discrete case the sum of all probabilities must equal 1. + f x dx =1 i P(a x b) = ( ) b a ( ) f x dx f(x) is ormalized to oe. probability for x to be exactly some umber is zero sice: x=a ( ) f x dx = 0 x=a Notatio: x i is called a radom variable K.K. Ga L1: Probability ad Statistics 5

Examples of some commo P(x) s ad f(x) s: Discrete = P(x) biomial Poisso Cotiuous = f(x) uiform, i.e. costat Gaussia expoetial chi square How do we describe a probability distributio? mea, mode, media, ad variace for a cotiuous distributio, these quatities are defied by: Mea Mode Media Variace average most probable 50% poit width of distributio + a + f x µ = xf (x)dx = 0 0.5 = f (x)dx σ 2 = f (x) x µ ( ) x x = a ( ) 2 dx for a discrete distributio, the mea ad variace are defied by: µ = 1 x i σ 2 = 1 (x i µ) 2 K.K. Ga L1: Probability ad Statistics 6

Some cotiuous pdf: Probability is the area uder the curves! mode media mea σ symmetric distributio (gaussia) For a Gaussia pdf, the mea, mode, ad media are all at the same x. Asymmetric distributio showig the mea, media ad mode For most pdfs, the mea, mode, ad media are at differet locatios. K.K. Ga L1: Probability ad Statistics 7

Calculatio of mea ad variace: example: a discrete data set cosistig of three umbers: {1, 2, 3} average (µ) is just: x µ = i 1+ 2+ 3 = = 2 3 complicatio: suppose some measuremet are more precise tha others. if each measuremet x i have a weight w i associated with it: µ = x i w i / w i variace (s 2 ) or average squared deviatio from the mea is just: σ 2 = 1 (x i µ) 2 variace describes the width of the pdf! s is called the stadard deviatio rewrite the above expressio by expadig the summatios: σ 2 = 1 % x 2 i + µ 2 ( ' 2µ x i * & ) = 1 = 1 x 2 i + µ 2 2µ 2 x 2 i µ 2 = x 2 x 2 < > º average weighted average i the deomiator would be -1 if we determied the average (µ) from the data itself. K.K. Ga L1: Probability ad Statistics 8

usig the defiitio of µ from above we have for our example of {1,2,3}: σ 2 = 1 x 2 i µ 2 = 4.67 2 2 = 0.67 the case where the measuremets have differet weights is more complicated: σ 2 = µ is the weighted mea w i (x i µ) 2 2 / w i = w i x i / w i µ 2 if we calculated µ from the data, s 2 gets multiplied by a factor /(-1). example: a cotiuous probability distributio, f (x) = si 2 x for 0 x 2π has two modes! has same mea ad media, but differ from the mode(s). f(x) is ot properly ormalized: ormalized pdf: f (x) = si 2 x / 2π 0 2π 0 si 2 xdx = π 1 si 2 xdx = 1 π si2 x K.K. Ga L1: Probability ad Statistics 9

for cotiuous probability distributios, the mea, mode, ad media are calculated usig either itegrals or derivatives: µ = 1 π 2π 0 x si 2 xdx = π mode : x si2 x = 0 π 2, 3π 2 α media : 1 π si2 xdx = 1 0 2 α = π example: Gaussia distributio fuctio, a cotiuous probability distributio p(x) = (x µ ) 1 2 σ 2π e 2σ 2 gaussia K.K. Ga L1: Probability ad Statistics 10

Accuracy ad Precisio: Accuracy: The accuracy of a experimet refers to how close the experimetal measuremet is to the true value of the quatity beig measured. Precisio: This refers to how well the experimetal result has bee determied, without regard to the true value of the quatity beig measured. just because a experimet is precise it does ot mea it is accurate!! accurate but ot precise precise but ot accurate K.K. Ga L1: Probability ad Statistics 11

Measuremet Errors (Ucertaities) Use results from probability ad statistics as a way of idicatig how good a measuremet is. most commo quality idicator: relative precisio = [ucertaity of measuremet]/measuremet example: we measure a table to be 10 iches with ucertaity of 1 ich. relative precisio = 1/10 = 0.1 or 10% (% relative precisio) ucertaity i measuremet is usually square root of variace: s = stadard deviatio usually calculated usig the techique of propagatio of errors (Lecture 4). Statistics ad Systematic Errors Results from experimets are ofte preseted as: N ± XX ± YY N: value of quatity measured (or determied) by experimet. XX: statistical error, usually assumed to be from a Gaussia distributio. with the assumptio of Gaussia statistics we ca say (calculate) somethig about how well our experimet agrees with other experimets ad/or theories. Expect a 68% chace that the true value is betwee N - XX ad N + XX. YY: systematic error. Hard to estimate, distributio of errors usually ot kow. examples: mass of proto = 0.9382769 ± 0.0000027 GeV (oly statistical error give) mass of W boso = 80.8 ± 1.5 ± 2.4 GeV K.K. Ga L1: Probability ad Statistics 12

What s the differece betwee statistical ad systematic errors? N ± XX ± YY statistical errors are radom i the sese that if we repeat the measuremet eough times: XX -> 0 systematic errors do ot -> 0 with repetitio. examples of sources of systematic errors: voltmeter ot calibrated properly a ruler ot the legth we thik is (meter stick might really be < meter!) because of systematic errors, a experimetal result ca be precise, but ot accurate! How do we combie systematic ad statistical errors to get oe estimate of precisio? big problem! two choices: s tot = XX + YY add them liearly s tot = (XX 2 + YY 2 ) 1/2 add them i quadrature widely accepted practice if XX ad YY are ot correlated errors ot of same origi, e.g. from the same voltmeter smaller s tot! Some other ways of quotig experimetal results lower limit: the mass of particle X is > 100 GeV upper limit: the mass of particle X is < 100 GeV +4 asymmetric errors: mass of particle X = 100 3 GeV K.K. Ga L1: Probability ad Statistics 13

How to preset your measured values: Do t quote ay measuremet to more tha three sigificat digits three sigificat digits meas you measure a quatity to 1 part i a thousad or 0.1% precisio difficult to achieve 0.1% precisio acceptable to quote more tha three sigificat digits if you have a large data sample (e.g. large simulatios) Do t quote ay ucertaity to more tha two sigificat digits Measuremet ad ucertaity should have the same umber of digits 991± 57 0.231 ± 0.013 (5.98± 0.43)x10-5 follow this rule i the lab report! K.K. Ga L1: Probability ad Statistics 14