Statistical Methods. Sample spaces and events. Axioms of Probability. UCLA M284 Principles of Neuroimaging A
|
|
- Winfred Phillips
- 5 years ago
- Views:
Transcription
1 UCL M84 riciples of euroimagig Statistical Methods Istructor: : Ivo Diov, sst. rof. I Statistics ad eurology Uiversity of Califoria, Los geles, Witer Outlie robability Theory xioms Basic riciples for probability modelig ad computatio Law of Total robability & Bayesia Theorem Data Summaries ad ED Distributios ( Experimets & Demos ( Statistical Iferece Hypothesis Testig & Cofidece itervals arameter Estimatio arametric vs. o-parametric iferece ( CLT & LL Liear modelig Simple liear regressio, Multiple liear regressio OV & GLM Outlie robability Theory xioms Basic riciples for probability modelig ad computatio Law of Total robability & Bayesia Theorem Distributios ( Experimets & Demos ( Statistical Iferece Hypothesis Testig & Cofidece itervals arameter Estimatio arametric vs. o-parametric iferece ( Liear modelig Simple liear regressio, Multiple liear regressio OV & GLM Sample spaces ad evets sample space, S,, for a radom experimet is the set of all possible outcomes of the experimet. E.g., Roll a pair of fair Hexagoal dice, S? evet is a collectio of outcomes. E.g., E {a eve sum turs up} evet occurs if ay outcome maig up that evet occurs. E.g., E occurs if total sum is oe of: {, 4, 6, 8, 0 or } E? R.V.: X D + D : S R xioms of robability Let. be a fuctio, that has these 3 properties. for ay evet E, 0 E 0.. S, where S is the sample space. 3. For ay fiite (or ifiite collectio of mutually exclusive evets y fuctio that satisfies the above three axioms is a probability fuctio. U U ( (
2 The complemet of a evet,, deoted, c, `, occurs if ad oly if does ot occur. S The complemet of a evet (a Sample space cotaiig evet (b Evet shaded (c shaded roperties of robability Fuctios.E C E.If E ad E are logically equivalet, the EE. E: ot all cars are worth > $0K. E: Some cars are worth $0K. The EE. 3.E E mi (E,( E. robability ad Ve diagrams i + ( + ( r+ + i i < i i < i<... < ir ( i I i ( I I I i i... i + ( i I i I... I i r + Melaoma type of si cacer type of si cacer a example of laws of coditioal probabilities 400 Melaoma atiets by Type ad Site Site Head ad Row Type ec Tru Extremities Totals Hutchiso's melaomic frecle 0 34 Superficial odular Idetermiat Colum Totals Cotigecy table based o Melaoma histological type ad its locatio Coditioal robability The coditioal probability of occurrig give that B occurs is give by Remars I pr( B, how should the symbol is read give that. How do we iterpret the fact that: The evet always occurs whe B occurs?? What ca you say about pr( B? B pr( B pr( ad B pr(b B Suppose we select oe out of the 400 patiets i the study ad we wat to fid the probability that the cacer is o the extremities give that it is of type odular: 73/5 C. o Extremities odular #odular patiets with cacer o extremities #odular patiets Whe drawig a probability tree for a particular problem, how do you ow what evets to use for the first fa of braches ad which evets to use for the subsequet brachig? (at each brachig stage coditio o all the ifo available up to here. E.g., at first brachig use all simple evets, o prior r is available. t 3-rd brachig coditio of the previous evets, etc..
3 Statistical idepedece Evets ad B are statistically idepedet if owig whether B has occurred gives o ew iformatio about the chaces of occurrig, i.e. if pr( B pr( Similarly, B B, sice B B & / BB/ B If ad B are statistically idepedet,, the Ivertig Coditioal robabilities B B x B B x pr( ad B pr( pr( B Formula summary cot. Multiplicatio Rule uder idepedece: If ad B are idepedet evets, the B B If,,, are mutually idepedet,... If {{,,,, } are a partitio of the sample space (mutually exclusive ad U i S the for ay evet B Ex: B B + B B B B + B Law of Total robability S B B B Bayesia Rule If {{,,,, } are a o-trivial partitio of the sample space (mutually exclusive ad U i S, i >0 the for ay o-trivial evet ad B ( B>0 I i B i B / B [B i x i ] / B B i i B Bayesia Rule i B i i D the test perso has the disease. B T the test result is positive. Ex: (Laboratory blood test ssume: Fid: positive Test Disease 0.95 Disease positive Test? positive Test o Disease0.0 D T? Disease D IT T D D D T T c c T D D + T D D
4 Classes vs. Evidece Coditioig Classes: healthy(c, cacer Evidece: positive mammogram (pos, egative mammogram (eg If a woma has a positive mammogram result, what is the probability that she has breast cacer? evidece class class class evidece cacer evidece class class 0. 0 classes pos cacer 0. 8 ( pos healthy 0. cacer pos? C CxC/[ CxC+ HxH] C 0.8x0.0 / [0.8x x0.99]? Test Results Bayesia Rule (differet data/example! egative ositive Total True Disease State o Disease Disease OK ( False egative II ( False ositive I ( C ( C ( C T I D T D D OK ( Total ower of Test T C D / Sesitivity: T/(T+F /( Specificity: T/(T+F /( Factors affectig the power ED Larger: Causes: Sample size (positive Sample variace (egative Effect size (positive The chose level for α (positive robability Theory xioms Basic riciples for probability modelig ad computatio Law of Total robability & Bayesia Theorem Data Summaries ad ED Distributios ( Experimets & Demos ( Statistical Iferece Hypothesis Testig & Cofidece itervals arameter Estimatio arametric vs. o-parametric iferece ( Liear modelig Simple liear regressio, Multiple liear regressio OV & GLM The Big Three: Ceter,, Spread ad Shape! Ceter: Cetral tedecy, the middle, as a sigle umber value of parameter value of parameter value of parameter value of parameter (a Uimodal (b Bimodal (c Trimodal Mode: The most frequet score i the distributio. Media: The cetermost score if there are a odd umber of scores or the average of the two cetermost scores if there are a eve umber of scores. Mea: The sum of all (umeric observatios divided by the umber of scores (arithmetic average. (d Symmetric (e ositively sewed (log upper tail (f egatively sewed (log lower tail (g Symmetric (h Bimodal with gap (i Expoetial shape 4
5 Variability ot oly iterested i a distributio s middle. lso iterested i its spread (deviatio or variability. Fudametal characteristics of distributios (as models: - Cetral tedecy - Variability - Shape How ca we describe variability with a sigle umber? Shape: Sewess & Kurtosis What do we mea by symmetry ad positive ad egative sewess? Kurtosis? roperties?!? Sewess ( SD 3 ( Y Y ( Y Y 3 ; Kurtosis ( SD Sewess is liearly ivariat S(aX+b S(X Sewess is a measure of usymmetry Kurtosis is (also liearly ivariat a measure of flatess Both are used to quatify departures from Stdormal Sewess(Stdorm0; Kurtosis(Stdorm3 4 4 Distributios robability Theory xioms Basic riciples for probability modelig ad computatio Law of Total robability & Bayesia Theorem Distributios ( Experimets & Demos ( Statistical Iferece Hypothesis Testig & Cofidece itervals arameter Estimatio arametric vs. o-parametric iferece ( Liear modelig Simple liear regressio, Multiple liear regressio OV & GLM edu.ucla.stat.socr.distributios.beroullidistributio edu.ucla.stat.socr.distributios.biomialdistributio edu.ucla.stat.socr.distributios.birthdaydistributio edu.ucla.stat.socr.distributios.diedistributio edu.ucla.stat.socr.distributios.discretercsiedistributio edu.ucla.stat.socr.distributios.discreteuiformdistributio edu.ucla.stat.socr.distributios.geometricdistributio edu.ucla.stat.socr.distributios.hypergeometricdistributio edu.ucla.stat.socr.distributios.egativebiomialdistributio edu.ucla.stat.socr.distributios.oitmassdistributio edu.ucla.stat.socr.distributios.oissodistributio edu.ucla.stat.socr.distributios.oerdicedistributio edu.ucla.stat.socr.distributios.walmaxdistributio edu.ucla.stat.socr.distributios.walositiodistributio Discrete Distributio Models What, whe ad how to use? Examples? Example: Hypergeometric Distributio The three assumptios that lead to a hypergeometric distributio:. The populatio or set to be sampled cosists of idividuals, objects, or elemets (a fiite populatio.. Each idividual ca be characterized as a success (S or failure (F, ad there are M successes i the populatio. 3. sample of idividuals is selected without replacemet i such a way that each subset of size is equally liely to be chose. Hypergeometric Distributio If X is the umber of S s i a completely radom sample of size draw w/o replacemet from a populatio cosistig of M S s ad ( M F s, the the probability distributio of X, called the hypergeometric distributio, is give by M M max(0, + M x mi(, M x x X ( x hxm ( ;,, 5
6 Hypergeometric Mea ad Variace M M M EX ( VX ( Ball_ad_Ur_Experimet HyperGeometric Distributio & Biomial pproximatio to HyperGeometric Computatio of a Biomial pmf x b( x;, p p x ( 0 x x p Hypergeometric Distributio & Biomial Biomial approximatio to Hypergeometric M is small (usually < 0., the p approaches HyperGeom( x;,, M Bi( x;, p M / p Ex: 4,000 out of 0,000 residets are agaist a ew tax. 5 residets are selected at radom ad surveyed. at most 7 favor the ew tax? HyperGeometric ad Biomial Experimet/Distributios oisso Distributio Defiitio Used to model couts umber of arrivals (( o a give iterval The oisso distributio is also sometimes referred to as the distributio of rare evets. Examples of oisso distributed variables are umber of accidets per perso, umber of sweepstaes wo per perso, or the umber of catastrophic defects foud i a productio process. Fuctioal Brai Imagig - ositro Emissio Tomography (ET Fuctioal Brai Imagig ositro Emissio Tomography (ET Isotope Eergy (MeV Rage(mm /-life ppl. C mi receptors 5O.7.5 mi stroe/activatio 8 F mi eurology 4I ~ days ocology 6
7 oisso Distributio Mea Used to model couts umber of arrivals (( o a give iterval e Y~oisso(, the Y, 0,,,! Mea of Y, μ Y,, sice e E( Y e! e 0 e (! 0 0 e! e! e 0,,, (! oisso Distributio - Variace Y~oisso(, the Y e, 0,,,! Variace of Y, σ Y ½, sice σ e Y Var( Y (... 0! For example, suppose that Y deotes the umber of bloced shots (arrivals i a radomly sampled game for the UCL Bruis me's basetball team. The a oisso distributio with h mea4 may be used to model Y. oisso as a approximatio to Biomial Suppose we have a sequece of Biomial(, p models, with lim( p, as ifiity. For each 0<y<, if Y ~ Biomial(, p, the Y y But this coverges to: p y y ( p y y y p p y ( WHY? y e y! p Thus, Biomial(, p oisso( oisso as a approximatio to Biomial Rule of thumb is that approximatio is good if: >00 p<0.0 p <0 The, Biomial(, p oisso( Validate usig: ml Biomial, HyperGeometric ad oisso Experimets Example usig oisso approx to Biomial Suppose Disease Fid the probability that a village of 5,000 people has > people havig the disease! Y~ Biomial(5,000, 0.000, fid Y>. ote that Z~oisso( p 5,000 x Z > Z 0.5 e 0! e!.5 z e! z.5 e z! Why bother discussig distributios? rovide a rich source of (aalytical Models. Geeral properties of processes may be studies without regard to the uderlyig molecular, physiological, geotypic or pheotypic properties or characteristics of the pheomeo. Easy to fit models to data ad mae iferece usig the model istead of limited data. Low computatioal costs (efficiecy( efficiecy What else? Example: 7
8 Cotiuous Distributio Models Beta Distributio, edu.ucla.stat.socr.distributios.betadistributio Beta (Geeralized Distributio, edu.ucla.stat.socr.distributios.betageeraldistributio CauchyDistributio, edu.ucla.stat.socr.distributios.cauchydistributio CauchyDistributio Chi-Square Distributio, edu.ucla.stat.socr.distributios.chisquaredistributio ChiSquareDistributio Circle Distributio, edu.ucla.stat.socr.distributios.circledistributio CircleDistributio Cotiuous Uiform Distributio, edu.ucla.stat.socr.distributios.cotiuousuiformdistributio CotiuousUiformDistributio Expoetial Distributio, edu.ucla.stat.socr.distributios.expoetialdistributio Fisher's F Distributio, edu.ucla.stat.socr.distributios.fisherdistributio Gamma Distributio, edu.ucla.stat.socr.distributios.gammadistributio GeeralCauchyDistributio, edu.ucla.stat.socr.distributios.geeralcauchydistributio Gilbrats Distributio, edu.ucla.stat.socr.distributios.gilbratsdistributio GumbelDistributio, edu.ucla.stat.socr.distributios.gumbeldistributio Half-ormal Distributio, edu.ucla.stat.socr.distributios.halformaldistributio Laplace Distributio, edu.ucla.stat.socr.distributios.laplacedistributio Logistic Distributio, edu.ucla.stat.socr.distributios.logisticdistributio Log-ormal Distributio, edu.ucla.stat.socr.distributios.logormaldistributio Maxwell Distributio, edu.ucla.stat.socr.distributios.maxwelldistributio MixtureDistributio, edu.ucla.stat.socr.distributios.mixturedistributio ormal Distributio, edu.ucla.stat.socr.distributios.ormaldistributio areto Distributio, edu.ucla.stat.socr.distributios.aretodistributio Rayleigh Distributio, edu.ucla.stat.socr.distributios.rayleighdistributio RayleighDistributio Studet's T Distributio, edu.ucla.stat.socr.distributios.studetdistributio StudetDistributio Triagle Distributio, edu.ucla.stat.socr.distributios.triagledistributio Weibull Distributio, edu.ucla.stat.socr.distributios.weibulldistributio What, whe ad how to use? Examples? Logormal (Y μ,σ Relatio amog Distributios ormal (X μ,σ X ly X Y e α U X β α Beta α, β α β Uiform(X α, β μ Z X σ χ Uiform(U 0, i X ( β α U + α Z i ormal (Z 0, Chi-square ( χ α /, β Gamma α, β α X β lu df Weibull γ, β γ Expoetial(X β T df (0, df Cauchy (0, The ormal Distributio ormal desity curve ca be summarized with the followig formula: y μ σ f ( y e σ π Every ormal curve uses this formula, what maes them differet is what gets plugged i for μ ad σ Each ormal curve is cetered at μ ad the width depeds o σ ( small tall, large short/wide. d-dimesioal dimesioal Gaussia distributios with mea vector μ ad covariace matrix Σ : T ( x Exp ( x μ Σ ( x μ p (π d / Σ / The ormal Distributio Each ormal curve is characterized by it's μ ad σ Y μ 3σ μ σ μ σ μ μ + σ μ + σ μ + 3σ μ μ μ + σ μ σ μ 3σ σ μ + σ μ + 3σ If radom variable Y is ormal with mea μ ad stadard deviatio σ,, we write Y ~ ( μ, σ Y Defiitio of the expected value,, i geeral The expected value: E(X x all x x x x dx all X Sum of (value times probability of value Example I the at least oe of each or at most 3 childre example, where X {umber of Girls} we have: X 0 3 pr(x E( X x x x
9 The expected value ad populatio mea μ X E(X is called the mea of the distributio of X. μ X E(X is usually called the populatio mea. μx is the poit where the bar graph of X x balaces. opulatio stadard deviatio The populatio stadard deviatio is sd(x E[(X - μ ] ote that if X is a RV, the (X-μ is also a RV, ad so is (X-μ. Hece, the expectatio, E[(X-μ ], maes sese. opulatio mea & stadard deviatio Expected value: E ( X x X x Variace Stadard Deviatio x ( x E( x X Var ( X x x ( x E( x X SD ( X Var( X x x For ay radom variable X E(aX +b a E(X +b+ ad SD(aX +b a SD(X Chebyshev s Theorem pplies to all distributios where σ, μ < afuty Chebyshev (Пафнутий Чебышёв (8( K Chebyshov, Tchebycheff or Tschebyscheff. μ σ < X< μ+ σ for > Chebyshev s Theorem Gives a lower boud for the probability that a value of a radom variable, with fiite variace,, lies withi a certai distace from the variable's mea; equivaletly, the theorem provides a upper boud for the probability that values lie outside the same distace from the mea. The theorem applies eve to o "bell-shaped" distributios ad puts bouds o how much of the data is or is ot "i the middle". Let X be a radom variable with mea μ ad fiite variace σ. ow, for ay real umber > 0, X μ X μ < σ σ Oly the cases > provide useful iformatio. Why? 9
10 Marov s s & Chebyshev s Iequalities Marov's iequality: (Marov was a studet of (Marov was a studet of Chebyshev If Y 0 & d > 0 E( Y Y d d Sice, if d, if Y d X, 0, otherwise ote Y 0, X 0 The : E( Y E( X d Y { d} Let Y X E(X ad d with > 0 E ( ( ( X E(X Y d X E(X Var(X σ ( X - E(X ( X - E(X σ Let /σ σ Chebyshev s Theorem pplies to all distributios, where mea umber exists (σ,( ofμ< Stadard Distace from Deviatios the Mea μ± σ μ± 3σ Miimum roportio of Values Fallig Withi Distace K -/ 0.75 K 3 -/ K 4 μ± 4σ -/ Coefficiet of Variatio Ratio of the stadard deviatio to the mea, expressed as a percetage Measuremet of relative dispersio Coefficiet of Variatio a example μ σ C. V. ( 00 CV.. ( 00 CV.. ( 00 μ 46. ( ( σ σ μ μ 84 0 σ σ μ Outlie robability Theory xioms Basic riciples for probability modelig ad computatio Law of Total robability & Bayesia Theorem Data Summaries ad ED Distributios ( Experimets & Demos ( Statistical Iferece arameter Estimatio Hypothesis Testig & Cofidece itervals arametric vs. o-parametric iferece ( CLT Liear modelig Simple liear regressio, Multiple liear regressio OV & GLM arameters, Estimators, Estimates parameter is a characteristic of process, populatio or distributio E.g., mea, st quartile, SD, mi, max, rage, sewess,, 97 th percetile, etc. estimator is a abstract rule for calculatig a quatity (or parameter from sample data. estimate is the value obtaied whe real data are plugged-i the estimator rule. 0
Discrete Random Variables and Probability Distributions. Random Variables. Discrete Models
UCLA STAT 35 Applied Computatioal ad Iteractive Probability Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Chris Barr Uiversity of Califoria, Los Ageles, Witer 006 http://www.stat.ucla.edu/~diov/
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationJoint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }
UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationStatistical and Mathematical Methods DS-GA 1002 December 8, Sample Final Problems Solutions
Statistical ad Mathematical Methods DS-GA 00 December 8, 05. Short questios Sample Fial Problems Solutios a. Ax b has a solutio if b is i the rage of A. The dimesio of the rage of A is because A has liearly-idepedet
More informationProbability and Statistics
robability ad Statistics rof. Zheg Zheg Radom Variable A fiite sigle valued fuctio.) that maps the set of all eperimetal outcomes ito the set of real umbers R is a r.v., if the set ) is a evet F ) for
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationIE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.
Closed book ad otes. No calculators. 120 miutes. Cover page, five pages of exam, ad tables for discrete ad cotiuous distributios. Score X i =1 X i / S X 2 i =1 (X i X ) 2 / ( 1) = [i =1 X i 2 X 2 ] / (
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationModule 1 Fundamentals in statistics
Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More informationLecture 4. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture. Radom variable ad distributio of probability dr hab.iż. Katarzya Zarzewsa, prof.agh Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationLecture 5. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture 5. Radom variable ad distributio of probability prof. dr hab.iż. Katarzya Zarzewsa Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
More informationInferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.
Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationf X (12) = Pr(X = 12) = Pr({(6, 6)}) = 1/36
Probability Distributios A Example With Dice If X is a radom variable o sample space S, the the probablity that X takes o the value c is Similarly, Pr(X = c) = Pr({s S X(s) = c} Pr(X c) = Pr({s S X(s)
More informationProbability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].
Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More informationRule of probability. Let A and B be two events (sets of elementary events). 11. If P (AB) = P (A)P (B), then A and B are independent.
Percetile: the αth percetile of a populatio is the value x 0, such that P (X x 0 ) α% For example the 5th is the x 0, such that P (X x 0 ) 5% 05 Rule of probability Let A ad B be two evets (sets of elemetary
More informationLast Lecture. Wald Test
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationElements of Statistical Methods Lots of Data or Large Samples (Ch 8)
Elemets of Statistical Methods Lots of Data or Large Samples (Ch 8) Fritz Scholz Sprig Quarter 2010 February 26, 2010 x ad X We itroduced the sample mea x as the average of the observed sample values x
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationData Description. Measure of Central Tendency. Data Description. Chapter x i
Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3
More informationQuick Review of Probability
Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter & Teachig Material.
More informationQuick Review of Probability
Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter 2 & Teachig
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationThis section is optional.
4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore
More informationIE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.
IE 230 Seat # Name < KEY > Please read these directios. Closed book ad otes. 60 miutes. Covers through the ormal distributio, Sectio 4.7 of Motgomery ad Ruger, fourth editio. Cover page ad four pages of
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationComputing Confidence Intervals for Sample Data
Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationUCLA STAT 110B Applied Statistics for Engineering and the Sciences
UCLA STAT 110B Applied Statistics for Egieerig ad the Scieces Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistats: Bria Ng, UCLA Statistics Uiversity of Califoria, Los Ageles,
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationLecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.
Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example
More informationn outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationIntroduction to Probability and Statistics Twelfth Edition
Itroductio to Probability ad Statistics Twelfth Editio Robert J. Beaver Barbara M. Beaver William Medehall Presetatio desiged ad writte by: Barbara M. Beaver Itroductio to Probability ad Statistics Twelfth
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More information(6) Fundamental Sampling Distribution and Data Discription
34 Stat Lecture Notes (6) Fudametal Samplig Distributio ad Data Discriptio ( Book*: Chapter 8,pg5) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye 8.1 Radom Samplig: Populatio:
More informationFinal Review for MATH 3510
Fial Review for MATH 50 Calculatio 5 Give a fairly simple probability mass fuctio or probability desity fuctio of a radom variable, you should be able to compute the expected value ad variace of the variable
More informationLecture 2: Poisson Sta*s*cs Probability Density Func*ons Expecta*on and Variance Es*mators
Lecture 2: Poisso Sta*s*cs Probability Desity Fuc*os Expecta*o ad Variace Es*mators Biomial Distribu*o: P (k successes i attempts) =! k!( k)! p k s( p s ) k prob of each success Poisso Distributio Note
More informationChapter 2 Descriptive Statistics
Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data
More informationApproximations and more PMFs and PDFs
Approximatios ad more PMFs ad PDFs Saad Meimeh 1 Approximatio of biomial with Poisso Cosider the biomial distributio ( b(k,,p = p k (1 p k, k λ: k Assume that is large, ad p is small, but p λ at the limit.
More informationSTAT 203 Chapter 18 Sampling Distribution Models
STAT 203 Chapter 18 Samplig Distributio Models Populatio vs. sample, parameter vs. statistic Recall that a populatio cotais the etire collectio of idividuals that oe wats to study, ad a sample is a subset
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationSummary. Recap ... Last Lecture. Summary. Theorem
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationSDS 321: Introduction to Probability and Statistics
SDS 321: Itroductio to Probability ad Statistics Lecture 23: Cotiuous radom variables- Iequalities, CLT Puramrita Sarkar Departmet of Statistics ad Data Sciece The Uiversity of Texas at Austi www.cs.cmu.edu/
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationTopic 6 Sampling, hypothesis testing, and the central limit theorem
CSE 103: Probability ad statistics Fall 2010 Topic 6 Samplig, hypothesis testig, ad the cetral limit theorem 61 The biomial distributio Let X be the umberofheadswhe acoiofbiaspistossedtimes The distributio
More informationIntroductory statistics
CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key
More informationCS 330 Discussion - Probability
CS 330 Discussio - Probability March 24 2017 1 Fudametals of Probability 11 Radom Variables ad Evets A radom variable X is oe whose value is o-determiistic For example, suppose we flip a coi ad set X =
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationChapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p
Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret
More informationBasics of Probability Theory (for Theory of Computation courses)
Basics of Probability Theory (for Theory of Computatio courses) Oded Goldreich Departmet of Computer Sciece Weizma Istitute of Sciece Rehovot, Israel. oded.goldreich@weizma.ac.il November 24, 2008 Preface.
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More information1 Models for Matched Pairs
1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i
More informationGG313 GEOLOGICAL DATA ANALYSIS
GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationUnderstanding Samples
1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We
More informationLimit Theorems. Convergence in Probability. Let X be the number of heads observed in n tosses. Then, E[X] = np and Var[X] = np(1-p).
Limit Theorems Covergece i Probability Let X be the umber of heads observed i tosses. The, E[X] = p ad Var[X] = p(-p). L O This P x p NM QP P x p should be close to uity for large if our ituitio is correct.
More informationCHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics
CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based
More informationBasis for simulation techniques
Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationPRACTICE PROBLEMS FOR THE FINAL
PRACTICE PROBLEMS FOR THE FINAL Math 36Q Fall 25 Professor Hoh Below is a list of practice questios for the Fial Exam. I would suggest also goig over the practice problems ad exams for Exam ad Exam 2 to
More informationAxioms of Measure Theory
MATH 532 Axioms of Measure Theory Dr. Neal, WKU I. The Space Throughout the course, we shall let X deote a geeric o-empty set. I geeral, we shall ot assume that ay algebraic structure exists o X so that
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More informationClosed book and notes. No calculators. 60 minutes, but essentially unlimited time.
IE 230 Seat # Closed book ad otes. No calculators. 60 miutes, but essetially ulimited time. Cover page, four pages of exam, ad Pages 8 ad 12 of the Cocise Notes. This test covers through Sectio 4.7 of
More informationLecture 6: Coupon Collector s problem
Radomized Algorithms Lecture 6: Coupo Collector s problem Sotiris Nikoletseas Professor CEID - ETY Course 2017-2018 Sotiris Nikoletseas, Professor Radomized Algorithms - Lecture 6 1 / 16 Variace: key features
More informationFormulas and Tables for Gerstman
Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary
More information