Computing Confidence Intervals for Sample Data
|
|
- Abel Richards
- 5 years ago
- Views:
Transcription
1 Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios How may measuremets are eeded for desired error? 1
2 What are statistics? A brach of mathematics dealig with the collectio, aalysis, iterpretatio, ad presetatio of masses of umerical data. Merriam-Webster We are most iterested i aalysis ad iterpretatio here. Lies, dam lies, ad statistics! 3 What is a statistic? A quatity that is computed from a sample [of data]. Merriam-Webster A estimate of a populatio parameter 4
3 Statistical Iferece populatio statistics iferece parameters sample 5 Why do we eed statistics? A set of experimetal measuremets costitute a sample of the uderlyig process/system beig measured Use statistical techiques to ifer the true value of the metric Use statistical techiques to quatify the amout of imprecisio due to radom experimetal errors 6 3
4 Experimetal errors Errors oise i measured values Systematic errors Result of a experimetal mistake Typically produce costat or slowly varyig bias Cotrolled through skill of experimeter Examples Temperature chage causes clock drift Forget to clear cache before timig ru 7 Experimetal errors Radom errors Upredictable, o-determiistic Ubiased equal probability of icreasig or decreasig measured value Result of Limitatios of measurig tool Observer readig output of tool Radom processes withi system Typically caot be cotrolled Use statistical tools to characterize ad quatify 8 4
5 Example: Quatizatio Radom error 9 Quatizatio error Timer resolutio quatizatio error Repeated measuremets X ± Δ Completely upredictable 10 5
6 A Model of Errors Error Measured value Probability -E x E ½ +E x + E ½ 11 A Model of Errors Error 1 Error Measured value Probability -E -E x E ¼ -E +E x ¼ +E -E x ¼ +E +E x + E ¼ 1 6
7 A Model of Errors 13 Probability of Obtaiig a Specific Measured Value 14 7
8 A Model of Errors Pr(X=x i ) = Pr(measure x i ) = umber of paths from real value to x i Pr(X=x i ) ~ biomial distributio As umber of error sources becomes large, Biomial Gaussia (Normal) Thus, the bell curve 15 Frequecy of Measurig Specific Values Accuracy Precisio Mea of measured values Resolutio True value 16 8
9 Accuracy, Precisio, Resolutio Systematic errors accuracy How close mea of measured values is to true value Radom errors precisio Repeatability of measuremets Characteristics of tools resolutio Smallest icremet betwee measured values 17 Quatifyig Accuracy, Precisio, Resolutio Accuracy Hard to determie true accuracy Relative to a predefied stadard o E.g. defiitio of a secod Resolutio Depedet o tools Precisio Quatify amout of imprecisio usig statistical tools 18 9
10 Cofidece Iterval for the Mea 1-α α/ c1 c α/ 19 Statistical Iferece populatio statistics iferece parameters sample 0 10
11 Why do we eed statistics? A set of experimetal measuremets costitute a sample of the uderlyig process/system beig measured Use statistical techiques to ifer the true value of the metric Use statistical techiques to quatify the amout of imprecisio due to radom experimetal errors Assumptio: radom errors ormally distributed 1 Iterval Estimate a b The iterval estimate of the populatio parameter will have a specified cofidece or probability of correctly estimatig the populatio parameter. 11
12 Properties of Poit Estimators I statistics, poit estimatio ivolves the use of sample data to calculate a sigle value which is to serve as a "best guess" for a ukow (fixed or radom) populatio parameter. Example of poit estimator: sample mea. Properties: Ubiasedess: the expected value of all possible sample statistics (of give size ) is equal to the populatio parameter. E[ X ] = µ E [ s ] =! Efficiecy: precisio as estimator of the populatio parameter. Cosistecy: as the sample size icreases the sample statistic becomes a better estimator of the populatio parameter. 3 Ubiasedess of the Mea ' i= 1 X i X = & E$ $ E[ X ] = % ' i= # Xi!!" = ' i= 1 ' i= 1 µ 1 µ = = µ E[ X ] i = 4 1
13 Sample size= % of populatio Sample 1 Sample Sample E[sample] Populatio Error Sample Average % Sample Variace % Efficiecy (average) 18% 18% 80% Efficiecy (variace) 59% 1% 173% 5 Sample size = 87 10% of populatio Sample 1 Sample Sample Populatio % Rel. Error Sample Average % Sample Variace % Efficiecy (average) 7.5% 5.7% 4.5% Efficiecy (variace).9% 10.0% 0.1% 6 13
14 Cofidece Iterval Estimatio of the Mea Kow populatio stadard deviatio. Ukow populatio stadard deviatio: Large samples: sample stadard deviatio is a good estimate for populatio stadard deviatio. OK to use ormal distributio. Small samples ad origial variable is ormally distributed: use t distributio with -1 degrees of freedom. 7 Cetral Limit Theorem If the observatios i a sample are idepedet ad come from the same populatio that has mea µ ad stadard deviatio σ the the sample mea for large samples has a ormal distributio with mea µ ad stadard deviatio σ/ x ~ N( µ,! / ) The stadard deviatio of the sample mea is called the stadard error. 8 14
15 c =Q(1-α/) c 1 =Q(α/) µ N ( µ,! / ) 0 1-α α/ α/ c 1 c 1-α α/ α/ x 1 0 x c N(0,1) = c1 = µ + µ + " " z1#! / z! / = x =z 1-α/ x 1 =z α/ = - z 1-α/ µ # " z1#! / 9 Cofidece Iterval - large (>30) samples 100 (1-α)% cofidece iterval for the populatio mea: s ( x " z1 "! /, x + z1 "! / s ) x : sample mea s: sample stadard deviatio : sample size z : (1-α/)-quatile of a uit ormal variate ( N(0,1)). 1 "! / 30 15
16 Populatio Sample Average Sample Variace Efficiecy (average) 7.5% 5.7% 4.5% Efficiecy (variace).9% 10.0% 0.1% 95% iterval lower % iterval upper Mea i iterval YES YES YES 99% iterval lower % iterval upper Mea i iterval YES YES YES 90% iterval lower % iterval upper Mea i iterval YES YES YES I Excel: ½ iterval = CONFIDENCE(1-0.95,s,) iterval size α Note that the higher the cofidece level the larger the iterval 31 µ Sample Iterval iclude µ? 1 YES YES 3 NO 100 YES 100 (1 α) of the 100 samples iclude the populatio mea µ. 3 16
17 Cofidece Iterval Estimatio of the Mea Kow populatio stadard deviatio. Ukow populatio stadard deviatio: Large samples: sample stadard deviatio is a good estimate for populatio stadard deviatio. OK to use ormal distributio. Small samples ad origial variable is ormally distributed: use t distributio with -1 degrees of freedom. 33 Studet s t distributio t( v) ~ N(0,1)! ( v) / v v: umber of degrees of freedom. v! ( ) : chi-square distributio with v degrees of freedom. Equal to the sum of squares of v uit ormal variates. the pdf of a t-variate is similar to that of a N(0,1). for v > 30 a t distributio ca be approximated by N(0,1)
18 Cofidece Iterval (small samples) For samples from a ormal distributio N(µ,σ ), (X " µ) /(# / ) has a N(0,1) distributio ad ( "1)s /# has a chi-square distributio with -1 degrees of freedom Thus, (X " µ) / s / has a t distributio with -1 degrees of freedom 35 Cofidece Iterval (small samples, ormally distributed populatio) 100 (1-α)% cofidece iterval for the populatio mea: x s ( x! t[1!" / ;! 1], x + t[1!" / ;! 1] s ) : sample mea s: sample stadard deviatio : sample size t [ 1! " / ;! 1] : critical value of the t distributio with -1 degrees of freedom for a area of α/ for the upper tail
19 Usig the t Distributio. Sample size= E[sample] Populatio Error Sample Average % Sample Variace % Efficiecy (average) 18% 18% 80% Efficiecy (variace) 59% 1% 173% 95% iterval lower %,-1 critical value % iterval upper Mea i iteval YES YES YES I Excel: TINV(1-0.95,15-1) α 37 How may measuremets do we eed for a desired iterval width? Width of iterval iversely proportioal to Wat to miimize umber of measuremets Fid cofidece iterval for mea, such that: Pr(actual mea i iterval) = (1 α) [(1! e) x,(1 e x] ( 1 + c, c ) = ) 38 19
20 How may measuremets? z ( c 1' ( / 1, c) = (1 m e) = x m z s = xe & = $ % z 1' ( / 1' ( / xe x s #! " s 39 How may measuremets? But depeds o kowig mea ad stadard deviatio! Estimate s with small umber of measuremets Use this s to fid eeded for desired iterval width 40 0
21 How may measuremets? Mea = 7.94 s Stadard deviatio =.14 s Wat 90% cofidece mea is withi 7% of actual mea. 41 How may measuremets? Mea = 7.94 s Stadard deviatio =.14 s Wat 90% cofidece mea is withi 7% of actual mea. α = 0.90 (1-α/) = 0.95 Error = ± 3.5% e =
22 How may measuremets? & z1 '( / s # = = $ % xe! " & 1.895(.14) # = $! % 0.035(7.94) " measuremets 90% chace true mea is withi ± 3.5% iterval 43 Cofidece Iterval Estimates for Proportios
23 Cofidece Iterval for Proportios For categorical data: E.g. file types {html, html, gif, jpg, html, pdf, ps, html, pdf } If 1 of observatios are of type html, the the sample proportio of html files is p = 1 /. The populatio proportio is π. Goal: provide cofidece iterval for the populatio proportio π. 45 Cofidece Iterval for Proportios The samplig distributio of the proportio formed by computig p from all possible samples of size from a populatio of size N with replacemet teds to a ormal with mea π ad stadard error! ( 1#! ). " p = The ormal distributio is beig used to approximate the biomial. So, "!
24 Cofidece Iterval for Proportios The (1-α)% cofidece iterval for π is p(1! p), p + z ( p! z1! " / 1! " / p(1! p) ) p: sample proportio. : sample size z : (1-α/)-quatile of a uit ormal variate ( N(0,1)). 1 "! / 47 Example 1 Oe thousad etries are selected from a Web log. Six hudred ad fifty correspod to gif files. Fid 90% ad 95% cofidece itervals for the proportio of files that are gif files. Sample size () 1000 No. gif files i sample 650 Sample proportio (p) 0.65 *p 650 > 10 OK 90% cofidece iterval alpha alpha/ 0.95 z Lower boud 0.65 Upper boud % cofidece iterval alpha alpha/ z Lower boud 0.60 Upper boud I Excel: NORMSINV(1-0.1/) NORMSINV(1-0.05/) 48 4
25 Example How much time does processor sped i OS? Iterrupt every 10 ms Icremet couters = umber of iterrupts m = umber of iterrupts whe PC withi OS 49 Proportios How much time does processor sped i OS? Iterrupt every 10 ms Icremet couters = umber of iterrupts m = umber of iterrupts whe PC withi OS Ru for 1 miute = 6000 m =
26 Proportios p(1! p) ( c 1, c) = p m z1!" / (1! ) = m = (0.1018,0.1176) 95% cofidece iterval for proportio So 95% certai processor speds % of its time i OS 51 Number of measuremets for proportios (1! e) p = ep = = p! z z 1! " / z 1! " / 1! " / p(1! p(1! p) p(1! ( ep) p) p) 5 6
27 Number of measuremets for proportios How log to ru OS experimet? Wat 95% cofidece ± 0.5% 53 Number of measuremets for proportios How log to ru OS experimet? Wat 95% cofidece ± 0.5% e = p =
28 Number of measuremets for proportios = z 1"# / p (1" p ) (ep) = (1.960) (0.1097)(1" ) 0.005(0.1097) [ ] =1,47,10 10 ms iterrupts 3.46 hours 55 Cofidece Iterval Estimatio for Variaces 8
29 Cofidece Iterval for the Variace If the origial variable is ormally distributed the the chi-square distributio ca be used to develop a cofidece iterval estimate of the populatio variace. The (1-α)% cofidece iterval for! is ( # 1) s! U $ " $ ( # 1) s! L! L! U : lower critical value of : upper critical value of!! 57 Chi-square distributio Not symmetric! α/ Q(α/) 1-α α/ Q(1-α/) 58 9
30 95% cofidece iterval for the populatio variace for a sample of size 100 for a N(3,) populatio. 1-α/ average variace.1716 std deviatio lower critical value of chi-square for 95% upper critical value of chi-square for 95% lower boud for cofidece iterval for the variace upper boud for cofidece iterval for the variace I Excel: CHIINV (0.975, 99) CHIINV (0.05, 99) α/ The populatio variace (4 i this case) is i the iterval (3.6343, 6.36) with 95% cofidece. 59 Cofidece Iterval for the Variace If the populatio is ot ormally distributed, the cofidece iterval, especially for small samples, is ot very accurate
31 Key Assumptio Measuremet errors are Normally distributed. Is this true for most measuremets o real computer systems? 1-α α/ c1 c α/ 61 Key Assumptio Saved by the Cetral Limit Theorem Sum of a large umber of values from ay distributio will be Normally (Gaussia) distributed. What is a large umber? Typically assumed to be > 6 or
32 Normalizig data for cofidece itervals If the uderlyig distributio of the data beig measured is ot ormal, the the data must be ormalized Fid the arithmetic mea of four or more radomly selected measuremets Fid cofidece itervals for the meas of these average values o We ca o loger obtai a cofidece iterval for the idividual values o Variace for the aggregated evets teds to be smaller tha the variace of the idividual evets 63 Summary Use statistics to Deal with oisy measuremets Estimate the true value from sample data Errors i measuremets are due to: Accuracy, precisio, resolutio of tools Other sources of oise Systematic, radom errors 64 3
33 Summary (cot d): Model errors with bell curve Accuracy Precisio Mea of measured values Resolutio True value 65 Summary (cot d) Use cofidece itervals to quatify precisio Cofidece itervals for Mea of samples Proportios Variace Cofidece level Pr(populatio parameter withi computed iterval) Compute umber of measuremets eeded for desired iterval width 66 33
7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationLinear Regression Models
Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationModule 1 Fundamentals in statistics
Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly
More informationLecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.
STT 315, Summer 006 Lecture 5 Materials Covered: Chapter 6 Suggested Exercises: 67, 69, 617, 60, 61, 641, 649, 65, 653, 66, 663 1 Defiitios Cofidece Iterval: A cofidece iterval is a iterval believed to
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationMBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS
MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationSampling Distributions, Z-Tests, Power
Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace
More informationInterval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),
Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We
More information(7 One- and Two-Sample Estimation Problem )
34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationTopic 10: Introduction to Estimation
Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationConfidence Level We want to estimate the true mean of a random variable X economically and with confidence.
Cofidece Iterval 700 Samples Sample Mea 03 Cofidece Level 095 Margi of Error 0037 We wat to estimate the true mea of a radom variable X ecoomically ad with cofidece True Mea μ from the Etire Populatio
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationAnalysis of Experimental Data
Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationDirection: This test is worth 250 points. You are required to complete this test within 50 minutes.
Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationChapter 6. Sampling and Estimation
Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More informationCHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics
CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationInferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.
Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike
More informationAP Statistics Review Ch. 8
AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.
More informationUNIT 8: INTRODUCTION TO INTERVAL ESTIMATION
STATISTICAL METHODS FOR BUSINESS UNIT 8: INTRODUCTION TO INTERVAL ESTIMATION 8..- Itroductio to iterval estimatio 8..- Cofidece itervals. Costructio ad characteristics 8.3.- Cofidece itervals for the mea
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More informationBIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov
Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios
More informationBayesian Methods: Introduction to Multi-parameter Models
Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested
More informationNCSS Statistical Software. Tolerance Intervals
Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided
More informationStatistical Intervals for a Single Sample
3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea
More informationSTAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)
STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationActivity 3: Length Measurements with the Four-Sided Meter Stick
Activity 3: Legth Measuremets with the Four-Sided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a four-sided meter
More informationError & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :
Error Error & Ucertaity The error is the differece betwee a TRUE value,, ad a MEASURED value, i : E = i There is o error-free measuremet. The sigificace of a measuremet caot be judged uless the associate
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationChapter 7 Student Lecture Notes 7-1
Chapter 7 Studet Lecture otes 7-1 Basic Busiess Statistics (9 th Editio) Chapter 7 Samplig Distributios 24 Pretice-Hall, Ic. Chap 7-1 Chapter Topics Samplig Distributio of the Mea The Cetral Limit Theorem
More informationClass 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationConfidence Intervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Anan Phonphoem, Ph.D. Intelligent Wireless Network Group (IWING Lab)
Cofidece Itervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Aa Phophoem, Ph.D. aa.p@ku.ac.th Itelliget Wireless Network Group (IWING Lab) http://iwig.cpe.ku.ac.th Computer Egieerig Departmet Kasetsart Uiversity,
More informationInstructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?
CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationSTATS 200: Introduction to Statistical Inference. Lecture 1: Course introduction and polling
STATS 200: Itroductio to Statistical Iferece Lecture 1: Course itroductio ad pollig U.S. presidetial electio projectios by state (Source: fivethirtyeight.com, 25 September 2016) Pollig Let s try to uderstad
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationConfidence Intervals QMET103
Cofidece Itervals QMET103 Library, Teachig ad Learig CONFIDENCE INTERVALS provide a iterval estimate of the ukow populatio parameter. What is a cofidece iterval? Statisticias have a habit of hedgig their
More informationLecture 3. Properties of Summary Statistics: Sampling Distribution
Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary
More informationExam 2 Instructions not multiple versions
Exam 2 Istructios Remove this sheet of istructios from your exam. You may use the back of this sheet for scratch work. This is a closed book, closed otes exam. You are ot allowed to use ay materials other
More informationLecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationOctober 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1
October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces
More informationLecture 5. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture 5. Radom variable ad distributio of probability prof. dr hab.iż. Katarzya Zarzewsa Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationCommon Large/Small Sample Tests 1/55
Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationLecture 1. Statistics: A science of information. Population: The population is the collection of all subjects we re interested in studying.
Lecture Mai Topics: Defiitios: Statistics, Populatio, Sample, Radom Sample, Statistical Iferece Type of Data Scales of Measuremet Describig Data with Numbers Describig Data Graphically. Defiitios. Example
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationFormulas and Tables for Gerstman
Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary
More informationStatistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes
Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:
More informationSampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals
Chapter 6 Studet Lecture Notes 6-1 Busiess Statistics: A Decisio-Makig Approach 6 th Editio Chapter 6 Itroductio to Samplig Distributios Chap 6-1 Chapter Goals After completig this chapter, you should
More informationChapter 1 (Definitions)
FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple
More informationBasis for simulation techniques
Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios
More informationPower and Type II Error
Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationLecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)
Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +
More informationChapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p
Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret
More informationExam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234
STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6
More informationBiostatistics for Med Students. Lecture 2
Biostatistics for Med Studets Lecture 2 Joh J. Che, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 22, 2017 Lecture Objectives To uderstad basic research desig priciples
More informationµ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion
Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example
More informationLecture 4. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture. Radom variable ad distributio of probability dr hab.iż. Katarzya Zarzewsa, prof.agh Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More informationEconomics Spring 2015
1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures
More information1.010 Uncertainty in Engineering Fall 2008
MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More information