MA20226: STATISTICS 2A 2011/12 Assessed Coursework Sheet One. Set: Lecture 10 12:15-13:05, Thursday 3rd November, 2011 EB1.1

Size: px
Start display at page:

Download "MA20226: STATISTICS 2A 2011/12 Assessed Coursework Sheet One. Set: Lecture 10 12:15-13:05, Thursday 3rd November, 2011 EB1.1"

Transcription

1 MA20226: STATISTICS 2A 2011/12 Assessed Coursework Sheet One Preamble Set: Lecture 10 12:15-13:05, Thursday 3rd November, 2011 EB1.1 Due: Please hand the work in to the coursework drop-box on level 1 of 4W by 14:30 on Thursday 17th November, The work should be accompanied by a completed Coursework Cover Sheet, available either in lectures, the Departmental Office, or If the work is submitted after the deadline, without an agreed extension or mitigating circumstances, it will be assessed at a maximum mark of the pass mark (40%). If you submit work more than five days after the submission date, you will normally receive a mark of 0, unless you have been granted an extension or a panel has agreed that there are Individual Mitigating Circumstances (IMCs). Time: The average student should spend around three hours on this assignment. Conditions: The assignment should be your own work. You should attempt all questions. You may also consult me, or your tutor, for general advice. It should be completed during the practical session in Week 6 and in your own time. Value: This assignment carries 50% of the total marks for the coursework. Hence, it carries % of the total marks for the course. This assignment contains nine questions and there are a possible 25 marks available. Aim: The objective of this coursework is to enable you to construct confidence intervals using the statistical package R and to gain an insight into the properties of such intervals. In Section 1 we state how you can use R to find quantiles and probabilities from the normal and χ 2 -distributions. In Section 2 we look at how to construct confidence intervals while in Section 3 we investigate the properties of confidence intervals using a sampling experiment. Contact details Simon Shaw Room: 4W s.shaw@bath.ac.uk Webpage: 1

2 1 Using R to find quantiles and probabilities Rather than using tables, we may use R to find quantiles and probabilities. Normal distribution: If Z N(0, 1), let P (Z z p ) = p. For a given p, z p may be found using the command qnorm(p). For a given z p, p may be found using the command pnorm(z p ). For example, > qnorm(0.975) # finds z such that P(Z <= z) = [1] > pnorm(1.96) # P(Z <= 1.96) [1] > pnorm(qnorm(0.975)) [1] If you want to compute an upper tail probability, say P (Z > z p ), then you can either use the result that P (Z > z p ) = 1 P (Z z p ) or change the default in pnorm to calculate the upper tail. The same approach can be used with qnorm. > 1-pnorm(1.96) # P(Z > 1.96) = 1 - P(Z <= 1.96) [1] > pnorm(1.96,lower.tail=false) # calculates using the upper tail, P(Z > 1.96) [1] > qnorm(0.975) # finds z such that P(Z <= z) = [1] > qnorm(0.025) # finds z such that P(Z <= z) = [1] > qnorm(0.025,lower.tail=false) # calculates using the upper tail, so finds z such that [1] P(Z > z) = χ 2 -distribution: If χ 2 ν is a χ 2 -distribution with ν degrees of freedom, let P (χ 2 ν χ 2 ν,p) = p. For a given p, χ 2 ν,p may be found using the command qchisq(p, ν). For a given χ 2 ν,p, p may be found using the command pchisq(χ 2 ν,p, ν). If you want to work with the upper tail, say P (χ 2 ν > χ 2 ν,p), then you can either use P (χ 2 ν > χ 2 ν,p) = 1 P (χ 2 ν χ 2 ν,p) or change the default in pchisq to calculate the upper tail. The same approach can be used with qchisq. For example, > qchisq(0.95, 10) # finds y such that P(Y <= y) = 0.95 when Y is a chi-square [1] with 10 degrees of freedom > qchisq(0.05, 10) # finds y such that P(Y <= y) = 0.05 [1] > qchisq(0.05, 10, lower.tail=false) # calculates using the upper tail, finds y such that [1] P(Y > y) = 0.05 > qchisq(0.95, 10, lower.tail=false) # calculates using the upper tail, finds y such that [1] P(Y > y) = 0.95 > 1-pchisq(18.307, 10) # P(Y > ) = 1 - P(Y <= ) [1] > pchisq(18.307, 10, lower.tail=false) # calculates using the upper tail, P(Y > ) [1] Constructing confidence intervals The following function calculates a 95% confidence interval for µ when observations X 1,..., X n are assumed to be iid N(µ, σ 2 ) with σ 2 assumed known. 2

3 > CIfun <- function(i, sigmasq) # i is a vector containing the observations + # sigmasq is the known variance + n <- length(i) # find the number of elements in i + smean <- mean(i) # calculate the mean of i + z <- qnorm(0.975) # appropriate z-value + lo <- smean - z*sqrt(sigmasq/n) # lower bound + hi <- smean + z*sqrt(sigmasq/n) # upper bound + c(lo, hi) # return confidence interval as a vector + } The code may be downloaded from The following data, which are assumed to come from a normal distribution with mean µ, representing the passage time of light, and variance σ 2, may be regarded as Newcomb s measurements of the passage time of light The data set may be scanned into R from a text file using the command scan. > newcomb <- scan(" Read 20 items Assuming σ 2 = 40, we can find a 95% confidence interval for the true passage time of light > CIfun(newcomb,40) [1] Write a function, which you should call gcifun, that allows you to construct a 100(1 α)% confidence interval for µ when σ 2 is assumed known. Your function should have three arguments: the first, i, corresponding to the data vector, the second, sigmasq, corresponding to the assumed variance and a third corresponding to the value α. Thus, for example, your answers from gcifun(newcomb,40,0.05) and CIfun(newcomb,40) should be the same. Hand in a copy of your code. [2] 2. Use your function, gcifun, to calculate a 91.8% confidence interval for µ, the true passage time of light. [1] 3. What assumptions have you made in calculating the confidence interval, and to what extent do they seem justified here? [2] 4. Suppose that we now assume σ 2 to be unknown. (a) Write a function, chigcifun, that allows you to construct a 100(1 α)% confidence interval for σ 2. Hand in a copy of your code. [3] (b) Use your function to calculate a 97.2% confidence interval for σ 2 for Newcomb s measurements of the passage time of light. Comment upon whether your interval does or does not support the previously assumed value of σ 2 = 40. [3] You often need to extract elements from vectors that satisfy certain criteria. This can be done by using a relational expression instead of the index. For example, newcomb[newcomb > 0] will produce the vector whose elements are those contained in newcomb which are positive. 5. Omitting the two smallest data values in Newcomb s measurements, recalculate the 91.8% confidence interval for µ (assuming σ 2 = 40) and the 97.2% confidence interval for σ 2 (now assumed unknown). Explain carefully any differences in the results compared with those using all 20 data values. [6] 3

4 3 Confidence intervals for multiple samples R provides a simple way to execute a loop where each iteration of the loop returns a value, perhaps from the application of the same function. First, we recall that in R we may use a colon to create a sequence of integers. For example, > 1:10 [1] > 16:9 [1] The function sapply is used to perform our loop. It takes two arguments: a sequence e.g. of integers and a function with at least one argument. The function is called once for each value in the sequence. > sapply(1:10, function(i)i}) [1] > sapply(16:9,function(i)sqrt(i)}) [1] > sapply(1:5,function(i)rnorm(2)}) [,1] [,2] [,3] [,4] [,5] [1,] [2,] The final example is interesting on two counts. Firstly, although i appears in the parenthesis for the function, it does not appear in the curly brackets. sapply always requires that the function definition has at least one argument and we have used a dummy one here. Secondly, the output is a matrix. At each stage of the loop, we take a sample of two observations from the standard normal distribution: the ith iteration produces the ith column. > normsample <- sapply(1:100,function(i)rnorm(25,6,3)}) The matrix normsample contains 100 random samples of size 25 from a normal distribution with mean µ = 6 and standard deviation σ = 3 so that the variance σ 2 = 9. Note that > gcifun(normsample[,47],9,0.14) [1] produces an 86% confidence interval for µ for the 47th sample of size Use the sapply function to create a matrix, which you should call meanconf. The ith column should contain the 86% confidence interval for µ for the ith sample of size 25 with the lower bounds on the first row and the upper bounds on the second row. Thus, the result of meanconf[,47] and gcifun(normsample[,47],9,0.14) should be identical. What R command did you use to create meanconf? [1] The function ciplot can be used to plot n confidence intervals, for a parameter θ, contained in a 2 n matrix whose first row contains the lower bounds and second row the upper bounds. The true value of the parameter θ is also specified to ciplot and this is drawn on the plot. The intervals that contain the parameter are coloured red; the others blue. ciplot <- function(confint, true) n <- length(confint[1,]) # find number of confidence intervals x <- matrix(c(1:n,1:n),nrow=n,ncol=2) y <- c(t(x)) # produces vector with y[2i-1] = y[2i] = i z <- c(confint) # vector with z[2i-1] lower bound, z[2i] upper bound of ith ci plot(z, y, type="n", ylab="sample number") # plot end points of ci 4

5 abline(v=true) for (i in 1:n) a <- 2*i-1 b <- 2*i if (z[a] <= true & z[b] >= true) lines(z[a:b],y[a:b],col=2) } else lines(z[a:b],y[a:b],col=4) }}} # draw vertical line at true value of parameter # interval contains true value of parameter # join endpoints of ci with red line # join endpoints of ci with blue line The code may be downloaded from 7. How many, and why, of the intervals in meanconf do you expect to contain µ = 6 and how many actually do? (The following R commands achieve this. > mulo <- meanconf[1,] > muhi <- meanconf[2,] > sum(mulo <= 6 & muhi >= 6) Note that inside the parenthesis we have a logical vector which is TRUE if and only if the designated interval contains the value 6. The sum function then totals up how many TRUE statements occur.) Use the function ciplot to plot your 100 confidence intervals. Hand in a copy of your plot. [2] 8. Use the sapply function to create a matrix, which you should call varconf. The ith column should contain the 92% confidence interval for σ 2 for the ith sample of size 25. The lower bounds should be on the first row and the upper bounds on the second row. How many, and why, of these intervals do you expect to contain σ 2 = 9 and how many actually do? Use ciplot to plot your 100 confidence intervals. Hand in a copy of your plot. [2] 9. Briefly comment upon the two plots you have obtained, in each case making reference to the typical location of the actual parameter in the confidence interval and highlighting any differences in the two plots. [3] 5

Homework for 1/13 Due 1/22

Homework for 1/13 Due 1/22 Name: ID: Homework for 1/13 Due 1/22 1. [ 5-23] An irregularly shaped object of unknown area A is located in the unit square 0 x 1, 0 y 1. Consider a random point distributed uniformly over the square;

More information

Y i. is the sample mean basal area and µ is the population mean. The difference between them is Y µ. We know the sampling distribution

Y i. is the sample mean basal area and µ is the population mean. The difference between them is Y µ. We know the sampling distribution 7.. In this problem, we envision the sample Y, Y,..., Y 9, where Y i basal area of ith tree measured in sq inches, i,,..., 9. We assume the population distribution is N µ, 6, and µ is the population mean

More information

7 Random samples and sampling distributions

7 Random samples and sampling distributions 7 Random samples and sampling distributions 7.1 Introduction - random samples We will use the term experiment in a very general way to refer to some process, procedure or natural phenomena that produces

More information

Robustness and Distribution Assumptions

Robustness and Distribution Assumptions Chapter 1 Robustness and Distribution Assumptions 1.1 Introduction In statistics, one often works with model assumptions, i.e., one assumes that data follow a certain model. Then one makes use of methodology

More information

Chapter 8 - Statistical intervals for a single sample

Chapter 8 - Statistical intervals for a single sample Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their

More information

MA554 Assessment 1 Cosets and Lagrange s theorem

MA554 Assessment 1 Cosets and Lagrange s theorem MA554 Assessment 1 Cosets and Lagrange s theorem These are notes on cosets and Lagrange s theorem; they go over some material from the lectures again, and they have some new material it is all examinable,

More information

MAS361. MAS361 1 Turn Over SCHOOL OF MATHEMATICS AND STATISTICS. Medical Statistics

MAS361. MAS361 1 Turn Over SCHOOL OF MATHEMATICS AND STATISTICS. Medical Statistics t r r t r r t r t s s MAS361 SCHOOL OF MATHEMATICS AND STATISTICS Medical Statistics Autumn Semester 2015 16 2 hours t s 2 r t t 1 t t r t t r s t rs t2 r t s q st s r t r t r 2 t st s rs q st s rr2 q

More information

Inference for Proportions, Variance and Standard Deviation

Inference for Proportions, Variance and Standard Deviation Inference for Proportions, Variance and Standard Deviation Sections 7.10 & 7.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 12 Cathy

More information

Outline. Unit 3: Inferential Statistics for Continuous Data. Outline. Inferential statistics for continuous data. Inferential statistics Preliminaries

Outline. Unit 3: Inferential Statistics for Continuous Data. Outline. Inferential statistics for continuous data. Inferential statistics Preliminaries Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 1 Center for Mind/Brain Sciences (CIMeC) University of Trento,

More information

Lecture 3. Biostatistics in Veterinary Science. Feb 2, Jung-Jin Lee Drexel University. Biostatistics in Veterinary Science Lecture 3

Lecture 3. Biostatistics in Veterinary Science. Feb 2, Jung-Jin Lee Drexel University. Biostatistics in Veterinary Science Lecture 3 Lecture 3 Biostatistics in Veterinary Science Jung-Jin Lee Drexel University Feb 2, 2015 Review Let S be the sample space and A, B be events. Then 1 P (S) = 1, P ( ) = 0. 2 If A B, then P (A) P (B). In

More information

Hypothesis Testing One Sample Tests

Hypothesis Testing One Sample Tests STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal

More information

MODULE 9 NORMAL DISTRIBUTION

MODULE 9 NORMAL DISTRIBUTION MODULE 9 NORMAL DISTRIBUTION Contents 9.1 Characteristics of a Normal Distribution........................... 62 9.2 Simple Areas Under the Curve................................. 63 9.3 Forward Calculations......................................

More information

Content by Week Week of October 14 27

Content by Week Week of October 14 27 Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence

More information

STAT 513 fa 2018 Lec 02

STAT 513 fa 2018 Lec 02 STAT 513 fa 2018 Lec 02 Inference about the mean and variance of a Normal population Karl B. Gregory Fall 2018 Inference about the mean and variance of a Normal population Here we consider the case in

More information

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean Confidence Intervals Confidence interval for sample mean The CLT tells us: as the sample size n increases, the sample mean is approximately Normal with mean and standard deviation Thus, we have a standard

More information

MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm)

MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm) Name: MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm) Instructions: The total score is 200 points. There are ten problems. Point values per problem are shown besides the questions.

More information

Post-exam 2 practice questions 18.05, Spring 2014

Post-exam 2 practice questions 18.05, Spring 2014 Post-exam 2 practice questions 18.05, Spring 2014 Note: This is a set of practice problems for the material that came after exam 2. In preparing for the final you should use the previous review materials,

More information

The Chi-Square and F Distributions

The Chi-Square and F Distributions Department of Psychology and Human Development Vanderbilt University Introductory Distribution Theory 1 Introduction 2 Some Basic Properties Basic Chi-Square Distribution Calculations in R Convergence

More information

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

EC2001 Econometrics 1 Dr. Jose Olmo Room D309 EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:

More information

Inference for Single Proportions and Means T.Scofield

Inference for Single Proportions and Means T.Scofield Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter

More information

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): Lecture Three Normal theory null distributions Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): A random variable which is a sum of many

More information

Homework 7: Solutions. P3.1 from Lehmann, Romano, Testing Statistical Hypotheses.

Homework 7: Solutions. P3.1 from Lehmann, Romano, Testing Statistical Hypotheses. Stat 300A Theory of Statistics Homework 7: Solutions Nikos Ignatiadis Due on November 28, 208 Solutions should be complete and concisely written. Please, use a separate sheet or set of sheets for each

More information

AMS 132: Discussion Section 2

AMS 132: Discussion Section 2 Prof. David Draper Department of Applied Mathematics and Statistics University of California, Santa Cruz AMS 132: Discussion Section 2 All computer operations in this course will be described for the Windows

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

Will Landau. Feb 28, 2013

Will Landau. Feb 28, 2013 Iowa State University The F Feb 28, 2013 Iowa State University Feb 28, 2013 1 / 46 Outline The F The F Iowa State University Feb 28, 2013 2 / 46 The normal (Gaussian) distribution A random variable X is

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

Confidence Intervals for Normal Data Spring 2018

Confidence Intervals for Normal Data Spring 2018 Confidence Intervals for Normal Data 18.05 Spring 2018 Agenda Exam on Monday April 30. Practice questions posted. Friday s class is for review (no studio) Today Review of critical values and quantiles.

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer

Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer Solutions to Exam in 02402 December 2012 Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer 3 1 5 2 5 2 3 5 1 3 Exercise IV.2 IV.3 IV.4 V.1

More information

MAT2377. Ali Karimnezhad. Version December 13, Ali Karimnezhad

MAT2377. Ali Karimnezhad. Version December 13, Ali Karimnezhad MAT2377 Ali Karimnezhad Version December 13, 2016 Ali Karimnezhad Comments These slides cover material from Chapter 4. In class, I may use a blackboard. I recommend reading these slides before you come

More information

Week 14 Comparing k(> 2) Populations

Week 14 Comparing k(> 2) Populations Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.

More information

7.3 The Chi-square, F and t-distributions

7.3 The Chi-square, F and t-distributions 7.3 The Chi-square, F and t-distributions Ulrich Hoensch Monday, March 25, 2013 The Chi-square Distribution Recall that a random variable X has a gamma probability distribution (X Gamma(r, λ)) with parameters

More information

MAS223 Statistical Inference and Modelling Exercises

MAS223 Statistical Inference and Modelling Exercises MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,

More information

One-Sample Numerical Data

One-Sample Numerical Data One-Sample Numerical Data quantiles, boxplot, histogram, bootstrap confidence intervals, goodness-of-fit tests University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

SOLUTION FOR HOMEWORK 4, STAT 4352

SOLUTION FOR HOMEWORK 4, STAT 4352 SOLUTION FOR HOMEWORK 4, STAT 4352 Welcome to your fourth homework. Here we begin the study of confidence intervals, Errors, etc. Recall that X n := (X 1,...,X n ) denotes the vector of n observations.

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)

More information

Chapter 8 of Devore , H 1 :

Chapter 8 of Devore , H 1 : Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in

More information

4 Resampling Methods: The Bootstrap

4 Resampling Methods: The Bootstrap 4 Resampling Methods: The Bootstrap Situation: Let x 1, x 2,..., x n be a SRS of size n taken from a distribution that is unknown. Let θ be a parameter of interest associated with this distribution and

More information

ECON 4130 Supplementary Exercises 1-4

ECON 4130 Supplementary Exercises 1-4 HG Set. 0 ECON 430 Sulementary Exercises - 4 Exercise Quantiles (ercentiles). Let X be a continuous random variable (rv.) with df f( x ) and cdf F( x ). For 0< < we define -th quantile (or 00-th ercentile),

More information

7 Estimation. 7.1 Population and Sample (P.91-92)

7 Estimation. 7.1 Population and Sample (P.91-92) 7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for

More information

Lecture 11. Multivariate Normal theory

Lecture 11. Multivariate Normal theory 10. Lecture 11. Multivariate Normal theory Lecture 11. Multivariate Normal theory 1 (1 1) 11. Multivariate Normal theory 11.1. Properties of means and covariances of vectors Properties of means and covariances

More information

Generalized linear models

Generalized linear models Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

WUCT121. Discrete Mathematics. Numbers

WUCT121. Discrete Mathematics. Numbers WUCT121 Discrete Mathematics Numbers 1. Natural Numbers 2. Integers and Real Numbers 3. The Principle of Mathematical Induction 4. Elementary Number Theory 5. Congruence Arithmetic WUCT121 Numbers 1 Section

More information

Math493 - Fall HW 4 Solutions

Math493 - Fall HW 4 Solutions Math493 - Fall 2017 - HW 4 Solutions Renato Feres - Wash. U. Preliminaries We have up to this point ignored a central aspect of the Monte Carlo method: How to estimate errors? Clearly, the larger the sample

More information

Lecture 10: Generalized likelihood ratio test

Lecture 10: Generalized likelihood ratio test Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

Confidence intervals for parameters of normal distribution.

Confidence intervals for parameters of normal distribution. Lecture 5 Confidence intervals for parameters of normal distribution. Let us consider a Matlab example based on the dataset of body temperature measurements of 30 individuals from the article []. The dataset

More information

18.05 Practice Final Exam

18.05 Practice Final Exam No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For

More information

CBA4 is live in practice mode this week exam mode from Saturday!

CBA4 is live in practice mode this week exam mode from Saturday! Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as

More information

Introduction to Statistics

Introduction to Statistics MTH4106 Introduction to Statistics Notes 15 Spring 2013 Testing hypotheses about the mean Earlier, we saw how to test hypotheses about a proportion, using properties of the Binomial distribution It is

More information

COS 341: Discrete Mathematics

COS 341: Discrete Mathematics COS 341: Discrete Mathematics Final Exam Fall 2006 Print your name General directions: This exam is due on Monday, January 22 at 4:30pm. Late exams will not be accepted. Exams must be submitted in hard

More information

Z-tables. January 12, This tutorial covers how to find areas under normal distributions using a z-table.

Z-tables. January 12, This tutorial covers how to find areas under normal distributions using a z-table. Z-tables January 12, 2019 Contents The standard normal distribution Areas above Areas below the mean Areas between two values of Finding -scores from areas Z tables in R: Questions This tutorial covers

More information

COS 341: Discrete Mathematics

COS 341: Discrete Mathematics COS 341: Discrete Mathematics Midterm Exam Fall 2006 Print your name General directions: This exam is due on Monday, November 13 at 4:30pm. Late exams will not be accepted. Exams must be submitted in hard

More information

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes Tests Concerning a Single Sample 41.2 Introduction This Section introduces you to the basic ideas of hypothesis testing in a non-mathematical way by using a problem solving approach to highlight the concepts

More information

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS 1a. Under the null hypothesis X has the binomial (100,.5) distribution with E(X) = 50 and SE(X) = 5. So P ( X 50 > 10) is (approximately) two tails

More information

CSE548, AMS542: Analysis of Algorithms, Spring 2014 Date: May 12. Final In-Class Exam. ( 2:35 PM 3:50 PM : 75 Minutes )

CSE548, AMS542: Analysis of Algorithms, Spring 2014 Date: May 12. Final In-Class Exam. ( 2:35 PM 3:50 PM : 75 Minutes ) CSE548, AMS54: Analysis of Algorithms, Spring 014 Date: May 1 Final In-Class Exam ( :35 PM 3:50 PM : 75 Minutes ) This exam will account for either 15% or 30% of your overall grade depending on your relative

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Math 31 Lesson Plan. Day 5: Intro to Groups. Elizabeth Gillaspy. September 28, 2011

Math 31 Lesson Plan. Day 5: Intro to Groups. Elizabeth Gillaspy. September 28, 2011 Math 31 Lesson Plan Day 5: Intro to Groups Elizabeth Gillaspy September 28, 2011 Supplies needed: Sign in sheet Goals for students: Students will: Improve the clarity of their proof-writing. Gain confidence

More information

The Normal Distribution

The Normal Distribution The Mary Lindstrom (Adapted from notes provided by Professor Bret Larget) February 10, 2004 Statistics 371 Last modified: February 11, 2004 The The (AKA Gaussian Distribution) is our first distribution

More information

10-701/15-781, Machine Learning: Homework 4

10-701/15-781, Machine Learning: Homework 4 10-701/15-781, Machine Learning: Homewor 4 Aarti Singh Carnegie Mellon University ˆ The assignment is due at 10:30 am beginning of class on Mon, Nov 15, 2010. ˆ Separate you answers into five parts, one

More information

Principal Moderator s Report

Principal Moderator s Report Principal Moderator s Report Centres are reminded that the deadline for coursework marks (and scripts if there are 10 or fewer from the centre) is December 10 for this specification. Moderators were pleased

More information

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of

More information

Comparing two independent samples

Comparing two independent samples In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

Confidence Intervals with σ unknown

Confidence Intervals with σ unknown STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017 Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 5 AMS-UCSC Tue 24, 2012 Winter 2012. Session 1 (Class 5) AMS-132/206 Tue 24, 2012 1 / 11 Topics Topics We will talk about... 1 Confidence Intervals

More information

CS446: Machine Learning Spring Problem Set 4

CS446: Machine Learning Spring Problem Set 4 CS446: Machine Learning Spring 2017 Problem Set 4 Handed Out: February 27 th, 2017 Due: March 11 th, 2017 Feel free to talk to other members of the class in doing the homework. I am more concerned that

More information

CS173 Lecture B, November 3, 2015

CS173 Lecture B, November 3, 2015 CS173 Lecture B, November 3, 2015 Tandy Warnow November 3, 2015 CS 173, Lecture B November 3, 2015 Tandy Warnow Announcements Examlet 7 is a take-home exam, and is due November 10, 11:05 AM, in class.

More information

Chapter 10: Inferences based on two samples

Chapter 10: Inferences based on two samples November 16 th, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

CSC411 Fall 2018 Homework 5

CSC411 Fall 2018 Homework 5 Homework 5 Deadline: Wednesday, Nov. 4, at :59pm. Submission: You need to submit two files:. Your solutions to Questions and 2 as a PDF file, hw5_writeup.pdf, through MarkUs. (If you submit answers to

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

Designing Information Devices and Systems I Spring 2018 Homework 11

Designing Information Devices and Systems I Spring 2018 Homework 11 EECS 6A Designing Information Devices and Systems I Spring 28 Homework This homework is due April 8, 28, at 23:59. Self-grades are due April 2, 28, at 23:59. Submission Format Your homework submission

More information

LECTURE 5 HYPOTHESIS TESTING

LECTURE 5 HYPOTHESIS TESTING October 25, 2016 LECTURE 5 HYPOTHESIS TESTING Basic concepts In this lecture we continue to discuss the normal classical linear regression defined by Assumptions A1-A5. Let θ Θ R d be a parameter of interest.

More information

Power. Week 8: Lecture 1 STAT: / 48

Power. Week 8: Lecture 1 STAT: / 48 Power STAT:5201 Week 8: Lecture 1 1 / 48 Power We have already described Type I and II errors. Decision Reality/True state Accept H o Reject H o H o is true good Type I error H o is false Type II error

More information

Using Tables and Graphing Calculators in Math 11

Using Tables and Graphing Calculators in Math 11 Using Tables and Graphing Calculators in Math 11 Graphing calculators are not required for Math 11, but they are likely to be helpful, primarily because they allow you to avoid the use of tables in some

More information

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1

More information

Introduction to R and Programming

Introduction to R and Programming Introduction to R and Programming Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota) Introduction

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test

More information

Exam 2 Practice Questions, 18.05, Spring 2014

Exam 2 Practice Questions, 18.05, Spring 2014 Exam 2 Practice Questions, 18.05, Spring 2014 Note: This is a set of practice problems for exam 2. The actual exam will be much shorter. Within each section we ve arranged the problems roughly in order

More information

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low

More information

Introduction to Statistical Data Analysis Lecture 8: Correlation and Simple Regression

Introduction to Statistical Data Analysis Lecture 8: Correlation and Simple Regression Introduction to Statistical Data Analysis Lecture 8: and James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1 / 40 Introduction

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Math 51 First Exam October 19, 2017

Math 51 First Exam October 19, 2017 Math 5 First Exam October 9, 27 Name: SUNet ID: ID #: Complete the following problems. In order to receive full credit, please show all of your work and justify your answers. You do not need to simplify

More information

Statistics for scientists and engineers

Statistics for scientists and engineers Statistics for scientists and engineers February 0, 006 Contents Introduction. Motivation - why study statistics?................................... Examples..................................................3

More information

probability George Nicholson and Chris Holmes 31st October 2008

probability George Nicholson and Chris Holmes 31st October 2008 probability George Nicholson and Chris Holmes 31st October 2008 This practical focuses on understanding probabilistic and statistical concepts using simulation and plots in R R. It begins with an introduction

More information

ESTIMATION BY CONFIDENCE INTERVALS

ESTIMATION BY CONFIDENCE INTERVALS ESTIMATION BY CONFIDENCE INTERVALS Introduction We are now in the knowledge that a population parameter can be estimated from sample data by calculating the corresponding point estimate. This chapter is

More information

SCIENCE PROGRAM CALCULUS III

SCIENCE PROGRAM CALCULUS III SCIENCE PROGRAM CALCULUS III Discipline: Mathematics Semester: Winter 2005 Course Code: 201-DDB-05 Instructor: Objectives: 00UV, 00UU Office: Ponderation: 3-2-3 Tel.: 457-6610 Credits: 2 2/3 Local: Course

More information

Gov Univariate Inference II: Interval Estimation and Testing

Gov Univariate Inference II: Interval Estimation and Testing Gov 2000-5. Univariate Inference II: Interval Estimation and Testing Matthew Blackwell October 13, 2015 1 / 68 Large Sample Confidence Intervals Confidence Intervals Example Hypothesis Tests Hypothesis

More information

Solutions to Practice Test 2 Math 4753 Summer 2005

Solutions to Practice Test 2 Math 4753 Summer 2005 Solutions to Practice Test Math 4753 Summer 005 This test is worth 00 points. Questions 5 are worth 4 points each. Circle the letter of the correct answer. Each question in Question 6 9 is worth the same

More information

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution

More information

MATH 137 : Calculus 1 for Honours Mathematics. Online Assignment #2. Introduction to Sequences

MATH 137 : Calculus 1 for Honours Mathematics. Online Assignment #2. Introduction to Sequences 1 MATH 137 : Calculus 1 for Honours Mathematics Online Assignment #2 Introduction to Sequences Due by 9:00 pm on WEDNESDAY, September 19, 2018 Instructions: Weight: 2% This assignment covers the topics

More information

Econometrics A. Simple linear model (2) Keio University, Faculty of Economics. Simon Clinet (Keio University) Econometrics A October 16, / 11

Econometrics A. Simple linear model (2) Keio University, Faculty of Economics. Simon Clinet (Keio University) Econometrics A October 16, / 11 Econometrics A Keio University, Faculty of Economics Simple linear model (2) Simon Clinet (Keio University) Econometrics A October 16, 2018 1 / 11 Estimation of the noise variance σ 2 In practice σ 2 too

More information

6.2 Area Under the Standard Normal Curve

6.2 Area Under the Standard Normal Curve 6.2 Area Under the Standard Normal Curve Tom Lewis Fall Term 2009 Tom Lewis () 6.2 Area Under the Standard Normal Curve Fall Term 2009 1 / 6 Outline 1 The cumulative distribution function 2 The z α notation

More information

Confidence intervals CE 311S

Confidence intervals CE 311S CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of

More information