Non-parametric Inference and Resampling
|
|
- Maurice Gilmore
- 5 years ago
- Views:
Transcription
1 Non-parametric Inference and Resampling Exercises by David Wozabal (Last update 3. Juni 2013) 1 Basic Facts about Rank and Order Statistics students were asked about the amount of time they spend surfing Internet per month : Student i Amount of time x i (in Hours) (a) What are the corresponding values for the ordered statistics? (b) Determine the rank statistics (r 1,..., r 10 ). 1.2 Suppose that we have a sample of size n for the setting of Exercise 1.1. In terms of the order statistics, what statistic will you consider to study the following : (a) the least amount of time spent (b) the highest amount of time spent (c) the median of the amount of time spent. Hint: distinguish between cases, when n is odd and when n is even. (d) the range of the amount of time spent (e) Determine the values of the statistics discussed in (a)-(d) based on the data in Exercise The empirical distribution function ˆF n corresponding to the sample x 1,..., x n is given by: ˆF n (t) = 1. Number of samples not exceeding t. n (a) Calculate (and draw the graph of) the empirical distribution function for Exercise 1.1. (b) Can you express the empirical distribution function in terms of order statistics. 1.4 Prove that for a random sample X = (X 1,..., X n ), where R i are the rank statistics of X. R i = n(n + 1) Prove that for a density f that is invariant with respect to permutation, i.e., f(x 1,..., x n ) = f(x s(1),..., x s(1) ), s S n and for X f the following holds (a) P(R = r) = (n!) 1 for R the vector of ranks. (b) f(x (1),..., x (n) ) = n!f(x (1),..., x (n) ) (c) R is independent of the vector of order statistics X ().
2 Hint: Use that f(x (1),..., x (n) ) = s S n f(x (s(1)),..., x (s(n)) ) and that P(R = r X () = x () ) = f(x (s(1)),..., x (s(n)) ) f(x (1),..., x (n) ) 1.6 Let X = (X 1,..., X n ) be a random vector with a continuous density, such that R is uniformly distributed in S n. Show that (a) P(R i = k) = 1 n, 1 i, k n (b) P(R i = k, R j = h) = 1 n(n 1), i j, k h (c) P(R i = R j ) = 0, 1 i j n. 1.7 Let X 1,..., X n be i.i.d. random variables. Show that Hint: Use the fact that f X(1),X (n) (x, y) = n(n 1) (F (y) F (x)) n 2 f(x)f(y), < x y <. P(X (1) x, X (n) y) = P(X (n) y) P(X (1) > x, X (n) y) to obtain the joint distribution of (X (1), X (n) ). 1.8 Show that the density of the midrange V n = (X (1) + X (n) )/2 for an i.i.d. sample X 1,... X n from an absolutely continuous distribution F with density f is given by f Vn (v) = 2n(n 1) v {F (2v x 1 ) F (x 1 )} n 2 f(x 1 )f(2v x 1 )dx 1, < v <. Hint: Use the joint density of (X (1), X (n) ) from 1.7 to calculate the joint density of (X (1), V n ) by finding a transformation T : R 2 R 2, such that T (X (1), X (n) ) = (X (1), V n ) and noting that using the change of variable formula P(T (X) A) = P(X T 1 (A)) = f X (x)dx = f X (T 1 (x)) J T 1(x) dx x T 1 (A) where J T 1 is the determinant of the Jacobian of T 1. Now integrate out the dependence on x (1). 1.9 Calculate the explicit form for the density of the midrange V n, when X i F = U(0, 1) for all 1 i n For an i.i.d. sample X 1,..., X n, calculate the density of the order statistic X (i) for 1 i n Using example 1.10 calculate the density of U (i) for U i i.i.d. with U i U(0, 1) for all 1 i n For the U 1,... U n defined in Exercise 1.11 prove that E(U (i) ) = Hint: Use the following result (Beta function) where Γ is the Gamma function. B(x, y) = 1 0 i n + 1. x A t x 1 (1 t) y 1 dt = Γ(x)Γ(y) Γ(x + y) 2
3 1.13 For the U 1,... U n defined in Exercise 1.11 prove that Var(U (i) ) = i(n i + 1) (n + 1) 2 (n + 2). Hint: Again use the Beta function and it s relation to the Gamma function Show that in the i.i.d. case the joint density of two arbitrary order statistics (X (j), X (i) ) with 1 i < j n is given by (for < x i < x j < ) f X(i),X (j) (x i, x j ) = n! (i 1)!(j i 1)!(n j)! F (x i) i 1 (F (x j ) F (x i )) j i 1 (1 F (x j )) n j f(x i )f(x j ). Hint: Use the formula for the joint density f of all the order statistics (why does the this hold?) f(x (1),..., x (n) ) = r S n f(x (r1 ),..., x (rn)) = n!f(x (1),..., x (n) ) and integrate out the orders k : k i, k j Let U 1,..., U n be a i.i.d. sample from the standard uniform distribution U(0, 1). Use 1.14 and 1.10 to show that the random variables U (i) /U (j) and U (j) for 1 i < j n, are statistically independent and calculate the respective distribution functions Prove that cov( ˆF n (x), ˆF n (y)) = c(f X (x), F X (y))/n, where c(s, t) = min(s, t) st and ˆF n is the empirical distribution function of a random sample of size n from the population F X. Hint: Write ˆF n (x) = n 1 n 1 {t:x i t}(x) Let ˆF n be the empirical cdf of a random sample of size n from a continuous cdf F X. Show that for < x < y <, cov( ˆF n (x), ˆF n (y)) = F X(x)(1 F X (y)) n Hint: Use the result from the last exercise Show that for an i.i.d. sample (X 1,..., X n ) with ranks (R 1,..., R n ) (a) Cov(R i, R j ) = n (b) Corr(R i, R j ) = n+1 n Non-parametric tests 2.1 Use the sign test and the following i.i.d. sample , , , , , , , , , to test the null hypothesis that the median φ of the underlying distribution equals 0, against H 1 : φ 0 and H 1 : φ > 0 (at the level α = 0.05). 2.2 Repeat the test in 2.1 with the Wilcoxon test. 2.3 Let X 1,..., X 48 be a random sample from a distribution with distribution function F. To test H 0 : F (41) = 1 4 against H 1 : F (41) < 1 4, use the statistic Y, which is the number of samples less than or equal to 41. If the observed value of Y is smaller than 7 reject H 0 and accept H 1. If p = F (41), find the power function K(p), 0 < p < 1 4 of the test. Approximate α = K( 1 4 ). 3
4 2.4 A statistician decides that working with numbers is a tedious job and designs a universal test, which rejects the null hypothesis in favor of the alternative hypothesis if a coin toss (whose outcome is independent of the testing problem in question) yields heads. What is the power of this test? 2.5 Calculate the expectation and the variance of the test statistic of the sign test under the null hypothesis of the test. 2.6 Calculate the expectation and the variance of the test statistic of the Wilcoxon test under the null hypothesis of the test. Hint: The Wilcoxon test statistic for testing H 0 : X = M is given by W = Z i rank( D i ) where D i = X i M and Z i = sign(d i ). Show that W has the same distribution as n V i, where V i are independent random variables with P(V i = i) = P(V i = i) = 2 1. Use this representation to calculate mean and variance. 2.7 (Gibbons & Chakraborti, 2003); A random sample of 10 observations is drawn from a normal population with mean µ and variance 1. Instead of a normal-theory test, the ordinary sign test is used for H 0 : µ = 0, H 1 : µ > 0, and the null hypothesis is rejected if th test statistic K 8. (a) Plot the power curve using the exact distribution of K. (b) Plot the power curve using the normal approximation to the distribution of K. 2.8 (Gibbons & Chakraborti, 2003); Hoskin et al. (1986) investigated the change in fatal motor-vehicle accidents after the legal minimum drinking age was raised in 10 states. Their data were the ratios of the number of single-vehicle nighttime fatalities to the number of licensed drivers in the affected age group before and after the laws were changed to raise the drinking age, shown in Table 1. The researchers hypothesized that raising the minimum drinking age resulted in a reduced median fatality ratio. Investigate this hypothesis. State Affected Ages Ratio before Ratio after Florida Georgia Illinois Iowa Maine Michigan Montana Nebraska New Hampshire Tennessee Show that for X symmetric around 0, X and Z = 1 {1} (sign(x)) are independent Check whether the distribution of the Wilcoxon test statistic W = Z i rank( D i ) is asymptotically normally distributed. Use the following 4
5 Theorem 2.1. Let U 1,..., U n be a independent sample, where E(U i ) = µ i and Var(U i ) = σi 2. If the Liapounov condition n lim E( U i µ i 3 ) n ( n ) 3 = 0 σ2 2 i holds then n U i n µ i n σ2 n N(0, 1) Write a computer program that calculates the exact distribution of the test statistic of the Wilcoxon test Rank-Sum-Test: Let (X 1,..., X m ) and (Y 1,..., Y n ) be two samples such that Z = (Z 1,..., Z k ) = (X 1,..., X m, Y 1,..., Y n ) with k = n + m is an i.i.d. sample from a continuous distribution and R = (R 1,..., R k ) the corresponding ranks. Define T = m R i. Show that (a) E(T ) = m k+1 2 (b) Var(T ) = m(k m)(k+1) 12 Hint: Use example Density Estimation In this section K will always be a symmetric density and K h (x) = h 1 K ( x h). A kernel density estimator ˆf h (x; X 1,..., X n ) based on the random sample X 1,..., X n following density f is given by ˆf h (x; X 1,..., X n ) = 1 K h (x X i ). n Often the dependence on the random variables X 1,..., X n is suppressed in the notation and we write simply ˆf h (x; X 1,..., X n ) = ˆf(x, h). 3.1 Show that ˆf h (x; X 1,..., X n ) is a density. 3.2 The Mean Squared Error of an estimator ˆθ for θ is given by [ MSE(ˆθ) = E (ˆθ θ) 2]. Show that with Bias(ˆθ) = E(ˆθ) θ. MSE(ˆθ) = Var(ˆθ) + Bias(ˆθ) Let f g(x) = f(x y)g(y)dy be the convolution of f and g. Show that 5
6 (a) E( ˆf(x; h)) = K h(x y)f(y)dy = (K h f)(x) (b) Var( ˆf(x; h)) = n 1 ( (K 2 h f)(x) (K h f) 2 (x) ). 3.4 Show that the MSE of ˆf(x; h) is MSE (f(x; h)) = n 1 ( (K 2 h f)(x) (K h f) 2 (x) ) + ((K h f)(x) f(x)) Let f be a continuous, bounded function. Show that lim h 0 f(x)k h (x ˆx)dx = f(ˆx). 3.6 The Integrated Squared Error of ˆf(x; X 1,..., X n ) is given as ISE( ˆf(x, h)) = while the Mean Integrated Squared Error is given by [ ] MISE( ˆf(x, h)) = E ( ˆf(x, h) f(x)) 2 dx = 1 n ( ˆf(x, h) f(x)) 2 dx (K 2 h f)(x) (K h f) 2 (x)dx + [(K h f)(x) f(x)] 2 dx where (f g)(x) = f(y)g(x y)dy = f(x y)g(y)dy is the convolution. Using the above show that MISE( ˆf(x, h)) = 1 nh 2 An asymptotic analysis of the MISE yields with K 2 (x)dx + (1 1 n ) (K h f) 2 (x)dx (K h f)(x)f(x)dx + f(x) 2 dx. MISE( ˆf(x, h)) = AMISE( ˆf(x, h)) + o ( (nh) 1 + h 4) AMISE( ˆf(x, h)) = (nh) 1 K(x) 2 dx + 1 ( 4 h4 ) 2 x 2 K(x)dx f (x) 2 dx. The first term in the above (asymptotically) corresponds to the variance of ˆf(x, h) while the second term captures the (asymptotic) bias of ˆf(x, h), showing a trade-off similar to the general formula in 3.2 in dependence on the parameter h. 3.7 Demonstrate the variance bias trade-off discussed above by sampling from the logistic distribution with 1 F (x) = 1 + e x. Sample from the distribution, fit a kernel density estimator with K being the density of the standard normal distribution and vary the values of h. Plot the results against f and complement this visual analysis by calculating the two components of the AMISE for each run. 3.8 Use the formula for the AMISE above to calculate the (asymptotically) optimal bandwidth h (i.e. the value of h for which the AMISE is minimal) and use h to obtain the asymptotic rate of convergence of the kernel density estimate. 6
7 4 Non-Parametric Regression In regression theory the aim is to find a relationship between two random variables X and Y by considering the following optimization (regression) problem min E [ (Y m(x)) 2]. (1) m:r R If m is restricted to be a linear function, the above problem is the classical linear regression problem. In locally weighted polynomial regression, we determine the regression function at a point x by solving the problem min β (Y i β 0 β 1 (X i x) β p (X i x) p ) 2 K h (X i x) with β = (β 0, β 1,..., β p ). We then set m(x) = m(x, p, h) = β 0. To make the problem conform with the usual regression setting, we define the (n p) matrix X x = 1 (X 1 x) (X 1 x) p... 1 (X n x) (X n x) p and the (n n) matrix W x = diag (K h (X 1 x),..., K h (X n x)) where diag(x 1,..., x n ) is the matrix A = (a i,j ) with a i,i = x i and a i,j = 0 if i j. We can then write (Y i β 0 β 1 (X i x) β p (X i x) p ) 2 K h (X i x) = (Y X x β) W x (Y X x β) with Y = (Y 1,..., Y n ). Now standard least squares theory can be used to get the optimal coefficients ˆβ = ( ˆβ 0, ˆβ 1,..., ˆβ p ) ˆβ = (X x W x X x ) 1 X x W x Y (2) and therefore ˆm(x, p, h) = e 1 (X x W x X x ) 1 X x W x Y. Exercises 4.1 Show that m(x) = E(Y X = x) in (1). Hint: To calculate the expectation the joint density f X,Y of (X, Y ) is needed, which can used in the following form f X,Y (x, y) = f Y,X=x (y)f X (x) where f Y,X=x (y) is the density of Y conditional on X = x. We can then write min E [ (Y m(x)) 2] = min (y m(x)) 2 f Y,X=x (y)f X (x)dydx. m:r R m:r R 4.2 Show (2). 4.3 The Nadaraja-Watson estimator: Show that for p = 0, we have that ˆm(x, 0, h) = β 0 = K h (x i x)y i n K h(x i x). 7
8 4.4 Show that for p = 1, it holds that ˆm(x, 1, h) = β 0 = (ŝ 2 (x, h) ŝ 1 (x, h))(x i x)k h (x i x)y i ŝ 2 (x, h)ŝ 0 (x, h) ŝ 1 (x, h) 2 where Hint: Show that with ŝ r (x, h) = n 1 (x i x) r K h (x i x). ŝ 0 (x, h) ŝ p (x, h) ˆt 0 (x, h) ˆm(x, p, h) = ŝ p (x, h) ŝ 2p (x, h) ˆt p (x, h) ˆt r (x, h) = n 1 (x i x) r K h (x i x)y i. 8
Non-parametric Inference and Resampling
Non-parametric Inference and Resampling Exercises by David Wozabal (Last update. Juni 010) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend surfing
More informationMaster s Written Examination - Solution
Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More informationMathematical statistics
October 4 th, 2018 Lecture 12: Information Where are we? Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter
More informationAdvanced Statistics II: Non Parametric Tests
Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney
More informationTest Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics
Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics The candidates for the research course in Statistics will have to take two shortanswer type tests
More informationPart IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015
Part IB Statistics Theorems with proof Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly)
More informationChapters 9. Properties of Point Estimators
Chapters 9. Properties of Point Estimators Recap Target parameter, or population parameter θ. Population distribution f(x; θ). { probability function, discrete case f(x; θ) = density, continuous case The
More informationCentral Limit Theorem ( 5.3)
Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately
More information5 Introduction to the Theory of Order Statistics and Rank Statistics
5 Introduction to the Theory of Order Statistics and Rank Statistics This section will contain a summary of important definitions and theorems that will be useful for understanding the theory of order
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More information6 The normal distribution, the central limit theorem and random samples
6 The normal distribution, the central limit theorem and random samples 6.1 The normal distribution We mentioned the normal (or Gaussian) distribution in Chapter 4. It has density f X (x) = 1 σ 1 2π e
More information36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1
36. Multisample U-statistics jointly distributed U-statistics Lehmann 6.1 In this topic, we generalize the idea of U-statistics in two different directions. First, we consider single U-statistics for situations
More informationReview. December 4 th, Review
December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter
More informationECE 650 Lecture 4. Intro to Estimation Theory Random Vectors. ECE 650 D. Van Alphen 1
EE 650 Lecture 4 Intro to Estimation Theory Random Vectors EE 650 D. Van Alphen 1 Lecture Overview: Random Variables & Estimation Theory Functions of RV s (5.9) Introduction to Estimation Theory MMSE Estimation
More informationLecture 7 Introduction to Statistical Decision Theory
Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7
More informationNonparametric Econometrics
Applied Microeconometrics with Stata Nonparametric Econometrics Spring Term 2011 1 / 37 Contents Introduction The histogram estimator The kernel density estimator Nonparametric regression estimators Semi-
More informationUnbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.
Unbiased Estimation Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others. To compare ˆθ and θ, two estimators of θ: Say ˆθ is better than θ if it
More informationECO Class 6 Nonparametric Econometrics
ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................
More informationf (1 0.5)/n Z =
Math 466/566 - Homework 4. We want to test a hypothesis involving a population proportion. The unknown population proportion is p. The null hypothesis is p = / and the alternative hypothesis is p > /.
More informationNADARAYA WATSON ESTIMATE JAN 10, 2006: version 2. Y ik ( x i
NADARAYA WATSON ESTIMATE JAN 0, 2006: version 2 DATA: (x i, Y i, i =,..., n. ESTIMATE E(Y x = m(x by n i= ˆm (x = Y ik ( x i x n i= K ( x i x EXAMPLES OF K: K(u = I{ u c} (uniform or box kernel K(u = u
More informationProblem 1 (20) Log-normal. f(x) Cauchy
ORF 245. Rigollet Date: 11/21/2008 Problem 1 (20) f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 4 2 0 2 4 Normal (with mean -1) 4 2 0 2 4 Negative-exponential x x f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.5
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth
More informationBootstrap tests. Patrick Breheny. October 11. Bootstrap vs. permutation tests Testing for equality of location
Bootstrap tests Patrick Breheny October 11 Patrick Breheny STA 621: Nonparametric Statistics 1/14 Introduction Conditioning on the observed data to obtain permutation tests is certainly an important idea
More informationStatement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.
MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss
More informationUQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables
UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1
More informationWhat to do today (Nov 22, 2018)?
What to do today (Nov 22, 2018)? Part 1. Introduction and Review (Chp 1-5) Part 2. Basic Statistical Inference (Chp 6-9) Part 3. Important Topics in Statistics (Chp 10-13) Part 4. Further Topics (Selected
More informationMAS223 Statistical Inference and Modelling Exercises
MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,
More informationPermutation tests. Patrick Breheny. September 25. Conditioning Nonparametric null hypotheses Permutation testing
Permutation tests Patrick Breheny September 25 Patrick Breheny STA 621: Nonparametric Statistics 1/16 The conditioning idea In many hypothesis testing problems, information can be divided into portions
More informationOn the Robust Modal Local Polynomial Regression
International Journal of Statistical Sciences ISSN 683 5603 Vol. 9(Special Issue), 2009, pp 27-23 c 2009 Dept. of Statistics, Univ. of Rajshahi, Bangladesh On the Robust Modal Local Polynomial Regression
More informationReview of Statistics
Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and
More informationOne-Sample Numerical Data
One-Sample Numerical Data quantiles, boxplot, histogram, bootstrap confidence intervals, goodness-of-fit tests University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html
More informationDensity estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas
0 0 5 Motivation: Regression discontinuity (Angrist&Pischke) Outcome.5 1 1.5 A. Linear E[Y 0i X i] 0.2.4.6.8 1 X Outcome.5 1 1.5 B. Nonlinear E[Y 0i X i] i 0.2.4.6.8 1 X utcome.5 1 1.5 C. Nonlinearity
More informationNONPARAMETRIC INFERENCE AND RESAMPLING
NONPARAMETRIC INFERENCE AND RESAMPLING SS 2011 OVERVIEW Introduction Rank and Order Statistics Tests for Randomness Tests for Goodness of Fit Tests for Location Kernel Density Estimation LITERATURE Jean
More informationNonparametric Methods
Nonparametric Methods Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania July 28, 2009 Michael R. Roberts Nonparametric Methods 1/42 Overview Great for data analysis
More informationPractice Problems Section Problems
Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,
More informationQuantitative Economics for the Evaluation of the European Policy. Dipartimento di Economia e Management
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti 1 Davide Fiaschi 2 Angela Parenti 3 9 ottobre 2015 1 ireneb@ec.unipi.it. 2 davide.fiaschi@unipi.it.
More informationRecall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n
Chapter 9 Hypothesis Testing 9.1 Wald, Rao, and Likelihood Ratio Tests Suppose we wish to test H 0 : θ = θ 0 against H 1 : θ θ 0. The likelihood-based results of Chapter 8 give rise to several possible
More informationChapter 10. U-statistics Statistical Functionals and V-Statistics
Chapter 10 U-statistics When one is willing to assume the existence of a simple random sample X 1,..., X n, U- statistics generalize common notions of unbiased estimation such as the sample mean and the
More informationOrder Statistics and Distributions
Order Statistics and Distributions 1 Some Preliminary Comments and Ideas In this section we consider a random sample X 1, X 2,..., X n common continuous distribution function F and probability density
More information2 Functions of random variables
2 Functions of random variables A basic statistical model for sample data is a collection of random variables X 1,..., X n. The data are summarised in terms of certain sample statistics, calculated as
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More informationRegression #3: Properties of OLS Estimator
Regression #3: Properties of OLS Estimator Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #3 1 / 20 Introduction In this lecture, we establish some desirable properties associated with
More informationConditioning Nonparametric null hypotheses Permutation testing. Permutation tests. Patrick Breheny. October 5. STA 621: Nonparametric Statistics
Permutation tests October 5 The conditioning idea In most hypothesis testing problems, information can be divided into portions that pertain to the hypothesis and portions that do not The usual approach
More information2. Variance and Covariance: We will now derive some classic properties of variance and covariance. Assume real-valued random variables X and Y.
CS450 Final Review Problems Fall 08 Solutions or worked answers provided Problems -6 are based on the midterm review Identical problems are marked recap] Please consult previous recitations and textbook
More informationKRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS
Bull. Korean Math. Soc. 5 (24), No. 3, pp. 7 76 http://dx.doi.org/34/bkms.24.5.3.7 KRUSKAL-WALLIS ONE-WAY ANALYSIS OF VARIANCE BASED ON LINEAR PLACEMENTS Yicheng Hong and Sungchul Lee Abstract. The limiting
More informationProbability Theory and Statistics. Peter Jochumzen
Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................
More informationModelling Non-linear and Non-stationary Time Series
Modelling Non-linear and Non-stationary Time Series Chapter 2: Non-parametric methods Henrik Madsen Advanced Time Series Analysis September 206 Henrik Madsen (02427 Adv. TS Analysis) Lecture Notes September
More informationMathematical statistics
October 18 th, 2018 Lecture 16: Midterm review Countdown to mid-term exam: 7 days Week 1 Chapter 1: Probability review Week 2 Week 4 Week 7 Chapter 6: Statistics Chapter 7: Point Estimation Chapter 8:
More informationUnbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.
Unbiased Estimation Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others. To compare ˆθ and θ, two estimators of θ: Say ˆθ is better than θ if it
More informationECON 721: Lecture Notes on Nonparametric Density and Regression Estimation. Petra E. Todd
ECON 721: Lecture Notes on Nonparametric Density and Regression Estimation Petra E. Todd Fall, 2014 2 Contents 1 Review of Stochastic Order Symbols 1 2 Nonparametric Density Estimation 3 2.1 Histogram
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationOn robust and efficient estimation of the center of. Symmetry.
On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract
More informationCopula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011
Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Outline Ordinary Least Squares (OLS) Regression Generalized Linear Models
More informationMinimum Hellinger Distance Estimation in a. Semiparametric Mixture Model
Minimum Hellinger Distance Estimation in a Semiparametric Mixture Model Sijia Xiang 1, Weixin Yao 1, and Jingjing Wu 2 1 Department of Statistics, Kansas State University, Manhattan, Kansas, USA 66506-0802.
More information1. Point Estimators, Review
AMS571 Prof. Wei Zhu 1. Point Estimators, Review Example 1. Let be a random sample from. Please find a good point estimator for Solutions. There are the typical estimators for and. Both are unbiased estimators.
More informationSTATS 200: Introduction to Statistical Inference. Lecture 29: Course review
STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout
More informationSTAT 285: Fall Semester Final Examination Solutions
Name: Student Number: STAT 285: Fall Semester 2014 Final Examination Solutions 5 December 2014 Instructor: Richard Lockhart Instructions: This is an open book test. As such you may use formulas such as
More informationMath 494: Mathematical Statistics
Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/
More informationIntroduction to Estimation Methods for Time Series models. Lecture 1
Introduction to Estimation Methods for Time Series models Lecture 1 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 1 SNS Pisa 1 / 19 Estimation
More informationIntroduction to Simple Linear Regression
Introduction to Simple Linear Regression Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Introduction to Simple Linear Regression 1 / 68 About me Faculty in the Department
More information1 of 7 7/16/2009 6:12 AM Virtual Laboratories > 7. Point Estimation > 1 2 3 4 5 6 1. Estimators The Basic Statistical Model As usual, our starting point is a random experiment with an underlying sample
More informationNonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I
1 / 16 Nonparametric tests Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I Nonparametric one and two-sample tests 2 / 16 If data do not come from a normal
More informationThis does not cover everything on the final. Look at the posted practice problems for other topics.
Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry
More informationMathematics Qualifying Examination January 2015 STAT Mathematical Statistics
Mathematics Qualifying Examination January 2015 STAT 52800 - Mathematical Statistics NOTE: Answer all questions completely and justify your derivations and steps. A calculator and statistical tables (normal,
More informationResampling and the Bootstrap
Resampling and the Bootstrap Axel Benner Biostatistics, German Cancer Research Center INF 280, D-69120 Heidelberg benner@dkfz.de Resampling and the Bootstrap 2 Topics Estimation and Statistical Testing
More informationStatistics 100A Homework 5 Solutions
Chapter 5 Statistics 1A Homework 5 Solutions Ryan Rosario 1. Let X be a random variable with probability density function a What is the value of c? fx { c1 x 1 < x < 1 otherwise We know that for fx to
More informationLecture 8: Information Theory and Statistics
Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 23, 2015 1 / 50 I-Hsiang
More informationFinal Overview. Introduction to ML. Marek Petrik 4/25/2017
Final Overview Introduction to ML Marek Petrik 4/25/2017 This Course: Introduction to Machine Learning Build a foundation for practice and research in ML Basic machine learning concepts: max likelihood,
More informationMaster s Written Examination
Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth
More informationEXAMINERS REPORT & SOLUTIONS STATISTICS 1 (MATH 11400) May-June 2009
EAMINERS REPORT & SOLUTIONS STATISTICS (MATH 400) May-June 2009 Examiners Report A. Most plots were well done. Some candidates muddled hinges and quartiles and gave the wrong one. Generally candidates
More informationRandom Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R
In probabilistic models, a random variable is a variable whose possible values are numerical outcomes of a random phenomenon. As a function or a map, it maps from an element (or an outcome) of a sample
More informationExpectations and Variance
4. Model parameters and their estimates 4.1 Expected Value and Conditional Expected Value 4. The Variance 4.3 Population vs Sample Quantities 4.4 Mean and Variance of a Linear Combination 4.5 The Covariance
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More informationNonparametric estimation using wavelet methods. Dominique Picard. Laboratoire Probabilités et Modèles Aléatoires Université Paris VII
Nonparametric estimation using wavelet methods Dominique Picard Laboratoire Probabilités et Modèles Aléatoires Université Paris VII http ://www.proba.jussieu.fr/mathdoc/preprints/index.html 1 Nonparametric
More informationMathematics Ph.D. Qualifying Examination Stat Probability, January 2018
Mathematics Ph.D. Qualifying Examination Stat 52800 Probability, January 2018 NOTE: Answers all questions completely. Justify every step. Time allowed: 3 hours. 1. Let X 1,..., X n be a random sample from
More informationDESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA
Statistica Sinica 18(2008), 515-534 DESIGN-ADAPTIVE MINIMAX LOCAL LINEAR REGRESSION FOR LONGITUDINAL/CLUSTERED DATA Kani Chen 1, Jianqing Fan 2 and Zhezhen Jin 3 1 Hong Kong University of Science and Technology,
More informationEvaluating Classifiers. Lecture 2 Instructor: Max Welling
Evaluating Classifiers Lecture 2 Instructor: Max Welling Evaluation of Results How do you report classification error? How certain are you about the error you claim? How do you compare two algorithms?
More information18.440: Lecture 28 Lectures Review
18.440: Lecture 28 Lectures 18-27 Review Scott Sheffield MIT Outline Outline It s the coins, stupid Much of what we have done in this course can be motivated by the i.i.d. sequence X i where each X i is
More informationUnit 2 Rational Functionals Exercises MHF 4UI Page 1
Unit 2 Rational Functionals Exercises MHF 4UI Page Exercises 2.: Division of Polynomials. Divide, assuming the divisor is not equal to zero. a) x 3 + 2x 2 7x + 4 ) x + ) b) 3x 4 4x 2 2x + 3 ) x 4) 7. *)
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different
More informationContinuous Random Variables
1 / 24 Continuous Random Variables Saravanan Vijayakumaran sarva@ee.iitb.ac.in Department of Electrical Engineering Indian Institute of Technology Bombay February 27, 2013 2 / 24 Continuous Random Variables
More information6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More information2 (Statistics) Random variables
2 (Statistics) Random variables References: DeGroot and Schervish, chapters 3, 4 and 5; Stirzaker, chapters 4, 5 and 6 We will now study the main tools use for modeling experiments with unknown outcomes
More informationRegression Estimation - Least Squares and Maximum Likelihood. Dr. Frank Wood
Regression Estimation - Least Squares and Maximum Likelihood Dr. Frank Wood Least Squares Max(min)imization Function to minimize w.r.t. β 0, β 1 Q = n (Y i (β 0 + β 1 X i )) 2 i=1 Minimize this by maximizing
More informationBias-Variance Tradeoff. David Dalpiaz STAT 430, Fall 2017
Bias-Variance Tradeoff David Dalpiaz STAT 430, Fall 2017 1 Announcements Homework 03 released Regrade policy Style policy? 2 Statistical Learning Supervised Learning Regression Parametric Non-Parametric
More informationComputational treatment of the error distribution in nonparametric regression with right-censored and selection-biased data
Computational treatment of the error distribution in nonparametric regression with right-censored and selection-biased data Géraldine Laurent 1 and Cédric Heuchenne 2 1 QuantOM, HEC-Management School of
More informationRegression and Statistical Inference
Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF
More informationThe Statistical Property of Ordinary Least Squares
The Statistical Property of Ordinary Least Squares The linear equation, on which we apply the OLS is y t = X t β + u t Then, as we have derived, the OLS estimator is ˆβ = [ X T X] 1 X T y Then, substituting
More informationProbability and Probability Distributions. Dr. Mohammed Alahmed
Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationHistogram Härdle, Müller, Sperlich, Werwatz, 1995, Nonparametric and Semiparametric Models, An Introduction
Härdle, Müller, Sperlich, Werwatz, 1995, Nonparametric and Semiparametric Models, An Introduction Tine Buch-Kromann Construction X 1,..., X n iid r.v. with (unknown) density, f. Aim: Estimate the density
More informationTime Series and Forecasting Lecture 4 NonLinear Time Series
Time Series and Forecasting Lecture 4 NonLinear Time Series Bruce E. Hansen Summer School in Economics and Econometrics University of Crete July 23-27, 2012 Bruce Hansen (University of Wisconsin) Foundations
More informationMAT 271E Probability and Statistics
MAT 71E Probability and Statistics Spring 013 Instructor : Class Meets : Office Hours : Textbook : Supp. Text : İlker Bayram EEB 1103 ibayram@itu.edu.tr 13.30 1.30, Wednesday EEB 5303 10.00 1.00, Wednesday
More informationQualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama
Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama Instructions This exam has 7 pages in total, numbered 1 to 7. Make sure your exam has all the pages. This exam will be 2 hours
More informationClass 26: review for final exam 18.05, Spring 2014
Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event
More informationMULTIVARIATE PROBABILITY DISTRIBUTIONS
MULTIVARIATE PROBABILITY DISTRIBUTIONS. PRELIMINARIES.. Example. Consider an experiment that consists of tossing a die and a coin at the same time. We can consider a number of random variables defined
More informationSummary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016
8. For any two events E and F, P (E) = P (E F ) + P (E F c ). Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016 Sample space. A sample space consists of a underlying
More informationChapter 4. Chapter 4 sections
Chapter 4 sections 4.1 Expectation 4.2 Properties of Expectations 4.3 Variance 4.4 Moments 4.5 The Mean and the Median 4.6 Covariance and Correlation 4.7 Conditional Expectation SKIP: 4.8 Utility Expectation
More information