Efficient and Robust Scale Estimation
|
|
- Andrew Silvester Barrett
- 6 years ago
- Views:
Transcription
1 Efficient and Robust Scale Estimation Garth Tarr, Samuel Müller and Neville Weber School of Mathematics and Statistics THE UNIVERSITY OF SYDNEY
2 Outline Introduction and motivation The robust scale estimator P n Properties of P n in finite samples Summary and key references
3 History of relative efficiencies at the Gaussian (n = 20) SD Relative Efficiency Year
4 History of relative efficiencies at the Gaussian (n = 20) SD Relative Efficiency IQR MAD Year
5 History of relative efficiencies at the Gaussian (n = 20) SD Relative Efficiency IQR MAD Q n S n Year
6 History of relative efficiencies at the Gaussian (n = 20) 1.00 SD 0.85 P n Relative Efficiency IQR MAD Q n S n Year
7 U-quantile statistics Given data X = (X 1,..., X n ) and a symmetric kernel h : R 2 R a U-statistic of order 2 is defined as: ( ) n 1 U n (X) := h(x i, X j ). (1) 2 i<j
8 U-quantile statistics Given data X = (X 1,..., X n ) and a symmetric kernel h : R 2 R a U-statistic of order 2 is defined as: ( ) n 1 U n (X) := h(x i, X j ). (1) 2 i<j Let H(t) = P (h(x i, X j ) t) be the cdf of the kernels with corresponding empirical distribution function, H n (t) := ( ) n 1 I{h(X i, X j ) t}, for t R. (2) 2 i<j
9 U-quantile statistics Given data X = (X 1,..., X n ) and a symmetric kernel h : R 2 R a U-statistic of order 2 is defined as: ( ) n 1 U n (X) := h(x i, X j ). (1) 2 i<j Let H(t) = P (h(x i, X j ) t) be the cdf of the kernels with corresponding empirical distribution function, H n (t) := ( ) n 1 I{h(X i, X j ) t}, for t R. (2) 2 i<j For 0 < p < 1, the corresponding sample U-quantile is: H 1 n (p) := inf{t : H n (t) p}. (3)
10 Generalized L-statistics A generalized linear (GL) statistic can be defined as where T n (H n ) = I J(p)H 1 n (p)dp + d j=1 J is function for smooth weighting of Hn 1 (p) I [0, 1] is some interval a j are discrete coefficients for H 1 n (p j ) a j H 1 n (p j ). (Serfling, 1984)
11 Examples of GL-statistics Interquartile range: h(x) = x, IQR = Hn 1 (0.75) Hn 1 (0.25)
12 Examples of GL-statistics Interquartile range: h(x) = x, IQR = Hn 1 (0.75) Hn 1 (0.25) Variance: h(x, y) = 1 2 (x y)2, 1 0 Hn 1 (p)dp
13 Examples of GL-statistics Interquartile range: h(x) = x, IQR = Hn 1 (0.75) Hn 1 (0.25) Variance: h(x, y) = 1 2 (x y)2, 1 0 Hn 1 (p)dp Winsorized variance: h(x, y) = 1 2 (x y)2, Hn 1 (p)dp Hn 1 (0.75)
14 Examples of GL-statistics Interquartile range: h(x) = x, IQR = Hn 1 (0.75) Hn 1 (0.25) Variance: h(x, y) = 1 2 (x y)2, 1 0 Hn 1 (p)dp Winsorized variance: h(x, y) = 1 2 (x y)2, Hn 1 (p)dp Hn 1 (0.75) Rousseeuw and Croux s Q n : h(x, y) = x y, H 1 n (0.25)
15 Outline Introduction and motivation The robust scale estimator P n Properties of P n in finite samples Summary and key references
16 Pairwise mean scale estimator: P n Consider the set of ( n 2) pairwise means: {h(x i, X j ), 1 i < j n} where h(x 1, X 2 ) = (X 1 + X 2 )/2.
17 Pairwise mean scale estimator: P n Consider the set of ( n 2) pairwise means: {h(x i, X j ), 1 i < j n} where h(x 1, X 2 ) = (X 1 + X 2 )/2. Let H n be the corresponding empirical distribution function: ( ) n 1 H n (t) := I{h(X i, X j ) t}, for t R. 2 i<j
18 Pairwise mean scale estimator: P n Consider the set of ( n 2) pairwise means: {h(x i, X j ), 1 i < j n} where h(x 1, X 2 ) = (X 1 + X 2 )/2. Let H n be the corresponding empirical distribution function: ( ) n 1 H n (t) := I{h(X i, X j ) t}, for t R. 2 Definition P n is defined as i<j P n = c [ Hn 1 (0.75) Hn 1 (0.25) ], where c is a correction factor to make P n consistent for the standard deviation when the underlying observations are Gaussian.
19 Influence curve The influence curve for a functional T at distribution F is T ((1 ɛ)f + ɛδ x ) T (F ) IC(x; T, F ) = lim ɛ 0 ɛ where δ x has all its mass at x. Serfling (1984) outlines the IC for GL-statistics.
20 Influence curve The influence curve for a functional T at distribution F is T ((1 ɛ)f + ɛδ x ) T (F ) IC(x; T, F ) = lim ɛ 0 ɛ where δ x has all its mass at x. Serfling (1984) outlines the IC for GL-statistics. Influence curve for P n Assuming that F has derivative f > 0 on [F 1 (ɛ), F 1 (1 ɛ)] for all ɛ > 0, [ 0.75 F (2H 1 F (0.75) x) IC(x; P n, F ) = c f(2h 1 F (0.75) x)f(x)dx ] 0.25 F (2H 1 F (0.25) x) f(2h 1. F (0.25) x)f(x)dx
21 Influence curves when F = Φ P n IC(x; T, F ) x
22 Influence curves when F = Φ SD P n IC(x; T, F ) x
23 Influence curves when F = Φ SD P n IC(x; T, F ) MAD x
24 Influence curves when F = Φ SD P n IC(x; T, F ) Q n MAD x
25 Asymptotic normality The empirical U-process is ( n (Hn (t) H(t)) ) t R.
26 Asymptotic normality The empirical U-process is ( n (Hn (t) H(t)) ) t R. Silverman (1976) proved that in this context, n(hn ( ) H( )) converges weakly to an almost sure continuous zero-mean Gaussian process W with covariance function: EW (s)w (t) = 4P (h(x 1, X 2 ) s, h(x 1, X 3 ) t) 4H(s)H(t).
27 Asymptotic normality The empirical U-process is ( n (Hn (t) H(t)) ) t R. Silverman (1976) proved that in this context, n(hn ( ) H( )) converges weakly to an almost sure continuous zero-mean Gaussian process W with covariance function: EW (s)w (t) = 4P (h(x 1, X 2 ) s, h(x 1, X 3 ) t) 4H(s)H(t). For 0 < p < q < 1, if H, the derivative of H, is strictly positive on the interval [H 1 (p) ε, H 1 (q) + ε] for some ε > 0, then we can use the inverse map to show n ( H 1 n ( ) H 1 ( ) ) D W (H 1 ( )) H (H 1 ( )).
28 Asymptotic normality Recall, P n = c [ Hn 1 (3/4) Hn 1 (1/4) ].
29 Asymptotic normality Recall, P n = c [ Hn 1 (3/4) Hn 1 (1/4) ]. Hence, where n(pn θ) D N (0, V ) θ = c ( H 1 (3/4) H 1 (1/4) ) and V both depend on the underlying distribution.
30 Asymptotic variance and relative efficiency The asymptotic variance follows from the previous limit theorem or from the expected square of the influence function.
31 Asymptotic variance and relative efficiency The asymptotic variance follows from the previous limit theorem or from the expected square of the influence function. When the underlying data are Gaussian, numerical integration yields, V = IC(x, P n, Φ) 2 dφ(x) =
32 Asymptotic variance and relative efficiency The asymptotic variance follows from the previous limit theorem or from the expected square of the influence function. When the underlying data are Gaussian, numerical integration yields, V = IC(x, P n, Φ) 2 dφ(x) = This equates to an asymptotic efficiency of 0.86 as compared with 0.82 for Q n and 0.37 for the MAD at the normal.
33 Asymptotic variance and relative efficiency The asymptotic variance follows from the previous limit theorem or from the expected square of the influence function. When the underlying data are Gaussian, numerical integration yields, V = IC(x, P n, Φ) 2 dφ(x) = This equates to an asymptotic efficiency of 0.86 as compared with 0.82 for Q n and 0.37 for the MAD at the normal. Result We have a robust estimator that is asymptotically more efficient than Q n at the normal. But how does it compare at heavier tailed distributions?
34 Asymptotic relative efficiency when f = t ν for ν [1, 10] Asymptotic relative efficiency P n Degrees of freedom
35 Asymptotic relative efficiency when f = t ν for ν [1, 10] Asymptotic relative efficiency MAD Degrees of freedom P n
36 Asymptotic relative efficiency when f = t ν for ν [1, 10] Asymptotic relative efficiency P n Q n MAD Degrees of freedom
37 Discrete distributions For P n = 0, the interquartile range of the pairwise means must be equal to zero.
38 Discrete distributions For P n = 0, the interquartile range of the pairwise means must be equal to zero. I.e. more than 50% of the pairwise means must be equal.
39 Discrete distributions For P n = 0, the interquartile range of the pairwise means must be equal to zero. I.e. more than 50% of the pairwise means must be equal. For a discrete random variable with possible outcomes, x 1, x 2,..., and P(X = x j ) = p j, for j = 1, 2,..., the expected proportion of pairwise differences equal to zero is j p2 j. In particular, p 2 j > 0.25 lim P(Q n = 0) = 1. n j
40 Discrete distributions For P n = 0, the interquartile range of the pairwise means must be equal to zero. I.e. more than 50% of the pairwise means must be equal. For a discrete random variable with possible outcomes, x 1, x 2,..., and P(X = x j ) = p j, for j = 1, 2,..., the expected proportion of pairwise differences equal to zero is j p2 j. In particular, p 2 j > 0.25 lim P(Q n = 0) = 1. n j Example (X Poisson(1)) For the Poisson(1), j p2 j 0.31 and so in the limit, Q n = 0 with high probability, whereas c 1 P n = 1.
41 Discrete distributions In finite samples, apart from some trivial cases, P n = 0 = Q n = 0.
42 Discrete distributions In finite samples, apart from some trivial cases, P n = 0 = Q n = 0. Example (X Binomial(6, 0.4)) In the limit neither Q n nor P n will converge to zero. In samples of size n = 20, Q n will return a scale estimate of zero, on average 12% of the time. In contrast, P n returns zero less than 0.1% of the time.
43 Adaptive trimming: Pn (Potential) Cons with P n P n has a breakdown value of 13%. P n is not very efficient at the Cauchy.
44 Adaptive trimming: Pn (Potential) Cons with P n P n has a breakdown value of 13%. P n is not very efficient at the Cauchy. For preliminary high breakdown location and scale estimates, m(x) and s(x) respectively, an observation, X i, is trimmed if X i m(x) s(x) where d is the tuning parameter. > d, (4)
45 Adaptive trimming: Pn (Potential) Cons with P n P n has a breakdown value of 13%. P n is not very efficient at the Cauchy. For preliminary high breakdown location and scale estimates, m(x) and s(x) respectively, an observation, X i, is trimmed if X i m(x) s(x) > d, (4) where d is the tuning parameter. Achieves the best possible breakdown value for a sensible choice of tuning parameter.
46 Adaptive trimming: Pn (Potential) Cons with P n P n has a breakdown value of 13%. P n is not very efficient at the Cauchy. For preliminary high breakdown location and scale estimates, m(x) and s(x) respectively, an observation, X i, is trimmed if X i m(x) s(x) > d, (4) where d is the tuning parameter. Achieves the best possible breakdown value for a sensible choice of tuning parameter. Definition Denote P n as the adaptively trimmed P n with d = 5.
47 Outline Introduction and motivation The robust scale estimator P n Properties of P n in finite samples Summary and key references
48 Efficiency of P n in finite samples Following Randal (2008) efficiencies are estimated over m independent samples as êff(t ) = Var (ln ˆσ 1,..., ln ˆσ m ) Var (ln T (X 1 ),..., ln T (X m )). (5)
49 Efficiency of P n in finite samples Following Randal (2008) efficiencies are estimated over m independent samples as êff(t ) = For each i = 1, 2,..., m, Var (ln ˆσ 1,..., ln ˆσ m ) Var (ln T (X 1 ),..., ln T (X m )). (5) X i are independent samples of size n, ˆσ i is the ML scale estimate, and T (X i ) is the proposed scale estimate.
50 Efficiency of P n in finite samples Following Randal (2008) efficiencies are estimated over m independent samples as êff(t ) = For each i = 1, 2,..., m, Var (ln ˆσ 1,..., ln ˆσ m ) Var (ln T (X 1 ),..., ln T (X m )). (5) X i are independent samples of size n, ˆσ i is the ML scale estimate, and T (X i ) is the proposed scale estimate. Distributions considered t distributions with degrees of freedom between 1 and 10. Configural polysampling using Tukey s 3 corners: Gaussian, One-wild and Slash.
51 Relative efficiencies: f = t ν for ν [1, 10] and n = 20 Estimated relative efficiency P n Degrees of freedom
52 Relative efficiencies: f = t ν for ν [1, 10] and n = 20 Estimated relative efficiency P n Degrees of freedom P n
53 Relative efficiencies: f = t ν for ν [1, 10] and n = 20 Estimated relative efficiency Degrees of freedom P n P n Q n
54 Relative efficiencies at the Slash corner (n = 20) P n 48% P n 75% 10% trim P n Q n 95% S n MAD 87% IQR Estimated relative efficiency
55 Relative efficiencies at the One-wild corner (n = 20) P n 84% P n 72% 10% trim P n Q n 68% S n MAD 40% IQR Estimated relative efficiency
56 Relative efficiencies at the Gaussian corner (n = 20) P n P n 81% 85% 10% trim P n Q n 67% S n MAD 38% IQR Estimated relative efficiency
57 Outline Introduction and motivation The robust scale estimator P n Properties of P n in finite samples Summary and key references
58 Summary 1. Aim An efficient, robust and widely applicable scale estimator.
59 Summary 1. Aim An efficient, robust and widely applicable scale estimator. 2. Method P n scale estimator a GL-statistic.
60 Summary 1. Aim An efficient, robust and widely applicable scale estimator. 2. Method P n scale estimator a GL-statistic. 3. Results 86% asymptotic efficiency at the normal. Highly efficient even when the underlying distribution has quite heavy tails. Less likely to fail for discrete distributions than Q n.
61 References Hampel, F. (1974). The influence curve and its role in robust estimation. Journal of the American Statistical Association, 69(346): Randal, J. (2008). A reinvestigation of robust scale estimation in finite samples. Computational Statistics & Data Analysis, 52(11): Rousseeuw, P. and Croux, C. (1993). Alternatives to the median absolute deviation. Journal of the American Statistical Association, 88(424): Serfling, R. J. (1984). Generalized L-, M-, and R-statistics. The Annals of Statistics, 12(1): Silverman, B. (1976). Limit theorems for dissociated random variables. Advances in Applied Probability, 8(4):
Robust estimation of scale and covariance with P n and its application to precision matrix estimation
Robust estimation of scale and covariance with P n and its application to precision matrix estimation Garth Tarr, Samuel Müller and Neville Weber USYD 2013 School of Mathematics and Statistics THE UNIVERSITY
More informationRobust scale estimation with extensions
Robust scale estimation with extensions Garth Tarr, Samuel Müller and Neville Weber School of Mathematics and Statistics THE UNIVERSITY OF SYDNEY Outline The robust scale estimator P n Robust covariance
More informationLecture 14 October 13
STAT 383C: Statistical Modeling I Fall 2015 Lecture 14 October 13 Lecturer: Purnamrita Sarkar Scribe: Some one Disclaimer: These scribe notes have been slightly proofread and may have typos etc. Note:
More informationRobust methods and model selection. Garth Tarr September 2015
Robust methods and model selection Garth Tarr September 2015 Outline 1. The past: robust statistics 2. The present: model selection 3. The future: protein data, meat science, joint modelling, data visualisation
More informationLecture 12 Robust Estimation
Lecture 12 Robust Estimation Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical Economics University of Karlsruhe Financial Econometrics, Summer Semester 2007 Copyright These lecture-notes
More information1 of 7 7/16/2009 6:12 AM Virtual Laboratories > 7. Point Estimation > 1 2 3 4 5 6 1. Estimators The Basic Statistical Model As usual, our starting point is a random experiment with an underlying sample
More informationMVE055/MSG Lecture 8
MVE055/MSG810 2017 Lecture 8 Petter Mostad Chalmers September 23, 2017 The Central Limit Theorem (CLT) Assume X 1,..., X n is a random sample from a distribution with expectation µ and variance σ 2. Then,
More informationGeneralized Multivariate Rank Type Test Statistics via Spatial U-Quantiles
Generalized Multivariate Rank Type Test Statistics via Spatial U-Quantiles Weihua Zhou 1 University of North Carolina at Charlotte and Robert Serfling 2 University of Texas at Dallas Final revision for
More informationA Comparison of Robust Estimators Based on Two Types of Trimming
Submitted to the Bernoulli A Comparison of Robust Estimators Based on Two Types of Trimming SUBHRA SANKAR DHAR 1, and PROBAL CHAUDHURI 1, 1 Theoretical Statistics and Mathematics Unit, Indian Statistical
More informationBootstrapping high dimensional vector: interplay between dependence and dimensionality
Bootstrapping high dimensional vector: interplay between dependence and dimensionality Xianyang Zhang Joint work with Guang Cheng University of Missouri-Columbia LDHD: Transition Workshop, 2014 Xianyang
More informationLARGE SAMPLE BEHAVIOR OF SOME WELL-KNOWN ROBUST ESTIMATORS UNDER LONG-RANGE DEPENDENCE
LARGE SAMPLE BEHAVIOR OF SOME WELL-KNOWN ROBUST ESTIMATORS UNDER LONG-RANGE DEPENDENCE C. LÉVY-LEDUC, H. BOISTARD, E. MOULINES, M. S. TAQQU, AND V. A. REISEN Abstract. The paper concerns robust location
More informationSymmetrised M-estimators of multivariate scatter
Journal of Multivariate Analysis 98 (007) 1611 169 www.elsevier.com/locate/jmva Symmetrised M-estimators of multivariate scatter Seija Sirkiä a,, Sara Taskinen a, Hannu Oja b a Department of Mathematics
More informationMIT Spring 2015
MIT 18.443 Dr. Kempthorne Spring 2015 MIT 18.443 1 Outline 1 MIT 18.443 2 Batches of data: single or multiple x 1, x 2,..., x n y 1, y 2,..., y m w 1, w 2,..., w l etc. Graphical displays Summary statistics:
More informationStatistical Data Analysis
DS-GA 0 Lecture notes 8 Fall 016 1 Descriptive statistics Statistical Data Analysis In this section we consider the problem of analyzing a set of data. We describe several techniques for visualizing the
More informationTable of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).
Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from
More informationON THE MAXIMUM BIAS FUNCTIONS OF MM-ESTIMATES AND CONSTRAINED M-ESTIMATES OF REGRESSION
ON THE MAXIMUM BIAS FUNCTIONS OF MM-ESTIMATES AND CONSTRAINED M-ESTIMATES OF REGRESSION BY JOSE R. BERRENDERO, BEATRIZ V.M. MENDES AND DAVID E. TYLER 1 Universidad Autonoma de Madrid, Federal University
More informationHeteroskedasticity in Time Series
Heteroskedasticity in Time Series Figure: Time Series of Daily NYSE Returns. 206 / 285 Key Fact 1: Stock Returns are Approximately Serially Uncorrelated Figure: Correlogram of Daily Stock Market Returns.
More informationMedian Cross-Validation
Median Cross-Validation Chi-Wai Yu 1, and Bertrand Clarke 2 1 Department of Mathematics Hong Kong University of Science and Technology 2 Department of Medicine University of Miami IISA 2011 Outline Motivational
More informationMeasuring robustness
Measuring robustness 1 Introduction While in the classical approach to statistics one aims at estimates which have desirable properties at an exactly speci ed model, the aim of robust methods is loosely
More informationRobust Testing and Variable Selection for High-Dimensional Time Series
Robust Testing and Variable Selection for High-Dimensional Time Series Ruey S. Tsay Booth School of Business, University of Chicago May, 2017 Ruey S. Tsay HTS 1 / 36 Outline 1 Focus on high-dimensional
More informationTime Series Models for Measuring Market Risk
Time Series Models for Measuring Market Risk José Miguel Hernández Lobato Universidad Autónoma de Madrid, Computer Science Department June 28, 2007 1/ 32 Outline 1 Introduction 2 Competitive and collaborative
More informationContinuous Distributions
Chapter 3 Continuous Distributions 3.1 Continuous-Type Data In Chapter 2, we discuss random variables whose space S contains a countable number of outcomes (i.e. of discrete type). In Chapter 3, we study
More informationBayesian Semiparametric GARCH Models
Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods
More informationDetecting outliers in weighted univariate survey data
Detecting outliers in weighted univariate survey data Anna Pauliina Sandqvist October 27, 21 Preliminary Version Abstract Outliers and influential observations are a frequent concern in all kind of statistics,
More informationMath 180A. Lecture 16 Friday May 7 th. Expectation. Recall the three main probability density functions so far (1) Uniform (2) Exponential.
Math 8A Lecture 6 Friday May 7 th Epectation Recall the three main probability density functions so far () Uniform () Eponential (3) Power Law e, ( ), Math 8A Lecture 6 Friday May 7 th Epectation Eample
More informationBayesian Semiparametric GARCH Models
Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods
More informationLocation Multiplicative Error Model. Asymptotic Inference and Empirical Analysis
: Asymptotic Inference and Empirical Analysis Qian Li Department of Mathematics and Statistics University of Missouri-Kansas City ql35d@mail.umkc.edu October 29, 2015 Outline of Topics Introduction GARCH
More information1 General problem. 2 Terminalogy. Estimation. Estimate θ. (Pick a plausible distribution from family. ) Or estimate τ = τ(θ).
Estimation February 3, 206 Debdeep Pati General problem Model: {P θ : θ Θ}. Observe X P θ, θ Θ unknown. Estimate θ. (Pick a plausible distribution from family. ) Or estimate τ = τ(θ). Examples: θ = (µ,
More informationKriging models with Gaussian processes - covariance function estimation and impact of spatial sampling
Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling François Bachoc former PhD advisor: Josselin Garnier former CEA advisor: Jean-Marc Martinez Department
More informationAsymptotic statistics using the Functional Delta Method
Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random
More informationApplying the Q n Estimator Online
Applying the Q n Estimator Online Robin Nunkesser 1 Karen Schettlinger 2 Roland Fried 2 1 Department of Computer Science, University of Dortmund 2 Department of Statistics, University of Dortmund GfKl
More informationRegression diagnostics
Regression diagnostics Kerby Shedden Department of Statistics, University of Michigan November 5, 018 1 / 6 Motivation When working with a linear model with design matrix X, the conventional linear model
More informationPublished: 26 April 2016
Electronic Journal of Applied Statistical Analysis EJASA, Electron. J. App. Stat. Anal. http://siba-ese.unisalento.it/index.php/ejasa/index e-issn: 2070-5948 DOI: 10.1285/i20705948v9n1p111 A robust dispersion
More informationInference for High Dimensional Robust Regression
Department of Statistics UC Berkeley Stanford-Berkeley Joint Colloquium, 2015 Table of Contents 1 Background 2 Main Results 3 OLS: A Motivating Example Table of Contents 1 Background 2 Main Results 3 OLS:
More informationA Brief Overview of Robust Statistics
A Brief Overview of Robust Statistics Olfa Nasraoui Department of Computer Engineering & Computer Science University of Louisville, olfa.nasraoui_at_louisville.edu Robust Statistical Estimators Robust
More informationAdditional Problems Additional Problem 1 Like the http://www.stat.umn.edu/geyer/5102/examp/rlike.html#lmax example of maximum likelihood done by computer except instead of the gamma shape model, we will
More informationConfidence Regions For The Ratio Of Two Percentiles
Confidence Regions For The Ratio Of Two Percentiles Richard Johnson Joint work with Li-Fei Huang and Songyong Sim January 28, 2009 OUTLINE Introduction Exact sampling results normal linear model case Other
More informationarxiv: v1 [math.st] 20 May 2014
ON THE EFFICIENCY OF GINI S MEAN DIFFERENCE CARINA GERSTENBERGER AND DANIEL VOGEL arxiv:405.5027v [math.st] 20 May 204 Abstract. The asymptotic relative efficiency of the mean deviation with respect to
More informationChapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations
Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models
More informationarxiv: v1 [math.st] 2 Aug 2007
The Annals of Statistics 2007, Vol. 35, No. 1, 13 40 DOI: 10.1214/009053606000000975 c Institute of Mathematical Statistics, 2007 arxiv:0708.0390v1 [math.st] 2 Aug 2007 ON THE MAXIMUM BIAS FUNCTIONS OF
More informationAdvanced Statistics II: Non Parametric Tests
Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney
More informationOn robust and efficient estimation of the center of. Symmetry.
On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract
More informationInference on distributions and quantiles using a finite-sample Dirichlet process
Dirichlet IDEAL Theory/methods Simulations Inference on distributions and quantiles using a finite-sample Dirichlet process David M. Kaplan University of Missouri Matt Goldman UC San Diego Midwest Econometrics
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 251 Nonparametric population average models: deriving the form of approximate population
More informationECE 275A Homework 7 Solutions
ECE 275A Homework 7 Solutions Solutions 1. For the same specification as in Homework Problem 6.11 we want to determine an estimator for θ using the Method of Moments (MOM). In general, the MOM estimator
More informationTable of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).
Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,
More informationAsymptotic Relative Efficiency in Estimation
Asymptotic Relative Efficiency in Estimation Robert Serfling University of Texas at Dallas October 2009 Prepared for forthcoming INTERNATIONAL ENCYCLOPEDIA OF STATISTICAL SCIENCES, to be published by Springer
More informationIMPROVING THE SMALL-SAMPLE EFFICIENCY OF A ROBUST CORRELATION MATRIX: A NOTE
IMPROVING THE SMALL-SAMPLE EFFICIENCY OF A ROBUST CORRELATION MATRIX: A NOTE Eric Blankmeyer Department of Finance and Economics McCoy College of Business Administration Texas State University San Marcos
More informationLecture 1: August 28
36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationProbability Density Functions
Statistical Methods in Particle Physics / WS 13 Lecture II Probability Density Functions Niklaus Berger Physics Institute, University of Heidelberg Recap of Lecture I: Kolmogorov Axioms Ingredients: Set
More informationLinear regression for heavy tails
Linear regression for heavy tails Guus Balkema & Paul Embrechts Universiteit van Amsterdam & ETH Zürich Abstract There exist several estimators of the regression line in the simple linear regression Y
More informationTeruko Takada Department of Economics, University of Illinois. Abstract
Nonparametric density estimation: A comparative study Teruko Takada Department of Economics, University of Illinois Abstract Motivated by finance applications, the objective of this paper is to assess
More informationInference based on robust estimators Part 2
Inference based on robust estimators Part 2 Matias Salibian-Barrera 1 Department of Statistics University of British Columbia ECARES - Dec 2007 Matias Salibian-Barrera (UBC) Robust inference (2) ECARES
More informationMath 494: Mathematical Statistics
Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/
More informationPractical Bayesian Optimization of Machine Learning. Learning Algorithms
Practical Bayesian Optimization of Machine Learning Algorithms CS 294 University of California, Berkeley Tuesday, April 20, 2016 Motivation Machine Learning Algorithms (MLA s) have hyperparameters that
More informationRobustness. James H. Steiger. Department of Psychology and Human Development Vanderbilt University. James H. Steiger (Vanderbilt University) 1 / 37
Robustness James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) 1 / 37 Robustness 1 Introduction 2 Robust Parameters and Robust
More informationBootstrap & Confidence/Prediction intervals
Bootstrap & Confidence/Prediction intervals Olivier Roustant Mines Saint-Étienne 2017/11 Olivier Roustant (EMSE) Bootstrap & Confidence/Prediction intervals 2017/11 1 / 9 Framework Consider a model with
More informationSTA 2201/442 Assignment 2
STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution
More informationRegression #3: Properties of OLS Estimator
Regression #3: Properties of OLS Estimator Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #3 1 / 20 Introduction In this lecture, we establish some desirable properties associated with
More informationSmall Sample Corrections for LTS and MCD
myjournal manuscript No. (will be inserted by the editor) Small Sample Corrections for LTS and MCD G. Pison, S. Van Aelst, and G. Willems Department of Mathematics and Computer Science, Universitaire Instelling
More informationNonparametric Methods
Nonparametric Methods Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania July 28, 2009 Michael R. Roberts Nonparametric Methods 1/42 Overview Great for data analysis
More informationFinancial Econometrics and Quantitative Risk Managenent Return Properties
Financial Econometrics and Quantitative Risk Managenent Return Properties Eric Zivot Updated: April 1, 2013 Lecture Outline Course introduction Return definitions Empirical properties of returns Reading
More informationStatistics: Learning models from data
DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial
More informationGraduate Econometrics I: Asymptotic Theory
Graduate Econometrics I: Asymptotic Theory Yves Dominicy Université libre de Bruxelles Solvay Brussels School of Economics and Management ECARES Yves Dominicy Graduate Econometrics I: Asymptotic Theory
More informationStatistical Methods in Particle Physics
Statistical Methods in Particle Physics Lecture 3 October 29, 2012 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline Reminder: Probability density function Cumulative
More informationRobust Backtesting Tests for Value-at-Risk Models
Robust Backtesting Tests for Value-at-Risk Models Jose Olmo City University London (joint work with Juan Carlos Escanciano, Indiana University) Far East and South Asia Meeting of the Econometric Society
More informationChapter 1 Statistical Reasoning Why statistics? Section 1.1 Basics of Probability Theory
Chapter 1 Statistical Reasoning Why statistics? Uncertainty of nature (weather, earth movement, etc. ) Uncertainty in observation/sampling/measurement Variability of human operation/error imperfection
More informationMATH Solutions to Probability Exercises
MATH 5 9 MATH 5 9 Problem. Suppose we flip a fair coin once and observe either T for tails or H for heads. Let X denote the random variable that equals when we observe tails and equals when we observe
More informationCounting principles, including permutations and combinations.
1 Counting principles, including permutations and combinations. The binomial theorem: expansion of a + b n, n ε N. THE PRODUCT RULE If there are m different ways of performing an operation and for each
More informationWeek 2. Review of Probability, Random Variables and Univariate Distributions
Week 2 Review of Probability, Random Variables and Univariate Distributions Probability Probability Probability Motivation What use is Probability Theory? Probability models Basis for statistical inference
More informationDistributions of Functions of Random Variables. 5.1 Functions of One Random Variable
Distributions of Functions of Random Variables 5.1 Functions of One Random Variable 5.2 Transformations of Two Random Variables 5.3 Several Random Variables 5.4 The Moment-Generating Function Technique
More informationSpatial autocorrelation: robustness of measures and tests
Spatial autocorrelation: robustness of measures and tests Marie Ernst and Gentiane Haesbroeck University of Liege London, December 14, 2015 Spatial Data Spatial data : geographical positions non spatial
More informationTHE BREAKDOWN POINT EXAMPLES AND COUNTEREXAMPLES
REVSTAT Statistical Journal Volume 5, Number 1, March 2007, 1 17 THE BREAKDOWN POINT EXAMPLES AND COUNTEREXAMPLES Authors: P.L. Davies University of Duisburg-Essen, Germany, and Technical University Eindhoven,
More informationEcon 583 Final Exam Fall 2008
Econ 583 Final Exam Fall 2008 Eric Zivot December 11, 2008 Exam is due at 9:00 am in my office on Friday, December 12. 1 Maximum Likelihood Estimation and Asymptotic Theory Let X 1,...,X n be iid random
More informationContents 1. Contents
Contents 1 Contents 6 Distributions of Functions of Random Variables 2 6.1 Transformation of Discrete r.v.s............. 3 6.2 Method of Distribution Functions............. 6 6.3 Method of Transformations................
More information9. Robust regression
9. Robust regression Least squares regression........................................................ 2 Problems with LS regression..................................................... 3 Robust regression............................................................
More informationPhysics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester
Physics 403 Parameter Estimation, Correlations, and Error Bars Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Review of Last Class Best Estimates and Reliability
More informationAsymptotic Statistics-VI. Changliang Zou
Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous
More informationON THE CALCULATION OF A ROBUST S-ESTIMATOR OF A COVARIANCE MATRIX
STATISTICS IN MEDICINE Statist. Med. 17, 2685 2695 (1998) ON THE CALCULATION OF A ROBUST S-ESTIMATOR OF A COVARIANCE MATRIX N. A. CAMPBELL *, H. P. LOPUHAA AND P. J. ROUSSEEUW CSIRO Mathematical and Information
More informationRandom Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R
In probabilistic models, a random variable is a variable whose possible values are numerical outcomes of a random phenomenon. As a function or a map, it maps from an element (or an outcome) of a sample
More informationIntroduction Robust regression Examples Conclusion. Robust regression. Jiří Franc
Robust regression Robust estimation of regression coefficients in linear regression model Jiří Franc Czech Technical University Faculty of Nuclear Sciences and Physical Engineering Department of Mathematics
More informationMATH4427 Notebook 4 Fall Semester 2017/2018
MATH4427 Notebook 4 Fall Semester 2017/2018 prepared by Professor Jenny Baglivo c Copyright 2009-2018 by Jenny A. Baglivo. All Rights Reserved. 4 MATH4427 Notebook 4 3 4.1 K th Order Statistics and Their
More informationSome Statistics. V. Lindberg. May 16, 2007
Some Statistics V. Lindberg May 16, 2007 1 Go here for full details An excellent reference written by physicists with sample programs available is Data Reduction and Error Analysis for the Physical Sciences,
More informationHighly Robust Variogram Estimation 1. Marc G. Genton 2
Mathematical Geology, Vol. 30, No. 2, 1998 Highly Robust Variogram Estimation 1 Marc G. Genton 2 The classical variogram estimator proposed by Matheron is not robust against outliers in the data, nor is
More informationTerminology Suppose we have N observations {x(n)} N 1. Estimators as Random Variables. {x(n)} N 1
Estimation Theory Overview Properties Bias, Variance, and Mean Square Error Cramér-Rao lower bound Maximum likelihood Consistency Confidence intervals Properties of the mean estimator Properties of the
More informationNegative Association, Ordering and Convergence of Resampling Methods
Negative Association, Ordering and Convergence of Resampling Methods Nicolas Chopin ENSAE, Paristech (Joint work with Mathieu Gerber and Nick Whiteley, University of Bristol) Resampling schemes: Informal
More informationFrequency Analysis & Probability Plots
Note Packet #14 Frequency Analysis & Probability Plots CEE 3710 October 0, 017 Frequency Analysis Process by which engineers formulate magnitude of design events (i.e. 100 year flood) or assess risk associated
More informationLimiting Distributions
Limiting Distributions We introduce the mode of convergence for a sequence of random variables, and discuss the convergence in probability and in distribution. The concept of convergence leads us to the
More informationDescriptive Statistics
Descriptive Statistics DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Descriptive statistics Techniques to visualize
More informationWeakly dependent functional data. Piotr Kokoszka. Utah State University. Siegfried Hörmann. University of Utah
Weakly dependent functional data Piotr Kokoszka Utah State University Joint work with Siegfried Hörmann University of Utah Outline Examples of functional time series L 4 m approximability Convergence of
More informationREGRESSION ANALYSIS AND ANALYSIS OF VARIANCE
REGRESSION ANALYSIS AND ANALYSIS OF VARIANCE P. L. Davies Eindhoven, February 2007 Reading List Daniel, C. (1976) Applications of Statistics to Industrial Experimentation, Wiley. Tukey, J. W. (1977) Exploratory
More informationBTRY 4090: Spring 2009 Theory of Statistics
BTRY 4090: Spring 2009 Theory of Statistics Guozhang Wang September 25, 2010 1 Review of Probability We begin with a real example of using probability to solve computationally intensive (or infeasible)
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationProbability and Measure
Part II Year 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2018 84 Paper 4, Section II 26J Let (X, A) be a measurable space. Let T : X X be a measurable map, and µ a probability
More informationOn rank tests for shift detection in time series
Computational Statistics & Data Analysis 52 (27) 221 233 www.elsevier.com/locate/csda On rank tests for shift detection in time series Roland Fried a,, Ursula Gather a a Department of Statistics, University
More informationProblem 1 (20) Log-normal. f(x) Cauchy
ORF 245. Rigollet Date: 11/21/2008 Problem 1 (20) f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 4 2 0 2 4 Normal (with mean -1) 4 2 0 2 4 Negative-exponential x x f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.5
More informationWEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract
Journal of Data Science,17(1). P. 145-160,2019 DOI:10.6339/JDS.201901_17(1).0007 WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION Wei Xiong *, Maozai Tian 2 1 School of Statistics, University of
More informationEfficient Regressions via Optimally Combining Quantile Information
Efficient Regressions via Optimally Combining Quantile Information Zhibiao Zhao Penn State University Zhijie Xiao Boston College September 29, 2011 Abstract We study efficient estimation of regression
More information