A NORMAL SCALE MIXTURE REPRESENTATION OF THE LOGISTIC DISTRIBUTION ABSTRACT

Similar documents
NORMAL SCALE MIXTURE APPROXIMATIONS TO THE LOGISTIC DISTRIBUTION WITH APPLICATIONS ABSTRACT

Bivariate Weibull-power series class of distributions

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

On robust and efficient estimation of the center of. Symmetry.

Full likelihood inferences in the Cox model: an empirical likelihood approach

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

A comparison study of the nonparametric tests based on the empirical distributions

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

arxiv: v1 [math.st] 4 Oct 2018

On the Detection of Heteroscedasticity by Using CUSUM Range Distribution

Latin Hypercube Sampling with Multidimensional Uniformity

Statistic Distribution Models for Some Nonparametric Goodness-of-Fit Tests in Testing Composite Hypotheses

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

THE PAIR CHART I. Dana Quade. University of North Carolina. Institute of Statistics Mimeo Series No ~.:. July 1967

ON THE MONOTONE LIKELIHOOD RATIO OFA "FAULTY INSPECTION" DISTRIBUTION ABSTRACT

arxiv: v1 [stat.me] 2 Mar 2015

Lecture 3: Central Limit Theorem

PROBABILISTIC METHODS FOR A LINEAR REACTION-HYPERBOLIC SYSTEM WITH CONSTANT COEFFICIENTS

A NOTE ON A DISTRIBUTION OF WEIGHTED SUMS OF I.I.D. RAYLEIGH RANDOM VARIABLES

Misclassification in Logistic Regression with Discrete Covariates

University of California, Berkeley

Lecture 3: Central Limit Theorem

Multivariate Normal-Laplace Distribution and Processes

Continuous Univariate Distributions

FAILURE RATE OF THE INVERSE GAUSSlAN-WEIBULL MIXTURE MODEL

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

Scaling limit of random planar maps Lecture 2.

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A Generalization of Discounted Central Limit Theorem and its Berry-Esseen Analogue

ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION

High-dimensional test for normality

Consider Table 1 (Note connection to start-stop process).

Estimation of cumulative distribution function with spline functions

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Asymptotics for posterior hazards

The Laplace Approximation

The Path Integral for Relativistic Worldlines

Chapter DEs with Discontinuous Force Functions

Sensitivity and Asymptotic Error Theory

PQL Estimation Biases in Generalized Linear Mixed Models

Phenomena in high dimensions in geometric analysis, random matrices, and computational geometry Roscoff, France, June 25-29, 2012

arxiv: v1 [stat.ml] 5 Jan 2015

Bayesian Logistic Regression

Because of the special form of an alternating series, there is an simple way to determine that many such series converge:

Financial Econometrics and Quantitative Risk Managenent Return Properties

Investigation of goodness-of-fit test statistic distributions by random censored samples

On the product of independent Generalized Gamma random variables

Optimal SPRT and CUSUM Procedures using Compressed Limit Gauges

Computer simulation on homogeneity testing for weighted data sets used in HEP

11 Survival Analysis and Empirical Likelihood

The Metalog Distributions

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

Duration Analysis. Joan Llull

Probabilistic Machine Learning. Industrial AI Lab.

Smooth simultaneous confidence bands for cumulative distribution functions

ANALYSIS OF PANEL DATA MODELS WITH GROUPED OBSERVATIONS. 1. Introduction

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Transform or Link? Harold V. Henderson Statistics Section, Ruakura Agricultural Centre, Hamilton, New Zealand. and

13! (52 13)! 52! =

Reprinted from MONTHLY WEATHER REVIEW, Vol. 109, No. 12, December 1981 American Meteorological Society Printed in I'. S. A.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Bayesian Analysis of Risk for Data Mining Based on Empirical Likelihood

An Algorithm for Bayesian Variable Selection in High-dimensional Generalized Linear Models

Testing Goodness Of Fit Of The Geometric Distribution: An Application To Human Fecundability Data

Tail negative dependence and its applications for aggregate loss modeling

Measurement Error in Nonparametric Item Response Curve Estimation

CORRECTING DATA FOR MEASUREMENT ERROR IN GENERALIZED LINEAR MODELS. Leonard A. Stefanski*

On prediction and density estimation Peter McCullagh University of Chicago December 2004

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Some asymptotic expansions and distribution approximations outside a CLT context 1

LOGISTIC REGRESSION Joseph M. Hilbe

Xia Wang and Dipak K. Dey

FINITE MIXTURE DISTRIBUTIONS

PAijpam.eu ADAPTIVE K-S TESTS FOR WHITE NOISE IN THE FREQUENCY DOMAIN Hossein Arsham University of Baltimore Baltimore, MD 21201, USA

Adjusted Empirical Likelihood for Long-memory Time Series Models

10-704: Information Processing and Learning Spring Lecture 8: Feb 5

July 6, Random effect models in nonlinear regression with applications to. bioassay. Mats Rudemo

On the usefulness of wavelet-based simulation of fractional Brownian motion

Dr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)

Uniform concentration inequalities, martingales, Rademacher complexity and symmetrization

Least Squares Regression

Miscellanea Kernel density estimation and marginalization consistency

Nonparametric Bayesian Methods - Lecture I

Density Estimation with Replicate Heteroscedastic Measurements

The discrepancy of permutation families

Extreme Value Theory as a Theoretical Background for Power Law Behavior

Empirical Likelihood Methods for Sample Survey Data: An Overview

On the econometrics of the Koyck model

Sums of exponentials of random walks

Least Squares Regression

Supplementary Materials for Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach

A THREE-PARAMETER WEIGHTED LINDLEY DISTRIBUTION AND ITS APPLICATIONS TO MODEL SURVIVAL TIME

THE DISTRIBUTION AND THE EXPECTATION OF THE DISTANCE BETWEEN TWO RANDOM POINTS ON DIFFERENT FACES OF A UNIT CUBE IN THREE AND FOUR DIMENSIONS.

ASYMPTOTIC PROPERTIES OF SOME GOODNESS-OF-FIT TESTS BASED ON THE L1-NORM

J2 e-*= (27T)- 1 / 2 f V* 1 '»' 1 *»

A review of some semiparametric regression models with application to scoring

8 Basics of Hypothesis Testing

PROPERTIES OF THE EMPIRICAL CHARACTERISTIC FUNCTION AND ITS APPLICATION TO TESTING FOR INDEPENDENCE. Noboru Murata

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

Transcription:

#1070R A NORMAL SCALE MIXTURE REPRESENTATION OF THE LOGISTIC DISTRIBUTION LEONARD A. STEFANSKI Department of Statistics North Carolina State University Raleigh, NC 27695 ABSTRACT In this paper it is shown that the logistic distribution can be represented as a scale mixture of the standard normal distribution where the mixing density is related to the Kolmogorov-Smirnov distribution. Two derivations of the theorem are presented that give rise to two different representations of the Kolmogorov-Smirnov distribution. The induced identity is of independent interest and is not widely published nor easily derived directly. Key words and phrases. Kolmogorov-Smirnov distribution; logistic distribution; normal scale mixture.

1. INTRODUCTION Recently I encountered a problem in which it was advantageous to approximate the logistic distribution, F(t) = 1/(1 + e- t ), with discrete mixtures having the form where <P k Fk(t) = LPk,i<P(tsk,i)' i=l (k = 1,2,...), (1.1 ) is the standard normal cumulative distribution function. These approximations provide a solution to a problem posed by Cox (1970, p. 110) and are useful in statistical models involving convolutions of a normal distribution with F, such as logistic regression measurement error models and random effects logistic regression models; see, for example, Carroll, Spiegelman, Bailey, Lan and Abbott (1984). Monahan and Stefanski (1989) discuss applications in detail and provide tables of {pk,i' sk,ilr=l that minimize D'k = sup I F(t) - Fk(t) I, t (1.2) for k = 1,2,... The approximation for k = 3 is very good and improves significantly with increasing k. In this paper I prove the following theorem, providing explanation for the quality of the approximations and mathematical justification for the class of approximants in (1.1). THEOREM. Let F and <P denote the standard logistic and normal cumulative distribution functions respectively. Then F(t) =1 <p(t/u)q(u)du, (1.3) where q(u) = (d/du)l(u/2) and L is the Kolmogorov-Smirnov distribution, L(u) = 1-2L(_1)(n+l) exp(_2n 2 ( 2 ). (1.4) 1

In Section 2, two derivations of (1.3) are presented that give rise to two different representations of the Kolmogorov-Smirnov distribution. The induced identity is of independent interest and is not widely published nor easily derived directly. 2. THE LOGISTIC DISTRIBUTION AS A GAUSSIAN SCALE MIXTURE Consider the integral equation (2.1) when f(t) = e- t /(1 + e- t? and </> is the standard normal density. A change-of-variables v = 1/2u 2 in the integral and evaluation at t = VS, s > 0, shows that (2.2) where h(v) = q(1/.,f2;)/v81rv 2 (2.3) It follows from (2.2) that f(vi) is the Laplace transform of h. Note that for t > 0, F(t) = (1 +e-tt 1 = l::~=o(-l)n exp(-nt). Upon differentiation and appeal to symmetry it follows that and thus f(t) = L( _l)(n+l)n exp{-n It I}, (2.4) f(-/8) = L(-l)(n+l)nexp{-n-/8}. Since e- avs is the Laplace transform of ha(t) = aexp{-a 2 /4t}/V41rt 3, it follows from (2.5) (2.5) that {n 2 } h(t) = ~(_1)(n+l) n2 exp --, ~ 2'V1rt 3 4t 2

and from (2.3) that (2.6) Note that in (2.6), q((1) = (d/d(1)l((1/2) where L is given in (1.4). o An alternative method of solving (2.1) leads to an interesting identity that is not easily derived directly. In terms of moment generating functions (2.1) becomes The change-of-variables v = (12/2, evaluation at t = i-js, and a geometric series expansion of 1/{I - exp(-211"-js)} results in the identities 211"-JS exp(-1i"-js) LOO 1 } e-svg(v)dv = ( 2 -JS) = 211" VSexp{ -1I"(2n + 1)JS, O 1-exp - 11" S n=o (2.7) where g(v) = q(~)/~. (2.8) Since -JS exp(-a-js) is the Laplace transform of it follows from (2.7) that _(.!!-.) ha(t) = _ (.!!-.) [aex p {-a 2 /4t}], da da ~411"t3 Using (2.8) and integrating q term-by-term shows that q((1) = (d/d(1)l*((1/2) where L *( ) = V2i ~ {_ 1I"2(2n +1)2} (1 ~exp 8 2. (1 (1 n=o 3 (2.9)

Of course Land L* must be equal thus showing that the right-hand sides of (1.4) and (2.9) are equal. This identity is not easily established directly; see for example, Feller (1948), Smirnov (1948) and Monahan (1989). The alternative representation (2.9) is useful for computing L(er) for small er, Monahan (1989). REFERENCES Carroll, R. J., Spiegelman, C., Bailey, K., Lan, K. K. G. and Abbott, R. D. (1984). On errors-in-variables for binary regression models. Biometrika, 71, 19-26. Cox, D. R. (1970). Binary Regression. Chapman and Hall, London. Feller, W. (1948). On the Kolmogorov-Smirnov limit theorems for empirical distributions. Annals of Mathematical Statistics, 19, 177-189. Monahan, J. F. (1989). On the alternative representation of the Kolmogorov-Smirnov distribution. Preprint. Monahan, J. F. and Stefanski, L. A. (1989). Normal scale mixture approximations to the logistic distribution with applications. Institute of Statistics Mimeograph Series No. 1968. Smirnov, N. (1948). Table for estimating the goodness of fit of empirical distributions. Annals of Mathematical Statistics, 19, 279-281. 4