Randomized Quantile Residuals
|
|
- Cody Hudson
- 5 years ago
- Views:
Transcription
1 Randmized Quantile Residuals Peter K. Dunn and Grdn K. Smyth Department f Mathematics, University f Queensland, Brisbane, Q 47, Australia. 4 April 996 Abstract In this paper we give a general definitin f residuals fr regressin mdels with independent respnses. Our definitin prduces residuals which are eactly nrmal, apart frm sampling variability in the estimated parameters, by inverting the fitted distributin functin fr each respnse value and finding the equivalent standard nrmal quantile. Our definitin includes sme randmizatin t achieve cntinuus residuals when the respnse variable is discrete. Quantile residuals are easily cmputed in cmputer packages such as SAS, S-Plus, GLIM r LispStat, and allw residual analyses t be carried ut in many cmmnly ccurring situatins in which the custmary definitins f residuals fail. Quantile residuals are applied in this paper t three eample data sets. Keywrds: deviance residual; epnential regressin; generalized linear mdel; lgistic regressin; nrmal prbability plt; Pearsn residual. Intrductin Residuals, and especially plts f residuals, play a central rle in the checking f statistical mdels. In nrmal linear regressin the residuals are nrmally distributed and can be standardized t have equal variances. In nn-nrmal regressin situatins, such as lgistic regressin r lg-linear analysis, the residuals, as usually defined, may be s far frm nrmality and frm having equal variances as t be f n practical use. A particular prblem ccurs when the respnse variable is discrete and takes n a small number f distinct values, as fr Pissn data with mean nt far frm zer r binmial data with mean clse t either zer r the number f trials. In such situatins the residuals lie n nearly parallel curves crrespnding t distinct respnse values, and these spurius This preprint is nw published as: Dunn, K. P., and Smyth, G. K. (996). Randmized quantile residuals. J. Cmput. Graph. Statist., 5,
2 curves distract the eye seriusly frm any meaningful message that might be cntained in a residual plt. In this paper we give a general definitin f residuals fr regressin mdels with independent respnses. Our definitin prduces residuals which are eactly nrmal, apart frm sampling variability in the estimated parameters, by inverting the fitted distributin functin at each respnse value and finding the equivalent standard nrmal quantile. This apprach is clsely related t that f C and Snell (968), but whereas C and Snell cncentrate n mean and variance crrectins we cncentrate n the transfrmatin t nrmality. Our definitin includes sme randmizatin t achieve cntinuus residuals when the respnse variable is discrete. Quantile residuals are easily cmputed in cmputer packages such as SAS, S-Plus, GLIM r LispStat, and allw residual analyses t be carried ut in many cmmnly ccurring situatins in which the custmary definitins f residuals fail. Special cases f quantile residuals have been used by Brillinger and Preisler (983) and Brillinger (996). Fr ther wrk n residuals fr nn-nrmal regressin mdels see Pierce and Schafer (986) r McCullagh and Nelder (989) and the references therein. In the discussin at the end f the paper we briefly indicate hw quantile residuals may be etended t mdels with dependent respnses. Pearsn and Deviance Residuals Let y,..., y n be respnses and fr each i let i be a vectr f cvariates. The y i are assumed t be independent and t fllw a distributin P(µ i, φ) where µ i = E(y i ) and φ is a parameter vectr cmmn t all the y i. The µ i are assumed t depend n the i and a vectr f regressin parameters β. We have particularly in mind generalized linear mdels (McCullagh and Nelder, 989) in which the prbability density r mass functin f y i has the frm f(y; θ i, φ) = a(y, φ) ep[{yθ i κ(θ i )}/φ] where a() and κ() are knwn functins and µ i = κ (θ i ). In this mdel we have var(y i ) = φv (µ i ) where V (µ i ) = κ (θ i ). It is custmary t assume that g(µ i ) = T β where g() is a knwn link functin. The parameter φ is the prprtinality cnstant in the meanvariance relatinship and is knwn as the dispersin parameter. In the cntet f generalized linear mdels, tw definitins f residuals have been cmmnly used in practice. The Pearsn residual is defined by r p,i = y i ˆµ i V (ˆµ i ) / where ˆµ i is the fitted value fr µ i. The Pearsn residual has the advantage that its mean and variance are eactly zer and φ respectively, if sampling variability in ˆµ i is small. The deviance residuals are defined in terms f the unit deviances. Fr the abve mdel, let t(y, µ) = yθ κ(θ). Assuming that y is in the dmain f µ, the unit deviance is d(y, µ) = {t(y, y) t(y, µ)}
3 The deviance residual is r d,i = d(y i, ˆµ i ) / sign(y i ˆµ i ) Pierce and Schafer (986) have argued n theretical grunds that the deviance residuals shuld be mre nearly nrmal than the Pearsn. Indeed bth cnverge t nrmality as φ relative t the µ i, the Pearsn residuals at rate O(φ / ) by the Central Limit Therem and the deviance residuals at O(φ) by the saddle-pint apprimatin t f(y; θ i, φ). The Pearsn and deviance residuals cincide and are eactly nrmal, ignring variability in ˆµ i, fr the nrmal linear mdel. The deviance residual is als eactly nrmal when the respnse is inverse-gaussian. In ther cases and fr φ/µ large hwever, neither type f residual can be guaranteed t be clsely nrmal, and the deviance residuals d nt generally have zer means r equal variances even at the true values µ i. 3 Randmized Quantile Residuals Let F (y; µ, φ) be the cumulative distributin functin f P(µ, φ). If F is cntinuus, then the F (y i ; µ i, φ) are unifrmly distributed n the unit interval. In this case, the quantile residuals are defined by r q,i = Φ {F (y i ; ˆµ i, ˆφ)} where Φ() is the cumulative distributin functin f the standard nrmal. Apart frm sampling variability in ˆµ i and ˆφ, the r q,i are eactly standard nrmal. This implies that the distributin f r q,i cnverges t standard nrmal if β and φ are cnsistently estimated. The abve definitin is a special case f C and Snell s (968) crude residuals. Eample: Leukemia data. Feigl and Zelen (965) discuss sme data relating the survival times y i f leukemia patients t their initial white bld cell cunts i and t eistence f AG-factr. Fllwing Feigl and Zelen, we treat the survival times as epnential, y i Ep(µ i ). We wrk with a lg-linear mdel fr the means, including separate intercepts fr the tw AG-factr grups, { α + β lg lg µ i = i AG psitive α + β lg i AG negative C and Snell (968) cnsidered a subset f this data, and defined apprimately epnential crude residuals R i = y i /ˆµ i, where the ˆµ i are the estimated means. In this case the quantile residuals r q,i = Φ { ep(y i /ˆµ i )} are a simple transfrmatin f the R i. A nrmal prbability plt f the quantile residuals cnfirms the assumptin f an epnential distributin. Figure plts the quantile residuals versus the cvariate. The three residuals (cases 7, 3 and 33) in the upper right-hand crner f the plt are relatively separate frm the bdy f the ther residuals, and withut them there appears t be a marked negative trend. While the pattern is nt sufficient t cntradict the mdel assumptins, it raises the pssibility that cases 7, 3 3
4 Figure : Plt f quantile residuals versus the cvariate fr the leukemia data. Circles represent patients which are AG-psitive, crsses AG-negative Quantile Residual Lg White Bld Cell Cunt and 33 may be utliers, r that the dispersin f the residuals increases at the largest white bld cell cunts. In any case, the three cases identified appear frm the residual plt t be jintly influential. Assigning the identified cases zer weight increases ˆβ nearly three-fld, frm -.3 t -.84 cmpared with a standard errr f.4. If F is nt cntinuus, a mre general definitin f quantile residuals is required. Let a i = lim y yi F (y; ˆµ i, ˆφ) and b i = F (y i ; ˆµ i, ˆφ). We define the randmized quantile residual fr y i by r q,i = Φ (u i ) where u i is a unifrm randm variable n the interval (a i, b i ]. Again, the r q,i are eactly standard nrmal, apart frm sampling variability in ˆµ i and ˆφ. The randmizatin strategy emplyed here is similar t the strategy f jittering (Chambers et al, 983) t prevent masses f verlapping pints in plts. Whereas jittering applies a unifrm randm cmpnent t the respnse, ur unifrm randm cmpnent is n the cumulative prbability scale and is tailred t the actual prbability mass at the pint in questin. Our randmizatin is the minimum necessary s that n granularity remains in the resulting residual distributin. Eample : Simulated binmial data. A lgistic linear regressin was used t mdel 6 binmial bservatins with binmial denminatr n = 3, i.e., the respnses were assumed t be independently distributed as y i bin(n, p i ), with n = 3 and lgit(p i ) = β + β i were i is a cvariate. The first plt f Figure displays the deviance residuals versus 4
5 Figure : Deviance and quantile residuals versus the cvariate frm a lgistic regressin. The respnse is simulated bin(3, p) with lgit p depending quadratically n the cvariate. 3 3 Deviance Residuals - - Quantile Residuals Cvariate Cvariate the cvariate. The pints in this plt lie n fur parallel curves crrespnding t the fur pssible values fr the respnse. The curves make it difficult t see any ther pattern in the data. The secnd plt displays the quantile residuals versus the cvariate. In this plt is clear that the residuals fllw a quadratic pattern. The data fr this eample was in fact cmputer generated with lgit(p i ) depending quadratically n the i. Figure 3 shws the residual plts nce the quadratic term has been included in the regressin. The deviance residuals lie n prminent curves while the quantile residuals nw shw randm scatter. Eample 3: Fathers and sns ccupatins. Brwn (974) and Ktze and Hawkins (984) analyze a sparse 4 4 cntingency table shwing the crss-classificatin f ccupatins f fathers (rws) by ccupatins f sns (clumns). The data was riginally published by Pearsn (94) and appears als in Hand et al (994). Brwn, Ktze and Hawkins were interested in identifying thse cells which are utliers relative t the independence mdel. We take a similar apprach, with the difference that the quantile residual apprach allws us t lk fr utliers relative t a mre realistic mdel. Observing that there is an apriri epectatin that sns will be influenced by their father s ccupatin, we fit a lg-linear Pissn regressin mdel t the cunts with rw and clumn effects and with an effect fr equality f father s and sn s ccupatin, i.e., y ij Pis(µ ij ), with lg µ ij = µ + α i + β j + δ ij () and ij = if i = j and therwise. Figure 4 is a nrmal prbability plt f quantile residuals frm this mdel. The largest psitive residual crrespnds t the (,) cell: sns almst always cntinue t wrk in the Arts if their father did. Figure 4 shws evidence f large negative residuals as well as large psitive residuals. Althugh nne f the negative residuals are individually significant, and the actual cntingency table cells represented in the left tail f the prbability plt varies with each realizatin f the quantile residuals, the 5
6 Figure 3: Deviance and quantile residuals versus the cvariate fr a well fitting lgistic regressin. Deviance Residuals Quantile Residuals Cvariate Cvariate Figure 4: Nrmal prbability plt with identity line f the quantile residuals frm the fathers and sns ccupatin data. Sample Deviates Nrmal Deviates 6
7 verall pattern is preserved acrss realizatins. The quantile residual plt shws in this way that there are t many small cunts in the cntingency table t be cmpatible with the abve mdel. N ther methd which has been applied t this data in the literature is able t shw this aspect f the data. Althugh Figure 4 shws clear evidence f lack f fit, the mdel () and the mdels which arise frm it by deleting selected cells des give an appreciably better fit t this data than the independence mdels cnsidered by earlier authrs. 4 Discussin and Etensins In this paper quantile residuals are cmputed by finding the equivalent standard nrmal deviate fr each respnse bservatin. In principle, any reference distributin culd have been chsen fr the residuals. C and Snell (968) fr eample cmputed epnential residuals fr data f Eample and Brillinger and Preisler (983) and Brillinger (996) use unifrm residuals. Hwever asymmetry seems an unnecessary cmplicatin, and bunded distributins intrduce a spurius pattern (the bundary itself) and make it difficult t distinguish between large residuals and utright utliers. The nrmal distributin is recmmended in this paper n the basis that nrmal variatin is that which mst peple have practice interpreting graphically. Randmizatin is used t prduce cntinuusly distributed residuals when the respnse is discrete r has a discrete cmpnent. This means that the quantile residuals will vary frm ne realizatin t anther fr a given data set and fitted mdel. Fr the sake f brevity, we have given nly ne realizatin f the quantile residuals fr each eample in this paper. In practice thugh we have fund it useful t rutinely plt fur realizatins f the quantile residuals. Any pattern in the residuals which is nt cnsistent acrss the realizatins is then ignred. The idea f applying a cntinuus randm cmpnent t discrete respnses s that methds fr cntinuus variables can be applied is in fact very ld. See Pearsn (95) fr a discussin. As used in this paper, randmizatin is a device thrugh which the aggregate pattern f the residuals becmes apparent. Since decisins d nt depend n individual realizatins, the bvius bjectins t randmizatin which arise in the cntet f tests and cnfidence intervals d nt seem t apply. Quantile residuals can generalize any f the usual diagnstic methds which use residuals. Fr eample, an added variable plt (Ck and Weisberg, 98) culd be cmputed fr a generalized linear mdel by pltting the quantile residuals, fr the mdel ecluding, versus a, where a is adjusted fr the ther cvariates in the mdel. The vectr a wuld be chsen t be rthgnal t the ther cvariates, relative t the cvariance matri f the y i. It might be cmputed as the residuals frm weighted least squares regressin f n the ther cvariates, using as weights the wrking weights frm the generalized linear mdel. Independence f the respnse bservatins was assumed in this paper. The methd f quantile residuals can be etended t dependent data situatins by epressing the multivariate likelihd as a sum f univariate cnditinal likelihds. Fr eample we might define the ith cnditinal quantile residual frm the cnditinal distributin f y i 7
8 given y,..., y i instead f frm the marginal distributin f y i as in the paper. This wuld prvide independent, standard nrmal residuals. Finally we cnsider the sampling variability f the ˆµ i, which has fr simplicity been ignred thrughut this paper. Treating the ˆµ i as fied is apprpriate when gd infrmatin is available n the mdel parameters, but may be unrealistic fr eample fr designed eperiments in which the number f parameters is nt small cmpared t the number f bservatins. In nrmal linear mdels, REML estimatin f the variance structure is btained frm the marginal distributin f any set f zer mean cntrasts, Z T y say. In a similar way, independent and identically distributed residuals culd be btained by transfrming frm the y i t any rthnrmal set f zer mean cnstrasts. Etending this idea t nn-nrmal regressin is mre difficult, but culd in principle be dne using the cnditinal apprach f Smyth and Verbyla (995). In that paper, Smyth and Verbyla argue that REML estimatin fr generalized linear mdels shuld prceed by cnsidering the cnditinal distributin f the y i given ˆβ. Independent quantile residuals culd therefre be defined by cnsidering the cnditinal distributin f each y i given y,..., y i and ˆβ. Fr certain values f i this distributin wuld be degenerate; these values culd be ignred withut lss f infrmatin. References Brillinger, D. R. and Preisler, H. K. (983). Maimum likelihd estimatin in a latent variable prblem. In S. Karlin, T. Amemiya and L. A. Gdman, eds., Studies in Ecnmetrics, Time Series and Multivariate Statistics, Academic, New Yrk, pp Brillinger, D. R. (996). An analysis f an rdinal-valued time series. In P. M. Rbinsn and M. Rsenblatt, eds., Papers in Time Series Analysis: A Memrial Vlume t Edward J Hannan. Athens Cnference Vlume. Lecture Ntes in Statistics, Vlume 5, Springer, New Yrk. T appear. Brwn, M. B. (974). Identificatin f the surces f significance in tw-way cntingency tables. Appl. Statist., 3, Chambers, J. M., Cleveland, W. S., Kleiner, B., and Tukey, P. A. (983). Methds fr Data Analysis, Wadswrth, Belmnt, Califrnia. Graphical Ck, R. D. and Weisberg, S. (98). Residuals and Influence in Regressin. Chapman and Hall, New Yrk. C, D. R. and Snell, E. J. (968). A general definitin f residuals (with discussin). J. R. Statist. Sc., B, 3, Feigl, P. and Zelen, M. (965). Estimatin f epnential survival prbabilities with cncmitant bservatin. Bimetrics,,
9 Hand, D. J., Daly, F., Lunn, A. D., McCnway, K. J. and Ostrwski, E. (994). Handbk f Small Data Sets. Chapman & Hall, Lndn. Ktze, T. J. v W. and Hawkins, D. M. (984). The identificatin f utliers in tw-way cntingency tables using subtables. Appl. Statist., 33, 5 3. McCullagh, P. and Nelder, J.A. (989). Generalized linear mdels, nd ed. Chapman and Hall: Lndn. Pearsn, E. S. (95). On questins raised by the cmbinatin f tests based n discntinuus distributins. Bimetrika, 37, Pearsn, K. (94). On the thery f cntingency and its relatin t assciatin and nrmal crrelatin. Reprinted in 948 in Karl Pearsn s Early Statistical Papers, Cambridge University Press, Cambridge, Pierce, D. A., and Schafer, D. W. (986). Residuals in generalized linear mdels. J. Amer. Statist. Ass., 8, Smyth, G. K. and Verbyla, A. P. (996). A cnditinal apprach t REML in generalized linear mdels. J. Ry. Statist. Sc. B, 58,
, which yields. where z1. and z2
The Gaussian r Nrmal PDF, Page 1 The Gaussian r Nrmal Prbability Density Functin Authr: Jhn M Cimbala, Penn State University Latest revisin: 11 September 13 The Gaussian r Nrmal Prbability Density Functin
More informationA Matrix Representation of Panel Data
web Extensin 6 Appendix 6.A A Matrix Representatin f Panel Data Panel data mdels cme in tw brad varieties, distinct intercept DGPs and errr cmpnent DGPs. his appendix presents matrix algebra representatins
More information4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression
4th Indian Institute f Astrphysics - PennState Astrstatistics Schl July, 2013 Vainu Bappu Observatry, Kavalur Crrelatin and Regressin Rahul Ry Indian Statistical Institute, Delhi. Crrelatin Cnsider a tw
More informationPattern Recognition 2014 Support Vector Machines
Pattern Recgnitin 2014 Supprt Vectr Machines Ad Feelders Universiteit Utrecht Ad Feelders ( Universiteit Utrecht ) Pattern Recgnitin 1 / 55 Overview 1 Separable Case 2 Kernel Functins 3 Allwing Errrs (Sft
More informationCOMP 551 Applied Machine Learning Lecture 5: Generative models for linear classification
COMP 551 Applied Machine Learning Lecture 5: Generative mdels fr linear classificatin Instructr: Herke van Hf (herke.vanhf@mail.mcgill.ca) Slides mstly by: Jelle Pineau Class web page: www.cs.mcgill.ca/~hvanh2/cmp551
More informationComparing Several Means: ANOVA. Group Means and Grand Mean
STAT 511 ANOVA and Regressin 1 Cmparing Several Means: ANOVA Slide 1 Blue Lake snap beans were grwn in 12 pen-tp chambers which are subject t 4 treatments 3 each with O 3 and SO 2 present/absent. The ttal
More informationResampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017
Resampling Methds Crss-validatin, Btstrapping Marek Petrik 2/21/2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins in R (Springer, 2013) with
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Sectin 7 Mdel Assessment This sectin is based n Stck and Watsn s Chapter 9. Internal vs. external validity Internal validity refers t whether the analysis is valid fr the ppulatin and sample being studied.
More informationCAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank
CAUSAL INFERENCE Technical Track Sessin I Phillippe Leite The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Phillippe Leite fr the purpse f this wrkshp Plicy questins are causal
More informationResampling Methods. Chapter 5. Chapter 5 1 / 52
Resampling Methds Chapter 5 Chapter 5 1 / 52 1 51 Validatin set apprach 2 52 Crss validatin 3 53 Btstrap Chapter 5 2 / 52 Abut Resampling An imprtant statistical tl Pretending the data as ppulatin and
More informationPSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa
There are tw parts t this lab. The first is intended t demnstrate hw t request and interpret the spatial diagnstics f a standard OLS regressin mdel using GeDa. The diagnstics prvide infrmatin abut the
More informationDistributions, spatial statistics and a Bayesian perspective
Distributins, spatial statistics and a Bayesian perspective Dug Nychka Natinal Center fr Atmspheric Research Distributins and densities Cnditinal distributins and Bayes Thm Bivariate nrmal Spatial statistics
More informationAP Statistics Notes Unit Two: The Normal Distributions
AP Statistics Ntes Unit Tw: The Nrmal Distributins Syllabus Objectives: 1.5 The student will summarize distributins f data measuring the psitin using quartiles, percentiles, and standardized scres (z-scres).
More informationBootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >
Btstrap Methd > # Purpse: understand hw btstrap methd wrks > bs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(bs) > mean(bs) [1] 21.64625 > # estimate f lambda > lambda = 1/mean(bs);
More informationx 1 Outline IAML: Logistic Regression Decision Boundaries Example Data
Outline IAML: Lgistic Regressin Charles Suttn and Victr Lavrenk Schl f Infrmatics Semester Lgistic functin Lgistic regressin Learning lgistic regressin Optimizatin The pwer f nn-linear basis functins Least-squares
More informationCHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS
CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS 1 Influential bservatins are bservatins whse presence in the data can have a distrting effect n the parameter estimates and pssibly the entire analysis,
More informationLHS Mathematics Department Honors Pre-Calculus Final Exam 2002 Answers
LHS Mathematics Department Hnrs Pre-alculus Final Eam nswers Part Shrt Prblems The table at the right gives the ppulatin f Massachusetts ver the past several decades Using an epnential mdel, predict the
More informationPressure And Entropy Variations Across The Weak Shock Wave Due To Viscosity Effects
Pressure And Entrpy Variatins Acrss The Weak Shck Wave Due T Viscsity Effects OSTAFA A. A. AHOUD Department f athematics Faculty f Science Benha University 13518 Benha EGYPT Abstract:-The nnlinear differential
More informationHypothesis Tests for One Population Mean
Hypthesis Tests fr One Ppulatin Mean Chapter 9 Ala Abdelbaki Objective Objective: T estimate the value f ne ppulatin mean Inferential statistics using statistics in rder t estimate parameters We will be
More informationSUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis
SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical mdel fr micrarray data analysis David Rssell Department f Bistatistics M.D. Andersn Cancer Center, Hustn, TX 77030, USA rsselldavid@gmail.cm
More informationDetermining the Accuracy of Modal Parameter Estimation Methods
Determining the Accuracy f Mdal Parameter Estimatin Methds by Michael Lee Ph.D., P.E. & Mar Richardsn Ph.D. Structural Measurement Systems Milpitas, CA Abstract The mst cmmn type f mdal testing system
More informationThe general linear model and Statistical Parametric Mapping I: Introduction to the GLM
The general linear mdel and Statistical Parametric Mapping I: Intrductin t the GLM Alexa Mrcm and Stefan Kiebel, Rik Hensn, Andrew Hlmes & J-B J Pline Overview Intrductin Essential cncepts Mdelling Design
More informationComputational modeling techniques
Cmputatinal mdeling techniques Lecture 4: Mdel checing fr ODE mdels In Petre Department f IT, Åb Aademi http://www.users.ab.fi/ipetre/cmpmd/ Cntent Stichimetric matrix Calculating the mass cnservatin relatins
More informationDifferentiation Applications 1: Related Rates
Differentiatin Applicatins 1: Related Rates 151 Differentiatin Applicatins 1: Related Rates Mdel 1: Sliding Ladder 10 ladder y 10 ladder 10 ladder A 10 ft ladder is leaning against a wall when the bttm
More informationModelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA
Mdelling f Clck Behaviur Dn Percival Applied Physics Labratry University f Washingtn Seattle, Washingtn, USA verheads and paper fr talk available at http://faculty.washingtn.edu/dbp/talks.html 1 Overview
More informationINSTRUMENTAL VARIABLES
INSTRUMENTAL VARIABLES Technical Track Sessin IV Sergi Urzua University f Maryland Instrumental Variables and IE Tw main uses f IV in impact evaluatin: 1. Crrect fr difference between assignment f treatment
More informationSIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST. Mark C. Otto Statistics Research Division, Bureau of the Census Washington, D.C , U.S.A.
SIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST Mark C. Ott Statistics Research Divisin, Bureau f the Census Washingtn, D.C. 20233, U.S.A. and Kenneth H. Pllck Department f Statistics, Nrth Carlina State
More informationCHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.
MATH 1342 Ch. 24 April 25 and 27, 2013 Page 1 f 5 CHAPTER 24: INFERENCE IN REGRESSION Chapters 4 and 5: Relatinships between tw quantitative variables. Be able t Make a graph (scatterplt) Summarize the
More informationIN a recent article, Geary [1972] discussed the merit of taking first differences
The Efficiency f Taking First Differences in Regressin Analysis: A Nte J. A. TILLMAN IN a recent article, Geary [1972] discussed the merit f taking first differences t deal with the prblems that trends
More informationFebruary 28, 2013 COMMENTS ON DIFFUSION, DIFFUSIVITY AND DERIVATION OF HYPERBOLIC EQUATIONS DESCRIBING THE DIFFUSION PHENOMENA
February 28, 2013 COMMENTS ON DIFFUSION, DIFFUSIVITY AND DERIVATION OF HYPERBOLIC EQUATIONS DESCRIBING THE DIFFUSION PHENOMENA Mental Experiment regarding 1D randm walk Cnsider a cntainer f gas in thermal
More informationRevision: August 19, E Main Suite D Pullman, WA (509) Voice and Fax
.7.4: Direct frequency dmain circuit analysis Revisin: August 9, 00 5 E Main Suite D Pullman, WA 9963 (509) 334 6306 ice and Fax Overview n chapter.7., we determined the steadystate respnse f electrical
More informationMATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank
MATCHING TECHNIQUES Technical Track Sessin VI Emanuela Galass The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Emanuela Galass fr the purpse f this wrkshp When can we use
More informationThe blessing of dimensionality for kernel methods
fr kernel methds Building classifiers in high dimensinal space Pierre Dupnt Pierre.Dupnt@ucluvain.be Classifiers define decisin surfaces in sme feature space where the data is either initially represented
More information3.4 Shrinkage Methods Prostate Cancer Data Example (Continued) Ridge Regression
3.3.4 Prstate Cancer Data Example (Cntinued) 3.4 Shrinkage Methds 61 Table 3.3 shws the cefficients frm a number f different selectin and shrinkage methds. They are best-subset selectin using an all-subsets
More informationTree Structured Classifier
Tree Structured Classifier Reference: Classificatin and Regressin Trees by L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stne, Chapman & Hall, 98. A Medical Eample (CART): Predict high risk patients
More informationLim f (x) e. Find the largest possible domain and its discontinuity points. Why is it discontinuous at those points (if any)?
THESE ARE SAMPLE QUESTIONS FOR EACH OF THE STUDENT LEARNING OUTCOMES (SLO) SET FOR THIS COURSE. SLO 1: Understand and use the cncept f the limit f a functin i. Use prperties f limits and ther techniques,
More informationENSC Discrete Time Systems. Project Outline. Semester
ENSC 49 - iscrete Time Systems Prject Outline Semester 006-1. Objectives The gal f the prject is t design a channel fading simulatr. Upn successful cmpletin f the prject, yu will reinfrce yur understanding
More informationMath Foundations 20 Work Plan
Math Fundatins 20 Wrk Plan Units / Tpics 20.8 Demnstrate understanding f systems f linear inequalities in tw variables. Time Frame December 1-3 weeks 6-10 Majr Learning Indicatrs Identify situatins relevant
More informationAP Statistics Practice Test Unit Three Exploring Relationships Between Variables. Name Period Date
AP Statistics Practice Test Unit Three Explring Relatinships Between Variables Name Perid Date True r False: 1. Crrelatin and regressin require explanatry and respnse variables. 1. 2. Every least squares
More informationKinetic Model Completeness
5.68J/10.652J Spring 2003 Lecture Ntes Tuesday April 15, 2003 Kinetic Mdel Cmpleteness We say a chemical kinetic mdel is cmplete fr a particular reactin cnditin when it cntains all the species and reactins
More informationOF SIMPLY SUPPORTED PLYWOOD PLATES UNDER COMBINED EDGEWISE BENDING AND COMPRESSION
U. S. FOREST SERVICE RESEARCH PAPER FPL 50 DECEMBER U. S. DEPARTMENT OF AGRICULTURE FOREST SERVICE FOREST PRODUCTS LABORATORY OF SIMPLY SUPPORTED PLYWOOD PLATES UNDER COMBINED EDGEWISE BENDING AND COMPRESSION
More informationENGI 4430 Parametric Vector Functions Page 2-01
ENGI 4430 Parametric Vectr Functins Page -01. Parametric Vectr Functins (cntinued) Any nn-zer vectr r can be decmpsed int its magnitude r and its directin: r rrˆ, where r r 0 Tangent Vectr: dx dy dz dr
More informationMATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank
MATCHING TECHNIQUES Technical Track Sessin VI Céline Ferré The Wrld Bank When can we use matching? What if the assignment t the treatment is nt dne randmly r based n an eligibility index, but n the basis
More informationCOMP 551 Applied Machine Learning Lecture 9: Support Vector Machines (cont d)
COMP 551 Applied Machine Learning Lecture 9: Supprt Vectr Machines (cnt d) Instructr: Herke van Hf (herke.vanhf@mail.mcgill.ca) Slides mstly by: Class web page: www.cs.mcgill.ca/~hvanh2/cmp551 Unless therwise
More informationSimple Linear Regression (single variable)
Simple Linear Regressin (single variable) Intrductin t Machine Learning Marek Petrik January 31, 2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins
More informationOn Huntsberger Type Shrinkage Estimator for the Mean of Normal Distribution ABSTRACT INTRODUCTION
Malaysian Jurnal f Mathematical Sciences 4(): 7-4 () On Huntsberger Type Shrinkage Estimatr fr the Mean f Nrmal Distributin Department f Mathematical and Physical Sciences, University f Nizwa, Sultanate
More informationCOMP 551 Applied Machine Learning Lecture 11: Support Vector Machines
COMP 551 Applied Machine Learning Lecture 11: Supprt Vectr Machines Instructr: (jpineau@cs.mcgill.ca) Class web page: www.cs.mcgill.ca/~jpineau/cmp551 Unless therwise nted, all material psted fr this curse
More informationmaking triangle (ie same reference angle) ). This is a standard form that will allow us all to have the X= y=
Intrductin t Vectrs I 21 Intrductin t Vectrs I 22 I. Determine the hrizntal and vertical cmpnents f the resultant vectr by cunting n the grid. X= y= J. Draw a mangle with hrizntal and vertical cmpnents
More informationHomology groups of disks with holes
Hmlgy grups f disks with hles THEOREM. Let p 1,, p k } be a sequence f distinct pints in the interir unit disk D n where n 2, and suppse that fr all j the sets E j Int D n are clsed, pairwise disjint subdisks.
More informationLyapunov Stability Stability of Equilibrium Points
Lyapunv Stability Stability f Equilibrium Pints 1. Stability f Equilibrium Pints - Definitins In this sectin we cnsider n-th rder nnlinear time varying cntinuus time (C) systems f the frm x = f ( t, x),
More information22.54 Neutron Interactions and Applications (Spring 2004) Chapter 11 (3/11/04) Neutron Diffusion
.54 Neutrn Interactins and Applicatins (Spring 004) Chapter (3//04) Neutrn Diffusin References -- J. R. Lamarsh, Intrductin t Nuclear Reactr Thery (Addisn-Wesley, Reading, 966) T study neutrn diffusin
More informationModule 4: General Formulation of Electric Circuit Theory
Mdule 4: General Frmulatin f Electric Circuit Thery 4. General Frmulatin f Electric Circuit Thery All electrmagnetic phenmena are described at a fundamental level by Maxwell's equatins and the assciated
More informationMidwest Big Data Summer School: Machine Learning I: Introduction. Kris De Brabanter
Midwest Big Data Summer Schl: Machine Learning I: Intrductin Kris De Brabanter kbrabant@iastate.edu Iwa State University Department f Statistics Department f Cmputer Science June 24, 2016 1/24 Outline
More informationModule 3: Gaussian Process Parameter Estimation, Prediction Uncertainty, and Diagnostics
Mdule 3: Gaussian Prcess Parameter Estimatin, Predictin Uncertainty, and Diagnstics Jerme Sacks and William J Welch Natinal Institute f Statistical Sciences and University f British Clumbia Adapted frm
More informationUNIV1"'RSITY OF NORTH CAROLINA Department of Statistics Chapel Hill, N. C. CUMULATIVE SUM CONTROL CHARTS FOR THE FOLDED NORMAL DISTRIBUTION
UNIV1"'RSITY OF NORTH CAROLINA Department f Statistics Chapel Hill, N. C. CUMULATIVE SUM CONTROL CHARTS FOR THE FOLDED NORMAL DISTRIBUTION by N. L. Jlmsn December 1962 Grant N. AFOSR -62..148 Methds f
More informationBiplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 13 Offprint
Biplts in Practice MICHAEL GREENACRE Prfessr f Statistics at the Pmpeu Fabra University Chapter 13 Offprint CASE STUDY BIOMEDICINE Cmparing Cancer Types Accrding t Gene Epressin Arrays First published:
More informationInference in the Multiple-Regression
Sectin 5 Mdel Inference in the Multiple-Regressin Kinds f hypthesis tests in a multiple regressin There are several distinct kinds f hypthesis tests we can run in a multiple regressin. Suppse that amng
More informationMODULE 1. e x + c. [You can t separate a demominator, but you can divide a single denominator into each numerator term] a + b a(a + b)+1 = a + b
. REVIEW OF SOME BASIC ALGEBRA MODULE () Slving Equatins Yu shuld be able t slve fr x: a + b = c a d + e x + c and get x = e(ba +) b(c a) d(ba +) c Cmmn mistakes and strategies:. a b + c a b + a c, but
More informationCHAPTER 3 INEQUALITIES. Copyright -The Institute of Chartered Accountants of India
CHAPTER 3 INEQUALITIES Cpyright -The Institute f Chartered Accuntants f India INEQUALITIES LEARNING OBJECTIVES One f the widely used decisin making prblems, nwadays, is t decide n the ptimal mix f scarce
More informationEquilibrium of Stress
Equilibrium f Stress Cnsider tw perpendicular planes passing thrugh a pint p. The stress cmpnents acting n these planes are as shwn in ig. 3.4.1a. These stresses are usuall shwn tgether acting n a small
More informationThis section is primarily focused on tools to aid us in finding roots/zeros/ -intercepts of polynomials. Essentially, our focus turns to solving.
Sectin 3.2: Many f yu WILL need t watch the crrespnding vides fr this sectin n MyOpenMath! This sectin is primarily fcused n tls t aid us in finding rts/zers/ -intercepts f plynmials. Essentially, ur fcus
More informationPreparation work for A2 Mathematics [2017]
Preparatin wrk fr A2 Mathematics [2017] The wrk studied in Y12 after the return frm study leave is frm the Cre 3 mdule f the A2 Mathematics curse. This wrk will nly be reviewed during Year 13, it will
More informationSupport-Vector Machines
Supprt-Vectr Machines Intrductin Supprt vectr machine is a linear machine with sme very nice prperties. Haykin chapter 6. See Alpaydin chapter 13 fr similar cntent. Nte: Part f this lecture drew material
More informationLecture 17: Free Energy of Multi-phase Solutions at Equilibrium
Lecture 17: 11.07.05 Free Energy f Multi-phase Slutins at Equilibrium Tday: LAST TIME...2 FREE ENERGY DIAGRAMS OF MULTI-PHASE SOLUTIONS 1...3 The cmmn tangent cnstructin and the lever rule...3 Practical
More informationLecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff
Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised
More informationParticle Size Distributions from SANS Data Using the Maximum Entropy Method. By J. A. POTTON, G. J. DANIELL AND B. D. RAINFORD
3 J. Appl. Cryst. (1988). 21,3-8 Particle Size Distributins frm SANS Data Using the Maximum Entrpy Methd By J. A. PTTN, G. J. DANIELL AND B. D. RAINFRD Physics Department, The University, Suthamptn S9
More informationPerfrmance f Sensitizing Rules n Shewhart Cntrl Charts with Autcrrelated Data Key Wrds: Autregressive, Mving Average, Runs Tests, Shewhart Cntrl Chart
Perfrmance f Sensitizing Rules n Shewhart Cntrl Charts with Autcrrelated Data Sandy D. Balkin Dennis K. J. Lin y Pennsylvania State University, University Park, PA 16802 Sandy Balkin is a graduate student
More informationKinematic transformation of mechanical behavior Neville Hogan
inematic transfrmatin f mechanical behavir Neville Hgan Generalized crdinates are fundamental If we assume that a linkage may accurately be described as a cllectin f linked rigid bdies, their generalized
More informationA New Evaluation Measure. J. Joiner and L. Werner. The problems of evaluation and the needed criteria of evaluation
III-l III. A New Evaluatin Measure J. Jiner and L. Werner Abstract The prblems f evaluatin and the needed criteria f evaluatin measures in the SMART system f infrmatin retrieval are reviewed and discussed.
More informationMath 105: Review for Exam I - Solutions
1. Let f(x) = 3 + x + 5. Math 105: Review fr Exam I - Slutins (a) What is the natural dmain f f? [ 5, ), which means all reals greater than r equal t 5 (b) What is the range f f? [3, ), which means all
More information(1.1) V which contains charges. If a charge density ρ, is defined as the limit of the ratio of the charge contained. 0, and if a force density f
1.0 Review f Electrmagnetic Field Thery Selected aspects f electrmagnetic thery are reviewed in this sectin, with emphasis n cncepts which are useful in understanding magnet design. Detailed, rigrus treatments
More informationLecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff
Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised
More informationSAMPLING DYNAMICAL SYSTEMS
SAMPLING DYNAMICAL SYSTEMS Melvin J. Hinich Applied Research Labratries The University f Texas at Austin Austin, TX 78713-8029, USA (512) 835-3278 (Vice) 835-3259 (Fax) hinich@mail.la.utexas.edu ABSTRACT
More informationPart 3 Introduction to statistical classification techniques
Part 3 Intrductin t statistical classificatin techniques Machine Learning, Part 3, March 07 Fabi Rli Preamble ØIn Part we have seen that if we knw: Psterir prbabilities P(ω i / ) Or the equivalent terms
More informationAdmissibility Conditions and Asymptotic Behavior of Strongly Regular Graphs
Admissibility Cnditins and Asympttic Behavir f Strngly Regular Graphs VASCO MOÇO MANO Department f Mathematics University f Prt Oprt PORTUGAL vascmcman@gmailcm LUÍS ANTÓNIO DE ALMEIDA VIEIRA Department
More informationT Algorithmic methods for data mining. Slide set 6: dimensionality reduction
T-61.5060 Algrithmic methds fr data mining Slide set 6: dimensinality reductin reading assignment LRU bk: 11.1 11.3 PCA tutrial in mycurses (ptinal) ptinal: An Elementary Prf f a Therem f Jhnsn and Lindenstrauss,
More informationHow do scientists measure trees? What is DBH?
Hw d scientists measure trees? What is DBH? Purpse Students develp an understanding f tree size and hw scientists measure trees. Students bserve and measure tree ckies and explre the relatinship between
More informationLocalized Model Selection for Regression
Lcalized Mdel Selectin fr Regressin Yuhng Yang Schl f Statistics University f Minnesta Church Street S.E. Minneaplis, MN 5555 May 7, 007 Abstract Research n mdel/prcedure selectin has fcused n selecting
More informationMath 10 - Exam 1 Topics
Math 10 - Exam 1 Tpics Types and Levels f data Categrical, Discrete r Cntinuus Nminal, Ordinal, Interval r Rati Descriptive Statistics Stem and Leaf Graph Dt Plt (Interpret) Gruped Data Relative and Cumulative
More informationPure adaptive search for finite global optimization*
Mathematical Prgramming 69 (1995) 443-448 Pure adaptive search fr finite glbal ptimizatin* Z.B. Zabinskya.*, G.R. Wd b, M.A. Steel c, W.P. Baritmpa c a Industrial Engineering Prgram, FU-20. University
More informationThermodynamics and Equilibrium
Thermdynamics and Equilibrium Thermdynamics Thermdynamics is the study f the relatinship between heat and ther frms f energy in a chemical r physical prcess. We intrduced the thermdynamic prperty f enthalpy,
More informationPreparation work for A2 Mathematics [2018]
Preparatin wrk fr A Mathematics [018] The wrk studied in Y1 will frm the fundatins n which will build upn in Year 13. It will nly be reviewed during Year 13, it will nt be retaught. This is t allw time
More informationWhat is Statistical Learning?
What is Statistical Learning? Sales 5 10 15 20 25 Sales 5 10 15 20 25 Sales 5 10 15 20 25 0 50 100 200 300 TV 0 10 20 30 40 50 Radi 0 20 40 60 80 100 Newspaper Shwn are Sales vs TV, Radi and Newspaper,
More informationBuilding to Transformations on Coordinate Axis Grade 5: Geometry Graph points on the coordinate plane to solve real-world and mathematical problems.
Building t Transfrmatins n Crdinate Axis Grade 5: Gemetry Graph pints n the crdinate plane t slve real-wrld and mathematical prblems. 5.G.1. Use a pair f perpendicular number lines, called axes, t define
More informationMODULE FOUR. This module addresses functions. SC Academic Elementary Algebra Standards:
MODULE FOUR This mdule addresses functins SC Academic Standards: EA-3.1 Classify a relatinship as being either a functin r nt a functin when given data as a table, set f rdered pairs, r graph. EA-3.2 Use
More informationLinear Classification
Linear Classificatin CS 54: Machine Learning Slides adapted frm Lee Cper, Jydeep Ghsh, and Sham Kakade Review: Linear Regressin CS 54 [Spring 07] - H Regressin Given an input vectr x T = (x, x,, xp), we
More informationPhysics 2B Chapter 23 Notes - Faraday s Law & Inductors Spring 2018
Michael Faraday lived in the Lndn area frm 1791 t 1867. He was 29 years ld when Hand Oersted, in 1820, accidentally discvered that electric current creates magnetic field. Thrugh empirical bservatin and
More informationCS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007
CS 477/677 Analysis f Algrithms Fall 2007 Dr. Gerge Bebis Curse Prject Due Date: 11/29/2007 Part1: Cmparisn f Srting Algrithms (70% f the prject grade) The bjective f the first part f the assignment is
More informationElements of Machine Intelligence - I
ECE-175A Elements f Machine Intelligence - I Ken Kreutz-Delgad Nun Vascncels ECE Department, UCSD Winter 2011 The curse The curse will cver basic, but imprtant, aspects f machine learning and pattern recgnitin
More informationMACE For Conformation Traits
MACE Fr Cnfrmatin raits L. Klei and. J. Lawlr Hlstein Assciatin USA, Inc., Brattlebr, Vermnt, USA Intrductin Multiple acrss cuntry evaluatins (MACE) fr prductin traits are nw rutinely cmputed and used
More informationContributions to the Theory of Robust Inference
Cntributins t the Thery f Rbust Inference by Matías Salibián-Barrera Licenciad en Matemáticas, Universidad de Buens Aires, Argentina, 1994 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS
More informationEric Klein and Ning Sa
Week 12. Statistical Appraches t Netwrks: p1 and p* Wasserman and Faust Chapter 15: Statistical Analysis f Single Relatinal Netwrks There are fur tasks in psitinal analysis: 1) Define Equivalence 2) Measure
More informationNOTE ON THE ANALYSIS OF A RANDOMIZED BLOCK DESIGN. Junjiro Ogawa University of North Carolina
NOTE ON THE ANALYSIS OF A RANDOMIZED BLOCK DESIGN by Junjir Ogawa University f Nrth Carlina This research was supprted by the Office f Naval Research under Cntract N. Nnr-855(06) fr research in prbability
More informationLogistic Regression. John Fox. York Summer Programme in Data Analysis. Department of Sociology McMaster University May 2005.
Lgistic Regressin Yrk Summer Prgramme in Data Analysis Jhn Fx Department f Scilgy McMaster University May 2005 2005 by Jhn Fx Lgistic Regressin 1 1. Gals: T shw hw mdels similar t linear mdels can be develped
More informationBayesian nonparametric modeling approaches for quantile regression
Bayesian nnparametric mdeling appraches fr quantile regressin Athanasis Kttas Department f Applied Mathematics and Statistics University f Califrnia, Santa Cruz Department f Statistics Athens University
More informationIntroduction to Quantitative Genetics II: Resemblance Between Relatives
Intrductin t Quantitative Genetics II: Resemblance Between Relatives Bruce Walsh 8 Nvember 006 EEB 600A The heritability f a trait, a central cncept in quantitative genetics, is the prprtin f variatin
More informationIAML: Support Vector Machines
1 / 22 IAML: Supprt Vectr Machines Charles Suttn and Victr Lavrenk Schl f Infrmatics Semester 1 2 / 22 Outline Separating hyperplane with maimum margin Nn-separable training data Epanding the input int
More informationAnalysis of Designed Experiments
8 Analysis f Designed Experiments 8.1 Intrductin In this chapter we discuss experiments whse main aim is t study and cmpare the effects f treatments (diets, varieties, dses) by measuring respnse (yield,
More informationAPPLICATION OF THE BRATSETH SCHEME FOR HIGH LATITUDE INTERMITTENT DATA ASSIMILATION USING THE PSU/NCAR MM5 MESOSCALE MODEL
JP2.11 APPLICATION OF THE BRATSETH SCHEME FOR HIGH LATITUDE INTERMITTENT DATA ASSIMILATION USING THE PSU/NCAR MM5 MESOSCALE MODEL Xingang Fan * and Jeffrey S. Tilley University f Alaska Fairbanks, Fairbanks,
More informationAPPROVED J Graduate Committeei. /fajor Professor. Minor Professor. Committee Memberv* Committee Member. Dean of. Collese of Education
AN INVESTIGATION OP THE EFFECT OF VIOLATING THE ASSUMPTION OF HOMOGENEITY OF REGRESSION SLOPES IN THE ANALYSIS OF COVARIANCE MODEL UPON THE F-STATISTIC APPROVED J Graduate Cmmitteei /fajr Prfessr Minr
More information