Correlation and Regression without Sums of Squares. (Kendall's Tau) Rudy A. Gideon ABSTRACT
|
|
- Norman Oliver
- 6 years ago
- Views:
Transcription
1 Correlaton and Regson wthout Sums of Squa (Kendall's Tau) Rud A. Gdeon ABSTRACT Ths short pee provdes an ntroduton to the use of Kendall's τ n orrelaton and smple lnear regson. The error estmate also uses Kendall's τ so that sums of squa are avoded. A populaton or random varable approah usng elementar slopes demonstrates a natural wa to ntrodue Kendall's τ. Ths paper uses populaton and sample onepts to fuse orrelaton and regson together nto a orrelaton estmaton sstem (CES) that allows regson to follow dretl from orrelaton. A slghtl dfferent formulaton of τ s needed to do ths. In Hollander and Wolfe (999) as well as other publatons there are separate areas on orrelaton and regson and t s not eas to see an onneton and ths paper hopes to orret that. It was ponted out b Huber (98) that lassal regson loses ts optmalt f onl one perent of the data s questonable. Sne ths s almost alwas the ase, regson wth robust propertes (mplng that a few errant data ponts do not severel affet the nferene) suh as pented here are preferable. Ke words: rank statsts, nonparametr, elementar slopes, onordane. INTRODUCTION A few short setons set the stage. Let ( X, Y ) be a ontnuous bvarate random varable and X, ) and X, ) ( Y ( Y two ndependent outomes. Assume ntet les n both the orrelaton and the smple lnear relatonshp between X and Y. The random varable
2 Y X Y X s known as an elementar slope (ELS) as t s the slope of the lne between the two ponts X, ) and X, ) ( Y ( Y. An elementar slope s alled onordant f t s postve and dsordant otherwse. The populaton verson of Kendall's τ an be formulated as the parameter Y Y Y Y Y Y τ ( X, Y ) P( > 0) P( < 0) P ( > 0). Thus τ s smpl the eess of onordant over dsordant slopes, or ve-versa. Let Y Y p P( > 0) and pd p. Note that τ les between - and +, and f τ 0, Y Y then P ( > 0) /.. POPULATION VALUE OF τ FOR THE BIVARIATE NORMAL Assume now that ( X, Y ) s a bvarate normal random varable wth means µ, µ, varanes, and ovarane ρ. Then the ELS random varable Y X Y X has a dstrbuton equal to ρ + ρ Ro where o R has the standard Cauh dstrbuton. The denst and umulatve dstrbuton funton of R are ( π ( + )) and o artan( ) + for over the real lne. The elementar slopes and orrelaton oeffent π ρ an now be related.
3 Y Y p ( > P 0) P( ρ + ρ Ro > 0) artan( ρ ρ ) +, and π τ ( X, Y ) artan( ρ ρ ) Y Y Ths an be solved for ρ n terms of P ( 0) π > p. After a wee bt of algebra the equaton beomes ρ sn( π ( p 0.5)). Now beause τ p, ths latter equaton an also be wrtten as ρ sn(πτ / ). Beause of the smmetr of R o about 0, the random varable Y X Y X s smmetr about the slope ρ. Ths s used to dsuss a natural estmate of the slope n smple lnear regson. 3. THE REGRESSION SETTING Two more theoretal relatonshps are needed. Frst the smple lnear regson model for the bvarate normal s E ( Y X ) µ + ρ ( µ ) ; that s, the slope s β ρ and the nterept s α µ. Seond s needed the dstrbuton of ρ µ Y ρ X. From propertes of the bvarate normal and standard varable transformaton theor, ths dstrbuton s normal wth mean µ and varane ρ µ ( ρ ). Note that ths dstrbuton s also smmetr about ts mean, whh s the nterept n the smple lnear regson. 4. POPULATION REGRESSION WITH KENDALL'S TAU 3
4 The regson duals are unorrelated wth the regsor varable for a orrelaton oeffent parameterθ, f b s found suh that θ ( X, Y bx ) 0. () In CES ths equaton s alled the populaton regson equaton. Solvng for b gves the slope of the populaton lnear model. It s now shown that the soluton of τ ( X, Y bx ) 0 s b β ρ ;that s τ gves the orret slope n the populaton regson equaton. To prove ths, start wth τ ( X, Y bx ) 0 and from the defnton of ( Y bx ) ( Y bx ) Y Y Y Y τ, P ( > 0) P( b > 0) P ( > b) /. Now ( ) b substtutng the dstrbuton of an ELS and for b β ρ, P( > b) Y X Y X P( ρ + ρ R0 > b) P( ρ R0 > 0) P ( R > 0) /. 0 Agan the last step follows from the smmetr of R 0 about 0. So ndeed the populaton regson oeffent s obtaned va τ and the populaton regson equaton. 5. CORRELATION AND SAMPLE REGRESSION WITH KENDALL'S TAU 5. Estmaton of Kendall's τ 4
5 For a random sample of sze n from an absolutel ontnuous dstrbuton wth data n n olumn vetors (, ), form the elementar slopes all of them. Estmate p, the probablt of onordane, b pˆ, as the number of onordant elementar slopes dvded b n. Beause p d p, pd p ˆ ˆ. Then the sample value of Kendall's τ s τ ˆ(, ) pˆ pˆ d, that s, the dfferene between the relatve number of onordant and the relatve number of dsordant elementar slopes. It follows from the populaton deas above that ρ ˆ sn( π ( pˆ.5)) sn( π τˆ / ). 5. Estmate of Slope Note that for the bvarate normal the populaton medan of ELS s ρ as gven n Seton. It s known that the sample medan from a smmetr populaton has the populaton medan as ts pont of smmetr. In the ase of the set of elementar slopes, there s not total ndependene, but one would stll epet the sample medan to var about the populaton medan. In other words, a natural estmate of the slope, ρ, s obtaned as the medan of the sample elementar slopes. It s now shown that τ gves ths estmate b usng the sample equvalent of equaton (). To have the duals and unorrelated, n equaton () replae random varables b data and obtan the sample regson equaton τ (, b) 0. () 5
6 But also τ (, b) p ( b) pd ( b) where p (b) s the fraton of ELSs that are postve for a hosen b and p d (b) s the fraton of ELSs that are negatve for a hosen b. The ELSs for the sample wth a gven b are ( b ) ( b ) b for < n. The sample regson equaton s solved f there are an equal number of onordant and dsordant ELSs; that s, when p ( b) pd ( b) whh means #{ > b} #{ < b} or that b medan{ }. Sen (968) s a good soure for a dfferent approah to slope estmaton va Kendall's Tau. 5.3 Interept After the slope b s estmated the nterept s estmated b the medan of the set of duals b, all t a. Ths s beause b s estmatng ρ and Y ρ X s smmetr about the populaton nterept µ. Then the Kendall estmate of the ρ µ lnear relatonshp s ˆ a + b. 5.4 Inferene on the Slope For nferene on β ρ, the asmptot ult n Gdeon (008) s used; t s shown there that β ˆ τ β has an asmptot N(0, π 9( ) ) dstrbuton. (Sen 968 also n develops a dfferent nferene). The quantt the Kendall regson lne, whle s estmated from the duals around s estmated b a sample varane of the regsor 6
7 varable. In the sprt of usng ust Kendall's τ to estmate all quanttes, here s the CES approah. Let z Φ ( ( n + ) ) for,, K, n where Φ s the dstrbuton funton of a N(0,) random varable. Now the z are used wth the order statsts for the duals ) and ( ( ) ) to estmate the standard devatons. Solvng the regson equatons ( () (wth the same log as above) but now wth ordered data denoted b the supersrpt o, o o o o o o τ ( z, ˆ z ) 0 and τ ( z, ˆ z ) 0, leads to the followng (Gdeon and Rothan 007): ( ) ( ) ˆ medan and z( ) z( ) ( ) ( ) ˆ medan, for < n. z( ) z( ) 6. EXAMPLES AND SIMULATIONS The above work s net arred out for the eample n Sen (968), {,,3,4,0,,8}, and {9,5,9,0,45,55,78}. Frst, the The two estmated standard devatons are are 6.34 and., petvel. The rato of the be omputed n two was. Frst b omputng τ estmated nterept and slope are 6 and 4. ˆ 7.4 and ˆ.48; the lassal ults ˆ / ˆ n the SD of the slope above an ˆ and ˆ and takng the rato. A seond wa that avods nvolvng a dstrbuton s possble wth the CES b omputng ( ) ( ) o o o rato medan dretl; that s, solve τ (, rato* ) 0. Both ( ) ( ) π ˆ gve the same ult, The estmated SD s 9( n ) ˆ π whh s * So the appromate 93% onfdene nterval (same level as n Sen) s 4 ±.8* ± where.8 s the upper quantle of a standard normal. The lower number s 3.85 ompared to Sen s 3.7 and the upper number s 4.5 7
8 ompared to 4.8; onsderng that the sample sze s onl 7, these numbers are ver lose. In hs paper Sen ponts out wh Tau (robust) should be used so t s not neessar to repeat the reasons here. A number of smulatons wth varous parameters were run n order to further authentate the asmptot dstrbuton of the slope, but onl one of eah tpe s dsussed. Frst a smple lnear regson was onstruted wth standard normal varables for a sample of sze 5 and a onfdene oeffent of 50 %. The model was ρ + ρ ε wth and ε ndependent N(0,), and so the orrelaton ρ s also the slope of the regson lne. Of 000 smulatons, 48 % ontaned the true slope ust about what s epeted. A seond smple lnear regson was run on bnomal varables. Frst was generated as a bnomal random varable based on 5 Bernoull trals wth the probablt of suess p. Then was generated b hoosng a slope and addng error wth the same Bnomal but entered b subtratng n*p, so slope* + ( B(5, ) 7.5). Agan 5 observatons were taken and a 50 % onfdene nterval onstruted. Of 000 smulatons, 63% of the onfdene ntervals nluded the true slope. Ths, of ourse, s far more than epeted. There were man ted values on both the and data for the bnomal and so the asmptot dstrbuton ma be less aurate, but the fat that the onfdene nterval level was hgher than epeted s promsng. A further motvaton for gvng ths last eample s to show that CES wth Kendall's an be appropratel used on dsrete data, ust as normal theor tehnques. In both ases, t s known that the ults are onl appromate. 8
9 7. CONCLUSION Most prattoners of statsts fous onl on least squa and sums of squa proedu and have the mpson that these are the onl eas to use and straghtforward proedu for regson analses. The ompat snthess (of populaton, sample, orrelaton, and regson) pented heren s meant to promote the use of Kendall's Tau and the CES to show that there are alternatves that are not onl eas to use but are n addton nherentl robust. Other orrelaton oeffents, both nonparametr and ontnuous, ould be studed n a smlar manner; the author has eamned a few of these, but onl Greatest Devaton Correlaton Coeffent (Gdeon and Hollster 987) has been etensvel studed. To summarze, both the populaton and sample versons of Kendall's Tau are used to eamne orrelaton and the parameters n a smple lnear regson. It should be noted that the nvarane propertes gven n Sen (968) are easl approahed through the orrelaton notons n ths paper. What, perhaps, has held up usng Kendall's and Sen's onepts n statstal analss s that ther work n orrelaton and regson has not been vewed as related muh less as a unversal sstem; orretng ths has been one of the goals of ths paper. Addtonall the advane of omputng has made ths approah ver feasble. Ths approah also leads to a ne wa to etend Kendall's Tau nto multple lnear regson; see Gdeon (008). Customzed R or S-Plus programs are gven n the append to obtan the Kendall estmates n smple lnear regson. These use the ma-mn method for dealng wth tes found n Gdeon and Hollster (987) so that estmates are avalable no matter how man tes there are. Ths method of te breakng gves the same ults as those n pakaged S- 9
10 Plus routnes but not as those n the R routnes. However, ths ma-mn te breakng method s the onl feasble method for the Greatest Devaton Correlaton Coeffent and so s more general. APPENDIX: R OR S-PLUS ROUTINES FOR CORRELATION AND SIMPLE LINEAR REGRESSION WITH KENDALL'S TAU # omputng the two unque vetors wth tes pent: the funton s tauunq tauunq <- funton(,) { n <- length() e <- :n rr <- n+ -rank() tp <- [order(,)] tn <- [order(,rr)] rkp <- order(tp,e) rkn <- order(tn,n:) out <- bnd(rkp,rkn) out } # alulaton of Kendall's tau on unque ma-mn vetors # the funton s rtau rtau <- funton(,){ ot <- tauunq(,) rkp <- ot[,] rkn <- ot[,] dp <- 0 dn <- 0 n <- length() n <- ((n*(n-))/) n <- n- for( n :n) { <- + tempp <- rkp[]-rkp[:n] tempn <- rkn[]-rkn[:n] dp <- dp + sum( tempp<0) dn <- dn + sum( tempn<0) } out <- (dp + dn)/n - out } # output s slope and nterept, funton name tauslp n postons and tauslp <- funton(,) { rat <- (outer(,,"-")/outer(,,"-")) ratv <- rat[!s.na(rat)] slp <- medan(ratv) 0
11 <- - slp * ant <- medan() <- - ant k <- rtau(,) k <- sum() k <- medan() out <- (slp,ant,k,k,k) out } REFERENCES R.A. Gdeon, The Correlaton Coeffents, Journal of Modern Appled Statstal Methods 6 (007). R.A. Gdeon, The Relatonshp between a Correlaton Coeffent and ts Assoated Slope Estmates n Multple Lnear Regson, Sankha (008) under revew. R.A. Gdeon and R.A. Hollster, A Rank Correlaton Coeffent Resstant to Outlers, Journal of the Ameran Statstal Assoaton 8 (987) R.A. Gdeon and A. M. Rothan CSJ, Loaton and Sale Estmaton wth Correlaton Coeffents, Communatons n Statsts-Theor and Methods (008) under revew. M.G. Kendall and J.D. Gbbons, Rank Correlaton Methods, 5th ed., Oford Unverst Ps, 990, or M. G. Kendall, Rank Correlaton Methods, 3rd ed., Hafner Publ. Co., 96. M. Hollander and D.A. Wolfe, Nonparametr Statstal Methods, nd ed., John Wle & Sons, 999. P.J. Huber, Robust Statsts, John Wle & Sons, 98. P.K. Sen, Estmates of the Regson Coeffent based on Kendall's Tau, Journal of the Ameran Statstal Assoaton 63 (968) Webste ontans opes of the author's unpublshed work.
The corresponding link function is the complementary log-log link The logistic model is comparable with the probit model if
SK300 and SK400 Lnk funtons for bnomal GLMs Autumn 08 We motvate the dsusson by the beetle eample GLMs for bnomal and multnomal data Covers the followng materal from hapters 5 and 6: Seton 5.6., 5.6.3,
More informationCorrelation and Regression. Correlation 9.1. Correlation. Chapter 9
Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,
More informationJSM Survey Research Methods Section. Is it MAR or NMAR? Michail Sverchkov
JSM 2013 - Survey Researh Methods Seton Is t MAR or NMAR? Mhal Sverhkov Bureau of Labor Statsts 2 Massahusetts Avenue, NE, Sute 1950, Washngton, DC. 20212, Sverhkov.Mhael@bls.gov Abstrat Most methods that
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationSTK4900/ Lecture 4 Program. Counterfactuals and causal effects. Example (cf. practical exercise 10)
STK4900/9900 - Leture 4 Program 1. Counterfatuals and ausal effets 2. Confoundng 3. Interaton 4. More on ANOVA Setons 4.1, 4.4, 4.6 Supplementary materal on ANOVA Example (f. pratal exerse 10) How does
More informationLinear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables
Lnear Correlaton Many research ssues are pursued wth nonexpermental studes that seek to establsh relatonshps among or more varables E.g., correlates of ntellgence; relaton between SAT and GPA; relaton
More informationModule 2. Random Processes. Version 2 ECE IIT, Kharagpur
Module Random Processes Lesson 6 Functons of Random Varables After readng ths lesson, ou wll learn about cdf of functon of a random varable. Formula for determnng the pdf of a random varable. Let, X be
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationChapter 14 Simple Linear Regression
Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng
More informatione i is a random error
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown
More informationMachine Learning: and 15781, 2003 Assignment 4
ahne Learnng: 070 and 578, 003 Assgnment 4. VC Dmenson 30 onts Consder the spae of nstane X orrespondng to all ponts n the D x, plane. Gve the VC dmenson of the followng hpothess spaes. No explanaton requred.
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informationLocation and Scale Estimation with Correlation Coefficients
Unversty of Montana ScholarWorks at Unversty of Montana Mathematcal Scences Faculty Publcatons Mathematcal Scences 2-2011 Locaton and Scale Estmaton wth Correlaton Coeffcents Rudy Gdeon Unversty of Montana,
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationLecture 3: Probability Distributions
Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the
More informationx yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.
The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationOPTIMISATION. Introduction Single Variable Unconstrained Optimisation Multivariable Unconstrained Optimisation Linear Programming
OPTIMIATION Introducton ngle Varable Unconstraned Optmsaton Multvarable Unconstraned Optmsaton Lnear Programmng Chapter Optmsaton /. Introducton In an engneerng analss, sometmes etremtes, ether mnmum or
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationSIMPLE LINEAR REGRESSION
Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two
More informationTHE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More informationResource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis
Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques
More informationAnalysis of Mixed Correlated Bivariate Negative Binomial and Continuous Responses
Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 8, Issue 2 (Deember 2013), pp. 404 415 Applatons and Appled Mathemats: An Internatonal Journal (AAM) Analyss of Mxed Correlated Bvarate
More informationLecture 3 Stat102, Spring 2007
Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture
More informationLogic effort and gate sizing
EEN454 Dgtal Integrated rcut Desgn Logc effort and gate szng EEN 454 Introducton hp desgners face a bewlderng arra of choces What s the best crcut topolog for a functon? How man stages of logc gve least
More informationCharged Particle in a Magnetic Field
Charged Partle n a Magnet Feld Mhael Fowler 1/16/08 Introduton Classall, the fore on a harged partle n eletr and magnet felds s gven b the Lorentz fore law: v B F = q E+ Ths velot-dependent fore s qute
More informationQ1: Calculate the mean, median, sample variance, and standard deviation of 25, 40, 05, 70, 05, 40, 70.
Q1: Calculate the mean, medan, sample varance, and standard devaton of 5, 40, 05, 70, 05, 40, 70. Q: The frequenc dstrbuton for a data set s gven below. Measurements 0 1 3 4 Frequenc 3 5 8 3 1 a) What
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationSystematic Error Illustration of Bias. Sources of Systematic Errors. Effects of Systematic Errors 9/23/2009. Instrument Errors Method Errors Personal
9/3/009 Sstematc Error Illustraton of Bas Sources of Sstematc Errors Instrument Errors Method Errors Personal Prejudce Preconceved noton of true value umber bas Prefer 0/5 Small over large Even over odd
More informationRockefeller College University at Albany
Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n
More informationStatistical Inference. 2.3 Summary Statistics Measures of Center and Spread. parameters ( population characteristics )
Ismor Fscher, 8//008 Stat 54 / -8.3 Summary Statstcs Measures of Center and Spread Dstrbuton of dscrete contnuous POPULATION Random Varable, numercal True center =??? True spread =???? parameters ( populaton
More informationController Design for Networked Control Systems in Multiple-packet Transmission with Random Delays
Appled Mehans and Materals Onlne: 03-0- ISSN: 66-748, Vols. 78-80, pp 60-604 do:0.408/www.sentf.net/amm.78-80.60 03 rans eh Publatons, Swtzerland H Controller Desgn for Networed Control Systems n Multple-paet
More informationGenerative classification models
CS 675 Intro to Machne Learnng Lecture Generatve classfcaton models Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square Data: D { d, d,.., dn} d, Classfcaton represents a dscrete class value Goal: learn
More informationChapter 3 Describing Data Using Numerical Measures
Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationExponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute
Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 193-9466 Vol. 10, Issue 1 (June 015), pp. 106-113 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) Exponental Tpe Product Estmator
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationEC3075 Mathematical Approaches to Economics
EC3075 Mathematal Aroahes to Eonoms etures 7-8: Dualt and Constraned Otmsaton Pemberton & Rau haters 7-8 Dr Gaa Garno [Astle Clarke Room 4 emal: gg44] Dualt n onsumer theor We wll exose the rmal otmsaton
More informationA Practical Look at Correlation
A Practcal Look at Correlaton Rud A. Gdeon and Carol A. Ulsafer Dept. of Mathematcal Scences, Unverst of Montana, Mssoula, MT, 598 Abstract The word correlaton n general ndcates that two quanttes are related
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More informationUsing the estimated penetrances to determine the range of the underlying genetic model in casecontrol
Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable
More informationPART 8. Partial Differential Equations PDEs
he Islamc Unverst of Gaza Facult of Engneerng Cvl Engneerng Department Numercal Analss ECIV 3306 PAR 8 Partal Dfferental Equatons PDEs Chapter 9; Fnte Dfference: Ellptc Equatons Assocate Prof. Mazen Abualtaef
More informationLecture 16 Statistical Analysis in Biomaterials Research (Part II)
3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan
More informationSystems of Equations (SUR, GMM, and 3SLS)
Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ
More informationY = β 0 + β 1 X 1 + β 2 X β k X k + ε
Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty
More informationβ0 + β1xi. You are interested in estimating the unknown parameters β
Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)
More informationNumerical Solution of Ordinary Differential Equations
Numercal Methods (CENG 00) CHAPTER-VI Numercal Soluton of Ordnar Dfferental Equatons 6 Introducton Dfferental equatons are equatons composed of an unknown functon and ts dervatves The followng are examples
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationMeasuring the Strength of Association
Stat 3000 Statstcs for Scentsts and Engneers Measurng the Strength of Assocaton Note that the slope s one measure of the lnear assocaton between two contnuous varables t tells ou how much the average of
More informationAssortment Optimization under MNL
Assortment Optmzaton under MNL Haotan Song Aprl 30, 2017 1 Introducton The assortment optmzaton problem ams to fnd the revenue-maxmzng assortment of products to offer when the prces of products are fxed.
More informationEcon Statistical Properties of the OLS estimator. Sanjaya DeSilva
Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate
More informationProperties of Least Squares
Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures
More informationFormulas for the Determinant
page 224 224 CHAPTER 3 Determnants e t te t e 2t 38 A = e t 2te t e 2t e t te t 2e 2t 39 If 123 A = 345, 456 compute the matrx product A adj(a) What can you conclude about det(a)? For Problems 40 43, use
More informationComplement of an Extended Fuzzy Set
Internatonal Journal of Computer pplatons (0975 8887) Complement of an Extended Fuzzy Set Trdv Jyot Neog Researh Sholar epartment of Mathemats CMJ Unversty, Shllong, Meghalaya usmanta Kumar Sut ssstant
More informationInner Product. Euclidean Space. Orthonormal Basis. Orthogonal
Inner Product Defnton 1 () A Eucldean space s a fnte-dmensonal vector space over the reals R, wth an nner product,. Defnton 2 (Inner Product) An nner product, on a real vector space X s a symmetrc, blnear,
More informationDepartment of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution
Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable
More informationModeling and Simulation NETW 707
Modelng and Smulaton NETW 707 Lecture 5 Tests for Random Numbers Course Instructor: Dr.-Ing. Magge Mashaly magge.ezzat@guc.edu.eg C3.220 1 Propertes of Random Numbers Random Number Generators (RNGs) must
More informationDefinition. Measures of Dispersion. Measures of Dispersion. Definition. The Range. Measures of Dispersion 3/24/2014
Measures of Dsperson Defenton Range Interquartle Range Varance and Standard Devaton Defnton Measures of dsperson are descrptve statstcs that descrbe how smlar a set of scores are to each other The more
More information9. Complex Numbers. 1. Numbers revisited. 2. Imaginary number i: General form of complex numbers. 3. Manipulation of complex numbers
9. Comple Numbers. Numbers revsted. Imagnar number : General form of comple numbers 3. Manpulaton of comple numbers 4. The Argand dagram 5. The polar form for comple numbers 9.. Numbers revsted We saw
More informationStatistical Analysis of Environmental Data - Academic Year Prof. Fernando Sansò CLUSTER ANALYSIS
Statstal Analyss o Envronmental Data - Aadem Year 008-009 Pro. Fernando Sansò EXERCISES - PAR CLUSER ANALYSIS Supervsed Unsupervsed Determnst Stohast Determnst Stohast Dsrmnant Analyss Bayesan Herarhal
More information3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X
Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationDOAEstimationforCoherentSourcesinBeamspace UsingSpatialSmoothing
DOAEstmatonorCoherentSouresneamspae UsngSpatalSmoothng YnYang,ChunruWan,ChaoSun,QngWang ShooloEletralandEletronEngneerng NanangehnologalUnverst,Sngapore,639798 InsttuteoAoustEngneerng NorthwesternPoltehnalUnverst,X
More informationImportant Instructions to the Examiners:
Summer 0 Examnaton Subject & Code: asc Maths (70) Model Answer Page No: / Important Instructons to the Examners: ) The Answers should be examned by key words and not as word-to-word as gven n the model
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationCHAPTER 4. Vector Spaces
man 2007/2/16 page 234 CHAPTER 4 Vector Spaces To crtcze mathematcs for ts abstracton s to mss the pont entrel. Abstracton s what makes mathematcs work. Ian Stewart The man am of ths tet s to stud lnear
More informationCS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements
CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before
More informatione a = 12.4 i a = 13.5i h a = xi + yj 3 a Let r a = 25cos(20) i + 25sin(20) j b = 15cos(55) i + 15sin(55) j
Vetors MC Qld-3 49 Chapter 3 Vetors Exerse 3A Revew of vetors a d e f e a x + y omponent: x a os(θ 6 os(80 + 39 6 os(9.4 omponent: y a sn(θ 6 sn(9 0. a.4 0. f a x + y omponent: x a os(θ 5 os( 5 3.6 omponent:
More informationFall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede. . For P such independent random variables (aka degrees of freedom): 1 =
Fall Analss of Epermental Measurements B. Esensten/rev. S. Errede More on : The dstrbuton s the.d.f. for a (normalzed sum of squares of ndependent random varables, each one of whch s dstrbuted as N (,.
More informationIntroduction to Regression
Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes
More informationA Robust Method for Calculating the Correlation Coefficient
A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal
More informationOne Dimensional Axial Deformations
One Dmensonal al Deformatons In ths secton, a specfc smple geometr s consdered, that of a long and thn straght component loaded n such a wa that t deforms n the aal drecton onl. The -as s taken as the
More information4.3 Poisson Regression
of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationBiostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li
Bostatstcs Chapter 11 Smple Lnear Correlaton and Regresson Jng L jng.l@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jngl/courses/2018fall/b372/ Dept of Bonformatcs & Bostatstcs, SJTU Recall eat chocolate Cell 175,
More informationsince [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation
Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson
More informationLimited Dependent Variables and Panel Data. Tibor Hanappi
Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range
More informationwhere I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).
11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e
More informationPropagation of error for multivariable function
Propagaton o error or multvarable uncton ow consder a multvarable uncton (u, v, w, ). I measurements o u, v, w,. All have uncertant u, v, w,., how wll ths aect the uncertant o the uncton? L tet) o (Equaton
More information4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA
4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationDurban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications
Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationBiostatistics 360 F&t Tests and Intervals in Regression 1
Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng
More informationIntroduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:
CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationSuites of Tests. DIEHARD TESTS (Marsaglia, 1985) See
Sutes of Tests DIEHARD TESTS (Marsagla, 985 See http://stat.fsu.edu/~geo/dehard.html NIST Test sute- 6 tests on the sequences of bts http://csrc.nst.gov/rng/ Test U0 Includes the above tests. http://www.ro.umontreal.ca/~lecuyer/
More informationLecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management
Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =
More informationSampling Theory MODULE VII LECTURE - 23 VARYING PROBABILITY SAMPLING
Samplng heory MODULE VII LECURE - 3 VARYIG PROBABILIY SAMPLIG DR. SHALABH DEPARME OF MAHEMAICS AD SAISICS IDIA ISIUE OF ECHOLOGY KAPUR he smple random samplng scheme provdes a random sample where every
More informationISQS 6348 Final Open notes, no books. Points out of 100 in parentheses. Y 1 ε 2
ISQS 6348 Fnal Open notes, no books. Ponts out of 100 n parentheses. 1. The followng path dagram s gven: ε 1 Y 1 ε F Y 1.A. (10) Wrte down the usual model and assumptons that are mpled by ths dagram. Soluton:
More informationMultivariate Ratio Estimation With Known Population Proportion Of Two Auxiliary Characters For Finite Population
Multvarate Rato Estmaton Wth Knon Populaton Proporton Of To Auxlar haracters For Fnte Populaton *Raesh Sngh, *Sachn Mal, **A. A. Adeara, ***Florentn Smarandache *Department of Statstcs, Banaras Hndu Unverst,Varanas-5,
More informationJAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger
JAB Chan Long-tal clams development ASTIN - September 2005 B.Verder A. Klnger Outlne Chan Ladder : comments A frst soluton: Munch Chan Ladder JAB Chan Chan Ladder: Comments Black lne: average pad to ncurred
More informationCS-433: Simulation and Modeling Modeling and Probability Review
CS-433: Smulaton and Modelng Modelng and Probablty Revew Exercse 1. (Probablty of Smple Events) Exercse 1.1 The owner of a camera shop receves a shpment of fve cameras from a camera manufacturer. Unknown
More informationIntroduction to Analysis of Variance (ANOVA) Part 1
Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned b regresson
More informationRigid body simulation
Rgd bod smulaton Rgd bod smulaton Once we consder an object wth spacal etent, partcle sstem smulaton s no longer suffcent Problems Problems Unconstraned sstem rotatonal moton torques and angular momentum
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More informationChapter 12. Ordinary Differential Equation Boundary Value (BV) Problems
Chapter. Ordnar Dfferental Equaton Boundar Value (BV) Problems In ths chapter we wll learn how to solve ODE boundar value problem. BV ODE s usuall gven wth x beng the ndependent space varable. p( x) q(
More information