CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS
|
|
- Alexis Harris
- 6 years ago
- Views:
Transcription
1 CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS 1
2 Influential bservatins are bservatins whse presence in the data can have a distrting effect n the parameter estimates and pssibly the entire analysis, e.g. identifying the wrng mdel. Distinctin frm utliers, thugh it is pssible fr ne bservatin t be bth influential and an utlier. Outliers: 1. data pints that cntain unusual dependent (y) values. 2. Outlying independent (x) values nt indicating lack f fit f mdel, but sme bservatins still influence the fit mre than thers. Detectin: In simple linear regressin, usually easy frm plts f data, but in multiple regressin, mre frmal measures are required. 2
3 8 A y B C x Figure 4.1. Three least squares lines fitted t sample data, where the bservatin at x = 8 is allwed t mve between the three pints A, B and C. The crrespnding least squares fits are the slid, dashed and dtted lines respectively. 3
4 The hat matrix Recall Ŷ = HY, H = X(X T X) 1 X T, s cvariance matrix f Ŷ is Var{Ŷ } = Hσ 2 Variance f ŷ i is h ii σ 2, variance f ith residual e i is (1 h ii )σ 2. Prperties f the {h ii } values include 0 h ii 1 fr all i, (1) i h ii = p. (2) Prperty (1) fllws simply frm the fact that bth h ii σ 2 and (1 h ii )σ 2 are the variances f randm quantities, and therefre are nnnegative. Fr prperty (2), nte that tr(h)=p. 4
5 Leverage A data pint with large h ii is called a pint f high leverage. Hw high is high? by (2), the average value f h ii is p n. A standard criterin is t call any data pint fr which h ii > 2p n a pint f high leverage. Nte that since h ii is a functin f X, it has n distributin, thus n frmal test. 5
6 Example: Cnsider the artificial data f Fig The twelve x values here are 0, 0.2, 0.4,..., 1.8, 2, 8. The crrespnding h ii values are.1342,.1221,...,.0869, The last bservatin, crrespnding t x = 8, is clearly highly influential. Intuitively, this is because if this pint is mved up r dwn, the least squares straight line will tend t fllw it the verall least squares fit n the ther 11 bservatins is nt much affected by mdest changes in the slpe f the fitted straight line, but the fit at x = 8 has a big influence. Nte that this has nthing t d with y 12 pssibly being an utlier, since fr any i, the actual value f y i des nt even enter int the calculatin f h ii. 6
7 Real data examples frm Chapter 3 Tree data: Highest h ii value is h 20 = (diameter=13.8, height=64), nt extreme fr either independent variable but des crrespnd t a fairly large diameter cmbined with the secnd smallest height. Next three largest values f h ii are h 3 = , h 31 = and h 2 = In this case p = 3, n = 31 s accrding t the 2p n = criterin, bservatins 3 and 20 are influential. Draws attentin t tw bservatins which wuld nt necessarily be identified as influential frm initial inspectin f the data. 7
8 diam height Figure Plt f tree diameter against height. 8
9 Real data examples frm Chapter 3 Nuclear pwer data: 2p n = = Largest values f h ii are in rws 26 (0.4126), 19 (0.3614) and 22 (0.3526). Nt especially large, but it turns ut later these bservatins are influential. > nuk.inf<lm.influence(nuk.lm) > print(nuk.inf$hat, digit=2)
10 Deletin diagnstics Recall Var(e i ) = (1 h ii )σ 2. This suggests that after estimating σ 2 by the mean squared errr s 2, we may then define e i = e i s 1 h ii (3) as a standardized frm f residual. We call (3) the internally standardized residual. (Als knwn as studentized.) 10
11 This des nt take int accunt influence f utlier n parameter estimates. Culd d that by deletin residual d i = y i ŷ i(i) (4) The subscript (i) means that the mdel is refitted withut the i th bservatin. Thus ŷ i(i) means the predicted value f y i based n the mdel fit in which bservatin i is mitted. Frmula: d i = e i 1 h ii. (5) Since Var(e i ) = σ 2 (1 h ii ), it fllws that Var(d i ) = σ 2 /(1 h ii ) and we can estimate this by s 2 (d i ) = s2 (i) 1 h ii (6) in which s 2 dentes the estimated mean squared (i) errr with the i th bservatin deleted. 11
12 Bth y i and ŷ i(i) are statistically independent f s 2 (i), s d i and s 2 are independent, and (i) This suggests that we define d i s(d i ) t n p 1. (7) d i = d i s(d i ) as an externally studentized residual. (8) Calculate s 2 (i) frm (n p)s 2 = (n p 1)s 2 (i) + e2 i 1 h ii. (9) Cmbining these frmulae leads t d i = e i [ n p 1 (1 h ii )(n p)s 2 e 2 i ]1 2. (10) 12
13 Examples Tree data: recall apparent utliers in bservatins 15, 16, 18. Internally standardized residuals are 2.109, 1.834, Externally studentized residuals are 2.258, and Since t 27,.975 = 2.052, frmal test f fit (at the 5% level f significance) wuld reject bservatins 15 and 18 as utliers. Largest psitive d i is 1.703, definitely nt significant. Culd g n t delete all three discussed in text. Nuclear pwer data: largest internally standardized residuals in magnitude are in rw 19, in rw 7, in rw 26 and in rw 12. When externally studentized these becme 2.503, 2.427, and t 25,.975 = 2.060, t 25,.995 =
14 1. DFFITS Measures f influence Detect influence n fitted values. (DF F IT S) i = ŷi ŷ i(i) = d hii i. (11) s (i) hii 1 h ii Ratinale: standardized frmula fr examining the difference between ŷ i and ŷ i(i). Hwever, the secnd equality in (11) shws that it is equivalent t a rescaled frm f the externally studentized residual, where the rescaling is dependent n the leverage f the i th data pint. Thus DF F IT S may be thught f as a cmbined measure f influence that takes int accunt the leverage f the data pint as well as the size f the residual. Rule f thumb: bservatin is influential if DF F IT S is greater than 1 in the case f small data sets, r 2 p/n fr large data sets. 14
15 Examples Tree data: 2 p/n = , nly inflential value is bservatin 18, DF F IT S = Cmbines utlier and mderate leverage (h ii = ). Nuclear pwer data: 2 p/n = is easily exceeded in magnitude by the DF F IT S values fr rws 19 (1.8830) and 26 ( ), and t a lesser extent in rw 7 ( ). Since we have already seen that these three rws have the largest residuals in magnitude, and that rws 26 and 19 are the nes with the highest leverage, these results are scarcely surprising. 15
16 2. DFBETAS Intended t measure the influence f an bservatin n the parameter estimates. If we were t test the hypthesis H 0 : β k = β k0 fr the k th parameter estimate, where β k0 is a given numerical value, then a suitable test statistic wuld be ˆβ k β k0 s c kk where c kk is the k th diagnal entry f (X T X) 1. This statistic has a t n p distributin. Mtivated by this, we define (DF BET AS) k(i) = ˆβ k ˆβ k(i) s (i) ckk. (12) Rule f thumb: DF BET AS > 1 fr a small data set r 2/ n fr a large data set. 16
17 Examples Tree data: 2/ n = , ffending values are (i = 18, k = 3), (i = 18, k = 1), (i = 17, k = 3) and (i = 17, k = 1). Rw 17 causing truble as well as rw 18? Nuclear pwer data: 2/ n = is exceeded by several values fr rws 7, 19 and 26 (largest verall value: fr the LN cefficient in rw 19). There are als three values in the range fr rw
18 3. Ck s D statistic Overall measure f the influence f the i th bservatin n all the parameter estimates. If we want t test H 0 : β = β 0 fr given vectr β 0, then when H 0 true, Ck defined (ˆβ β 0 ) T X T X(ˆβ β 0 ) p s 2 F p,n p. D i = (ˆβ ˆβ (i) ) T X T X(ˆβ ˆβ (i) ) p s 2 = e2 i ps 2 h ii (1 h ii ) 2. (13) Identify the i th bservatin as influential if D i is greater than the 10% pint f the F distributin, and highly influential if it is greater than the 50% pint. 18
19 Examples Tree data: largest value f Ck s D is in rw 18 fllwed by in rw 17. Fr the F 3,28 distributin, the 10% pint is and the 50% pint Again rw 18 stands ut. Nuclear pwer data: D =0.423 in rw 26, in rw 19. The 10% pint f F 7,25 is and the 50% pint
20 The mdified Ck statistic Frm the frm f (13) in cmparisn with (11) and (12), natural t ask why, in (13), we did nt use s (i) in place f s. In fact Atkinsn (1985) tk this pint f view t define a mdified Ck statistic which turns ut, after scaling by a cnstant, t be equivalent t DF F IT S. Thus if Atkinsn s pint f view is adpted, there seems n need t cnsider Ck s statistic as a separate diagnstic, since all the relevant infrmatin is in DF F IT S. Our examples rather supprt this pint f view, since it appears that Ck s D is dwngrading the evidence f influence in the case f sme bservatins which seemed highly influential when judged by the earlier diagnstics. 20
21 4. COVRATIO Measures effect f deletin n the variances f the parameter estimates. (COV RAT IO) i = Det[{XT (i) X (i) } 1 s 2 (i) ] Det{(X T X) 1 s 2 } (14) where Det[A] means the determinant f a matrix A. An equivalent frmula is ( ) p n p 1 (COV RAT IO) i = + d 2 i (1 h ii) 1. n p n p (15) The suggested criterin here is COV RAT IO 1 > 3p n. (16) 21
22 Examples Tree data: (16) gives the critical values f COVRATIO as and At lwer end f range we have nly (rw 15), i.e. variances are decreased by mitting this bservatin. At the upper end there are several ffenders (rws 1,2,3,20,31, with largest value 1.47 in rw 20) which seems t pint twards bservatins f high leverage as thse whse missin wuld tend t increase the variances. Nuclear pwer data: critical values f COV RAT IO are and Rw 7 (0.334) is influential at the lwer end, while rws 25 (2.011), 8 (1.788) and 28 (1.716) are the nes with high COV RAT IOs. Rw 25 has fairly high leverage (h ii = which is furth largest in the data set) but rws 8 and 28 (h ii = and ) d nt, s it is hard t see a clearcut explanatin f these. 22
23 Graphical methds f assessing influence Ideas due t Atkinsn (1985): refine previus rules f thumb using simulatin. N frmal hypthesis test is pssible fr high leverage, but fr the ther measures we have seen, it is pssible t cnstruct a frmal test that the bservatins are nrmal. In the case f single deletin residuals, we have seen the exact sampling distributin (t n p 1 ). Hwever, even this desn t easily extend t the prblem f the largest deletin residual in a sample (multiple testing prblem). Fr DFFITS etc., n exact test seems pssible. As an alternative, use simulatin. 23
24 Atkinsn s idea: use prbability plts (nrmal r halfnrmal). Halfnrmal plt: plt rdered abslute values f the deletin residuals against the n largest expected rder statistics frm a nrmal sample f size 2n + 1. As an apprximatin t the expected value f the k th largest rder statistic in a standard nrmal sample f size N, Atkinsn used Blm s apprximatin z (k 0.375)/(n+0.25) where z is the inverse f the standard nrmal c.d.f. This is slightly different frm the frmula z (k 0.5)/n which was prpsed in Sectin 2.6.1, thugh it makes very little difference in practice which frm is used. We fllw Atkinsn s usage here. 24
25 Examples The circles in Figure 4.2(a) shw a halfnrmal plt fr the deletin residuals fr the tree data, and Figure 4.3(a) shws the same thing fr the nuclear pwer data. In each case the plt is reasnably clse t a straight line, and even tends t flatten ff at the right hand end. Even with n ther means f assessment, this wuld suggest that the largest residuals are nt excessive utliers. The same idea can be tried fr the ther influence measures. The circles in Figures 4.2(b) and 4.3(b) shw a halfnrmal plt f the values f DF F IT S fr the tree and nuclear pwer data respectively. In the case f the tree data, the plt again appears clse t a straight line, but with the nuclear pwer data it is bvius that the largest tw values are behaving differently frm the rest. 25
26 (a) (b) Observed Value Observed Value Expected Value Expected Value Figure 4.2 Simulatin envelpe plts fr tree data. (a) Deletin residual, (b) DFFITS. 26
27 (a) (b) Observed Value Expected Value Expected Value Observed Value Figure 4.3 Simulatin envelpe plts fr nuclear pwer data. (a) Deletin residual, (b) DFFITS. 27
28 Assessing significance (Described fr deletin residuals, but same idea fr ther measures) 1. Generate a simulated sample f n standard nrmal randm variables, and calculate the deletin residuals based n that sample. OK t take β = 0 and σ 2 = 1 fr this simulatin. 2. Order the abslute values f the deletin residuals t btain a simulated sample f rder statistics. 3. Repeat this whle prcedure m times. Fr each i between 1 and n, find 5% largest and smallest values f the m simulatins fr that value f i. Mark these t btain apprximate cnfidence limits fr that value n the plt. 28
29 Results Fr the tree data (Fig. 4.2), neither f the plts (deletin residuals r DF F IT S) strays utside the envelpe. N serius utliers r influential pints in this data set. Fig. 4.3(a): same message fr deletin residuals with nuclear pwer data. But Fig. 4.3(b) is different. There really is a prblem with the influence f the tw largest data pints. Nte that this prcedure did nt adjust fr multiple cmparisn, which is pssible in principal but mre cmputatinally expensive (e.g., fr fixed envelpe ne can evaluate prbability that largest DF F IT S is utside the envelpe by simulatin). 29
30 Remedial measures First questin: is it a genuine errr (e.g. wrng data?). Even if nt, cnsider deleting bservatin and refitting mdel, but there is a danger f verding this. 30
31 Tree data There is a grup f three suspect bservatins cnsider deleting them all at nce. New estimates f β 2 and β 3 becme and , standard errrs.0584 and.1654 (cmpare ld estimates , , SEs.0750,.2044). Parameter estimates are nt significantly different. s is reduced.0814 t Questin whether it is valid t qute lwer value. F statistic fr the test β 2 = 2, β 3 = 1 is nw 1.14, cmpared with 0.17 last time, but this is still nwhere near significant. These calculatins shw that the three suspect bservatins d nt substantially affect the main questins f interest and there therefre seems n reasn t remve them frm the data. 31
32 Nuclear pwer data The nuclear pwer data were refitted withut the influential data pints in rws 19 and 26. Use same mdel as befre. The fitted regressin mdel then becmes LC = D LS NE CT 0.044LN 0.232P T + ɛ with s = Standard errrs are fr the intercept, and 0.047, 0.119, 0.065, 0.056, and fr the six cefficients. Mst f the cefficients are abut the same size, the largest differences relative t their standard errrs being in CT and LN. Indeed, accrding t the present mdel the cefficient f LN is nt significant and culd be drpped frm the mdel. The ther main thing t nte, as with the previus example, is that the residual standard deviatin s is nticably smaller. 32
33 We can repeat mst f the analyses tried the first time, with similar results. As an example, Figure 4.4(a) shws a plt f residuals against fitted values, and Figure 4.4(b) a nrmal prbability plt f the (internally) standardized residuals. 33
34 Residual Observed Value Fitted Value Expected Value Figure 4.4 Residual plts fr nuclear pwer data with rws 19 and 26 deleted. (a) Residuals against fitted values. (b) Nrmal prbability plt fr standardized residuals. 34
35 (a) (b) Observed Value Observed Value Expected Value Expected Value Figure 4.5 Simulatin envelpe plts fr nuclear pwer data with rws 19 and 26 deleted. (a) Deletin residual, (b) DFFITS. 35
36 Nrmal prbability plt again seems OK Residuals vs. fitted values plt seems mre randm than befre. A plt f residuals against P T (nt shwn) shws even strnger evidence that the variances are different with the tw values f P T. D we need t delete mre bservatins? when diagnstics are recmputed there are still questins abut sme bservatins, but Fig. 4.5 des nt suggest mre deletins are needed. In cnclusin, fr this data set there des indeed seem t be a case that the tw mst influential bservatins are distrting the whle analysis and shuld be mitted, but there d nt seem t be any further instances fr which interventin is needed. 36
37 Calculatins in R In R, Ck s D statistics available by pltting lm bjects: nuk.lm <lm(lc~d+ls+ne+ct+ln+pt) plt(nuk.lm, which=4) Sme diagnstics available using functin lm.influence : nuk.inf<lm.influence(nuk.lm) Fr example, nuk.inf$cefficients cntains all the regressin cefficients crrespnding t deletin f each bservatin in turn, nuk.inf$sigma gives the s (i) values, and nuk.inf$hat cntains the diagnal entries f the hat matrix. 37
38 Calculatins in R Further diagnstics can be calculated frm these. dfbetas, dffits, stanres, studres can be cmputed using functins in diagnse.r at curse website. These can als be incrprated int a simulatin t create simulatinbased diagnstics (e.g. prgram dnsim.r n curse web page). surce("diagnse.r") plt(stanres(nuk.lm)) plt(studres(nuk.lm)) plt(dffits(nuk.lm)) dfbetas(nuk.lm) surce("dnsim.r") attach(nukes) lc<lg(c) ls<lg(s) ln<lg(n) par(mfrw=c(1,2)) dnsim(lc, cbind(d,ls,ne,ct,ln,pt), nsim=1000) 38
A Matrix Representation of Panel Data
web Extensin 6 Appendix 6.A A Matrix Representatin f Panel Data Panel data mdels cme in tw brad varieties, distinct intercept DGPs and errr cmpnent DGPs. his appendix presents matrix algebra representatins
More informationCHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.
MATH 1342 Ch. 24 April 25 and 27, 2013 Page 1 f 5 CHAPTER 24: INFERENCE IN REGRESSION Chapters 4 and 5: Relatinships between tw quantitative variables. Be able t Make a graph (scatterplt) Summarize the
More informationSUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis
SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical mdel fr micrarray data analysis David Rssell Department f Bistatistics M.D. Andersn Cancer Center, Hustn, TX 77030, USA rsselldavid@gmail.cm
More information4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression
4th Indian Institute f Astrphysics - PennState Astrstatistics Schl July, 2013 Vainu Bappu Observatry, Kavalur Crrelatin and Regressin Rahul Ry Indian Statistical Institute, Delhi. Crrelatin Cnsider a tw
More informationBootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >
Btstrap Methd > # Purpse: understand hw btstrap methd wrks > bs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(bs) > mean(bs) [1] 21.64625 > # estimate f lambda > lambda = 1/mean(bs);
More informationAP Statistics Notes Unit Two: The Normal Distributions
AP Statistics Ntes Unit Tw: The Nrmal Distributins Syllabus Objectives: 1.5 The student will summarize distributins f data measuring the psitin using quartiles, percentiles, and standardized scres (z-scres).
More informationREADING STATECHART DIAGRAMS
READING STATECHART DIAGRAMS Figure 4.48 A Statechart diagram with events The diagram in Figure 4.48 shws all states that the bject plane can be in during the curse f its life. Furthermre, it shws the pssible
More informationChapter 3: Cluster Analysis
Chapter 3: Cluster Analysis } 3.1 Basic Cncepts f Clustering 3.1.1 Cluster Analysis 3.1. Clustering Categries } 3. Partitining Methds 3..1 The principle 3.. K-Means Methd 3..3 K-Medids Methd 3..4 CLARA
More informationHypothesis Tests for One Population Mean
Hypthesis Tests fr One Ppulatin Mean Chapter 9 Ala Abdelbaki Objective Objective: T estimate the value f ne ppulatin mean Inferential statistics using statistics in rder t estimate parameters We will be
More informationWe say that y is a linear function of x if. Chapter 13: The Correlation Coefficient and the Regression Line
Chapter 13: The Crrelatin Cefficient and the Regressin Line We begin with a sme useful facts abut straight lines. Recall the x, y crdinate system, as pictured belw. 3 2 1 y = 2.5 y = 0.5x 3 2 1 1 2 3 1
More informationResampling Methods. Chapter 5. Chapter 5 1 / 52
Resampling Methds Chapter 5 Chapter 5 1 / 52 1 51 Validatin set apprach 2 52 Crss validatin 3 53 Btstrap Chapter 5 2 / 52 Abut Resampling An imprtant statistical tl Pretending the data as ppulatin and
More informationModelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA
Mdelling f Clck Behaviur Dn Percival Applied Physics Labratry University f Washingtn Seattle, Washingtn, USA verheads and paper fr talk available at http://faculty.washingtn.edu/dbp/talks.html 1 Overview
More information, which yields. where z1. and z2
The Gaussian r Nrmal PDF, Page 1 The Gaussian r Nrmal Prbability Density Functin Authr: Jhn M Cimbala, Penn State University Latest revisin: 11 September 13 The Gaussian r Nrmal Prbability Density Functin
More informationDifferentiation Applications 1: Related Rates
Differentiatin Applicatins 1: Related Rates 151 Differentiatin Applicatins 1: Related Rates Mdel 1: Sliding Ladder 10 ladder y 10 ladder 10 ladder A 10 ft ladder is leaning against a wall when the bttm
More informationCS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007
CS 477/677 Analysis f Algrithms Fall 2007 Dr. Gerge Bebis Curse Prject Due Date: 11/29/2007 Part1: Cmparisn f Srting Algrithms (70% f the prject grade) The bjective f the first part f the assignment is
More informationLab 1 The Scientific Method
INTRODUCTION The fllwing labratry exercise is designed t give yu, the student, an pprtunity t explre unknwn systems, r universes, and hypthesize pssible rules which may gvern the behavir within them. Scientific
More informationThis section is primarily focused on tools to aid us in finding roots/zeros/ -intercepts of polynomials. Essentially, our focus turns to solving.
Sectin 3.2: Many f yu WILL need t watch the crrespnding vides fr this sectin n MyOpenMath! This sectin is primarily fcused n tls t aid us in finding rts/zers/ -intercepts f plynmials. Essentially, ur fcus
More informationComparing Several Means: ANOVA. Group Means and Grand Mean
STAT 511 ANOVA and Regressin 1 Cmparing Several Means: ANOVA Slide 1 Blue Lake snap beans were grwn in 12 pen-tp chambers which are subject t 4 treatments 3 each with O 3 and SO 2 present/absent. The ttal
More informationDistributions, spatial statistics and a Bayesian perspective
Distributins, spatial statistics and a Bayesian perspective Dug Nychka Natinal Center fr Atmspheric Research Distributins and densities Cnditinal distributins and Bayes Thm Bivariate nrmal Spatial statistics
More informationTHE LIFE OF AN OBJECT IT SYSTEMS
THE LIFE OF AN OBJECT IT SYSTEMS Persns, bjects, r cncepts frm the real wrld, which we mdel as bjects in the IT system, have "lives". Actually, they have tw lives; the riginal in the real wrld has a life,
More informationPSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa
There are tw parts t this lab. The first is intended t demnstrate hw t request and interpret the spatial diagnstics f a standard OLS regressin mdel using GeDa. The diagnstics prvide infrmatin abut the
More informationMath Foundations 20 Work Plan
Math Fundatins 20 Wrk Plan Units / Tpics 20.8 Demnstrate understanding f systems f linear inequalities in tw variables. Time Frame December 1-3 weeks 6-10 Majr Learning Indicatrs Identify situatins relevant
More informationPhysics 2010 Motion with Constant Acceleration Experiment 1
. Physics 00 Mtin with Cnstant Acceleratin Experiment In this lab, we will study the mtin f a glider as it accelerates dwnhill n a tilted air track. The glider is supprted ver the air track by a cushin
More informationCHM112 Lab Graphing with Excel Grading Rubric
Name CHM112 Lab Graphing with Excel Grading Rubric Criteria Pints pssible Pints earned Graphs crrectly pltted and adhere t all guidelines (including descriptive title, prperly frmatted axes, trendline
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Sectin 7 Mdel Assessment This sectin is based n Stck and Watsn s Chapter 9. Internal vs. external validity Internal validity refers t whether the analysis is valid fr the ppulatin and sample being studied.
More informationExperiment #3. Graphing with Excel
Experiment #3. Graphing with Excel Study the "Graphing with Excel" instructins that have been prvided. Additinal help with learning t use Excel can be fund n several web sites, including http://www.ncsu.edu/labwrite/res/gt/gt-
More informationMODULE 1. e x + c. [You can t separate a demominator, but you can divide a single denominator into each numerator term] a + b a(a + b)+1 = a + b
. REVIEW OF SOME BASIC ALGEBRA MODULE () Slving Equatins Yu shuld be able t slve fr x: a + b = c a d + e x + c and get x = e(ba +) b(c a) d(ba +) c Cmmn mistakes and strategies:. a b + c a b + a c, but
More informationPerfrmance f Sensitizing Rules n Shewhart Cntrl Charts with Autcrrelated Data Key Wrds: Autregressive, Mving Average, Runs Tests, Shewhart Cntrl Chart
Perfrmance f Sensitizing Rules n Shewhart Cntrl Charts with Autcrrelated Data Sandy D. Balkin Dennis K. J. Lin y Pennsylvania State University, University Park, PA 16802 Sandy Balkin is a graduate student
More informationMATHEMATICS SYLLABUS SECONDARY 5th YEAR
Eurpean Schls Office f the Secretary-General Pedaggical Develpment Unit Ref. : 011-01-D-8-en- Orig. : EN MATHEMATICS SYLLABUS SECONDARY 5th YEAR 6 perid/week curse APPROVED BY THE JOINT TEACHING COMMITTEE
More informationPattern Recognition 2014 Support Vector Machines
Pattern Recgnitin 2014 Supprt Vectr Machines Ad Feelders Universiteit Utrecht Ad Feelders ( Universiteit Utrecht ) Pattern Recgnitin 1 / 55 Overview 1 Separable Case 2 Kernel Functins 3 Allwing Errrs (Sft
More informationSPH3U1 Lesson 06 Kinematics
PROJECTILE MOTION LEARNING GOALS Students will: Describe the mtin f an bject thrwn at arbitrary angles thrugh the air. Describe the hrizntal and vertical mtins f a prjectile. Slve prjectile mtin prblems.
More informationmaking triangle (ie same reference angle) ). This is a standard form that will allow us all to have the X= y=
Intrductin t Vectrs I 21 Intrductin t Vectrs I 22 I. Determine the hrizntal and vertical cmpnents f the resultant vectr by cunting n the grid. X= y= J. Draw a mangle with hrizntal and vertical cmpnents
More informationIN a recent article, Geary [1972] discussed the merit of taking first differences
The Efficiency f Taking First Differences in Regressin Analysis: A Nte J. A. TILLMAN IN a recent article, Geary [1972] discussed the merit f taking first differences t deal with the prblems that trends
More informationT Algorithmic methods for data mining. Slide set 6: dimensionality reduction
T-61.5060 Algrithmic methds fr data mining Slide set 6: dimensinality reductin reading assignment LRU bk: 11.1 11.3 PCA tutrial in mycurses (ptinal) ptinal: An Elementary Prf f a Therem f Jhnsn and Lindenstrauss,
More informationComputational modeling techniques
Cmputatinal mdeling techniques Lecture 2: Mdeling change. In Petre Department f IT, Åb Akademi http://users.ab.fi/ipetre/cmpmd/ Cntent f the lecture Basic paradigm f mdeling change Examples Linear dynamical
More informationInference in the Multiple-Regression
Sectin 5 Mdel Inference in the Multiple-Regressin Kinds f hypthesis tests in a multiple regressin There are several distinct kinds f hypthesis tests we can run in a multiple regressin. Suppse that amng
More information(2) Even if such a value of k was possible, the neutrons multiply
CHANGE OF REACTOR Nuclear Thery - Curse 227 POWER WTH REACTVTY CHANGE n this lessn, we will cnsider hw neutrn density, neutrn flux and reactr pwer change when the multiplicatin factr, k, r the reactivity,
More informationDrought damaged area
ESTIMATE OF THE AMOUNT OF GRAVEL CO~TENT IN THE SOIL BY A I R B O'RN EMS S D A T A Y. GOMI, H. YAMAMOTO, AND S. SATO ASIA AIR SURVEY CO., l d. KANAGAWA,JAPAN S.ISHIGURO HOKKAIDO TOKACHI UBPREFECTRAl OffICE
More informationOn Huntsberger Type Shrinkage Estimator for the Mean of Normal Distribution ABSTRACT INTRODUCTION
Malaysian Jurnal f Mathematical Sciences 4(): 7-4 () On Huntsberger Type Shrinkage Estimatr fr the Mean f Nrmal Distributin Department f Mathematical and Physical Sciences, University f Nizwa, Sultanate
More informationAP Statistics Practice Test Unit Three Exploring Relationships Between Variables. Name Period Date
AP Statistics Practice Test Unit Three Explring Relatinships Between Variables Name Perid Date True r False: 1. Crrelatin and regressin require explanatry and respnse variables. 1. 2. Every least squares
More information**DO NOT ONLY RELY ON THIS STUDY GUIDE!!!**
Tpics lists: UV-Vis Absrbance Spectrscpy Lab & ChemActivity 3-6 (nly thrugh 4) I. UV-Vis Absrbance Spectrscpy Lab Beer s law Relates cncentratin f a chemical species in a slutin and the absrbance f that
More informationNAME: Prof. Ruiz. 1. [5 points] What is the difference between simple random sampling and stratified random sampling?
CS4445 ata Mining and Kwledge iscery in atabases. B Term 2014 Exam 1 Nember 24, 2014 Prf. Carlina Ruiz epartment f Cmputer Science Wrcester Plytechnic Institute NAME: Prf. Ruiz Prblem I: Prblem II: Prblem
More informationUNIV1"'RSITY OF NORTH CAROLINA Department of Statistics Chapel Hill, N. C. CUMULATIVE SUM CONTROL CHARTS FOR THE FOLDED NORMAL DISTRIBUTION
UNIV1"'RSITY OF NORTH CAROLINA Department f Statistics Chapel Hill, N. C. CUMULATIVE SUM CONTROL CHARTS FOR THE FOLDED NORMAL DISTRIBUTION by N. L. Jlmsn December 1962 Grant N. AFOSR -62..148 Methds f
More informationCONSTRUCTING STATECHART DIAGRAMS
CONSTRUCTING STATECHART DIAGRAMS The fllwing checklist shws the necessary steps fr cnstructing the statechart diagrams f a class. Subsequently, we will explain the individual steps further. Checklist 4.6
More informationLead/Lag Compensator Frequency Domain Properties and Design Methods
Lectures 6 and 7 Lead/Lag Cmpensatr Frequency Dmain Prperties and Design Methds Definitin Cnsider the cmpensatr (ie cntrller Fr, it is called a lag cmpensatr s K Fr s, it is called a lead cmpensatr Ntatin
More informationSmoothing, penalized least squares and splines
Smthing, penalized least squares and splines Duglas Nychka, www.image.ucar.edu/~nychka Lcally weighted averages Penalized least squares smthers Prperties f smthers Splines and Reprducing Kernels The interplatin
More informationI. Analytical Potential and Field of a Uniform Rod. V E d. The definition of electric potential difference is
Length L>>a,b,c Phys 232 Lab 4 Ch 17 Electric Ptential Difference Materials: whitebards & pens, cmputers with VPythn, pwer supply & cables, multimeter, crkbard, thumbtacks, individual prbes and jined prbes,
More informationECE 5318/6352 Antenna Engineering. Spring 2006 Dr. Stuart Long. Chapter 6. Part 7 Schelkunoff s Polynomial
ECE 538/635 Antenna Engineering Spring 006 Dr. Stuart Lng Chapter 6 Part 7 Schelkunff s Plynmial 7 Schelkunff s Plynmial Representatin (fr discrete arrays) AF( ψ ) N n 0 A n e jnψ N number f elements in
More informationarxiv:hep-ph/ v1 2 Jun 1995
WIS-95//May-PH The rati F n /F p frm the analysis f data using a new scaling variable S. A. Gurvitz arxiv:hep-ph/95063v1 Jun 1995 Department f Particle Physics, Weizmann Institute f Science, Rehvt 76100,
More informationDetermining the Accuracy of Modal Parameter Estimation Methods
Determining the Accuracy f Mdal Parameter Estimatin Methds by Michael Lee Ph.D., P.E. & Mar Richardsn Ph.D. Structural Measurement Systems Milpitas, CA Abstract The mst cmmn type f mdal testing system
More informationWhat is Statistical Learning?
What is Statistical Learning? Sales 5 10 15 20 25 Sales 5 10 15 20 25 Sales 5 10 15 20 25 0 50 100 200 300 TV 0 10 20 30 40 50 Radi 0 20 40 60 80 100 Newspaper Shwn are Sales vs TV, Radi and Newspaper,
More informationBASD HIGH SCHOOL FORMAL LAB REPORT
BASD HIGH SCHOOL FORMAL LAB REPORT *WARNING: After an explanatin f what t include in each sectin, there is an example f hw the sectin might lk using a sample experiment Keep in mind, the sample lab used
More informationNUMBERS, MATHEMATICS AND EQUATIONS
AUSTRALIAN CURRICULUM PHYSICS GETTING STARTED WITH PHYSICS NUMBERS, MATHEMATICS AND EQUATIONS An integral part t the understanding f ur physical wrld is the use f mathematical mdels which can be used t
More informationPerformance Bounds for Detect and Avoid Signal Sensing
Perfrmance unds fr Detect and Avid Signal Sensing Sam Reisenfeld Real-ime Infrmatin etwrks, University f echnlgy, Sydney, radway, SW 007, Australia samr@uts.edu.au Abstract Detect and Avid (DAA) is a Cgnitive
More informationCESAR Science Case The differential rotation of the Sun and its Chromosphere. Introduction. Material that is necessary during the laboratory
Teacher s guide CESAR Science Case The differential rtatin f the Sun and its Chrmsphere Material that is necessary during the labratry CESAR Astrnmical wrd list CESAR Bklet CESAR Frmula sheet CESAR Student
More informationHow do scientists measure trees? What is DBH?
Hw d scientists measure trees? What is DBH? Purpse Students develp an understanding f tree size and hw scientists measure trees. Students bserve and measure tree ckies and explre the relatinship between
More informationFall 2013 Physics 172 Recitation 3 Momentum and Springs
Fall 03 Physics 7 Recitatin 3 Mmentum and Springs Purpse: The purpse f this recitatin is t give yu experience wrking with mmentum and the mmentum update frmula. Readings: Chapter.3-.5 Learning Objectives:.3.
More informationPhys. 344 Ch 7 Lecture 8 Fri., April. 10 th,
Phys. 344 Ch 7 Lecture 8 Fri., April. 0 th, 009 Fri. 4/0 8. Ising Mdel f Ferrmagnets HW30 66, 74 Mn. 4/3 Review Sat. 4/8 3pm Exam 3 HW Mnday: Review fr est 3. See n-line practice test lecture-prep is t
More informationENSC Discrete Time Systems. Project Outline. Semester
ENSC 49 - iscrete Time Systems Prject Outline Semester 006-1. Objectives The gal f the prject is t design a channel fading simulatr. Upn successful cmpletin f the prject, yu will reinfrce yur understanding
More informationChapter Summary. Mathematical Induction Strong Induction Recursive Definitions Structural Induction Recursive Algorithms
Chapter 5 1 Chapter Summary Mathematical Inductin Strng Inductin Recursive Definitins Structural Inductin Recursive Algrithms Sectin 5.1 3 Sectin Summary Mathematical Inductin Examples f Prf by Mathematical
More informationCambridge Assessment International Education Cambridge Ordinary Level. Published
Cambridge Assessment Internatinal Educatin Cambridge Ordinary Level ADDITIONAL MATHEMATICS 4037/1 Paper 1 Octber/Nvember 017 MARK SCHEME Maximum Mark: 80 Published This mark scheme is published as an aid
More informationInterference is when two (or more) sets of waves meet and combine to produce a new pattern.
Interference Interference is when tw (r mre) sets f waves meet and cmbine t prduce a new pattern. This pattern can vary depending n the riginal wave directin, wavelength, amplitude, etc. The tw mst extreme
More informationThermodynamics and Equilibrium
Thermdynamics and Equilibrium Thermdynamics Thermdynamics is the study f the relatinship between heat and ther frms f energy in a chemical r physical prcess. We intrduced the thermdynamic prperty f enthalpy,
More informationSection 5.8 Notes Page Exponential Growth and Decay Models; Newton s Law
Sectin 5.8 Ntes Page 1 5.8 Expnential Grwth and Decay Mdels; Newtn s Law There are many applicatins t expnential functins that we will fcus n in this sectin. First let s lk at the expnential mdel. Expnential
More informationPhysics 212. Lecture 12. Today's Concept: Magnetic Force on moving charges. Physics 212 Lecture 12, Slide 1
Physics 1 Lecture 1 Tday's Cncept: Magnetic Frce n mving charges F qv Physics 1 Lecture 1, Slide 1 Music Wh is the Artist? A) The Meters ) The Neville rthers C) Trmbne Shrty D) Michael Franti E) Radiatrs
More information3.4 Shrinkage Methods Prostate Cancer Data Example (Continued) Ridge Regression
3.3.4 Prstate Cancer Data Example (Cntinued) 3.4 Shrinkage Methds 61 Table 3.3 shws the cefficients frm a number f different selectin and shrinkage methds. They are best-subset selectin using an all-subsets
More informationKinetic Model Completeness
5.68J/10.652J Spring 2003 Lecture Ntes Tuesday April 15, 2003 Kinetic Mdel Cmpleteness We say a chemical kinetic mdel is cmplete fr a particular reactin cnditin when it cntains all the species and reactins
More informationSIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST. Mark C. Otto Statistics Research Division, Bureau of the Census Washington, D.C , U.S.A.
SIZE BIAS IN LINE TRANSECT SAMPLING: A FIELD TEST Mark C. Ott Statistics Research Divisin, Bureau f the Census Washingtn, D.C. 20233, U.S.A. and Kenneth H. Pllck Department f Statistics, Nrth Carlina State
More information2004 AP CHEMISTRY FREE-RESPONSE QUESTIONS
2004 AP CHEMISTRY FREE-RESPONSE QUESTIONS 6. An electrchemical cell is cnstructed with an pen switch, as shwn in the diagram abve. A strip f Sn and a strip f an unknwn metal, X, are used as electrdes.
More informationChecking the resolved resonance region in EXFOR database
Checking the reslved resnance regin in EXFOR database Gttfried Bertn Sciété de Calcul Mathématique (SCM) Oscar Cabells OECD/NEA Data Bank JEFF Meetings - Sessin JEFF Experiments Nvember 0-4, 017 Bulgne-Billancurt,
More informationLecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff
Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised
More information1996 Engineering Systems Design and Analysis Conference, Montpellier, France, July 1-4, 1996, Vol. 7, pp
THE POWER AND LIMIT OF NEURAL NETWORKS T. Y. Lin Department f Mathematics and Cmputer Science San Jse State University San Jse, Califrnia 959-003 tylin@cs.ssu.edu and Bereley Initiative in Sft Cmputing*
More informationThermodynamics Partial Outline of Topics
Thermdynamics Partial Outline f Tpics I. The secnd law f thermdynamics addresses the issue f spntaneity and invlves a functin called entrpy (S): If a prcess is spntaneus, then Suniverse > 0 (2 nd Law!)
More informationMATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank
MATCHING TECHNIQUES Technical Track Sessin VI Emanuela Galass The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Emanuela Galass fr the purpse f this wrkshp When can we use
More informationLesson Plan. Recode: They will do a graphic organizer to sequence the steps of scientific method.
Lessn Plan Reach: Ask the students if they ever ppped a bag f micrwave ppcrn and nticed hw many kernels were unppped at the bttm f the bag which made yu wnder if ther brands pp better than the ne yu are
More informationLecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff
Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised
More informationTuring Machines. Human-aware Robotics. 2017/10/17 & 19 Chapter 3.2 & 3.3 in Sipser Ø Announcement:
Turing Machines Human-aware Rbtics 2017/10/17 & 19 Chapter 3.2 & 3.3 in Sipser Ø Annuncement: q q q q Slides fr this lecture are here: http://www.public.asu.edu/~yzhan442/teaching/cse355/lectures/tm-ii.pdf
More informationA Regression Solution to the Problem of Criterion Score Comparability
A Regressin Slutin t the Prblem f Criterin Scre Cmparability William M. Pugh Naval Health Research Center When the criterin measure in a study is the accumulatin f respnses r behavirs fr an individual
More informationChapter 2 GAUSS LAW Recommended Problems:
Chapter GAUSS LAW Recmmended Prblems: 1,4,5,6,7,9,11,13,15,18,19,1,7,9,31,35,37,39,41,43,45,47,49,51,55,57,61,6,69. LCTRIC FLUX lectric flux is a measure f the number f electric filed lines penetrating
More informationChapter 13: The Correlation Coefficient and the Regression Line. We begin with a some useful facts about straight lines.
Chapter 13: The Crrelatin Cefficient and the Regressin Line We begin with a sme useful facts abut straight lines. Recall the x, y crdinate system, as pictured belw. 3 2 y = 2.5 1 y = 0.5x 3 2 1 1 2 3 1
More information5 th grade Common Core Standards
5 th grade Cmmn Cre Standards In Grade 5, instructinal time shuld fcus n three critical areas: (1) develping fluency with additin and subtractin f fractins, and develping understanding f the multiplicatin
More informationSection 6-2: Simplex Method: Maximization with Problem Constraints of the Form ~
Sectin 6-2: Simplex Methd: Maximizatin with Prblem Cnstraints f the Frm ~ Nte: This methd was develped by Gerge B. Dantzig in 1947 while n assignment t the U.S. Department f the Air Frce. Definitin: Standard
More informationIntroduction to Smith Charts
Intrductin t Smith Charts Dr. Russell P. Jedlicka Klipsch Schl f Electrical and Cmputer Engineering New Mexic State University as Cruces, NM 88003 September 2002 EE521 ecture 3 08/22/02 Smith Chart Summary
More informationALE 21. Gibbs Free Energy. At what temperature does the spontaneity of a reaction change?
Name Chem 163 Sectin: Team Number: ALE 21. Gibbs Free Energy (Reference: 20.3 Silberberg 5 th editin) At what temperature des the spntaneity f a reactin change? The Mdel: The Definitin f Free Energy S
More informationThe Law of Total Probability, Bayes Rule, and Random Variables (Oh My!)
The Law f Ttal Prbability, Bayes Rule, and Randm Variables (Oh My!) Administrivia Hmewrk 2 is psted and is due tw Friday s frm nw If yu didn t start early last time, please d s this time. Gd Milestnes:
More informationCHAPTER 8 ANALYSIS OF DESIGNED EXPERIMENTS
CHAPTER 8 ANALYSIS OF DESIGNED EXPERIMENTS Discuss experiments whse main aim is t study and cmpare the effects f treatments (diets, varieties, dses) by measuring respnse (yield, weight gain) n plts r units
More informationHubble s Law PHYS 1301
1 PHYS 1301 Hubble s Law Why: The lab will verify Hubble s law fr the expansin f the universe which is ne f the imprtant cnsequences f general relativity. What: Frm measurements f the angular size and
More informationDepartment of Economics, University of California, Davis Ecn 200C Micro Theory Professor Giacomo Bonanno. Insurance Markets
Department f Ecnmics, University f alifrnia, Davis Ecn 200 Micr Thery Prfessr Giacm Bnann Insurance Markets nsider an individual wh has an initial wealth f. ith sme prbability p he faces a lss f x (0
More informationLCAO APPROXIMATIONS OF ORGANIC Pi MO SYSTEMS The allyl system (cation, anion or radical).
Principles f Organic Chemistry lecture 5, page LCAO APPROIMATIONS OF ORGANIC Pi MO SYSTEMS The allyl system (catin, anin r radical).. Draw mlecule and set up determinant. 2 3 0 3 C C 2 = 0 C 2 3 0 = -
More informationModule 3: Gaussian Process Parameter Estimation, Prediction Uncertainty, and Diagnostics
Mdule 3: Gaussian Prcess Parameter Estimatin, Predictin Uncertainty, and Diagnstics Jerme Sacks and William J Welch Natinal Institute f Statistical Sciences and University f British Clumbia Adapted frm
More informationMedium Scale Integrated (MSI) devices [Sections 2.9 and 2.10]
EECS 270, Winter 2017, Lecture 3 Page 1 f 6 Medium Scale Integrated (MSI) devices [Sectins 2.9 and 2.10] As we ve seen, it s smetimes nt reasnable t d all the design wrk at the gate-level smetimes we just
More informationFIELD QUALITY IN ACCELERATOR MAGNETS
FIELD QUALITY IN ACCELERATOR MAGNETS S. Russenschuck CERN, 1211 Geneva 23, Switzerland Abstract The field quality in the supercnducting magnets is expressed in terms f the cefficients f the Furier series
More informationWe can see from the graph above that the intersection is, i.e., [ ).
MTH 111 Cllege Algebra Lecture Ntes July 2, 2014 Functin Arithmetic: With nt t much difficulty, we ntice that inputs f functins are numbers, and utputs f functins are numbers. S whatever we can d with
More informationOF SIMPLY SUPPORTED PLYWOOD PLATES UNDER COMBINED EDGEWISE BENDING AND COMPRESSION
U. S. FOREST SERVICE RESEARCH PAPER FPL 50 DECEMBER U. S. DEPARTMENT OF AGRICULTURE FOREST SERVICE FOREST PRODUCTS LABORATORY OF SIMPLY SUPPORTED PLYWOOD PLATES UNDER COMBINED EDGEWISE BENDING AND COMPRESSION
More informationPipetting 101 Developed by BSU CityLab
Discver the Micrbes Within: The Wlbachia Prject Pipetting 101 Develped by BSU CityLab Clr Cmparisns Pipetting Exercise #1 STUDENT OBJECTIVES Students will be able t: Chse the crrect size micrpipette fr
More informationTree Structured Classifier
Tree Structured Classifier Reference: Classificatin and Regressin Trees by L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stne, Chapman & Hall, 98. A Medical Eample (CART): Predict high risk patients
More information1b) =.215 1c).080/.215 =.372
Practice Exam 1 - Answers 1. / \.1/ \.9 (D+) (D-) / \ / \.8 / \.2.15/ \.85 (T+) (T-) (T+) (T-).080.020.135.765 1b).080 +.135 =.215 1c).080/.215 =.372 2. The data shwn in the scatter plt is the distance
More informationMATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank
MATCHING TECHNIQUES Technical Track Sessin VI Céline Ferré The Wrld Bank When can we use matching? What if the assignment t the treatment is nt dne randmly r based n an eligibility index, but n the basis
More informationSimple Linear Regression (single variable)
Simple Linear Regressin (single variable) Intrductin t Machine Learning Marek Petrik January 31, 2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins
More information