II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

 Kerrie Powell
 7 months ago
 Views:
Transcription
1 II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso productmomet correlatio coefficiet, deoted as r, describes a liear relatioship betwee two quatitative variables. It is importat to otice that whe lookig at this value, it oly idicates the liear relatioship. You could have aother kid of relatioship preset i the data. The r value idicates both the stregth of the liear associatio ad its directio. You will ot eed to calculate a r formula is below: value by had, but i case you are iterested the SXY r SSX SSY where SSX x x SSY SXY x y y y x x y y x y xy x y You will be required to iterpret what a r directio of the relatioship. value tells you. We will start with the Positive r suggests large values of X ad Y occur together ad that small values of X ad Y occur together. This meas that the slope of the lie that best fits the poits is positive. A example would be Experiece ad Salary. People with lower levels of experiece ted to have lower salaries ad people with more experiece ted to have higher salaries. Negative r suggests large values of oe variable ted to occur with small values of the other variable. This meas that the slope of the lie that best fits the poits is egative. A example would be Weight of a car ad Gas mileage. Light cars ted to have higher gas mileage ad heavier cars ted to have lower gas mileage. 1
2 So the sig of the r value tells us the directio of the relatioship. Stregth of the relatioship is measured by the actual value of the umber. By the term stregth, we mea how close are the poits to a lie? The closer the poits are to a lie, the stroger the relatioship. Because of the setup of r, the maximum value for r (i terms of absolute value) is 1. Below are some useful thigs to keep i mid. 1 r 1 If the there is perfect positive liear correlatio all data are exactly o a lie with positive slope If the there is perfect egative liear correlatio all data are exactly o a lie with egative slope If the there is o liear relatioship (keep i mid there could be aother type of correlatio) r 1 r 1 r The stroger the liear relatioship, the larger Geerally, we will say there is a strog relatioship if r (the closer to 1 this value will be). r.75 Lookig back to our example from the last sectio whe itroducig scatterplots we had X = Dosage of Drug ad Y = Reductio i Blood Pressure, what do you thik the r value will be? Remember based o the scatterplot that the poits had a strog positive liear relatioship. Not perfect but pretty close meaig the r value should be close to 1. If you calculate this value, you will get r This should seem reasoable as it supports what we idetified i the graph. Aother measure, you will sometimes see reported is the Rsquared value. It is commo for computer software to give you a Rsquared value istead of r. This value represets the percet of variatio i Y explaied by the model. It measures the stregth of the relatioship ad i the liear case is simply calculated by squarig the r value. The higher Rsquared is, the better the model. % R 1% For the Drug example r R %
3 . Cause ad Effect Causal Research Whe the objective is to determie if a variable causes a certai behavior (whether there is a cause ad effect relatioship betwee variables) Note: it is ever possible to prove causality just based o the relatioship betwee two variables There is a strog statistical correlatio over the moths of the year betwee ice cream cosumptio ad the umber of assaults i the U.S. The r value for this data is above.9. Does this mea ice cream maufacturers are resposible for crime? No! The correlatio occurs statistically because the hot temperatures of summer icreases both ice cream cosumptio ad assaults (High values occur at the same time ad low values occur at the same time) Thus, correlatio does NOT imply causatio. This is oe of the biggest mistakes that I see i the iterpretatio of a correlatio. You should always keep i mid that other factors besides cause ad effect ca create a observed correlatio. To establish whether two variables are causally related you must establish all of the followig: 1) Time order  the cause must have occurred before the effect ) Covariatio (statistical associatio) the correlatio coefficiet ad graph must show a strog relatioship betwee the depedet ad idepedet variable 3) Ratioale  there must be a logical ad compellig explaatio for why oe variable causes the other 4) Nospuriousess  it must be established that the idepedet variable X, ad oly X, was the cause of chages i the depedet variable Y; rival explaatios must be ruled out The first three of these ca be easily established i may cases. It is the fourth criteria which is hard ad ca rarely be show. To help idetify a relatioship as cause ad effect, a study should be performed may times. The study should yield the same results every time it is coducted. Give that the outside variables will differ from situatio to situatio, this helps rule out rival explaatios. Causal research is very complex ad the researcher ca rarely be certai that other factors are ot ifluecig a relatioship. 3
4 3. Liear Regressio Determiistic View This is the idea that Y is caused by X or that oce X has happeed, Y will follow. I this situatio, the exact value of Y is kow. The determiistic view is studied i a typical algebra class. However, a determiistic view whe applied to the behavior of may variables is ot possible. Regressio A techique used to predict variables (typically difficult to measure variables) based o a set of other variables (typically easier to measure variables). Liear Regressio Used to predict the value of Y (the respose variable), based o X (the explaatory variable) usig a liear equatio. Predict reactio time based o blood alcohol level. Reactio time is difficult to measure so istead we predict it with blood alcohol level which is easy to measure. The liear regressio model expresses Y as a fuctio of X plus radom error. Radom error reflects variatio i Y values. Keep i mid we are goig to measure X, so assumig we get a good measure there is o error i the X variable. However, whe we go to use X to predict Y, the predictio will ot be exact. Therefore, there is error i the Y variable. Graphically this error is represeted by the vertical distace betwee the poits ad the lie. The liear regressio model is: b b x Y 1 where b is the yitercept b 1 is the slope The above formula is the same format as what you should be used to from a algebra class. However, the way we deote the relatioship is differet. It is importat you become familiar with this otatio. I order to use liear regressio, we must first make sure the model is reasoable. The scatter plot ad r should idicate a strog relatioship. If the model is ot reasoable, do ot fit a lie. It may still be possible to do regressio with a more complicated model. However, if there is o relatioship betwee the variables the regressio caot be used. I this class we will ot worry about more complicated models, but you should uderstad that a simple liear model is just oe of the may optios available. 4
5 Whe usig a liear regressio model, we eed the lie that is the best fit for our data. Sice our purpose will be to predict, we will wat to pick the lie that will miimize the error i the predictio. To accomplish this we will use the method of leastsquares. Method of LeastSquares says that the sum of the squares of the vertical distaces from the poits to the lie is miimized. Remember it is the vertical distace that represets the error. To calculate the best fit lie you ca use the followig formulas. You do ot have to do this by had i this class. I show you the formulas i case you are iterested. SXY SSX ( x x)( y y) ( x x) xy x x y x b1 b y b1 x At the begiig of this sectio whe lookig at correlatio for the Dosage of drug ad Reductio i blood pressure example we idetified r which idicates a high positive liear correlatio. This fact alog with the scatterplot supports the use of liear regressio i this case. With the above formulas, you ca calculate b ad b Therefore, the regressio model i this case is: y b b x y x 1 As I stated earlier, you will ot have to calculate the formula by had. Istead, I will provide computer output ad you eed to be able to aswer questios based o the output. The computer output (a regressio plot) for this example follows. 5
6 Regressio Plot Y = X RSq = 99.5 % 6 5 Pressure Drug I this output, the equatio ad the Rsquared value are give. If you look above the graph, you will see this iformatio. Notice the Rsquared value for this example is exactly what we stated previously i this sectio of material. You eed to be able to get the r value based o the Rsquared that is give i the output. All you have to do is take the square root of the Rsquared value. The thig you have to be careful of is the directio of the relatioship. Remember that if the slope of the lie is positive the r is positive ad if the slope is egative the r is egative. Therefore, you must look at the slope i order to decide if r is positive or egative. I terms of the equatio, you eed to be able to use it for predictio. This is a pretty direct process as we will always be predictig Y based o X. Therefore, you will plug i for X ad solve for Y. For our example, predict the Reductio i Blood Pressure if 5 is the Dosage of Drug. y x y (5) y 6.1 6
7 Cosider the followig data ad software output, which give the weight (i thousads of pouds), X, ad gasolie mileage (miles per gallo), Y, for te differet automobiles. X Y Regressio Plot Y = X S = 3.46 RSq = 9. % RSq(adj) = 91. % Y X Calculate r.. Based o r ad the scatterplot is liear regressio justified i this case. 3. Predict the gas mileage for a car weighig 4. (4,) pouds. Aswers 1. r R This is egative because of the egative slope. Yes, liear regressio is justified, r. 75 ad the poits are spread reasoably about the lie o the scatterplot Y X
8 Cautios with regressio There are two commo mistakes with regressio. You must be aware of the problems with extrapolatio ad extreme values. Iterpolatio predictig Y values for X values that are withi the rage of the scatter plot (this is what regressio should be used for) Extrapolatio predictig Y values for X values beyod the rage of the observatios (this should ot be doe with a basic regressio model, it is a complex problem) If our X variable rages from 1 to 5 as it does i the Dosage of drug ad Reductio i blood pressure example the it is reasoable to predict withi that rage. However, if you try ad predict for a X of 1 the you have o data idicatig that this relatioship holds at that value. It is quite possible that the relatioship chages beyod the rage of the data. There is o way to kow this without collectig data cosistet with the X values you wat to predict. The leastsquares lie ad the r value ca be affected greatly by extreme data poits. I order to illustrate this we will look at some computer output. 6 Regressio Plot Y = EX RSq =. % 5 4 C C Calculate the r value for the above data. r With a r of, we kow that there is o liear relatioship betwee X ad Y. 8
9 Regressio Plot Y = X RSq = 77.9 % 3 C1 1 1 C 3 Calculate r for the above data. r With a r of.883, which is bigger tha the criteria of.75 it seems like we have a strog relatioship. With further ivestigatio via the scatterplot, you will see that all of the data is i the bottom left of the graph except oe data poit which is extreme. What I actually did was take the data from the previous graph with a r value of ad add oe extreme value. Notice the extreme value makes the other data poits appear close together. They also appear umerically close sice the oe value is so extreme. Therefore, the r value is high because the poits are close to the lie. I this case liear regressio is ot justified. If you have a extreme value i a plot like i this case, you should remove the extreme value ad see if the relatioship still exists. I this case it does ot so liear regressio will ot work for this data. 9
1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More information3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.
3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear
More informationGotta Keep It Correlatin
Gotta Keep It Correlati Correlatio.2 Learig Goals I this lesso, ou will: Determie the correlatio coefficiet usig a formula. Iterpret the correlatio coefficiet for a set of data. ew Stud Liks Dark Chocolate
More informationPaired Data and Linear Correlation
Paired Data ad Liear Correlatio Example. A group of calculus studets has take two quizzes. These are their scores: Studet st Quiz Score ( data) d Quiz Score ( data) 7 5 5 0 3 0 3 4 0 5 5 5 5 6 0 8 7 0
More informationMA131  Analysis 1. Workbook 2 Sequences I
MA3  Aalysis Workbook 2 Sequeces I Autum 203 Cotets 2 Sequeces I 2. Itroductio.............................. 2.2 Icreasig ad Decreasig Sequeces................ 2 2.3 Bouded Sequeces..........................
More informationChapter Objectives. Bivariate Data. Terminology. Lurking Variable. Types of Relations. Chapter 3 Linear Regression and Correlation
Chapter Objectives Chapter 3 Liear Regressio ad Correlatio Descriptive Aalysis & Presetatio of Two Quatitative Data To be able to preset twovariables data i tabular ad graphic form Display the relatioship
More informationSummary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.
Key Cocepts: 1) Sketchig of scatter diagram The scatter diagram of bivariate (i.e. cotaiig two variables) data ca be easily obtaied usig GC. Studets are advised to refer to lecture otes for the GC operatios
More informationCorrelation and Covariance
Correlatio ad Covariace Tom Ilveto FREC 9 What is Next? Correlatio ad Regressio Regressio We specify a depedet variable as a liear fuctio of oe or more idepedet variables, based o covariace Regressio
More informationSequences I. Chapter Introduction
Chapter 2 Sequeces I 2. Itroductio A sequece is a list of umbers i a defiite order so that we kow which umber is i the first place, which umber is i the secod place ad, for ay atural umber, we kow which
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Crosssectioal data. 2. Time series data.
More informationChapter 4  Summarizing Numerical Data
Chapter 4  Summarizig Numerical Data 15.075 Cythia Rudi Here are some ways we ca summarize data umerically. Sample Mea: i=1 x i x :=. Note: i this class we will work with both the populatio mea µ ad the
More informationSIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS
SIMPLE LINEAR REGRESSION AND CORRELATION ANALSIS INTRODUCTION There are lot of statistical ivestigatio to kow whether there is a relatioship amog variables Two aalyses: (1) regressio aalysis; () correlatio
More informationREGRESSION (Physics 1210 Notes, Partial Modified Appendix A)
REGRESSION (Physics 0 Notes, Partial Modified Appedix A) HOW TO PERFORM A LINEAR REGRESSION Cosider the followig data poits ad their graph (Table I ad Figure ): X Y 0 3 5 3 7 4 9 5 Table : Example Data
More informationLinear Regression Models
Liear Regressio Models Dr. Joh MellorCrummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationAssessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions
Assessmet ad Modelig of Forests FR 48 Sprig Assigmet Solutios. The first part of the questio asked that you calculate the average, stadard deviatio, coefficiet of variatio, ad 9% cofidece iterval of the
More informationA sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as
More informationMedian and IQR The median is the value which divides the ordered data values in half.
STA 666 Fall 2007 Webbased Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5umber summary mea ad stadard deviatio Media
More informationP.3 Polynomials and Special products
Precalc Fall 2016 Sectios P.3, 1.2, 1.3, P.4, 1.4, P.2 (radicals/ratioal expoets), 1.5, 1.6, 1.7, 1.8, 1.1, 2.1, 2.2 I Polyomial defiitio (p. 28) a x + a x +... + a x + a x 1 1 0 1 1 0 a x + a x +... +
More informationRecall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and nonusers, x  y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad ousers, x  y. Such studies are sometimes viewed
More informationAcademic. Grade 9 Assessment of Mathematics. Released assessment Questions
Academic Grade 9 Assessmet of Mathematics 2014 Released assessmet Questios Record your aswers to the multiplechoice questios o the Studet Aswer Sheet (2014, Academic). Please ote: The format of this booklet
More informationNumber of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day
LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the
More informationCORRELATION AND REGRESSION
the Further Mathematics etwork www.fmetwork.org.uk V 7 1 1 REVISION SHEET STATISTICS 1 (Ed) CORRELATION AND REGRESSION The mai ideas are: Scatter Diagrams ad Lies of Best Fit Pearso s Product Momet Correlatio
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture  9 Multicolliearity Dr Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Multicolliearity diagostics A importat questio that
More informationP1 Chapter 8 :: Binomial Expansion
P Chapter 8 :: Biomial Expasio jfrost@tiffi.kigsto.sch.uk www.drfrostmaths.com @DrFrostMaths Last modified: 6 th August 7 Use of DrFrostMaths for practice Register for free at: www.drfrostmaths.com/homework
More informationLesson 10: Limits and Continuity
www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals
More informationMth 95 Notes Module 1 Spring Section 4.1 Solving Systems of Linear Equations in Two Variables by Graphing, Substitution, and Elimination
Mth 9 Notes Module Sprig 4 Sectio 4. Solvig Sstems of Liear Equatios i Two Variales Graphig, Sustitutio, ad Elimiatio A Solutio to a Sstem of Two (or more) Liear Equatios is the commo poit(s) of itersectio
More informationSeries III. Chapter Alternating Series
Chapter 9 Series III With the exceptio of the Null Sequece Test, all the tests for series covergece ad divergece that we have cosidered so far have dealt oly with series of oegative terms. Series with
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 00900 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More information, then cv V. Differential Equations Elements of Lineaer Algebra Name: Consider the differential equation. and y2 cos( kx)
Cosider the differetial equatio y '' k y 0 has particular solutios y1 si( kx) ad y cos( kx) I geeral, ay liear combiatio of y1 ad y, cy 1 1 cy where c1, c is also a solutio to the equatio above The reaso
More information7.1 Finding Rational Solutions of Polynomial Equations
Name Class Date 7.1 Fidig Ratioal Solutios of Polyomial Equatios Essetial Questio: How do you fid the ratioal roots of a polyomial equatio? Resource Locker Explore Relatig Zeros ad Coefficiets of Polyomial
More information1 Lesson 6: Measure of Variation
1 Lesso 6: Measure of Variatio 1.1 The rage As we have see, there are several viable coteders for the best measure of the cetral tedecy of data. The mea, the mode ad the media each have certai advatages
More informationSimple Linear Regression
Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio
More informationMAT1026 Calculus II Basic Convergence Tests for Series
MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationSeunghee Ye Ma 8: Week 5 Oct 28
Week 5 Summary I Sectio, we go over the Mea Value Theorem ad its applicatios. I Sectio 2, we will recap what we have covered so far this term. Topics Page Mea Value Theorem. Applicatios of the Mea Value
More informationNotes on iteration and Newton s method. Iteration
Notes o iteratio ad Newto s method Iteratio Iteratio meas doig somethig over ad over. I our cotet, a iteratio is a sequece of umbers, vectors, fuctios, etc. geerated by a iteratio rule of the type 1 f
More information71. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
71 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7 Sectio 1. Samplig Distributio 73 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More informationPractice Test Problems for Test IV, with Solutions
Practice Test Problems for Test IV, with Solutios Dr. Holmes May, 2008 The exam will cover sectios 8.2 (revisited) to 8.8. The Taylor remaider formula from 8.9 will ot be o this test. The fact that sums,
More informationWHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT
WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? Harold G. Loomis Hoolulu, HI ABSTRACT Most coastal locatios have few if ay records of tsuami wave heights obtaied over various time periods. Still
More informationDefinitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.
Defiitios ad Theorems Remember the scalar form of the liear programmig problem, Miimize, Subject to, f(x) = c i x i a 1i x i = b 1 a mi x i = b m x i 0 i = 1,2,, where x are the decisio variables. c, b,
More informationLecture 9: Hierarchy Theorems
IAS/PCMI Summer Sessio 2000 Clay Mathematics Udergraduate Program Basic Course o Computatioal Complexity Lecture 9: Hierarchy Theorems David Mix Barrigto ad Alexis Maciel July 27, 2000 Most of this lecture
More informationOnce we have a sequence of numbers, the next thing to do is to sum them up. Given a sequence (a n ) n=1
. Ifiite Series Oce we have a sequece of umbers, the ext thig to do is to sum them up. Give a sequece a be a sequece: ca we give a sesible meaig to the followig expressio? a = a a a a While summig ifiitely
More informationCURRICULUM INSPIRATIONS: INNOVATIVE CURRICULUM ONLINE EXPERIENCES: TANTON TIDBITS:
CURRICULUM INSPIRATIONS: wwwmaaorg/ci MATH FOR AMERICA_DC: wwwmathforamericaorg/dc INNOVATIVE CURRICULUM ONLINE EXPERIENCES: wwwgdaymathcom TANTON TIDBITS: wwwjamestatocom TANTON S TAKE ON MEAN ad VARIATION
More informationTopic 6 Sampling, hypothesis testing, and the central limit theorem
CSE 103: Probability ad statistics Fall 2010 Topic 6 Samplig, hypothesis testig, ad the cetral limit theorem 61 The biomial distributio Let X be the umberofheadswhe acoiofbiaspistossedtimes The distributio
More informationRiemann Sums y = f (x)
Riema Sums Recall that we have previously discussed the area problem I its simplest form we ca state it this way: The Area Problem Let f be a cotiuous, oegative fuctio o the closed iterval [a, b] Fid
More informationZeros of Polynomials
Math 160 www.timetodare.com 4.5 4.6 Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered with fidig the solutios of polyomial equatios of ay degree
More informationFIR Filters. Lecture #7 Chapter 5. BME 310 Biomedical Computing  J.Schesser
FIR Filters Lecture #7 Chapter 5 8 What Is this Course All About? To Gai a Appreciatio of the Various Types of Sigals ad Systems To Aalyze The Various Types of Systems To Lear the Skills ad Tools eeded
More informationSection 11.8: Power Series
Sectio 11.8: Power Series 1. Power Series I this sectio, we cosider geeralizig the cocept of a series. Recall that a series is a ifiite sum of umbers a. We ca talk about whether or ot it coverges ad i
More informationRegression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X.
Regressio Correlatio vs. regressio Predicts Y from X Liear regressio assumes that the relatioship betwee X ad Y ca be described by a lie Regressio assumes... Radom sample Y is ormally distributed with
More informationTable 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab
Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet
More informationCentral Limit Theorem the Meaning and the Usage
Cetral Limit Theorem the Meaig ad the Usage Covetio about otatio. N, We are usig otatio X is variable with mea ad stadard deviatio. i lieu of sayig that X is a ormal radom Assume a sample of measuremets
More informationAlgebra II Notes Unit Seven: Powers, Roots, and Radicals
Syllabus Objectives: 7. The studets will use properties of ratioal epoets to simplify ad evaluate epressios. 7.8 The studet will solve equatios cotaiig radicals or ratioal epoets. b a, the b is the radical.
More information(all terms are scalars).the minimization is clearer in sum notation:
7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1
More informationStudents will calculate quantities that involve positive and negative rational exponents.
: Ratioal Expoets What are ad? Studet Outcomes Studets will calculate quatities that ivolve positive ad egative ratioal expoets. Lesso Notes Studets exted their uderstadig of iteger expoets to ratioal
More informationREVISION SHEET FP1 (MEI) ALGEBRA. Identities In mathematics, an identity is a statement which is true for all values of the variables it contains.
the Further Mathematics etwork wwwfmetworkorguk V 07 The mai ideas are: Idetities REVISION SHEET FP (MEI) ALGEBRA Before the exam you should kow: If a expressio is a idetity the it is true for all values
More informationChapter 7. Transformation
Chapter 7 Trasformatio 7.. Trasformatio Is liear regressio appropriate? 7.. Trasformatio The assumptio of liear relatioship does ot alwas hold We ca trasform The predictor The respose Both to achieve the
More informationNumerical Methods in Fourier Series Applications
Numerical Methods i Fourier Series Applicatios Recall that the basic relatios i usig the Trigoometric Fourier Series represetatio were give by f ( x) a o ( a x cos b x si ) () where the Fourier coefficiets
More information14.2 Simplifying Expressions with Rational Exponents and Radicals
Name Class Date 14. Simplifyig Expressios with Ratioal Expoets ad Radicals Essetial Questio: How ca you write a radical expressio as a expressio with a ratioal expoet? Resource Locker Explore Explorig
More informationMA131  Analysis 1. Workbook 9 Series III
MA3  Aalysis Workbook 9 Series III Autum 004 Cotets 4.4 Series with Positive ad Negative Terms.............. 4.5 Alteratig Series.......................... 4.6 Geeral Series.............................
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationComplex Numbers Solutions
Complex Numbers Solutios Joseph Zoller February 7, 06 Solutios. (009 AIME I Problem ) There is a complex umber with imagiary part 64 ad a positive iteger such that Fid. [Solutio: 697] 4i + + 4i. 4i 4i
More informationThe Sample Variance Formula: A Detailed Study of an Old Controversy
The Sample Variace Formula: A Detailed Study of a Old Cotroversy Ky M. Vu PhD. AuLac Techologies Ic. c 00 Email: kymvu@aulactechologies.com Abstract The two biased ad ubiased formulae for the sample variace
More informationPH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the zaxis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationCov(aX, cy ) Var(X) Var(Y ) It is completely invariant to affine transformations: for any a, b, c, d R, ρ(ax + b, cy + d) = a.s. X i. as n.
CS 189 Itroductio to Machie Learig Sprig 218 Note 11 1 Caoical Correlatio Aalysis The Pearso Correlatio Coefficiet ρ(x, Y ) is a way to measure how liearly related (i other words, how well a liear model
More informationMeasures of Spread: Variance and Standard Deviation
Lesso 16 Measures of Spread: Variace ad Stadard Deviatio BIG IDEA Variace ad stadard deviatio deped o the mea of a set of umbers. Calculatig these measures of spread depeds o whether the set is a sample
More informationPb ( a ) = measure of the plausibility of proposition b conditional on the information stated in proposition a. & then using P2
Axioms for Probability Logic Pb ( a ) = measure of the plausibility of propositio b coditioal o the iformatio stated i propositio a For propositios a, b ad c: P: Pb ( a) 0 P2: Pb ( a& b ) = P3: Pb ( a)
More informationSection 1 of Unit 03 (Pure Mathematics 3) Algebra
Sectio 1 of Uit 0 (Pure Mathematics ) Algebra Recommeded Prior Kowledge Studets should have studied the algebraic techiques i Pure Mathematics 1. Cotet This Sectio should be studied early i the course
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chisquare Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chisquare Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More information62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More information10.1 Sequences. n term. We will deal a. a n or a n n. ( 1) n ( 1) n 1 2 ( 1) a =, 0 0,,,,, ln n. n an 2. n term.
0. Sequeces A sequece is a list of umbers writte i a defiite order: a, a,, a, a is called the first term, a is the secod term, ad i geeral eclusively with ifiite sequeces ad so each term Notatio: the sequece
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationRepresenting Functions as Power Series. 3 n ...
Math Fall 7 Lab Represetig Fuctios as Power Series I. Itrouctio I sectio.8 we leare the series c c c c c... () is calle a power series. It is a uctio o whose omai is the set o all or which it coverges.
More information1 Section 2.2, Absolute value
.Math 0450 Hoors itro to aalysis Sprig, 2009 Notes #6 1 Sectio 2.2, Absolute value It is importat to uderstad iequalities ivolvig absolute value. I class we cosidered the iequality jx 1j < jxj ; ad discussed
More information4.1 Data processing inequality
ECE598: Iformatiotheoretic methods i highdimesioal statistics Sprig 206 Lecture 4: Total variatio/iequalities betwee fdivergeces Lecturer: Yihog Wu Scribe: Matthew Tsao, Feb 8, 206 [Ed. Mar 22] Recall
More informationBHW #13 1/ Cooper. ENGR 323 Probabilistic Analysis Beautiful Homework # 13
BHW # /5 ENGR Probabilistic Aalysis Beautiful Homework # Three differet roads feed ito a particular freeway etrace. Suppose that durig a fixed time period, the umber of cars comig from each road oto the
More informationSECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES
SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,
More informationSEQUENCES AND SERIES
9 SEQUENCES AND SERIES INTRODUCTION Sequeces have may importat applicatios i several spheres of huma activities Whe a collectio of objects is arraged i a defiite order such that it has a idetified first
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform largesample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationIntroduction to Artificial Intelligence CAP 4601 Summer 2013 Midterm Exam
Itroductio to Artificial Itelligece CAP 601 Summer 013 Midterm Exam 1. Termiology (7 Poits). Give the followig task eviromets, eter their properties/characteristics. The properties/characteristics of the
More informationDiscrete probability distributions
Discrete probability distributios I the chapter o probability we used the classical method to calculate the probability of various values of a radom variable. I some cases, however, we may be able to develop
More informationMost text will write ordinary derivatives using either Leibniz notation 2 3. y + 5y= e and y y. xx tt t
Itroductio to Differetial Equatios Defiitios ad Termiolog Differetial Equatio: A equatio cotaiig the derivatives of oe or more depedet variables, with respect to oe or more idepedet variables, is said
More informationMachine Learning Theory Tübingen University, WS 2016/2017 Lecture 12
Machie Learig Theory Tübige Uiversity, WS 06/07 Lecture Tolstikhi Ilya Abstract I this lecture we derive risk bouds for kerel methods. We will start by showig that Soft Margi kerel SVM correspods to miimizig
More information11. FINITE FIELDS. Example 1: The following tables define addition and multiplication for a field of order 4.
11. FINITE FIELDS 11.1. A Field With 4 Elemets Probably the oly fiite fields which you ll kow about at this stage are the fields of itegers modulo a prime p, deoted by Z p. But there are others. Now although
More informationInstructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?
CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter
More informationPart 1 of the text covers regression analysis with crosssectional data. It builds
Regressio Aalysis with CrossSectioal Data 1 Part 1 of the text covers regressio aalysis with crosssectioal data. It builds upo a solid base of college algebra ad basic cocepts i probability ad statistics.
More informationProbability, Expectation Value and Uncertainty
Chapter 1 Probability, Expectatio Value ad Ucertaity We have see that the physically observable properties of a quatum system are represeted by Hermitea operators (also referred to as observables ) such
More informationCTL.SC0x Supply Chain Analytics
CTL.SC0x Supply Chai Aalytics Key Cocepts Documet V1.1 This documet cotais the Key Cocepts documets for week 6, lessos 1 ad 2 withi the SC0x course. These are meat to complemet, ot replace, the lesso videos
More informationAP Calculus Chapter 9: Infinite Series
AP Calculus Chapter 9: Ifiite Series 9. Sequeces a, a 2, a 3, a 4, a 5,... Sequece: A fuctio whose domai is the set of positive itegers = 2 3 4 a = a a 2 a 3 a 4 terms of the sequece Begi with the patter
More informationPROBABILITY AMPLITUDE AND INTERFERENCE
PROILITY MPLITUDE ND INTERFERENCE I. Probability amplitude Suppose that particle is placed i the ifiite square well potetial. Let the state of the particle be give by ϕ ad let the system s eergy eigestates
More informationChapter VII Measures of Correlation
Chapter VII Measures of Correlatio A researcher may be iterested i fidig out whether two variables are sigificatly related or ot. For istace, he may be iterested i kowig whether metal ability is sigificatly
More informationChapter 3. Strong convergence. 3.1 Definition of almost sure convergence
Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i
More informationA PROBABILITY PRIMER
CARLETON COLLEGE A ROBABILITY RIMER SCOTT BIERMAN (Do ot quote without permissio) A robability rimer INTRODUCTION The field of probability ad statistics provides a orgaizig framework for systematically
More informationREVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION
REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION I liear regreio, we coider the frequecy ditributio of oe variable (Y) at each of everal level of a ecod variable (X). Y i kow a the depedet variable.
More informationActivity 3: Length Measurements with the FourSided Meter Stick
Activity 3: Legth Measuremets with the FourSided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a foursided meter
More information11.1 Radical Expressions and Rational Exponents
Name Class Date 11.1 Radical Expressios ad Ratioal Expoets Essetial Questio: How are ratioal expoets related to radicals ad roots? Resource Locker Explore Defiig Ratioal Expoets i Terms of Roots Remember
More informationPROBABILITY LOGIC: Part 2
James L Bec 2 July 2005 PROBABILITY LOGIC: Part 2 Axioms for Probability Logic Based o geeral cosideratios, we derived axioms for: Pb ( a ) = measure of the plausibility of propositio b coditioal o the
More information