Cj = Column total of column j Statistical Hypotheses: Ho : O f = E f

Size: px
Start display at page:

Download "Cj = Column total of column j Statistical Hypotheses: Ho : O f = E f"

Transcription

1 Stats Review Data Aalysis as a Decisio Makig Process I Levels of Measuremet NOIR (See Whitley, 001, pp , for details) Nomial Categories with Names; Yes vs. No (do t ask sometimes vs. ever), Sex, Religio, Relatioship Status, Political Affiliatio, Experimetal Group Membership (Cotrol Group, Maipulatio Group, Compariso Group). Numbers represet groups (1 Female, Male). Order is arbitrary. Ordial Nomial Categories with a Logical Order; Class Rak, Height from tallest to shortest, Resposes o a Numerical Ratig Scale (1 strogly agree, 7 Strogly disagree). Itervals betwee umbers is ot stadard from uit to uit. Iterval Numerical scales with logically ordered uits that are equidistat, but the zero is artificial. E.g., Temperature i Cetigrade ad Fahreheit (Zero does ot represet a absece of temperature). Time of day (o 0 o clock). Caledar Dates. Caledar Years. Ratio Numerical scales with logically ordered uits that are equidistat ad have a true Zero (the zero represets a lack of that which is beig measured). E.g., Elapsed Time, Temperature i Kelvi (0 degrees Kelvi degrees celcius), Legth, Mass. Because it uses a true zero, umerical values ca be used to defie ratios: 5 iches is five times more legth tha 1 ich. 10 iches is twice as log as 5 iches. - Ratio Level Measures are Quite Rare i Psychology Cotiuous Vs. Discrete Variables Discrete Variables Mutually Exclusive/Exhaustive Numerical Categories that ca t be broke dow i to fier uits (e.g., if sex is represeted by 1:male ad :female, there is o 1.5). All Nomial ad Ordial Variables are Discrete. However, may Ordial variables will be treated as cotiuous (e.g., the Numerical ratig scales are ofte averaged to form a sigle score which is treated as cotiuous). Cotiuous Variables Numerical systems where there are a ifiite umber of possible poits betwee each uit. Also the measuremets ca be broke dow ito fier uits (e.g., elapsed time : Years, Moths, Days, Hours, Miutes, Secods, Millisecods, Naosecods, etc..). II. Choosig your Statistics Kowig which statistic to use to test the relatioship betwee each variable depeds o the type of data you have (ad sometimes the type of questio you wat to aswer) -Note: This is ofte discussed with respect to the issue of statistical validity ad there are differet camps regardig the Appropriateess of certai levels of measuremet for specific statistics (Michell, 1986). Also give specific coditios most parametric statistics ca had o-parametric data. A. Sigle Discrete Variable Goodess of Fit X Allows us to test whether the group frequecies differ from chace patters (base rate frequecies : the frequecy istaces aturally occur i the eviromet). (df k -1, where k umber of groups) χ ( O E ) i i E i where O i observed frequecy for each separate group E i expected group frequecies (based o chace)

2 Statistical Hypotheses: Ho : O f E f Ha : O f E f Example Research Questio: Does umber of people who say they like cheezy poofs (Yes 1) vs. those who do ot like cheezy poofs (No 0), differ sigificatly from the umber expected by chace aloe? B. Discrete X Discrete Pearso s X (AKA: Test of Idepedece) Allows us to test whether the cross tabulatio patter of two omial variables differs from the patters expected by chace. If oe variable is ordial the t or F are ormally used. (df (R-1)(C-1)) where R # of rows & C # of colums. χ ( O E ) ij ij E E ij ij RC i N j where i the differet groups for Variable 1 j the differet groups for Variable where Ri Row total of row i Cj Colum total of colum j Statistical Hypotheses: Ho : O f E f Ha : O f E f Example Research Questio: Does the umber of people who thik they are Eric Cartma (yes 1, o 0), relative to whether or ot they eat cheezy poofs, sigificatly differ from the frequecies that are expected by chace aloe. Note: a sigificat Pearso s chi square will ot tell you which cells are differet. The crosstabulatio matrix must be examied to determie this. Eye-ballig the stadardized, corrected residuals seems to be the most useful. Limitatios o X : 1) Resposes must be idepedet ad mutually exclusive ad exhaustive. Each case from the sample should fit ito oe ad oly oe cell of the cross tab matrix. ) Low expected Frequecies limit the validity of X. If df 1 (e.g., x matrix), the o expected frequecy ca be less tha 5. Also, If df, all expected frequecies should exceed. If df3 or greater, the all expected frequecies except oe should be 5 or greater ad the oe cell eeds to have a expected frequecy of 1 or greater. Phi Coefficiet (if X matrix) correlatio coefficiet that estimates the stregth of the relatioship betwee two dichotomous omial variables. Note: Phi ca ot estimate the directio (e.g., positive liear vs. egative liear) of the relatioship betwee omial variables because the umerical values are arbitrary (directio is meaigless). -This correlatio coefficiet ca be calculated exactly like Pearso s r (below) or ca be estimated usig the X statistic. Thus ay X ca be coverted to Phi or Phi ca be coverted to X. (sigificace should be determied usig X tables) φ χ N ad χ φ N

3 Statistical Hypotheses: Ho : Phi 0 Ha : Phi 0 Example Research Questio: What is the stregth of the relatioship betwee whether oe thiks they are Eric Cartma or ot (Yes 1, No 0) ad whether oe eats cheezy poofs (Yes 1, No 0). Cramer s Phi or V (If X3 matrix or larger) correlatio coefficiet that estimates the stregth of the relatioship betwee two discrete omial variables (correctig for the ifluece of the umber of groups). (Sigificace should be determied usig X tables). Note: Phi ca ot estimate the directio (positive vs. egative) of the relatioship betwee omial variables because the umerical values are arbitrary (directio is meaigless). φ cramer χ N( k 1) where k is the smaller of R or C (or whe r c, k r c) Statistical Hypotheses: Ho : Phi 0 Ha : Phi 0 Example Research Questio: What is the stregth of the relatioship betwee whether oe thiks they are Eric Cartma or ot (Yes 1, No 0) ad whether oe prefers cheezy poofs (1), HooHoo Dillies (), or coa-coa yum-yum s (3). - If both discrete variables are Ordial (Raked Data) Spearma s Rho r s : Correlatio coefficiet that estimates the stregth ad directio of the relatioship betwee two ordial variables. Calculated usig Pearso s r formula. However, the r table ca t be used to estimate sigificace. For N > 10 ad N < 8, covert to a Z score. If Z greater tha 1.96, the alpha <.05. If Z greater tha.85, the alpha <.01. For smaller ad larger N, respectively, special tables will eed to be obtaied. Z rs ( N + 5) 9N( N 1) Statistical Hypotheses: Ho : Rho 0 Ha : Rho 0 Example Research Questio: What is the stregth ad directio of the relatioship betwee Eric Cartma s rakigs of 0 participat s suitability as a mate (rage 1-0) ad participats rakigs with respect to how may Cheezy Poofs they eat per-week ( rage 1-0). Kedall s Tau J : This statistic provides a alterative to Rho that is easier to calculate sigificace for by had. Rather tha beig a variatio of Pearso s r, Tau is based i the umber of iversios betwee the raked items. For example, the followig data set cotais 4 iversios out of the 10 comparisos. Subject Var Var Iversio Iversio Iversio Iversio

4 τ 1 ( I) N( N 1) where I umber of iversios Ofte, Tau is used as a measure of iter-rater agreemet (though Rho ca also be used for such purposes). Sigificace (regardless of the umber of subjects) is tested usig the same Z test as for Rho, oly substitute J for r s. Use the same Critical Values of Z: If Z greater tha 1.96, the alpha <.05. If Z greater tha.85, the alpha <.01. Statistical Hypotheses: Ho : Tau 0 Ha : Tau 0 Example Research Questio: What is the stregth ad directio of the relatioship betwee Eric Cartma s rakigs of 0 participat s suitability as a mate (rage 1-0) ad participats rakigs with respect to how may Cheezy Poofs they eat per-week ( rage 1-0). C. Discrete X Cotiuous If Discrete Variable is Dichotomous (oly levels) z-test compares a sigle sample mea to the populatio mea, whe the populatio stadard deviatio is kow Z X μ σ N where : & F populatio mea ad stadard deviatio, respectively Sigificace Use the Critical Values of Z: If Z greater tha 1.96, the alpha <.05. If Z greater tha.85, the alpha <.01. Statistical Hypotheses: Ho : Group 1 mea Populatio Mea Ha : Group 1 mea Populatio Mea Example Research Questio: Does the average umber of Cheezy Poofs eate per day by studets eroled i Graduate Research Statistics (rage 0-600) differ from the average umber of Cheezy Poofs eate per day i the geeral populatio of Graduate Studets. t-test Sigificace : If t obtaied exceeds the t critical (see ay t table for critical values) for a give df at the.05 alpha level, the the groups are sigificatly differet. Sigle Sample t-test compare a sample mea to a populatio mea whe the oly the sample stadard deviatio is kow. df -1 t X μ s Statistical Hypotheses: where s sample stadard deviatio Ho : Group 1 mea Populatio Mea Ha : Group 1 mea Populatio Mea Example Research Questio: Does the average umber of Cheezy Poofs eate per day by studets eroled i Graduate Research Statistics (rage 0-600) differ from the average umber of Cheezy Poofs eate per day i the Uited States.

5 Idepedet Sample t-test compare two the meas of two urelated groups. df - t X1 X s s ( 1) + ( 1) Statistical Hypotheses: Ho : Ha : Example Research Questio: Does a radomly assiged group exposed to 37 hours of South Park (group 1) rerus report sigificatly more positive or egative attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7) compared to a radomly assiged compariso coditio exposed to 37 hours of Sally Struthers Feed the Childre commercials (group ). Repeated Measure / Matched Sample t-test Repeated Measure: test the sigificace of the averaged differece i scores betwee time 1 ad time. Matched Sample: compare averaged differece i scores betwee group 1 ad group whe the subjects from each group have bee matched o some variable (e.g. age, itelligece, etc.). df -1 t dif X Σ ( X X ) t1 t Xt ( Σ ( Xt Xt )) 1 t1 1 Statistical Hypotheses: Ho : 0 t1 0 t Ha : 0 t1 0 t Example Research Questio: After beig exposed to 37 hours of South Park rerus (Time ), do participats report sigificatly more positive or egative attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7) compared to their pre-exposure scores (Time 1). Biserial vs. Poit Biserial (Artificial Dichotomies vs. Natural Dichotomies). Poit Biserial Correlatio Coefficiet r pb Same formula as Perso s r (see below), oly oe variable is a atural dichotomy ofte desigated by 0 & 1. This correlatio coefficiet will idicate the stregth of the relatioship betwee category membership ad the cotiuous score. Note that like Phi, the directio of the relatioship (positive vs. egative) is arbitrarily based o the umerical labels assiged to the groups. Examiatio of the meas is ecessary to determie the directio of the group differeces. df - Statistical Hypotheses: Ho : r pb 0 Ha : r pb 0 Example Research Questio: What is the stregth of the relatioship betwee beig radomly assiged to a group exposed to 37 hours of South Park rerus (group 1)

6 vs. beig radomly assiged to a compariso coditio exposed to 37 hours of Sally Struthers Feed the Childre commercials (group 0) ad self-report attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7). Biserial Correlatio Coefficiet r b Used whe a Dichotomy is developed from a cotiuous variable (e.g. mea split, or media split methods). Groups are ofte desigated by 0 & 1. r b is estimated from a ormal Pearso s r (see below). This statistic will tell you the stregth of the associatio betwee category membership (based o a artificial dichotomy) ad a cotiuous score. Agai, directio of the relatioship will be arbitrary depedig o the umerical category labels (however sice they are based o cotiuous scores the umerical label with greater value should be give to the upper ed of the cotiuum, makig iterpretatio easier) df - r b r % % X μ pearso belowcp abovecp cp σ where X cp the raw score cut poit (raw score used to split distributio ito groups) Statistical Hypotheses: Ho : r b 0 Ha : r b 0 Example Research Questio: What is the stregth ad directio of the relatioship betwee watchig more tha 10 hrs. per week of South Park (Group 1) vs. watchig 10 or fewer hours of South Park per week (group 0) ad self-report attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7), where 10 hrs. per week is the mea of selfreported South Park viewig habits. If Discrete Variable has more tha Levels (more tha groups). Oe Way ANOVA : Aova tests whether or more group meas are sigificatly differet. -Whe usig groups F t. -See ANOVA hadout for details o Oe Way Aova Statistical Hypotheses : Ho : Mea grp1 Mea grp Mea grp j (for j groups) Ha : At least oe group mea sigificatly differet from oe other group mea. Example Research Questio: Are there ay sigificat differece betwee three radomly assiged groups ( 1, exposed to 37 hours of South Park Episodes;, exposed to 37 hours of Sally Struthers Feed the Childre commercials; & 3, o TV cotrol coditio) with respect to their self-report attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7). If Discrete X Discrete X Cotiuous (Where both discrete variables are predictors) Two Way ANOVA : If we have idepedet variables (or 1 IV ad a Blockig Variable) the Two Way Aova (Or Factoral ANOVA) is called for. Agai, this will tell us if oe group mea (or matrix cell mea) is sigificatly differet from oe other group mea (or cell mea). Also, Factoral Aova ca hadle More tha IV s

7 (ad or Blockig Variables). (See Two Way Aova Hadout for Details). Moderatio - whe we fid a sigificat iteractio betwee two predictor variables we ca say that oe predictor moderates the relatioship betwee the other predictor ad the outcome (DV). Our decisio about which predictor is the IV ad which is the Moderatig Variable is based o our theoretical perspective. Statistical Hypotheses - The two way ANOVA actually tests several hypotheses at oce. 1) Mai Effects IV1 : Ho : Mea Group 1. Mea Group i. (for i groups) Ha : At least oe group mea sigificatly differet from oe other group mea. IV : Ho : Mea Group.1 Mea Group.j (for j groups) Ha : At least oe group mea sigificatly differet from oe other group mea. ) Iteractio effects (moderatio effects) IV1 : Ho : Mea grp11 Mea grp 1 Mea grp1 Mea grp ij (for i ad j groups) Ha : At least oe group mea sigificatly differet from oe other group mea. Example Research Questio: Do males (sex 0) have sigificatly more positive or egative attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7) compared to females (sex 1). Also, does a radomly assiged group exposed to 37 hours of South Park (group 1) rerus report sigificatly more positive or egative attitudes toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7) compared to a radomly assiged compariso coditio exposed to 37 hours of Sally Struthers Feed the Childre commercials (group ). Does participat sex (m 0, f 1) moderate the relatioship betwee south park exposure (IV: radom assigmet to coditio: 1) exposed to 37 hours of South Park Episodes; ) exposed to 37 hours of Sally Struthers Feed the Childre commercials.) ad attitude toward Research Methods (measured usig a 5 item questioaire employig a 7 poit ratig scale: item averages rage from 1-7). ANCOVA : Aalysis of Covariace : Sometimes we wat to remove the effects of a third variable (Covariate: Ca be cotiuous or discrete). We may wat to remove the effects of a uisace variable that is correlated with our mai variables of iterest or we may wat to test a mediatioal hypothesis (that the 3 rd variable explais the relatioship betwee the IV ad DV. that is, the 3 rd variable accouts for all the shared variace betwee the IV & DV). (see a good stats book for details) Statistical Hypotheses: Ho : Mea Group1 Mea Groupj (for j groups) Ha : At least oe group mea sigificatly differet from oe other group mea. Example Research Questio: Do males (Sex 0) have sigificatly more positive or egative attitudes toward Cheezy Poofs (measured usig a 6 item questioaire employig a 5 poit ratig scale: item averages rage from 1-5) compared to females (Sex 1) after the effects associated with IQ (rage ) are removed. NOTE - Multiple Regressio : Aythig Aova ad Acova ca do, Multiple regressio ca do as well through the use of dummy codig, effects codig, ad cotrast codig.

8 r D. Cotiuous X Cotiuous 1. Pearso s r : Allows us to test the stregth of the associatio betwee two cotiuous variables. It represets a ratio of the Covariace (variace shared by two variables) ad the total variace (covariace + uique variace). df - ( ΣX)( ΣY) Σ XY ( Σ X ) ( Σ Y) Σ X Σ Y or r ( X X)( Y Y) ( X X) ( Y Y) Statistical Hypotheses: Ho : r 0 Ha : r 0 Example Research Questio: What is the stregth ad directio of the associatio betwee the umber of Viea Sausages that a perso eats i 4 hr period (measured i grams, rage 0-900) ad the amout of time they sped doig impersoatios of south park characters i that same 4 hr. period (measured i miutes, rage ). r : The Coefficiet of Determiatio : r represets a ratio of covariace to total variace, but if we wat to kow how much variace i the DV is explaied by variace i the IV, the we ca square r to fid out. NOTE : for all of the correlatio based statistics above (Rho, Tau, Biserial, Poit-Biserial) if the sample size exceeds 30 the the Pearso r is robust eough that it ca will provide a reasoable approximatio of ay of these statistics without sigificatly iflatig the Type I error rate. Sigificace of r : sigificace tables for r are available i most stats books. If r obtaied exceeds the critical value of r for a give df at the.05 alpha level, the there is a sigificat associatio betwee the variables of iterest. If r tables are ot available you ca use t tables, as r tables are based o t coversios. t 1 r. Partial Correlatio pr : Tests the associatio betwee two cotiuous variables with the effects of a third variable removed. We may wat to remove the effects of a uisace variable that is correlated with our mai variables of iterest or we may wat to test a mediatioal hypothesis (that the 3 rd variable explais the relatioship betwee the IV ad DV. that is, the 3 rd variable accouts for all the shared variace betwee the IV & DV). (see a good stats book for details) -Note: The covariate does ot ecessarily have to be cotiuous (especially if your N is larger tha 30). Statistical Hypotheses: Ho : pr 0 Ha : pr 0 Example Research Questio: What is the stregth ad directio of the associatio betwee the umber of Viea Sausages that a perso eats i 4 hr period (measured i grams, rage 0-900) ad the amout of time they sped doig impersoatios of south park characters i that same 4 hr. period (measured i miutes, rage ) whe the variace i attitudes toward Viea Sausages (measure usig 4 item

9 measure employig a 7 poit umerical ratig scale, item averages rag from 1 to 7) is removed. 3. Regressio: Whe you have a sigle variable, regressio is essetially the same as correlatio. Istead of r, you calculate for b (beta), where b reflects the slope of lie that best fits the data (least-squares regressio coefficiet). Stregth of associatio is represeted as the chage i Y (DV) that results from a 1 uit chage i X (IV). Equatio for a Straight Lie (Least-Squares Regressio) Y a + bx Where: Y Depedet Variable: see as a fuctio of (or predicted by) the idepedet variable (X); X Idepedet Variable: the dimesio or characteristic that is see as the determiat or cause of the depedet variable (Y); b Slope of the Lie: rise (or drop) divided by ru; a Y-itercept: where the value of X 0 ad the lie itercepts the Y axis. Formula for Calculatig a ad b b Σ XY Σ X ( ΣX)( ΣY) ( Σ X ) a y bx Oce a ad b are kow they ca be plugged ito the regressio lie formula (Y a + bx) ad the predicted value of Y ca be estimated for ay value of X. Example Research Questio- For a give slope (b) of.6300 ad a Y-itercept (a) of 3., the how much time would a perso be expected to sped impersoatig Southpark characters (measured i miutes, rage ) i 4 hour period after eatig 300 grams of Viea Sausages? E. Multiple Cotiuous Idepedet Variables ad Sigle Cotiuous Depedet Variable Multiple Regressio - Like Factorial Aova, Multiple Regressio ca deal with multiple Idepedet Variables. Also, as a form of regressio, multiple regressio ca be used for predictio (predictig Y based o the values of the IVs). R The Multiple Correlatio Coefficiet - idicates the total associatio betwee the Predictors (IVs) ad the Criterio (DV). R Squared Multiple Correlatio Coefficiet idicates the % of the variace i the DV that is accouted for by the IVs. F used to test the sigificace of R. b The multiple regressio coefficiet - idicates the icrease i Y resultig from a 1 uit icrease i Y, whe all other IVs are held at the costat of 0. That is, it tells us about the uique effect a give predictor has o the criterio $ Beta The stadardized multiple regressio coefficiet - same as b oly the uits are stadardized i Z-score uits.

10 t (b/stadard error for b) used to test the sigificace of the regressio coefficiet. - The research questios that you ca ask with Multiple regressio are quite flexible. - Sigle Step - idetifies the uique associatio of each predictor with a criterio - Hierarchical Regressio (Mulitple Steps) - idetifies the uique cotributio of a sigle variable or group of variables to the multiple correlatio coefficiet (R )) - Mediatio Aalyses - (Because) A third variable accouts for/explais the relatioship betwee X ad Y. Why is X related to Y, because of Z. -This is the ultimate goal of Sciece. - Moderatio Aalyses / Iteractio effects - (It Depeds) A third variable iflueces the stregth ad/or directio of the relatioship betwee a IV ad DV. What ifluece does X have o Y? It depeds o Z. F. Multiple Depedet Variables 1. Discrete IVs ad Multiple Cotiuous DV s MANOVA : Multivariate Aalysis of Variace - Whe you have multiple cotiuous outcome variables that reflect a related set of costructs ad you wat to test the associatio with 1 or more discrete IV s you ca use Multiple Aalysis of Variace. - Returs a sigle F that idicates whether the IV (or IVs) are sigificatly associated with the DVs as a group. - This is most useful for keepig the Type I error rate dow whe coductig multiple aalyses. - If it is sigificat the it is usually followed up with Uivariate tests assessig oe depedet variable at a time. - (see a good stats book for details) MANCOVA : Maova with a Covariate (a variable that is havig it s ifluece removed from the test). (see a good stats book for details). Oe or more Cotiuous Idepedet Variables ad Multiple Cotiuous Depedet Variables - Caoical Correlatio or Set Correlatio - Returs a sigle correlatio coefficiet that is the best fittig correlatio betwee set 1 (IVs) ad set (DVs) determied through multiple iteratios. - Path Aalysis / Structural Equatio Modelig - Theory/Model Testig Procedures - - Allows you to determie the degree to which causal relatioship predicted by theory fit with the data; Goodess of Fit, which is expressed as a chi-square ad other fit idicies. - Path Aalysis / Causal Modelig - oly cosiders Maifest Variables, which are directly measured variables.

11 - Structural Equatio Modelig / Latet Variable Models - iclude Latet Variables, which are ot measured directly. For example SES, is ot determied by a sigle idicator. It is a latet variable made up of maifest variables like icome, educatio, job prestige, ad obtaied wealth. Sample Structural Model G. Cotiuous Predictors ad Categorical Depedet Variables - Logistical Regressio - Allows you to ask a variety of research questios with miimal statistical assumptios (e.g., assumptios regardig ormal distributios). - The most importat of which would be Stregth of Associatio betwee predictors ad outcomes ad Predictio (predictig outcomes/group membership for future cases). - Logistic regressio ca hadle multiple predictors that either cotiuous or discrete ad combiatios of both.

12 G. Data Reductio - Reducig a larger umber of variables dow to more maageable set. - Selectig Items for scale developmet - Factor Aalysis / Priciple Compoet Aalysis - Exploratory - - Idetifies groups of variables that have bee respoded to i similar way whe o a priori groups have bee idetified. - Cofirmatory - - Similar to SEM ad Path Aalysis, idetifies the goodess of fit betwee priori groups of items/variables ad the data. - Cluster Aalysis - Like Factor aalysis but groups people istead of varaibles. I. There are others... May May Others... - Multi Level Modelig (for group level idepedet variables) - Hierarchical Liear Modelig (kid of a combiatio of cluster aalysis ad regressio)

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Chapter VII Measures of Correlation

Chapter VII Measures of Correlation Chapter VII Measures of Correlatio A researcher may be iterested i fidig out whether two variables are sigificatly related or ot. For istace, he may be iterested i kowig whether metal ability is sigificatly

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y 1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these

More information

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation II. Descriptive Statistics D. Liear Correlatio ad Regressio I this sectio Liear Correlatio Cause ad Effect Liear Regressio 1. Liear Correlatio Quatifyig Liear Correlatio The Pearso product-momet correlatio

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Chapter 1 (Definitions)

Chapter 1 (Definitions) FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution. Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio

More information

Regression, Inference, and Model Building

Regression, Inference, and Model Building Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y). Chapters 5 ad 13: REGREION AND CORRELATION (ectios 5.5 ad 13.5 are omitted) Uivariate data: x, Bivariate data (x,y). Example: x: umber of years studets studied paish y: score o a proficiecy test For each

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

STP 226 EXAMPLE EXAM #1

STP 226 EXAMPLE EXAM #1 STP 226 EXAMPLE EXAM #1 Istructor: Hoor Statemet: I have either give or received iformatio regardig this exam, ad I will ot do so util all exams have bee graded ad retured. PRINTED NAME: Siged Date: DIRECTIONS:

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

MA238 Assignment 4 Solutions (part a)

MA238 Assignment 4 Solutions (part a) (i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 017 MODULE 4 : Liear models Time allowed: Oe ad a half hours Cadidates should aswer THREE questios. Each questio carries

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

DAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM

DAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM DAWSON COLLEGE DEPARTMENT OF MATHEMATICS 201-BZS-05 PROBABILITY AND STATISTICS FALL 2015 FINAL EXAM Name: Date: December 24th, 2015 Studet Number: Time: 9:30 12:30 Grade: / 116 Examier: Matthew MARCHANT

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43 PAPER NO.: 444, 445 PAGE NO.: Page 1 of 1 INSTRUCTIONS I. You have bee provided with: a) the examiatio paper i two parts (PART A ad PART B), b) a multiple choice aswer sheet (for PART A), c) selected formulae

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Correlation and Covariance

Correlation and Covariance Correlatio ad Covariace Tom Ilveto FREC 9 What is Next? Correlatio ad Regressio Regressio We specify a depedet variable as a liear fuctio of oe or more idepedet variables, based o co-variace Regressio

More information

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions Assessmet ad Modelig of Forests FR 48 Sprig Assigmet Solutios. The first part of the questio asked that you calculate the average, stadard deviatio, coefficiet of variatio, ad 9% cofidece iterval of the

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

1 Models for Matched Pairs

1 Models for Matched Pairs 1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

y ij = µ + α i + ɛ ij,

y ij = µ + α i + ɛ ij, STAT 4 ANOVA -Cotrasts ad Multiple Comparisos /3/04 Plaed comparisos vs uplaed comparisos Cotrasts Cofidece Itervals Multiple Comparisos: HSD Remark Alterate form of Model I y ij = µ + α i + ɛ ij, a i

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Chapter two: Hypothesis testing

Chapter two: Hypothesis testing : Hypothesis testig - Some basic cocepts: - Data: The raw material of statistics is data. For our purposes we may defie data as umbers. The two kids of umbers that we use i statistics are umbers that result

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740 Ageda: Recap. Lecture. Chapter Homework. Chapt #,, 3 SAS Problems 3 & 4 by had. Copyright 06 by D.B. Rowe Recap. 6: Statistical Iferece: Procedures for μ -μ 6. Statistical Iferece Cocerig μ -μ Recall yes

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

1 Review of Probability & Statistics

1 Review of Probability & Statistics 1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

Topic 10: Introduction to Estimation

Topic 10: Introduction to Estimation Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Yig Zhag STA6938-Logistic Regressio Model Topic -Simple (Uivariate) Logistic Regressio Model Outlies:. Itroductio. A Example-Does the liear regressio model always work? 3. Maximum Likelihood Curve

More information

Simple Linear Regression

Simple Linear Regression Chapter 2 Simple Liear Regressio 2.1 Simple liear model The simple liear regressio model shows how oe kow depedet variable is determied by a sigle explaatory variable (regressor). Is is writte as: Y i

More information

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS SIMPLE LINEAR REGRESSION AND CORRELATION ANALSIS INTRODUCTION There are lot of statistical ivestigatio to kow whether there is a relatioship amog variables Two aalyses: (1) regressio aalysis; () correlatio

More information

Statistical Properties of OLS estimators

Statistical Properties of OLS estimators 1 Statistical Properties of OLS estimators Liear Model: Y i = β 0 + β 1 X i + u i OLS estimators: β 0 = Y β 1X β 1 = Best Liear Ubiased Estimator (BLUE) Liear Estimator: β 0 ad β 1 are liear fuctio of

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

UNIT 11 MULTIPLE LINEAR REGRESSION

UNIT 11 MULTIPLE LINEAR REGRESSION UNIT MULTIPLE LINEAR REGRESSION Structure. Itroductio release relies Obectives. Multiple Liear Regressio Model.3 Estimatio of Model Parameters Use of Matrix Notatio Properties of Least Squares Estimates.4

More information

CS284A: Representations and Algorithms in Molecular Biology

CS284A: Representations and Algorithms in Molecular Biology CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by

More information

Measures of Variation

Measures of Variation Chapter : Measures of Variatio from Statistical Aalysis i the Behavioral Scieces by James Raymodo Secod Editio 97814669676 01 Copyright Property of Kedall Hut Publishig CHAPTER Measures of Variatio Key

More information

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued) Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

General IxJ Contingency Tables

General IxJ Contingency Tables page1 Geeral x Cotigecy Tables We ow geeralize our previous results from the prospective, retrospective ad cross-sectioal studies ad the Poisso samplig case to x cotigecy tables. For such tables, the test

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

Statistics 20: Final Exam Solutions Summer Session 2007

Statistics 20: Final Exam Solutions Summer Session 2007 1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets

More information

Lecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 8: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review What do we mea by oparametric? What is a desirable locatio statistic for ordial data? What

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

5. A formulae page and two tables are provided at the end of Part A of the examination PART A

5. A formulae page and two tables are provided at the end of Part A of the examination PART A Istructios: 1. You have bee provided with: (a) this questio paper (Part A ad Part B) (b) a multiple choice aswer sheet (for Part A) (c) Log Aswer Sheet(s) (for Part B) (d) a booklet of tables. (a) I PART

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday Aoucemets MidtermII Review Sta 101 - Fall 2016 Duke Uiversity, Departmet of Statistical Sciece Office Hours Wedesday 12:30-2:30pm Watch liear regressio videos before lab o Thursday Dr. Abrahamse Slides

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

10-701/ Machine Learning Mid-term Exam Solution

10-701/ Machine Learning Mid-term Exam Solution 0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Least-Squares Regression

Least-Squares Regression MATH 482 Least-Squares Regressio Dr. Neal, WKU As well as fidig the correlatio of paired sample data {{ x 1, y 1 }, { x 2, y 2 },..., { x, y }}, we also ca plot the data with a scatterplot ad fid the least

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Categorical Data Analysis

Categorical Data Analysis Categorical Data Aalysis Refereces : Ala Agresti, Categorical Data Aalysis, Wiley Itersciece, New Jersey, 2002 Bhattacharya, G.K., Johso, R.A., Statistical Cocepts ad Methods, Wiley,1977 Outlie Categorical

More information

A proposed discrete distribution for the statistical modeling of

A proposed discrete distribution for the statistical modeling of It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical

More information

6.867 Machine learning

6.867 Machine learning 6.867 Machie learig Mid-term exam October, ( poits) Your ame ad MIT ID: Problem We are iterested here i a particular -dimesioal liear regressio problem. The dataset correspodig to this problem has examples

More information

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information