Pumpkin Example: Flaws in Diagnostics: Correcting Models
|
|
- Warren Lamb
- 5 years ago
- Views:
Transcription
1 Math Treibergs Pumpkin Example: Flaws in Diagnostics: Correcting Models Name: Example March, 204 From Levine Ramsey & Smidt, Applied Statistics for Engineers and Scientists, Prentice Hall, Upper Saddle River, NJ, 200. The circumference (in cms) and weight (in grams) is recorded for 22 pumpkins. Can one predict the weight from circumference? Data Set Used in this Analysis : # Math Pumpkin Data March, 204 # Treibergs # # From Levine Ramsey & Smidt, Applied Statistics for Engneers and # Scientists, Prentice Hall, Upper saddle River, NJ, 200. # The circumference (in cms) and weight (in grams) is recorded # for 22 pumpkins. Can one predict the weight from circumference? # "Circ" "Weight"
2 R Session: R version 2.3. ( ) Copyright (C) 20 The R Foundation for Statistical Computing ISBN Platform: i38-apple-darwin9.8.0/i38 (32-bit) R is free software and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute it under certain conditions. Type license() or licence() for distribution details. Natural language support but running in an English locale R is a collaborative project with many contributors. Type contributors() for more information and citation() on how to cite R or R packages in publications. Type demo() for some demos, help() for on-line help, or help.start() for an HTML browser interface to help. Type q() to quit R. [R.app GUI.4 (5874) i38-apple-darwin9.8.0] [History restored from /Users/andrejstreibergs/.Rapp.history] > tt=read.table("m3082datapumpkin.txt",header=t) > attach(tt) > names(tt) [] "Circ" "Weight" > ############## SCATTERPLOT OF WEIGHT VS CIRC WITH REGRESSION LINE ###### > f=lm(weight~circ) > abline(f,col=2) > plot(weight~circ, main="scatter Plot of Pumpkin Weight vs. Circumference") > abline(f,col=2) > ############### DIAGNOSTIC PLOTS OF LINEAR MODEL ####################### > layout(matrix(c(,3,2,4),nrow=2)) > plot(f) > ####################### SUMMARY AND ANOVA TABLE FOR LINEAR MODEL ##### > summary(f); anova(f) Call: lm(formula = Weight ~ Circ) Residuals: Min Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) e-09 *** Circ e-4 *** 2
3 --- Signif. codes: 0 *** 0.00 ** 0.0 * Residual standard error: on 2 degrees of freedom Multiple R-squared: ,Adjusted R-squared: F-statistic: 33.7 on and 2 DF, p-value: 4.203e-4 Analysis of Variance Table Response: Weight Df Sum Sq Mean Sq F value Pr(>F) Circ e-4 *** Residuals Signif. codes: 0 *** 0.00 ** 0.0 * > layout() > ############## REGRESSION LOG(WEIGHT) VS LOG(CIRC) ################# > f2=lm(log(weight)~log(circ)) > summary(f2); anova(f2) Call: lm(formula = log(weight) ~ log(circ)) Residuals: Min Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) *** log(circ) e-4 *** --- Signif. codes: 0 *** 0.00 ** 0.0 * Residual standard error: on 2 degrees of freedom Multiple R-squared: ,Adjusted R-squared: F-statistic: 355 on and 2 DF, p-value:.23e-4 Analysis of Variance Table Response: log(weight) Df Sum Sq Mean Sq F value Pr(>F) log(circ) e-4 *** Residuals Signif. codes: 0 *** 0.00 ** 0.0 * > ########### SCATTERPLOT WITH FITTED POWER CURVE ########################## > plot(weight~circ,main="scatter Plot of Pumpkin Weight vs. Circumference") > curve(exp(f2$coefficients[]+f2$coefficients[2]*log(x)),30,8,col=4, add=t) 3
4 > > ############# DIAGNOSTIC PLOT OF LOG-LOG REGRESSION ##################### > layout(matrix(c(,3,2,4),nrow=2)) > plot(f2) > > ################ REMOVE OBSERVATIONS 5 AND FROM DATA ################## > W=Weight[-:-5]; C=Circ[-:-5] > f3=lm(log(w)~log(c)) > summary(f3); anova(f3) Call: lm(formula = log(w) ~ log(c)) Residuals: Min Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) e-05 *** log(c) < 2e- *** --- Signif. codes: 0 *** 0.00 ** 0.0 * Residual standard error: on 9 degrees of freedom Multiple R-squared: 0.978,Adjusted R-squared: F-statistic: on and 9 DF, p-value: < 2.2e- Analysis of Variance Table Response: log(w) Df Sum Sq Mean Sq F value Pr(>F) log(c) < 2.2e- *** Residuals Signif. codes: 0 *** 0.00 ** 0.0 * > > ############### SCATTERPLOT OF DATA WITHOUT OBS 5 AND ############## > layout() > plot(w~c,main="scatter Plot of Pumpkin Wt vs. Circ Omitting Obs 5,") > curve(exp(f3$coefficients[]+f3$coefficients[2]*log(x)),30,8,col=4, add=t) > > ############### DIAGNOSTICS FOR REGRESSION WITHOUT POINTS 5 AND #### > layout(matrix(c(,3,2,4),nrow=2)) > plot(f3) > 4
5 Scatter Plot of Pumpkin Weight vs. Circumference Weight Circ 5
6 Residuals Residuals vs Fitted Normal Q-Q Theoretical Quantiles Scale-Location Residuals vs Leverage Cook's distance Leverage
7 Scatter Plot of Pumpkin Weight vs. Circumference Weight Circ 7
8 Residuals vs Fitted Normal Q-Q Residuals Theoretical Quantiles Scale-Location Residuals vs Leverage Cook's distance Leverage 8
9 Scatter Plot of Pumpkin Wt vs. Circ Omitting Obs 5, W C 9
10 Residuals vs Fitted Normal Q-Q Residuals Theoretical Quantiles Scale-Location Residuals vs Leverage 9 9 Cook's distance Leverage 0
Beam Example: Identifying Influential Observations using the Hat Matrix
Math 3080. Treibergs Beam Example: Identifying Influential Observations using the Hat Matrix Name: Example March 22, 204 This R c program explores influential observations and their detection using the
More informationSpectrophotometer Example: One-Factor Random Effects ANOVA
Math 3080 1. Treibergs Spectrophotometer Example: One-Factor Random Effects ANOVA Name: Example Jan. 22, 2014 Today s example was motivated from problem.14.9 of Walpole, Myers, Myers and Ye, Probability
More informationStraw Example: Randomized Block ANOVA
Math 3080 1. Treibergs Straw Example: Randomized Block ANOVA Name: Example Jan. 23, 2014 Today s example was motivated from problem 13.11.9 of Walpole, Myers, Myers and Ye, Probability and Statistics for
More informationR version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN
Math 3080 1. Treibergs Bread Wrapper Data: 2 4 Incomplete Block Design. Name: Example April 26, 2010 Data File Used in this Analysis: # Math 3080-1 Bread Wrapper Data April 23, 2010 # Treibergs # # From
More informationNatural language support but running in an English locale
R version 3.2.1 (2015-06-18) -- "World-Famous Astronaut" Copyright (C) 2015 The R Foundation for Statistical Computing Platform: x86_64-apple-darwin13.4.0 (64-bit) R is free software and comes with ABSOLUTELY
More information(1) (AE) (BE) + (AB) + (CE) (AC) (BE) + (ABCE) +(DE) (AD) (BD) + (ABDE) + (CD) (ACDE) (BCDE) + (ABCD)
Math 3080 1. Treibergs Peanut xample: 2 5 alf Replication Factorial xperiment Name: xample Feb. 8, 2014 We describe a 2 5 experimental design with one half replicate per cell. It comes from a paper by
More informationMath Treibergs. Turbine Blade Example: 2 6 Factorial Name: Example Experiment with 1 4 Replication February 22, 2016
Math 3080 1. Treibergs Turbine Blade Example: 2 6 actorial Name: Example Experiment with 1 4 Replication ebruary 22, 2016 An experiment was run in the VPI Mechanical Engineering epartment to assess various
More informationR version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN
Math 3080 1. Treibergs Spray Data: 2 3 Factorial with Multiple Replicates Per ell ANOVA Name: Example April 7, 2010 Data File Used in this Analysis: # Math 3081-1 Spray Data April 17,2010 # Treibergs #
More informationCarapace Example: Confidence Intervals in the Wilcoxon Sign-rank Test
Math 3080 1. Treibergs Carapace Example: Confidence Intervals in the Wilcoxon Sign-rank Test Name: Example April 19, 2014 In this discussion, we look at confidence intervals in the Wilcoxon Sign-Rank Test
More informationGas Transport Example: Confidence Intervals in the Wilcoxon Rank-Sum Test
Math 3080 1. Treibergs Gas Transport Example: Confidence Intervals in the Wilcoxon Rank-Sum Test Name: Example April 19, 014 In this discussion, we look at confidence intervals in the Wilcoxon Rank-Sum
More informationHorse Kick Example: Chi-Squared Test for Goodness of Fit with Unknown Parameters
Math 3080 1. Treibergs Horse Kick Example: Chi-Squared Test for Goodness of Fit with Unknown Parameters Name: Example April 4, 014 This R c program explores a goodness of fit test where the parameter is
More informationMath Treibergs. Peanut Oil Data: 2 5 Factorial design with 1/2 Replication. Name: Example April 22, Data File Used in this Analysis:
Math 3080 1. Treibergs Peanut Oil Data: 2 5 Factorial design with 1/2 Replication. Name: Example April 22, 2010 Data File Used in this Analysis: # Math 3080-1 Peanut Oil Data April 22, 2010 # Treibergs
More informationR version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN
Math 3070 1. Treibergs Extraterrestial Life Example: One-Sample CI and Test of Proportion Name: Example May 31, 2011 The existence of intelligent life on other planets has fascinated writers, such as H.
More informationMunitions Example: Negative Binomial to Predict Number of Accidents
Math 37 1. Treibergs Munitions Eample: Negative Binomial to Predict Number of Accidents Name: Eample July 17, 211 This problem comes from Bulmer, Principles of Statistics, Dover, 1979, which is a republication
More informationR version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN
Math 3070 1. Tibergs Simulating the Sampling Distribution of the t-statistic Name: Eample May 20, 2011 How well does a statistic behave when the assumptions about the underlying population fail to hold?
More information(U 338-E) 2018 Nuclear Decommissioning Cost Triennial Proceeding A Workpapers
(U 338-E) 2018 Nuclear Decommissioning Cost Triennial Proceeding A.18-03-009 Workpapers 2018 SCE Trust Fund Contributions and Financial Assumptions SCE-06 Chapter II: Burial Escalation Rate Calculation
More informationProblem 4.4 and 6.16 of Devore discuss the Rayleigh Distribution, whose pdf is x, x > 0; x x 0. x 2 f(x; θ) dx x=0. pe p dp.
Math 3070 1. Treibergs Blade Stress Example: Estimate Parameter of Rayleigh Distribution. Name: Example June 15, 2011 Problem 4.4 and 6.16 of Devore discuss the Rayleigh Distribution, whose pdf is ) x
More informationf Simulation Example: Simulating p-values of Two Sample Variance Test. Name: Example June 26, 2011 Math Treibergs
Math 3070 1. Treibergs f Simulation Example: Simulating p-values of Two Sample Variance Test. Name: Example June 26, 2011 The t-test is fairly robust with regard to actual distribution of data. But the
More informationX = S 2 = 1 n. X i. i=1
Math 3070 1. Treibergs Horse Kick Example: Confidence Interval for Poisson Parameter Name: Example July 3, 2011 The Poisson Random Variable describes the number of occurences of rare events in a period
More informationBasic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 9: June 6, Abstract. Regression and R.
July 22, 2013 1 ohp9 Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 9: June 6, 2013 Regression and R. Abstract Regression Correlation shows the statistical strength
More informationMODELS WITHOUT AN INTERCEPT
Consider the balanced two factor design MODELS WITHOUT AN INTERCEPT Factor A 3 levels, indexed j 0, 1, 2; Factor B 5 levels, indexed l 0, 1, 2, 3, 4; n jl 4 replicate observations for each factor level
More informationR version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN
Math 3070 1. Treibergs Design of a Binomial Experiment: Programming Conditional Statements Name: Example May 20, 2011 How big a sample should we test in order that the Type I and Type II error probabilities
More information> modlyq <- lm(ly poly(x,2,raw=true)) > summary(modlyq) Call: lm(formula = ly poly(x, 2, raw = TRUE))
School of Mathematical Sciences MTH5120 Statistical Modelling I Tutorial 4 Solutions The first two models were looked at last week and both had flaws. The output for the third model with log y and a quadratic
More informationMultiple Predictor Variables: ANOVA
Multiple Predictor Variables: ANOVA 1/32 Linear Models with Many Predictors Multiple regression has many predictors BUT - so did 1-way ANOVA if treatments had 2 levels What if there are multiple treatment
More informationLinear Modelling: Simple Regression
Linear Modelling: Simple Regression 10 th of Ma 2018 R. Nicholls / D.-L. Couturier / M. Fernandes Introduction: ANOVA Used for testing hpotheses regarding differences between groups Considers the variation
More informationNature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.
Understanding regression output from software Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals In 1966 Cyril Burt published a paper called The genetic determination of differences
More informationRecall that a measure of fit is the sum of squared residuals: where. The F-test statistic may be written as:
1 Joint hypotheses The null and alternative hypotheses can usually be interpreted as a restricted model ( ) and an model ( ). In our example: Note that if the model fits significantly better than the restricted
More informationBiostatistics 380 Multiple Regression 1. Multiple Regression
Biostatistics 0 Multiple Regression ORIGIN 0 Multiple Regression Multiple Regression is an extension of the technique of linear regression to describe the relationship between a single dependent (response)
More informationMultiple Regression: Example
Multiple Regression: Example Cobb-Douglas Production Function The Cobb-Douglas production function for observed economic data i = 1,..., n may be expressed as where O i is output l i is labour input c
More informationPsychology 405: Psychometric Theory
Psychology 405: Psychometric Theory Homework Problem Set #2 Department of Psychology Northwestern University Evanston, Illinois USA April, 2017 1 / 15 Outline The problem, part 1) The Problem, Part 2)
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationSTAT 572 Assignment 5 - Answers Due: March 2, 2007
1. The file glue.txt contains a data set with the results of an experiment on the dry sheer strength (in pounds per square inch) of birch plywood, bonded with 5 different resin glues A, B, C, D, and E.
More informationST430 Exam 2 Solutions
ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving
More informationCAS MA575 Linear Models
CAS MA575 Linear Models Boston University, Fall 2013 Midterm Exam (Correction) Instructor: Cedric Ginestet Date: 22 Oct 2013. Maximal Score: 200pts. Please Note: You will only be graded on work and answers
More informationStatistiek II. John Nerbonne. March 17, Dept of Information Science incl. important reworkings by Harmut Fitz
Dept of Information Science j.nerbonne@rug.nl incl. important reworkings by Harmut Fitz March 17, 2015 Review: regression compares result on two distinct tests, e.g., geographic and phonetic distance of
More information1 Use of indicator random variables. (Chapter 8)
1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting
More informationSimple Linear Regression
Simple Linear Regression MATH 282A Introduction to Computational Statistics University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/ eariasca/math282a.html MATH 282A University
More informationIntroduction and Single Predictor Regression. Correlation
Introduction and Single Predictor Regression Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Correlation A correlation
More informationMath 2311 Written Homework 6 (Sections )
Math 2311 Written Homework 6 (Sections 5.4 5.6) Name: PeopleSoft ID: Instructions: Homework will NOT be accepted through email or in person. Homework must be submitted through CourseWare BEFORE the deadline.
More informationRegression Analysis lab 3. 1 Multiple linear regression. 1.1 Import data. 1.2 Scatterplot matrix
Regression Analysis lab 3 1 Multiple linear regression 1.1 Import data delivery
More informationChapter 3 - Linear Regression
Chapter 3 - Linear Regression Lab Solution 1 Problem 9 First we will read the Auto" data. Note that most datasets referred to in the text are in the R package the authors developed. So we just need to
More informationAnalytics 512: Homework # 2 Tim Ahn February 9, 2016
Analytics 512: Homework # 2 Tim Ahn February 9, 2016 Chapter 3 Problem 1 (# 3) Suppose we have a data set with five predictors, X 1 = GP A, X 2 = IQ, X 3 = Gender (1 for Female and 0 for Male), X 4 = Interaction
More informationLinear Model Specification in R
Linear Model Specification in R How to deal with overparameterisation? Paul Janssen 1 Luc Duchateau 2 1 Center for Statistics Hasselt University, Belgium 2 Faculty of Veterinary Medicine Ghent University,
More informationGeneral Linear Statistical Models
General Linear Statistical Models Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin This framework includes General Linear Statistical Models Linear Regression Analysis of Variance (ANOVA) Analysis
More informationLecture 2. Simple linear regression
Lecture 2. Simple linear regression Jesper Rydén Department of Mathematics, Uppsala University jesper@math.uu.se Regression and Analysis of Variance autumn 2014 Overview of lecture Introduction, short
More informationIES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc
IES 612/STA 4-573/STA 4-576 Winter 2008 Week 1--IES 612-STA 4-573-STA 4-576.doc Review Notes: [OL] = Ott & Longnecker Statistical Methods and Data Analysis, 5 th edition. [Handouts based on notes prepared
More informationIntroduction to Mixed Models in R
Introduction to Mixed Models in R Galin Jones School of Statistics University of Minnesota http://www.stat.umn.edu/ galin March 2011 Second in a Series Sponsored by Quantitative Methods Collaborative.
More informationFigure 1: The fitted line using the shipment route-number of ampules data. STAT5044: Regression and ANOVA The Solution of Homework #2 Inyoung Kim
0.0 1.0 1.5 2.0 2.5 3.0 8 10 12 14 16 18 20 22 y x Figure 1: The fitted line using the shipment route-number of ampules data STAT5044: Regression and ANOVA The Solution of Homework #2 Inyoung Kim Problem#
More informationStat 401B Final Exam Fall 2015
Stat 401B Final Exam Fall 015 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning
More informationVariance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017
Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 PDF file location: http://www.murraylax.org/rtutorials/regression_anovatable.pdf
More informationVariance Decomposition and Goodness of Fit
Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings
More informationStat 412/512 TWO WAY ANOVA. Charlotte Wickham. stat512.cwick.co.nz. Feb
Stat 42/52 TWO WAY ANOVA Feb 6 25 Charlotte Wickham stat52.cwick.co.nz Roadmap DONE: Understand what a multiple regression model is. Know how to do inference on single and multiple parameters. Some extra
More informationComparing Nested Models
Comparing Nested Models ST 370 Two regression models are called nested if one contains all the predictors of the other, and some additional predictors. For example, the first-order model in two independent
More informationLecture 10. Factorial experiments (2-way ANOVA etc)
Lecture 10. Factorial experiments (2-way ANOVA etc) Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regression and Analysis of Variance autumn 2014 A factorial experiment
More informationST430 Exam 1 with Answers
ST430 Exam 1 with Answers Date: October 5, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textook are permitted but you may use a calculator.
More information1 Multiple Regression
1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only
More informationStat 411/511 ESTIMATING THE SLOPE AND INTERCEPT. Charlotte Wickham. stat511.cwick.co.nz. Nov
Stat 411/511 ESTIMATING THE SLOPE AND INTERCEPT Nov 20 2015 Charlotte Wickham stat511.cwick.co.nz Quiz #4 This weekend, don t forget. Usual format Assumptions Display 7.5 p. 180 The ideal normal, simple
More informationStat 5303 (Oehlert): Randomized Complete Blocks 1
Stat 5303 (Oehlert): Randomized Complete Blocks 1 > library(stat5303libs);library(cfcdae);library(lme4) > immer Loc Var Y1 Y2 1 UF M 81.0 80.7 2 UF S 105.4 82.3 3 UF V 119.7 80.4 4 UF T 109.7 87.2 5 UF
More informationLab 3 A Quick Introduction to Multiple Linear Regression Psychology The Multiple Linear Regression Model
Lab 3 A Quick Introduction to Multiple Linear Regression Psychology 310 Instructions.Work through the lab, saving the output as you go. You will be submitting your assignment as an R Markdown document.
More informationWorkshop 7.4a: Single factor ANOVA
-1- Workshop 7.4a: Single factor ANOVA Murray Logan November 23, 2016 Table of contents 1 Revision 1 2 Anova Parameterization 2 3 Partitioning of variance (ANOVA) 10 4 Worked Examples 13 1. Revision 1.1.
More informationTests of Linear Restrictions
Tests of Linear Restrictions 1. Linear Restricted in Regression Models In this tutorial, we consider tests on general linear restrictions on regression coefficients. In other tutorials, we examine some
More informationBooklet of Code and Output for STAC32 Final Exam
Booklet of Code and Output for STAC32 Final Exam December 7, 2017 Figure captions are below the Figures they refer to. LowCalorie LowFat LowCarbo Control 8 2 3 2 9 4 5 2 6 3 4-1 7 5 2 0 3 1 3 3 Figure
More informationUNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Tuesday, January 17, 2017 Work all problems 60 points are needed to pass at the Masters Level and 75
More informationThe Statistical Sleuth in R: Chapter 5
The Statistical Sleuth in R: Chapter 5 Linda Loi Kate Aloisio Ruobing Zhang Nicholas J. Horton January 21, 2013 Contents 1 Introduction 1 2 Diet and lifespan 2 2.1 Summary statistics and graphical display........................
More informationMotor Trend Car Road Analysis
Motor Trend Car Road Analysis Zakia Sultana February 28, 2016 Executive Summary You work for Motor Trend, a magazine about the automobile industry. Looking at a data set of a collection of cars, they are
More informationData Analysis Using R ASC & OIR
Data Analysis Using R ASC & OIR Overview } What is Statistics and the process of study design } Correlation } Simple Linear Regression } Multiple Linear Regression 2 What is Statistics? Statistics is a
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationStat 5303 (Oehlert): Balanced Incomplete Block Designs 1
Stat 5303 (Oehlert): Balanced Incomplete Block Designs 1 > library(stat5303libs);library(cfcdae);library(lme4) > weardata
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationChapter 8 Conclusion
1 Chapter 8 Conclusion Three questions about test scores (score) and student-teacher ratio (str): a) After controlling for differences in economic characteristics of different districts, does the effect
More informationHomework 9 Sample Solution
Homework 9 Sample Solution # 1 (Ex 9.12, Ex 9.23) Ex 9.12 (a) Let p vitamin denote the probability of having cold when a person had taken vitamin C, and p placebo denote the probability of having cold
More informationGeneral Linear Statistical Models - Part III
General Linear Statistical Models - Part III Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Interaction Models Lets examine two models involving Weight and Domestic in the cars93 dataset.
More informationSCHOOL OF MATHEMATICS AND STATISTICS
RESTRICTED OPEN BOOK EXAMINATION (Not to be removed from the examination hall) Data provided: Statistics Tables by H.R. Neave MAS5052 SCHOOL OF MATHEMATICS AND STATISTICS Basic Statistics Spring Semester
More informationStat 401B Exam 2 Fall 2015
Stat 401B Exam Fall 015 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning
More informationSecond Midterm Exam Name: Solutions March 19, 2014
Math 3080 1. Treibergs σιι Second Midterm Exam Name: Solutions March 19, 2014 (1. The article Withdrawl Strength of Threaded Nails, in Journal of Structural Engineering, 2001, describes an experiment to
More informationBooklet of Code and Output for STAC32 Final Exam
Booklet of Code and Output for STAC32 Final Exam December 8, 2014 List of Figures in this document by page: List of Figures 1 Popcorn data............................. 2 2 MDs by city, with normal quantile
More informationRegression. Marc H. Mehlman University of New Haven
Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven the statistician knows that in nature there never was a normal distribution, there never was a straight line, yet with normal and
More informationSTAT 3022 Spring 2007
Simple Linear Regression Example These commands reproduce what we did in class. You should enter these in R and see what they do. Start by typing > set.seed(42) to reset the random number generator so
More informationStat 401B Exam 2 Fall 2016
Stat 40B Eam Fall 06 I have neither given nor received unauthorized assistance on this eam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning will
More informationUNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics January, 2018
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics January, 2018 Work all problems. 60 points needed to pass at the Masters level, 75 to pass at the PhD
More informationNC Births, ANOVA & F-tests
Math 158, Spring 2018 Jo Hardin Multiple Regression II R code Decomposition of Sums of Squares (and F-tests) NC Births, ANOVA & F-tests A description of the data is given at http://pages.pomona.edu/~jsh04747/courses/math58/
More informationSTAT 215 Confidence and Prediction Intervals in Regression
STAT 215 Confidence and Prediction Intervals in Regression Colin Reimer Dawson Oberlin College 24 October 2016 Outline Regression Slope Inference Partitioning Variability Prediction Intervals Reminder:
More informationThe Statistical Sleuth in R: Chapter 13
The Statistical Sleuth in R: Chapter 13 Linda Loi Kate Aloisio Ruobing Zhang Nicholas J. Horton June 15, 2016 Contents 1 Introduction 1 2 Intertidal seaweed grazers 2 2.1 Data coding, summary statistics
More informationDealing with Heteroskedasticity
Dealing with Heteroskedasticity James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Dealing with Heteroskedasticity 1 / 27 Dealing
More informationMultiple Predictor Variables: ANOVA
What if you manipulate two factors? Multiple Predictor Variables: ANOVA Block 1 Block 2 Block 3 Block 4 A B C D B C D A C D A B D A B C Randomized Controlled Blocked Design: Design where each treatment
More information13 Simple Linear Regression
B.Sc./Cert./M.Sc. Qualif. - Statistics: Theory and Practice 3 Simple Linear Regression 3. An industrial example A study was undertaken to determine the effect of stirring rate on the amount of impurity
More informationRegression on Faithful with Section 9.3 content
Regression on Faithful with Section 9.3 content The faithful data frame contains 272 obervational units with variables waiting and eruptions measuring, in minutes, the amount of wait time between eruptions,
More informationThe Statistical Sleuth in R: Chapter 13
The Statistical Sleuth in R: Chapter 13 Kate Aloisio Ruobing Zhang Nicholas J. Horton June 15, 2016 Contents 1 Introduction 1 2 Intertidal seaweed grazers 2 2.1 Data coding, summary statistics and graphical
More information36-707: Regression Analysis Homework Solutions. Homework 3
36-707: Regression Analysis Homework Solutions Homework 3 Fall 2012 Problem 1 Y i = βx i + ɛ i, i {1, 2,..., n}. (a) Find the LS estimator of β: RSS = Σ n i=1(y i βx i ) 2 RSS β = Σ n i=1( 2X i )(Y i βx
More informationR Output for Linear Models using functions lm(), gls() & glm()
LM 04 lm(), gls() &glm() 1 R Output for Linear Models using functions lm(), gls() & glm() Different kinds of output related to linear models can be obtained in R using function lm() {stats} in the base
More informationAnalysis of variance. Gilles Guillot. September 30, Gilles Guillot September 30, / 29
Analysis of variance Gilles Guillot gigu@dtu.dk September 30, 2013 Gilles Guillot (gigu@dtu.dk) September 30, 2013 1 / 29 1 Introductory example 2 One-way ANOVA 3 Two-way ANOVA 4 Two-way ANOVA with interactions
More informationLecture 1: Linear Models and Applications
Lecture 1: Linear Models and Applications Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction to linear models Exploratory data analysis (EDA) Estimation
More informationExercise 2 SISG Association Mapping
Exercise 2 SISG Association Mapping Load the bpdata.csv data file into your R session. LHON.txt data file into your R session. Can read the data directly from the website if your computer is connected
More informationThe Big Picture. Model Modifications. Example (cont.) Bacteria Count Example
The Big Picture Remedies after Model Diagnostics The Big Picture Model Modifications Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison February 6, 2007 Residual plots
More informationBIOSTATS 640 Spring 2018 Unit 2. Regression and Correlation (Part 1 of 2) R Users
BIOSTATS 640 Spring 08 Unit. Regression and Correlation (Part of ) R Users Unit Regression and Correlation of - Practice Problems Solutions R Users. In this exercise, you will gain some practice doing
More informationSTATISTICS 479 Exam II (100 points)
Name STATISTICS 79 Exam II (1 points) 1. A SAS data set was created using the following input statement: Answer parts(a) to (e) below. input State $ City $ Pop199 Income Housing Electric; (a) () Give the
More informationExam 3 Practice Questions Psych , Fall 9
Vocabular Eam 3 Practice Questions Psch 3101-100, Fall 9 Rather than choosing some practice terms at random, I suggest ou go through all the terms in the vocabular lists. The real eam will ask for definitions
More informationMultiple Regression Part I STAT315, 19-20/3/2014
Multiple Regression Part I STAT315, 19-20/3/2014 Regression problem Predictors/independent variables/features Or: Error which can never be eliminated. Our task is to estimate the regression function f.
More informationRegression and Models with Multiple Factors. Ch. 17, 18
Regression and Models with Multiple Factors Ch. 17, 18 Mass 15 20 25 Scatter Plot 70 75 80 Snout-Vent Length Mass 15 20 25 Linear Regression 70 75 80 Snout-Vent Length Least-squares The method of least
More information