REGRESSION AND CORRELATION ANALYSIS

Size: px
Start display at page:

Download "REGRESSION AND CORRELATION ANALYSIS"

Transcription

1 Problem 1 Problem 2 A group of 625 students has a mean age of 15.8 years with a standard devia>on of 0.6 years. The ages are normally distributed. How many students are younger than 16.2 years?

2 REGRESSION AND CORRELATION ANALYSIS

3 Regression analysis: From experimental or theore>cal work we know the func>on- like rela>onship between one or more variables, but the constants in equa>ons are not necessarily known. Regression analysis provides techniques to determine the constants with a given reliability intervals. Correla:on analysis: This sta>s>cal method determines a constant (correla>on coefficient) that measures the strength of the rela>onship between variables; this method is the correla>on analysis.

4 It is possible that there is no lawful connec>on between the variables, moreover, the independent variable is not even con>nuous. Also in this case we can try to set up a func>on- like rela>on between the variables, which is the regression func3on or equa3on. It is important to note that a significant regression rela>on between the variables does not necessarily mean a causal rela3onship;

5 The ra>onale in this correla>on is that small shoe size (English unit of measurement) may indicate a possible birth difficulty due to a small pelvis. The analysis of data gives significant results, as there is strong evidence of a trend between the data (Figure 12.2). This rela>on is not directly causal, trend is a convenient indicator for doctors that may occasionally indicate a problem.

6

7 Linear regression A special but at the same >me important case of regression analysis occurs when there is a linear rela>on between one or more variables. The graphical image of linear rela>on in case of two variables is a straight line, or regression line, in case of more variables we talk about regression plane. The corresponding value pairs of two non- independent random variables(x, Y) should be: If we label the independent variable X, and the dependent variable Y, then the equa>on characterising the linear rela>onship is the following: where a and b are the already known constants (a is the intercept, b is the slope of straight line), E is a random variable, its mean is 0. E is the variable error that represents a part of Y, which cannot be explained by X.

8 A legkisebb négyzetek módszere The best possible fit is if we choose the posi>on when the sum of differences around the line Y is minimal, meaning that the difference that cannot be explained by regression should be the smallest possible.

9 The equa>on of the line: By choosing the a and b parameters well we are able to reach that the sum of squares of random devia>ons be minimal about the Y = a + b x line: As a result of the mathema>cal method the slope of the line is: is called the regression equa3on of variable Y regarding variable X.

10

11 Standard deviáció számítása In order to calculate the standard deviance of b regression coefficient, let s compare the dispersion of yi about with the dispersion of the points of the regression line (Yi) about

12 The standard error of regression es3mate is given by the residual sum of squares, the variability unexplained by regression:

13

14 A regressziós együ[hatóra vonatkozó hipotézisvizsgálat The devia>on unexplained by regression in most cases can be a[ributed to random error, therefore, we can assume that the y i measurement result for different x i shows normal distribu>on about the real or popula3on regression line, and the variance does not depend on x. Any func>on of a random variable is a random variable, therefore a, b and Y are also random variables. It can be proved that the regression coefficient b is a normally distributed random variable, thus the b/se(b) random variable follows a t- distribu3on with (n- 2) degrees of freedom. With the help of the null hypothesis for the popula>on regression coefficient (β) we can analyse whether the rela>on between the two variables is a real correla>on or seemingly real.

15 1. T- teszt Null hypothesis: the devia>on of the popula>on s βregression coefficient from zero is due to random effects. Alterna3ve hypothesis: theβregression coefficient refers to the real rela>on between the two variables. Since b can be both posi>ve and nega>ve, the probability has to be checked on both ends of the distribu>on (two- sided hypothesis tes3ng).

16 2. ANOVA

17 Null hypothesis: the two variances are from the same popula>on, the variance explained by regression and the residual variance may differ from each other due to random effects. Alterna3ve hypothesis: the two variances are from different popula>ons, the rela>on between the two variables is a real correla>on.

18

19

20 Null hypothesis: there is no rela>on between the dose of an>coagulant and the prothrombin >me. An equal defini>on to this: the coefficient of popula>on regression line is zero: H0: β = 0 Alterna3ve hypothesis: the prothrombin >me linearly depends on the an>coagulant concentra>on that means the coefficient of popula>on regression line is not zero: H1: β 0 We can use both the t- test and the F- test for hypothesis tes>ng. The test sta>s>c for the regression coefficient is:

21 At a P = 0.05 probability level in case of variances with f1 = 1and f2 = 10 degrees of freedom the table for the F sta>s>c: At a P = 0.05 probability level we refuse the null hypothesis and accept the alterna>ve hypothesis, meaning that the two variance cannot come from the same popula>on. It means again that there is a real correla>on between the two variables.

22 Least squares principle The method provided by the least squares principle is oeen applied in such measurement procedures where physical, chemical or biological data pairs are measured, and it is assumed that there is a known func>onal rela>onship between the observed variables. The parameters of the selected func>on can be determined by minimalizing the sum of squared devia>ons of the observed data and the data fi[ed with the help of the func>on. In the form of an equa>on it is: where n is the number of the data pairs, Y i is the value of the func>on for the(x i, y i )data pair. The summa>on includes all data pairs. The least squares principle can be applied to such y = f(x,α,β,γ) model func>ons where - (α, β, γ) are the parameters of the func>on that fulfil the following condi>ons: (1) the distribu>on of variable y approximates normal distribu>on about the observed Y value of the real model func>on for any values of variable x; (2) the variance of variable y does not depend on the selec>on of x, meaning that the precision of determining variable y does not depend on x;

23 Logis>c regression (dose- response problem) The main concern of the medical treatment of diseases is how the pa>ents suffering from the same disease react to the treatment with the same medica>on. It is obvious that due to the biological variability the response to the dose is different; some pa>ents exhibit the same response for lower doses while others for higher dose.

24 Test for checking the linearity of regression In a case where regression is considered the first thing we assume between the variables X and Y is a linear rela>onship, and then try to do the analysis based on the linear model. This model is oeen not adequate for solving the problem. A quick and simple model to check linearity is based on hypothesis tes>ng. The method analyzes the randomness of the sign sequence differences of the serialised yi Yi. If it is a random sample, then the linear approach is effec>ve.

25 The average systolic blood pressure of young females ages between 8 and 20 follows non- linear rela>on. Analysing the yi Yi differences, amongst the higher age values the differences appear to be nega>ve. A more detailed observa>on would show that the linear model is not suitable for analysis in this age group. An effec>ve choice would be the parabolic quadric surface fihng.

Data Processing Techniques

Data Processing Techniques Universitas Gadjah Mada Department of Civil and Environmental Engineering Master in Engineering in Natural Disaster Management Data Processing Techniques Hypothesis Tes,ng 1 Hypothesis Testing Mathema,cal

More information

Some Review and Hypothesis Tes4ng. Friday, March 15, 13

Some Review and Hypothesis Tes4ng. Friday, March 15, 13 Some Review and Hypothesis Tes4ng Outline Discussing the homework ques4ons from Joey and Phoebe Review of Sta4s4cal Inference Proper4es of OLS under the normality assump4on Confidence Intervals, T test,

More information

Linear Regression and Correla/on. Correla/on and Regression Analysis. Three Ques/ons 9/14/14. Chapter 13. Dr. Richard Jerz

Linear Regression and Correla/on. Correla/on and Regression Analysis. Three Ques/ons 9/14/14. Chapter 13. Dr. Richard Jerz Linear Regression and Correla/on Chapter 13 Dr. Richard Jerz 1 Correla/on and Regression Analysis Correla/on Analysis is the study of the rela/onship between variables. It is also defined as group of techniques

More information

Linear Regression and Correla/on

Linear Regression and Correla/on Linear Regression and Correla/on Chapter 13 Dr. Richard Jerz 1 Correla/on and Regression Analysis Correla/on Analysis is the study of the rela/onship between variables. It is also defined as group of techniques

More information

Correla'on. Keegan Korthauer Department of Sta's'cs UW Madison

Correla'on. Keegan Korthauer Department of Sta's'cs UW Madison Correla'on Keegan Korthauer Department of Sta's'cs UW Madison 1 Rela'onship Between Two Con'nuous Variables When we have measured two con$nuous random variables for each item in a sample, we can study

More information

Announcements. Topics: Homework: - sec0ons 1.2, 1.3, and 2.1 * Read these sec0ons and study solved examples in your textbook!

Announcements. Topics: Homework: - sec0ons 1.2, 1.3, and 2.1 * Read these sec0ons and study solved examples in your textbook! Announcements Topics: - sec0ons 1.2, 1.3, and 2.1 * Read these sec0ons and study solved examples in your textbook! Homework: - review lecture notes thoroughly - work on prac0ce problems from the textbook

More information

1. Introduc9on 2. Bivariate Data 3. Linear Analysis of Data

1. Introduc9on 2. Bivariate Data 3. Linear Analysis of Data Lecture 3: Bivariate Data & Linear Regression 1. Introduc9on 2. Bivariate Data 3. Linear Analysis of Data a) Freehand Linear Fit b) Least Squares Fit c) Interpola9on/Extrapola9on 4. Correla9on 1. Introduc9on

More information

Garvan Ins)tute Biosta)s)cal Workshop 16/6/2015. Tuan V. Nguyen. Garvan Ins)tute of Medical Research Sydney, Australia

Garvan Ins)tute Biosta)s)cal Workshop 16/6/2015. Tuan V. Nguyen. Garvan Ins)tute of Medical Research Sydney, Australia Garvan Ins)tute Biosta)s)cal Workshop 16/6/2015 Tuan V. Nguyen Tuan V. Nguyen Garvan Ins)tute of Medical Research Sydney, Australia Introduction to linear regression analysis Purposes Ideas of regression

More information

Announcements. Topics: Work On: - sec0ons 1.2 and 1.3 * Read these sec0ons and study solved examples in your textbook!

Announcements. Topics: Work On: - sec0ons 1.2 and 1.3 * Read these sec0ons and study solved examples in your textbook! Announcements Topics: - sec0ons 1.2 and 1.3 * Read these sec0ons and study solved examples in your textbook! Work On: - Prac0ce problems from the textbook and assignments from the coursepack as assigned

More information

Class Notes. Examining Repeated Measures Data on Individuals

Class Notes. Examining Repeated Measures Data on Individuals Ronald Heck Week 12: Class Notes 1 Class Notes Examining Repeated Measures Data on Individuals Generalized linear mixed models (GLMM) also provide a means of incorporang longitudinal designs with categorical

More information

CS 6140: Machine Learning Spring 2016

CS 6140: Machine Learning Spring 2016 CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa?on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Logis?cs Assignment

More information

Two sample Test. Paired Data : Δ = 0. Lecture 3: Comparison of Means. d s d where is the sample average of the differences and is the

Two sample Test. Paired Data : Δ = 0. Lecture 3: Comparison of Means. d s d where is the sample average of the differences and is the Gene$cs 300: Sta$s$cal Analysis of Biological Data Lecture 3: Comparison of Means Two sample t test Analysis of variance Type I and Type II errors Power More R commands September 23, 2010 Two sample Test

More information

Data files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav

Data files for today. CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav Correlation Data files for today CourseEvalua2on2.sav pontokprediktorok.sav Happiness.sav Ca;erplot.sav Defining Correlation Co-variation or co-relation between two variables These variables change together

More information

CS 6140: Machine Learning Spring What We Learned Last Week. Survey 2/26/16. VS. Model

CS 6140: Machine Learning Spring What We Learned Last Week. Survey 2/26/16. VS. Model Logis@cs CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa@on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Assignment

More information

Regression Part II. One- factor ANOVA Another dummy variable coding scheme Contrasts Mul?ple comparisons Interac?ons

Regression Part II. One- factor ANOVA Another dummy variable coding scheme Contrasts Mul?ple comparisons Interac?ons Regression Part II One- factor ANOVA Another dummy variable coding scheme Contrasts Mul?ple comparisons Interac?ons One- factor Analysis of variance Categorical Explanatory variable Quan?ta?ve Response

More information

Short introduc,on to the

Short introduc,on to the OXFORD NEUROIMAGING PRIMERS Short introduc,on to the An General Introduction Linear Model to Neuroimaging for Neuroimaging Analysis Mark Jenkinson Mark Jenkinson Janine Michael Bijsterbosch Chappell Michael

More information

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered) Test 3 Practice Test A NOTE: Ignore Q10 (not covered) MA 180/418 Midterm Test 3, Version A Fall 2010 Student Name (PRINT):............................................. Student Signature:...................................................

More information

One- factor ANOVA. F Ra5o. If H 0 is true. F Distribu5on. If H 1 is true 5/25/12. One- way ANOVA: A supersized independent- samples t- test

One- factor ANOVA. F Ra5o. If H 0 is true. F Distribu5on. If H 1 is true 5/25/12. One- way ANOVA: A supersized independent- samples t- test F Ra5o F = variability between groups variability within groups One- factor ANOVA If H 0 is true random error F = random error " µ F =1 If H 1 is true random error +(treatment effect)2 F = " µ F >1 random

More information

Graphical Models. Lecture 3: Local Condi6onal Probability Distribu6ons. Andrew McCallum

Graphical Models. Lecture 3: Local Condi6onal Probability Distribu6ons. Andrew McCallum Graphical Models Lecture 3: Local Condi6onal Probability Distribu6ons Andrew McCallum mccallum@cs.umass.edu Thanks to Noah Smith and Carlos Guestrin for some slide materials. 1 Condi6onal Probability Distribu6ons

More information

Last Lecture Recap UVA CS / Introduc8on to Machine Learning and Data Mining. Lecture 3: Linear Regression

Last Lecture Recap UVA CS / Introduc8on to Machine Learning and Data Mining. Lecture 3: Linear Regression UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 3: Linear Regression Yanjun Qi / Jane University of Virginia Department of Computer Science 1 Last Lecture Recap q Data

More information

Overview: In addi:on to considering various summary sta:s:cs, it is also common to consider some visual display of the data Outline:

Overview: In addi:on to considering various summary sta:s:cs, it is also common to consider some visual display of the data Outline: Lecture 2: Visual Display of Data Overview: In addi:on to considering various summary sta:s:cs, it is also common to consider some visual display of the data Outline: 1. Histograms 2. ScaCer Plots 3. Assignment

More information

z-scores z-scores z-scores and the Normal Distribu4on PSYC 300A - Lecture 3 Dr. J. Nicol

z-scores z-scores z-scores and the Normal Distribu4on PSYC 300A - Lecture 3 Dr. J. Nicol z-scores and the Normal Distribu4on PSYC 300A - Lecture 3 Dr. J. Nicol z-scores Knowing a raw score does not inform us about the rela4ve loca4on of that score in the distribu4on The rela4ve loca4on of

More information

Example: Data from the Child Health and Development Study

Example: Data from the Child Health and Development Study Example: Data from the Child Health and Development Study Can we use linear regression to examine how well length of gesta:onal period predicts birth weight? First look at the sca@erplot: Does a linear

More information

Least Squares Parameter Es.ma.on

Least Squares Parameter Es.ma.on Least Squares Parameter Es.ma.on Alun L. Lloyd Department of Mathema.cs Biomathema.cs Graduate Program North Carolina State University Aims of this Lecture 1. Model fifng using least squares 2. Quan.fica.on

More information

Chapter 11: Linear Regression and Correla4on. Correla4on

Chapter 11: Linear Regression and Correla4on. Correla4on Chapter 11: Linear Regression and Correla4on Regression analysis is a sta3s3cal tool that u3lizes the rela3on between two or more quan3ta3ve variables so that one variable can be predicted from the other,

More information

A mul&scale autocorrela&on func&on for anisotropy studies

A mul&scale autocorrela&on func&on for anisotropy studies A mul&scale autocorrela&on func&on for anisotropy studies Mario Scuderi 1, M. De Domenico, H Lyberis and A. Insolia 1 Department of Physics and Astronomy & INFN Catania University ITALY DAA2011 Erice,

More information

Outline. What is Machine Learning? Why Machine Learning? 9/29/08. Machine Learning Approaches to Biological Research: Bioimage Informa>cs and Beyond

Outline. What is Machine Learning? Why Machine Learning? 9/29/08. Machine Learning Approaches to Biological Research: Bioimage Informa>cs and Beyond Outline Machine Learning Approaches to Biological Research: Bioimage Informa>cs and Beyond Robert F. Murphy External Senior Fellow, Freiburg Ins>tute for Advanced Studies Ray and Stephanie Lane Professor

More information

Causal Inference and Response Surface Modeling. Inference and Representa6on DS- GA Fall 2015 Guest lecturer: Uri Shalit

Causal Inference and Response Surface Modeling. Inference and Representa6on DS- GA Fall 2015 Guest lecturer: Uri Shalit Causal Inference and Response Surface Modeling Inference and Representa6on DS- GA- 1005 Fall 2015 Guest lecturer: Uri Shalit What is Causal Inference? source: xkcd.com/552/ 2/53 Causal ques6ons as counterfactual

More information

Garvan Ins)tute Biosta)s)cal Workshop 16/7/2015. Tuan V. Nguyen. Garvan Ins)tute of Medical Research Sydney, Australia

Garvan Ins)tute Biosta)s)cal Workshop 16/7/2015. Tuan V. Nguyen. Garvan Ins)tute of Medical Research Sydney, Australia Garvan Ins)tute Biosta)s)cal Workshop 16/7/2015 Tuan V. Nguyen Tuan V. Nguyen Garvan Ins)tute of Medical Research Sydney, Australia Analysis of variance Between- group and within- group varia)on explained

More information

Statistical Models for sequencing data: from Experimental Design to Generalized Linear Models

Statistical Models for sequencing data: from Experimental Design to Generalized Linear Models Best practices in the analysis of RNA-Seq and CHiP-Seq data 4 th -5 th May 2017 University of Cambridge, Cambridge, UK Statistical Models for sequencing data: from Experimental Design to Generalized Linear

More information

Sta$s$cal Significance Tes$ng In Theory and In Prac$ce

Sta$s$cal Significance Tes$ng In Theory and In Prac$ce Sta$s$cal Significance Tes$ng In Theory and In Prac$ce Ben Cartere8e University of Delaware h8p://ir.cis.udel.edu/ictir13tutorial Hypotheses and Experiments Hypothesis: Using an SVM for classifica$on will

More information

Structural Equa+on Models: The General Case. STA431: Spring 2013

Structural Equa+on Models: The General Case. STA431: Spring 2013 Structural Equa+on Models: The General Case STA431: Spring 2013 An Extension of Mul+ple Regression More than one regression- like equa+on Includes latent variables Variables can be explanatory in one equa+on

More information

T- test recap. Week 7. One- sample t- test. One- sample t- test 5/13/12. t = x " µ s x. One- sample t- test Paired t- test Independent samples t- test

T- test recap. Week 7. One- sample t- test. One- sample t- test 5/13/12. t = x  µ s x. One- sample t- test Paired t- test Independent samples t- test T- test recap Week 7 One- sample t- test Paired t- test Independent samples t- test T- test review Addi5onal tests of significance: correla5ons, qualita5ve data In each case, we re looking to see whether

More information

An Introduc+on to Sta+s+cs and Machine Learning for Quan+ta+ve Biology. Anirvan Sengupta Dept. of Physics and Astronomy Rutgers University

An Introduc+on to Sta+s+cs and Machine Learning for Quan+ta+ve Biology. Anirvan Sengupta Dept. of Physics and Astronomy Rutgers University An Introduc+on to Sta+s+cs and Machine Learning for Quan+ta+ve Biology Anirvan Sengupta Dept. of Physics and Astronomy Rutgers University Why Do We Care? Necessity in today s labs Principled approach:

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)

More information

Analysis of Variance and Co-variance. By Manza Ramesh

Analysis of Variance and Co-variance. By Manza Ramesh Analysis of Variance and Co-variance By Manza Ramesh Contents Analysis of Variance (ANOVA) What is ANOVA? The Basic Principle of ANOVA ANOVA Technique Setting up Analysis of Variance Table Short-cut Method

More information

Least Square Es?ma?on, Filtering, and Predic?on: ECE 5/639 Sta?s?cal Signal Processing II: Linear Es?ma?on

Least Square Es?ma?on, Filtering, and Predic?on: ECE 5/639 Sta?s?cal Signal Processing II: Linear Es?ma?on Least Square Es?ma?on, Filtering, and Predic?on: Sta?s?cal Signal Processing II: Linear Es?ma?on Eric Wan, Ph.D. Fall 2015 1 Mo?va?ons If the second-order sta?s?cs are known, the op?mum es?mator is given

More information

Common models and contrasts. Tuesday, Lecture 2 Jeane5e Mumford University of Wisconsin - Madison

Common models and contrasts. Tuesday, Lecture 2 Jeane5e Mumford University of Wisconsin - Madison ommon models and contrasts Tuesday, Lecture Jeane5e Mumford University of Wisconsin - Madison Let s set up some simple models 1-sample t-test -sample t-test With contrasts! 1-sample t-test Y = 0 + 1-sample

More information

Sample sta*s*cs and linear regression. NEU 466M Instructor: Professor Ila R. Fiete Spring 2016

Sample sta*s*cs and linear regression. NEU 466M Instructor: Professor Ila R. Fiete Spring 2016 Sample sta*s*cs and linear regression NEU 466M Instructor: Professor Ila R. Fiete Spring 2016 Mean {x 1,,x N } N samples of variable x hxi 1 N NX i=1 x i sample mean mean(x) other notation: x Binned version

More information

Experimental Designs for Planning Efficient Accelerated Life Tests

Experimental Designs for Planning Efficient Accelerated Life Tests Experimental Designs for Planning Efficient Accelerated Life Tests Kangwon Seo and Rong Pan School of Compu@ng, Informa@cs, and Decision Systems Engineering Arizona State University ASTR 2015, Sep 9-11,

More information

Descriptive Statistics. Population. Sample

Descriptive Statistics. Population. Sample Sta$s$cs Data (sing., datum) observa$ons (such as measurements, counts, survey responses) that have been collected. Sta$s$cs a collec$on of methods for planning experiments, obtaining data, and then then

More information

Valida&on of Predic&ve Classifiers

Valida&on of Predic&ve Classifiers Valida&on of Predic&ve Classifiers 1! Predic&ve Biomarker Classifiers In most posi&ve clinical trials, only a small propor&on of the eligible popula&on benefits from the new rx Many chronic diseases are

More information

PHYS1121 and MECHANICS

PHYS1121 and MECHANICS PHYS1121 and 1131 - MECHANICS Lecturer weeks 1-6: John Webb, Dept of Astrophysics, School of Physics Multimedia tutorials www.physclips.unsw.edu.au Where can I find the lecture slides? There will be a

More information

11 Correlation and Regression

11 Correlation and Regression Chapter 11 Correlation and Regression August 21, 2017 1 11 Correlation and Regression When comparing two variables, sometimes one variable (the explanatory variable) can be used to help predict the value

More information

Calculate the volume of the sphere. Give your answer correct to two decimal places. (3)

Calculate the volume of the sphere. Give your answer correct to two decimal places. (3) 1. Let m = 6.0 10 3 and n = 2.4 10 5. Express each of the following in the form a 10 k, where 1 a < 10 and k. mn; m. n (Total 4 marks) 2. The volume of a sphere is V =, where S is its surface area. 36π

More information

STAT 501 EXAM I NAME Spring 1999

STAT 501 EXAM I NAME Spring 1999 STAT 501 EXAM I NAME Spring 1999 Instructions: You may use only your calculator and the attached tables and formula sheet. You can detach the tables and formula sheet from the rest of this exam. Show your

More information

Friday, March 15, 13. Mul$ple Regression

Friday, March 15, 13. Mul$ple Regression Mul$ple Regression Mul$ple Regression I have a hypothesis about the effect of X on Y. Why might we need addi$onal variables? Confounding variables Condi$onal independence Reduce/eliminate bias in es$mates

More information

Numerical Methods in Physics

Numerical Methods in Physics Numerical Methods in Physics Numerische Methoden in der Physik, 515.421. Instructor: Ass. Prof. Dr. Lilia Boeri Room: PH 03 090 Tel: +43-316- 873 8191 Email Address: l.boeri@tugraz.at Room: TDK Seminarraum

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance (ANOVA) Used for comparing or more means an extension of the t test Independent Variable (factor) = categorical (qualita5ve) predictor should have at least levels, but can have many

More information

ES-2 Lecture: More Least-squares Fitting. Spring 2017

ES-2 Lecture: More Least-squares Fitting. Spring 2017 ES-2 Lecture: More Least-squares Fitting Spring 2017 Outline Quick review of least-squares line fitting (also called `linear regression ) How can we find the best-fit line? (Brute-force method is not efficient)

More information

Bias/variance tradeoff, Model assessment and selec+on

Bias/variance tradeoff, Model assessment and selec+on Applied induc+ve learning Bias/variance tradeoff, Model assessment and selec+on Pierre Geurts Department of Electrical Engineering and Computer Science University of Liège October 29, 2012 1 Supervised

More information

Introduc)on to RNA- Seq Data Analysis. Dr. Benilton S Carvalho Department of Medical Gene)cs Faculty of Medical Sciences State University of Campinas

Introduc)on to RNA- Seq Data Analysis. Dr. Benilton S Carvalho Department of Medical Gene)cs Faculty of Medical Sciences State University of Campinas Introduc)on to RNA- Seq Data Analysis Dr. Benilton S Carvalho Department of Medical Gene)cs Faculty of Medical Sciences State University of Campinas Material: hep://)ny.cc/rnaseq Slides: hep://)ny.cc/slidesrnaseq

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

CS 6140: Machine Learning Spring What We Learned Last Week 2/26/16

CS 6140: Machine Learning Spring What We Learned Last Week 2/26/16 Logis@cs CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa@on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Sign

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Classifica(on and predic(on omics style. Dr Nicola Armstrong Mathema(cs and Sta(s(cs Murdoch University

Classifica(on and predic(on omics style. Dr Nicola Armstrong Mathema(cs and Sta(s(cs Murdoch University Classifica(on and predic(on omics style Dr Nicola Armstrong Mathema(cs and Sta(s(cs Murdoch University Classifica(on Learning Set Data with known classes Prediction Classification rule Data with unknown

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression.

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression. WISE ANOVA and Regression Lab Introduction to the WISE Correlation/Regression and ANOVA Applet This module focuses on the logic of ANOVA with special attention given to variance components and the relationship

More information

ME 432 Fundamentals of Modern Photovoltaics. Discussion 15: Semiconductor Carrier Sta?s?cs 3 October 2018

ME 432 Fundamentals of Modern Photovoltaics. Discussion 15: Semiconductor Carrier Sta?s?cs 3 October 2018 ME 432 Fundamentals of Modern Photovoltaics Discussion 15: Semiconductor Carrier Sta?s?cs 3 October 2018 Fundamental concepts underlying PV conversion input solar spectrum light absorp?on carrier excita?on

More information

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource Networks Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource Networks in biology Protein-Protein Interaction Network of Yeast Transcriptional regulatory network of E.coli Experimental

More information

Introduc)on to the Design and Analysis of Experiments. Violet R. Syro)uk School of Compu)ng, Informa)cs, and Decision Systems Engineering

Introduc)on to the Design and Analysis of Experiments. Violet R. Syro)uk School of Compu)ng, Informa)cs, and Decision Systems Engineering Introduc)on to the Design and Analysis of Experiments Violet R. Syro)uk School of Compu)ng, Informa)cs, and Decision Systems Engineering 1 Complex Engineered Systems What makes an engineered system complex?

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Computer Vision. Pa0ern Recogni4on Concepts Part I. Luis F. Teixeira MAP- i 2012/13

Computer Vision. Pa0ern Recogni4on Concepts Part I. Luis F. Teixeira MAP- i 2012/13 Computer Vision Pa0ern Recogni4on Concepts Part I Luis F. Teixeira MAP- i 2012/13 What is it? Pa0ern Recogni4on Many defini4ons in the literature The assignment of a physical object or event to one of

More information

Sta$s$cal sequence recogni$on

Sta$s$cal sequence recogni$on Sta$s$cal sequence recogni$on Determinis$c sequence recogni$on Last $me, temporal integra$on of local distances via DP Integrates local matches over $me Normalizes $me varia$ons For cts speech, segments

More information

Machine Learning and Data Mining. Bayes Classifiers. Prof. Alexander Ihler

Machine Learning and Data Mining. Bayes Classifiers. Prof. Alexander Ihler + Machine Learning and Data Mining Bayes Classifiers Prof. Alexander Ihler A basic classifier Training data D={x (i),y (i) }, Classifier f(x ; D) Discrete feature vector x f(x ; D) is a con@ngency table

More information

Predicate abstrac,on and interpola,on. Many pictures and examples are borrowed from The So'ware Model Checker BLAST presenta,on.

Predicate abstrac,on and interpola,on. Many pictures and examples are borrowed from The So'ware Model Checker BLAST presenta,on. Predicate abstrac,on and interpola,on Many pictures and examples are borrowed from The So'ware Model Checker BLAST presenta,on. Outline. Predicate abstrac,on the idea in pictures 2. Counter- example guided

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Sociology 301. Hypothesis Testing + t-test for Comparing Means. Hypothesis Testing. Hypothesis Testing. Liying Luo 04.14

Sociology 301. Hypothesis Testing + t-test for Comparing Means. Hypothesis Testing. Hypothesis Testing. Liying Luo 04.14 Sociology 301 Hypothesis Testing + t-test for Comparing Means Liying Luo 04.14 Hypothesis Testing 5. State a technical decision and a substan;ve conclusion Hypothesis Testing A random sample of 100 UD

More information

Pseudospectral Methods For Op2mal Control. Jus2n Ruths March 27, 2009

Pseudospectral Methods For Op2mal Control. Jus2n Ruths March 27, 2009 Pseudospectral Methods For Op2mal Control Jus2n Ruths March 27, 2009 Introduc2on Pseudospectral methods arose to find solu2ons to Par2al Differen2al Equa2ons Recently adapted for Op2mal Control Key Ideas

More information

Exponen'al growth and differen'al equa'ons

Exponen'al growth and differen'al equa'ons Exponen'al growth and differen'al equa'ons But first.. Thanks for the feedback! Feedback about M102 Which of the following do you find useful? 70 60 50 40 30 20 10 0 How many resources students typically

More information

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

Quebec Adipose and Lifestyle Inves?ga?on in Youth (QUALITY) cohort: youth with a family history of obesity.

Quebec Adipose and Lifestyle Inves?ga?on in Youth (QUALITY) cohort: youth with a family history of obesity. A longitudinal study of the effect of physical ac4vity and cardiorespiratory fitness on insulin and glucose homeostasis in a cohort of children with a family history of obesity Protocol for PhD work by

More information

Introduction to Statistical Genetics (BST227) Lecture 6: Population Substructure in Association Studies

Introduction to Statistical Genetics (BST227) Lecture 6: Population Substructure in Association Studies Introduction to Statistical Genetics (BST227) Lecture 6: Population Substructure in Association Studies Confounding in gene+c associa+on studies q What is it? q What is the effect? q How to detect it?

More information

Simple linear regression

Simple linear regression Simple linear regression Prof. Giuseppe Verlato Unit of Epidemiology & Medical Statistics, Dept. of Diagnostics & Public Health, University of Verona Statistics with two variables two nominal variables:

More information

Diagnosing the Role of MHD Turbulence in Massive Star Forma:on

Diagnosing the Role of MHD Turbulence in Massive Star Forma:on Diagnosing the Role of MHD Turbulence in Massive Star Forma:on Blakesley Burkhart Einstein Fellow Harvard- Smithsonian Center for Astrophysics With Min Young- Lee, Alex Lazarian, David Collins, Jonathan

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Regression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv).

Regression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv). Regression Analysis Two variables may be related in such a way that the magnitude of one, the dependent variable, is assumed to be a function of the magnitude of the second, the independent variable; however,

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Decision Trees Lecture 12

Decision Trees Lecture 12 Decision Trees Lecture 12 David Sontag New York University Slides adapted from Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore Machine Learning in the ER Physician documentation Triage Information

More information

General linear model: basic

General linear model: basic General linear model: basic Introducing General Linear Model (GLM): Start with an example Proper>es of the BOLD signal Linear Time Invariant (LTI) system The hemodynamic response func>on (Briefly) Evalua>ng

More information

Differen'al Privacy with Bounded Priors: Reconciling U+lity and Privacy in Genome- Wide Associa+on Studies

Differen'al Privacy with Bounded Priors: Reconciling U+lity and Privacy in Genome- Wide Associa+on Studies Differen'al Privacy with Bounded Priors: Reconciling U+lity and Privacy in Genome- Wide Associa+on Studies Florian Tramèr, Zhicong Huang, Erman Ayday, Jean- Pierre Hubaux ACM CCS 205 Denver, Colorado,

More information

New Developments in East

New Developments in East New Developments in East MAMS: Multi-arm Multi-stage Trials Presented at the Fifth East User Group Meeting March 16, 2016 Cyrus Mehta, Ph.D. President, Cytel Inc Multi-arm Multi-stage Designs Generaliza8on

More information

Mixture Models. Michael Kuhn

Mixture Models. Michael Kuhn Mixture Models Michael Kuhn 2017-8-26 Objec

More information

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Founda'ons- The Basic Concepts of Science

Founda'ons- The Basic Concepts of Science Founda'ons- The Basic Concepts of Science The Scien'fic Method The Scien'fic Method As part of the founda/on of your house of biology knowledge, you need to understand the process of how scien/sts arrive

More information

Sample solutions. Stat 8051 Homework 8

Sample solutions. Stat 8051 Homework 8 Sample solutions Stat 8051 Homework 8 Problem 1: Faraway Exercise 3.1 A plot of the time series reveals kind of a fluctuating pattern: Trying to fit poisson regression models yields a quadratic model if

More information

Applied Time Series Analysis FISH 507. Eric Ward Mark Scheuerell Eli Holmes

Applied Time Series Analysis FISH 507. Eric Ward Mark Scheuerell Eli Holmes Applied Time Series Analysis FISH 507 Eric Ward Mark Scheuerell Eli Holmes Introduc;ons Who are we? Who & why you re here? What are you looking to get from this class? Days and Times Lectures When: Tues

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression 1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25

More information

From differen+al equa+ons to trigonometric func+ons. Introducing sine and cosine

From differen+al equa+ons to trigonometric func+ons. Introducing sine and cosine From differen+al equa+ons to trigonometric func+ons Introducing sine and cosine Calendar OSH due this Friday by 12:30 in MX 1111 Last quiz next Wedn Office hrs: Today: 11:30-12:30 OSH due by 12:30 Math

More information

Submit a written or typed note with- 1.Name 2. Group number 3. Question number 4. Short reason/explanation stating the reason for re-grading.

Submit a written or typed note with- 1.Name 2. Group number 3. Question number 4. Short reason/explanation stating the reason for re-grading. Exam 2 rebuttals due Tuesday 04.12.16 Submit a written or typed note with- 1.Name 2. Group number 3. Question number 4. Short reason/explanation stating the reason for re-grading. - Homework 7 will be

More information