SALES AND MARKETING Department MATHEMATICS. 2nd Semester. Bivariate statistics LESSONS

Size: px
Start display at page:

Download "SALES AND MARKETING Department MATHEMATICS. 2nd Semester. Bivariate statistics LESSONS"

Transcription

1 SALES AND MARKETING Departmet MATHEMATICS d Semester Bivariate statistics LESSONS Olie documet: sectio DUT Maths S. IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page / 4

2 TABLE OF CONTENTS LESSONS 3 Itroductio, vocabulary 3 - Aims 3 - Formattig 3-3 Scatter plot 4 Chi-square idepedece testig 5 3 Fittig: Mayer s method ad movig meas 6 3- Movig meas 6 3- Purpose of liear fittig Mayer s method 7 4 Liear fittig: least square method 8 4- Parameters of a bivariate series 8 4- Least square method Liear correlatio coefficiet 0 5 No-liear fittig: variable chage 6 Statistical predictio 3 6- Poit estimate 3 6- Cofidece iterval 3 IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page / 4

3 LESSONS Itroductio, vocabulary. Aims Two characters will be simultaeously studied o each idividual of a -sized populatio, creatig two variables (lists of values) X ad Y. Aims : * study the relatioship betwee both characters: their correlatio; * model this correlatio by a mathematical fuctio: regressio; * use this model to perform a predictio, with a associated cofidece level; * test the hypothesis that X ad Y are ot related.. Formattig From oe idividual (# i), a observatio will be writte dow as a ordered pair of values (x i ; y i ). There are two possible ways to display the data series, followig the situatio: * data series give i lists e.g.: relatioship betwee the quatity of spread fertilizer ad the harvested productio fertilizer harvest plot # X (kg.ha - ) Y (q.ha - ) example of a time series: aual advertisig expese of a compay X : year Y : expese * series frequecies: cotigecy table e.g.: relatioship betwee age ad visual acuity, data collected from 00 people X : age Y : acuity 3/ / / IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 3 / 4

4 .3 Scatter plot Every statistical data series ca be displayed o a graph by a poits cloud, each variable takig place o its ow axis. * series i lists: a pair (x i ; y i ) correspods to oe idividual ad to oe poit. secod example i the previous page: year : 006) * series with cotigecy: a pair (x i ; y i ) mostly correspods to more tha oe idividual (freq ) ad to a object whose size is a icreasig fuctio of the associated frequecy. third example i the previous page: acuity age IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 4 / 4

5 Chi-square idepedece testig The goal of a statistical testig is to decide whether we ca afford to reject (or whether we ca t) a give hypothesis made o a populatio, by the aalysis of a sample. Firstly, this hypothesis is worded ad is amed "ull hypothesis", H 0. If the coclusio/decisio of the test is a rejectio of H 0, the there is automatically a risk to be wrog, whose probability is amed "sigificace level" of the test, oted α. The special case of a Chi-square idepedece testig: Here, a survey is crossig two variables (e.g. i the ext tutorial: geder ad behaviour towards tobacco), whose level of depedece has to be estimated i a populatio, oly aalysig the distributio of people got from a sample. I case of idepedece, the ideal distributio cosists of a proportio cotigecy table: people s aswers are supposed to be distributed i proportio to the existig subtotals (margi frequecies). The test aims to compare the observed frequecies (obs) to these theoretical proportioal frequecies (th) associated to perfect idepedece, to extract a value, "χ²" (proouce Chi-square), umberig a "distace betwee observatios ad perfect idepedece" foud i the studied sample, ad fially to decide whether this gap is large or ot. Methodology: observatios are coducted : idividuals are evaluated o two variables X ad Y. The variable X shows as results r differet values, ad Y shows k differet values. The ull hypothesis H 0 is by covetio: the variables are idepedet. Le test compares reality to what would perfect idepedece have show. We ca reject H 0 i case the set of observatios is «too far» from the theoretical distributio.. Calculatio of the observed χ² * table of observatios o idividuals Y Y Y k total X X obs obs obs k total X X obs obs obs k total X X r obs r obs r obs rk total X r total Y total Y total Y total Y k * table of the theoretical distributio (idepedece) This secod table is built from the first, takig back every subtotal, the calculatig each frequecy i proportio to these subtotals ad to the geeral total. * calculatio of χ² calc (global differece betwee obs ad th): χ² calc = table ( obs th) th. Rejectio area The χ² variable expresses the ifiity of the possible values that could be obtaied from ay possible sample compared to idepedece. This variable is distributed i probability, by a law of the same ame, settled by its umber of degrees of freedom (dof). dof = (r- )(k - ) To each possible χ² value ([0; [) correspods a probability "α" that a sample - extracted from a ideal idepedece populatio would exceed it. I a exercise, i case α is give, we the read the value of χ² lim. 3. Compariso ad decisio If χ² calc > χ² lim, the we re allowed to reject H 0 (the idepedece), with a risk α to be wrog. IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 5 / 4

6 3 Fittig: Mayer s method ad movig meas 3. Movig meas Movig meas are mostly used with time series, the variable X represetig time ad the variable Y a value that evolves i time. Whe Y values show large oscillatios through time, a global upward or dowward tred is hard to detect. Movig meas are there to help us givig a aswer, by smoothig these oscillatios. Methodology: * Cosolidate sets of successive Y values, always of the same umber (e.g.: take two values by two or three by three, or four by four, etc.); * The ext set cosists of the previous oe, i which the first value of Y is removed ad the ext oe is joied (slippery sets); * The average value of Y is calculated i each set (providig a list of movig meas), same for the average value of X (providig a average locatio i time for each set); * The correspodig poits may be plotted. e.g.: X (trimesters) Y (thousads of tourists) Let s create the list of the 4 4 movig meas: X Y This ew list of values (doubled by its graph) suggests a very slight dowward tred. ote: * the first movig mea is the mea of the values #,, 3 ad 4. (34)/4 =.5 for X ad (58336)/4 = 3.5 for Y * the secod movig mea is the mea of the values #, 3, 4 ad 5. (345)/4 = 3.5 for X ad (33660)/4 = 3.75 for Y * ad so o IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 6 / 4

7 3. Purpose of liear fittig A poit cloud may show a lik betwee both variables if its poits appear ot to be scattered at radom. I some cases, this cloud's shape may be elogated, quite thi, with a "directioal axis" quite right showig a tedecy... Ca we fid a axis, a straight lie which "follows" the whole cloud "to the best"? Let s say this lie has already bee draw (D) : y = ax b. To a give value x i are associated the value y i (ordiate of the poit M i i the cloud) ad the value y ˆi = ax i b (o the lie). y i yˆi defiitio: we ame residue the umber e i = y i y ˆi The residue of a poit M i is the positive if this poit is above the lie ad egative i the opposite situatio. x i Hece, we aim to fid the lie that «miimises to the best» the residues, the lie that passes through the cloud as close as possible to the poits. This way, we perform a liear fittig, or liear regressio. Oce doe, this object is called fittig lie, tred lie or regressio lie of the series. 3.3 Mayer s method Some residues are positive, some egative. Mayer's assumptio is that the "best" lie is the oe that leads to a zero sum of residues (the egative residues offset the positive oes). defiitio: we ame Mayer s priciple the goal mathematical aalysis: e = y ax b = y a x b ( ) i i i i i e i = 0 This sum is zero iff yi a xi b = 0 iff y ax b = 0 That is to say: to obtai a cacellatio of the sum of residues, it is ecessary ad sufficiet that the G x, y. This property is't sufficiet i itself to make a straight lie cotai the midpoit of the cloud, ( ) Mayer's lie uique, sice the oly obligatio is to ow oe give poit. There are a ifiite umber of straight lies makig a zero sum of residues! Mayer s method: * Divide the cloud ito two subclouds: Both subclouds must cotai the same umber of poits: / if is eve, or ()/ o oe side ad (-)/ o the other side if is odd. The abscissas x i the first subcloud must all be less tha the abscissas x i the secod oe; * Calculate the coordiates of G ad G, mea poits (midpoits) of both subclouds; * Determie (if asked) the expressio of the lie (G G ), Mayer s lie that will be chose; draw it ote: It s bee proved that the mea poit of the whole cloud, G, belogs to the lie (G G ) i ay case, ad the that the latter meets Mayer s priciple. IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 7 / 4

8 4 Liear fittig: least square method 4. Parameters of a bivariate series 4.. The mea of X or of Y are: x = x = 4.. r x i ad y = i xi ad y = k y j = i y j without cotigecy (data series i lists see p.3 examples ad ); j with cotigecy (frequecies gathered ito a crossed table p.3 ex 3). The special poit G( x, y ) is amed mea poit or midpoit of the cloud. The variace of X ad the oe of Y are easily accessible (maual calculatios) by Koeig s theorem: r x i V ( X ) = x ad V( Y ) V ( X ) r i xi = x ad V( Y ) r yi = y r i yi = y without cotigecy; with cotigecy. The stadard deviatios are still the square roots of the variaces We ame covariace of the pair (X,Y) the umber : Cov( X Y ), = ( x x )( y y ) This is a «commo variace» betwee both variables, which is ecessary to aalyze their correlatio. Koeig s theorem gives a easier way to calculate the covariace: Cov ( ) xi yi X, Y = x y (without cotigecy) ad Cov(, ) r i k ij i i j= X Y = x y i x y. (with) 4..4 Usig the calculator: The meas ad stadard deviatios are give directly, i Stat mode. Ufortuately, the calculator gives either the variaces or the covariace. IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 8 / 4

9 4. Least square method The idea of this method is to square each residue, the to add these squares, ad fially to say that the "best" lie is the oe that miimizes this sum (obtai the smallest possible sum, cosiderig the ifiite umber of possible lies). defiitio: We ame least square priciple the oe that cosists i fidig a lie leadig to ei is miimum withi the cloud (Gauss) mathematical aalysis: we set P( a, b) = ( y ax b) There are two differet ways to expad it: i i : bivariate polyomial. (, ) = (( ) ) = ( ) ( ) P a b y ax b b b y ax y ax () i i i i i i d degree triomial, with respect to b; (, ) = ( ( i ) i ) = i ( i i i ) ( i ) P a b y b ax a x a x y b x y b () d degree triomial with respect to a. I this cotext, we ca cotiue like this: * cosider a as a costat ad b as a variable. P(a,b) () is miimum whe its derivative (/b) is zero (its st coefficiet,, is o-egative), which leads to b = y ax * cosider this latest value of b, ad a as a variable. P(a,b) () is miimum whe its derivative (/a) is xi yi x. y ( X, Y ) zero, which leads to a Cov = = x ( X ) i x V Calculus amateurs may try to fid back these results! ote: such a value of b implies that the regressio lie ows the mea poit of the cloud, G. This method coducts to a uique lie ad so is mostly employed. least square method: * Calculate the coefficiets ( X, Y ) ( X ) Cov a = ad b = y ax (you ca get them o your calculator!) V * Write the expressio of the Y o X regressio lie D Y/X : y = ax b IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 9 / 4

10 4.3 Liear correlatio coefficiet A scatterplot shows a more or less strog lik betwee two variables X ad Y, sometimes displayig a elogated ad almost right cloud: i this case, a liear model is relevat. The purpose of the liear correlatio coefficiet is to evaluate the stregth of a liear lik, by a umber. liear correlatio coefficiet betwee X ad Y : (, ) ( X ) σ ( Y ) Cov X Y r = σ It s bee stated that, whatever the data series, - r (the capital R or the Greek letter ρ - «rhô», are sometimes used for this coefficiet) O the calculator: A calculator geerally writes it r if it metios it! (it depeds o the model). Therefore, we will calculate it by ourselves (which implies calculatig the covariace first...). Iterpretatio of its value: The strogest the liear correlatio is (cloud lookig like a straight lie), the closest to is r. "positive correlatio" : r is positive whe Y overall icreases with X "egative correlatio" : r is egative whe Y overall decreases as X icreases 0 r 0.5 : weak liear correlatio, iappropriate liear model. 0.5 r 0.75 : mea liear correlatio, o-appropriate liear model r 0.95 : tolerable liear correlatio, the liear model may ot be the best oe r : strog liear correlatio, the liear model is oe of the most appropriate. Commets: * are X ad Y really liked? If r is close to (or -), the poits are close to be colliear. Nevertheless, that does't always mea that X ad Y are cocretely related. E.g.: i Frace, from 974 to 98, the weddig rate decreased ad i the meatime the GDP (Frech : PIB) icreased, so that the scatter plot usig both data sets is quasi-liear (fourth graph below). The liear correlatio is mathematically very strog, but facts ad studies show there's o cause to effect relatioship betwee both variables! (after 98, the followig poits are ot at all colliear with the previous oes ay more). * liear correlatio r oly shows a liear lik. A correlatio betwee X ad Y may be very strog, but ot i a liear way (curved). I that case, r is far from ad -, ad the study has to be expaded (see IV). IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 0 / 4

11 Examples: icome ( ) vs. duratio i a compay success rate vs. % of disadvataged SPC r = duratio r = uit margi ( /u) vs. quatity weddig rate through time r = quatity (thousads) r = IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page / 4

12 5 No-liear fittig: the variable chage A variable chage may be performed if the poits seem to follow a curve i particular. The fuctio to cosider will always be defied by the directios of a exercise. It may be: * a logarithm or expoetial fuctio * a polyomial fuctio * a trigoometric fuctio * Oe of the variables X or Y (or both!) has to be replaced each by a ew oe, oted T for istace, followig a give formula that allows its calculatio startig from the former. e.g.: X Y As Y seems to vary as X squared, plus 5, we ca defie the variable chage T = X ². We have to build the followig table, ito which T shall replace X : T Y * We perform a liear regressio of the pair (T, Y),observig their order. e.g.: Here, the questio is to determie the expressio of their fittig lie, y = at b. If we are told to use the least square method, the coefficiets a ad b will be give by the calculator: y =.056 t * Fially, we ca deduce the expressio of a curve, fittig the o-liear relatioship betwee X ad Y, just by writig the variable chage agai; we may draw this curve, if we re told to. e.g.: Sice y =.056 t 3.856, we get: y =.056 x² (expressio of a parabola) IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page / 4

13 6 Statistical predictio 6. Poit estimate The fittig straight lie (obtaied with or without a variable chage) makes it possible, through its expressio, to estimate a value of the variable Y o choosig a uexplored value of the variable X (geerally greater tha those collected i the geuie series). I this case, if X represets time, it is possible to make a forecast to the future. e.g.: let s set the expressio of a fittig lie: y = 0.85x. a. Poit estimate of y with x 0 = 0. y 0 = = b. Poit estimate of x with y 0 = 39. x 0 = (39 )/0.85 = Cofidece iterval We ought to step back, cosiderig the poit estimate: accordig to the oise (dispersio) of the poit cloud, it is more or less trustable it gives us a more or less precise predictio. Here, the ew idea is to give a estimate by a rage (iterval), aroud the poit estimate, rather tha a sigle value, ad to be able to associate a probability (cofidece level) for the ukow reality to be iside such a rage. Rates method (uses a liear model, estimates y from x):. For each value x i of the iitial data set: * calculate the values y' i followig the expressio of the regressio lie * calculate the rates z i = y i / y' i * calculate the mea ad stadard deviatio of the variable Z. Z is cosidered as distributed by a ormal law. Cosequetly: z.96σ ; z.96σ 95 % of Z values take place iside the iterval [ Z Z ] 99 % of Z values take place iside the iterval [ z.58σ ; z.58σ ] 3. Calculate the poit estimate y' 0, associated to the ew give value x 0, thaks to the fittig lie. Now, we ca predict the uexplored possible values y 0 by a iterval, as follows: There are 95% chaces that y 0 would be i y 0 ( z.96σ Z ) ; y 0 ( z.96σ Z ) There are 99% chaces that y 0 would be i y 0 ( z.58σ Z ) ; y 0 ( z.58σ Z ) commets: * this method is efficiet oly for r > 0 (o-egative correlatio) * the probability (95%, 99%, etc.) is amed cofidece level of the predictio. Its complemet (5%, %, etc.) is amed sigificace level. * The size of such a iterval is related to the ucertaity of the aswer. It icreases whe:. the cofidece level icreases,. r decreases,. the distace betwee x 0 ad the abscissas x i of the poit cloud icreases. Z Z IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 3 / 4

14 IUT TC MATHEMATICS FORM FOR BIVARIATE STATISTICS IUT de Sait-Etiee Départemet TC J.F.Ferraris Math S StatVar Lessos Rev08 page 4 / 4

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

IUT of Saint-Etienne Sales and Marketing department Mr Ferraris Prom /12/2015

IUT of Saint-Etienne Sales and Marketing department Mr Ferraris Prom /12/2015 IUT of Sait-Etiee Sales ad Marketig departmet Mr Ferraris Prom 2014-2016 09/12/2015 MATHEMATICS 3 rd semester, Test 2 legth : 2 hours coefficiet 2/3 The graphic calculator is allowed. Ay persoal sheet

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated

More information

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n. ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

Polynomial Functions and Their Graphs

Polynomial Functions and Their Graphs Polyomial Fuctios ad Their Graphs I this sectio we begi the study of fuctios defied by polyomial expressios. Polyomial ad ratioal fuctios are the most commo fuctios used to model data, ad are used extesively

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

REGRESSION (Physics 1210 Notes, Partial Modified Appendix A)

REGRESSION (Physics 1210 Notes, Partial Modified Appendix A) REGRESSION (Physics 0 Notes, Partial Modified Appedix A) HOW TO PERFORM A LINEAR REGRESSION Cosider the followig data poits ad their graph (Table I ad Figure ): X Y 0 3 5 3 7 4 9 5 Table : Example Data

More information

Quadratic Functions. Before we start looking at polynomials, we should know some common terminology.

Quadratic Functions. Before we start looking at polynomials, we should know some common terminology. Quadratic Fuctios I this sectio we begi the study of fuctios defied by polyomial expressios. Polyomial ad ratioal fuctios are the most commo fuctios used to model data, ad are used extesively i mathematical

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical

More information

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other Liear Regressio Aalysis Aalysis of paired data ad usig a give value of oe variable to predict the value of the other 5 5 15 15 1 1 5 5 1 3 4 5 6 7 8 1 3 4 5 6 7 8 Liear Regressio Aalysis E: The chirp rate

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued) Worksheet 3 ( 11.5-11.8) Itroductio to Simple Liear Regressio (cotiued) This worksheet is a cotiuatio of Discussio Sheet 3; please complete that discussio sheet first if you have ot already doe so. This

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43 PAPER NO.: 444, 445 PAGE NO.: Page 1 of 1 INSTRUCTIONS I. You have bee provided with: a) the examiatio paper i two parts (PART A ad PART B), b) a multiple choice aswer sheet (for PART A), c) selected formulae

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

Correlation and Covariance

Correlation and Covariance Correlatio ad Covariace Tom Ilveto FREC 9 What is Next? Correlatio ad Regressio Regressio We specify a depedet variable as a liear fuctio of oe or more idepedet variables, based o co-variace Regressio

More information

10-701/ Machine Learning Mid-term Exam Solution

10-701/ Machine Learning Mid-term Exam Solution 0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it

More information

(all terms are scalars).the minimization is clearer in sum notation:

(all terms are scalars).the minimization is clearer in sum notation: 7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

Section 14. Simple linear regression.

Section 14. Simple linear regression. Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

Zeros of Polynomials

Zeros of Polynomials Math 160 www.timetodare.com 4.5 4.6 Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered with fidig the solutios of polyomial equatios of ay degree

More information

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram. Key Cocepts: 1) Sketchig of scatter diagram The scatter diagram of bivariate (i.e. cotaiig two variables) data ca be easily obtaied usig GC. Studets are advised to refer to lecture otes for the GC operatios

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 017 MODULE 4 : Liear models Time allowed: Oe ad a half hours Cadidates should aswer THREE questios. Each questio carries

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Regression, Part I. A) Correlation describes the relationship between two variables, where neither is independent or a predictor.

Regression, Part I. A) Correlation describes the relationship between two variables, where neither is independent or a predictor. Regressio, Part I I. Differece from correlatio. II. Basic idea: A) Correlatio describes the relatioship betwee two variables, where either is idepedet or a predictor. - I correlatio, it would be irrelevat

More information

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? Harold G. Loomis Hoolulu, HI ABSTRACT Most coastal locatios have few if ay records of tsuami wave heights obtaied over various time periods. Still

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

Ismor Fischer, 1/11/

Ismor Fischer, 1/11/ Ismor Fischer, //04 7.4-7.4 Problems. I Problem 4.4/9, it was show that importat relatios exist betwee populatio meas, variaces, ad covariace. Specifically, we have the formulas that appear below left.

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Circle the single best answer for each multiple choice question. Your choice should be made clearly. TEST #1 STA 4853 March 6, 2017 Name: Please read the followig directios. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directios This exam is closed book ad closed otes. There are 32 multiple choice questios.

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Topic 10: Introduction to Estimation

Topic 10: Introduction to Estimation Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

1 Models for Matched Pairs

1 Models for Matched Pairs 1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i

More information

Statistics 20: Final Exam Solutions Summer Session 2007

Statistics 20: Final Exam Solutions Summer Session 2007 1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

3.2 Properties of Division 3.3 Zeros of Polynomials 3.4 Complex and Rational Zeros of Polynomials

3.2 Properties of Division 3.3 Zeros of Polynomials 3.4 Complex and Rational Zeros of Polynomials Math 60 www.timetodare.com 3. Properties of Divisio 3.3 Zeros of Polyomials 3.4 Complex ad Ratioal Zeros of Polyomials I these sectios we will study polyomials algebraically. Most of our work will be cocered

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Chapter 5: Hypothesis testing

Chapter 5: Hypothesis testing Slide 5. Chapter 5: Hypothesis testig Hypothesis testig is about makig decisios Is a hypothesis true or false? Are wome paid less, o average, tha me? Barrow, Statistics for Ecoomics, Accoutig ad Busiess

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y 1 Sociology 405/805 Revised February 4, 004 Summary of Formulae for Bivariate Regressio ad Correlatio Let X be a idepedet variable ad Y a depedet variable, with observatios for each of the values of these

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Regression and Correlation

Regression and Correlation 43 Cotets Regressio ad Correlatio 43.1 Regressio 43. Correlatio 17 Learig outcomes You will lear how to explore relatioships betwee variables ad how to measure the stregth of such relatioships. You should

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Ma 530 Introduction to Power Series

Ma 530 Introduction to Power Series Ma 530 Itroductio to Power Series Please ote that there is material o power series at Visual Calculus. Some of this material was used as part of the presetatio of the topics that follow. What is a Power

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Analytic Theory of Probabilities

Analytic Theory of Probabilities Aalytic Theory of Probabilities PS Laplace Book II Chapter II, 4 pp 94 03 4 A lottery beig composed of umbered tickets of which r exit at each drawig, oe requires the probability that after i drawigs all

More information