Week 3 Linear Regression I

Size: px
Start display at page:

Download "Week 3 Linear Regression I"

Transcription

1 Week 3 Linear Regression I POL 200B, Spring 2014 Linear regression is the most commonly used statistical technique. A linear regression captures the relationship between two or more phenomena with a straight line. The value of linear regression in the study of social phenomena is a matter of some debate. In one of your background readings for this week, Charles Wheelan refers to regression analysis as the miracle elixir. Others have been much less enthusiastic. In an article entitled Econometrics Alchemy or Science? Economist David Hendry bemoaned the rise of linear regression in his profession, saying: Econometricians have found their Philosophers Stone; it is called regression analysis and is used for transforming data into significant results! 1 More than thirty years after Hendry s article appeared, linear regression continues to be the workhorse of quantitative analysis in the social sciences. Even if you don t plan on running regressions for the rest of your career, it s probably good to know what the alchemists are up to. 1 What is it? Let s start with the familiar scatter plot from Red State, Blue State... that you re-created as part of the first coding assignment. On the x-axis is the average income in each state in On the y-axis is the vote share received by George Bush in the 2004 presidential election. As you may recall, the trend is negative higher income is correlated with lower Bush vote share. 1 Hendry, David F. Econometrics Alchemy or Science?. Economica (1980): p

2 percent_bush MS WV AR UT ID AL KY MT SC LA NM OK ND KS TX IN SD ME NE GA TN NC AZ MO FL OH IA MI OR WI PA VT HI AK NV RI WY VA CO DEWA IL CA NH MN income NY MD NJ MA CT We could come up with a line to capture the relationship between income and Republican vote share. Why do we want to make a line, you might ask? Well, imagine you were living in the 1800s and you didn t have a computer. Drawing a line and measuring its slope might seem like a reasonable way of capturing a relationship between two phenomena...or maybe it might not, but it is in any event what people chose to do. The attraction of drawing a line is that it allows you to extrapolate beyond your data. Extrapolating can be very useful, but it has some very obvious risks as illustrated here by xkcd: 2

3 So we want to make a line. Looking at our Red State, Blue State... data, we could probably imagine drawing a line through the points just by eyeballing it. It might look something like this: percent_bush MS WV AR UT ID AL KY MT SC LA NM OK NE ND KS TX IN SD GA TN NC AZ MO FL OH IA OR MI WI PA ME HI VT AK VA CO NV NH MN DEWA IL CA income RI WY NY MD NJ MA CT Linear Regression Line 3

4 Once we have a line, we can measure its slope and find the place where it crosses the y-axis. If you haven t repressed all memories of high school math, you might remember seeing the equation for a straight line written like this: Y = mx + b In this equation, m is the slope of the line, and b is the y-intercept. We sometimes call m and b parameters to distinguish them from x and y which are variables. The equation we get from a linear regression is very similar, but we ve changed the symbols used for the parameters and put them in a different order: Y = β 0 + βx In statistics jargon, Y is called the dependent variable, or sometimes the outcome variable. X is called the independent variable or sometimes the explanatory variable or predictor variable. The y-intercept is now written at the beginning of the equation and instead of b it is now β 0 ( beta naught ). The y-intercept is often referred to as the constant. The slope is now β, which is sometimes referred to as the regression coefficient (or just coefficient ). As you can probably guess, eyeballing it is not how statisticians prefer to find the slope and intercept. So how do we find values for β and β 0? We can actually get some valuable insights from thinking about eyeballing it. If we wanted to do an especially good job at eyeballing it, we d try to get the line as close as possible to the most points. One formal mathematical way to accomplish this goal is to use a technique called least squares, as explained below. 2 Ordinary Least Squares Since our line doesn t pass through each point, we can measure the distance between each point and the line. This is called the point s residual. A point that has a small residual is very close to the line (a residual of zero means that the point is on the line). A point that has a large residual is farther away from the line. Here is our linear regression from above, 4

5 showing the residuals for Utah and Massachusetts. percent_bush MS WV AR UT ID AL KY MT SC LA NM OK VT NE ND KS TX IN SD GA TN NC AZ MO FL OH IA WI OR MI PA ME HI AK VA CO NV NH MN DEWA IL CA income RI WY NY MD NJ MA CT Linear Regression Line Residuals The residual for Utah is.123. Therefore, in Utah, Bush s vote share was 12.3 percentage points higher than our line would predict for a state with that income. The residual for Massachusetts is In Massachusetts, Bush s vote share was 5.9 percentage points lower than our line would predict for a state with that income. Aside from eyeballing it, how do we find out which line has the smallest possible residuals? One method is to use the least squares technique. This technique is called least squares because we find the line that minimizes the sum of the squared residuals. Why the squared residuals? you quite reasonably ask. Well, looking at our two residuals above, we notice that Utah has a positive residual, but Massachusetts has a negative residual. We just want to minimize the magnitude of the residuals, so we square them to make them all positive. If this sounds familiar, that s because we used a similar technique when calculating standard deviations and correlation coefficients. The decision to square the residuals instead of just taking their absolute values is something that made a lot of sense in the pre-computer era, but doesn t really make that much sense now. As you might guess, squaring the residuals gives more weight to outlying points. 5

6 To run a regression in STATA you simply use the reg command as follows: reg Y-var X-var So, for our Red State, Blue State... data, the dependent variable (Y ) is Bush vote share (percent bush) and the independent variable (X) is income. Here s what happens if we run the command: As usual, STATA includes a whole bunch of information that you don t really need. The value of β is the coefficient of income which is The value of β0 is the constant (written as cons for some reason) and is What does it all mean? So, now we know that we re drawing a straight line and STATA can tell us the values of the slope and the intercept for that line. But what does it mean for β to be ? Isn t that tiny? It s practically zero! We ll get there, but first let s put the values of the parameters into our model: Y = X 6

7 Okay, now we have a sort of weird-looking equation for a line. Let s remember what each of the pieces refers to: Predicted Bush vote share = 89.3% Income Using this equation, we can make a guess about the vote share Bush would get for any value of state income. Notice that we can predict Bush s vote share for nonsensical incomes. For example, our y-intercept, β 0, is actually the predicted value of Bush s vote share in a hypothetical state where the average income is 0. In general, β is the average amount we predict Y will increase or decrease if X were increased by 1 unit. Therefore, for every $1 increase in state income, we predict an average decrease in Bush s vote share of percentage points. The differences in income between our states are usually bigger than $1, so if we wanted to make our equation more pliable, we could rewrite β as a fraction with meaningful increments in the independent (X) and dependent (Y ) variables. For example, we could rewrite as % or. Consider two states, one with an average income of $30,000 and $10,000 one with an average income of $40,000. Because the difference in state income is $10,000, we predict that Bush s vote share in the second state would be 11 percentage points lower than in the first. 7

8 4 Presenting the results In our example, the regression output from STATA looked like this: In a paper, it would probably be reported as something like this: Table 1: Dependent variable: Bush vote share in 2004 a Variable Coefficient (Std. Err.) State Income b (0.022) Intercept (0.071) a Results are OLS estimates. b State income is measured per $10,000. Significance levels: = p < 5%, = p < 1% I want to emphasize that this is not necessarily a good way of presenting information! These tables are not usually much more helpful than the regression output from STATA and exhibit some rather unfortunate trends in the field. So why learn about them? Well, keeping with the theme of this lecture, we re going to figure out what s going on in a table like this because it s good to know what the alchemists are doing (so you can call them on their alchemy!). 8

9 So what do we have here? The values we found for β 0 and β are in the left hand column. I divided the state incomes by $10,000 so that the coefficient on income (β) reflects the change in vote share for a $10,000 increase in income. The intercept (β 0 ) is also reported. So far this is not too horrible. I m sure you noticed my beautiful bedazzled coefficients! Don t the stars make them look important? The table shows significance levels (p-values) using stars. This is a very common practice, but I personally think it s a bad one. Presenting your results with stars next to the significant ones is often a sign of star gazing. 2 The table also includes the standard error of each parameter. Like a p-value, the standard error is supposed to give you some sense of the statistical uncertainty associated with the value we ve found for the parameter. Unfortunately, both of these measures are quite poorly understood. In fact, you might be wondering what we are so uncertain about. Given our state vote shares and state incomes, the values we get for β and β 0 are completely determined. In a sense, they re facts. The value we have for β or β 0 is no less uncertain than the value of the mean income in our data. All of the measures of statistical uncertainty refer to the uncertainty we should take into account if we want to generalize our results. In the next section we ll talk about three different ways that statistical uncertainty is measured: Standard errors, 95% confidence intervals, and p-values. But first, let s take a stab at defining what is, and what isn t, statistical uncertainty. 5 Statistical uncertainty In statistics, the word uncertainty, like its cousin significance, does not have its normal English meaning. Normally, we think of uncertainty as the result of difficulties we have in figuring out how things really are. Consider the income measure in our Red State, Blue State... example. One source of uncertainty (in the normal English sense) might be that people don t report all of their income, such as money they ve made under the table or hidden in a offshore tax shelter. We can probably think of many more sources of uncertainty in our income data, not to mention sources of uncertainty in measuring Bush s vote share (recall the hanging chad fiasco). Statistical uncertainty, however, is not designed to give 2 A great book about statistical significance is The cult of statistical significance: How the standard error costs us jobs, justice and lives by Dierdre McCloskey and Stephen Ziliak (University of Michigan Press, 2008). An article-length treatment can be found here: 9

10 you information about these real-world sources of uncertainty in your data. In fact, they are completely ignored. Rather, measures of statistical uncertainty are designed to help you identify errors that arise from the fact that you are looking only at a sample. We ll talk below about three measures of statistical uncertainty, but first let s think a bit about how they apply to the technique we just learned namely, linear regression. The measures of statistical uncertainty we ll discuss are somewhat important if you want to generalize your statistical findings. But what does generalize even mean? Int this context, it means that you want to take the equation you got from your linear regression and use it to predict vote shares for values of income that are not currently in our data set. For example, using the Red State, Blue State... data, we might want to use the equation we found to make predictions about the 2012 election results. 3 On the other hand, if we don t care about generalizability, these measures of uncertainty are not actually important and, worse, can be completely misleading! We could just look at the good old standard deviation. Quantitative methodologists often get all up in arms about quantifying uncertainty, but this debate assumes that we are trying to generalize our results to some other population that you don t currently have data about (i.e., that there is something to be uncertain about!) Sometimes, this kind of generalizability is not the point of your quantitative research. If this is the case, though, I d advise you to steer clear of running linear regressions altogether. You can accomplish a lot just by presenting graphs and describing your data set. There are cases where generalizability is actually the goal. For example, if you want to use survey results gathered by interviewing a sample of the US population to say something about the population as a whole. In this case, statistical uncertainty is a very useful tool. However, even in this case, it can be misleading. 5.1 Standard Errors Here s our (bad) table: 3 It s pretty obvious that this would not be the best idea. Clearly some things have changed between 2004 and 2012, not to mention the fact that there are totally different presidential candidates! For the standard errors from our 2004 sample to give us accurate information about how we could generalize to 2012, we would have to assume that our 50 states from 2004 are a random sample from some larger population that includes those same 50 states in

11 Table 2: Dependent variable: Bush vote share in 2004 a Variable Coefficient (Std. Err.) State Income b (0.022) Intercept (0.071) a Results are OLS estimates. b State income is measured per $10,000. Significance levels: = p < 5%, = p < 1% As is often done, I ve put the standard errors in parentheses under their corresponding coefficients. So the standard error of β ( 0.11) is and the standard error of β 0 (0.893) is A standard error is like a standard deviation, but a standard deviation of what? Well, this all goes back to assuming that your data is a random sample from some larger population. If you data were, in fact, such a sample, you could imagine taking many different samples and then running your regression on each of the samples. 4 Each time you ran the regression you could save the value of the coefficient on income. Then you d have a giant list of different values you got for β. If β actually has some underlying average value in the population at large, then most of your different βs would be kind of close to that real value because you were taking representative (random) samples. In fact, if you have lots and lots and lots of perfectly random samples and you calculated the β from each one of them. The values of β could be described by a perfect bell curve (or normal distribution) centered on the mean of β. If this were the case, then about 68% of the values would fall within 1 standard deviation of the mean and about 95% of the values would fall within 2 standard deviations of the mean. If you ve taken statistics before, you might have seen this infamous chart: 4 Sort of like conducting many studies and recording the results you got each time. 11

12 Graphic from Wikipedia article Standard Deviation available here: Note: µ is just another symbol one can use for the mean of a variable (what we called x) What does this chart mean? Just that if the values of your variable β were perfectly normally distributed around a mean of µ, each of the sections on the graph would contain a certain proportion of your βs (the proportion in each section is shown by its label on the chart). For example, the section between µ and 1σ would include the βs that are greater than the mean (µ) and less than the mean plus one standard deviation. In theory, this would be 34.1% of your data. 5 The standard error is just an estimate of the standard deviation of this hypothetical normal distribution that was reverse-engineered to represent some larger population that your sample could have been randomly drawn from. The information in our data that is used to reverse-engineer the hypothetical population is: The dependence between the variables (i.e., the strength of correlation between X and Y ). The number of observations we have (i.e., if we have 50 states in our data set, the number of observations we have is 50). Higher dependence will, naturally, lead to a lower standard error. If there is a strong re- 5 In reality, unless you had a really huge sample you would probably just get something kind close to 34.1%. 12

13 lationship between the variables in our data, and our data is a random sample from this hypothetical population, there s probably a strong relationship between the variables in the hypothetical population. A larger sample size will also, of course, lead to smaller standard errors, since a larger sample will be more representative of the population at large. As a rule of thumb, statisticians say that if the standard error is less than half of the estimate that our equation spits out, we re in pretty good shape. Using the standard error, STATA also calculates a 95% confidence interval % Confidence Intervals I didn t show the 95% confidence intervals in the regression table, but here s where they show up in the STATA output: I ve circled the 95% confidence interval for the coefficient of income. The 95% confidence interval is given as a range. In this case, the range goes from to Both of these values are calculated directly from the standard error. The upper bound (top) of the 95% percent confidence interval is just the value of β plus 1.96 times its standard error (i.e., ). The lower bound (bottom) of the 95% confidence interval is just the value of β minus 1.96 times its standard error (i.e., ). 13

14 Why 1.96 times the standard error? The 95% confidence interval just gives the values on this distribution such that 95% of the βs would fall in that region (look back up at the normal distribution above if you re confused). By definition, this is between β (the standard error) and β (the standard error). 6 It s interesting to puzzle a little over what the 95% confidence interval corresponds to especially since it s not quite what the name implies! If you were to re-run your regression over and over using new random samples, then 95% of the time your value for β would fall within the range given by the 95% confidence interval. This is a little different from simply saying that we are 95% sure that the real value of β is in the interval given. Related to the 95% confidence interval is the p-value. We first talked about p-values when discussing group comparisons. There, p-values told us about the probability that a difference we were observing in a sample containing two groups (like men and women) was actually zero in the larger population. In the regression context, the p-value is slightly different. 5.3 P-values in Regression A p-value is a loose way of testing how good our line is at making predictions for a larger population one about which don t have information. Given the information we know about our sample, p-values tell us the probabilities that β and β 0 in the overall population are actually zero. The actual calculations for p-values are fairly involved (and not particularly important), but fortunately STATA will give them to you when you run the reg command. As in the group comparison context, this measure is a fairly loose one. 6 Multiple Regression So far we ve only considered linear regression with one dependent variable (Y ) and one independent variable (X). Using the same framework, we can include more independent variables. Adding more variables is often referred to as controlling for them. This can be a bit misleading. I think that the best way to understand multiple regression is through a simple example. 6 Most of the time you could just use 2, but 1.96 is technically exactly where 95% would be in a very, very large sample. 14

15 Using Gallup poll data from , I added a new variable to the Red State, Blue State... data: The percentage of state population who identified as Catholic. 7 I call this variable percent catholic. If we make a scatter plot of Bush s vote share vs. the percent Catholic in each state, we see the following familiar pattern: percent_bush UT AL OK MS SC GA TN WVNC AR ID WY KY IN VAMO OR WA IA KS TX CO NE SD MT AZ FL OHNV MI ND NM PA MN DE CA MEIL MD percent_catholic VT LA WI NH NY NJ CT MA RI As the percentage of Catholics in a state increases, the percentage of the vote share received by Bush in that state decreases. If we run a regression where our dependent variable (Y ) is Bush s 2004 vote share and our independent variable (X) is the percent Catholic we can again describe it using the following equation: Y = β 0 + βx We get the following results for β 0 and β from STATA: 7 Gallup published estimates of the percent of the population identifying as Protestant, Other Christian, Catholic, Mormon, Jewish, or None for 48 states (Alaska and Hawaii are not included). The results are based on telephone interviews with 62,744 randomly selected national adults, aged 18 and older, conducted in Gallup Polls between 2000 and For details see: 15

16 The value of β 0 is 0.649, meaning that in a hypothetical state which had 0 percent Catholics we would predict that Bush would get 64.9% of the vote. The value of β is We can re-write this as the following ratio: an equation: Predicted Bush vote share = 64.9% 5% Bush Vote. Just like before, we can translate this into 10% Catholic 5% Bush Vote 10% Catholic Percent Catholic What will happen if we add income? If we do a regression using both percent Catholic and income, we have two pieces of information for each state. We know that on their own each of these explanatory variables (Xs) can be used to predict vote share. Including both pieces of information should give us a better prediction. Would our prediction necessarily be better? Well, this depends on the degree to which percent Catholic and average income are actually giving us different information. For example, if percent Catholic is strongly related to average income and average income is strongly related to vote share then maybe including both is just like including average income twice. (This may be unintuitive to think about at first, don t worry). Say we modified our regression equation to have two explanatory variables: Y = β 0 + β 1 X 1 + β 2 X 2 16

17 Now X 1 is percent Catholic and X 2 is average income. They each have their own coefficient, β 1 and β 2. If we run the regression in STATA we get the following results: We ll talk more about this next week! Appendix Linear Regression Therapy? It occurred to me as I was writing these notes that regression is an English word that doesn t seem particularly related to the statistical technique that bares its name. Any X- files aficionado knows that hypnotic regression therapy is the technique special agent Fox Mulder uses to help people recover repressed memories of alien encounters! Here, and in general outside of statistics, regression means a return to a former state or condition. How did it come to be the name of the most common statistical technique? Remember our old friend Sir Francis Galton the naturalist, cousin of Charles Darwin, and sweet pea enthusiast? Galton developed linear regression to figure out how traits like intelligence might be passed down from parents to offspring. The kind of regression Galton was thinking about was regression to the mean, or as he called it, reversion towards mediocrity. 17

18 In his research on the relationship between parents and offsprings heights, Galton found that very tall or very short parents tended to have children who were less extreme in height. Galton assumed that this kind of process also applied to traits like intelligence. When combined with Galton s belief that less intelligent people were more fertile than intelligent people, it was clear to him that, if left unchecked, these forces would lead society towards intellectual mediocrity. Following this (incorrect) logic, Galton claimed that it would be quite practicable to produce a highly-gifted race of men by judicious marriages during several consecutive generations. 8 In order to quantify just how much of a regression to the mean one should expect for a particular trait such as height, Galton plotted the heights of parents against the heights of their children. Galton then constructed straight lines that seemed to capture the direction of the trend and calculated their slopes. It is from this checkered past that linear regression gets its name. The kind of linear regression Galton was doing fell out of fashion, but, as often happens in Statistics, the name stuck. Linear regression as we know it today was developed in the 1920s by R.A. Fisher, who combined the work Galton (and Pearson) were doing with the least-squares method developed by the mathematician Carl Friedrich Gauss. Fisher saw linear regression as a way of giving a more definite meaning to the value of a correlation coefficient. As Fisher put it in a paper written in 1925: 9 The idea of regression is usually introduced in connection with the theory of correlation, but it is in reality a more general, and, in some respects, a simpler idea, and the regression coefficients are of interest and scientific importance in many classes of data where the correlation coefficient, if used at all, is an artificial concept of no real utility. 8 For the morbidly curious, you can see Galton s book, Hereditary Genius (1869) here: mugu.com/galton/books/hereditary-genius/. 9 Fisher, Ronald Aylmer. Statistical methods for research workers. Genesis Publishing,

Lecture 26 Section 8.4. Mon, Oct 13, 2008

Lecture 26 Section 8.4. Mon, Oct 13, 2008 Lecture 26 Section 8.4 Hampden-Sydney College Mon, Oct 13, 2008 Outline 1 2 3 4 Exercise 8.12, page 528. Suppose that 60% of all students at a large university access course information using the Internet.

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Your Galactic Address

Your Galactic Address How Big is the Universe? Usually you think of your address as only three or four lines long: your name, street, city, and state. But to address a letter to a friend in a distant galaxy, you have to specify

More information

Use your text to define the following term. Use the terms to label the figure below. Define the following term.

Use your text to define the following term. Use the terms to label the figure below. Define the following term. Mapping Our World Section. and Longitude Skim Section of your text. Write three questions that come to mind from reading the headings and the illustration captions.. Responses may include questions about

More information

Sample Statistics 5021 First Midterm Examination with solutions

Sample Statistics 5021 First Midterm Examination with solutions THE UNIVERSITY OF MINNESOTA Statistics 5021 February 12, 2003 Sample First Midterm Examination (with solutions) 1. Baseball pitcher Nolan Ryan played in 20 games or more in the 24 seasons from 1968 through

More information

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984.

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 984. y ˆ = a + b x + b 2 x 2K + b n x n where n is the number of variables Example: In an earlier bivariate

More information

Class business PS is due Wed. Lecture 20 (QPM 2016) Multivariate Regression November 14, / 44

Class business PS is due Wed. Lecture 20 (QPM 2016) Multivariate Regression November 14, / 44 Multivariate Regression Prof. Jacob M. Montgomery Quantitative Political Methodology (L32 363) November 14, 2016 Lecture 20 (QPM 2016) Multivariate Regression November 14, 2016 1 / 44 Class business PS

More information

Analyzing Severe Weather Data

Analyzing Severe Weather Data Chapter Weather Patterns and Severe Storms Investigation A Analyzing Severe Weather Data Introduction Tornadoes are violent windstorms associated with severe thunderstorms. Meteorologists carefully monitor

More information

C Further Concepts in Statistics

C Further Concepts in Statistics Appendix C.1 Representing Data and Linear Modeling C1 C Further Concepts in Statistics C.1 Representing Data and Linear Modeling Use stem-and-leaf plots to organize and compare sets of data. Use histograms

More information

Forecasting the 2012 Presidential Election from History and the Polls

Forecasting the 2012 Presidential Election from History and the Polls Forecasting the 2012 Presidential Election from History and the Polls Drew Linzer Assistant Professor Emory University Department of Political Science Visiting Assistant Professor, 2012-13 Stanford University

More information

Nursing Facilities' Life Safety Standard Survey Results Quarterly Reference Tables

Nursing Facilities' Life Safety Standard Survey Results Quarterly Reference Tables Nursing Facilities' Life Safety Standard Survey Results Quarterly Reference Tables Table of Contents Table 1: Summary of Life Safety Survey Results by State Table 2: Ten Most Frequently Cited Life Safety

More information

Module 19: Simple Linear Regression

Module 19: Simple Linear Regression Module 19: Simple Linear Regression This module focuses on simple linear regression and thus begins the process of exploring one of the more used and powerful statistical tools. Reviewed 11 May 05 /MODULE

More information

EXST 7015 Fall 2014 Lab 08: Polynomial Regression

EXST 7015 Fall 2014 Lab 08: Polynomial Regression EXST 7015 Fall 2014 Lab 08: Polynomial Regression OBJECTIVES Polynomial regression is a statistical modeling technique to fit the curvilinear data that either shows a maximum or a minimum in the curve,

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

Empirical Application of Panel Data Regression

Empirical Application of Panel Data Regression Empirical Application of Panel Data Regression 1. We use Fatality data, and we are interested in whether rising beer tax rate can help lower traffic death. So the dependent variable is traffic death, while

More information

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008 Study and research skills 2009. and Adrian Newton. Last draft 11/24/2008 Inference about the mean: What you will learn Why we need to draw inferences from samples The difference between a population and

More information

What Lies Beneath: A Sub- National Look at Okun s Law for the United States.

What Lies Beneath: A Sub- National Look at Okun s Law for the United States. What Lies Beneath: A Sub- National Look at Okun s Law for the United States. Nathalie Gonzalez Prieto International Monetary Fund Global Labor Markets Workshop Paris, September 1-2, 2016 What the paper

More information

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of Factoring Review for Algebra II The saddest thing about not doing well in Algebra II is that almost any math teacher can tell you going into it what s going to trip you up. One of the first things they

More information

Evolution Strategies for Optimizing Rectangular Cartograms

Evolution Strategies for Optimizing Rectangular Cartograms Evolution Strategies for Optimizing Rectangular Cartograms Kevin Buchin 1, Bettina Speckmann 1, and Sander Verdonschot 2 1 TU Eindhoven, 2 Carleton University September 20, 2012 Sander Verdonschot (Carleton

More information

Regression Diagnostics

Regression Diagnostics Diag 1 / 78 Regression Diagnostics Paul E. Johnson 1 2 1 Department of Political Science 2 Center for Research Methods and Data Analysis, University of Kansas 2015 Diag 2 / 78 Outline 1 Introduction 2

More information

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class.

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class. Name Answer Key Economics 170 Spring 2003 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment

More information

Multiple Regression Theory 2006 Samuel L. Baker

Multiple Regression Theory 2006 Samuel L. Baker MULTIPLE REGRESSION THEORY 1 Multiple Regression Theory 2006 Samuel L. Baker Multiple regression is regression with two or more independent variables on the right-hand side of the equation. Use multiple

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable Chapter 08: Linear Regression There are lots of ways to model the relationships between variables. It is important that you not think that what we do is the way. There are many paths to the summit We are

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Grades 7 & 8, Math Circles 10/11/12 October, Series & Polygonal Numbers

Grades 7 & 8, Math Circles 10/11/12 October, Series & Polygonal Numbers Faculty of Mathematics Waterloo, Ontario N2L G Centre for Education in Mathematics and Computing Introduction Grades 7 & 8, Math Circles 0//2 October, 207 Series & Polygonal Numbers Mathematicians are

More information

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc. Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter

More information

Alex s Guide to Word Problems and Linear Equations Following Glencoe Algebra 1

Alex s Guide to Word Problems and Linear Equations Following Glencoe Algebra 1 Alex s Guide to Word Problems and Linear Equations Following Glencoe Algebra 1 What is a linear equation? It sounds fancy, but linear equation means the same thing as a line. In other words, it s an equation

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Quadratic Equations Part I

Quadratic Equations Part I Quadratic Equations Part I Before proceeding with this section we should note that the topic of solving quadratic equations will be covered in two sections. This is done for the benefit of those viewing

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction A typical Modern Geometry course will focus on some variation of a set of axioms for Euclidean geometry due to Hilbert. At the end of such a course, non-euclidean geometries (always

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

What is proof? Lesson 1

What is proof? Lesson 1 What is proof? Lesson The topic for this Math Explorer Club is mathematical proof. In this post we will go over what was covered in the first session. The word proof is a normal English word that you might

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi

Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 41 Pulse Code Modulation (PCM) So, if you remember we have been talking

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives STA 2023 Module 6 The Sampling Distributions Module Objectives In this module, we will learn the following: 1. Define sampling error and explain the need for sampling distributions. 2. Recognize that sampling

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

POL 681 Lecture Notes: Statistical Interactions

POL 681 Lecture Notes: Statistical Interactions POL 681 Lecture Notes: Statistical Interactions 1 Preliminaries To this point, the linear models we have considered have all been interpreted in terms of additive relationships. That is, the relationship

More information

L6: Regression II. JJ Chen. July 2, 2015

L6: Regression II. JJ Chen. July 2, 2015 L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

Ordinary Least Squares Linear Regression

Ordinary Least Squares Linear Regression Ordinary Least Squares Linear Regression Ryan P. Adams COS 324 Elements of Machine Learning Princeton University Linear regression is one of the simplest and most fundamental modeling ideas in statistics

More information

Annual Performance Report: State Assessment Data

Annual Performance Report: State Assessment Data Annual Performance Report: 2005-2006 State Assessment Data Summary Prepared by: Martha Thurlow, Jason Altman, Damien Cormier, and Ross Moen National Center on Educational Outcomes (NCEO) April, 2008 The

More information

Summary of Natural Hazard Statistics for 2008 in the United States

Summary of Natural Hazard Statistics for 2008 in the United States Summary of Natural Hazard Statistics for 2008 in the United States This National Weather Service (NWS) report summarizes fatalities, injuries and damages caused by severe weather in 2008. The NWS Office

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots

Relationships between variables. Visualizing Bivariate Distributions: Scatter Plots SFBS Course Notes Part 7: Correlation Bivariate relationships (p. 1) Linear transformations (p. 3) Pearson r : Measuring a relationship (p. 5) Interpretation of correlations (p. 10) Relationships between

More information

Simple Regression Model. January 24, 2011

Simple Regression Model. January 24, 2011 Simple Regression Model January 24, 2011 Outline Descriptive Analysis Causal Estimation Forecasting Regression Model We are actually going to derive the linear regression model in 3 very different ways

More information

Combinatorics. Problem: How to count without counting.

Combinatorics. Problem: How to count without counting. Combinatorics Problem: How to count without counting. I How do you figure out how many things there are with a certain property without actually enumerating all of them. Sometimes this requires a lot of

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

At the start of the term, we saw the following formula for computing the sum of the first n integers:

At the start of the term, we saw the following formula for computing the sum of the first n integers: Chapter 11 Induction This chapter covers mathematical induction. 11.1 Introduction to induction At the start of the term, we saw the following formula for computing the sum of the first n integers: Claim

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Lecture 5. 1 Review (Pairwise Independence and Derandomization)

Lecture 5. 1 Review (Pairwise Independence and Derandomization) 6.842 Randomness and Computation September 20, 2017 Lecture 5 Lecturer: Ronitt Rubinfeld Scribe: Tom Kolokotrones 1 Review (Pairwise Independence and Derandomization) As we discussed last time, we can

More information

Guide to Proofs on Sets

Guide to Proofs on Sets CS103 Winter 2019 Guide to Proofs on Sets Cynthia Lee Keith Schwarz I would argue that if you have a single guiding principle for how to mathematically reason about sets, it would be this one: All sets

More information

CHAPTER 8: MATRICES and DETERMINANTS

CHAPTER 8: MATRICES and DETERMINANTS (Section 8.1: Matrices and Determinants) 8.01 CHAPTER 8: MATRICES and DETERMINANTS The material in this chapter will be covered in your Linear Algebra class (Math 254 at Mesa). SECTION 8.1: MATRICES and

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

Sociology Exam 2 Answer Key March 30, 2012

Sociology Exam 2 Answer Key March 30, 2012 Sociology 63993 Exam 2 Answer Key March 30, 2012 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher has constructed scales

More information

Statistical Mechanics of Money, Income, and Wealth

Statistical Mechanics of Money, Income, and Wealth Statistical Mechanics of Money, Income, and Wealth Victor M. Yakovenko Adrian A. Dragulescu and A. Christian Silva Department of Physics, University of Maryland, College Park, USA http://www2.physics.umd.edu/~yakovenk/econophysics.html

More information

In the previous chapter, we learned how to use the method of least-squares

In the previous chapter, we learned how to use the method of least-squares 03-Kahane-45364.qxd 11/9/2007 4:40 PM Page 37 3 Model Performance and Evaluation In the previous chapter, we learned how to use the method of least-squares to find a line that best fits a scatter of points.

More information

Appendix 5 Summary of State Trademark Registration Provisions (as of July 2016)

Appendix 5 Summary of State Trademark Registration Provisions (as of July 2016) Appendix 5 Summary of State Trademark Registration Provisions (as of July 2016) App. 5-1 Registration Renewal Assignments Dates Term # of of 1st # of Use # of Form Serv. Key & State (Years) Fee Spec. Use

More information

Hi, my name is Dr. Ann Weaver of Argosy University. This WebEx is about something in statistics called z-

Hi, my name is Dr. Ann Weaver of Argosy University. This WebEx is about something in statistics called z- Hi, my name is Dr. Ann Weaver of Argosy University. This WebEx is about something in statistics called z- Scores. I have two purposes for this WebEx, one, I just want to show you how to use z-scores in

More information

Physics 6A Lab Experiment 6

Physics 6A Lab Experiment 6 Biceps Muscle Model Physics 6A Lab Experiment 6 Introduction This lab will begin with some warm-up exercises to familiarize yourself with the theory, as well as the experimental setup. Then you ll move

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

Sampling Distribution Models. Central Limit Theorem

Sampling Distribution Models. Central Limit Theorem Sampling Distribution Models Central Limit Theorem Thought Questions 1. 40% of large population disagree with new law. In parts a and b, think about role of sample size. a. If randomly sample 10 people,

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Exam #2 Results (as percentages)

Exam #2 Results (as percentages) Oct. 30 Assignment: Read Chapter 19 Try exercises 1, 2, and 4 on p. 424 Exam #2 Results (as percentages) Mean: 71.4 Median: 73.3 Soda attitudes 2015 In a Gallup poll conducted Jul. 8 12, 2015, 1009 adult

More information

Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2:

Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2: Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2: 03 17 08 3 All about lines 3.1 The Rectangular Coordinate System Know how to plot points in the rectangular coordinate system. Know the

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

Chapter 1 Review of Equations and Inequalities

Chapter 1 Review of Equations and Inequalities Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve

More information

An analogy from Calculus: limits

An analogy from Calculus: limits COMP 250 Fall 2018 35 - big O Nov. 30, 2018 We have seen several algorithms in the course, and we have loosely characterized their runtimes in terms of the size n of the input. We say that the algorithm

More information

6.080 / Great Ideas in Theoretical Computer Science Spring 2008

6.080 / Great Ideas in Theoretical Computer Science Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 6.080 / 6.089 Great Ideas in Theoretical Computer Science Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

Chapter 3 ALGEBRA. Overview. Algebra. 3.1 Linear Equations and Applications 3.2 More Linear Equations 3.3 Equations with Exponents. Section 3.

Chapter 3 ALGEBRA. Overview. Algebra. 3.1 Linear Equations and Applications 3.2 More Linear Equations 3.3 Equations with Exponents. Section 3. 4 Chapter 3 ALGEBRA Overview Algebra 3.1 Linear Equations and Applications 3.2 More Linear Equations 3.3 Equations with Exponents 5 LinearEquations 3+ what = 7? If you have come through arithmetic, the

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Math 138: Introduction to solving systems of equations with matrices. The Concept of Balance for Systems of Equations

Math 138: Introduction to solving systems of equations with matrices. The Concept of Balance for Systems of Equations Math 138: Introduction to solving systems of equations with matrices. Pedagogy focus: Concept of equation balance, integer arithmetic, quadratic equations. The Concept of Balance for Systems of Equations

More information

Ratios, Proportions, Unit Conversions, and the Factor-Label Method

Ratios, Proportions, Unit Conversions, and the Factor-Label Method Ratios, Proportions, Unit Conversions, and the Factor-Label Method Math 0, Littlefield I don t know why, but presentations about ratios and proportions are often confused and fragmented. The one in your

More information

LECTURE 2: SIMPLE REGRESSION I

LECTURE 2: SIMPLE REGRESSION I LECTURE 2: SIMPLE REGRESSION I 2 Introducing Simple Regression Introducing Simple Regression 3 simple regression = regression with 2 variables y dependent variable explained variable response variable

More information

Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time.

Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time. Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time. Simplifying the calculation of variance Notice that we can rewrite the calculation of a

More information

2006 Supplemental Tax Information for JennisonDryden and Strategic Partners Funds

2006 Supplemental Tax Information for JennisonDryden and Strategic Partners Funds 2006 Supplemental Information for JennisonDryden and Strategic Partners s We have compiled the following information to help you prepare your 2006 federal and state tax returns: Percentage of income from

More information

THE SAMPLING DISTRIBUTION OF THE MEAN

THE SAMPLING DISTRIBUTION OF THE MEAN THE SAMPLING DISTRIBUTION OF THE MEAN COGS 14B JANUARY 26, 2017 TODAY Sampling Distributions Sampling Distribution of the Mean Central Limit Theorem INFERENTIAL STATISTICS Inferential statistics: allows

More information

Achilles: Now I know how powerful computers are going to become!

Achilles: Now I know how powerful computers are going to become! A Sigmoid Dialogue By Anders Sandberg Achilles: Now I know how powerful computers are going to become! Tortoise: How? Achilles: I did curve fitting to Moore s law. I know you are going to object that technological

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

Chapter 19 Sir Migo Mendoza

Chapter 19 Sir Migo Mendoza The Linear Regression Chapter 19 Sir Migo Mendoza Linear Regression and the Line of Best Fit Lesson 19.1 Sir Migo Mendoza Question: Once we have a Linear Relationship, what can we do with it? Something

More information

Note: Please use the actual date you accessed this material in your citation.

Note: Please use the actual date you accessed this material in your citation. MIT OpenCourseWare http://ocw.mit.edu 18.06 Linear Algebra, Spring 2005 Please use the following citation format: Gilbert Strang, 18.06 Linear Algebra, Spring 2005. (Massachusetts Institute of Technology:

More information

Water tank. Fortunately there are a couple of objectors. Why is it straight? Shouldn t it be a curve?

Water tank. Fortunately there are a couple of objectors. Why is it straight? Shouldn t it be a curve? Water tank (a) A cylindrical tank contains 800 ml of water. At t=0 (minutes) a hole is punched in the bottom, and water begins to flow out. It takes exactly 100 seconds for the tank to empty. Draw the

More information

Lecture 10: F -Tests, ANOVA and R 2

Lecture 10: F -Tests, ANOVA and R 2 Lecture 10: F -Tests, ANOVA and R 2 1 ANOVA We saw that we could test the null hypothesis that β 1 0 using the statistic ( β 1 0)/ŝe. (Although I also mentioned that confidence intervals are generally

More information

Discrete Structures Proofwriting Checklist

Discrete Structures Proofwriting Checklist CS103 Winter 2019 Discrete Structures Proofwriting Checklist Cynthia Lee Keith Schwarz Now that we re transitioning to writing proofs about discrete structures like binary relations, functions, and graphs,

More information

Algebra. Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Algebra. Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This document was written and copyrighted by Paul Dawkins. Use of this document and its online version is governed by the Terms and Conditions of Use located at. The online version of this document is

More information

MITOCW ocw f99-lec30_300k

MITOCW ocw f99-lec30_300k MITOCW ocw-18.06-f99-lec30_300k OK, this is the lecture on linear transformations. Actually, linear algebra courses used to begin with this lecture, so you could say I'm beginning this course again by

More information

Electromagnetic Theory Prof. D. K. Ghosh Department of Physics Indian Institute of Technology, Bombay

Electromagnetic Theory Prof. D. K. Ghosh Department of Physics Indian Institute of Technology, Bombay Electromagnetic Theory Prof. D. K. Ghosh Department of Physics Indian Institute of Technology, Bombay Lecture -1 Element of vector calculus: Scalar Field and its Gradient This is going to be about one

More information

MIT BLOSSOMS INITIATIVE

MIT BLOSSOMS INITIATIVE MIT BLOSSOMS INITIATIVE The Broken Stick Problem Taught by Professor Richard C. Larson Mitsui Professor of Engineering Systems and of Civil and Environmental Engineering Segment 1 Hi! My name is Dick Larson

More information

Correlation and Regression

Correlation and Regression Elementary Statistics A Step by Step Approach Sixth Edition by Allan G. Bluman http://www.mhhe.com/math/stat/blumanbrief SLIDES PREPARED BY LLOYD R. JAISINGH MOREHEAD STATE UNIVERSITY MOREHEAD KY Updated

More information

Calculus II. Calculus II tends to be a very difficult course for many students. There are many reasons for this.

Calculus II. Calculus II tends to be a very difficult course for many students. There are many reasons for this. Preface Here are my online notes for my Calculus II course that I teach here at Lamar University. Despite the fact that these are my class notes they should be accessible to anyone wanting to learn Calculus

More information

Estimating Dynamic Games of Electoral Competition to Evaluate Term Limits in U.S. Gubernatorial Elections: Online Appendix

Estimating Dynamic Games of Electoral Competition to Evaluate Term Limits in U.S. Gubernatorial Elections: Online Appendix Estimating Dynamic Games of Electoral Competition to Evaluate Term Limits in U.S. Gubernatorial Elections: Online ppendix Holger Sieg University of Pennsylvania and NBER Chamna Yoon Baruch College I. States

More information

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs Logistic Regression, Part I: Problems with the Linear Probability Model (LPM) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals

More information

Lesson 3-1: Solving Linear Systems by Graphing

Lesson 3-1: Solving Linear Systems by Graphing For the past several weeks we ve been working with linear equations. We ve learned how to graph them and the three main forms they can take. Today we re going to begin considering what happens when we

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

Line Integrals and Path Independence

Line Integrals and Path Independence Line Integrals and Path Independence We get to talk about integrals that are the areas under a line in three (or more) dimensional space. These are called, strangely enough, line integrals. Figure 11.1

More information