5.4 Some Conceptual Underpinnings

Size: px
Start display at page:

Download "5.4 Some Conceptual Underpinnings"

Transcription

1 5.4. SOME CONCEPTUAL UNDERPINNINGS 229 What, then, is an appropriate definition of identifiability for nonparametric models? One reasonable definition is that answers to interventional queries are unique, and this is precisely how Definition interprets the identification of the causal effect P (yjdo(x)). As we have seen in Chapters 3 and 4, many aspects of nonparametric identification can be determined graphically, almost by inspection, from the diagrams that accompany the equations. These include tests for deciding whether a given interventional query is identifiable as well as formulas for estimating such queries. 5.4 Some Conceptual Underpinnings What Do Structural Parameters Really Mean? Every student of SEM has stumbled on the following paradox at some point in his or her career. If we interpret the coefficient in the equation y = x + as the change in E(Y ) per unit change of X, then, after rewriting the equation as x =(y )= we ought to interpret 1= as the change in E(X) per unit change of Y. But this conflicts both with intuition and with the prediction of the model: the change in E(X) per unit change of Y ought to be zero if Y does not appear as an independent variable in the original, structural equation for X. Teachers of SEM generally evade this dilemma via one of two escape routes. One route involves denying that has any causal reading and settling for a purely statistical interpretation, in which measures the reduction in the variance of Y explained by X (see e.g. Muthen 1987). The other route permits causal reading of only those coefficients that meet the isolation restriction (Bollen 1989; James et al. 1982): the explanatory variable must be uncorrelated with the error in the equation. Because cannot be uncorrelated with both X and Y (or so the argument goes), and 1= cannot both have causal meaning, and the paradox dissolves. The first route is self-consistent, but it compromises the founders intent that SEM function as an aid to policy making and clashes with the intuition of most SEM users. The second is vulnerable to attack logically. It is well known that every pair of bivariate normal variables, X and Y, can be expressed in two equivalent ways, y = x + 1 and x = y + 2 where cov(x 1 )=cov(y 2 )=0and = r XY = X 2 =2 Y. Thus, if the condition cov(x 1 )=0endows with causal meaning, then cov(y 2 )=0ought to endow with causal meaning as well. But this, too, conflicts with both intuition and the intentions behind SEM; the change in E(X) per unit change of Y ought to be zero, not r XY, if there is no causal path from Y to X. What then is the meaning of a structural coefficient? Or a structural equation? Or an error term? The interventional interpretation of causal effects, when coupled with

2 230CHAPTER 5. CAUSALITY AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS the do(x) notation, provides simple answers to these questions. The answers explicate the operational meaning of structural equations and thus should end, I hope an era of controversy and confusion regarding these entities. Structural Equations: Operational Definition Definition (Structural Equations) An equation y = x + is said to be structural if it is to be interpreted as follows: In an ideal experiment where we control X to x and any other set Z of variables (not containing X or Y ) to z, the value y of Y is given by x +, where is not a function of the settings x and z. This definition is operational because all quantities are observable, albeit under conditions of controlled manipulation. That manipulations cannot be performed in most observational studies does not negate the operationality of the definition, much as our inability to observe bacteria with the naked eye does not negate their observability under a microscope. The challenge of SEM is to extract the maximum information concerning what we wish to observe from the little we actually can observe. Note that the operational reading just given makes no claim about how X (or any other variable) will behave when we control Y. This asymmetry makes the equality signs in structural equations different from algebraic equality signs; the former act symmetrically in relating observations on X and Y (e.g., observing Y = 0 implies x = ), but they act asymmetrically when it comes to interventions (e.g., setting Y to zero tells us nothing about the relation between x and ). The arrows in path diagrams make this dual role explicit, and this may account for the insight and inferential power gained through the use of diagrams. The strongest empirical claim of the equation y = x + is made by excluding other variables from the r.h.s. of the equation, thus proclaiming X the only immediate cause of Y. This translates into a testable claim of invariance: the statistics of Y under condition do(x) should remain invariant to the manipulation of any other variable in the model (see Section 1.3.2). 16 This claim can be written symbolically as P (yjdo(x) do(z)) = P (yjdo(x)) (5.23) for all Z disjoint of fx [ Y g. 17 Note that this invariance holds relative to manipulations, not observations, of Z. The statistics of Y under condition do(x) given the measurement Z = z, written 16 The basic notion that structural equations remain invariant to certain changes in the system goes back to Marschak (1950) and Simon (1953), and it has received mathematical formulation at various levels of abstraction in Hurwicz (1962), Mesarovic (1969), Sims (1977), Cartwright (1989), Hoover (1990), and Woodward (1995). The simplicity, precision, and clarity of (5.23) is unsurpassed, however. 17 This claim is, in fact, only part of the message conveyed by the equation; the other part consists of a dynamic or counterfactual claim: If we were to control X to x 0 instead of x, then Y would attain the value x + 0. In other words, plotting the value of Y under various hypothetical controls of X, and under the same external conditions (), should result in a straight line with slope. Such deterministic dynamic claims concerning system behavior under successive control conditions can only be tested under the assumption that, representing external conditions or properties of experimental units, remains unaltered as we switch from x to x 0. Such counterfactual claims constitute the empirical content of every scientific law (see Section 7.2.2).

3 5.4. SOME CONCEPTUAL UNDERPINNINGS 231 P (yjdo(x) z), would certainly depend on z if the measurement were taken on a consequence (i.e. descendant) of Y. Note also that the ordinary conditional probability P (yjx) does not enjoy such a strong property of invariance, since P (yjx) is generally sensitive to manipulations of variables other than X in the model (unless X and are independent). Equation (5.23), in contrast, remains valid regardless of the statistical relationship between and X. Generalized to a set of several structural equations, (5.23) explicates the assumptions underlying a given causal diagram. If G is the graph associated with a set of structural equations, then the assumptions are embodied in G as follows: (1) every missing arrow say, between X and Y represents the assumption that X has no causal effect on Y once we intervene and hold the parents of Y fixed; and (2) every missing bidirected link between X and Y represents the assumption that the omitted factors that (directly) influence X are uncorrelated with those that (directly) influence Y. We shall define the operational meaning of the latter assumption in (5.25) (5.27). The Structural Parameters: Operational Definition The interpretation of a structural equation as a statement about the behavior of Y under a hypothetical intervention yields a simple definition for the structural parameters. The meaning of in the equation y = x + is simply E[Y jdo(x) ] that is, the rate of change (relative to x) of the expectation of Y in an experiment where X is held at x by external control. This interpretation holds regardless of whether and X are correlated in nonexperimental studies (e.g., via another equation x = y + ). We hardly need to add at this point that has nothing to do with the regression coefficient r YX or, equivalently, with the conditional expectation E(Y jx), as suggested in many textbooks. The conditions under which coincides with the regression coefficient are spelled out in Theorem It is important nevertheless to compare the definition of (5.24) with theories that acknowledge the invariant character of but have difficulties explicating which changes is invariant to. Cartwright (1989, p. 194), for example, characterizes as an invariant of nature that she calls capacity. She states correctly that remains constant under change but explains that, as the statistics of X changes, it is the ratio [ = E(Y X)=E(X 2 )] which remains fixed no matter how the variances shift. This characterization is imprecise on two accounts. First, may in general not be equal to the stated ratio nor to any other combination of statistical parameters. Second and this is the main point of Definition structural parameters are invariant to local interventions (i.e., changes in specific equations in the system) and not to general changes in the statistics of the variables. If we start with cov(x ) =0and the variance of X changes because we (or Nature) locally modify the process that generates X, then Cartwright is correct; the ratio = E(Y X)=E(X 2 ) will remain constant. However, if the variance of X changes for any other reason say, because we observed some evidence Z = z that depends on both X and Y or because the process generat-

4 232CHAPTER 5. CAUSALITY AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS ing X becomes dependent on a wider set of variables then that ratio will not remain constant. The Mystical Error Term: Operational Definition The interpretations given in Definition and (5.24) provide an operational definition for that mystical error term = y E[Y jdo(x)] (5.25) which, despite being unobserved in nonmanipulative studies, is far from being metaphysical or definitional as suggested by some researchers (e.g. Richard 1980; Holland 1988, p. 460; Hendry 1995, p. 62). Unlike errors in regression equations, measures the deviation of Y from its controlled expectation E[Y jdo(x)] and not from its conditional expectation E[Y jx]. The statistics of can therefore be measured from observations on Y once X is controlled. Alternatively, because remains the same regardless of whether X is manipulated or observed, the statistics of = y x can be measured in observational studies if we know. Likewise, correlations among errors can be estimated empirically. For any two nonadjacent variables X and Y, (5.25) yields E[ Y X ]=E[Y Xjdo(pa Y pa X )] E[Y jdo(pa Y )]E[Xjdo(pa X )]: (5.26) Once we have determined the structural coefficients, the controlled expectations E[Y jdo(pa Y )] E[Xjdo(pa X )], and E[Y Xjdo(pa Y pa X )] become known linear functions of the observed variables pa Y and pa X ; hence, the expectations on the r.h.s. of (5.26) can be estimated in observational studies. Alternatively, if the coefficients are not determined, then the expression can be assessed directly in interventional studies by holding pa X and pa Y fixed (assuming X and Y are not in parent-child relationship) and estimating the covariance of X and Y from data obtained under such conditions. Finally, we are often interested not in assessing the numerical value of E[ Y X ] but rather in determining whether Y and X can be assumed to be uncorrelated. For this determination, it suffices to test whether the equality E[Y jx do(s XY )] = E[Y jdo(x) do(s XY )] (5.27) holds true, where s XY stands for (any setting of) all variables in the model excluding X and Y. This test can be applied to any two variables in the model except when Y is a parent of X, in which case the symmetrical equation (with X and Y interchanged) is applicable. The Mystical Error Term: Conceptual Interpretation The authors of SEM textbooks usually interpret error terms as representing the influence of omitted factors. Many SEM researchers are reluctant to accept this interpretation, however, partly because unspecified omitted factors open the door to metaphysical speculations and partly because arguments based on such factors were improperly used

5 5.4. SOME CONCEPTUAL UNDERPINNINGS 233 as a generic, substance-free license to omit bidirected arcs from path diagrams (Mc- Donald 1997). Such concerns are answered by the operational interpretation of error terms, (5.25), since it prescribes how errors are measured, not how they originate. It is important to note, though, that this operational definition is no substitute for the omitted-factors conception when it comes to deciding whether pairs of error terms can be assumed to be uncorrelated. Because such decisions are needed at a stage when the model s parameters are still free, they cannot be made on the basis of numerical assessments of correlations but must rest instead on qualitative structural knowledge about how mechanisms are tied together and how variables affect each other. Such judgmental decisions are hardly aided by the operational criterion of (5.26), which instructs the investigator to assess whether two deviations taken on two different variables under complex experimental conditions would be correlated or uncorrelated. Such assessments are cognitively unfeasible. In contrast, the omitted-factors conception instructs the investigator to judge whether there could be factors that simultaneously influence several observed variables. Such judgments are cognitively manageable because they are qualitative and rest on purely structural knowledge the only knowledge available during this phase of modeling. Another source of error correlation that should be considered by investigators is selection bias. If two uncorrelated unobserved factors have a common effect that is omitted from the analysis but influences the selection of samples for the study, then the corresponding error terms will be correlated in the sampled population; hence, the expectation in (5.26) will not vanish when taken over the sampled population (see discussion of Berkson s paradox in Section 1.2.3). We should emphasize, however, that the arcs missing from the diagram, not those in the diagram, demand the most attention and careful substantive justification. Adding an extra bidirected arc can at worst compromise the identifiability of parameters, but deleting an existing bidirected arc may produce erroneous conclusions as well as a false sense of model testability. Thus, bidirected arcs should be assumed to exist, by default, between any two nodes in the diagram. They should be deleted only by well-motivated justifications, such as the unlikely existence of a common cause for the two variables and the unlikely existence of selection bias. Although we can never be cognizant of all the factors that may affect our variables, substantive knowledge sometimes permits us to state that the influence of a possible common factor is not likely to be significant. Thus, as often happens in the sciences, the way we measure physical entities does not offer the best way of thinking about them. The omitted-factor conception of errors, because it rests on structural knowledge, is a more useful guide than the operational definition when building, evaluating, and thinking about causal models Interpretation of Effect Decomposition Structural equation modeling prides itself, and rightly so, for providing principled methodology for distinguishing direct from indirect effects. We have seen in Section 4.5 that such distinction is important in many applications, ranging from process control to legal disputes, and that SEM indeed provides a coherent methodology of defining, identifying, and estimating direct and indirect effects. However, the reluctance of

6 234CHAPTER 5. CAUSALITY AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS most SEM researchers to admit the causal reading of structural parameters coupled with their preoccupation with algebraic manipulations has resulted in inadequate definitions of direct and indirect effects, as pointed out by Freedman (1987) and Sobel (1990). In this section we hope to correct this confusion by adhering to the operational meaning of the structural coefficients. We start with the general notion of a causal effect P (yjdo(x)), as in Definition We then specialize it to define direct effect, as in Section 4.5, and finally express the definitions in terms of structural coefficients. Definition (Total Effect) The total effect of X on Y is given by P (yjdo(x)), namely, the distribution of Y while X is held constant at x and all other variables are permitted to run their natural course. Definition (Direct Effect) The direct effect of X on Y is given by P (yjdo(x) do(s XY )), where S XY is the set of all observed variables in the system except X and Y. In linear analysis, Definitions and yield, after differentiation with respect to x, the familiar path coefficients in terms of which direct and indirect effects are usually defined. Yet they differ from conventional definitions in several important aspects. First, direct effects are defined in terms of hypothetical experiments in which intermediate variables are held constant by physical intervention, not by statistical adjustment (which is often disguised under the misleading phrase control for ). Figure 5.10 depicts a simple example where adjusting for the intermediate variables (Z and W ) would not give the correct value of zero for the direct effect of X on E(Y jdo(x y w)) does yield the correct value: (w + z) = 0. (Table 4.1) provides another such example, one that involves dichotomous variables. Second, there is no need to limit control to only intermediate variables; all variables in the system may be held constant (except for X and Y ). Hypothetically, the scientist controls for all possible conditions S XY, and measurements may commence without knowing the structure of the diagram. Finally, our definitions differ from convention by interpreting total and direct effects independently of each other, as outcomes of two different experiments. Textbook definitions (e.g. Bollen 1989, p. 376; Mueller 1996, p. 141; Kline 1998, p. 175) usually equate the total effect with a power series of path coefficient matrices. This algebraic definition coincides with the operational definition (Definition 5.4.2) in recursive (semi-markovian) systems, but it yields erroneous expressions in models with feedback. For instance, given the pair of equations fy = x + x = y + g, the total effect of X on Y is simply, not (1 ) 1 as stated in Bollen (1989, p. 379). The latter has no operational significance worthy of the phrase effect of X. 18 We end this section of effect decomposition with a few remarks that should be of interest to researchers dealing with dichotomous variables. The relations among such 18 This error was noted by Sobel (1990) but, perhaps because constancy of path coefficients was presented as a new and extraneous assumption, Sobel s correction has not brought about a shift in practice or philosophy.

7 5.4. SOME CONCEPTUAL UNDERPINNINGS 235 variables are usually nonlinear, so the results of Section 4.5 should be applicable. In particular, the direct effect of X on Y should will depend on the levels at which we hold the other parents of Y. If we wish to average over these values, we obtain the expression given in Section In standard linear analysis, an indirect effect may be defined as the difference between the total effect and the direct effects (Bollen 1989). In nonlinear analysis, differences lose their significance, and one must isolate the contribution of mediating paths in some other way. Expressions of the form P (yjdo(x) do(z)) cannot be used to isolate such contributions because there is no physical means of selectively disabling a direct causal link from X to Y by holding some variables constant. This suggests that the notion of indirect effect has no intrinsic operational meaning apart from providing a comparison between the direct and the total effects. In other words, a policy maker who asks for that part of the total effect transmitted by a particular intermediate variable or by a group Z of such variables is really asking for a comparison of the effects of two policies, one where Z is held constant versus the other where it is not. The expressions corresponding to these policies are P (yjdo(x) do(z)) and P (yjdo(x)), and this pair of distributions should be taken as the most general representation of indirect effects. Similar conclusions have been expressed by Robins (1986) and Robins and Greenland (1992) Exogeneity, Superexogeneity, and Other Frills Economics textbooks invariably warn readers that the distinction between exogenous and endogenous variables is, on the one hand, most important for model building (Darnell 1994, p. 127) and, on the other hand, a subtle and sometimes controversial complication (Greene 1997, p. 712). Economics students would naturally expect the concepts and tools developed in this chapter to shed some light on the subject, and rightly so. We next offer a simple definition of exogeneity that captures the important nuances appearing in the literature and that is both palatable and precise, It is fashionable today to distinguish three types of exogeneity: weak, strong, and super (Engle et al. 1983); the former two are statistical and the latter causal. However, the importance of exogeneity and the reason for its controversial status lies in its implications for policy interventions. Some economists believe, therefore, that only the causal aspect (i.e. superexogeneity) deserves the exogenous title and that the statistical versions are unwarranted intruders that tend to confuse issues of identification and interpretability with those of estimation efficiency (Ed Leamer, personal communication). 19 I will serve both camps by starting with a simple definition of causal exogeneity and then offering a more general definition, from which both the causal and the statistical aspects would follow as special cases. Thus, what we call exogeneity corresponds to what Engle et al. called superexogeneity, a notion that captures economists interest in the structural invariance of certain relationships under policy intervention. Suppose that we consider intervening on a set of variables X and that we wish 19 Similiar opinions have also been communicated by John Aldrich and James Heckman. See also Aldrich (1993).

8 236CHAPTER 5. CAUSALITY AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS to characterize the statistical behavior of a set Y of outcome variables under the intervention do(x = x). Denote the postintervention distribution of Y by the usual expression P (yjdo(x)). If we are interested in a set of parameters of that distribution, then our task is to estimate [P (yjdo(x)] from the available data. However, the data available is typically generated under a different set of conditions: X was not held constant but instead was allowed to vary with whatever economical pressures and expectations prompted decision makers to set X in the past. Denoting the process that generated data in the past by M and the probability distribution associated with M by PM (v), we ask whether [PM (yjdo(x)] can be estimated consistently from samples drawn from PM (v), given our background knowledge T (connoting theory ) about M. This is essentially the problem of identification that we have analyzed in this and previous chapters, with one important difference; we now ask whether [P (yjdo(x)] can be identified from the conditional distribution P (yjx) alone, instead of from the entire joint distribution P (v). When identification holds under this restricted condition, X is said to be exogenous relative to (Y T ). We may state this formally as follows. Definition (Exogeneity) Let X and Y be two sets of variables, and let be any set of parameters of the postintervention probability P (yjdo(x)). We say that X is exogenous relative to (Y T ) if is identifiable from the conditional distribution P (yjx), that is, if PM 1 (yjx) =PM 2 (yjx) =) [PM 1 (yjdo(x))] = [PM 2 (yjdo(x))] (5.28) for any two models, M 1 and M 2, satisfying theory T. In the special case where constitutes a complete specification of the postintervention probabilities, (5.28) reduces to the implication PM 1 (yjx) =PM 2 (yjx) =) PM 1 (yjdo(x)) = PM 2 (yjdo(x)): (5.29) If we further assume that, for every P (yjx), our theory T does not a priori exclude some model M 2 satisfying PM 2 (yjdo(x)) = PM 2 (yjx), 20 then (5.29) reduces to the equality P (yjdo(x)) = P (yjx) (5.30) a condition we recognize as no confounding (see Sections 3.3 and 6.2). Equation (5.30) follows (from (5.29)) because (5.29) must hold for all M 1 in T. Note that, since the theory T is not mentioned explicitly, (5.30) can be applied to any individual model M and can be taken as yet another definition of exogeneity albeit a stronger one than (5.28). The motivation for insisting that be identifiable from the conditional distribution P (yjx) alone, even though the marginal distribution P (x) is available, lies in its ramification for the process of estimation. As stated in (5.30), discovering that X is 20 For example, if T stands for all models possessing the same graph structure, then such M2 is not a priori excluded.

9 5.4. SOME CONCEPTUAL UNDERPINNINGS 237 exogenous permits us to predict the effect of interventions (in X) directly from passive observations, without even adjusting for confounding factors. Our analyses in Sections 3.3 and 5.3 further provide a graphical test of exogeneity: X is exogenous for Y if there is no unblocked back-door path from X to Y (Theorem 5.3.2). This test supplements the declarative definition of (5.30) with a procedural definition and thus completes the formalization of exogeneity. That the invariance properties usually attributable to superexogeneity are discernible from the topology of the causal diagram should come as no surprise, considering that each causal diagram represents a structural model and that each structural model already embodies the invariance assumptions necessary for policy predictions (see Definition 5.4.1). Leamer (1985) defined X to be exogenous if P (yjx) remains invariant to changes in the process that generates X. This definition coincides 21 with (5.30) because P (yjdo(x)) is governed by a structural model in which the equations determining X are wiped out; thus, P (yjx) must be insensitive to the nature of those equations. In contrast, Engle et al. (1983) defined exogeneity (i.e., their superexogeneity) in terms of changes in the marginal density of X; as usual, the transition from process language to statistical terminology leads to ambiguities. According to Engle et al. (1983, p. 284), exogeneity requires that all the parameters of the conditional distribution P (yjx) be invariant for any change in the distribution of the conditioning variables 22 (i.e. P (x)). This requirement of constancy under any change in P (x) is too strong changing conditions or new observations can easily alter both P (x) and P (yjx) even when X is perfectly exogenous. (To illustrate, consider a change that turns a randomized experiment, where X is indisputably exogenous, into a nonrandomized experiment; we should not insist on P (yjx) remaining invariant under such change.) The class of changes considered must be restricted to local modification of the mechanisms (or equations) that determine X, as stated by Leamer, and this restriction must be incorporated into any definition of exogeneity. In order to make this restriction precise, however, the vocabulary of SEMs must be invoked as in the definition of P (yjdo(x)); the vocabulary of marginal and conditional densities is far too coarse to properly define the changes against which P (yjx) ought to remain invariant. We are now ready to define a more general notion of exogeneity, one that includes weak and super exogeneities under the same umbrella. 23 Toward that end, we remove from Definition the restriction that must represent features of the postintervention distribution. Instead, we allow to represent any feature of the underlying model M, including structural features such as path coefficients, causal effects, and counterfactuals, and including statistical features (which could, of course, be ascertained from the joint distribution alone). With this generalization, we also obtain a simpler definition of exogeneity. Definition (General Exogeneity) Let X and Y be two sets of variables, and let be any set of parameters defined on a 21 Provided that changes are confined to modification of functions without changing the set of arguments (i.e. parents) in each function. 22 This requirement is repeated verbatim in Darnell (1994, p. 131) and Maddala (1992, p. 192). 23 We leave out discussion of strong exogeneity, which is a slightly more involved version of weak exogeneity applicable to time-series analysis.

10 238CHAPTER 5. CAUSALITY AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS structural model M in a theory T. We say that X is exogenous relative to (Y T ) if is identifiable from the conditional distribution P (yjx), that is, if PM 1 (yjx) =PM 2 (yjx) =) (M 1 )=(M 2 ) (5.31) for any two models, M 1 and M 2, satisfying theory T. When consists of structural parameters, such as path coefficients or causal effects, (5.31) expresses invariance to a variety of interventions, not merely do(x = x). Although the interventions themselves are not mentioned explicitly in (5.31), the equality (M 1 )= (M 2 ) reflects such interventions through the structural character of. In particular, if stands for the values of the causal effect function P (yjdo(x)) at selected points of x and y, then (5.31) reduces to the implication PM 1 (yjx) =PM 2 (yjx) =) PM 1 (yjdo(x)) = PM 2 (yjdo(x)) (5.32) which is identical to (5.29). Hence the causal properties of exogeneity follow. When consists of strictly statistical parameters such as means, modes, regression coefficients, or other distributional features the structural features of M do not enter into consideration; we have (M ) =(PM ) and so (5.31) reduces to P 1 (yjx) =P 2 (yjx) =) (P 1 )=(P 2 ) (5.33) for any two probability distributions P 1 (x y) and P 2 (x y) that are consistent with T. We have thus obtained a statistical notion of exogeneity that permits us to ignore the marginal P (x) in the estimation of and that we may call weak exogeneity. 24 Finally, if consists of causal effects among variables in Y (excluding X), we obtain a generalized definition of instrumental variables. For example, if our interest lies in the causal effect = P (wjdo(z)), where W and Z are two sets of variables in Y, then the exogeneity of X relative to this parameter ensures the identification of P (wjdo(z)) from the conditional probability P (z wjx). This is indeed the role of an instrumental variable to assist in the identification of causal effects not involving the instrument. (See Figure 5.9, with Z X Y representing X Z W, respectively.) A word of caution regarding the language used in most textbooks: exogeneity is frequently defined by asking whether parameters enter into the expressions of the conditional or the marginal density. For example, Maddala (1992, p. 392) defined weak exogeneity as the requirement that the marginal distribution P (x) does not involve. Such definitions are not unambiguous, because the question of whether a parameter enters a density or whether a density involves a parameter are syntax-dependent; different algebraic representations may make certain parameters explicit or obscure. For example, if X and Y are dichotomous, then the marginal probability P (x) certainly involves parameters such as 1 = P (x 0 y 0 )+P (x 0 y 1 ) and 2 = P (x 0 y 0 ) 24 Engle et al. (1983) further imposed a requirement called variation-free, which is satisfied by default when dealing with genuinely structural models M in which mechanisms do not constrain one another.

11 5.4. SOME CONCEPTUAL UNDERPINNINGS 239 as well as their ratio: = 2 = 1 : Therefore, writing P (x 0 )= 2 = whos that both and 2 are involved in the marginal probability P (x 0 ), and one may be tempted to conclude that X is not exogenous relative to. Yet X is in fact exogenous relative to, because the ratio = 2 = 1 is none other than P (y 0 jx 0 ); hence it is determined uniquely by P (y 0 jx 0 ) as required by (5.33). 25 The advantage of the definition given in (5.31) is that it depends not on the syntactic representation of the density function but rather on its semantical content alone. Parameters are treated as quantities computed from a model, and not as mathematical symbols that describe a model. Consequently, the definition applies to both statistical and structural parameters and, in fact, to any quantity that can be computed from a structural model M, regardless of whether it serves (or may serve) in the description of the marginal or conditional densities. The Mystical Error Term Revisited Historically, the definition of exogeneity that has evoked most controversy is the one expressed in terms of correlation between variables and errors. It reads as follows. Definition (Error-Based Exogeneity) As variable X is exogenous (relative to = P (yjdo(x))) if X is independent of all errors that influence Y, except those mediated by X. This definition, which Hendry and Morgan (1995) trace to Orcutt (1952), became standard in the econometric literature between 1950 and 1970 (e.g. Christ 1966, p. 156; Dhrymes 1970, p. 169) and still serves to guide the thoughts of most econometricians (as in the selection of instrumental variables; Bowden and Turkington 1984). However, it came under criticism in the early 1980s when the distinction between structural errors (equation (5.25)) and regression errors became obscured (Richard 1980). (Regression errors, by definition, are orthogonal to the regressors.) The Cowles Commission logic of structural equations (see Section 5.1) has not reached full mathematical maturity and by denying notational distinction between structural and regressional parameters has left all notions based on error terms suspect of ambiguity. The prospect of establishing an entirely new foundation of exogeneity seemingly free of theoretical terms such as errors and structure (Engle et al. 1983) has further dissuaded economists from tidying up the Cowles Commission logic, and criticism of the error-based definition of exogeneity has become increasingly fashionable. For example, Hendry and Morgan (1995) wrote that the concept of exogeneity rapidly evolved into a loose notion as a property of an observable variable being uncorrelated with an unobserved error, and Imbens (1997) readily agreed that this notion is inadequate Engle et al. (1983, p. 281) and Hendry (1995, pp ) attempted to overcome this ambiguity by using reparameterization an unnecessary complication. 26 Imbens prefers definitions in terms of experimental metaphors such as random assignment assumption, fearing, perhaps, that [t]ypically the researcher does not have a firm idea what these disturbances really represent (Angrist et al. p. 446).

12 240CHAPTER 5. CAUSALITY AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS These critics are hardly justified if we consider the precision and clarity with which structural errors can be defined when using the proper notation (e.g. (5.25)). When applied to structural errors, the standard error-based criterion of exogeneity coincides formally with that of (5.30), as can be verified using the back-door test of Theorem (with Z = ). Consequently, the standard definition conveys the same information as that embodied in more complicated and less communicable definitions of exogeneity. I am therefore convinced that the standard definition will eventually regain the acceptance and respectability that it has always deserved. Relationships between graphical and counterfactual definitions of exogeneity and instrumental variables will be discussed in Chapter 7 (Section 7.4.5). 5.5 Conclusion Today the enterprise known as structural equation modeling is increasingly under fire. The founding fathers have retired, their teachings are forgotten, and practitioners, teachers, and researchers currently find the methodology they inherited difficult to either defend or supplant. Modern SEM textbooks are preoccupied with parameter estimation and rarely explicate the role that those parameters play in causal explanations or in policy analysis; examples dealing with the effects of interventions are conspicuously absent, for instance. Research in SEM now focuses almost exclusively on model fitting, while issues pertaining to the meaning and usage of SEM s models are subjects of confusion and controversy. I am thoroughly convinced that the contemporary crisis in SEM originates in the lack of a mathematical language for handling the causal information embedded in structural equations. Graphical models have provided such a language. They have thus helped us answer many of the unsettled questions that drive the current crisis: 1. Under what conditions can we give causal interpretation to structural coefficients? 2. What are the causal assumptions underlying a given structural equation model? 3. What are the statistical implications of any given structural equation model? 4. What is the operational meaning of a given structural coefficient? 5. What are the policy-making claims of any given structural equation model? 6. When is an equation not structural? This chapter has described the conceptual developments that now resolve such foundational questions. In addition, we have presented several tools to be used in answering questions of practical importance: 1. When are two structural equation models observationally indistinguishable? 2. When do regression coefficients represent path coefficients? 3. When would the addition of a regressor introduce bias?

5.3 Graphs and Identifiability

5.3 Graphs and Identifiability 218CHAPTER 5. CAUSALIT AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS.83 AFFECT.65.23 BEHAVIOR COGNITION Figure 5.5: Untestable model displaying quantitative causal information derived. method elucidates

More information

From Causality, Second edition, Contents

From Causality, Second edition, Contents From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability

More information

6.3 How the Associational Criterion Fails

6.3 How the Associational Criterion Fails 6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P

More information

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles CAUSALITY Models, Reasoning, and Inference Judea Pearl University of California, Los Angeles 1 CAMBRIDGE UNIVERSITY PRESS Preface page xiii 1 Introduction to Probabilities, Graphs, and Causal Models 1

More information

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea earl Department of Computer Science UCLA OUTLINE Statistical vs. Causal Modeling: distinction and mental barriers N-R vs. structural model: strengths

More information

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla. OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics

More information

A Distinction between Causal Effects in Structural and Rubin Causal Models

A Distinction between Causal Effects in Structural and Rubin Causal Models A istinction between Causal Effects in Structural and Rubin Causal Models ionissi Aliprantis April 28, 2017 Abstract: Unspecified mediators play different roles in the outcome equations of Structural Causal

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla. THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Modeling: Statistical vs. Causal Causal Models and Identifiability to

More information

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. David Kaplan Department of Educational Psychology The General Theme

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Reflections, Elaborations, and Discussions with Readers

Reflections, Elaborations, and Discussions with Readers CHAPTER ELEVEN Reflections, Elaborations, and Discussions with Readers As X-rays are to the surgeon, graphs are for causation. The author In this chapter, I reflect back on the material covered in Chapters

More information

On Objectivity and Models for Measuring. G. Rasch. Lecture notes edited by Jon Stene.

On Objectivity and Models for Measuring. G. Rasch. Lecture notes edited by Jon Stene. On Objectivity and Models for Measuring By G. Rasch Lecture notes edited by Jon Stene. On Objectivity and Models for Measuring By G. Rasch Lectures notes edited by Jon Stene. 1. The Basic Problem. Among

More information

The Effects of Interventions

The Effects of Interventions 3 The Effects of Interventions 3.1 Interventions The ultimate aim of many statistical studies is to predict the effects of interventions. When we collect data on factors associated with wildfires in the

More information

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SEM is a family of statistical techniques which builds upon multiple regression,

More information

Econometric Causality

Econometric Causality Econometric (2008) International Statistical Review, 76(1):1-27 James J. Heckman Spencer/INET Conference University of Chicago Econometric The econometric approach to causality develops explicit models

More information

CAUSAL INFERENCE. Stephen F. LeRoy. address:

CAUSAL INFERENCE. Stephen F. LeRoy.  address: CAUSAL INFERENCE Stephen F. LeRoy U C, S B ; N 27, 2018 E-mail address: leroy@ucsb.edu Contents Preface v Part 1. Theory 1 Chapter 1. Structural Models 3 1. Equality and Causation 4 2. Causation Based

More information

REVIEW OF SOCIAL ECONOMY, VOL. LXIII, NO. 1, MARCH Book Reviews

REVIEW OF SOCIAL ECONOMY, VOL. LXIII, NO. 1, MARCH Book Reviews REVIEW OF SOCIAL ECONOMY, VOL. LXIII, NO. 1, MARCH 2005 Book Reviews Causality: Models, Reasoning, and Inference. By Judea Pearl. Cambridge, Cambridge University Press, 2000, xvi, 384 pages, 30.00 (hardcover),

More information

Incompatibility Paradoxes

Incompatibility Paradoxes Chapter 22 Incompatibility Paradoxes 22.1 Simultaneous Values There is never any difficulty in supposing that a classical mechanical system possesses, at a particular instant of time, precise values of

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics

More information

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models Chapter 5 Introduction to Path Analysis Put simply, the basic dilemma in all sciences is that of how much to oversimplify reality. Overview H. M. Blalock Correlation and causation Specification of path

More information

Confounding Causality Principles: comment on Rédei and san Pedro s Distinguishing Causality Principles

Confounding Causality Principles: comment on Rédei and san Pedro s Distinguishing Causality Principles Confounding Causality Principles: comment on Rédei and san Pedro s Distinguishing Causality Principles arxiv:1210.1463v1 [quant-ph] 4 Oct 2012 Joe Henson February 10, 2019 Abstract Rédei and san Pedro

More information

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization

More information

CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING

CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING 1 CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING Updated 9/26/00 page v *insert* TO RUTH centered in middle of page page xv *insert* in second paragraph David Galles after Dechter page 2 *replace* 2000

More information

DECISIONS UNDER UNCERTAINTY

DECISIONS UNDER UNCERTAINTY August 18, 2003 Aanund Hylland: # DECISIONS UNDER UNCERTAINTY Standard theory and alternatives 1. Introduction Individual decision making under uncertainty can be characterized as follows: The decision

More information

Measurement Independence, Parameter Independence and Non-locality

Measurement Independence, Parameter Independence and Non-locality Measurement Independence, Parameter Independence and Non-locality Iñaki San Pedro Department of Logic and Philosophy of Science University of the Basque Country, UPV/EHU inaki.sanpedro@ehu.es Abstract

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models An obvious reason for the endogeneity of explanatory variables in a regression model is simultaneity: that is, one

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

Background on Coherent Systems

Background on Coherent Systems 2 Background on Coherent Systems 2.1 Basic Ideas We will use the term system quite freely and regularly, even though it will remain an undefined term throughout this monograph. As we all have some experience

More information

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL BRENDAN KLINE AND ELIE TAMER Abstract. Randomized trials (RTs) are used to learn about treatment effects. This paper

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

Delayed Choice Paradox

Delayed Choice Paradox Chapter 20 Delayed Choice Paradox 20.1 Statement of the Paradox Consider the Mach-Zehnder interferometer shown in Fig. 20.1. The second beam splitter can either be at its regular position B in where the

More information

On the Identification of a Class of Linear Models

On the Identification of a Class of Linear Models On the Identification of a Class of Linear Models Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper deals with the problem of identifying

More information

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory

More information

Hardy s Paradox. Chapter Introduction

Hardy s Paradox. Chapter Introduction Chapter 25 Hardy s Paradox 25.1 Introduction Hardy s paradox resembles the Bohm version of the Einstein-Podolsky-Rosen paradox, discussed in Chs. 23 and 24, in that it involves two correlated particles,

More information

Computational Tasks and Models

Computational Tasks and Models 1 Computational Tasks and Models Overview: We assume that the reader is familiar with computing devices but may associate the notion of computation with specific incarnations of it. Our first goal is to

More information

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Logic. Quantifiers. (real numbers understood). x [x is rotten in Denmark]. x<x+x 2 +1

Logic. Quantifiers. (real numbers understood). x [x is rotten in Denmark]. x<x+x 2 +1 Logic One reason for studying logic is that we need a better notation than ordinary English for expressing relationships among various assertions or hypothetical states of affairs. A solid grounding in

More information

The Foundations of Causal Inference

The Foundations of Causal Inference The Foundations of Causal Inference Judea Pearl University of California, Los Angeles Computer Science Department Los Angeles, CA, 90095-1596, USA judea@cs.ucla.edu May 12, 2010 Abstract This paper reviews

More information

Causal analysis after Haavelmo. 8th/last Lecture - Hedibert Lopes

Causal analysis after Haavelmo. 8th/last Lecture - Hedibert Lopes Causal analysis after Haavelmo 8th/last Lecture - Hedibert Lopes Insper - Institute of Education and Research December 1st, 2015 Hedibert Lopes (Insper) Heckman & Pinto (2014) December 1st, 2015 1 / 39

More information

Structure learning in human causal induction

Structure learning in human causal induction Structure learning in human causal induction Joshua B. Tenenbaum & Thomas L. Griffiths Department of Psychology Stanford University, Stanford, CA 94305 jbt,gruffydd @psych.stanford.edu Abstract We use

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles (

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles ( CAUSAL INFERENCE AS COMUTATIONAL LEARNING Judea earl University of California Los Angeles www.cs.ucla.edu/~judea OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

Coins and Counterfactuals

Coins and Counterfactuals Chapter 19 Coins and Counterfactuals 19.1 Quantum Paradoxes The next few chapters are devoted to resolving a number of quantum paradoxes in the sense of giving a reasonable explanation of a seemingly paradoxical

More information

Causal inference in statistics: An overview

Causal inference in statistics: An overview Statistics Surveys Vol. 3 (2009) 96 146 ISSN: 1935-7516 DOI: 10.1214/09-SS057 Causal inference in statistics: An overview Judea Pearl Computer Science Department University of California, Los Angeles,

More information

1 Measurement Uncertainties

1 Measurement Uncertainties 1 Measurement Uncertainties (Adapted stolen, really from work by Amin Jaziri) 1.1 Introduction No measurement can be perfectly certain. No measuring device is infinitely sensitive or infinitely precise.

More information

7.1 Significance of question: are there laws in S.S.? (Why care?) Possible answers:

7.1 Significance of question: are there laws in S.S.? (Why care?) Possible answers: I. Roberts: There are no laws of the social sciences Social sciences = sciences involving human behaviour (Economics, Psychology, Sociology, Political Science) 7.1 Significance of question: are there laws

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

A Summary of Economic Methodology

A Summary of Economic Methodology A Summary of Economic Methodology I. The Methodology of Theoretical Economics All economic analysis begins with theory, based in part on intuitive insights that naturally spring from certain stylized facts,

More information

CHAPTER 1: Functions

CHAPTER 1: Functions CHAPTER 1: Functions 1.1: Functions 1.2: Graphs of Functions 1.3: Basic Graphs and Symmetry 1.4: Transformations 1.5: Piecewise-Defined Functions; Limits and Continuity in Calculus 1.6: Combining Functions

More information

Greene, Econometric Analysis (7th ed, 2012)

Greene, Econometric Analysis (7th ed, 2012) EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.

More information

Technical Track Session I: Causal Inference

Technical Track Session I: Causal Inference Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Path Diagrams. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Path Diagrams. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Path Diagrams James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Path Diagrams 1 / 24 Path Diagrams 1 Introduction 2 Path Diagram

More information

DR.RUPNATHJI( DR.RUPAK NATH )

DR.RUPNATHJI( DR.RUPAK NATH ) Contents 1 Sets 1 2 The Real Numbers 9 3 Sequences 29 4 Series 59 5 Functions 81 6 Power Series 105 7 The elementary functions 111 Chapter 1 Sets It is very convenient to introduce some notation and terminology

More information

Math 2 Unit 3 Quadratic Functions: Working with Equations

Math 2 Unit 3 Quadratic Functions: Working with Equations Approximate Time Frame: 4 5 Weeks Connections to Previous Learning: In the previous unit, students learned to represent quadratic functions algebraically, in tables, and graphically and to analyze these

More information

CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla.

CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla. CAUSAL INFERENCE IN STATISTICS A Gentle Introduction Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/jsm12) OUTLINE Inference: Statistical vs. Causal, distinctions, and mental

More information

Lecture Notes on Inductive Definitions

Lecture Notes on Inductive Definitions Lecture Notes on Inductive Definitions 15-312: Foundations of Programming Languages Frank Pfenning Lecture 2 September 2, 2004 These supplementary notes review the notion of an inductive definition and

More information

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM)

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM) unified theory of counterfactual reasoning hristopher G. Lucas cglucas@cmu.edu Department of Psychology arnegie Mellon University harles Kemp ckemp@cmu.edu Department of Psychology arnegie Mellon University

More information

(January 6, 2006) Paul Garrett garrett/

(January 6, 2006) Paul Garrett  garrett/ (January 6, 2006)! "$# % & '!)( *+,.-0/%&1,3234)5 * (6# Paul Garrett garrett@math.umn.edu http://www.math.umn.edu/ garrett/ To communicate clearly in mathematical writing, it is helpful to clearly express

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

35 Chapter CHAPTER 4: Mathematical Proof

35 Chapter CHAPTER 4: Mathematical Proof 35 Chapter 4 35 CHAPTER 4: Mathematical Proof Faith is different from proof; the one is human, the other is a gift of God. Justus ex fide vivit. It is this faith that God Himself puts into the heart. 21

More information

LECSS Physics 11 Introduction to Physics and Math Methods 1 Revised 8 September 2013 Don Bloomfield

LECSS Physics 11 Introduction to Physics and Math Methods 1 Revised 8 September 2013 Don Bloomfield LECSS Physics 11 Introduction to Physics and Math Methods 1 Physics 11 Introduction to Physics and Math Methods In this introduction, you will get a more in-depth overview of what Physics is, as well as

More information

Transparent Structural Estimation. Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro)

Transparent Structural Estimation. Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro) Transparent Structural Estimation Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro) 1 A hallmark of contemporary applied microeconomics is a conceptual framework

More information

Conditional probabilities and graphical models

Conditional probabilities and graphical models Conditional probabilities and graphical models Thomas Mailund Bioinformatics Research Centre (BiRC), Aarhus University Probability theory allows us to describe uncertainty in the processes we model within

More information

THE FOUNDATIONS OF CAUSAL INFERENCE

THE FOUNDATIONS OF CAUSAL INFERENCE TECHNICAL REPORT R-355 August 2010 3 THE FOUNDATIONS OF CAUSAL INFERENCE Judea Pearl* This paper reviews recent advances in the foundations of causal inference and introduces a systematic methodology for

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

On the learning of algebra

On the learning of algebra On the learning of algebra H. Wu Department of Mathematics #3840 University of California, Berkeley Berkeley, CA 94720-3840 USA http://www.math.berkeley.edu/ wu/ wu@math.berkeley.edu One of the main goals

More information

Checking Consistency. Chapter Introduction Support of a Consistent Family

Checking Consistency. Chapter Introduction Support of a Consistent Family Chapter 11 Checking Consistency 11.1 Introduction The conditions which define a consistent family of histories were stated in Ch. 10. The sample space must consist of a collection of mutually orthogonal

More information

Confounding Equivalence in Causal Inference

Confounding Equivalence in Causal Inference Revised and submitted, October 2013. TECHNICAL REPORT R-343w October 2013 Confounding Equivalence in Causal Inference Judea Pearl Cognitive Systems Laboratory Computer Science Department University of

More information

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from Topics in Data Analysis Steven N. Durlauf University of Wisconsin Lecture Notes : Decisions and Data In these notes, I describe some basic ideas in decision theory. theory is constructed from The Data:

More information

Evidence and Theory in Physics. Tim Maudlin, NYU Evidence in the Natural Sciences, May 30, 2014

Evidence and Theory in Physics. Tim Maudlin, NYU Evidence in the Natural Sciences, May 30, 2014 Evidence and Theory in Physics Tim Maudlin, NYU Evidence in the Natural Sciences, May 30, 2014 Two Features of Physics Physics displays two interesting features: 1) Programmatically, it aspires to be completely

More information

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS Donald B. Rubin Harvard University 1 Oxford Street, 7th Floor Cambridge, MA 02138 USA Tel: 617-495-5496; Fax: 617-496-8057 email: rubin@stat.harvard.edu

More information

Slope Fields: Graphing Solutions Without the Solutions

Slope Fields: Graphing Solutions Without the Solutions 8 Slope Fields: Graphing Solutions Without the Solutions Up to now, our efforts have been directed mainly towards finding formulas or equations describing solutions to given differential equations. Then,

More information

1. Characterization and uniqueness of products

1. Characterization and uniqueness of products (March 16, 2014) Example of characterization by mapping properties: the product topology Paul Garrett garrett@math.umn.edu http://www.math.umn.edu/ garrett/ [This document is http://www.math.umn.edu/ garrett/m/mfms/notes

More information

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental

More information

Lecture Notes on Inductive Definitions

Lecture Notes on Inductive Definitions Lecture Notes on Inductive Definitions 15-312: Foundations of Programming Languages Frank Pfenning Lecture 2 August 28, 2003 These supplementary notes review the notion of an inductive definition and give

More information

Use and Abuse of Regression

Use and Abuse of Regression This article was downloaded by: [130.132.123.28] On: 16 May 2015, At: 01:35 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer

More information

CHAPTER 2 INTRODUCTION TO CLASSICAL PROPOSITIONAL LOGIC

CHAPTER 2 INTRODUCTION TO CLASSICAL PROPOSITIONAL LOGIC CHAPTER 2 INTRODUCTION TO CLASSICAL PROPOSITIONAL LOGIC 1 Motivation and History The origins of the classical propositional logic, classical propositional calculus, as it was, and still often is called,

More information

Manual of Logical Style

Manual of Logical Style Manual of Logical Style Dr. Holmes January 9, 2015 Contents 1 Introduction 2 2 Conjunction 3 2.1 Proving a conjunction...................... 3 2.2 Using a conjunction........................ 3 3 Implication

More information

Manual of Logical Style (fresh version 2018)

Manual of Logical Style (fresh version 2018) Manual of Logical Style (fresh version 2018) Randall Holmes 9/5/2018 1 Introduction This is a fresh version of a document I have been working on with my classes at various levels for years. The idea that

More information

CHEMISTRY. Overview. Choice of topic

CHEMISTRY. Overview. Choice of topic CHEMISTRY Overview An extended essay in chemistry provides students with an opportunity to investigate a particular aspect of the materials of our environment. Such extended essays must be characterized

More information

Research Note: A more powerful test statistic for reasoning about interference between units

Research Note: A more powerful test statistic for reasoning about interference between units Research Note: A more powerful test statistic for reasoning about interference between units Jake Bowers Mark Fredrickson Peter M. Aronow August 26, 2015 Abstract Bowers, Fredrickson and Panagopoulos (2012)

More information

Chapter One. The Real Number System

Chapter One. The Real Number System Chapter One. The Real Number System We shall give a quick introduction to the real number system. It is imperative that we know how the set of real numbers behaves in the way that its completeness and

More information

Logicality of Operators

Logicality of Operators Logicality of Operators Tomoya Sato Abstract Characterizing logical operators has been crucially important in the philosophy of logic. One reason for this importance is that the boundary between logically

More information

An Introduction to Causal Inference

An Introduction to Causal Inference To be published in The International Journal of Biostatistics, 2009. TECHNICAL REPORT R-354 February 2010 An Introduction to Causal Inference Judea Pearl University of California, Los Angeles Computer

More information

Stochastic Histories. Chapter Introduction

Stochastic Histories. Chapter Introduction Chapter 8 Stochastic Histories 8.1 Introduction Despite the fact that classical mechanics employs deterministic dynamical laws, random dynamical processes often arise in classical physics, as well as in

More information

Potential Outcomes Model (POM)

Potential Outcomes Model (POM) Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics

More information

On the Arbitrary Choice Regarding Which Inertial Reference Frame is "Stationary" and Which is "Moving" in the Special Theory of Relativity

On the Arbitrary Choice Regarding Which Inertial Reference Frame is Stationary and Which is Moving in the Special Theory of Relativity Regarding Which Inertial Reference Frame is "Stationary" and Which is "Moving" in the Special Theory of Relativity Douglas M. Snyder Los Angeles, CA The relativity of simultaneity is central to the special

More information

1 Teaching notes on structural VARs.

1 Teaching notes on structural VARs. Bent E. Sørensen February 22, 2007 1 Teaching notes on structural VARs. 1.1 Vector MA models: 1.1.1 Probability theory The simplest (to analyze, estimation is a different matter) time series models are

More information

Causal Analysis After Haavelmo

Causal Analysis After Haavelmo After Haavelmo University College London UCL Department Seminar September 3, 2013 Economics has forgotten its own past. (Judea Pearl, 2012) Haavemo s Contributions to Causality: Two seminal papers (1943,

More information

Pasinetti s notion of a unit of vertically integrated productive capacity : a comment

Pasinetti s notion of a unit of vertically integrated productive capacity : a comment Pasinetti s notion of a unit of vertically integrated productive capacity : a comment Jaime L. del Valle Caballero Ensayos y Monografías Numero 87 marzo 1998 Pasinetti s notion of a unit of vertically

More information

Bounding the Probability of Causation in Mediation Analysis

Bounding the Probability of Causation in Mediation Analysis arxiv:1411.2636v1 [math.st] 10 Nov 2014 Bounding the Probability of Causation in Mediation Analysis A. P. Dawid R. Murtas M. Musio February 16, 2018 Abstract Given empirical evidence for the dependence

More information

The Beach Party Problem: An informal introduction to continuity problems in group choice *

The Beach Party Problem: An informal introduction to continuity problems in group choice * The Beach Party Problem: An informal introduction to continuity problems in group choice * by Nick Baigent Institute of Public Economics, Graz University nicholas.baigent@kfunigraz.ac.at Fax: +44 316 3809530

More information

Treatment Effects with Normal Disturbances in sampleselection Package

Treatment Effects with Normal Disturbances in sampleselection Package Treatment Effects with Normal Disturbances in sampleselection Package Ott Toomet University of Washington December 7, 017 1 The Problem Recent decades have seen a surge in interest for evidence-based policy-making.

More information