Statistical Tests. Matthieu de Lapparent

Size: px
Start display at page:

Download "Statistical Tests. Matthieu de Lapparent"

Transcription

1 Statistical Tests Matthieu de Lapparent Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne Transport and Mobility Laboratory Decision-Aid Methodologies 1 / 65

2 Introduction Introduction Modeling Difficult to determine the most appropriate model specification A good fit does not imply a good model Formal testing is necessary, but not sufficient No clear-cut rules can be given Good modeling = good (subjective) judgment + good analysis Wilkinson (1999) The grammar of graphics. Springer... some researchers who use statistical methods pay more attention to goodness of fit than to the meaning of the model... Statisticians must think about what the models mean, regardless of fit, or they will promulgate nonsense. Transport and Mobility Laboratory Decision-Aid Methodologies 2 / 65

3 Introduction Introduction Hypothesis testing Four steps step 1: State the hypotheses H 0 null hypothesis H 1 alternative hypothesis step 2: Set the criteria for a decision step 3: Compute a test statistic step 4: Make a decision Step 1: Analogy with a court trial H 0 : defendant is presumed innocent until proved guilty H 0 is accepted, unless the data argue strongly to the contrary Transport and Mobility Laboratory Decision-Aid Methodologies 3 / 65

4 Introduction Introduction Step 2: Criterion for a decision Court-room: criterion is to show guilt beyond reasonable doubt Implies defining the level of significance α Step 3: Test statistic Determine the likelihood of obtaining a sample outcome if the H 0 hypothesis were true How far we accept to be from the H 0 Step 4: Decide Decide if null is retained or rejected Gives the probability value (p-value of obtaining an outcome, given that the H 0 is true) Transport and Mobility Laboratory Decision-Aid Methodologies 4 / 65

5 Introduction Introduction Possible decision outcomes Accept H 0 Reject H 0 H 0 is true Correct (1-α) Type I error (prob. α) H 0 is false Type II error (prob. β) Correct (1-β) Relations For a given sample size N, there is a trade-off between α and β. Only way to reduce both types of error probabilities is to increase N. π = 1 β is the power of the test, that is, the probability of correctly rejecting H 0. Researcher directly controls Type I errors by fixing α Transport and Mobility Laboratory Decision-Aid Methodologies 5 / 65

6 Case study Summary of case-studies Netherlands mode choice Intercity travelers Choice between train & car 228 respondents Revealed preference data with self-reported trip characteristics Swissmetro Travelers St. Gallen - Geneva Choice between train, car & swissmetro 441 respondents Stated preference (swissmetro is a non-existing mag-lev train) Transport and Mobility Laboratory Decision-Aid Methodologies 6 / 65

7 Case study Informal tests Informal tests Sign of the coefficient Do the estimated parameters have the right sign? Example: Netherlands Mode Choice Case Robust Coeff. Asympt. Parameter estimate std. error t-stat p-value ASC car β cost β time Transport and Mobility Laboratory Decision-Aid Methodologies 7 / 65

8 Case study Informal tests Informal tests Value of trade-offs Are the trade-offs reasonable? How much are we ready to pay for a marginal improvement of the level-of-service? Example: reduction of travel time The increase in cost must be exactly compensated by the reduction of travel time β cost (C + C) + β time (T T ) +... = β cost C + β time T +... Therefore, C T = β time β cost Transport and Mobility Laboratory Decision-Aid Methodologies 8 / 65

9 Case study Informal tests Informal tests Value of trade-offs: example with Netherlands data In general: Trade-off: Units: V / x V / x C 1/Hour 1/Guilder = Guilder Hour Parameter Coeff. Guilders Euros CHF ASC car β cost β time (/Hour) Transport and Mobility Laboratory Decision-Aid Methodologies 9 / 65

10 Case study t-tests t-test Question Is the parameter θ significantly different from a given value θ? Statistic H 0 : θ = θ H 1 : θ θ Under H 0, if ˆθ is normally distributed with known variance σ 2 Therefore ˆθ θ σ N(0, 1). P( 1.96 ˆθ θ σ 1.96) = 0.95 = Transport and Mobility Laboratory Decision-Aid Methodologies 10 / 65

11 Case study t-tests t-test H 0 can be rejected at the 5% level (α = 0.05) if ˆθ θ σ Comments If ˆθ asymptotically normal If variance unknown A t test should be used with n degrees of freedom. When n 30, the Student t distribution is well approximated by a N(0, 1) Transport and Mobility Laboratory Decision-Aid Methodologies 11 / 65

12 Case study t-tests t-test Swissmetro: model specification Car Train Swissmetro ASC car ASC train β cost cost cost cost β time time time time β headway 0 headway headway Transport and Mobility Laboratory Decision-Aid Methodologies 12 / 65

13 Case study t-tests t-test Swissmetro: coefficient estimates Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost β headway β time H 0 : β cost = 0: rejected at the 5% level H 0 : β headway = 0: rejected at the 5% level H 0 : β time = 0: rejected at the 5% level Transport and Mobility Laboratory Decision-Aid Methodologies 13 / 65

14 Case study t-tests t-test Comparing two coefficients H 0 : β 1 = β 2. The t statistic is given by β 1 β 2 var( β 1 β 2 ) var( β 1 β 2 ) = var( β 1 ) + var( β 2 ) 2 cov( β 1, β 2 ) Transport and Mobility Laboratory Decision-Aid Methodologies 14 / 65

15 Case study t-tests t-test Comparing two coefficients Example: alternative specific or generic coefficients? Below alternative specific time Car Train Swissmetro ASC car ASC train β cost cost cost cost β time car time 0 0 β time train 0 time 0 β time Swissmetro 0 0 time β headway 0 headway headway Transport and Mobility Laboratory Decision-Aid Methodologies 15 / 65

16 Case study t-tests t-test Swissmetro: coefficient estimates (alternative specific time) Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost β headway β time car β time Swissmetro β time train Transport and Mobility Laboratory Decision-Aid Methodologies 16 / 65

17 Case study t-tests t-test Variance-covariance matrix Parameter 1 Parameter 2 Covariance Correlation t-stat β time car β time train 7.57e β time car β time Swissmetro 1.38e β time Swissmetro β time train 1.47e H 0 : β time car = β time train var( β t.car β t.train ) = var( β t.car ) + var( β t.train ) 2 cov( β t.car, β t.train ) = = Transport and Mobility Laboratory Decision-Aid Methodologies 17 / 65

18 Case study t-tests t-test H 0 : β time car = β time train β t.car β t.train var( β t.car β = t.train ) We can reject the H 0 of parameter equality ( ) = What about β time car = β time metro and β time metro = β time train? Homework to calculate the t-ratios for these parameter differences! Transport and Mobility Laboratory Decision-Aid Methodologies 18 / 65

19 Likelihood ratio test Likelihood ratio test Comparing two models Used for nested hypotheses One model is a special case of another obtained from a set of linear restrictions on the parameters H 0 : the restricted model is the true model Statistic under H 0 2(L( ˆβR) L( ˆβU)) χ 2 (K U K R ) L( ˆβ R ) is the log likelihood of the restricted model L( ˆβ U ) is the log likelihood of the unrestricted model K R is the number of parameters in the restricted model K U is the number of parameters in the unrestricted model Transport and Mobility Laboratory Decision-Aid Methodologies 19 / 65

20 Likelihood ratio test Likelihood ratio test Test of parameters being equal to zero: Netherlands Statistic Unrestricted model: 3 parameters: β time, β cost, ASC car. Final log likelihood: Restricted model Restrictions: β time = β cost = 0 1 parameter: ASC car. Final log likelihood: Test: 2( ) = χ 2, 2 degrees of freedom, 95% quantile: 5.99 H 0 is rejected The unrestricted model is preferred. Transport and Mobility Laboratory Decision-Aid Methodologies 20 / 65

21 Likelihood ratio test Test of generic attributes Likelihood ratio test Test of generic attributes: Swissmetro Restricted model: Car Train Swissmetro ASC car ASC train β cost cost cost cost β time time time time β headway 0 headway headway Restrictions: β time car = β time train = β time Swissmetro Unrestricted model: Car Train Swissmetro ASC car ASC train β cost cost cost cost β time car time 0 0 β time train 0 time 0 β time Swissmetro 0 0 time β headway 0 headway headway Transport and Mobility Laboratory Decision-Aid Methodologies 21 / 65

22 Likelihood ratio test Test of generic attributes Likelihood ratio test Test of generic attributes: Swissmetro Statistic Restricted model: Final log likelihood: parameters Unrestricted model: Final log likelihood: parameters -2( ) = χ 2, 2 degrees of freedom, 95% quantile: 5.99 Reject the restrictions (H 0 ) The alternative specific specification is preferred Transport and Mobility Laboratory Decision-Aid Methodologies 22 / 65

23 Likelihood ratio test Test of taste variation Test of taste variations Segmentation Classify the data into G groups. Size of group g: N g. The same specification is considered for each group. A different set of parameters is estimated for each group. Restrictions: β 1 = β 2 =... = β G where β g is the vector of coefficients of market segment g. Statistic: 2 L N ( β) G L Ng ( β g ) g=1 χ 2 with G g=1 K g K degrees of freedom. In general, G g=1 K g K = (G 1)K. Transport and Mobility Laboratory Decision-Aid Methodologies 23 / 65

24 Likelihood ratio test Test of taste variation Test of taste variations Segmentation according to income: Swissmetro Unrestricted model: a different set of parameters for each income group 1: [0 50], 2: [50 100], 3:[100 ], 4: unknown (KCHF) Restricted model: same parameters across income groups Hypothesis H 0 the true parameters are the same across income classes Transport and Mobility Laboratory Decision-Aid Methodologies 24 / 65

25 Likelihood ratio test Test of taste variation Estimation results by income groups Estimation procedure Divide the sample into 4 subsets, corresponding to the income groups Estimate the restricted model on each of the samples separately Add up the log likelihoods Group Log likelihood Sample size Total Transport and Mobility Laboratory Decision-Aid Methodologies 25 / 65

26 Likelihood ratio test Test of taste variation Different taste across income groups? Test of taste variations Restricted model: 7 parameters Final log likelihood: Unrestricted model: 7 4 = 28 parameters Final log likelihood: Statistic Likelihood ratio test gives: χ 2, 21 degrees of freedom, 95% quantile: > hence H 0 is rejected There is evidence of taste variation per income group Transport and Mobility Laboratory Decision-Aid Methodologies 26 / 65

27 Test of nonlinear specifications Piecewise linear specification Nonlinear specifications Consider a variable x of the model (travel time, say) Unrestricted model: V is a nonlinear function of x Restricted model: V is a linear function of x We consider the following nonlinear specifications: Piecewise linear Power series Box-Cox transforms For each case, the linear specification is obtained using simple restrictions on the nonlinear specification Transport and Mobility Laboratory Decision-Aid Methodologies 27 / 65

28 Test of nonlinear specifications Piecewise linear specification Piecewise linear specification Model procedure Partition the range of values of x into M intervals [a m, a m+1 ], m = 1,..., M For example, the partition [0 500], [ ], [1000 ] corresponds to M = 3, a 1 = 0, a 2 = 500, a 3 = 1000, a 4 = + The slope of the utility function may vary across intervals Therefore, there will be M parameters instead of 1 The function must be continuous Transport and Mobility Laboratory Decision-Aid Methodologies 28 / 65

29 Test of nonlinear specifications Piecewise linear specification Piecewise linear specification Linear specification: V i = βx i + Piecewise linear specification M V i = β m x im + m=1 where that is x im = max(0, min(x a m, a m+1 a m )) 0 if x < a m x im = x a m if a m x < a m+1 a m+1 a m if a m+1 x Transport and Mobility Laboratory Decision-Aid Methodologies 29 / 65

30 Test of nonlinear specifications Piecewise linear specification Piecewise linear specification Example: M = 3, a 1 = 0, a 2 = 500, a 3 = 1000, a 4 = + x x 1 x 2 x Transport and Mobility Laboratory Decision-Aid Methodologies 30 / 65

31 Test of nonlinear specifications Piecewise linear specification Piecewise linear specification: restricted model Test of piecewise specification Restricted model Unrestricted model Car Train Swissmetro ASC car ASC train β cost cost cost cost β time time time time β headway 0 headway headway Car Train Swissmetro ASC car ASC train β cost cost cost cost β time,1 time 1 time 1 time 1 β time,2 time 2 time 2 time 2 β time,3 time 3 time 3 time 3 β headway 0 headway headway Transport and Mobility Laboratory Decision-Aid Methodologies 31 / 65

32 Test of nonlinear specifications Piecewise linear specification Piecewise linear specification Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost β headway β time, β time, β time, Transport and Mobility Laboratory Decision-Aid Methodologies 32 / 65

33 Test of nonlinear specifications Piecewise linear specification Piecewise linear specification 0 Piecewise linear Utility Time Transport and Mobility Laboratory Decision-Aid Methodologies 33 / 65

34 Test of nonlinear specifications Piecewise linear specification Likelihood ratio test Test of piecewise linear specification for time Restricted model: 5 parameters Final log likelihood: Unrestricted model: 7 parameters Final log likelihood: Statistic LR Test: χ 2, 2 degrees of freedom, 95% quantile: 5.99 H 0 is rejected The linear specification is rejected Transport and Mobility Laboratory Decision-Aid Methodologies 34 / 65

35 Test of nonlinear specifications Power series Power series Idea If the utility function is nonlinear in x, it can be approximated by a polynomial of degree M Linear specification: V i = βx i + Power series M V i = β m xi m + m=1 Transport and Mobility Laboratory Decision-Aid Methodologies 35 / 65

36 Test of nonlinear specifications Power series Power series: restricted model Test of power series specification for time Restricted model Unrestricted model Car Train Swissmetro ASC car ASC train β cost cost cost cost β time time time time β headway 0 headway headway Car Train Swissmetro ASC car ASC train β cost cost cost cost β time,1 time time time β time,2 time 2 /10 5 time 2 /10 5 time 2 /10 5 β time,3 time 3 /10 5 time 3 /10 5 time 3 /10 5 β headway 0 headway headway Transport and Mobility Laboratory Decision-Aid Methodologies 36 / 65

37 Test of nonlinear specifications Power series Power series: unrestricted model Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost β headway β time, β time, β time, Transport and Mobility Laboratory Decision-Aid Methodologies 37 / 65

38 Test of nonlinear specifications Power series Power series: M=3 0 Power series -1-2 Utility Time Transport and Mobility Laboratory Decision-Aid Methodologies 38 / 65

39 Test of nonlinear specifications Power series Likelihood ratio test Test of power series specification for time Restricted model: 5 parameters Final log likelihood: Unrestricted model: 7 parameters Final log likelihood: Statistic LR Test: χ 2, 2 degrees of freedom, 95% quantile: 5.99 H 0 is rejected The linear specification is rejected Transport and Mobility Laboratory Decision-Aid Methodologies 39 / 65

40 Test of nonlinear specifications Box-Cox Box-Cox transform Definition Let x > 0 be a positive variable Its Box-Cox transform is defined as x λ 1 if λ 0 B(x, λ) = λ ln x if λ = 0. where λ R is a parameter. Continuity x λ 1 lim = ln x. λ 0 λ Transport and Mobility Laboratory Decision-Aid Methodologies 40 / 65

41 Test of nonlinear specifications Box-Cox Box-Cox transform Linear specification V i = βx i + Box-Cox specification V i = βb(x, λ) + Properties Convex if λ > 1 Linear if λ = 1 Concave if λ < 1 Transport and Mobility Laboratory Decision-Aid Methodologies 41 / 65

42 Test of nonlinear specifications Box-Cox Box-Cox specification Test of Box-Cox transformation on time Restricted model Unrestricted model Car Train Swissmetro ASC car ASC train β cost cost cost cost β time time time time β headway 0 headway headway Car Train Swissmetro ASC car ASC train β cost cost cost cost β time B(time,λ) B(time,λ) B(time,λ) β headway 0 headway headway λ Note: specification tables are not designed for nonlinear specifications. Transport and Mobility Laboratory Decision-Aid Methodologies 42 / 65

43 Test of nonlinear specifications Box-Cox Box-Cox: unrestricted model Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost β headway β time λ Transport and Mobility Laboratory Decision-Aid Methodologies 43 / 65

44 Test of nonlinear specifications Box-Cox Box-Cox transform 2 Box-Cox Utility Time Transport and Mobility Laboratory Decision-Aid Methodologies 44 / 65

45 Test of nonlinear specifications Box-Cox Likelihood ratio test Test of Box-Cox specification for time Restricted model: 5 parameters Final log likelihood: Unrestricted model: 6 parameters Final log likelihood: Statistic LR Test: χ 2, 1 degree of freedom, 95% quantile: 3.84 H 0 is rejected The linear specification is rejected Also possible to employ t-test to compare Box-Cox to linear Transport and Mobility Laboratory Decision-Aid Methodologies 45 / 65

46 Test of nonlinear specifications Box-Cox Comparison of nonlinear time specifications Linear Piecewise linear Power series Box-Cox -4-6 Utility Time Transport and Mobility Laboratory Decision-Aid Methodologies 46 / 65

47 Non nested hypotheses Non nested hypotheses Nested hypotheses Restricted and unrestricted models Linear restrictions H 0 : restricted model is correct Test: likelihood ratio test Non nested hypotheses Need to compare two models None of them is a restriction of the other Likelihood ratio test cannot be used Two possible tests: Cox composite model Horowitz test ρ 2 Transport and Mobility Laboratory Decision-Aid Methodologies 47 / 65

48 Non nested hypotheses Cox test Cox test We want to test model 1 against model 2 We generate a composite model C such that both models 1 and 2 are restricted cases of model C. We test model 1 against C using the likelihood ratio test We test model 2 against C using the likelihood ratio test Possible outcomes: Only one of the two models is rejected. Keep the other. Both models are rejected. Better models should be developed. Both models are accepted. Use another test. Model C Model 1 Model 2 Transport and Mobility Laboratory Decision-Aid Methodologies 48 / 65

49 Non nested hypotheses Cox test Cox test Models M 1 : U in = + βx in + + ε (1) in M 2 : U in = + θlog(x) in + + ε (2) in M C : U in = + βx in + θlog(x) in + + ε in. Testing M 1 against M C Restrictions: θ = 0 Testing M 2 against M C Restrictions: β = 0 Transport and Mobility Laboratory Decision-Aid Methodologies 49 / 65

50 Non nested hypotheses Cox test Non nested models: estimates for model 1 Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost car β cost Swissmetro β cost train β gen. abo β headway β time car β time Swissmetro β time train Transport and Mobility Laboratory Decision-Aid Methodologies 50 / 65

51 Non nested hypotheses Cox test Non nested models: estimates for model 2 Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β log cost car β cost Swissmetro β cost train β gen. abo β headway β time car β time Swissmetro β time train Transport and Mobility Laboratory Decision-Aid Methodologies 51 / 65

52 Non nested hypotheses Cox test Non nested models Log likelihood # parameters Model 1 (linear car cost) Model 2 (log car cost) The fit of model 1 is better But we cannot apply a likelihood ratio test We estimate a composite model Transport and Mobility Laboratory Decision-Aid Methodologies 52 / 65

53 Non nested hypotheses Cox test Non nested models: estimates of the composite model Robust Parameter Coeff. Asympt. number Description estimate std. error t-stat p-value 1 ASC car ASC train β cost car β log cost car β cost Swissmetro β cost train β gen. abo β headway β time car β time Swissmetro β time train Transport and Mobility Laboratory Decision-Aid Methodologies 53 / 65

54 Non nested hypotheses Cox test Non nested models Test 1: model 1 vs. composite Restricted model (linear cost): 10 parameters Final log likelihood: Unrestricted model (Composite): 11 parameters Final log likelihood: Test: 1.58 χ 2, 1 degree of freedom, 95% quantile: 3.84 H 0 cannot be rejected Model 1 cannot be rejected Transport and Mobility Laboratory Decision-Aid Methodologies 54 / 65

55 Non nested hypotheses Cox test Non nested models Test 2: model 2 vs. composite Restricted model (log cost): 10 parameters Final log likelihood: Unrestricted model (Composite): 11 parameters Final log likelihood: Test: χ 2, 1 degree of freedom, 95% quantile: 3.84 H 0 can be rejected Model 2 can be rejected Overall conclusion: model 1 (linear car cost) is preferred over model 2 (log car cost). Transport and Mobility Laboratory Decision-Aid Methodologies 55 / 65

56 Non nested hypotheses Adjusted likelihood ratio index Adjusted likelihood ratio index Likelihood ratio index ρ 2 = 1 L( ˆβ) L(0) ρ 2 = 0: trivial model, equal probabilities ρ 2 = 1: perfect fit. Adjusted likelihood ratio index ρ 2 is increasing with the number of parameters. A higher fit (that is a higher ρ 2 ) does not mean a better model. An adjustment is needed. ρ 2 = 1 L( ˆβ) K L(0) Transport and Mobility Laboratory Decision-Aid Methodologies 56 / 65

57 Non nested hypotheses Adjusted likelihood ratio index ρ 2 test (Horowitz) Compare model 0 and model 1. where We expect that the best model corresponds to the best fit. We will be wrong if M 0 is the true model and M 1 produces a better fit. What is the probability that this happens? If this probability is low, M 0 can be rejected. ( P( ρ 2 1 ρ 2 0 > z M 0 ) Φ ) 2zL(0) + (K 1 K 0 ) ρ l 2 is the adjusted likelihood ratio index of model l = 0, 1 K l is the number of parameters of model l Φ is the standard normal CDF. Transport and Mobility Laboratory Decision-Aid Methodologies 57 / 65

58 Non nested hypotheses Adjusted likelihood ratio index ρ 2 test (Horowitz) Back to the example: ρ 2 # parameters Model 0 (log car cost) Model 1 (linear car cost) ( P( ρ 2 1 ρ 2 0 > z M 0 ) Φ ) 2zL(0) + (K 1 K 0 ) ( P( ρ 2 1 ρ 2 0 > M 0 ) Φ ) 2z( ) + (10 10) P( ρ 1 2 ρ 0 2 > M 0 ) Φ ( 3.73) 0 Therefore, M 0 can be rejected, and the linear specification is preferred. Transport and Mobility Laboratory Decision-Aid Methodologies 58 / 65

59 Non nested hypotheses Adjusted likelihood ratio index ρ 2 test (Horowitz) In practice, if the sample is large enough (i.e. more than 250 observations) if the values of the ρ 2 differ by 0.01 or more the model with the lower ρ 2 is almost certainly incorrect Transport and Mobility Laboratory Decision-Aid Methodologies 59 / 65

60 Further tests Outlier analysis Outlier analysis Procedure Apply the model on the sample Examine observations where the predicted probability is the smallest for the observed choice Test model sensitivity to outliers, as a small probability has a significant impact on the log likelihood Potential causes of low probability: Coding or measurement error in the data Model misspecification Unexplainable variation in choice behavior Transport and Mobility Laboratory Decision-Aid Methodologies 60 / 65

61 Further tests Outlier analysis Outlier analysis Coding or measurement error in the data Look for signs of data errors Correct or remove the observation Model misspecification Seek clues of missing variables from the observation Keep the observation and improve the model Unexplainable variation in choice behavior Keep the observation Avoid over fitting of the model to the data Transport and Mobility Laboratory Decision-Aid Methodologies 61 / 65

62 Further tests Market segments Market segments Procedure Compare predicted vs. observed shares per segment Let N g be the set of sampled individuals in segment g Observed share for alt. i and segment g S g (i) = n N g y in /N g Predicted share for alt. i and segment g Ŝ g (i) = n N g P n (i)/n g Transport and Mobility Laboratory Decision-Aid Methodologies 62 / 65

63 Further tests Market segments Market segments Note: With a full set of constants for segment g: y in = P n (i) n N g n N g Do not saturate the model with constants Transport and Mobility Laboratory Decision-Aid Methodologies 63 / 65

64 Further tests Market segments Conclusions Tests are designed to check meaningful hypotheses Do not test hypotheses that do not make sense Do not apply the tests blindly Always use your judgment. Transport and Mobility Laboratory Decision-Aid Methodologies 64 / 65

65 Appendix 90%, 95% and 99% of the χ 2 distribution with K degrees of freedom K 90% 95% 99% K 90% 95% 99% Transport and Mobility Laboratory Decision-Aid Methodologies 65 / 65

Binary choice. Michel Bierlaire

Binary choice. Michel Bierlaire Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)

More information

Swissmetro. March 3, Model Specification with Generic Attributes

Swissmetro. March 3, Model Specification with Generic Attributes Swissmetro March 3, 2015 Model Specification with Generic Attributes Model file: MNL_SM_generic.mod The dataset consists of survey data collected on the trains between St. Gallen and Geneva in Switzerland.

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Political Science 236 Hypothesis Testing: Review and Bootstrapping Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

Choice Theory. Matthieu de Lapparent

Choice Theory. Matthieu de Lapparent Choice Theory Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

Hypothesis Testing The basic ingredients of a hypothesis test are

Hypothesis Testing The basic ingredients of a hypothesis test are Hypothesis Testing The basic ingredients of a hypothesis test are 1 the null hypothesis, denoted as H o 2 the alternative hypothesis, denoted as H a 3 the test statistic 4 the data 5 the conclusion. The

More information

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean

More information

Statistics and econometrics

Statistics and econometrics 1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Lecture 3. Hypothesis testing. Goodness of Fit. Model diagnostics GLM (Spring, 2018) Lecture 3 1 / 34 Models Let M(X r ) be a model with design matrix X r (with r columns) r n

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator

More information

Ch. 5 Hypothesis Testing

Ch. 5 Hypothesis Testing Ch. 5 Hypothesis Testing The current framework of hypothesis testing is largely due to the work of Neyman and Pearson in the late 1920s, early 30s, complementing Fisher s work on estimation. As in estimation,

More information

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Hypothesis Testing. ECE 3530 Spring Antonio Paiva Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

STAT 4385 Topic 01: Introduction & Review

STAT 4385 Topic 01: Introduction & Review STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics

More information

Statistical Inference. Part IV. Statistical Inference

Statistical Inference. Part IV. Statistical Inference Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Econometric Methods. Prediction / Violation of A-Assumptions. Burcu Erdogan. Universität Trier WS 2011/2012

Econometric Methods. Prediction / Violation of A-Assumptions. Burcu Erdogan. Universität Trier WS 2011/2012 Econometric Methods Prediction / Violation of A-Assumptions Burcu Erdogan Universität Trier WS 2011/2012 (Universität Trier) Econometric Methods 30.11.2011 1 / 42 Moving on to... 1 Prediction 2 Violation

More information

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007 Hypothesis Testing Daniel Schmierer Econ 312 March 30, 2007 Basics Parameter of interest: θ Θ Structure of the test: H 0 : θ Θ 0 H 1 : θ Θ 1 for some sets Θ 0, Θ 1 Θ where Θ 0 Θ 1 = (often Θ 1 = Θ Θ 0

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing

More information

Parameter Estimation, Sampling Distributions & Hypothesis Testing

Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation & Hypothesis Testing In doing research, we are usually interested in some feature of a population distribution (which

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear

More information

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics

More information

Problem 1 (20) Log-normal. f(x) Cauchy

Problem 1 (20) Log-normal. f(x) Cauchy ORF 245. Rigollet Date: 11/21/2008 Problem 1 (20) f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 4 2 0 2 4 Normal (with mean -1) 4 2 0 2 4 Negative-exponential x x f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.5

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Chapter 8 Heteroskedasticity

Chapter 8 Heteroskedasticity Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized

More information

EE/CpE 345. Modeling and Simulation. Fall Class 9

EE/CpE 345. Modeling and Simulation. Fall Class 9 EE/CpE 345 Modeling and Simulation Class 9 208 Input Modeling Inputs(t) Actual System Outputs(t) Parameters? Simulated System Outputs(t) The input data is the driving force for the simulation - the behavior

More information

2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)?

2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)? ECE 830 / CS 76 Spring 06 Instructors: R. Willett & R. Nowak Lecture 3: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics Executive summary In the last lecture we

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

Section 9.1 (Part 2) (pp ) Type I and Type II Errors Section 9.1 (Part 2) (pp. 547-551) Type I and Type II Errors Because we are basing our conclusion in a significance test on sample data, there is always a chance that our conclusions will be in error.

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Calculating indicators with PythonBiogeme

Calculating indicators with PythonBiogeme Calculating indicators with PythonBiogeme Michel Bierlaire May 17, 2017 Report TRANSP-OR 170517 Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Testing Restrictions and Comparing Models

Testing Restrictions and Comparing Models Econ. 513, Time Series Econometrics Fall 00 Chris Sims Testing Restrictions and Comparing Models 1. THE PROBLEM We consider here the problem of comparing two parametric models for the data X, defined by

More information

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction

More information

20 Hypothesis Testing, Part I

20 Hypothesis Testing, Part I 20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

EE/CpE 345. Modeling and Simulation. Fall Class 10 November 18, 2002

EE/CpE 345. Modeling and Simulation. Fall Class 10 November 18, 2002 EE/CpE 345 Modeling and Simulation Class 0 November 8, 2002 Input Modeling Inputs(t) Actual System Outputs(t) Parameters? Simulated System Outputs(t) The input data is the driving force for the simulation

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

STAT 461/561- Assignments, Year 2015

STAT 461/561- Assignments, Year 2015 STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and

More information

Data analysis and Geostatistics - lecture VI

Data analysis and Geostatistics - lecture VI Data analysis and Geostatistics - lecture VI Statistical testing with population distributions Statistical testing - the steps 1. Define a hypothesis to test in statistics only a hypothesis rejection is

More information

How do we compare the relative performance among competing models?

How do we compare the relative performance among competing models? How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model

More information

Functional Form. Econometrics. ADEi.

Functional Form. Econometrics. ADEi. Functional Form Econometrics. ADEi. 1. Introduction We have employed the linear function in our model specification. Why? It is simple and has good mathematical properties. It could be reasonable approximation,

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

Parameter Estimation and Fitting to Data

Parameter Estimation and Fitting to Data Parameter Estimation and Fitting to Data Parameter estimation Maximum likelihood Least squares Goodness-of-fit Examples Elton S. Smith, Jefferson Lab 1 Parameter estimation Properties of estimators 3 An

More information

MTMS Mathematical Statistics

MTMS Mathematical Statistics MTMS.01.099 Mathematical Statistics Lecture 12. Hypothesis testing. Power function. Approximation of Normal distribution and application to Binomial distribution Tõnu Kollo Fall 2016 Hypothesis Testing

More information

ˆπ(x) = exp(ˆα + ˆβ T x) 1 + exp(ˆα + ˆβ T.

ˆπ(x) = exp(ˆα + ˆβ T x) 1 + exp(ˆα + ˆβ T. Exam 3 Review Suppose that X i = x =(x 1,, x k ) T is observed and that Y i X i = x i independent Binomial(n i,π(x i )) for i =1,, N where ˆπ(x) = exp(ˆα + ˆβ T x) 1 + exp(ˆα + ˆβ T x) This is called the

More information

ECON 4160, Autumn term Lecture 1

ECON 4160, Autumn term Lecture 1 ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Answers to Problem Set #4

Answers to Problem Set #4 Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Robustness and Distribution Assumptions

Robustness and Distribution Assumptions Chapter 1 Robustness and Distribution Assumptions 1.1 Introduction In statistics, one often works with model assumptions, i.e., one assumes that data follow a certain model. Then one makes use of methodology

More information

Are You The Beatles Or The Rolling Stones?

Are You The Beatles Or The Rolling Stones? N. Ortelli, G. Lederrey, A. Alahi, M. Bierlaire Transport and Mobility Laboratory School of Architecture, Civil, and Environmental Engineering ENAC École Polytechnique Fédérale de Lausanne 13th Workshop

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Unit 19 Formulating Hypotheses and Making Decisions

Unit 19 Formulating Hypotheses and Making Decisions Unit 19 Formulating Hypotheses and Making Decisions Objectives: To formulate a null hypothesis and an alternative hypothesis, and to choose a significance level To identify the Type I error and the Type

More information

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December

More information

The Analysis of Power for Some Chosen VaR Backtesting Procedures - Simulation Approach

The Analysis of Power for Some Chosen VaR Backtesting Procedures - Simulation Approach The Analysis of Power for Some Chosen VaR Backtesting Procedures - Simulation Approach Krzysztof Piontek Department of Financial Investments and Risk Management Wroclaw University of Economics ul. Komandorska

More information

Chapter 4. Theory of Tests. 4.1 Introduction

Chapter 4. Theory of Tests. 4.1 Introduction Chapter 4 Theory of Tests 4.1 Introduction Parametric model: (X, B X, P θ ), P θ P = {P θ θ Θ} where Θ = H 0 +H 1 X = K +A : K: critical region = rejection region / A: acceptance region A decision rule

More information

Nonlinear Regression Functions

Nonlinear Regression Functions Nonlinear Regression Functions (SW Chapter 8) Outline 1. Nonlinear regression functions general comments 2. Nonlinear functions of one variable 3. Nonlinear functions of two variables: interactions 4.

More information

Cherry Blossom run (1) The credit union Cherry Blossom Run is a 10 mile race that takes place every year in D.C. In 2009 there were participants

Cherry Blossom run (1) The credit union Cherry Blossom Run is a 10 mile race that takes place every year in D.C. In 2009 there were participants 18.650 Statistics for Applications Chapter 5: Parametric hypothesis testing 1/37 Cherry Blossom run (1) The credit union Cherry Blossom Run is a 10 mile race that takes place every year in D.C. In 2009

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for

More information

Topic 10: Hypothesis Testing

Topic 10: Hypothesis Testing Topic 10: Hypothesis Testing Course 003, 2016 Page 0 The Problem of Hypothesis Testing A statistical hypothesis is an assertion or conjecture about the probability distribution of one or more random variables.

More information

8. Hypothesis Testing

8. Hypothesis Testing FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Multiple Regression Analysis. Part III. Multiple Regression Analysis Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments /4/008 Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University C. A Sample of Data C. An Econometric Model C.3 Estimating the Mean of a Population C.4 Estimating the Population

More information

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth

More information

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test. Economics 52 Econometrics Professor N.M. Kiefer LECTURE 1: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING NEYMAN-PEARSON LEMMA: Lesson: Good tests are based on the likelihood ratio. The proof is easy in the

More information

Preliminary Statistics. Lecture 5: Hypothesis Testing

Preliminary Statistics. Lecture 5: Hypothesis Testing Preliminary Statistics Lecture 5: Hypothesis Testing Rory Macqueen (rm43@soas.ac.uk), September 2015 Outline Elements/Terminology of Hypothesis Testing Types of Errors Procedure of Testing Significance

More information

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i 1/34 Outline Basic Econometrics in Transportation Model Specification How does one go about finding the correct model? What are the consequences of specification errors? How does one detect specification

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information

Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Budi Pratikno 1 and Shahjahan Khan 2 1 Department of Mathematics and Natural Science Jenderal Soedirman

More information

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures FE661 - Statistical Methods for Financial Engineering 9. Model Selection Jitkomut Songsiri statistical models overview of model selection information criteria goodness-of-fit measures 9-1 Statistical models

More information

8: Hypothesis Testing

8: Hypothesis Testing Some definitions 8: Hypothesis Testing. Simple, compound, null and alternative hypotheses In test theory one distinguishes between simple hypotheses and compound hypotheses. A simple hypothesis Examples:

More information

Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics. 1 Executive summary

Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics. 1 Executive summary ECE 830 Spring 207 Instructor: R. Willett Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics Executive summary In the last lecture we saw that the likelihood

More information

BIOS 312: Precision of Statistical Inference

BIOS 312: Precision of Statistical Inference and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Outline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation

Outline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation 1/30 Outline Basic Econometrics in Transportation Autocorrelation Amir Samimi What is the nature of autocorrelation? What are the theoretical and practical consequences of autocorrelation? Since the assumption

More information

Lecture-20: Discrete Choice Modeling-I

Lecture-20: Discrete Choice Modeling-I Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Homework 2: Simple Linear Regression

Homework 2: Simple Linear Regression STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information