Supplementary Materials for

Size: px
Start display at page:

Download "Supplementary Materials for"

Transcription

1 Supplementary Materials for Altered Placebo and Drug Labeling Changes the Outcome of Episodic Migraine Attacks Slavenka Kam-Hansen, Moshe Jakubowski, John M. Kelley, Irving Kirsch, David C. Hoaglin, Ted J. Kaptchuk, Rami Burstein* The PDF file includes: *Corresponding author. Published 8 January 2014, Sci. Transl. Med. 6, 218ra5 (2014) DOI: /scitranslmed Methods Scripted Information Read to Participants Randomization of Treatment and Treatment Labeling Missing Data Analyses of Primary Endpoint Fitting the Main Model Analyses of Secondary Endpoint Table S1. Reasons for excluding 19 of the prescreened subjects. Table S2. Background characteristics of participants and dropouts. Table S3. Selected quantiles of nondichotomous background characteristics of the 66 participants. Table S4. Incidence of missing pain scores and data imputation. Table S5. Structure of the eight treatment sequences and assignment of subjects to treatment sequences. Table S6. Distribution of attacks with imputed pain scores and pain freedom at 2.5 hours. Table S7. Estimates of parameters in the main model for the pain scores including the imputed pain scores at 2.5 hours. Table S8. Percentage decrease in the estimated average pain score from 30 min to 2.5 hours under the seven experimental conditions (from an analysis that included imputed pain scores at 2.5 hours). Table S9. Estimates of parameters in the covariate model for the pain scores including the imputed pain scores at 2.5 hours. Table S10. Estimates of difference (2.5 hours minus 30 min) on the primary endpoint for key contrasts involving treatment and labeling. Table S11. Sensitivity analysis of the main model for the pain scores. Table S12. Estimates of parameters in the main model for pain freedom at 2.5 hours.

2 Table S13. Estimated probability of being pain-free 2 hours after treatment under the seven experimental conditions (from an analysis that included imputed pain scores at 2.5 hours). Table S14. Estimates of difference (2.5 hours minus 30 min) on the secondary endpoint for key contrasts involving treatment and labeling.

3 Supplementary Materials for Altered Placebo and Drug Labeling Changes the Outcome of Episodic Migraine Attacks Slavenka Kam-Hansen, Moshe Jakubowski, John M. Kelley, Irving Kirsch, David C. Hoaglin, Ted J. Kaptchuk, Rami Burstein *Corresponding author. METHODS Scripted Information Read to Participants. You are invited to take part in a research study for the purpose of understanding the effects of repeated administration of Maxalt for the treatment of acute migraine attacks, and why placebo rates are so high in migraine therapy. Our first goal is to understand why Maxalt makes you pain-free in one attack but not in another. Our second goal is to understand why placebo pills can also make you pain-free. Our third goal is to understand why Maxalt works differently when given in double-blind study vs. real-life experience when you take it at home. These goals are scientifically important for developing new therapies for migraine. To reach these goals, we would like to follow 7 of your migraine attacks. Two of these attacks will be treated in the traditional double-blind placebo-controlled way. You will not know whether the pill you take is a Maxalt or a placebo. Four of the attacks will be treated in an open way. We will give you Maxalt in 2 attacks and you will know you take Maxalt and we will give you placebo pills in 2 attacks and you will know you take a placebo pill. At the end of the study we will compare the results of the double-blind placebo treatment to the results of the open treatment attacks. As a control, we will ask you to report your pain in an additional attack (#7) in which you withheld treatment for at least 2 hours. 1

4 Randomization of Treatment and Treatment Labeling. The study-drug envelopes for the 6 treatment conditions combined one of the 3 labels and one of the 2 pills (Fig. 1). Of 720 (= 6!) possible permutations, we chose 8 sequences for delivering the 6 treatment conditions (table S5). The key for the abbreviation system for the 6 treatment conditions is described in the footnote of table S5. In each sequence, maxalt and placebo treatments alternated from one migraine attack to the next, and each labeling was repeated for two consecutive attacks. Within each type of labeling, the treatment pills were given in one order in half of the sequences (maxalt placebo) and in the reverse order in the other half (placebo maxalt). The labeling would have been balanced on all 6 permutations of P ( Placebo ), U (unspecified Maxalt or Placebo ), and M ( Maxalt ) labeling if we had included 4 additional sequences, based on the permutations P U M and M U P (each combined with the two permutations of placebo pill and maxalt pill). We chose to limit the number of sequences to 8 to ensure that a reasonable number of participants would be randomized to each sequence. With our final sample size of 66 analyzable participants, the 8 sequences had an average of 8.25 (range 7 to 10) participants. Had we used 12 sequences, the average would have been 5.5 participants per sequence, with some sequences perhaps having as few as 3 or 4 participants. We chose to omit those 4 sequences because the resulting lack of balance occurs only for the neutral condition ( Maxalt or Placebo ), and the remaining neutral conditions are equally divided between the initial two attacks and the final two attacks, thus maintaining partial balance. Choosing to omit some other subset of sequences would have led to an imbalance for either the Maxalt or Placebo labels, and we assumed that any imbalance for these non-neutral labels would be more likely to affect outcomes. 2

5 The treatment sequences were listed in a spreadsheet ahead of the study. Blocks of the 8 sequences were listed repeatedly, and the order of the sequences was randomized within each block. Each row in the spreadsheet contained one treatment sequence, and the rows were numbered consecutively. Each subject received an identification number according to the order of her/his recruitment and was assigned the corresponding treatment sequence in the spreadsheet. Table S5 shows the distribution of the recruited subjects among the 8 sequences. Missing Data. Sixty-six participants reporting 2 pain scores (30 min and 2.5 h) for each of the 7 attacks would have yielded a total of 924 observations. However, the available data comprised 894 observations because 15 subjects had a total of 30 unreported or unusable pain scores. The analysis took into account the reasons for missingness. The reasons and the numbers of missing and imputed values of pain score and pain freedom outcomes at 2.5 h are listed in tables S4 and S6. In 18 of the attacks with missing data, subjects took rescue medications before the designated time, implying that they did not attain pain freedom at 2.5 h. In those attacks we imputed the missing 2.5-h pain score, according to the following rule. If the subject did not report a 2.5-h pain score or reported a score that was lower than or equal to the 30-min score, we carried over the 30-min pain score. If the subject reported a 2.5-h pain score that was greater than the 30-min score, we used that higher score. Although no single approach is likely to be ideal, the reason for the missingness makes this procedure sensible. Our approach has the limitations that the imputed value may entail some bias and is less variable than the 2.5-h pain score would have been had the subject not resorted to rescue medication prematurely. As a form of sensitivity analysis, we analyzed the available data. Analyses of Primary Endpoint Fitting the Main Model. To describe the analysis of the pain scores, we denote the subject by i (i = 1,, 66), experimental condition by j (j = 1,, 7; table 3

6 S6), and time by t (t = 1, 2; t = 1 corresponds to 30 min, and t = 2 corresponds to 2.5 h). The pain score for subject i at time t under experimental condition j is denoted by y ijt. The analyses used generalized linear mixed models (GLMM). The linear component describes the structure of g(µ ijt ), where g is a link function and µ ijt denotes the expected value of y ijt, and one also specifies a family of distributions for the random variation in y ijt. Our analyses used the logarithm as the link function and the normal distributions as the family for the random component. For the main analysis, the linear component (or linear predictor) had the following form: g(µ ijt ) = β 1 + β 2 Cond2 ijt + β 3 Cond3 ijt + β 4 Cond4 ijt + β 5 Cond5 ijt + β 6 Cond6 ijt + β 7 Cond7 ijt + β 8 Time2 ijt + β 9 Cond2 ijt Time2 ijt + β 10 Cond3 ijt Time2 ijt + β 11 Cond4 ijt Time2 ijt + β 12 Cond5 ijt Time2 ijt + β 13 Cond6 ijt Time2 ijt + β 14 Cond7 ijt Time2 ijt + b i The variables Cond2, Cond3, Cond4, Cond5, Cond6, Cond7 (experimental conditions), and Time2 (2.5 h) are indicator variables; they have the value 1 for certain values of their ijt subscripts and 0 otherwise. NT (no treatment) serves as the reference category for the experimental conditions, and t = 1 (30 min) serves as the reference category for time. A difference between t = 2 and t = 1 on the log scale corresponds to a ratio of pain scores at 2.5 h to pain scores at 30 min. β 1 represents the logarithm of the average of the subjects pain scores at 30 min under the NT condition; it is part of the linear predictor for each experimental condition. Thus, the coefficients for the experimental Cond indicators express the logarithm of the corresponding average 30-min pain score as a difference from the logarithm of the average 30- min pain score under NT. For example, the logarithm of the average 30-min score under the M M condition (j = 4) is β 1 + β 4. Similarly, the coefficient of Time2 expresses the logarithm of the average 2.5-h pain score under the NT condition as a difference from the logarithm of the 4

7 average 30-min pain score under NT; that is, the logarithm of the average pain score at 2.5 h under NT is β 1 + β 8. The coefficients β 9 through β 14 come into play only at 2.5 h under the corresponding condition, because the variables to which they are attached have the value 1 only when both Condj and Time2 are 1. Each expresses the logarithm of its corresponding average pain score as a difference from the sum of the parameter for NT at 30 min ( β 1 ), the parameter for that condition at 30 min ( β j ), and the parameter for NT at 2.5 h ( β 8 ). For example, the logarithm of the average pain score at 2.5 h under the P P condition (j = 7) is β 1 + β 7 + β 8 + β 14. That is, β 9 through β 14 reflect the interaction between experimental condition and time. In the equation displayed above, each of the indicator variables has all three subscripts, i, j, and t, because those variables are part of the model for each combination of subject, experimental condition, and time, even when they have the value 0. We could simplify the notation, however, because the indicator variables have the same value for all subjects, the Cond variables have the same value at 30 min and 2.5 h, and Time2 has the same value for all experimental conditions. That is, we could omit the subscript i from all the indicator variables, we could omit the subscript t from each of the Cond variables, and we could omit the subscript j from Time2. The last term in the displayed equation, b i, is a random effect for subject i. In a mixed model, such random effects account for variation among subjects, which is the main source of correlation among the repeated measures on individual subjects. The parameter to be estimated is the variance of the (normal) distribution of the random effects. We considered including a second random effect, for attack j within subject i, but decided not to do so, as discussed further below. 5

8 To fit the various models, we used Release 12.1 of Stata (Stata Corp LP, College Station, TX, The analyses of the pain scores by generalized linear mixed models used gllamm, a user-written Stata command ( The calculations in gllamm and similar programs involve numerical integration (quadrature), and gllamm has options that allow users to make that process more efficient and more accurate. By default the program uses nonadaptive quadrature with 8 integration points. To assess accuracy, we used the main model and increased the number of integration points to 12, 16, 20, 24, 30, 40, and 50 (and experienced corresponding increases in execution time). The estimates of the coefficients with 50 integration points differed from those with 40 integration points by at most two digits in the fourth decimal place (most of the estimates agreed to four decimal places). We got the same estimates (and smaller execution time) with 20 integration points and adaptive quadrature, so we settled on adaptive quadrature with 20 integration points. Table S7 shows parameter estimates, standard errors, and 95% CIs for the parameters in the main model. The estimates reported in the article are based on this model. In particular, table S8 shows the numerical values of the estimates and 95% CIs plotted in Fig. 3. Sequence of experimental conditions. To investigate the possibility of effects associated with the sequence in which the subjects encountered the experimental conditions (table S5), we expanded the linear model shown above by including effects for the sequence (a main effect), the combination of sequence and experimental condition (a 2-factor interaction), the combination of sequence and time (a 2-factor interaction), and the combination of sequence and experimental condition and time (a 3-factor interaction). A likelihood-ratio test indicated that the expanded model was not a significant improvement (chi-squared = , df = 98, P = 0.116). Thus, the models for the primary endpoint did not include effects for the sequences. 6

9 Covariates. We investigated the influence of 7 background characteristics: age, years since first attack, number of attacks per month, migraine days per month, sex, triptan history, and family history. Adding these 7 covariates to the main model produced a non-significant likelihood-ratio statistic (chi-squared = 9.19, df = 7, P = 0.239). Also, the P value for each covariate was > 0.05 (table S9). Thus, the main model did not include any covariates. Random effect for attack within subject. Adding to the main model a random effect for attack within subject yielded a log-likelihood that was virtually identical to the one obtained without this random effect. The estimates of the parameters were essentially the same as those in table S7, and the estimate of the variance of the additional random effect was with a standard error of In view of these negligible contributions we did not retain this random effect in the main model. Gamma distributions for the random component. We also considered using the gamma family instead of the normal family for the random component in the main model. gllamm gave the error message: initial values not feasible. The initial values for the parameters in the linear model were appropriate, so the difficulty apparently lay with the initial values of the parameters for the gamma distribution, which gllamm did not reveal. It did not seem worthwhile to pursue this problem further, since the direction of the skewness in the gamma distributions is opposite to that in the pain scores. Interaction of labeling and treatment. The main model is suitable for testing the main effect for treatment (the difference between M and P treatments, df = 1), but not for assessing, in the aggregate, the main effects for labeling (df = 2) and the interactions between labeling and treatment. Likelihood-ratio tests of the main effects and interactions are straightforward when one has a sequence of nested models that allows for removal of variables associated only with 7

10 interactions between labeling and treatment (leaving only main effects for labeling and main effects for treatment), followed by removal of variables associated only with main effects of labeling, or variables associated only with main effects of treatment, or removal of both sets of main effects. The main model does not have such a nested structure, because it takes into account the data from the untreated attack (NT). Thus, we modified the main model by replacing the 6 product variables with indicator variables (all as differences from the change associated with NT, as in the main model). The indicator variables were the overall effect of the six treatment conditions and the incremental effects of M labeling, P labeling, P treatment, the M P combination, and the P P combination. Omission of the M P and P P indicator variables yielded a model with no interaction between labeling and treatment (likelihood-ratio test; chisquared = 1.96, df = 2, P = 0.37). To produce a model that contained main effects for treatment and no main effects for labeling, we removed the indicator variables for M labeling and P labeling, and replaced the overall indicator with an indicator for M treatment. The resulting likelihood-ratio test (against the model that contained main effects for both labeling and treatment) had a chi-squared statistic of 7.38, on 2 df (P = 0.025). In the absence of statistically significant interaction between labeling and treatment, it was appropriate to assess, separately, the effect of treatment and the effects of labeling. Table S10 shows the estimate, 95% CI, and P-value for each of these four contrasts. The difference between the two treatments was statistically significant (P <.001). Similarly, the difference between maxalt labeling and placebo labeling was significant (P =.013), and so was the difference between uncertain labeling and placebo labeling (P =.010). The analysis did not find the difference between uncertain and maxalt labeling significant (P =.923). 8

11 Analysis of available data. The available data set (i.e., no imputed values) comprised 894 observations on 66 subjects. The available-data estimates for the Constant and Cond2 through Cond7 are fairly close to the imputed-data estimates (table S11), as one would expect because the imputed values were all at 2.5 h. The slightly smaller imputed-data estimate for Time2 may reflect the lack of increase in 2 of the 3 imputed pain scores for NT. Conversely, the imputeddata estimates for the 6 product variables are higher (i.e., closer to 0) than the available-data estimates, most likely because 12 of the 15 imputed pain scores for the treated attacks were equal to the pain score at 30 min. (Part of the difference between these imputed-data and availabledata estimates offsets the difference in the estimates for Time2.) Biases may not be absent, but the small number of missing observations (a total of 30 out of 924, 3.2%) and the informative nature of most of the missingness suggest that they are not an important concern. Analyses of Secondary Endpoint. In analyzing pain-freedom status at 2.5 h, the outcome variable takes the values 1 (pain-free) and 0 (not pain-free). The analyses used mixed-effects logit models (the xtmelogit command in Stata). For this type of GLMM, the analysis is on the log-odds scale, the link function is the logit, g(µ) = log e [µ / (1 µ)], and the family of distributions for the random component is the binomial distributions. For the main analysis the linear model had the following form: g(µ ij ) = β 1 + β 2 Cond2 ij + β 3 Cond3 ij + β 4 Cond4 ij + β 5 Cond5 ij + β 6 Cond6 ij + β 7 Cond7 ij + b i The definitions of the subscripts and the indicator variables are the same as in the model for the pain scores. The coefficients, β 1 through β 7, and the random effect, b i, have no direct connection with their counterparts in the main model for the pain scores. We have simply followed the common practice of using the same family of symbols for the coefficients in all 9

12 models. It is helpful to keep in mind, however, that in any regression-like model the definition of each coefficient includes the list of all other explanatory variables in the model. Using the same procedure as in the analysis of the primary endpoint, we found no need to include any background characteristics as covariates in the model. Table S12 shows parameter estimates, standard errors, and 95% confidence intervals for the parameters in the main model. The estimates reported in the article are based on this model. In particular, table S13 shows the numerical values of the estimates and 95% CIs plotted in Fig. 4. As in the analysis of the primary endpoint, the aggregate effects of the interactions between labeling and treatment and the aggregate main effects of labeling were tested using a modified model. The likelihood-ratio test for the labeling/treatment interaction was not significant (chisquared = 1.51, df = 2, P = 0.47), and the likelihood-ratio test for the main effects of labeling (against the model that contained main effects for both labeling and treatment) was almost significant (chi-squared = 5.84, df = 2, P = 0.054). In the absence of statistically significant interaction between labeling and treatment, it was appropriate to assess, separately, the effect of treatment and the effects of labeling. Table S14 shows the estimate, 95% CI, and P-value for each of these four contrasts. As with the primary endpoint, the difference between the two treatments was statistically significant (P <.001). However, none of the pairwise differences between the types of labeling were significant. 10

13 Table S1. Reasons for excluding 19 of the prescreened subjects Subject A B C D E F G H I J K L M N O P Q R S Comment CM CM CDH, CM, self-administered demerol CM, had a pacemaker due to PFO (counter indication for triptans) CM, TTH CM, TTH CDH, fibromyalgia CDH, TTH, medication-overuse headache, chronic back pain, epilepsy CDH, fibromyalgia CDH CDH, was not interested after learning more details about the study CDH CDH, daily opioids CDH, daily opioids CDH, medication-overuse headache Non-migraine headache CM, counter indication to triptans EM plus fibromyalgia EM plus chronic back and knee pain, hypertension, impaired memory/comprehension CDH = chronic daily headache, CM = chronic migraine, EM = episodic migraine TTH = tension-type headache. 11

14 Table S2. Background characteristics of participants and dropouts Variable Participants (n = 66) Dropouts (n = 10) P value Age (years) 40.6 (12.7) 40.5 (11.5) 0.98 (1.00) a Age at first attack 18.2 (10.4) 15.7 (8.3) 0.40 (0.45) a Years since first attack 22.2 (12.9) 24.8 (10.0) 0.47 (0.50) a Attacks per month 5.4 (4.0) 4.8 (2.5) 0.56 (0.93) a Days per attack 1.2 (0.7) 1.9 (1.7) 0.22 (0.22) a Migraine days per month 5.6 (3.9) 7.9 (7.1) 0.33 (0.47) a Female/male (percentage ratio) 85/15 90/ b Visual aura (%) b Other pains (%) b Family history (%) b Triptan history (%) b Continuous variables are Mean (SD). parentheses are from rank-sum test. b Fisher s exact test. a Two-sample t-test, not assuming equal variances; P values in 12

15 Table S3. Selected quantiles of nondichotomous background characteristics of the 66 participants Variable Minimum 25th Percentile Median 75th Percentile Maximum Age (years) Age at first attack Years since first attack Attacks per month Days per attack Days per month

16 Table S4. Incidence of missing pain scores and data imputation Reason for missingness Number of subjects Missing 30 min Missing 2.5 h Total missing values Incomplete attacks Imputed pain scores/pain freedom Reported neither the 30-min nor the 2.5-h pain score Reported the 30-min pain score but not the 2.5-h pain score Fell asleep before and beyond the 2.5 h time Took rescue medications before the 2.5 h time a / 18 Totals 15* / 18 a The missing 2.5-h pain score was imputed as described in Materials and Methods. *Two of these subjects had missing data for two reasons. 14

17 Table S5. Structure of the eight treatment sequences and assignment of subjects to treatment sequences Treatment sequence a Treatment conditions Number of subjects Attack 1 Attack 2 Attack 3 Attack 4 Attack 5 Attack 6 Recruited Dropped out Analyzed 5 M M M P P M P P U M U P P M P P M M M P U M U P U M U P M M M P P M P P U M U P P M P P M M M P U P U M M P M M P P P M U P U M P P P M M P M M M P M M P P P M U P U M P P P M M P M M U P U M Totals The 6 pill/label combinations are abbreviated as follows: the first letter (in italic) denotes the label (M for Maxalt, P for Placebo, U for the unspecified Maxalt or Placebo ); the second letter (in color) denotes the actual pill (M for maxalt, P for placebo). a Sequence numbers correspond to the order they were entered in the GLMM analyses (cf. table S6). 15

18 Table S6. Distribution of attacks with imputed pain scores and pain freedom at 2.5 hours Treatment condition* (number of attacks with imputed pain scores / pain freedom) NT U M U P M M M P P M P P Imputed pain scores/pain freedom 3 / 3 1 / 1 3 / / 3 2 / 2 6 / 6 Experimental condition (j) j = 1 j = 2 j = 3 j = 4 j = 5 j = 6 j = 7 *NT, No treatment. For abbreviation of the 6 treatment conditions see footnote of table S5. 16

19 Table S7. Estimates of parameters in the main model for the pain scores including the imputed pain scores at 2.5 hours Variable Estimate Standard error 95% Confidence Interval Constant (NT) Cond2 (U M) Cond3 (U P) Cond4 (M M) Cond5 (M P) Cond6 (P M) Cond7 (P P) Time2 (2.5 h) Cond2 Time Cond3 Time Cond4 Time Cond5 Time Cond6 Time Cond7 Time Var(b i ) Residual variance NT, No treatment. For abbreviation of the 6 treatment conditions see footnote of table S5. 17

20 Table S8. Percentage decrease in the estimated average pain score from 30 min to 2.5 hours under the seven experimental conditions (from an analysis that included imputed pain scores at 2.5 hours) Attack Estimate 95% Confidence Interval Untreated P P U P M P P M U M M M For abbreviation of the 6 treatment conditions see footnote of table S5. 18

21 Table S9. Estimates of parameters in the covariate model for the pain scores including the imputed pain scores at 2.5 hours Variable Estimate Standard error 95% Confidence interval Constant (NT) Cond2 (U M) Cond3 (U P) Cond4 (M M) Cond5 (M P) Cond6 (P M) Cond7 (P P) Time2 (2.5 h) Cond2 Time Cond3 Time Cond4 Time Cond5 Time Cond6 Time Cond7 Time Age (years) Sex Migraine years Attacks/month Migraine days/mo Triptan history Family history Var(b i ) Residual variance NT, No treatment. For abbreviation of the 6 treatment conditions see footnote of table S5. 19

22 Table S10. Estimates of difference (2.5 hours minus 30 min) on the primary endpoint for key contrasts involving treatment and labeling Contrast Estimate 95% Confidence Interval P value Pill: Maxalt vs. placebo < Label: M vs. P Label: U vs. M Label: U vs. P Estimates and 95% CIs are in logarithmic units (base e). Labels: M, Maxalt; P, placebo; U, unspecified. 20

23 Table S11. Sensitivity analysis of the main model for the pain scores* Variable Estimate Standard error 95% Confidence interval Constant (NT) / / / / Cond2 (U M) / / / / Cond3 (U P) / / / / Cond4 (M M) / / / / Cond5 (M P) / / / / Cond6 (P M) / / / / Cond7 (P P) / / / / Time2 (2.5 h) / / / / Cond2 Time / / / / Cond3 Time / / / / Cond4 Time / / / / Cond5 Time / / / / Cond6 Time / / / / Cond7 Time / / / / Var(b i ) / / Residual variance / / *Values are based on the data without/with the imputed pain scores at 2.5 h. NT, No treatment. For abbreviation of the 6 treatment conditions see footnote of table S5. 21

24 Table S12. Estimates of parameters in the main model for pain freedom at 2.5 hours Variable Estimate Standard error 95% Confidence interval Constant (NT) Cond2 (U M) Cond3 (U P) Cond4 (M M) Cond5 (M P) Cond6 (P M) Cond7 (P P) Var(b i ) NT, No treatment. For abbreviation of the 6 treatment conditions see footnote of table S5. 22

25 Table S13. Estimated probability of being pain-free 2 hours after treatment under the seven experimental conditions (from an analysis that included imputed pain scores at 2.5 hours) Attack Estimate (%) 95% Confidence Interval Untreated P P U P M P P M U M M M For abbreviation of the 6 treatment conditions see footnote of table S5. 23

26 Table S14. Estimates of difference (2.5 hours minus 30 min) on the secondary endpoint for key contrasts involving treatment and labeling Contrast Estimate 95% Confidence Interval P value Pill: Maxalt vs. placebo < Label: M vs. P Label: U vs. M Label: U vs. P Estimates and 95% CIs are in units of log-odds (base e). Labels: M, Maxalt; P, placebo; U, unspecified. 24

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Chapter 19: Logistic regression

Chapter 19: Logistic regression Chapter 19: Logistic regression Self-test answers SELF-TEST Rerun this analysis using a stepwise method (Forward: LR) entry method of analysis. The main analysis To open the main Logistic Regression dialog

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 20th April 2009 Time: 9am-2pm Examiner: Dr David A Stephens Associate Examiner: Dr Russell Steele Please

More information

Fractional Polynomial Regression

Fractional Polynomial Regression Chapter 382 Fractional Polynomial Regression Introduction This program fits fractional polynomial models in situations in which there is one dependent (Y) variable and one independent (X) variable. It

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with

More information

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial

More information

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response) Model Based Statistics in Biology. Part V. The Generalized Linear Model. Logistic Regression ( - Response) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10, 11), Part IV

More information

Turning a research question into a statistical question.

Turning a research question into a statistical question. Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE

More information

Statistical tables are attached Two Hours UNIVERSITY OF MANCHESTER. May 2007 Final Draft

Statistical tables are attached Two Hours UNIVERSITY OF MANCHESTER. May 2007 Final Draft Statistical tables are attached wo Hours UNIVERSIY OF MNHESER Ma 7 Final Draft Medical Statistics M377 Electronic calculators ma be used provided that the cannot store tet nswer LL si questions in SEION

More information

Item Reliability Analysis

Item Reliability Analysis Item Reliability Analysis Revised: 10/11/2017 Summary... 1 Data Input... 4 Analysis Options... 5 Tables and Graphs... 5 Analysis Summary... 6 Matrix Plot... 8 Alpha Plot... 10 Correlation Matrix... 11

More information

1 Introduction to Minitab

1 Introduction to Minitab 1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you

More information

Recent Developments in Multilevel Modeling

Recent Developments in Multilevel Modeling Recent Developments in Multilevel Modeling Roberto G. Gutierrez Director of Statistics StataCorp LP 2007 North American Stata Users Group Meeting, Boston R. Gutierrez (StataCorp) Multilevel Modeling August

More information

Generalized linear models

Generalized linear models Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Investigating Models with Two or Three Categories

Investigating Models with Two or Three Categories Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

More information

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern. STAT 01 Assignment NAME Spring 00 Reading Assignment: Written Assignment: Chapter, and Sections 6.1-6.3 in Johnson & Wichern. Due Monday, February 1, in class. You should be able to do the first four problems

More information

Review of One-way Tables and SAS

Review of One-way Tables and SAS Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 23rd April 2007 Time: 2pm-5pm Examiner: Dr. David A. Stephens Associate Examiner: Dr. Russell Steele Please

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) (b) (c) (d) (e) In 2 2 tables, statistical independence is equivalent

More information

MULTILEVEL IMPUTATION 1

MULTILEVEL IMPUTATION 1 MULTILEVEL IMPUTATION 1 Supplement B: MCMC Sampling Steps and Distributions for Two-Level Imputation This document gives technical details of the full conditional distributions used to draw regression

More information

BINF 702 SPRING Chapter 8 Hypothesis Testing: Two-Sample Inference. BINF702 SPRING 2014 Chapter 8 Hypothesis Testing: Two- Sample Inference 1

BINF 702 SPRING Chapter 8 Hypothesis Testing: Two-Sample Inference. BINF702 SPRING 2014 Chapter 8 Hypothesis Testing: Two- Sample Inference 1 BINF 702 SPRING 2014 Chapter 8 Hypothesis Testing: Two-Sample Inference Two- Sample Inference 1 A Poster Child for two-sample hypothesis testing Ex 8.1 Obstetrics In the birthweight data in Example 7.2,

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

Investigation into the use of confidence indicators with calibration

Investigation into the use of confidence indicators with calibration WORKSHOP ON FRONTIERS IN BENCHMARKING TECHNIQUES AND THEIR APPLICATION TO OFFICIAL STATISTICS 7 8 APRIL 2005 Investigation into the use of confidence indicators with calibration Gerard Keogh and Dave Jennings

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

MATH Notebook 3 Spring 2018

MATH Notebook 3 Spring 2018 MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................

More information

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

A multivariate multilevel model for the analysis of TIMMS & PIRLS data A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

Additional Problems Additional Problem 1 Like the http://www.stat.umn.edu/geyer/5102/examp/rlike.html#lmax example of maximum likelihood done by computer except instead of the gamma shape model, we will

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Introduction to GSEM in Stata

Introduction to GSEM in Stata Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /

More information

McGill University. Faculty of Science. Department of Mathematics and Statistics. Statistics Part A Comprehensive Exam Methodology Paper

McGill University. Faculty of Science. Department of Mathematics and Statistics. Statistics Part A Comprehensive Exam Methodology Paper Student Name: ID: McGill University Faculty of Science Department of Mathematics and Statistics Statistics Part A Comprehensive Exam Methodology Paper Date: Friday, May 13, 2016 Time: 13:00 17:00 Instructions

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

MMWS Software Program Manual

MMWS Software Program Manual MMWS Software Program Manual 1 Software Development The MMWS program is regularly updated. The latest beta version can be downloaded from http://hlmsoft.net/ghong/ MMWS Click here to get MMWS. For a sample

More information

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p ) Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was

More information

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i B. Weaver (24-Mar-2005) Multiple Regression... 1 Chapter 5: Multiple Regression 5.1 Partial and semi-partial correlation Before starting on multiple regression per se, we need to consider the concepts

More information

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012 Ronald Heck Week 14 1 From Single Level to Multilevel Categorical Models This week we develop a two-level model to examine the event probability for an ordinal response variable with three categories (persist

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

Outline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs

Outline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs Outline Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team UseR!2009,

More information

Estimating and Testing the US Model 8.1 Introduction

Estimating and Testing the US Model 8.1 Introduction 8 Estimating and Testing the US Model 8.1 Introduction The previous chapter discussed techniques for estimating and testing complete models, and this chapter applies these techniques to the US model. For

More information

Ordinal Independent Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 9, 2017

Ordinal Independent Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 9, 2017 Ordinal Independent Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 9, 2017 References: Paper 248 2009, Learning When to Be Discrete: Continuous

More information

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression: Biost 518 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture utline Choice of Model Alternative Models Effect of data driven selection of

More information

SPSS Guide For MMI 409

SPSS Guide For MMI 409 SPSS Guide For MMI 409 by John Wong March 2012 Preface Hopefully, this document can provide some guidance to MMI 409 students on how to use SPSS to solve many of the problems covered in the D Agostino

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

NCSS Statistical Software. Harmonic Regression. This section provides the technical details of the model that is fit by this procedure.

NCSS Statistical Software. Harmonic Regression. This section provides the technical details of the model that is fit by this procedure. Chapter 460 Introduction This program calculates the harmonic regression of a time series. That is, it fits designated harmonics (sinusoidal terms of different wavelengths) using our nonlinear regression

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Hotelling s One- Sample T2

Hotelling s One- Sample T2 Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response

More information

Structural Equation Modeling Lab 5 In Class Modification Indices Example

Structural Equation Modeling Lab 5 In Class Modification Indices Example Structural Equation Modeling Lab 5 In Class Modification Indices Example. Model specifications sntax TI Modification Indices DA NI=0 NO=0 MA=CM RA FI='E:\Teaching\SEM S09\Lab 5\jsp6.psf' SE 7 6 5 / MO

More information

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means 4.1 The Need for Analytical Comparisons...the between-groups sum of squares averages the differences

More information

SPSS LAB FILE 1

SPSS LAB FILE  1 SPSS LAB FILE www.mcdtu.wordpress.com 1 www.mcdtu.wordpress.com 2 www.mcdtu.wordpress.com 3 OBJECTIVE 1: Transporation of Data Set to SPSS Editor INPUTS: Files: group1.xlsx, group1.txt PROCEDURE FOLLOWED:

More information

Advanced Quantitative Data Analysis

Advanced Quantitative Data Analysis Chapter 24 Advanced Quantitative Data Analysis Daniel Muijs Doing Regression Analysis in SPSS When we want to do regression analysis in SPSS, we have to go through the following steps: 1 As usual, we choose

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Logistic Regression 1 / 38 Logistic Regression 1 Introduction

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Logistic Regression Models to Integrate Actuarial and Psychological Risk Factors For predicting 5- and 10-Year Sexual and Violent Recidivism Rates

Logistic Regression Models to Integrate Actuarial and Psychological Risk Factors For predicting 5- and 10-Year Sexual and Violent Recidivism Rates Logistic Regression Models to Integrate Actuarial and Psychological Risk Factors For predicting 5- and 10-Year Sexual and Violent Recidivism Rates WI-ATSA June 2-3, 2016 Overview Brief description of logistic

More information

Mixed Models for Longitudinal Ordinal and Nominal Outcomes

Mixed Models for Longitudinal Ordinal and Nominal Outcomes Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences Biological Sciences Division University of Chicago hedeker@uchicago.edu Hedeker, D. (2008). Multilevel

More information

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126 Psychology 60 Fall 2013 Practice Final Actual Exam: This Wednesday. Good luck! Name: To view the solutions, check the link at the end of the document. This practice final should supplement your studying;

More information

Mixed models in R using the lme4 package Part 7: Generalized linear mixed models

Mixed models in R using the lme4 package Part 7: Generalized linear mixed models Mixed models in R using the lme4 package Part 7: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team University of

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 525 Fall Final exam. Tuesday December 14, 2010 STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 12: Effect modification, and confounding in logistic regression Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression

More information

THE ROYAL STATISTICAL SOCIETY 2015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3

THE ROYAL STATISTICAL SOCIETY 2015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3 THE ROYAL STATISTICAL SOCIETY 015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3 The Society is providing these solutions to assist candidates preparing for the examinations in 017. The solutions are

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

An Approximate Test for Homogeneity of Correlated Correlation Coefficients Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE

More information

Additional Notes: Investigating a Random Slope. When we have fixed level-1 predictors at level 2 we show them like this:

Additional Notes: Investigating a Random Slope. When we have fixed level-1 predictors at level 2 we show them like this: Ron Heck, Summer 01 Seminars 1 Multilevel Regression Models and Their Applications Seminar Additional Notes: Investigating a Random Slope We can begin with Model 3 and add a Random slope parameter. If

More information

PAPER 218 STATISTICAL LEARNING IN PRACTICE

PAPER 218 STATISTICAL LEARNING IN PRACTICE MATHEMATICAL TRIPOS Part III Thursday, 7 June, 2018 9:00 am to 12:00 pm PAPER 218 STATISTICAL LEARNING IN PRACTICE Attempt no more than FOUR questions. There are SIX questions in total. The questions carry

More information

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates Madison January 11, 2011 Contents 1 Definition 1 2 Links 2 3 Example 7 4 Model building 9 5 Conclusions 14

More information

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article

More information

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials

CHL 5225 H Crossover Trials. CHL 5225 H Crossover Trials CHL 55 H Crossover Trials The Two-sequence, Two-Treatment, Two-period Crossover Trial Definition A trial in which patients are randomly allocated to one of two sequences of treatments (either 1 then, or

More information

Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method

Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method Yan Wang 1, Michael Ong 2, Honghu Liu 1,2,3 1 Department of Biostatistics, UCLA School

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)

More information

Exercise 5.4 Solution

Exercise 5.4 Solution Exercise 5.4 Solution Niels Richard Hansen University of Copenhagen May 7, 2010 1 5.4(a) > leukemia

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies

More information

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Tentative solutions TMA4255 Applied Statistics 16 May, 2015 Norwegian University of Science and Technology Department of Mathematical Sciences Page of 9 Tentative solutions TMA455 Applied Statistics 6 May, 05 Problem Manufacturer of fertilizers a) Are these independent

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Design and Analysis of Experiments. David Yanez Department of Biostatistics University of Washington

Design and Analysis of Experiments. David Yanez Department of Biostatistics University of Washington Design and Analysis of Experiments David Yanez Department of Biostatistics University of Washington Outline Basic Ideas Definitions Structures of an Experimental Design Design Structure Treatment Structure

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Non-Gaussian Response Variables

Non-Gaussian Response Variables Non-Gaussian Response Variables What is the Generalized Model Doing? The fixed effects are like the factors in a traditional analysis of variance or linear model The random effects are different A generalized

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

Extensions of One-Way ANOVA.

Extensions of One-Way ANOVA. Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa18.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How

More information