Performance of AIC-Selected Spatial Covariance Structures for f MRI Data

Size: px
Start display at page:

Download "Performance of AIC-Selected Spatial Covariance Structures for f MRI Data"

Transcription

1 Brigham Young University BYU ScholarsArchive All Theses and Dissertations Performance of AIC-Selected Spatial Covariance Structures for f MRI Data David A. Stromberg Brigham Young University - Provo Follow this and additional works at: Part of the Statistics and Probability Commons BYU ScholarsArchive Citation Stromberg, David A., "Performance of AIC-Selected Spatial Covariance Structures for fmri Data" (2005). All Theses and Dissertations This Selected Project is brought to you for free and open access by BYU ScholarsArchive. It has been accepted for inclusion in All Theses and Dissertations by an authorized administrator of BYU ScholarsArchive. For more information, please contact scholarsarchive@byu.edu, ellen_amatangelo@byu.edu.

2 PERFORMANCE OF AIC-SELECTED SPATIAL COVARIANCE STRUCTURES FOR FMRI DATA by David A. Stromberg A Project submitted to the faculty of Brigham Young University in partial fulfillment of the requirements for the degree of Master of Science Department of Statistics Brigham Young University July 2005

3

4 BRIGHAM YOUNG UNIVERSITY GRADUATE COMMITTEE APPROVAL of a Project submitted by David A. Stromberg This Project has been read by each member of the following graduate committee and by majority vote has been found to be satisfactory. Date William F. Christensen, Chair Date Gilbert W. Fellingham Date G. Bruce Schaalje

5

6 BRIGHAM YOUNG UNIVERSITY As chair of the candidate s graduate committee, I have read the Project of David A. Stromberg in its final form and have found that (1) its format, citations, and bibliographical style are consistent and acceptable and fulfill university and department style requirements; (2) its illustrative materials including figures, tables, and charts are in place; and (3) the final manuscript is satisfactory to the graduate committee and is ready for submission to the university library. Date William F. Christensen Chair, Graduate Committee Accepted for the Department G. Bruce Schaalje Graduate Coordinator Accepted for the College G. Rex Bryce Associate Dean, College of Physical and Mathematical Sciences

7

8 ABSTRACT PERFORMANCE OF AIC-SELECTED SPATIAL COVARIANCE STRUCTURES FOR FMRI DATA David A. Stromberg Department of Statistics Master of Science FMRI datasets allow scientists to assess functionality of the brain by measuring the response of blood flow to a stimulus. Since the responses from neighboring locations within the brain are correlated, simple linear models that assume independence of measurements across locations are inadequate. Mixed models can be used to model the spatial correlation between observations, however selecting the correct covariance structure is difficult. Information criteria, such as AIC are often used to choose among covariance structures. Once the covariance structure is selected, significance tests can be used to determine if a region of interest within the brain is significantly active. Through the use of simulations, this project explores the performance of AIC in selecting the covariance structure. Type I error rates are presented for the fixed effects using the the AIC chosen covariance structure. Power of the fixed effects are also discussed.

9

10 Acknowledgements I want to express my thanks to Dr. William Christensen for the many hours of work and consultation he invested with me on this project which ultimately lead to the attainment of my Master s degree. I would also like to thank all those who contributed to the progression of the project. I want to thank my family for their constant support and reassurance. I would have never made it thus far without the care and guidance of a loving mother, and never succeeded in my Master s program without a loving, and supportive wife. Most of all I want to express my gratitude to my Heavenly Father for His abundant goodness in allowing me so many wonderful opportunities such as the privilege of studying and this university.

11

12 Contents Chapter 1 Introduction 1 2 Review of Literature fmri Spatial Statistics Spatial Modeling Stationarity Isotropy Linear Mixed Models Selection of Covariance Structure Type I Error Rates Methods Data Generation Modeling the Data xi

13 4 Results Performance of AIC Type I Error Rates Power Summary Data Analysis Covariance Structure Selection Test of Significance Further Considerations Independence of Responses Across Treatments Estimating Covariance Parameters from Stationary Responses Appendix A Code Appendix 38 A.1 SAS Code for Simulations A.2 S-Plus Code for Estimation and Graphing A.3 SAS Code for Tabulating Results A.4 SAS Code for Actual fmri Analysis Bibliography 63 xii

14 Tables Table 3.1 Selected Spatial Covariance Structures. d i,j is the distance between two points, and 1 (di,j =0) is an indicator function for distance zero Covariance Parameters used to Generate Data Percents of AIC-Selected Covariance Structures Rejection Rates Based on AIC-Selected Structure Rejection Rates for Power Analysis Based on AIC-Selected Structure Least-Squares means for treatment by region levels Covariance parameter estimates from SAS xiii

15 xiv

16 Figures Figure 2.1 Hemodynamic response curve for an active voxel when using the sfmri method Generated data voxel locations within a slice. The auditory cortex is the shaded region Semi-Variograms used to generate the simulated data. Dots represent the empirical Semi-Variogram Log-Eigenvalues of the Covariance Matrices xv

17 xvi

18 Chapter 1 Introduction Functional magnetic resonance imaging (fmri) allows scientists to explore functionality of the brain by evaluating hemodynamic activity or blood flow. A primary objective is to determine whether a region of the brain becomes active when stimulated. To accomplish this, periods of stimulus or tasks are alternated with periods without stimulus, and the hemodynamic activity is recorded during the two periods. Through this method it can be determined if a specific region has a different hemodynamic response during the stimulated state than in the non-stimulated, thus indicating whether the region is associated with the stimulus or task. One region of specific interest to neuroscientists is the auditory cortex, the region of the brain responsible for hearing. To examine the functionality of the auditory cortex, an audible tone stimulus of specific intensity and frequency is presented to the subject. For subjects whose hearing is not impaired, this tone stimulus should invoke a response in the auditory cortex that can be measured by a scanner. However, the neuronal response due to the stimulus is confounded 1

19 with the response due to the noise of the scanner that records the response. Silent fmri (sfmri) has been used to overcome this difficulty of confounded responses. In sfmri, responses are recorded while the stimulus is given and then again at a resting or non-stimulated state just like in the fmri, except that the scanner is turned on and a measurement quickly recorded before there is an increase in hemodynamic activity due to the scanner, thus avoiding confounding due to the noise of the scanner. The measurement is taken after the auditory cortex has responded to the stimulus but before it responds to the scanner noise. The response is recorded for several tiny brain locations called voxels. Due to the nature of blood flow in the brain, the responses obtained from neighboring voxels cannot be considered independent but are assumed to be spatially correlated. In order to correctly identify if a region of the brain (such as the auditory cortex) is statistically active, scientists must correctly account for this dependency in the data (Christensen and Yetkin 2005; Grady and Helms 1995). To correctly account for the spatial dependency of the data, a spatial covariance structure can be included in the model. This can be implemented in SAS R with the use of the MIXED procedure. However, SAS R offers many types of spatial covariance structures and the choice of the correct structure is not an easy one. Researchers often use some criterion such as AIC or SBC to select the covariance structure that best fits the data, however these criteria do not always select the correct structure. (Keselman et al. 1998; Ferron et al. 2002; Gomez et al. 2005) Once a covariance structure has been selected, the fixed effects can 2

20 then be tested. In the case of the fmri data, the region of the auditory cortex can be tested for significant activation. This project explores, through the use of simulations, the performance of AIC in selecting the correct spatial covariance structure. Also, Type I error rates are reported for testing a region for significant activation using the AIC-selected covariance structure. Finally an actual fmri data set is analyzed. A literature review is given in Chapter 2. The literature review discusses fmri data, fundamentals of spatial data analysis and the selection of covariance structures in a mixed model setting. Finally, repercussions of covariance selection on testing a fixed effect are discussed. Chapter 3 discusses the methods used for the simulations, including how the data are generated and analyzed. Chapter 4 presents the results of the simulations, as well as some general conclusions. An example analysis of an actual data set is covered in Chapter 5. Chapter 6 briefly explores some potential directions for future research. 3

21 Chapter 2 Review of Literature 2.1 fmri Magnetic Resonance Imaging (MRI) is a tool used to view structural images of the inside of the body. In an MRI a patient is placed in a strong magnetic field which is used to magnetize the tissue. Genovese (1998) states that, carefully modulating the magnetic field affects a specific type of atomic nucleus, such as hydrogen, in the patient s tissue and produces a measurable signal in a nearby receiver coil. The signal that is recorded by the receiver differs for differing densities of tissue, thus an MRI is a device that essentially measures the hydrogen content in a given volume element in the body. These volume elements, or voxels, can be placed together to create a high-resolution, three-dimensional image of the tissue volume inside the body. Thus, MRI s are very valuable for obtaining anatomical structure images of the inside of the body. In functional Magnetic Resonance Imaging (fmri), the technology of an MRI is applied not to obtain a structural image of the body, but in an attempt to create a functional image of the body. Instead of being interested in the actual 4

22 image, the researcher is interested in the activity of structures inside the body. An activity image can be obtained by exploring the subtle changes in the measured signal across time. These changes are caused by physiological effects related to neural activity (Genovese 1998). This neural activity may give indications of regions that are responsible for different physical or mental tasks or other stimuli. In an fmri experiment a patient is placed in the scanner and then given some sequence of tasks or stimuli while the images of the subject s brain are acquired at regular time intervals. The data set that results from this type of experiment is a time series of three dimensional images. Many of these experimental designs consist of two conditions: one in which the subject is given a stimulus or task and then a control or resting period. It is assumed that the control task involves all aspects of the stimulus task except for those stimuli being specifically investigated. An example of an fmri experiment would be playing an audible tone for a few seconds into the ear of the subject during the MRI scan and then turning the tone off. By repeating this process many times the scanner would produce images of the brain that could be associated with the time of the stimulus or control. This time-series image could then be used to classify voxels as active or inactive via a statistical hypothesis test (Genovese 1998; Lange 1996). One might use the t-test to compare the average signal levels in the two conditions. This classification technique provides a good map of which regions of the brain are more active when the stimulus is given. One major problem in assessing the effects of a audible tone given during 5

23 the fmri is that the scanner itself is very noisy. The neuronal response due to tone stimulus is confounded with the response to the noise of the scanner. The silent fmri (sfmri) technique is designed to decrease the problem of the scanner noise (Yang et al. 2000). When a stimulus is given, there is a brief lag between the neuronal activity and the resulting hemodynamic response; this lag is exploited in the sfmri (see Figure 2.1 (Christensen and Yetkin 2005)). The pure tone response is given and then the scanner is turned on. A measure of the hemodynamic activity is quickly recorded. Since the hemodynamic response due to the noise of the scanner will occur after a brief amount of time, the response recorded by the scanner immediately after turning the scanner on represents only the hemodynamic response due to the pure-tone stimulus. After a resting period in silence, the scanner is once again turned on and a measurement is quickly recorded before the brain has time to react to the noise of the scanner. This response represents the control response (Yetkin et al. 2003). The data obtained from the sfmri can be analyzed with a statistical model to determine if a region of interest is statistically active. Since the data represent a three dimensional map of activation the responses are probably spatially correlated and the use of spatial statistics is recommended. 2.2 Spatial Statistics Cressie (1993, pg. 2-3) states that beginning classes in Statistics (and many of the more advanced ones) always assume that observations on a phe- 6

24 Figure 2.1: Hemodynamic response curve for an active voxel when using the sfmri method. 7

25 nomenon are taken under identical conditions and that each observation is taken independently of any other. The data then form a random sample [i.e., are independent and identically distributed (i.i.d)]; standard statistical techniques can be applied to build a statistical model and to estimate the model s parameters. He goes on to say, models that involve statistical dependence are often more realistic; two classes of models that have commonly been used involve intraclasscorrelation structures and serial-correlation structures. These offer little scope for spatial data, where dependence is present in all directions and becomes weaker as data locations become more dispersed. The complex spatial nature of fmri data makes the modeling of covariance structures an important part of the data analysis process Spatial Modeling Spatial data can be thought of as a realization from a random process that varies continuously throughout a region. Symbolically, Cressie (1993) refers to this as {Z(s) : s D}, where D is a subset of IR d with positive d-dimensional volume, and s is the continuous spatial index of the process Z. It is of interest to model this random process given the data {Z(s 1 ),..., Z(s n )} at know spatial locations {s 1,..., s n }. One of the key tools in modeling this process is the variogram. Matheron (1962) defines the 8

26 variogram, 2γ(.), as 2γ(s 1 s 2 ) = var(z(s 1 ) Z(s 2 )), for all s 1, s 2 D. It should be noted that γ(.) is called the semivariogram and is commonly used in place of the variogram. From the variogram definition it is easy to see that 2γ(0) = 0; however, if the observed value of the variogram does not approach zero but approaches some other value c 0 > 0 as the distance between two points approaches zero, then c 0 is called the nugget effect (Matheron 1962). Notationally, this can be written lim h 0 γ(h) = c 0 > 0. The nugget effect can be considered microscale variation in the process and can be used to explain the discontinuity in the variogram as the distance approaches zero. This nugget can also be due to measurement error in the process Stationarity In order to model the random process Z, assumptions must be made about the process. If no assumptions are made the data would represent an incomplete sampling of a single realization of the process, making any kind of statistical inference impossible. One assumption often made is that the mean of the overall process is the same regardless of the spatial location s, that is E(Z(s)) = µ for all s D. Assumptions about the covariance must also be made in order to make predictions. One such assumption is cov(z(s 1 ), Z(s 2 )) = C(s 1 s 2 ), for all s 1, s 2 D. 9

27 The function C(.) is called a covariogram and a process satisfying both the constant mean and the constant covariogram function assumptions is called secondorder stationary. The stationarity assumption allows for estimation and prediction within the space (Cressie 1993) Isotropy The value of the variogram function is found using the distance between two spatial locations s 1 and s 2. If the variogram value is the same for all s 1, s 2 pairs for which the distance between s 1 and s 2 is constant, regardless of their locations, then the variogram is called isotropic (Journel and Huijbregts 1979). Isotropy is a simplifying assumption that is often made in the analysis of spatial data. If a process is stationary and the variogram is only a function of s 1 s 2, it is isotropic. If the value of the variogram is not only a function of the distance between s 1 and s 2 but also a function of their direction then the process is said to be anisotropic. The MIXED procedure in SAS R can be used to analyze isotropic and certain anisotropic spatial covariance functions (SAS 1999). 2.3 Linear Mixed Models Linear mixed models can be used for the analysis of correlated data, such as fmri data. With the use of the MIXED procedure in SAS R, spatial covariance structures can be fitted to the correlated data with the use of the repeated 10

28 statement. The repeated statement is used because the observed measurements are recorded on the same subject over different locations. The linear mixed model can be written as: y = Xβ + Zu + ɛ where X and Z are design matrices, β and u are vectors of coefficients for the fixed and random effects respectively, and ɛ is a vector of random errors. The vector u is assumed to be normally distributed with mean vector zero and variancecovariance matrix G. ɛ is also assumed to be normally distributed with mean 0 and variance-covariance matrix R. Also u and ɛ are assumed independent; consequently, Cov(u, ɛ) = 0. In general, V ar(y) for linear mixed models is ZGZ + R = V. Estimation of the parameters in mixed models is more difficult than in fixed linear models (Littell et al. 2002). Not only must β be estimated but also the unknown parameters in V. Also, the correlated nature of the data must be taken into account. In such a case, least-squares is not usually the best method for finding ˆβ. Instead, we use generalized least-squares (GLS), which minimizes (y Xβ) V 1 (y Xβ). Thus the (GLS) estimate for β is ˆβ GLS = (X V 1 X) 1 X V 1 y. However, since V is unknown, estimated generalized least-squares (EGLS) is used by inserting a reasonable estimate of V into the estimating equation. This esti- 11

29 mate of V can be obtained through likelihood-based approaches such as maximum likelihood (ML) or restricted maximum likelihood (REML) (Littell et al. 2002). Now with the estimate ˆV, the estimate for β becomes ˆβ EGLS = (X ˆV 1 X) 1 X ˆV 1 y. It should be noted that although REML is a sensible method for parameter estimation, and is the default method in SAS R, Warnes and Ripley (1987) claim that maximum likelihood estimators are of limited value for spatial problems. They offer no better method of parameter estimation but simply suggest that the estimates be interpreted with caution. If G and R (which affect V) were known, ˆβ would be the best linear unbiased estimator (BLUE) of β. However, since they are not known, ˆβ is no longer BLUE but is referred to as being the empirical best linear unbiased estimator (EBLUE), where best indicates the least squared error. (Littell et al. 2002; Searle 1971) Selection of Covariance Structure The modeling of a covariance structure refers to representing V (Y) as a function of a relatively small number of parameters (Littell et al. 2000). Since observations on different subjects are assumed independent, but repeated measurements from the same subject cannot be considered independent, the covariance structure of the data can be modeled by placing a functional specification on R. In their analysis of repeated measures data over time, Littell et al. (2000) use 12

30 a model in which the covariance between two observations on the same subject depends only on the length of the time interval between measurements (called the lag), and the variance is constant over time. In spatial analysis the covariance is a function of distance rather than time, and in more complex spatial modeling it is also a function of direction and space (Cressie 1993). Since the observations obtained from an fmri cannot be assumed to be independent, the correlation between measurements must be modeled. Grady and Helms (1995) discuss how an unstructured covariance structure is useful for exploring trends in the covariance matrix. Littell et al. (2000) made use of the correlogram (Cressie 1993) as a graphical tool to explore the nature of the correlation and Dawson et al. (1997) offer other graphical techniques. Often the researcher s prior knowledge can be a useful tool in exploring the covariance structure in an effort to obtain the correct structure. Littell et al. (2000) observed that if the covariance structure is incorrectly modeled the estimates of the effects and inferences made may be invalid (Ferron et al. 2002). In the analysis of an fmri dataset, since the measurements are correlated over space it seems practical to use a covariance structure specifically designed for spatial data, but it may be difficult to choose which structure is correct. Many researchers use information criteria such as the Akaike Information Criteria (AIC) (Akaike 1973) or Schwarz s Bayesian criteria (SBC) (Schwarz 1978) as tools for choosing between different covariance structures. Marx and Stroup (1993) state that AIC are SBC are goodness-of-fit indices that may be used to 13

31 compare models with the same fixed effects but different covariance structures. AIC and SBC in SAS R are computed by taking -2 times the minimized restricted log-likelihood (REML) and then penalizing for the number of covariance parameter estimates. SBC imposes a greater penalty for covariance parameter estimates. If two different covariance structures contained the same number of parameters, such as compound symmetry and auto-regressive(1) or spatial Gaussian and spatial spherical, then AIC and SBC would simply indicate the better fit to the data since both structures receive the same penalty. Grady and Helms (1995) also note that if covariance structures have different numbers of parameters, and if one is a special case of the other, likelihood ratio tests can be used to choose between the covariance structures. Covariance structure modeling offers many benefits in a data analysis process. Grady and Helms (1995) suggest that by modeling the covariance structure of the data a more complete understanding of the process is obtained. Modeling a covariance structure allows for unbalanced and missing data without having to delete the subject (Littell et al. 2000). By modeling the covariance structure the number of parameters that must be estimated can be drastically reduced. This is advantageous when the selected covariance structure is adequate, that is, it produces good estimates and test statistics. 14

32 2.3.2 Type I Error Rates Several simulation studies have been performed to investigate Type I error rates of F-tests in mixed linear models (Schaalje et al. 2002). Most of these studies include covariance structures such as: auto-regressive of order 1 (AR(1)), compound symmetry (CS), toeplitz (TOEP), and similar structures with heterogeneity of variance. The influence of the complexity of the covariance structure on the error rates were explored. Different ways of computing degrees of freedom have also be investigated (Schaalje et al. 2002; Kenward and Roger 1997). In an fmri it is of interest to find regions of activity in the brain. If these regions are previously identified then the location can be considered a fixed effect and F-tests can be used to evaluate if the responses in that region significantly differ from the other responses. However, as previously stated, if the covariance structure is not adequately modeled, the tests on fixed effects may be invalid. 15

33 Chapter 3 Methods This chapter describes the methods used for the simulation study. The simulation study entails the generation of data that resemble fmri data, where observations taken at spatial locations are correlated according to some known spatial structure. Then mixed models with several spatially correlated error structures are fitted to the data. A spatial error structure is chosen using AIC as a measure of fit after which statistical properties of the model parameter estimates will be observed. An explanation of the data generation process is outlined in the first section. The second section discusses the use of simulations to investigate the performance of the covariance structure selection process, the Type I error rates, and power. 3.1 Data Generation The data generation was designed to represent an fmri data set. Responses were generated for each voxel within a single slice of an fmri scan. Since the 390 observed locations on a slice are all taken from the same subject, they are spatially 16

34 correlated repeated measures. No fixed effects were added to the simulated data set. The data consisted of 390 observations from a grid similar to half of an fmri scan (Figure 3.1). Two sets of generated data were created for the same slice, one to represent the responses from the control fmri scan and one to represent the responses from the stimulated scan. The region intended to represent the auditory cortex region contained 34 of the 390 observed locations. The region of the auditory cortex was identified in the grid of points but no effect was added to the responses in the region (Figure 3.1). Due to the absence of a fixed effect associated with the region of interest it is possible to consider Type I error rates associated with the test for region activation. The four covariance structures used to generate the repeated measures were exponential, Gaussian, power, and spherical. Details of these four covariance structures are given in Table 3.1 (SAS 1999; Journel and Huijbregts 1979). Each of these covariance structures has three parameters; the variance (σ 2 ), the range of dependency (θ) or correlation coefficient (ρ), and the nugget (η). Note that d i,j represents the euclidean distance between the i th and j th elements in the grid of observed data locations. The nugget (η) is multiplied by an indicator variable where 1 (di,j =0) is equal to one when d i,j = 0 and is equal to zero otherwise. The parameters for each of the four structures were chosen from the estimates of an actual data set. The data set was slice 10 from an fmri scan. The observations were trimmed in order to only have responses for regions that actually contain brain tissue. The estimates were obtained by fitting a semi-variogram 17

35 y x Figure 3.1: Generated data voxel locations within a slice. The auditory cortex is the shaded region. Structure (i, j) th element Exponential Gaussian σ 2 [exp( d i,j /θ Exp )] + η1 (di,j =0) σ [ 2 exp( d 2 i,j/θgau 2 )] + η1 (di,j =0) Power σ 2 ρ d i,j [ P ow + ] η1 (d i,j =0) Spherical σ 2 1 3d ij 2θ Sph + d3 i,j 1 2θSph 3 (di,j θ Sph ) + η1 (di,j =0) Table 3.1: Selected Spatial Covariance Structures. d i,j is the distance between two points, and 1 (di,j =0) is an indicator function for distance zero. 18

36 Parameter Structure σ 2 θ or ρ η Exponential Gaussian Power Spherical Table 3.2: Covariance Parameters used to Generate Data. to the data with each of the four structures. Weighted least squares was used to fit the semi-variograms and obtain estimates of the parameters. The fitted semivariograms are shown in Figure 3.2 and Table 3.2 gives the values of the covariance parameters for each structure. To ensure that the data from each of the four structures were similar in terms of generalized variance, the eigenvalues for each of the four structures were computed and the log eigenvalues are shown in Figure 3.3. Since the eigenvalues of the covariance matrices are similar for the four structures, the generalized variances are also similar. This is important to insure that the results of the AIC selection of the covariance structure is not simply confounded with the fact that one structure yields a much larger generalized variance than another. Once the data generation parameters were chosen, ten thousand datasets for each structure were generated following a two-step process outlined by Ripley (1987). The first step of this process is to generate a random multivariate normal vector y with mean vector zero and variance-covariance matrix as an identity, y N n (0, I). In the second step the random vector y is multiplied by the Cholesky decomposition of the covariance matrix of the desired structure. The response vec- 19

37 gamma Exponential Gaussian Power Spherical objective = distance Figure 3.2: Semi-Variograms used to generate the simulated data. Dots represent the empirical Semi-Variogram. 20

38 Log(Eigenvalues) Exponential Gaussian Power Spherical Index Figure 3.3: Log-Eigenvalues of the Covariance Matrices 21

39 tor that results has a zero mean and variance-covariance matrix which follows the structure of R as in Table 3.1. Recall that R is the variance-covariance matrix for repeated measurements that are correlated so V ar(y) = R. Since the actual data had an overall mean of 11, this amount was added to every generated response; thus the response vector has a mean vector of 11 and a variance-covariance matrix of R. Since the entire region has the same expected value and the covariance depends only on the distance between two voxels, the data are stationary and isotropic. However, for the power study, a fixed amount of 2 was added to all of the responses from the stimulated state in the predefined region. All of the data generation was done using Proc IML of SAS. Due to memory problems, the data were generated in small portions and then added together to create the total 40,000 data sets. Proc MIXED was used to analyze all of the data sets following a similar pattern as the data generation, in small portions. Generating the data, fitting the model, and evaluating Type I error rates for the 40,000 data sets took about 1000 hours on a Penguin Computing Dual Opteron Altus 1000E Linux machine with SAS v. 9. By default, Proc MIXED uses the iterative process of REML to estimate the variance-covariance parameters of the mixed model (Littell et al. 2002). 3.2 Modeling the Data The model was fitted in two steps. Since it is assumed that the repeated measures are stationary, the control or non-stimulated set of responses was used 22

40 to fit the covariance structure with no fixed effects in the model. Recall that stationarity implies that the mean and the covariance of the process do not differ for different locations but are constant. The stimulated set of responses are assumed to not be stationary since it is believed that the responses in the auditory cortex differ from the others. So the MODEL statement of SAS Proc Mixed for the control was y=. The four covariance structures were fitted to each of the data sets by changing the REPEATED statement (Listing 3.1 for example code). The DDFM option on the MODEL statement was changed to KENWARDROGER. This was used instead of the default BETWITHIN (between-within) because it involves modifying the estimated variance-covariance matrix of the fixed effects which should result in better Type I error rates (Kenward and Roger 1997; SAS 1999). Listing 3.1: Example of Pseudo SAS Mixed Code 1 Mixed code f o r s t a t i o n a r y p r o c e s s to f i n d c o v a r i a n c e s t r u c t r e and parameters ; 2 Proc mixed data=fmri ; 3 class treatment r e g i o n ; 4 model y= / ddfm=kenwardroger; 5 repeated / subject=treatment type=sp(exp) ( x y ) local ; 6 where treatment =1; 7 8 Mixed code f o r a n a l y s i s o f treatment by r e g i o n e f f e c t, using parameter v a l u e s from s t a t i o n a r y n o n s t i u l a t e d r e s p o n s e s ; 9 Proc mixed data=frmi n o p r o f i l e ; 10 class treatment r e g i o n ; 11 model y= treatment r e g i o n / ddfm=kenwardroger; 12 repeated / subject=i n t e r c e p t type=sp(exp) ( x y ) local ; 13 parms / n o i t e r ; 14 AIC was recorded for each fitted model and then used to determine which covariance structure best fit the data. Recall that AIC is computed by taking 23

41 -2 log likelihood and then penalizing by adding twice the number of parameter estimates. However, since the four structures in question all have three parameters (the variance, the range, and the nugget), AIC is being penalized equally for all structures and simply represents the best fit to the data with the largest likelihood. The lowest AIC model for each data set was tabulated to examine the success rates of AIC in choosing the correct covariance structure. Chapter 4 will show the results of the AIC selection in a two-way table, with data structure on one margin and AIC chosen structure on the other. If the iterative process of REML were performing perfectly in finding the true parameter values, all of the AIC chosen structures would be the same as the actual data structure. Once the best covariance structure was chosen for each data set, the model was re-fit with the covariance structure and covariance parameter estimates obtained from only fitting the control responses. This time the MODEL statement was changed to y=treatment region treatment*region, where the treatment represents the control or stimulated responses, region is a indicator variable of locations previously identified as inside the auditory cortex, and treatment*region represents the interaction between treatment and region. In the second call of the MIXED procedure the F-test on the treatment by region interaction is of special interest to determine if the auditory cortex is becoming active during the stimulated state. So, the p-values for the treatment by region interaction were recorded for the AIC chosen model, and then for simulation purposes p-values were also recorded for the correct covariance structure with the 24

42 true covariance parameters. The proportion of times that the p-values were less than or equal to α = 0.05 was calculated, which should be close to.05 since no fixed effects were added. The rejection rates for this fixed effect are shown in Chapter 4. The table shows the structure with which the data was generated, the AIC chosen structure(model used to fit the data) and the rejection rates. Since the rejection rates are not exact with only 10,000 simulations, ranges for the rates were computed. The standard deviation of this proportion is given by the equation p(1 p) n, which for the 10,000 observed datasets is A 95% probability region for the proportion is ( ). Therefore, if the Kenward-Roger method were performing correctly for these spatial structures, the Type I error rates should fall within the probability region in 95% of cases. 25

43 Chapter 4 Results This chapter presents the results of the simulation study. First the performance of AIC in selecting a spatial covariance structure is discussed. Then type I error rates and power are addressed. 4.1 Performance of AIC For each of the 40,000 total data sets, AIC was recorded for all four modeled covariance structures. The structure that returned the lowest AIC was selected as the structure that best fit the data. The results of the AIC selection are shown in Table 4.1. Each row in the table represents the covariance structure from which the data was generated and the columns represent the AIC-selected structure. If AIC were performing perfectly at selecting the correct covariance structure then this table would have 100% down the diagonal and zero everywhere else. It is clear to see from Table 4.1 that AIC performs very poorly at selecting the correct covariance structure. The Gaussian covariance structure is the most commonly selected structure, chosen for 33.61% of the 40,000 total data 26

44 AIC Selected Structure Data Structure Exponential Gaussian Power Spherical Total(Counts) Exponential 19.73% 33.02% 19.26% 27.99% Gaussian 4.18% 48.19% 43.68% 3.95% Power 20.41% 31.88% 19.30% 28.41% Spherical 24.08% 21.36% 44.05% 10.51% Total 17.10% 33.61% 31.57% 17.72% Table 4.1: Percents of AIC-Selected Covariance Structures. sets. Power is the next most common structure followed by spherical and exponential. Further exploration of the table reveals some interesting patterns. For the exponential, Gaussian, and power data structures, the structure most commonly chosen by AIC was the Gaussian structure. For the spherical data, the power structure was most commonly selected by AIC. However, even when the actual data structure was Gaussian, AIC chose the Gaussian structure less than half the time. In general, AIC is a very poor selector of the correct spatial covariance structures. 4.2 Type I Error Rates To evaluate type I error rates for the treatment by region interaction, the p-values from the fixed-effects tests were recorded. These tests were done using the AIC-selected covariance structure. Recall that the treatment by region interaction is of primary interest because it indicates if the region identified as the auditory cortex becomes active during the stimulated state. Since no fixed effect was added to the responses in the region of interest, the type I error rates should be equal 27

45 AIC Selected Structure Actual Data Structure Exponential Gaussian Power Spherical Overall Exponential * * * Gaussian * * * * * Power * * * Spherical * * 0.076* * * Overall * * * * * Table 4.2: Rejection Rates Based on AIC-Selected Structure. to α = The type I error rates are reported in Table 4.2. The row labeled Overall represents the marginal type I error rates for each modeled covariance structure regardless of the actual data structure. The Overall column represents the marginal type I error rates for each data structure. The type I error rates marked with an asterisk (*) are significantly different than 0.05 based on the number of data types modeled with a particular structure. For example, 1973 of the 10,000 exponentially generated data sets were modeled with an exponential covariance structure so the type I error rates should be between and (0.05 ± (1 0.05) 1973 ). Of the 16 different combinations of actual and selected data structures, 12 of them have type I error rates that are significantly different than expected. Gaussian structures seem to yield the highest inflation to the type I error rates. The exponential model fitted to the exponential and power data have reasonable type I error rates, as well as the spherical model to the exponential data and power model to the power data. The type I error rate for the power model of the exponential data is significantly lower than expected. Overall the type I error 28

46 rates are inflated even when using a spatial covariance structure to account for the dependency in the data. During the data analysis process, researchers are unaware of the correct structure from which their data came and can only select the structure used to model the data. From the information presented in Table 4.2 it can be clear to the researcher that regardless of the structure chosen to model the data, the type I error rates will be inflated. If the Gaussian structure is selected by AIC, it also yields the highest type I error rates, so the results should be reported cautiously. 4.3 Power In the fmri analysis it is of primary interest to determine if a region of the brain is being activated when stimulated. In the results discussed to this point no fixed-effects where added to the responses so the responses of all regions would have an equal mean. In order to address the goal of evaluating region activity, a power analysis was also run. The simulations of the power analysis follow a similar pattern as the rest of the simulations. The parameters for all of the structures were the same as before however, a small constant was added to all of the responses identified as being in the region of interest in the stimulated state. The amount added was chosen so that the rejection rate of the fixed-effects test would be about When the data were generated, the overall mean of the responses was 11 and for the power analysis 2 was added to the responses in the region of interest. Thus, the mean of 29

47 AIC Selected Structure Actual Data Structure Exponential Gaussian Power Spherical Overall Exponential Gaussian Power Spherical Overall Table 4.3: Rejection Rates for Power Analysis Based on AIC-Selected Structure. the responses in the region of interest during the stimulated state is approximately 13, while the mean of the rest of the responses are approximately 11. Ignoring the spatial correlation, the standard deviation of the 390 responses in the simulated data is around 2.5, so less than one standard deviation has been added to the 34 responses. The rejection rates for the treatment by region interaction are presented in Table 4.3, the rejection rates are from the AIC-selected covariance structure. The rejection rates seem to be similar for all structures of data and models. The data structure that results in the highest overall power is the power data structure. The modeled structure that yields the highest overall power is Gaussian. For the exponential data structure the Gaussian models had the highest power with 48.75% of the models rejecting the null hypothesis; of the 16 combinations of data and modeled structures this is the one with the highest power. Since all four covariance structures have similar power, the structure with an optimal type I error rate would be prefered over the others, indicating that Gaussian would be a poor choice. However, since the only results reported in this study are from AIC-selected structures, it is unclear if the same pattern would hold for all data 30

48 sets and structures. 4.4 Summary AIC appears to be a very poor selector of spatial covariance structures. Incorrect structures are generally selected more frequently than the correct one. The Gaussian covariance structure is most commonly selected by AIC. Type I error rates are generally inflated for testing fixed effects when using the AIC-selected structure, which is sometimes the correct structure. 31

49 Chapter 5 Data Analysis In Chapter 4, the overall performance of SAS R in analyzing spatial data resembling an actual fmri was investigated. We now consider an analysis of the fmri data set from which this project was motivated. From the results of the simulations, we know that Gaussian covariance structures are most commonly selected by AIC regardless of the true structure of the data. Type I error rates are inflated, and a difference in the mean responses in the auditory cortex of about 2 results in concluding significant activation in the region of interest about half the time. 5.1 Covariance Structure Selection In order to select which covariance structure should be used to model the data, all four structures where fit to the data and AIC recorded for each one. The model with the lowest AIC is chosen as the one that should be used. The AIC values are as follows, 541.0(Exp), 517.1(Gau), and 916.7(Sph). The power model did not converge so the value of AIC is not comparable. As expected, the Gaussian 32

50 structure returned the lowest value of AIC. However, from the results of the simulation study it is unclear whether the data really come from a Gaussian process or if it is just an artifact of the poor performance of AIC. Regardless of the lack of confidence in the AIC selection process, a structure must be chosen and since we have no better method, the AIC chosen structure will be used, but we also consider the results associated with the exponential and spherical models. The covariance parameter estimates obtained for the Gaussian model are nugget(η)=0.0023, range(θ Gaussian )=1.316, and variance(σ 2 )= These estimates along with the Gaussian structure should correctly account for the dependency in the data so that accurate inference can be made. 5.2 Test of Significance Now that a covariance structure has been chosen and covariance parameters estimated, the tests on the fixed-effects can be examined. The F-statistic for the region by treatment interaction is with 1 numerator and 387 denominator degrees of freedom. This statistics yields a p-value of <0.0001, thus we would conclude that the mean of the defined region is different during the two stages. Printing out the least-squares means reveals that even though the difference in the responses is small it is sufficient enough to be significantly detected. The leastsquares means and standard errors associated with the Gaussian model are given in Table 5.1. Even thought AIC selected Gaussian as the best fitting structure, it is of 33

51 Treatment Region Estimate Standard Error Stimulated Auditory Cortex Stimulated Non-Auditory Cortex Non-Stimulated Auditory Cortex Non-Stimulated Non-Auditory Cortex Table 5.1: Least-Squares means for treatment by region levels interest to examine the results from the other structures. Since the power structure did not converge it is not meaningful to explore the results of that model. The F-statistics for the fixed-effect test of the treatment by region interaction were 17.57, 17.56, and for exponential, Gaussian and spherical, respectively. The parameter estimates for the other three structures are shown in Table 5.2. Despite yielding unrealistic parameter estimates for σ 2 and θ, the F-statistic associated with the exponential model is similar to the F-statistic associated with the more realistic-looking Gaussian model. The spherical model concludes that the range of spatial dependency is essentially zero, yet it too has an F-statistic very close to the Gaussian model F-statistic. All three F-statistics would result in the same conclusion so it appears that adjusting for the spatial correlation, whether with the correct structure or just a good one, is beneficial. Parameter Structure σ 2 θ η Exponential Gaussian Spherical E Table 5.2: Covariance parameter estimates from SAS. 34

52 Chapter 6 Further Considerations During the course of this project many different ideas have been discussed and this chapter presents a few of those ideas which were not incorporated into this project but which may be useful in further research. 6.1 Independence of Responses Across Treatments In Chapter 3 the method for generating the data sets for the simulation study was explained. There were two sets of responses created with the spatial covariance structure, these two sets represented the stimulated and non-stimulated responses for an individual. In the data generation, these two sets were created independent of one another which in actuality is not reasonable because the responses are for the same subject so even across treatments the responses should be correlated for a given voxel. For future studies, the responses from the two treatments should not be generated independent of one another, but should be generated so that for a given voxel, the responses are correlated. This can be incorporated by creating two sets of responses, Z 1 and Z 3 such that they are indepen- 35

53 dent. The set of correlated responses (Z 2 ) can be obtained from Z 2 = az 1 + bz 3, where a and b are chosen to induce correlation. When the simulated data was analyzed, the treatments were considered independent but that the responses from both treatments had the same covariance structure and covariance parameters. The non-stimulated set of data was used to find the lowest AIC model structure and also to obtain estimates for the covariance parameters. These covariance parameter estimates were used for both treatments when evaluating the type I error rates and for the power analysis. This was specified in SAS R in the REPEATED statement by specifying SUBJECT=treatment. Recall that the repeated statement specifies the variance-covariance structure of ɛ, which is called R, and that the variance-covariance matrix of the response vector V is equal to R only if a REPEATED statement is used and there are no random effects. However, in specifying the SUBJECT=treatment option of the REPEATED statement, the variance-covariance structure R was placed on both sets of treatment responses so that V was really a block diagonal matrix with two blocks of the same R matrix. Both blocks of the R matrix were the same since the covariance parameters were estimated from the non-stimulated responses and then forced into the model with both treatments, not allowing SAS R to further iterate in finding new covariance parameter estimates. Before the actual fmri data set was analyzed, this inconsistency of considering the two treatments independent of one another was discovered and corrected for the actual fmri analysis. Simply by changing the SUBJECT= option to SUB- 36

A Comparative Simulation Study of Robust Estimators of Standard Errors

A Comparative Simulation Study of Robust Estimators of Standard Errors Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2007-07-10 A Comparative Simulation Study of Robust Estimators of Standard Errors Natalie Johnson Brigham Young University - Provo

More information

POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA

POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA POWER ANALYSIS TO DETERMINE THE IMPORTANCE OF COVARIANCE STRUCTURE CHOICE IN MIXED MODEL REPEATED MEASURES ANOVA A Thesis Submitted to the Graduate Faculty of the North Dakota State University of Agriculture

More information

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED.

Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Analysis of Longitudinal Data: Comparison between PROC GLM and PROC MIXED. Maribeth Johnson, Medical College of Georgia, Augusta, GA ABSTRACT Longitudinal data refers to datasets with multiple measurements

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Circle the single best answer for each multiple choice question. Your choice should be made clearly. TEST #1 STA 4853 March 6, 2017 Name: Please read the following directions. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directions This exam is closed book and closed notes. There are 32 multiple choice

More information

Approximations to Distributions of Test Statistics in Complex Mixed Linear Models Using SAS Proc MIXED

Approximations to Distributions of Test Statistics in Complex Mixed Linear Models Using SAS Proc MIXED Paper 6-6 Approximations to Distributions of Test Statistics in Complex Mixed Linear Models Using SAS Proc MIXED G. Bruce Schaalje, Department of Statistics, Brigham Young University, Provo, UT Justin

More information

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS

HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS EMERY N. BROWN AND CHRIS LONG NEUROSCIENCE STATISTICS RESEARCH LABORATORY DEPARTMENT

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

Experimental designs for multiple responses with different models

Experimental designs for multiple responses with different models Graduate Theses and Dissertations Graduate College 2015 Experimental designs for multiple responses with different models Wilmina Mary Marget Iowa State University Follow this and additional works at:

More information

Overview of Spatial Statistics with Applications to fmri

Overview of Spatial Statistics with Applications to fmri with Applications to fmri School of Mathematics & Statistics Newcastle University April 8 th, 2016 Outline Why spatial statistics? Basic results Nonstationary models Inference for large data sets An example

More information

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar

RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed. Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Paper S02-2007 RANDOM and REPEATED statements - How to Use Them to Model the Covariance Structure in Proc Mixed Charlie Liu, Dachuang Cao, Peiqi Chen, Tony Zagar Eli Lilly & Company, Indianapolis, IN ABSTRACT

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Repeated Measures Data

Repeated Measures Data Repeated Measures Data Mixed Models Lecture Notes By Dr. Hanford page 1 Data where subjects are measured repeatedly over time - predetermined intervals (weekly) - uncontrolled variable intervals between

More information

Biostatistics 301A. Repeated measurement analysis (mixed models)

Biostatistics 301A. Repeated measurement analysis (mixed models) B a s i c S t a t i s t i c s F o r D o c t o r s Singapore Med J 2004 Vol 45(10) : 456 CME Article Biostatistics 301A. Repeated measurement analysis (mixed models) Y H Chan Faculty of Medicine National

More information

Inferences about Parameters of Trivariate Normal Distribution with Missing Data

Inferences about Parameters of Trivariate Normal Distribution with Missing Data Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 7-5-3 Inferences about Parameters of Trivariate Normal Distribution with Missing

More information

Basics of Point-Referenced Data Models

Basics of Point-Referenced Data Models Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45 Basics of Point-Referenced Data Models Basic

More information

Introduction. Semivariogram Cloud

Introduction. Semivariogram Cloud Introduction Data: set of n attribute measurements {z(s i ), i = 1,, n}, available at n sample locations {s i, i = 1,, n} Objectives: Slide 1 quantify spatial auto-correlation, or attribute dissimilarity

More information

Modeling the Covariance

Modeling the Covariance Modeling the Covariance Jamie Monogan University of Georgia February 3, 2016 Jamie Monogan (UGA) Modeling the Covariance February 3, 2016 1 / 16 Objectives By the end of this meeting, participants should

More information

Testing for Unit Roots with Cointegrated Data

Testing for Unit Roots with Cointegrated Data Discussion Paper No. 2015-57 August 19, 2015 http://www.economics-ejournal.org/economics/discussionpapers/2015-57 Testing for Unit Roots with Cointegrated Data W. Robert Reed Abstract This paper demonstrates

More information

Modelling the Covariance

Modelling the Covariance Modelling the Covariance Jamie Monogan Washington University in St Louis February 9, 2010 Jamie Monogan (WUStL) Modelling the Covariance February 9, 2010 1 / 13 Objectives By the end of this meeting, participants

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

1 Mixed effect models and longitudinal data analysis

1 Mixed effect models and longitudinal data analysis 1 Mixed effect models and longitudinal data analysis Mixed effects models provide a flexible approach to any situation where data have a grouping structure which introduces some kind of correlation between

More information

LATVIAN GDP: TIME SERIES FORECASTING USING VECTOR AUTO REGRESSION

LATVIAN GDP: TIME SERIES FORECASTING USING VECTOR AUTO REGRESSION LATVIAN GDP: TIME SERIES FORECASTING USING VECTOR AUTO REGRESSION BEZRUCKO Aleksandrs, (LV) Abstract: The target goal of this work is to develop a methodology of forecasting Latvian GDP using ARMA (AutoRegressive-Moving-Average)

More information

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Linear Models 1. Isfahan University of Technology Fall Semester, 2014 Linear Models 1 Isfahan University of Technology Fall Semester, 2014 References: [1] G. A. F., Seber and A. J. Lee (2003). Linear Regression Analysis (2nd ed.). Hoboken, NJ: Wiley. [2] A. C. Rencher and

More information

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation Daniel B Rowe Division of Biostatistics Medical College of Wisconsin Technical Report 40 November 00 Division of Biostatistics

More information

Elements of Multivariate Time Series Analysis

Elements of Multivariate Time Series Analysis Gregory C. Reinsel Elements of Multivariate Time Series Analysis Second Edition With 14 Figures Springer Contents Preface to the Second Edition Preface to the First Edition vii ix 1. Vector Time Series

More information

MIXED EFFECTS MODELS FOR TIME SERIES

MIXED EFFECTS MODELS FOR TIME SERIES Outline MIXED EFFECTS MODELS FOR TIME SERIES Cristina Gorrostieta Hakmook Kang Hernando Ombao Brown University Biostatistics Section February 16, 2011 Outline OUTLINE OF TALK 1 SCIENTIFIC MOTIVATION 2

More information

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN: Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying

More information

Point-Referenced Data Models

Point-Referenced Data Models Point-Referenced Data Models Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Point-Referenced Data Models Spring 2013 1 / 19 Objectives By the end of these meetings, participants should

More information

The General Linear Model (GLM)

The General Linear Model (GLM) he General Linear Model (GLM) Klaas Enno Stephan ranslational Neuromodeling Unit (NU) Institute for Biomedical Engineering University of Zurich & EH Zurich Wellcome rust Centre for Neuroimaging Institute

More information

The exam is closed book, closed notes except your one-page (two sides) or two-page (one side) crib sheet.

The exam is closed book, closed notes except your one-page (two sides) or two-page (one side) crib sheet. CS 189 Spring 013 Introduction to Machine Learning Final You have 3 hours for the exam. The exam is closed book, closed notes except your one-page (two sides) or two-page (one side) crib sheet. Please

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013 Exam policy: This exam allows two one-page, two-sided cheat sheets; No other materials. Time: 2 hours. Be sure to write your name and

More information

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research

Dynamic Determination of Mixed Model Covariance Structures. in Double-blind Clinical Trials. Matthew Davis - Omnicare Clinical Research PharmaSUG2010 - Paper SP12 Dynamic Determination of Mixed Model Covariance Structures in Double-blind Clinical Trials Matthew Davis - Omnicare Clinical Research Abstract With the computing power of SAS

More information

CS Homework 3. October 15, 2009

CS Homework 3. October 15, 2009 CS 294 - Homework 3 October 15, 2009 If you have questions, contact Alexandre Bouchard (bouchard@cs.berkeley.edu) for part 1 and Alex Simma (asimma@eecs.berkeley.edu) for part 2. Also check the class website

More information

Applied Statistics Preliminary Examination Theory of Linear Models August 2017

Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Instructions: Do all 3 Problems. Neither calculators nor electronic devices of any kind are allowed. Show all your work, clearly

More information

Computing FMRI Activations: Coefficients and t-statistics by Detrending and Multiple Regression

Computing FMRI Activations: Coefficients and t-statistics by Detrending and Multiple Regression Computing FMRI Activations: Coefficients and t-statistics by Detrending and Multiple Regression Daniel B. Rowe and Steven W. Morgan Division of Biostatistics Medical College of Wisconsin Technical Report

More information

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R. Methods and Applications of Linear Models Regression and the Analysis of Variance Third Edition RONALD R. HOCKING PenHock Statistical Consultants Ishpeming, Michigan Wiley Contents Preface to the Third

More information

Lecture 2: Univariate Time Series

Lecture 2: Univariate Time Series Lecture 2: Univariate Time Series Analysis: Conditional and Unconditional Densities, Stationarity, ARMA Processes Prof. Massimo Guidolin 20192 Financial Econometrics Spring/Winter 2017 Overview Motivation:

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA CHAPTER 6 TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA 6.1. Introduction A time series is a sequence of observations ordered in time. A basic assumption in the time series analysis

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

PAPER 206 APPLIED STATISTICS

PAPER 206 APPLIED STATISTICS MATHEMATICAL TRIPOS Part III Thursday, 1 June, 2017 9:00 am to 12:00 pm PAPER 206 APPLIED STATISTICS Attempt no more than FOUR questions. There are SIX questions in total. The questions carry equal weight.

More information

Mixed Models No Repeated Measures

Mixed Models No Repeated Measures Chapter 221 Mixed Models No Repeated Measures Introduction This specialized Mixed Models procedure analyzes data from fixed effects, factorial designs. These designs classify subjects into one or more

More information

TIME SERIES DATA ANALYSIS USING EVIEWS

TIME SERIES DATA ANALYSIS USING EVIEWS TIME SERIES DATA ANALYSIS USING EVIEWS I Gusti Ngurah Agung Graduate School Of Management Faculty Of Economics University Of Indonesia Ph.D. in Biostatistics and MSc. in Mathematical Statistics from University

More information

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis Chapter 12: An introduction to Time Series Analysis Introduction In this chapter, we will discuss forecasting with single-series (univariate) Box-Jenkins models. The common name of the models is Auto-Regressive

More information

LECTURE NOTE #3 PROF. ALAN YUILLE

LECTURE NOTE #3 PROF. ALAN YUILLE LECTURE NOTE #3 PROF. ALAN YUILLE 1. Three Topics (1) Precision and Recall Curves. Receiver Operating Characteristic Curves (ROC). What to do if we do not fix the loss function? (2) The Curse of Dimensionality.

More information

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND Testing For Unit Roots With Cointegrated Data NOTE: This paper is a revision of

More information

Circle a single answer for each multiple choice question. Your choice should be made clearly.

Circle a single answer for each multiple choice question. Your choice should be made clearly. TEST #1 STA 4853 March 4, 215 Name: Please read the following directions. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directions This exam is closed book and closed notes. There are 31 questions. Circle

More information

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30 MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD Copyright c 2012 (Iowa State University) Statistics 511 1 / 30 INFORMATION CRITERIA Akaike s Information criterion is given by AIC = 2l(ˆθ) + 2k, where l(ˆθ)

More information

Outline Introduction OLS Design of experiments Regression. Metamodeling. ME598/494 Lecture. Max Yi Ren

Outline Introduction OLS Design of experiments Regression. Metamodeling. ME598/494 Lecture. Max Yi Ren 1 / 34 Metamodeling ME598/494 Lecture Max Yi Ren Department of Mechanical Engineering, Arizona State University March 1, 2015 2 / 34 1. preliminaries 1.1 motivation 1.2 ordinary least square 1.3 information

More information

Building Blocks for Direct Sequential Simulation on Unstructured Grids

Building Blocks for Direct Sequential Simulation on Unstructured Grids Building Blocks for Direct Sequential Simulation on Unstructured Grids Abstract M. J. Pyrcz (mpyrcz@ualberta.ca) and C. V. Deutsch (cdeutsch@ualberta.ca) University of Alberta, Edmonton, Alberta, CANADA

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

Part 2: Multivariate fmri analysis using a sparsifying spatio-temporal prior

Part 2: Multivariate fmri analysis using a sparsifying spatio-temporal prior Chalmers Machine Learning Summer School Approximate message passing and biomedicine Part 2: Multivariate fmri analysis using a sparsifying spatio-temporal prior Tom Heskes joint work with Marcel van Gerven

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Statistics 262: Intermediate Biostatistics Model selection

Statistics 262: Intermediate Biostatistics Model selection Statistics 262: Intermediate Biostatistics Model selection Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Today s class Model selection. Strategies for model selection.

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

SIMULATION STUDY OF SPATIAL- POISSON DATA ASSESSING INCLUSION OF SPATIAL CORRELATION AND NON- NORMALITY IN THE ANALYSIS

SIMULATION STUDY OF SPATIAL- POISSON DATA ASSESSING INCLUSION OF SPATIAL CORRELATION AND NON- NORMALITY IN THE ANALYSIS Libraries Conference on Applied Statistics in Agriculture 2001-13th Annual Conference Proceedings SIMULATION STUDY OF SPATIAL- POISSON DATA ASSESSING INCLUSION OF SPATIAL CORRELATION AND NON- NORMALITY

More information

Determining the Optimum Number of Increments in Composite Sampling

Determining the Optimum Number of Increments in Composite Sampling Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2005-05-20 Determining the Optimum Number of Increments in Composite Sampling John Ellis Hathaway Brigham Young University - Provo

More information

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation. CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.

More information

Piotr Majer Risk Patterns and Correlated Brain Activities

Piotr Majer Risk Patterns and Correlated Brain Activities Alena My²i ková Piotr Majer Song Song Alena Myšičková Peter N. C. Mohr Peter N. C. Mohr Wolfgang K. Härdle Song Song Hauke R. Heekeren Wolfgang K. Härdle Hauke R. Heekeren C.A.S.E. Centre C.A.S.E. for

More information

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3 Table of contents TABLE OF CONTENTS...1 1 INTRODUCTION TO MIXED-EFFECTS MODELS...3 Fixed-effects regression ignoring data clustering...5 Fixed-effects regression including data clustering...1 Fixed-effects

More information

Mixed models in R using the lme4 package Part 4: Theory of linear mixed models

Mixed models in R using the lme4 package Part 4: Theory of linear mixed models Mixed models in R using the lme4 package Part 4: Theory of linear mixed models Douglas Bates 8 th International Amsterdam Conference on Multilevel Analysis 2011-03-16 Douglas Bates

More information

Modelling temporal structure (in noise and signal)

Modelling temporal structure (in noise and signal) Modelling temporal structure (in noise and signal) Mark Woolrich, Christian Beckmann*, Salima Makni & Steve Smith FMRIB, Oxford *Imperial/FMRIB temporal noise: modelling temporal autocorrelation temporal

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

Machine Learning, Fall 2009: Midterm

Machine Learning, Fall 2009: Midterm 10-601 Machine Learning, Fall 009: Midterm Monday, November nd hours 1. Personal info: Name: Andrew account: E-mail address:. You are permitted two pages of notes and a calculator. Please turn off all

More information

CS 195-5: Machine Learning Problem Set 1

CS 195-5: Machine Learning Problem Set 1 CS 95-5: Machine Learning Problem Set Douglas Lanman dlanman@brown.edu 7 September Regression Problem Show that the prediction errors y f(x; ŵ) are necessarily uncorrelated with any linear function of

More information

Homework 1 Solutions Probability, Maximum Likelihood Estimation (MLE), Bayes Rule, knn

Homework 1 Solutions Probability, Maximum Likelihood Estimation (MLE), Bayes Rule, knn Homework 1 Solutions Probability, Maximum Likelihood Estimation (MLE), Bayes Rule, knn CMU 10-701: Machine Learning (Fall 2016) https://piazza.com/class/is95mzbrvpn63d OUT: September 13th DUE: September

More information

Comparing Group Means When Nonresponse Rates Differ

Comparing Group Means When Nonresponse Rates Differ UNF Digital Commons UNF Theses and Dissertations Student Scholarship 2015 Comparing Group Means When Nonresponse Rates Differ Gabriela M. Stegmann University of North Florida Suggested Citation Stegmann,

More information

Examining the accuracy of the normal approximation to the poisson random variable

Examining the accuracy of the normal approximation to the poisson random variable Eastern Michigan University DigitalCommons@EMU Master's Theses and Doctoral Dissertations Master's Theses, and Doctoral Dissertations, and Graduate Capstone Projects 2009 Examining the accuracy of the

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

Chapter 9: The Perceptron

Chapter 9: The Perceptron Chapter 9: The Perceptron 9.1 INTRODUCTION At this point in the book, we have completed all of the exercises that we are going to do with the James program. These exercises have shown that distributed

More information

Univariate ARIMA Models

Univariate ARIMA Models Univariate ARIMA Models ARIMA Model Building Steps: Identification: Using graphs, statistics, ACFs and PACFs, transformations, etc. to achieve stationary and tentatively identify patterns and model components.

More information

Topic 3: Hypothesis Testing

Topic 3: Hypothesis Testing CS 8850: Advanced Machine Learning Fall 07 Topic 3: Hypothesis Testing Instructor: Daniel L. Pimentel-Alarcón c Copyright 07 3. Introduction One of the simplest inference problems is that of deciding between

More information

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology Serial Correlation Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 017 Model for Level 1 Residuals There are three sources

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Geostatistics in Hydrology: Kriging interpolation

Geostatistics in Hydrology: Kriging interpolation Chapter Geostatistics in Hydrology: Kriging interpolation Hydrologic properties, such as rainfall, aquifer characteristics (porosity, hydraulic conductivity, transmissivity, storage coefficient, etc.),

More information

Introduction to Machine Learning Midterm Exam

Introduction to Machine Learning Midterm Exam 10-701 Introduction to Machine Learning Midterm Exam Instructors: Eric Xing, Ziv Bar-Joseph 17 November, 2015 There are 11 questions, for a total of 100 points. This exam is open book, open notes, but

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Linear Regression. September 27, Chapter 3. Chapter 3 September 27, / 77

Linear Regression. September 27, Chapter 3. Chapter 3 September 27, / 77 Linear Regression Chapter 3 September 27, 2016 Chapter 3 September 27, 2016 1 / 77 1 3.1. Simple linear regression 2 3.2 Multiple linear regression 3 3.3. The least squares estimation 4 3.4. The statistical

More information

Final Review. Yang Feng. Yang Feng (Columbia University) Final Review 1 / 58

Final Review. Yang Feng.   Yang Feng (Columbia University) Final Review 1 / 58 Final Review Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Final Review 1 / 58 Outline 1 Multiple Linear Regression (Estimation, Inference) 2 Special Topics for Multiple

More information

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017

COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 COMS 4721: Machine Learning for Data Science Lecture 10, 2/21/2017 Prof. John Paisley Department of Electrical Engineering & Data Science Institute Columbia University FEATURE EXPANSIONS FEATURE EXPANSIONS

More information

FINANCIAL ECONOMETRICS AND EMPIRICAL FINANCE -MODULE2 Midterm Exam Solutions - March 2015

FINANCIAL ECONOMETRICS AND EMPIRICAL FINANCE -MODULE2 Midterm Exam Solutions - March 2015 FINANCIAL ECONOMETRICS AND EMPIRICAL FINANCE -MODULE2 Midterm Exam Solutions - March 205 Time Allowed: 60 minutes Family Name (Surname) First Name Student Number (Matr.) Please answer all questions by

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Graphical Procedures, SAS' PROC MIXED, and Tests of Repeated Measures Effects. H.J. Keselman University of Manitoba

Graphical Procedures, SAS' PROC MIXED, and Tests of Repeated Measures Effects. H.J. Keselman University of Manitoba 1 Graphical Procedures, SAS' PROC MIXED, and Tests of Repeated Measures Effects by H.J. Keselman University of Manitoba James Algina University of Florida and Rhonda K. Kowalchuk University of Manitoba

More information

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA

Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA Paper 188-29 Repeated Measures Modeling With PROC MIXED E. Barry Moser, Louisiana State University, Baton Rouge, LA ABSTRACT PROC MIXED provides a very flexible environment in which to model many types

More information

Ch 6. Model Specification. Time Series Analysis

Ch 6. Model Specification. Time Series Analysis We start to build ARIMA(p,d,q) models. The subjects include: 1 how to determine p, d, q for a given series (Chapter 6); 2 how to estimate the parameters (φ s and θ s) of a specific ARIMA(p,d,q) model (Chapter

More information

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative

More information

Modeling Transition Probabilities for Loan States Using a Bayesian Hierarchical Model

Modeling Transition Probabilities for Loan States Using a Bayesian Hierarchical Model Brigham Young University BYU ScholarsArchive All Theses and Dissertations 2007-11-30 Modeling Transition Probabilities for Loan States Using a Bayesian Hierarchical Model Rebecca Lee Monson Brigham Young

More information

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives

More information

Appendix A: Review of the General Linear Model

Appendix A: Review of the General Linear Model Appendix A: Review of the General Linear Model The generallinear modelis an important toolin many fmri data analyses. As the name general suggests, this model can be used for many different types of analyses,

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability

More information

Practicum : Spatial Regression

Practicum : Spatial Regression : Alexandra M. Schmidt Instituto de Matemática UFRJ - www.dme.ufrj.br/ alex 2014 Búzios, RJ, www.dme.ufrj.br Exploratory (Spatial) Data Analysis 1. Non-spatial summaries Numerical summaries: Mean, median,

More information

Algorithmisches Lernen/Machine Learning

Algorithmisches Lernen/Machine Learning Algorithmisches Lernen/Machine Learning Part 1: Stefan Wermter Introduction Connectionist Learning (e.g. Neural Networks) Decision-Trees, Genetic Algorithms Part 2: Norman Hendrich Support-Vector Machines

More information

Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction

Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction Dragoljub Pokrajac DPOKRAJA@EECS.WSU.EDU Zoran Obradovic ZORAN@EECS.WSU.EDU School of Electrical Engineering and Computer

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information