Chapter 3 ANALYSIS OF RESPONSE PROFILES

Similar documents
Modeling the Mean: Response Profiles v. Parametric Curves

GAMINGRE 8/1/ of 7

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7

Mr. XYZ. Stock Market Trading and Investment Astrology Report. Report Duration: 12 months. Type: Both Stocks and Option. Date: Apr 12, 2011

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Regression With a Categorical Independent Variable

Power and Sample Size for the Most Common Hypotheses in Mixed Models

Corn Basis Information By Tennessee Crop Reporting District

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Review of the General Linear Model

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

The science of learning from data.

Multiple Regression Analysis

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Regression With a Categorical Independent Variable

Experimental Design and Data Analysis for Biologists

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR

Technical note on seasonal adjustment for M0

Introduction to Statistical Analysis

2019 Settlement Calendar for ASX Cash Market Products. ASX Settlement

Technical note on seasonal adjustment for Capital goods imports

Rejection regions for the bivariate case

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Mean Vector Inferences

Statistical Distribution Assumptions of General Linear Models

BIOSTATISTICAL METHODS

NATCOR Regression Modelling for Time Series

Analysis of variance, multivariate (MANOVA)

Statistics for IT Managers

Analysis of Incomplete Non-Normal Longitudinal Lipid Data

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2

UNIT-IV CORRELATION AND REGRESSION

Jayalath Ekanayake Jonas Tappolet Harald Gall Abraham Bernstein. Time variance and defect prediction in software projects: additional figures

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

Longitudinal Modeling with Logistic Regression

Lecture Prepared By: Mohammad Kamrul Arefin Lecturer, School of Business, North South University

Chapter 3 Multiple Regression Complete Example

2017 Settlement Calendar for ASX Cash Market Products ASX SETTLEMENT

Published by ASX Settlement Pty Limited A.B.N Settlement Calendar for ASX Cash Market Products

Lectures 5 & 6: Hypothesis Testing

Determine the trend for time series data

The Wealth Creator. Theme: 6 Reasons why the Phased SIP has worked. An equity SIP approach to Investing. Jul 09th 2016

Dates and Prices ICAEW - Manchester In Centre Programme Prices

Monitoring Platelet Issues - a novel approach CUSUM. Clive Hyam Blood Stocks Management Scheme London

Forecasting the Canadian Dollar Exchange Rate Wissam Saleh & Pablo Navarro

Semiparametric Regression

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

2 Naïve Methods. 2.1 Complete or available case analysis

General Principles Within-Cases Factors Only Within and Between. Within Cases ANOVA. Part One

Suan Sunandha Rajabhat University

Bayesian Linear Regression

YEAR 10 GENERAL MATHEMATICS 2017 STRAND: BIVARIATE DATA PART II CHAPTER 12 RESIDUAL ANALYSIS, LINEARITY AND TIME SERIES

Sample Size and Power Considerations for Longitudinal Studies

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data

Multivariate Regression Model Results

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

CIMA Professional 2018

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Time-Invariant Predictors in Longitudinal Models

CIMA Professional

Average 175, , , , , , ,046 YTD Total 1,098,649 1,509,593 1,868,795 1,418, ,169 1,977,225 2,065,321

Average 175, , , , , , ,940 YTD Total 944,460 1,284,944 1,635,177 1,183, ,954 1,744,134 1,565,640

CIMA Professional

Logistic Regression Analysis

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

CIMA Professional 2018

A Re-Introduction to General Linear Models (GLM)

In Centre, Online Classroom Live and Online Classroom Programme Prices

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Time Series Analysis

Analysis of Repeated Measures and Longitudinal Data in Health Services Research

Stat 579: Generalized Linear Models and Extensions

Monthly Trading Report Trading Date: Dec Monthly Trading Report December 2017

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Lecture 4: Statistical Hypothesis Testing

Testing Independence

R = µ + Bf Arbitrage Pricing Model, APM

Lecture 7 Time-dependent Covariates in Cox Regression

Chapter 8 Heteroskedasticity

Profile Analysis Multivariate Regression

General Linear Models. with General Linear Hypothesis Tests and Likelihood Ratio Tests

FinQuiz Notes

Virginia Basis Tables for Corn, Soybeans, and Wheat

Model Estimation Example

Process Behavior Analysis Understanding Variation

Improve Forecasts: Use Defect Signals

Multivariate Regression (Chapter 10)

Appendix A: Review of the General Linear Model

6 Pattern Mixture Models

Statistical Models for Rainfall with Applications to Index Insura

Chapter 14: Repeated-measures designs

Linear Models and Estimation by Least Squares

Transcription:

Chapter 3 ANALYSIS OF RESPONSE PROFILES 78

31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over time and on the covariance among the repeated measures The method focusses on analysing response profiles and can be applied to longitudinal data when the design is balanced, with the timing of the repeat measures common to all individuals in the study However, the method can also handle incomplete longitudinal studies with balanced designs The method is appealing when there is a single categorical covariate (denoting exposure groups) and when no specific a priori pattern of the difference in response profiles between groups can be specified If E(Y ij ) is expressed exclusively in terms of the p explanatory variables X ij1, X ij2,, X ijp, the model is said to be unconditional If among these, time is simply interpreted as a discrete variable indicating the order in which the response is observed within each sample unit in a balanced design the corresponding models are known as profile models and are equivalent to those usually considered in ANOVA or ANCOVA problems The profile model may be expressed as Y = XM + E, where Y = (Y 1, Y 2,, Y N ), X = (X 1, X 2,, X p ), M is a p n matrix of unknown parameters and E = (e 1, e 2,, e n ) Typically, the elements of M denote expected responses for units under the different treatments at the different instants 79

At any given level of the group factor, the sequence of means over time is referred to as the mean response profile For example, the mean response profiles for the two groups randomized to a drug and its placebo The main goal in the analysis of response profiles is to characterise the patterns of change in the mean response over time in the groups and to determine whether the shapes of the mean response profiles differ for the two groups The methods for analysing response profiles can be extended in a straightforward way to handle the case where there is more than a single group factor and when there are baseline covariances that need to be adjusted for For example, in an observational study the groups might be defined by characteristics of the study subjects, such as age, gender or exposure level Rowel and Walters (1976) and Bryant and Gillings (1985) discuss the analysis strategies using standard parametric or non-parametric ANOVA or regression procedures on some kind of univariate or bivariate summary measure for the individual response profiles The article by Rowell and Walters (1976) is widely cited for popularising the analysis of summary measures of growth in many different disciplines Potthoff and Roy (1964) gives an extension of the profile models, which in the literature is often referred to as growth curves models The paper gives a typical example that involves the linear relation between an anatomic distance and age in orthodontical study 80

A detailed discussion of issues surrounding adjustment for baseline response in the analysis of change can be found in Lord (1967), Laird (1983) and Fitzmaurice (2001) MANOVA models may also be employed to analyse problems under the growth curve model Khatri (1966) show that the estimators proposed in Potthoff and Roy s (1964) paper are ML estimators Strengths and Weaknesses of Analysing Response Profiles The analysis of response profiles is a straightforward way to analyse data from a longitudinal study when the design is balanced, with the timings of the repeated measures common to all individuals in the study, and when all the covariates are discrete (eg, representing different treatments, interventions, or characteristics of the study subjects) It allows arbitrary patterns in the mean response over time and in the covariance of the responses As a result, this method has certain robustness since the potential risk of bias due to misspecification of the models for mean and covariance are minimal Although the method requires that data arise from a balanced design, it can be applied when the data are incomplete due to missing response data The analysis of response profiles can be extended in a straightforward way to handle the case where individuals can be grouped according to more than a single factor For example, if there are two covariates that are discrete (eg, treatment group and gender), the analysis will include tests of the 3-way and 2-way interactions among 81

these two factors and time (in addition to their main effect) The analysis of response profiles does have a number of potential drawbacks First, the requirement that the longitudinal design be balanced implies that the method cannot be applied when the vectors of repeated measures are obtained at different sequences of time except by moving an observation to the nearest planned measurement time As a result, the method is not well suited to handle mistimed measurements, a common problem in many longitudinal studies Note, however, that the method can handle unbalanced patterns of observations due to missing response data Second, the analysis ignores the time ordering of the repeated measures Third, because the analysis of response profiles produces an overall omnibus test of effects, it may have low power to detect group differences in specific trends in the mean response over time (eg, linear trends in the mean response) Single degree of freedom tests of specific time trends are more powerful Finally, in the analysis of response profiles, the number of estimated parameters (G n mean parameters and n(n+1) 2 covariance parameters) grows rapidly with the number of measurement occasions 32 Hypotheses Concerning Response Profiles Initially we focus on two group designs Generalizations to more than two groups are straightforward Given a sequence of n repeated measures on a number of distinct groups of individuals, three main questions concerning response profiles can be posed 82

1 Are the two mean response profiles similar in the two groups, in the sense that the mean response profiles are parallel? That is whether the patterns of changes in the response over time are same across the groups This question concerns the group time interaction effects 2 Assuming that the population mean responses are parallel, are the means constant over time, in the sense that the mean response profiles are flat This question concerns the time effect 3 Assuming that the population mean response profiles are parallel, are they also same at the same level in the sense that mean response profiles for the two groups coincide? The first question is of main scientific interest This question is the raison d être of a longitudinal study Whether the second and third questions have scientific interest depends upon the longitudinal study design An affirmative answer to the first question is assumed while considering the second and third questions, consistent with the general principle that the main effects are not of interest when there is an interaction among them The choice of appropriate hypothesis deserves considerations regarding whether the longitudinal data arise from a randomised trial or from an observational study In the former case, the study participants are randomised to treatment groups and the baseline values of the response are obtained prior to scientific interventions, ie, the mean response at occasion 1 is independent of treatment assignment That is, by 83

design, the group means are equal at baseline In contrast, in an observational study, there is no a priori reason to assume that the groups have the same mean at baseline unless the groups are selected by matching on baseline response In a randomised longitudinal clinical trial the only question of scientific interest is the first because it addresses whether changes in the mean response over time are the same in all groups The second question is of less importance as it does not involve a direct comparison of groups The second question concerns the time effect, where focus is on the comparison of the mean response at each occasion averaged over the groups Thus hypothesis concerning the main effect of time translate into the question whether overall (ie, averaged over groups) mean response has changed from baseline In a randomised trial, the third question is of no interest Actually, the test of group effect is subsumed within the test of group time interaction, since the absence of group time interaction implies that groups have same pattern of change over time and their mean response profiles must necessarily coincide In an observational study, the first question is usually of primary interest It addresses whether the patterns of change over time in the mean response vary by group In contrast to randomised trial, however, the second and third questions may also be of substantive interest For example, in a longitudinal study of growth or ageing, there may be interest in the pattern of change in the mean response over time, even when the pattern of change is same in all the groups This concerns the main effect of time and is addressed by the second question Ordinarily, when time effect is of interest, the trend in mean response is described with a parametric curve 84

In an observational study, there may also be interest in group comparisons of the mean response averaged over time This concerns the group effect and is addressed in the third question To highlight the main features of analysis of response profiles, consider an example from a two-group study comparing a novel treatment and a control Assume that the two groups have repeated measurements at the same set of n occasions The analysis of response profiles is based on comparing the mean response profiles in the two groups Let µ(t ) = {µ 1 (T ), µ 2 (T ),, µ n (T )} denote the mean response profile for the treatment group and µ(c) = {µ 1 (C), µ 2 (C), µ n (C)} denote that for the control group Let j = µ j (T ) µ j (C), j = 1, 2,, n The comparison can be made by considering the null hypothesis of no group time interaction, that is, the hypothesis that the mean response profiles are parallel, for which the requirement is that the differences j s are constant over time Thus, in terms of j s, the null hypothesis is H 01 : 1 = 2 = = n The test has n 1 degrees of freedom as the number of constraints on the mean responses under this hypothesis is n 1 When there are G groups with repeated measurements at the same set of n occasions, let µ(g) = {µ 1 (g), µ 2 (g),, µ n (g)} denote the mean response profile for the g th group (g = 1, 2,, G) With G groups, there are G 1 redundant comparisons Define j (g) = µ j (g) µ j (G), (for j = 1, 2,,n; g = 1, 2,, G 1) ie, j (g) is 85

the contrast or comparison of the mean response at the j th occasion for the g th group (g = 1, 2,, G 1) with the mean response at the j th occasion for the group G Then, the null hypothesis that the mean response profiles are parallel is H 01 : 1 (g) = 2 (g) = = n (g), for g = 1, 2,, G 1 The test has (G 1) (n 1) degrees of freedom However, unless the test of group time interaction has only one degree of freedom, this test does not help in discerning in what manner the pattern of change over time differ across groups In the next section we describe a general linear model formulation of the analysis of response profiles 33 General Linear Model Formulation Consider the general linear regression model E(Y i /X i ) = µ i = X i β, (331) for appropriate choices of X i The hypothesis of no group time interaction can be expressed in terms of β Let n be the number of repeated measures and N be the number of subjects To express the model for a design with G groups and n occasions of measurement, we require G n parameters for the G mean response profiles We illustrate the main idea with the help of a numerical example with two 86

groups measured at three occasions For the first group, let the design matrix be 1 0 0 0 0 0 X i = 0 1 0 0 0 0, 0 0 1 0 0 0 while for the second group the design matrix is 0 0 0 1 0 0 X i = 0 0 0 0 1 0 0 0 0 0 0 1 Then in terms of the model (331) where β = (β 1, β 2,, β 6 ) is a 6 1 vector of regression coefficients, Similarly µ(1) = µ(2) = µ 1 (1) µ 2 (1) µ 3 (1) µ 1 (2) µ 2 (2) µ 3 (2) = = The hypothesis about the mean response profiles in two groups in terms of µ(1) and µ(2) can now be expressed in terms of hypotheses about the components of β Specifically, the hypothesis of no group time interaction effect can be expressed as β 1 β 2 β 3 β 4 β 5 β 6 H 01 : (β 1 β 4 ) = (β 2 β 5 ) = (β 3 β 6 ) (332) In this parameterisation, hypothesis about the group time interaction cannot be expressed in terms of certain components of β being zero; instead, these hypotheses can be expressed in terms of Lβ = 0, for particular choices of vectors or matrices L For example (332) may now be expressed as H 01 : Lβ = 0, where L = ( 1 1 0 1 1 0 1 0 1 1 0 1 ) 87

An attractive feature of the general linear model (331) is that it can handle settings where the data for some subjects are missing For example, suppose that the i th subject belongs to the first group and is missing the response at the third occasion Then the appropriate design matrix for the subject is obtained by removing the third row of the full data design matrix for the subjects from the first group as X i = ( 1 0 0 0 0 0 0 1 0 0 0 0 In general, the appropriate design matrix for the i th subject is simply obtained by removing rows of the full data design matrix corresponding to the missing responses This allows analysis of response profiles to be based on all available observations of the subjects ) The general linear model (331) for two groups measured at three occasions could have also been expressed in terms of the following matrices 1 0 0 1 0 0 X i = 1 1 0 1 1 0, 1 0 1 1 0 1 for the first group, and X i = 1 0 0 0 0 0 1 1 0 0 0 0 1 0 1 0 0 0, for the second group In that case µ(2) = and, µ(1) = µ 1 (2) µ 2 (2) µ 3 (2) µ 1 (1) µ 2 (1) µ 3 (1) = = 88 β 1 β 1 + β 2 β 1 + β 3 ; β 1 + β 4 (β 1 + β 4 ) + (β 2 + β 5 ) (β 1 + β 4 ) + (β 3 + β 6 )

The choice of the reference group (here the second group) is arbitrary With this choice of design matrices for the two groups, the interpretation of the regression coefficients has changed The hypothesis of no group time interaction, now, is H 01 : β 5 = β 6 = 0 This parameterisation, often called the reference parameterisation, is more convenient since the hypothesis H 01 is represented by the vanishing of certain components of β, and is the one that is commonly adopted by many statistical packages If H 01 cannot be rejected, the hypotheses concerning mean effects of time and/or group may be of secondary interest Hypotheses concerning main effects can similarly be represented by the vanishing of certain components of β For example hypothesis of no time effect is H 02 : β 2 = β 3 = 0, and the hypothesis of no group effect is H 03 : β 4 = 0 For the general case with G groups measured at n occasions, the number of constraints under H 02 is n-1 and is the same regardless of the number of groups and the test of H 02 has n-1 degrees of freedom Similarly the number of constraints under H 03 is G-1 and is the same regardless of the number of occasions and the test of H 03 has G-1 degrees of freedom The tests of main effects are obtained from the reduced models that excludes the group time interaction 89

Finally, Given that the analysis of response profiles can be expressed in terms of the linear model (331), where β = (β 1, β 2, β p ) is a p 1 vector of regression coefficients (with p = G n), maximum likelihood estimation of β, and the construction of tests of the group time interaction (and the main effects of time and group), are possible once the covariance of Y i has been specified In the analysis of response profiles, the covariance of Y i is usually assumed to be unstructured with no constraints on n(n+1) 2 covariance parameters other than the requirement that they yield a symmetric matrix that is positive definite The condition of positive definiteness of the covariance matrix ensures that while repeated measures can be highly correlated, there must be no redundancy in the sense that one of the repeated measures can be expressed as a linear combination of the others The condition also ensures that no linear combination of responses can have negative variance Given REML (or ML) estimates of β, and their standard errors (and the estimated covariance of β, tests of group time interaction (and main effects of time and group), can be conducted using multivariate Wald tests Alternatively likelihood ratio tests can be constructed, but requires that the model be fit to the data with and without the constraints under the null hypothesis (ie, fitting the reduced and the full models respectively) Before illustrating the analysis of response profiles, we present a brief review of the reference group parametrisation 90

34 Reference Group Parameterisation Consider a group factor with G levels To present this factor in a linear model we can define a set of dummy or indicator variables, { 1, if the i Z ig = th subject belongs to group g; 0, otherwise Letting X i = (Z i1, Z i2,, Z ig ), the mean response in G groups, denoted by µ i (1), µ i (2),, µ i (G), can be expressed in terms of the linear model E(Y i /X i ) = µ i = X i β In this parameterisation µ i (1) µ i (2) µ i (G) = If we wish to introduce an intercept, say β 1, by setting the elements of the first column of X i to 1 (for all i = 1, 2,, N) then there is redundancy in X i, if all G indicator variables Z i1, Z i2,, Z ig are also included in the design vector X i To avoid this over specification, one of the indicator variables must be excluded from X i Arbitrarily we can drop Z ig Then with X i = (1, Z i1, Z i2,, Z i,g 1 ), the mean response in G β 1 β 2 β G groups can be expressed in terms of the linear model E(Y i /X i ) = µ i = X i β, where β = (β 1, β 2,, β G ) In this parameterisation, µ i (1) β 1 + β 2 µ i (2) β 1 + β 3 µ i (G 1) µ i (G) = β 1 + β G β 1 91

Because the intercept term, β 1, is also the mean of group G, and all the remaining components of β represent deviations from the mean of group G, this parameterisation is often referred to as the reference group parameterisation Here, the last level of the group factor (ie, group G) is the reference group and it is no coincidence that this is the same group whose indicator variable was excluded from X i 35 One Degree of freedom Tests for Group Time Interaction The tests of group time interaction is quite general and it posits no specific pattern for the difference in response profiles between groups This lack of specificity becomes a problem in studies with a large number of occasions of measurement because the general test for group time interaction with (G 1) (n 1) degrees of freedom, becomes less sensitive to an interaction with a specific pattern as n increases In the typical randomised trial of interventions, subjects are randomised to the intervention groups at baseline and the investigator seeks to determine whether the pattern of response after intervention differs between groups Randomisation implies that the mean at baseline is independent of treatment group, that is, by design, the groups have the same mean response at baseline In that setting, analysts frequently specify a single contrast believed to best represent the direction in which the pattern of response will differ most markedly For example, if we assume the first parameter- 92

isation described in Section 33 with two groups and wish to test for the equality of the difference between the average response at occasion 2 through n and the baseline value in the two groups, we can choose the contrast L = ( L 1, L 1 ), where L 1 = ( 1, ) 1 n 1, 1 n 1,, 1 n 1 Here, L 1 computes the mean response from occasion 2 through n and subtracts the mean response at baseline for a single group The latter can be thought of as the average change over the interval for a single group Thus L is the group contrast of this average change in the two groups A variant of this approach, known as Area Under the Curve Minus Baseline, or sometimes simply AUC, corresponds to calculation of the area under the trapezoidal curve created by connecting the responses plotted at the respective time points and subtracting y 1 (t n t 1 ), the area of the rectangle of height y 1 and width t n t 1 The AUC (minus baseline) is negative because the responses after intervention begins are smaller than baseline value The AUC (minus baseline) can be constructed by subtracting the baseline mean µ 1, from each of the means µ 1 through µ n and calculating area under the trapezoid constructed by connecting these differences To test for the equality of AUC in two groups, one employs the contrast L = ( L 2, L 2 ), 93

where L 2 = 1 2 (t 1 + t 2 2t n, t 3 t 1,, t j+1 t j 1,, t n t n 1 ) and 1(t 2 j+1 t j 1 ) is the value of the contrast vector for time points 1 (baseline) or n (the last occasion) These contrast weights are not intuitively obvious, but can be derived from the formula for the area of a trapezoid Although the curve in the above figure suggests that L is applied to the individual observations, it must be emphasized that the contrast weights are applied to the estimated means, not the individual observations A third popular method for constructing a single-degree-of freedom test corresponds to a test of the hypothesis that the trend over time is the same in several treatment groups This method is a special case of the growth curve analysis In many applications, the one-degree-of-freedom test will be statistically significant when the overall test for group time interaction is not For valid application of conventional significant levels, the form of the contrast must be specified prior to data analysis Otherwise, one would be at risk of seeking the best contrast and testing its significance as if it had been chosen in advance To guard against this criticism, the protocols for randomised trials usually specify the form of the contrast This requirement highlights a hazard of the one-degree-of-freedom tests The added sensitivity comes at the price of reduced generality If the difference between the treatment groups takes a form quite different from the pattern anticipated by the contrast, one can fail to obtain statistically significant result for a one-degree-of-freedom test even 94

when the overall test for group time interaction is statistically significant Thus one-degree-of-freedom test should be employed only when there is sufficient prior information to specify the contrast with confidence 36 A Numerical Illustration We shall illustrate the analysis of mean response using recent stock market data Stock prices of Indian IT major Infosys Technologies for the period 26 th December 2003 to 31 st December 2008 obtained from official web site of National Stock Exchange (NSE) of India Bombay Stock Exchange (BSE)indices for the corresponding period obtained from the BSE itself We arrived at the stock prices of Infosys Technologies by considering bonus share issue and stock spit for the last five years On 1 st july2004 the company issued bonus shares in the ratio 3:1 (3 shares for every 1 share in demat account before the book closure) Evidently every investor will have four shares instead of one in his demat account Therefore Net Asset Value (NAV) of one share held previously to 1 st july2004 will be 4 close price Similarly the company made a stock split in the ratio 2:1 on 13 th july2006 (one share with face value Rs10/- becomes 2 shares with face value Rs5/- each) Hence NAV of one share held previously to 1 st july2004 will be 8 close price The relevant figures shown in bold type and with * in table-31 The NAV taken as effective stock price for the share 95

Table- 31 Stock prices of Infosys Technologies with traded volume and NAV Date Prev Close Open Price Close Price Total Traded Net Asset Quantity Value 28-Jun-04 548390 553000 559790 490516 559790 29-Jun-04 559790 560500 558670 687773 558670 30-Jun-04 558670 557000 552410 611316 552410 1-Jul-04 552410 139500 140880* 1767215 @ 563520* 2-Jul-04 140880 136525 142335 1558128 569340 5-Jul-04 142335 142630 142945 969096 571780 6-Jul-04 142945 142700 142650 1201123 570600 7-Jul-04 142650 140415 139245 1776746 556980 8-Jul-04 139245 140000 135225 2366130 @ 540900 9-Jul-04 135225 130000 138250 1424235 553000 10-Jul-06 310500 318900 318805 764033 1275220 11-Jul-06 318805 319900 314820 810381 1259280 12-Jul-06 314820 330000 338565 4251043 1354260 13-Jul-06 338565 174800* 168195* 1423289 @ 1345560* 14-Jul-06 168195 165000 164875 1737712 1319000 17-Jul-06 164875 164100 160525 1552356 1284200 18-Jul-06 160525 161035 162955 1323057 1303640 19-Jul-06 162955 166100 160455 1448682 1283640 20-Jul-06 160455 169870 164785 1503490 @ 1318280 21-Jul-06 164785 163480 160210 1140542 1281680 @ figures considered for data set Nowadays, Cash Market of stock exchanges are highly influenced by Derivative Market (Futures & Options Market) Life of monthly contracts of derivatives end on last Thursday (critical day) of the month Stock prices will have a new lease of life after the critical day Thus we have considered stock prices on first Thursday (base line), on second Thursday (Week 1), on third Thursday (Week 2)and on the critical day (Week 4), for data set To make independent entities for the months, We have 96

taken the deviation of stock price from stock price on critical day of previous month Table- 32A Increase in Stock Prices and BSE Indices Stock Price BSE Index Month Base line Week 1 Week 2 Week 4 Base line Week 1 Week 2 Week 4 2003 Jan -2600 4995-13855 -48915-1758 156-912 -16276 Feb 25860-8515 -780-15395 8556 2763 8334 5746 Mar -755-4005 25555 18440-8699 -16910-8441 -16055 Apr 510-124000 -132635-139190 3437-8146 -13229-17956 May -11125 7795-1425 -12825 2256 2437 7574 22702 Jun 9020 20030 27360 46940 9795 17305 29035 38815 Jul -12040 38780 22440 36320 8749 12723 11651 24021 Aug -9385-18555 11225 29660 1422 12859 30278 41968 Sep 31085 32575 34235 60870 9822 18084-7814 8486 Oct 6640-6040 12315 10910 15793 40153 59017 48337 Nov 45880 13055 525 31175 26702 16864-929 20852 Dec 13340 4945 20050 49550 23686 31092 46596 65288 2004 Jan 21250 42425 13760-17075 27355 46662 42199 16083 Feb 18635 10005-14680 -36800-8212 13421 5235-23563 Mar 39990-2950 5210 28190 24875 8274-15218 -15268 Apr -5600 14510 16940-12825 32641 42401 42953 25399 May 25850 15500-11080 25900 8887-26896 -73632-60988 Jun -5630-2460 -9820 11140-24056 -11391-21867 -350 Jul 22605-015 36465 73325 16550 13529 17964 4119 Aug 8880-14780 460 3980 13233 1932 32 15 Sep 22840 30840 59760 59980 6327 16278 34223 44816 Oct 4000 47600 33700 101320 19005 12949 5745 13201 Nov 15560 27320 45600 32320 11726 23869 30985 31941 Dec 20120 5040 36300 7420 29340 26924 38535 48751 2005 Jan -1760-31920 -33360-38420 -15515-30148 -3393-28311 Feb 59700 36920 88740 84880 38054 33840 34986 33478 Mar 20060 27100 6720 37160 21051 33344 9531-8139 Apr -27560-62280 -124960-140640 5282-2490 -19362-20862 May 35820 62220 76460 119840 7545 17262 19474 38658 Jun -1540 12620 30100 61220-1522 16175 22963 52307 97

Table- 32B Increase in Stock Prices and BSE Indices Stock Price BSE Index Month Base line Week 1 Week 2 Week 4 Base line Week 1 Week 2 Week 4 Jul -15340-88960 -59700-51060 -4872-615 11047 41118 Aug 33160 25120 27220 16480 19205 21148 2063 5539 Sep 61540 60240 71500 101760 21573 39214 62334 98975 Oct 33160 38580 4960 24120-12147 -27327-71505 -57742 Nov 000 26980 48220 45780 000 23618 57677 67129 Dec 26780 48140 86680 114600 20074 16227 42636 57921 2006 Jan 26980-59860 -66620-66200 29449 5763 12659 22667 Feb 2460 32280-16340 -10960 29395 49490 57438 69413 Mar 23900 19380 50900 99380 38273 32949 63469 106299 Apr 28700-9660 93440 35880 43986-6981 73251 52798 May 31440 49340-39500 -121660 51261 60039-44359 -11687 Jun 520-49900 -40600 66040-59490 -137051-112126 -50416 Jul 68080 148100 120820 129220 60581 69634 19078 57943 Aug 11680 40200 79400 118440 18157 40758 73589 95746 Sep 400 19920 25360 41400 15480 27397 57522 68169 Oct 5640 129520 168280 172160 867 15724 34285 31767 Nov 9120 51040 101280 85080 39271 43908 80748 9979 Dec 35280 13120-12800 54800 27572-20915 -31145 15003 2007 Jan 29680-52160 -19240-5760 2537-21563 37141 43638 Feb 22360 106160 113560 37200-1554 36937 7283-26141 Mar -102680-121400 -166480-237440 -86176-97196 -147746-104165 Apr 1280 44120 39360 22520-12358 13415 64004 124922 May 50240-35720 -28280-76280 -15067-45765 7083 31558 Jun 26920 76560 27960 1240-35828 -34074-4522 -3989 Jul -6600-2280 57160 87440 35732 58747 104556 127174 Aug -108120-80080 -97000-135320 -79061-67616 -14181-65457 Sep 30880-24760 -51840 40520 49457 49270 122621 202882 Oct 66440 48800-20680 -63040 62658 166351 84783 162033 Nov 29520-96560 -151800-215840 95346 28804 1014 23237 Dec 54480 74160 102640 188120 79261 110113 15931 121346 98

Table- 33: Variance-Covariance matrix Ĉov( β) Stock Price BSE Index Base Week Week Week Base Week Week Week Group line 1 2 4 line 1 2 4 Stock Price Base line 157222 137529 150592 192448 108181 112491 124192 113025 Week 1 137529 459829 481844 523734 88084 193376 135510 140120 Week 2 150592 481844 698350 791594 96076 188283 236046 260312 Week 4 192448 523734 791594 1141654 114526 182603 255810 391719 BSE Index Base line 108181 88084 96076 114526 162529 179004 190168 165575 Week 1 112491 193376 188283 182603 179004 308012 264477 266490 Week 2 124192 135510 236046 255810 190168 264477 455341 432616 Week 4 113025 140120 260312 391719 165575 266490 432616 580713 Table-34: Mean increase and(std deviation) at Base line, Week 1, Week 2, Week 4 Group Base line Week 1 Week 2 Week 4 stock Price 14126 10153 11820 16118 ( 30714 ) ( 52526 ) ( 64731 ) ( 82764 ) BSE Index 12432 13595 16004 28057 ( 31228 ) ( 42989 ) ( 52269 ) ( 59028 ) A popular method for constructing a single-degree-of freedom test corresponds to a test of the hypothesis that the trend over time is the same in several groups This method is a special case of the growth curve analysis The test of group time interaction is based on multivariate Wald Test The test provides a simultaneous test of H 0 : Lβ = 0 vs H A : Lβ 0, for a suitable 99

choice of L and the test statistic can be constructed as W 2 = ( L β ) {LĈov( β)l } 1 ( L β ), (361) and compared to a χ 2 -distribution with degrees of freedom equal to the number of rows of L The increase in stock prices are computed for four time points in a month, the vector representing the contrast based on the mean response at time 2 through n minus baseline is given by L = ( L 1, L 1 ), = ( 1, 1 3, 1 3, 1 3, 1, 1 3, 1 3 ) 1 3 The Mean increase and standard deviation at base line, Week 1, Week 2, Week 4 for both the groups are shown in Its Variance-Covariance matrix shown in table (33) Thus from the table (33), we can easily determine that the average value of the mean response minus baseline as 144167 in the stock price group and 676753 in the BSE index group Thus if we assume the first parameterisation assumed in Section 34, then the L β = 820920 and the value of the Wald test statistics is Z = 12441 (or W 2 = 15478, with one degree of freedom), indicating no significant difference in the response pattern between the two groups The contrast for comparing the AUC (minus baseline) in the two groups is given by L = ( L 2, L 2 ), = ( 35, 1, 15, 1, 35, 1, 15, 1) 100

From Table (34), the estimated mean AUC is 543996 in the stock price group and 2214614 in the BSE index group Thus if we assume the first parameterisation assumed in Section 34, then the L β = 2778610, yielding a Wald statistics of Z = 1895 (or W 2 = 14149, with one degree of freedom), again not a significant statistic Thus both the methods of analysis provide a clear signal that the response profile does not differ in the two groups 101