EXTENDED NEYMAN SMOOTH GOODNESS-OF-FIT TESTS, APPLIED TO COMPETING HEAVY-TAILED DISTRIBUTIONS. J. Huston McCulloch and E. Richard Percy, Jr.

Size: px
Start display at page:

Download "EXTENDED NEYMAN SMOOTH GOODNESS-OF-FIT TESTS, APPLIED TO COMPETING HEAVY-TAILED DISTRIBUTIONS. J. Huston McCulloch and E. Richard Percy, Jr."

Transcription

1 EXTENDED NEYMAN SMOOTH GOODNESS-OF-FIT TESTS, APPLIED TO COMPETING HEAVY-TAILED DISTRIBUTIONS J. Huston McCulloch and E. Richard Percy, Jr.* J. Econoetrics Special Issue Subission: Sept. 4, 200 First Revision: Feb. 5, 20 Second Revision: April 2, 20 Third Revision: May 6, 20 Fourth Revision: June 2, 20 J. Huston McCulloch E. Richard Percy, Jr. Departent of Econoics 3503 Treehouse Lane Ohio State University Canal Winchester, OH 430 JEL Classification Codes: C2 (Hypothesis Testing), C6 (Specific Distributions) Keywords: Stable distribution, Student t distribution, Generalized Error Distribution, Lagrange Multiplier Test * The authors are grateful to two anonyous referees for their very helpful and patient coents and suggestions.

2 ABSTRACT A siplified version of the Neyan (937) Sooth goodness-of-fit test is extended to account for the presence of estiated odel paraeters, thereby reoving overfitting bias. Using a Lagrange Multiplier approach rather than the Likelihood Ratio statistic proposed by Neyan greatly siplifies the calculations. Polynoials, splines, and the step function of Pearson s test are copared as alternative perturbations to the theoretical unifor distribution. The extended tests have negligible size distortion and ore power than standard tests. The tests are applied to copeting syetric leptokurtic distributions with US stock return data. These are generally reected, priarily because of the presence of skewness.

3 . Introduction A siplified version of the Neyan (937) Sooth goodness-of-fit test (GFT) is extended to adust for the presence of estiated odel paraeters. The effect is to reduce size distortion and to increase the power of the test. As is well known (e.g. Bai 2003), standard tests that do not account for the estiation of odel paraeters tend to under-reect the distribution assued by the null hypothesis. The heavy tails of ost financial return series often lead to an easy reection of the Gaussian distribution. A paraetric distribution that is consistent with the data would allow the ean to be estiated ore efficiently than would a nonparaetric procedure. Furtherore, paraetric distributions allow out-of-saple extree tail probabilities to be estiated. These tail probabilities are iportant for the pricing of options and contingent clais, for Value at Risk calculations, and for non-financial applications such as the risk of floods and other extree events. When heavy tails are present, Mandelbrot (963), Faa (965), Saorodnitsky and Taqqu (994), and McCulloch (996) have suggested the use of stable distributions. Blattberg and Gonedes (974), Hageran (978), Perry (983), and Boothe and Glassan (987) propose the Student t distributions. Nelson (99) investigates the generalized error distribution (GED), while Praetz (972) and Clark (973) ipleent a ixture of noral distributions. This paper is aied at developing an appropriate GFT that can work well with all the above distributions. Although this paper assues identically and independently distributed (i.i.d.) errors, Percy (2006) addresses other odels including ARCH and GARCH odels of volatility clustering.

4 Section 2 introduces a siplified Lagrange Multiplier (LM) version of the Neyan Sooth GFT. Section 3 extends this test to accoodate estiated paraeters. Section 4 illustrates the extended test for the proposed heavy-tailed distributions with stock return data. Section 5 reports on the power of the test to discriinate between the candidate distributions, and Section 6 concludes. This paper draws heavily upon Percy (2006), which provides ore theory and details than are available herein. 2. A siplified Neyan Sooth GFT with known paraeters We can develop an LM test for any distribution with known or unknown paraeters by first constructing an LM test for the unifor distribution. 2.. An LM test for unifority For perturbation functions φ ( z),, that are linearly independent, bounded, and, α,,, define integrate to zero on the unit interval and a vector of coefficients g z; : φ z α, z [0, ]. For α in a sufficiently sall neighborhood N of the origin in, g ;α is a probability density function, with cuulative distribution function denoted by G ;α. This distribution nests the standard unifor distribution U(0,) when α 0. u u,, Consider a rando saple n H : u i ~ i.i.d. ;α u n. We wish to test H 0: u i ~ i.i.d. U(0,) vs. G with α 0. The log likelihood function of interest is Such tests are also called efficient score tests, score tests, or soeties Rao score tests in honor of C.R. Rao, who first proposed this type of test. 2

5 3 n i i u φ log ) ; L( log u α, and the score vector of first derivatives with respect to the s is n i h i h h i u u φ log L/ : ) ; ( u α s. When evaluated at the null, this siplifies to n i u i φ 0;u s. () Letting ;α G ~ Z, the Fisher inforation atrix with respect to the perturbation paraeters is,, ' I α α I where dz z z z Z Z I h h h 0 φ φ φ ; log g ; log g E : α α α. At the null hypothesis, this siplifies to dz z z, 0 φ φ 0 I. (2) The LM statistic is then n LM / ; ; u 0 s 0 I u 0 s. (3) Since the null hypothesis is a single internal point in the set N, the asyptotic distribution of this statistic is 2 with degrees of freedo under the null.

6 2.2. Functional for and basis Although any set of linearly independent bounded functions integrating to 0 ay be used in theory, there are nuerical considerations in choosing a basis so that tractable results can be obtained. We investigated polynoials, orthogonal polynoials, splines, and B-splines (Judd 999, Percy 2006). For sall values of, the deeaned standard polynoials φ ( z) z /( ) are an obvious choice. For larger values of, however, these standard polynoials are ill-conditioned for the atrix inversion required to copute the Fisher inforation atrix. Neyan-Pearson orthogonal polynoials are atheatically equivalent but coputationally better conditioned in this respect. However, rounding errors still ay arise in the evaluation of the score as there is a large disparity in the orders of agnitude between coefficients, even for idsize values of : Experience with polynoials derived by truncating [Taylor] series, [especially in their use with estiating transcendental functions,] ay islead one into thinking that the use of high order polynoials does not lead to coputational difficulties. However it ust be appreciated that truncated [Taylor] series are not typical of polynoials in general. [Truncated Taylor series] have the special feature that the ters decrease rapidly in size for values of x in the range for which they are appropriate. A tendency to underestiate the difficulties involved in working with general polynoials is perhaps a consequence of one s experience in classical analysis. There it is natural to regard a polynoial as a very desirable function since it is bounded in any finite region and has derivatives of all orders. In nuerical work, however, polynoials having coefficients which are ore or less arbitrary are tiresoe to deal with by entirely autoatic procedures. (Wilkinson 963, p. 38) Polynoials have the further drawback that fitting one region better ay require fitting other distant regions ore poorly. Cubic splines, which are piecewise cubic functions with discontinuities in the third derivative at selected knotpoints, are ore pliable and better able to fit a particular interval without affecting ore distant intervals as uch. Siple spline basis functions can create siilar nuerical probles with the inversion of the inforation atrix but the so-called B-spline basis solves this 4

7 issue, while generating the sae space of functions. Thus, for uch of the analysis, we used B- splines as a basis but also investigated orthogonal polynoials with a oderate nuber of paraeters. s have soe nuerical properties that are ore desirable than polynoials, while the tradeoff in the other properties is not severe. The classic Pearson GFT arises as a special case of the LM test when each perturbation function is a step function integrating to zero, with discontinuities at selected points. Piecewise linear and quadratic spline perturbation functions are also considered below. The proposed LM GFT is closely related to the Neyan (937) Sooth test, so naed because the alternative distributions vary soothly away fro the null hypothesized distribution rather than with discontinuities as in the Pearson test. However, Neyan s test, which he called the 2 test (as contrasted with Pearson s 2 test), was based on a likelihood ratio statistic rather than an LM statistic and hence required estiating the odel both under the null and the perturbed alternative. Furtherore, in order to prevent negative densities, his perturbations were exponentiated polynoials, nuerically constrained to integrate to zero. As noted already by Rayner and Best (989), the LM siplification of the Neyan test eployed here only needs to be estiated under the null, a uch sipler calculation. Furtherore, since the alternative only needs to be a proper density in a neighborhood of the null hypothesis, polynoial or spline perturbations, which are easily constrained to integrate to zero, ay be used directly, without exponentiation LM test for a copletely specified distribution The LM test for any continuous distribution with known paraeters is essentially the sae as that for the unifor distribution developed above. 5

8 Consider a rando saple n y y,, y n. One would like to test: H 0: y i ~ i.i.d. F(), where F() is a copletely specified continuous distribution with density f()=f (), vs. H : y i ~ i.i.d. F G ;α with 0 α. Define u F i y i. Then, as is well known, y i ~ F() if and only if u i ~ U(0,). Under the alternative, the density is f g F ; α. The log-likelihood function is then n ; y log gu i ; α log f y i n log L α. i i Since the second suation does not depend on α, the derivatives necessary to calculate the LM statistic are identical to those for the test for the unifor distribution. Thus one can siply use the transfored observations u i with the test for the unifor distribution (3). All tables and critical values that are suitable for the test of unifority are also suitable for a general distribution Finite saple properties with a copletely specified distribution As noted, the LM statistic has a liiting asyptotic distribution that is chi-squared with degrees of freedo equal to, the nuber of perturbation paraeters. Siulations reported in Percy (2006) indicate that for saple size n 30, 5, and test size = 0.05, the convergence to the liiting distribution is quite rapid. At the indicated values of, n, and test size, the 95 th percentile of the siulated distributions was generally within sapling error of the 95 th percentile of a chi-squared rando variable. 6

9 Difference 0.05 Cubic Size Distortion (=6) ` Cuulative Probability n=30 n=00 n=300 n=000 95% CI Figure. Size distortion of cubic spline test ( = 6). Figure shows results of the siulations for 9999 replications with a cubic spline perturbation and = 6. The solid lines indicate the difference between the theoretical chi-squared distribution and the epirical distribution of the siulations. The dashed lines indicate 95% confidence intervals for the quantiles of an epirical CDF of rando draws fro the chi-squared distribution. Except for very sall saples with n = 30, the correspondence is as close as could be expected at all quantiles. See Percy (2006) for illustrations of size distortion for other paraeter sizes and basis functions. 7

10 3. The LM test with estiated odel paraeters We now extend the LM test to the typical situation in which the values of the odel paraeters are not known and so ust be estiated. Our null hypothesis is now H 0 : y i ~ i.i.d. F ;θ, where F is a continuous distribution with density ;θ f that depends on the k vector of paraeters θ. This vector lies in the interior of a convex set Θ, but is otherwise unknown. The Fisher inforation atrix is assued to be positive definite and finite for all interior points of this set. Let the axiu likelihood (ML) estiate of θ under the null hypothesis be θˆ and set u ˆ uˆ,, uˆ where ˆ F y ;θˆ n u. i i For ost of the copeting heavy-tailed distributions considered here, naely the syetric stable, Student t, and generalized error distributions, the unknowns consist of a location, scale, and shape paraeter, so that k = 3. For a ixture of two Gaussian distributions with a coon ean, two variances, and an unknown ixing probability, k = 4. More generally, the location of each y i could be deterined by a possibly non-linear function h(x i ;), where x i is a vector of known constants or of exogenous rando variables and is a vector of unknown coefficients to be estiated by ML. However, the present paper illustrates the extended test only in cases involving a coon location paraeter. Our alternative hypothesis is now H : y i ~ i.i.d. F defined as above. The log likelihood is now n, α; y log gfy i ; θ; α log f y ; θ i n log L θ. The full score vector i i G ;θ ;α, with 0 α and G ;α s θ, α; y s s θ α θ, α; y θ, α; y log L/ log L/ h hk 8

11 is now (k+), but θ ˆ, 0; y 0 by the ML first order conditions, while θ, 0; y s 0; uˆ (), since g ( ; α) 0 when α 0. The full Fisher inforation atrix I θ, α I I θθ θα s θ θ, α Iθαθ, α θ, α I θ, α is now of diension (k+) (k+), with I I I αα θ ˆ, α I θθθ ˆ, αhh h, hk θθ, θα θ ˆ, α I θαθ ˆ, αh hk, θ ˆ, α I ααθ ˆ, α, αα. Letting Z ~ F I I I θθ θα αα G ;θ ;α, we have log g F Z; θ θˆ, 0 : E hh' f ; αf Z; θ log gfz ; θ; αf Z; θ z; θ f z; θ h h h log g F Z; θ θˆ, 0 : E h f z; θ / f z; θ dz, θθˆ ; αf Z; θ log gfz ; θ; αf Z; θ h φ log g F Z; θ θˆ, 0 : E φ so that θ, 0 I0 I αα 0 h Fz; θ dz, θθˆ ; αf Z; θ log gfz ; θ; αf Z; θ zφ z dz, ˆ as in (2). h s α θθˆ, α0 θθˆ, α0 θθˆ, α0 ˆ as in 9

12 LM s s Using the partitioned atrix inverse forula, the extended LM test statistic becoes θˆ, 0; yi θˆ, 0sθˆ, 0; y/ n 0, uˆ I0 I θˆ, 0I θˆ, 0I θˆ, 0 s0, uˆ / n. θα θθ θα By standard theory, the asyptotic distribution of this statistic under the null is again 2 with degrees of freedo. The subtraction of I I I inside the inverse increases the extended test θα θθ θα statistic by ust enough to prevent overacceptance of the null hypothesis when û is used in place of u. Although I 0 can readily be coputed in closed for for polynoial, spline, or step perturbation functions, calculation of the reaining inforation coponents of the extended LM statistic requires nuerical integrations that ay depend on the distribution in question, its paraeters, and the for and nuber of the perturbation functions. DuMouchel (975) has tabulated I θθ θ, 0for the stable distributions. See Percy (2006) for coputational details and tables for the distributions and perturbation functions considered here. In the extended test, finite saple critical values depend on the specific odel and true paraeter values. Siulations reported in Percy (2006) indicate ore size distortion than with known paraeters. However, size distortion can still largely be ignored, even for relatively sall saple sizes. 4. The extended test on stock arket returns The extended LM test is illustrated in this section using continuously copounded percent real onthly returns, including dividends, on the CRSP value-weighted stock arket index, during the 40-year period fro /53-2/92 (480 observations), as eployed in McCulloch (997). Percy (2006) also considers the 50-year period ending in 2/2002. However, volatility clustering is ore 0

13 apparent in the longer series. Although estiated ARCH or GARCH paraeters ay be incorporated into the extended test, the present paper assues i.i.d. errors, and hence this section uses only the shorter period. The extended test was perfored with the following syetric distributions: () Gaussian, (2) syetric stable, (3) Student t with a ean and scale paraeter, and (4) the generalized error distribution. We also used various perturbation functions: a step function that extends the traditional Pearson test, Neyan-Legendre polynoials, and linear, quadratic, and cubic B-splines. All the splines, as well as the extended Pearson test, used equidistant knotpoints. Table reports axiu likelihood estiates under a Gaussian null hypothesis. Table 2 reports the extended LM test statistics and asyptotic p values for the hypothesis that these returns are Gaussian, with up to twelve perturbation paraeters in the alternative hypotheses. Gaussian ML Estiates Estiate Standard Error z-score Probability Std. Deviation Mean Log L Table. Gaussian axiu likelihood estiates (n = 480).

14 Pearson Lagrange Multiplier Test Statistics Neyan- Legendre Linear Quadratic Cubic Copleent of chi square inverse of test statistic Pearson Neyan- Legendre Linear Quadratic e-07.8e-05.3e-07 Cubic e e e e e-0 5.7e-08.2e e e-0.6e e-0 3.5e-0 6.2e-06.2e-0.4e-08.6e e e-06.7e-2.5e-08.4e e- 8 3.e e-3 4.4e e-0 7.e e-5 4.6e e- 2.0e e-7 6.8e e-2 2.6e e-7 5.3e e-2 2.8e e-7 6.6e-0.6e-2.3e-4 Table 2. LM Test Statistics and p values for Gaussian null hypothesis. The Jarque-Bera (987) statistic for i.i.d. norality is 89.9 with this data. The p value for this statistic with the appropriate 2 degrees of freedo is 0-4. Many of the p values in Table 2 are zero to several places, but none is as sall as for the Jarque-Bera statistic. Jarque-Bera is designed to be sensitive to departures of skewness fro 0 and of kurtosis fro 3, and is unquestionably the best test to use for departures fro norality that show up strongly in these oents. The 2

15 extended Neyan LM statistic, on the other hand, will be sensitive to other departures fro norality, depending on the nuber and type of basis functions, and, with sufficiently large saple size and sufficiently large values of, could eventually detect distributions that were not Gaussian even if they had zero skewness and kurtosis equal to 3. Drawing attention to the Pearson statistics oentarily, one can see that the Pearson statistics with low paraeter nubers are not particularly adept at identifying the non-gaussian nature of this data set. For =, the significance level is and for = 2, the significance level is Since Pearson takes no account of where an error resides within the bin to which it is assigned, it will often not be very sensitive to departures fro the posited distribution, so we recoend against use of this statistic whenever the null is a continuous distribution, even when it is corrected, as here, for estiated paraeters. In Table 3, we estiate the syetric stable distribution paraeters by axiu likelihood using the density approxiation of McCulloch (998). As in McCulloch (997), the estiated stable characteristic exponent (not to be confused with the perturbation coefficients of the preceding sections) is.845. The algorith fits the natural logarith of the scale c, so that the asyptotic standard errors apply to log c rather than c itself. Although.845 is 2.63 asyptotic standard errors fro the value of 2 corresponding to a Gaussian distribution, and the likelihood ratio (LR) statistic for the hypothesis of norality is 26.66, these statistics do not have their usual N(0,) and 2 () distributions, because 2 is on the boundary of the perissible paraeter space. Nevertheless, the siulations of McCulloch (997) deonstrate that the 5% critical value for the LR statistic is less than.2 for this saple size, and that norality can be reected with p <<.004. Table 4 reports the extended LM test of the null that these real returns are syetric stable, with the sae alternatives as considered in Table 2. 3

16 Syetric Stable ML Estiates Estiate Standard Error z-score Probability Log scale Scale c 2.7 Stable a Mean Log L Table 3. Syetric stable axiu likelihood estiates (n = 480). Pearson Lagrange Multiplier Test Statistics Neyan- Legendre Linear Quadratic Cubic Copleent of chi square inverse of test statistic Pearson Neyan- Legendre Linear Quadratic Cubic Table 4. LM Test Statistics and p values for syetric stable null hypothesis. 4

17 Turning our attention to the Neyan-Legendre polynoial tests, the first three tests do not reect the syetric stable distribution at the 5% test size. However, when we add the fourth basis function, we begin to obtain significant reections. The first three Neyan-Legendre statistics are necessarily identical to the first statistic for the linear, quadratic, and cubic splines, respectively. For the cubic spline with ore than 3 paraeters and therefore at least one internal knotpoint, the results are siilar to those for the Neyan-Legendre statistics. The step-function Pearson results are far ore dependent on than are the sooth alternatives. Tables 5 and 6 report the sae tests for the Student t distribution. Infinite Student t degrees of freedo (DOF) correspond to a proper, Gaussian distribution, so the search was paraeterized in ters of reciprocal DOF rather than DOF directly. Tables 7 and 8 report these tests for the GED. An infinite GED power paraeter leads to a proper, U(-, ) liit, so this search was also paraeterized in ters of the reciprocal of the shape paraeter. In order for the Fisher inforation atrix to be finite, it is necessary to restrict the power to be strictly greater than, so as to ust exclude the Laplace distribution. However, this restriction was not binding. Percy (2006) reports siilar test results for a ixture of two Gaussian distributions. 5

18 Student t ML Estiates Estiate Standard Error z-score Probability Log Scale Scale c 3.53 Reciprocal DOF Deg. of Freedo Mean e-5 Log L Table 5. Student t axiu likelihood estiates (n = 480) Pearson Lagrange Multiplier Test Statistics Neyan- Legendre Linear Quadratic Cubic Copleent of chi square inverse of test statistic Pearson Neyan- Legendre Linear Quadratic Cubic Table 6. LM Test Statistics and p values for Student t null hypothesis. 6

19 Generalized Error Distribution (GED) ML Estiates Estiate Standard Error z-score Probability Log Scale Scale c 4.65 Recip. power e-4 GED power.404 Mean e-5 Log L Table 7. GED axiu likelihood estiates (n = 480) Pearson Lagrange Multiplier Test Statistics Neyan- Legendre Linear Quadratic Cubic Copleent of chi square inverse of test statistic Pearson Neyan- Legendre 7 Linear Quadratic Cubic Table 8. LM Test Statistics and p values for GED null hypothesis.

20 The ost easily reected of these three distributions is the GED. These distributions have relatively thin tails copared to the others. As a general rule, the presence of ore than the expected value of outliers (or even one extree outlier) often allows for the reection of null hypotheses of thin-tailed distributions, while the absence of outliers does not allow for as easy of a reection of null hypotheses of heavy-tailed distributions. Of course as the saple size grows, eventually a test for a heavy-tailed distribution will decrease its p value if outliers do not eventually appear. For purely coputational reasons, the present paper considers only syetric distributions. Although all three syetric distributions considered in the present section encounter frequent reections (with the notable exception of the Student t for between 6 and 9), it is likely that any of these reections would be reversed if syetry were not iposed. The nuerical approxiation of Nolan (997) perits skew-stable distributions to be fit by ML, while the Student t distribution can be generalized to incorporate skewness in a natural way (Ki and McCulloch 2007). It is also possible to skew the GED, albeit in an ad hoc anner, siply by expanding the horizontal axis on one side of the origin while copressing it on the other. With liited data, densities that are siilar over uch of their support cannot be distinguished fro one other very easily. This does not allow one to ake very strong stateents about the extree tail probabilities, where the densities ay differ considerably. Although this data set has only 480 onthly returns, 40 years of daily returns would yield about 0,000 observations. When daily data is used, however, the returns often becoe less independent and less identically distributed since there is ore apparent volatility clustering, day-of-the-week effects in both ean and scale, holiday effects, end-of-year effects, and other coplications. However, the extended 8

21 Neyan Sooth GFT developed here allows these additional effects to be estiated without size distortion. 5. Size distortion and power with financial data paraeter values This section investigates criteria for deterining the best test to use in ters of size distortion and power against the syetric leptokurtic alternatives considered above, and also a ixture of two norals with a coon ean. With the extended test, size distortion and power ay depend on both the distribution and true paraeter values in question. For this purpose, following Percy (2006), we use paraeter estiates for percent log real onthly returns for the 50- year period January, 953, through Deceber, The ML paraeter estiates for this longer period were as follows: Syetric stable: =.862, log scale =.024, Student t: degrees of freedo = 6.864, log scale =.293, ean = 0.64 Generalized error distribution: power =.49, log scale =.568, ean = Mixture of two Gaussians: probability of saller standard deviation = 0.906, log(saller st. dev.) =.308, log(larger st. dev.) = 2.25, ean = These values were used in siulations for both null hypotheses and data-generating processes. It is iportant to note that the conclusions to be drawn using these paraeter values ay not be applicable with other values. The size distortion and power against each of the other distributions considered were investigated for each of the four leptokurtic distributions. All sizes and powers are based on siulations using 000 saples, described in ore detail in Percy (2006). All data is generated by one of the four hypothesized distributions with the indicated paraeters. There are two test sizes 9

22 investigated for each scenario, 0.0 and Fro to 20 basis paraeters are tested, or 3 to 20 for the cubic splines. Six saple sizes were considered: n = 32, 00, 36, 000, 362 and 0,000 (0 k, k = 3 / 2, 2, 5 / 2, 3, 7 / 2, 4). We tested the size distortion for each extended goodness-of-fit test and did the sae using conventional tests without the adustent for estiated paraeters. This produces 8 power tests per null hypothesis, based on 6 possible saple sizes with 3 possible alternative hypotheses. For each category there is a size-adusted power for the corrected tests and non-adusted powers for both the corrected tests and the conventional tests. Preliinary tests indicated that the Neyan-Legendre polynoial basis and the Cubic basis generally outperfored the other bases investigated (Pearson, the Quadratic and the Linear ). Accordingly, only those two bases are copared here. 5.. Size distortion There is treendous size distortion with the uncorrected conventional tests in every instance for every value of. This distortion does not go away as the saple size increases fro 32 to 0,000. It diinishes soewhat as the nuber of basis paraeters increases, but this is still not very helpful. The distortion is, as expected, in the direction of over-acceptance. Figure 2 illustrates the size iproveent fro the extended test for the syetric stable distribution with saple size 36 and test size 0.0. It shows that the size of the extended test lies ostly within 95% confidence liits of the intended size, while that of the conventional tests lies copletely outside this interval. This exaple is typical of the full set of figures reported in Percy (2006). For the corrected tests, there is soe size distortion for saller saple sizes, but generally uch saller than with the conventional tests. This size distortion vanishes, within sapling error, for oderate saple sizes. 20

23 Percent Reect Null 0.4 Actual Size and 95% Conf Liits around 0.0 Stable, 36 observations, 0.0 test Ext. Poly Ext. Polynoial 95% CI Paraeters Figure 2. Size Iproveent due to Extended Test, syetric stable null, n = 36, test size.0, with 95% CI bracketing noinal test size. For the ixture distribution and as few as 00 observations, with up to = 6 perturbation paraeters, the extended test size distortion is undetectable. For 36 observations and the level 0.0 test, the results are ust barely in the low end of the confidence interval for ost values of. For 000 and ore observations, there is no significant distortion. Distortion gradually disappears for the stable null fro n = 36 to 000 as well, although it is sall for saller saple sizes with sall nubers of basis paraeters as well. For Student nulls, distortion dies out at only 00 observations. GED nulls require around 36 observations Power and recoendations The over-acceptance caused by the size distortion in the conventional tests contributes to poor power against the chosen nulls. With the conventional tests, fitting the odel paraeters 2

24 biases against reecting even false hypotheses. Hence, practitioners ay all too often istakenly conclude that, since their test does not reect an alternative hypothesis, they are ustified in accepting the validity of the assuptions in their study. The uncorrected tests have power even less than the test size for the largest saple sizes! The corrected tests do have ore power than the conventional tests. However, it is quite difficult to tell the leptokurtic distributions under consideration fro one another when the saple size is sall, even with the corrected tests. When the saple size is high enough for there to be reasonable levels of power (4 to 5 ties the test size), there is negligible size distortion. However, there are often significant power gains to be ade by adapting the functional for and nuber of perturbation paraeters to the specific null in question. Accordingly, recoendations are ade below that are specific to the null, yet robust to the actual data-generating distribution, using paraeters as fit above to stock arket data. With a stable null: a GED distribution can start to be detected with as few as 36 observations. A pattern eerges which has a power peak at only = 2 polynoial test paraeters, or 3 for the cubic spline. This power peak also works well for detecting a Student distribution. The ixture of norals, with its extra paraeter, is quite difficult to identify, requiring ore than 3000 observations even to get odest 30% power levels. At 0,000 observations the ost power coes with a large nuber of paraeters. But at 0,000 observations, any value of (ore than 2) will have the sae (00%) power levels for the GED and Student distributions. At this saple size, it is recoended to use the cubic spline, perhaps with = 5 test paraeters, because it has fewer nuerical difficulties than the polynoial as increases. See Percy (2006) for graphs illustrating these findings. 22

25 Under a Student t null hypothesis, a stable distribution can be seen alost half the tie with 36 observations. Alost any value of greater than 2 will work equally effectively. GED and noral ixtures are still concealed for the ost part at this saple size, but a test with 2 or 3 paraeters has the best chance of finding the. So, if Student t is the null, one ay proceed with 2 or 3 test paraeters, regardless of saple size. With a GED null hypothesis, the advisable nuber is 4 paraeters. Using a basis with only three paraeters has very low power but there is a treendous increase at 4, with sall increases after that. With a ixture of two Gaussians as the null hypothesis, there is little power to detect the other three distributions, unless one has at least 3000 observations. The pattern is a bit unusual, however, in that higher values of generally yield ore power. For this reason, the cubic spline is recoended here to avoid nuerical inaccuracy, with about 5 paraeters. At 0,000 observations, even with the possible inaccuracies, the test with 8 paraeters has fairly low distortion levels, so it would be safe to use against the chosen alternatives. For further discussion, see Percy (2006). 6. Conclusion The extended Neyan Sooth test can be used to analyze econoic and financial data to probe the distribution underlying the generation of the data. Soe parsionious paraetric distributions ay be found that will aid inferences about levels of and relationships between econoic variables. Thus, asyptotically consistent estiates of paraeters are possible without either presuing norality of error ters or using solely nonparaetric techniques. In that regard, these new procedures can offer new answers to old questions. 23

26 Unlike any goodness-of-fit tests, unknown odel paraeters can be estiated with the extended Neyan tests without preudicing the tests. Since these tests rely on axiu likelihood techniques, they asyptotically eet the conditions of the Neyan-Pearson lea against any siple alternative hypothesis in its paraeter space. Tests with one-sided alternatives that eet these criteria qualify as Uniforly Most Powerful (UMP) tests for arbitrary significance levels. odels are ore tractable than polynoial odels with existing double precision software, and it does not appear that this tractability is obtained at the cost of lower power in tests of interest. Size distortion that is present in the conventional tests is lowered considerably with the extended tests, even for odest saple sizes. A related benefit is increased test power. REFERENCES Bai, J., 2003, Testing paraetric conditional distributions of dynaic odels. Review of Econoics and Statistics 85, Blattberg, R., and N. Gonedes, 974, A coparison of the stable and Student distributions as statistical odels for stock prices. Journal of Business 47, Boothe, P., and D. Glassan, 987, The statistical distribution of exchange rates. Journal of International Econoics 22, Clark, P.K., 973, A subordinated stochastic process odel with finite variance for speculative prices. Econoetrica 4, DuMouchel, W.H., 975, Stable distributions in statistical inference: 2. Inforation fro stably distributed saples. Journal of the Aerican Statistical Association 70, Faa, E.F., 965, The behavior of stock prices. Journal of Business 38, Hageran, R.L., 978, More evidence on the distribution of security returns. Journal of Finance 33,

27 Jarque, C.M., and A.K. Bera, 987, A test for norality of observations and regression residuals. International Statistical Review 55, Judd, K.L., 999, Nuerical ethods in econoics. MIT Press, Cabridge. Ki, Y.I., and J.H. McCulloch, 2007, The skew-student distribution with application to U.S. stock arket returns and the equity preiu. Ohio State University. Mandelbrot, B., 963, The variation of certain speculative prices. Journal of Business 36, McCulloch, J.H., 996, Financial applications of stable distributions, in: G.S. Maddala and C.R. Rao, (Eds.), Handbook of statistics, Vol. 4: Statistical ethods in finance, Elsevier Science, Asterda. McCulloch, J.H., 997, Measuring tail thickness to estiate the stable index A critique. Journal of Business & Econoic Statistics 5, McCulloch, J.H., 998, Nuerical approxiation of the syetric stable distribution and density, in: R. Adler, R. Feldan, and M.S. Taqqu (Eds.), A practical guide to heavy tails, Burkhäuser, Boston. Nelson, D.B., 99, Conditional heteroskedasticity in asset returns: A new approach. Econoetrica 59, Neyan, J., 937, Sooth test for goodness of fit. Skandinavisk Aktuarietidskrift 20, Nolan, J.P., 997, Nuerical calculation of stable densities and distribution functions. Counications in Statistics Stochastic Models 3, Percy, E.R., Jr., 2006, Corrected LM goodness-of-fit tests with application to stock returns, < Perry, P.R., 983, More evidence on the nature of the distribution of security returns. Journal of Financial and Quantitative Analysis 8, Praetz, P.D., 972, The distribution of share price changes. Journal of Business 45, Rayner, J.C.W., and D.J. Best, 989, Sooth tests of goodness of fit. Oxford University Press, Oxford. Saorodnitsky, G., and M.S. Taqqu, 994, Stable non-gaussian rando processes: Stochastic processes with infinite variance. Chapan and Hall, New York. Wilkinson, J.H., 963, Rounding Errors in algebraic processes. Prentice-Hall, Englewood Cliffs, NJ. 25

28 TYPOGRAPHIC NOTES The JE ay wish to use a double stroke font such as LaTeX Bbold or Blackboard Bold for the expectations operator E in the equation following (), as well as on p. 9. This is unfortunately not available in MS Word Equation Editor. Likewise, the JE ay wish to use a cursive font such as Script MT Bold (I) for the inforation atrix I in the equation following (), as well as in (2), (3), and several places on p. 9. However, since the identity atrix is not used in this paper, no confusion arises fro using I. Again, this font is not available in MS Word Equation Editor. Every attept has been ade to use bold face for vectors and atrices. If there is any abiguity, please contact the authors. Vectors and atrices, when defined in ters of their eleents, are consistently defined using large parentheses, as in Sion and Blue, Matheatics for Econoists. A subscript defines the range of the subscripts, as requested by Referee. If the JE would prefer a different notation, such as square brackets as in Greene s Econoetric Analysis, the authors will be happy to work with the production editor on this. 26

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution Testing approxiate norality of an estiator using the estiated MSE and bias with an application to the shape paraeter of the generalized Pareto distribution J. Martin van Zyl Abstract In this work the norality

More information

Testing equality of variances for multiple univariate normal populations

Testing equality of variances for multiple univariate normal populations University of Wollongong Research Online Centre for Statistical & Survey Methodology Working Paper Series Faculty of Engineering and Inforation Sciences 0 esting equality of variances for ultiple univariate

More information

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics

ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS. A Thesis. Presented to. The Faculty of the Department of Mathematics ESTIMATING AND FORMING CONFIDENCE INTERVALS FOR EXTREMA OF RANDOM POLYNOMIALS A Thesis Presented to The Faculty of the Departent of Matheatics San Jose State University In Partial Fulfillent of the Requireents

More information

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon

Model Fitting. CURM Background Material, Fall 2014 Dr. Doreen De Leon Model Fitting CURM Background Material, Fall 014 Dr. Doreen De Leon 1 Introduction Given a set of data points, we often want to fit a selected odel or type to the data (e.g., we suspect an exponential

More information

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013).

The proofs of Theorem 1-3 are along the lines of Wied and Galeano (2013). A Appendix: Proofs The proofs of Theore 1-3 are along the lines of Wied and Galeano (2013) Proof of Theore 1 Let D[d 1, d 2 ] be the space of càdlàg functions on the interval [d 1, d 2 ] equipped with

More information

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES

TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES TEST OF HOMOGENEITY OF PARALLEL SAMPLES FROM LOGNORMAL POPULATIONS WITH UNEQUAL VARIANCES S. E. Ahed, R. J. Tokins and A. I. Volodin Departent of Matheatics and Statistics University of Regina Regina,

More information

Biostatistics Department Technical Report

Biostatistics Department Technical Report Biostatistics Departent Technical Report BST006-00 Estiation of Prevalence by Pool Screening With Equal Sized Pools and a egative Binoial Sapling Model Charles R. Katholi, Ph.D. Eeritus Professor Departent

More information

Non-Parametric Non-Line-of-Sight Identification 1

Non-Parametric Non-Line-of-Sight Identification 1 Non-Paraetric Non-Line-of-Sight Identification Sinan Gezici, Hisashi Kobayashi and H. Vincent Poor Departent of Electrical Engineering School of Engineering and Applied Science Princeton University, Princeton,

More information

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels

Extension of CSRSM for the Parametric Study of the Face Stability of Pressurized Tunnels Extension of CSRSM for the Paraetric Study of the Face Stability of Pressurized Tunnels Guilhe Mollon 1, Daniel Dias 2, and Abdul-Haid Soubra 3, M.ASCE 1 LGCIE, INSA Lyon, Université de Lyon, Doaine scientifique

More information

Using EM To Estimate A Probablity Density With A Mixture Of Gaussians

Using EM To Estimate A Probablity Density With A Mixture Of Gaussians Using EM To Estiate A Probablity Density With A Mixture Of Gaussians Aaron A. D Souza adsouza@usc.edu Introduction The proble we are trying to address in this note is siple. Given a set of data points

More information

COS 424: Interacting with Data. Written Exercises

COS 424: Interacting with Data. Written Exercises COS 424: Interacting with Data Hoework #4 Spring 2007 Regression Due: Wednesday, April 18 Written Exercises See the course website for iportant inforation about collaboration and late policies, as well

More information

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS ISSN 1440-771X AUSTRALIA DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS An Iproved Method for Bandwidth Selection When Estiating ROC Curves Peter G Hall and Rob J Hyndan Working Paper 11/00 An iproved

More information

Probability Distributions

Probability Distributions Probability Distributions In Chapter, we ephasized the central role played by probability theory in the solution of pattern recognition probles. We turn now to an exploration of soe particular exaples

More information

OBJECTIVES INTRODUCTION

OBJECTIVES INTRODUCTION M7 Chapter 3 Section 1 OBJECTIVES Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance, and

More information

Machine Learning Basics: Estimators, Bias and Variance

Machine Learning Basics: Estimators, Bias and Variance Machine Learning Basics: Estiators, Bias and Variance Sargur N. srihari@cedar.buffalo.edu This is part of lecture slides on Deep Learning: http://www.cedar.buffalo.edu/~srihari/cse676 1 Topics in Basics

More information

Modeling the Structural Shifts in Real Exchange Rate with Cubic Spline Regression (CSR). Turkey

Modeling the Structural Shifts in Real Exchange Rate with Cubic Spline Regression (CSR). Turkey International Journal of Business and Social Science Vol. 2 No. 17 www.ijbssnet.co Modeling the Structural Shifts in Real Exchange Rate with Cubic Spline Regression (CSR). Turkey 1987-2008 Dr. Bahar BERBEROĞLU

More information

PORTMANTEAU TESTS FOR ARMA MODELS WITH INFINITE VARIANCE

PORTMANTEAU TESTS FOR ARMA MODELS WITH INFINITE VARIANCE doi:10.1111/j.1467-9892.2007.00572.x PORTMANTEAU TESTS FOR ARMA MODELS WITH INFINITE VARIANCE By J.-W. Lin and A. I. McLeod The University of Western Ontario First Version received Septeber 2006 Abstract.

More information

A Note on the Applied Use of MDL Approximations

A Note on the Applied Use of MDL Approximations A Note on the Applied Use of MDL Approxiations Daniel J. Navarro Departent of Psychology Ohio State University Abstract An applied proble is discussed in which two nested psychological odels of retention

More information

A Jackknife Correction to a Test for Cointegration Rank

A Jackknife Correction to a Test for Cointegration Rank Econoetrics 205, 3, 355-375; doi:0.3390/econoetrics3020355 OPEN ACCESS econoetrics ISSN 2225-46 www.dpi.co/journal/econoetrics Article A Jackknife Correction to a Test for Cointegration Rank Marcus J.

More information

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization

Support Vector Machine Classification of Uncertain and Imbalanced data using Robust Optimization Recent Researches in Coputer Science Support Vector Machine Classification of Uncertain and Ibalanced data using Robust Optiization RAGHAV PAT, THEODORE B. TRAFALIS, KASH BARKER School of Industrial Engineering

More information

A Simple Regression Problem

A Simple Regression Problem A Siple Regression Proble R. M. Castro March 23, 2 In this brief note a siple regression proble will be introduced, illustrating clearly the bias-variance tradeoff. Let Y i f(x i ) + W i, i,..., n, where

More information

Symbolic Analysis as Universal Tool for Deriving Properties of Non-linear Algorithms Case study of EM Algorithm

Symbolic Analysis as Universal Tool for Deriving Properties of Non-linear Algorithms Case study of EM Algorithm Acta Polytechnica Hungarica Vol., No., 04 Sybolic Analysis as Universal Tool for Deriving Properties of Non-linear Algoriths Case study of EM Algorith Vladiir Mladenović, Miroslav Lutovac, Dana Porrat

More information

Ph 20.3 Numerical Solution of Ordinary Differential Equations

Ph 20.3 Numerical Solution of Ordinary Differential Equations Ph 20.3 Nuerical Solution of Ordinary Differential Equations Due: Week 5 -v20170314- This Assignent So far, your assignents have tried to failiarize you with the hardware and software in the Physics Coputing

More information

Meta-Analytic Interval Estimation for Bivariate Correlations

Meta-Analytic Interval Estimation for Bivariate Correlations Psychological Methods 2008, Vol. 13, No. 3, 173 181 Copyright 2008 by the Aerican Psychological Association 1082-989X/08/$12.00 DOI: 10.1037/a0012868 Meta-Analytic Interval Estiation for Bivariate Correlations

More information

Analyzing Simulation Results

Analyzing Simulation Results Analyzing Siulation Results Dr. John Mellor-Cruey Departent of Coputer Science Rice University johnc@cs.rice.edu COMP 528 Lecture 20 31 March 2005 Topics for Today Model verification Model validation Transient

More information

Polygonal Designs: Existence and Construction

Polygonal Designs: Existence and Construction Polygonal Designs: Existence and Construction John Hegean Departent of Matheatics, Stanford University, Stanford, CA 9405 Jeff Langford Departent of Matheatics, Drake University, Des Moines, IA 5011 G

More information

Block designs and statistics

Block designs and statistics Bloc designs and statistics Notes for Math 447 May 3, 2011 The ain paraeters of a bloc design are nuber of varieties v, bloc size, nuber of blocs b. A design is built on a set of v eleents. Each eleent

More information

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis

E0 370 Statistical Learning Theory Lecture 6 (Aug 30, 2011) Margin Analysis E0 370 tatistical Learning Theory Lecture 6 (Aug 30, 20) Margin Analysis Lecturer: hivani Agarwal cribe: Narasihan R Introduction In the last few lectures we have seen how to obtain high confidence bounds

More information

AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS

AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS Statistica Sinica 6 016, 1709-178 doi:http://dx.doi.org/10.5705/ss.0014.0034 AN OPTIMAL SHRINKAGE FACTOR IN PREDICTION OF ORDERED RANDOM EFFECTS Nilabja Guha 1, Anindya Roy, Yaakov Malinovsky and Gauri

More information

Physically Based Modeling CS Notes Spring 1997 Particle Collision and Contact

Physically Based Modeling CS Notes Spring 1997 Particle Collision and Contact Physically Based Modeling CS 15-863 Notes Spring 1997 Particle Collision and Contact 1 Collisions with Springs Suppose we wanted to ipleent a particle siulator with a floor : a solid horizontal plane which

More information

Computable Shell Decomposition Bounds

Computable Shell Decomposition Bounds Coputable Shell Decoposition Bounds John Langford TTI-Chicago jcl@cs.cu.edu David McAllester TTI-Chicago dac@autoreason.co Editor: Leslie Pack Kaelbling and David Cohn Abstract Haussler, Kearns, Seung

More information

lecture 36: Linear Multistep Mehods: Zero Stability

lecture 36: Linear Multistep Mehods: Zero Stability 95 lecture 36: Linear Multistep Mehods: Zero Stability 5.6 Linear ultistep ethods: zero stability Does consistency iply convergence for linear ultistep ethods? This is always the case for one-step ethods,

More information

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis

Soft Computing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Soft Coputing Techniques Help Assign Weights to Different Factors in Vulnerability Analysis Beverly Rivera 1,2, Irbis Gallegos 1, and Vladik Kreinovich 2 1 Regional Cyber and Energy Security Center RCES

More information

Smooth Test for Testing Equality of Two Densities 1

Smooth Test for Testing Equality of Two Densities 1 Sooth Test for Testing Equality of Two Densities Anil K. Bera University of Illinois. E-ail:abera@uiuc.edu. Phone: 7-333-4596. Aurobindo Ghosh Singapore Manageent University. E-ail:aurobindo@su.edu.sg.

More information

Feature Extraction Techniques

Feature Extraction Techniques Feature Extraction Techniques Unsupervised Learning II Feature Extraction Unsupervised ethods can also be used to find features which can be useful for categorization. There are unsupervised ethods that

More information

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices

13.2 Fully Polynomial Randomized Approximation Scheme for Permanent of Random 0-1 Matrices CS71 Randoness & Coputation Spring 018 Instructor: Alistair Sinclair Lecture 13: February 7 Disclaier: These notes have not been subjected to the usual scrutiny accorded to foral publications. They ay

More information

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t.

This model assumes that the probability of a gap has size i is proportional to 1/i. i.e., i log m e. j=1. E[gap size] = i P r(i) = N f t. CS 493: Algoriths for Massive Data Sets Feb 2, 2002 Local Models, Bloo Filter Scribe: Qin Lv Local Models In global odels, every inverted file entry is copressed with the sae odel. This work wells when

More information

GEE ESTIMATORS IN MIXTURE MODEL WITH VARYING CONCENTRATIONS

GEE ESTIMATORS IN MIXTURE MODEL WITH VARYING CONCENTRATIONS ACTA UIVERSITATIS LODZIESIS FOLIA OECOOMICA 3(3142015 http://dx.doi.org/10.18778/0208-6018.314.03 Olesii Doronin *, Rostislav Maiboroda ** GEE ESTIMATORS I MIXTURE MODEL WITH VARYIG COCETRATIOS Abstract.

More information

Ufuk Demirci* and Feza Kerestecioglu**

Ufuk Demirci* and Feza Kerestecioglu** 1 INDIRECT ADAPTIVE CONTROL OF MISSILES Ufuk Deirci* and Feza Kerestecioglu** *Turkish Navy Guided Missile Test Station, Beykoz, Istanbul, TURKEY **Departent of Electrical and Electronics Engineering,

More information

In this chapter, we consider several graph-theoretic and probabilistic models

In this chapter, we consider several graph-theoretic and probabilistic models THREE ONE GRAPH-THEORETIC AND STATISTICAL MODELS 3.1 INTRODUCTION In this chapter, we consider several graph-theoretic and probabilistic odels for a social network, which we do under different assuptions

More information

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES

Proc. of the IEEE/OES Seventh Working Conference on Current Measurement Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES Proc. of the IEEE/OES Seventh Working Conference on Current Measureent Technology UNCERTAINTIES IN SEASONDE CURRENT VELOCITIES Belinda Lipa Codar Ocean Sensors 15 La Sandra Way, Portola Valley, CA 98 blipa@pogo.co

More information

Estimation of the Mean of the Exponential Distribution Using Maximum Ranked Set Sampling with Unequal Samples

Estimation of the Mean of the Exponential Distribution Using Maximum Ranked Set Sampling with Unequal Samples Open Journal of Statistics, 4, 4, 64-649 Published Online Septeber 4 in SciRes http//wwwscirporg/ournal/os http//ddoiorg/436/os4486 Estiation of the Mean of the Eponential Distribution Using Maiu Ranked

More information

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and This article appeared in a ournal published by Elsevier. The attached copy is furnished to the author for internal non-coercial research and education use, including for instruction at the authors institution

More information

Computable Shell Decomposition Bounds

Computable Shell Decomposition Bounds Journal of Machine Learning Research 5 (2004) 529-547 Subitted 1/03; Revised 8/03; Published 5/04 Coputable Shell Decoposition Bounds John Langford David McAllester Toyota Technology Institute at Chicago

More information

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis

Experimental Design For Model Discrimination And Precise Parameter Estimation In WDS Analysis City University of New York (CUNY) CUNY Acadeic Works International Conference on Hydroinforatics 8-1-2014 Experiental Design For Model Discriination And Precise Paraeter Estiation In WDS Analysis Giovanna

More information

Pattern Recognition and Machine Learning. Artificial Neural networks

Pattern Recognition and Machine Learning. Artificial Neural networks Pattern Recognition and Machine Learning Jaes L. Crowley ENSIMAG 3 - MMIS Fall Seester 2017 Lessons 7 20 Dec 2017 Outline Artificial Neural networks Notation...2 Introduction...3 Key Equations... 3 Artificial

More information

An Approximate Model for the Theoretical Prediction of the Velocity Increase in the Intermediate Ballistics Period

An Approximate Model for the Theoretical Prediction of the Velocity Increase in the Intermediate Ballistics Period An Approxiate Model for the Theoretical Prediction of the Velocity... 77 Central European Journal of Energetic Materials, 205, 2(), 77-88 ISSN 2353-843 An Approxiate Model for the Theoretical Prediction

More information

The Distribution of the Covariance Matrix for a Subset of Elliptical Distributions with Extension to Two Kurtosis Parameters

The Distribution of the Covariance Matrix for a Subset of Elliptical Distributions with Extension to Two Kurtosis Parameters journal of ultivariate analysis 58, 96106 (1996) article no. 0041 The Distribution of the Covariance Matrix for a Subset of Elliptical Distributions with Extension to Two Kurtosis Paraeters H. S. Steyn

More information

Ensemble Based on Data Envelopment Analysis

Ensemble Based on Data Envelopment Analysis Enseble Based on Data Envelopent Analysis So Young Sohn & Hong Choi Departent of Coputer Science & Industrial Systes Engineering, Yonsei University, Seoul, Korea Tel) 82-2-223-404, Fax) 82-2- 364-7807

More information

e-companion ONLY AVAILABLE IN ELECTRONIC FORM

e-companion ONLY AVAILABLE IN ELECTRONIC FORM OPERATIONS RESEARCH doi 10.1287/opre.1070.0427ec pp. ec1 ec5 e-copanion ONLY AVAILABLE IN ELECTRONIC FORM infors 07 INFORMS Electronic Copanion A Learning Approach for Interactive Marketing to a Custoer

More information

Nonlinear Log-Periodogram Regression for Perturbed Fractional Processes

Nonlinear Log-Periodogram Regression for Perturbed Fractional Processes Nonlinear Log-Periodogra Regression for Perturbed Fractional Processes Yixiao Sun Departent of Econoics Yale University Peter C. B. Phillips Cowles Foundation for Research in Econoics Yale University First

More information

Kernel-Based Nonparametric Anomaly Detection

Kernel-Based Nonparametric Anomaly Detection Kernel-Based Nonparaetric Anoaly Detection Shaofeng Zou Dept of EECS Syracuse University Eail: szou@syr.edu Yingbin Liang Dept of EECS Syracuse University Eail: yliang6@syr.edu H. Vincent Poor Dept of

More information

Sharp Time Data Tradeoffs for Linear Inverse Problems

Sharp Time Data Tradeoffs for Linear Inverse Problems Sharp Tie Data Tradeoffs for Linear Inverse Probles Saet Oyak Benjain Recht Mahdi Soltanolkotabi January 016 Abstract In this paper we characterize sharp tie-data tradeoffs for optiization probles used

More information

Solving initial value problems by residual power series method

Solving initial value problems by residual power series method Theoretical Matheatics & Applications, vol.3, no.1, 13, 199-1 ISSN: 179-9687 (print), 179-979 (online) Scienpress Ltd, 13 Solving initial value probles by residual power series ethod Mohaed H. Al-Sadi

More information

a a a a a a a m a b a b

a a a a a a a m a b a b Algebra / Trig Final Exa Study Guide (Fall Seester) Moncada/Dunphy Inforation About the Final Exa The final exa is cuulative, covering Appendix A (A.1-A.5) and Chapter 1. All probles will be ultiple choice

More information

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines

Intelligent Systems: Reasoning and Recognition. Perceptrons and Support Vector Machines Intelligent Systes: Reasoning and Recognition Jaes L. Crowley osig 1 Winter Seester 2018 Lesson 6 27 February 2018 Outline Perceptrons and Support Vector achines Notation...2 Linear odels...3 Lines, Planes

More information

Fast Montgomery-like Square Root Computation over GF(2 m ) for All Trinomials

Fast Montgomery-like Square Root Computation over GF(2 m ) for All Trinomials Fast Montgoery-like Square Root Coputation over GF( ) for All Trinoials Yin Li a, Yu Zhang a, a Departent of Coputer Science and Technology, Xinyang Noral University, Henan, P.R.China Abstract This letter

More information

Research in Area of Longevity of Sylphon Scraies

Research in Area of Longevity of Sylphon Scraies IOP Conference Series: Earth and Environental Science PAPER OPEN ACCESS Research in Area of Longevity of Sylphon Scraies To cite this article: Natalia Y Golovina and Svetlana Y Krivosheeva 2018 IOP Conf.

More information

Bootstrapping Dependent Data

Bootstrapping Dependent Data Bootstrapping Dependent Data One of the key issues confronting bootstrap resapling approxiations is how to deal with dependent data. Consider a sequence fx t g n t= of dependent rando variables. Clearly

More information

Using a De-Convolution Window for Operating Modal Analysis

Using a De-Convolution Window for Operating Modal Analysis Using a De-Convolution Window for Operating Modal Analysis Brian Schwarz Vibrant Technology, Inc. Scotts Valley, CA Mark Richardson Vibrant Technology, Inc. Scotts Valley, CA Abstract Operating Modal Analysis

More information

Deflation of the I-O Series Some Technical Aspects. Giorgio Rampa University of Genoa April 2007

Deflation of the I-O Series Some Technical Aspects. Giorgio Rampa University of Genoa April 2007 Deflation of the I-O Series 1959-2. Soe Technical Aspects Giorgio Rapa University of Genoa g.rapa@unige.it April 27 1. Introduction The nuber of sectors is 42 for the period 1965-2 and 38 for the initial

More information

A Poisson process reparameterisation for Bayesian inference for extremes

A Poisson process reparameterisation for Bayesian inference for extremes A Poisson process reparaeterisation for Bayesian inference for extrees Paul Sharkey and Jonathan A. Tawn Eail: p.sharkey1@lancs.ac.uk, j.tawn@lancs.ac.uk STOR-i Centre for Doctoral Training, Departent

More information

arxiv: v1 [cs.ds] 3 Feb 2014

arxiv: v1 [cs.ds] 3 Feb 2014 arxiv:40.043v [cs.ds] 3 Feb 04 A Bound on the Expected Optiality of Rando Feasible Solutions to Cobinatorial Optiization Probles Evan A. Sultani The Johns Hopins University APL evan@sultani.co http://www.sultani.co/

More information

Research Article Rapidly-Converging Series Representations of a Mutual-Information Integral

Research Article Rapidly-Converging Series Representations of a Mutual-Information Integral International Scholarly Research Network ISRN Counications and Networking Volue 11, Article ID 5465, 6 pages doi:1.54/11/5465 Research Article Rapidly-Converging Series Representations of a Mutual-Inforation

More information

International Journal of Pure and Applied Mathematics Volume 37 No , IMPROVED DATA DRIVEN CONTROL CHARTS

International Journal of Pure and Applied Mathematics Volume 37 No , IMPROVED DATA DRIVEN CONTROL CHARTS International Journal of Pure and Applied Matheatics Volue 37 No. 3 2007, 423-438 IMPROVED DATA DRIVEN CONTROL CHARTS Wille Albers 1, Wilbert C.M. Kallenberg 2 1,2 Departent of Applied Matheatics Faculty

More information

On Conditions for Linearity of Optimal Estimation

On Conditions for Linearity of Optimal Estimation On Conditions for Linearity of Optial Estiation Erah Akyol, Kuar Viswanatha and Kenneth Rose {eakyol, kuar, rose}@ece.ucsb.edu Departent of Electrical and Coputer Engineering University of California at

More information

An Introduction to Meta-Analysis

An Introduction to Meta-Analysis An Introduction to Meta-Analysis Douglas G. Bonett University of California, Santa Cruz How to cite this work: Bonett, D.G. (2016) An Introduction to Meta-analysis. Retrieved fro http://people.ucsc.edu/~dgbonett/eta.htl

More information

Interactive Markov Models of Evolutionary Algorithms

Interactive Markov Models of Evolutionary Algorithms Cleveland State University EngagedScholarship@CSU Electrical Engineering & Coputer Science Faculty Publications Electrical Engineering & Coputer Science Departent 2015 Interactive Markov Models of Evolutionary

More information

Online Publication Date: 19 April 2012 Publisher: Asian Economic and Social Society

Online Publication Date: 19 April 2012 Publisher: Asian Economic and Social Society Online Publication Date: April Publisher: Asian Econoic and Social Society Using Entropy Working Correlation Matrix in Generalized Estiating Equation for Stock Price Change Model Serpilılıç (Faculty of

More information

Testing the lag length of vector autoregressive models: A power comparison between portmanteau and Lagrange multiplier tests

Testing the lag length of vector autoregressive models: A power comparison between portmanteau and Lagrange multiplier tests Working Papers 2017-03 Testing the lag length of vector autoregressive odels: A power coparison between portanteau and Lagrange ultiplier tests Raja Ben Hajria National Engineering School, University of

More information

A note on the multiplication of sparse matrices

A note on the multiplication of sparse matrices Cent. Eur. J. Cop. Sci. 41) 2014 1-11 DOI: 10.2478/s13537-014-0201-x Central European Journal of Coputer Science A note on the ultiplication of sparse atrices Research Article Keivan Borna 12, Sohrab Aboozarkhani

More information

1 Generalization bounds based on Rademacher complexity

1 Generalization bounds based on Rademacher complexity COS 5: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #0 Scribe: Suqi Liu March 07, 08 Last tie we started proving this very general result about how quickly the epirical average converges

More information

The Weierstrass Approximation Theorem

The Weierstrass Approximation Theorem 36 The Weierstrass Approxiation Theore Recall that the fundaental idea underlying the construction of the real nubers is approxiation by the sipler rational nubers. Firstly, nubers are often deterined

More information

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all

2 Q 10. Likewise, in case of multiple particles, the corresponding density in 2 must be averaged over all Lecture 6 Introduction to kinetic theory of plasa waves Introduction to kinetic theory So far we have been odeling plasa dynaics using fluid equations. The assuption has been that the pressure can be either

More information

Constructing Locally Best Invariant Tests of the Linear Regression Model Using the Density Function of a Maximal Invariant

Constructing Locally Best Invariant Tests of the Linear Regression Model Using the Density Function of a Maximal Invariant Aerican Journal of Matheatics and Statistics 03, 3(): 45-5 DOI: 0.593/j.ajs.03030.07 Constructing Locally Best Invariant Tests of the Linear Regression Model Using the Density Function of a Maxial Invariant

More information

C na (1) a=l. c = CO + Clm + CZ TWO-STAGE SAMPLE DESIGN WITH SMALL CLUSTERS. 1. Introduction

C na (1) a=l. c = CO + Clm + CZ TWO-STAGE SAMPLE DESIGN WITH SMALL CLUSTERS. 1. Introduction TWO-STGE SMPLE DESIGN WITH SMLL CLUSTERS Robert G. Clark and David G. Steel School of Matheatics and pplied Statistics, University of Wollongong, NSW 5 ustralia. (robert.clark@abs.gov.au) Key Words: saple

More information

Spine Fin Efficiency A Three Sided Pyramidal Fin of Equilateral Triangular Cross-Sectional Area

Spine Fin Efficiency A Three Sided Pyramidal Fin of Equilateral Triangular Cross-Sectional Area Proceedings of the 006 WSEAS/IASME International Conference on Heat and Mass Transfer, Miai, Florida, USA, January 18-0, 006 (pp13-18) Spine Fin Efficiency A Three Sided Pyraidal Fin of Equilateral Triangular

More information

Data-Driven Imaging in Anisotropic Media

Data-Driven Imaging in Anisotropic Media 18 th World Conference on Non destructive Testing, 16- April 1, Durban, South Africa Data-Driven Iaging in Anisotropic Media Arno VOLKER 1 and Alan HUNTER 1 TNO Stieltjesweg 1, 6 AD, Delft, The Netherlands

More information

Chapter 6: Economic Inequality

Chapter 6: Economic Inequality Chapter 6: Econoic Inequality We are interested in inequality ainly for two reasons: First, there are philosophical and ethical grounds for aversion to inequality per se. Second, even if we are not interested

More information

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME

AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME J. Japan Statist. Soc. Vol. 35 No. 005 73 86 AN EFFICIENT CLASS OF CHAIN ESTIMATORS OF POPULATION VARIANCE UNDER SUB-SAMPLING SCHEME H. S. Jhajj*, M. K. Shara* and Lovleen Kuar Grover** For estiating the

More information

Chapter 6 1-D Continuous Groups

Chapter 6 1-D Continuous Groups Chapter 6 1-D Continuous Groups Continuous groups consist of group eleents labelled by one or ore continuous variables, say a 1, a 2,, a r, where each variable has a well- defined range. This chapter explores:

More information

ANALYTICAL INVESTIGATION AND PARAMETRIC STUDY OF LATERAL IMPACT BEHAVIOR OF PRESSURIZED PIPELINES AND INFLUENCE OF INTERNAL PRESSURE

ANALYTICAL INVESTIGATION AND PARAMETRIC STUDY OF LATERAL IMPACT BEHAVIOR OF PRESSURIZED PIPELINES AND INFLUENCE OF INTERNAL PRESSURE DRAFT Proceedings of the ASME 014 International Mechanical Engineering Congress & Exposition IMECE014 Noveber 14-0, 014, Montreal, Quebec, Canada IMECE014-36371 ANALYTICAL INVESTIGATION AND PARAMETRIC

More information

Lower Bounds for Quantized Matrix Completion

Lower Bounds for Quantized Matrix Completion Lower Bounds for Quantized Matrix Copletion Mary Wootters and Yaniv Plan Departent of Matheatics University of Michigan Ann Arbor, MI Eail: wootters, yplan}@uich.edu Mark A. Davenport School of Elec. &

More information

Covariance-based orthogonality tests for regressors with unknown persistence

Covariance-based orthogonality tests for regressors with unknown persistence Covariance-based orthogonality tests for regressors with unknown persistence Alex Maynard and Katsui Shiotsu Departent of Econoics, University of Toronto and Departent of Econoics, Queen s University April

More information

About the definition of parameters and regimes of active two-port networks with variable loads on the basis of projective geometry

About the definition of parameters and regimes of active two-port networks with variable loads on the basis of projective geometry About the definition of paraeters and regies of active two-port networks with variable loads on the basis of projective geoetry PENN ALEXANDR nstitute of Electronic Engineering and Nanotechnologies "D

More information

DESIGN OF THE DIE PROFILE FOR THE INCREMENTAL RADIAL FORGING PROCESS *

DESIGN OF THE DIE PROFILE FOR THE INCREMENTAL RADIAL FORGING PROCESS * IJST, Transactions of Mechanical Engineering, Vol. 39, No. M1, pp 89-100 Printed in The Islaic Republic of Iran, 2015 Shira University DESIGN OF THE DIE PROFILE FOR THE INCREMENTAL RADIAL FORGING PROCESS

More information

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair

A Simplified Analytical Approach for Efficiency Evaluation of the Weaving Machines with Automatic Filling Repair Proceedings of the 6th SEAS International Conference on Siulation, Modelling and Optiization, Lisbon, Portugal, Septeber -4, 006 0 A Siplified Analytical Approach for Efficiency Evaluation of the eaving

More information

A method to determine relative stroke detection efficiencies from multiplicity distributions

A method to determine relative stroke detection efficiencies from multiplicity distributions A ethod to deterine relative stroke detection eiciencies ro ultiplicity distributions Schulz W. and Cuins K. 2. Austrian Lightning Detection and Inoration Syste (ALDIS), Kahlenberger Str.2A, 90 Vienna,

More information

Some Proofs: This section provides proofs of some theoretical results in section 3.

Some Proofs: This section provides proofs of some theoretical results in section 3. Testing Jups via False Discovery Rate Control Yu-Min Yen. Institute of Econoics, Acadeia Sinica, Taipei, Taiwan. E-ail: YMYEN@econ.sinica.edu.tw. SUPPLEMENTARY MATERIALS Suppleentary Materials contain

More information

IN modern society that various systems have become more

IN modern society that various systems have become more Developent of Reliability Function in -Coponent Standby Redundant Syste with Priority Based on Maxiu Entropy Principle Ryosuke Hirata, Ikuo Arizono, Ryosuke Toohiro, Satoshi Oigawa, and Yasuhiko Takeoto

More information

Topic 5a Introduction to Curve Fitting & Linear Regression

Topic 5a Introduction to Curve Fitting & Linear Regression /7/08 Course Instructor Dr. Rayond C. Rup Oice: A 337 Phone: (95) 747 6958 E ail: rcrup@utep.edu opic 5a Introduction to Curve Fitting & Linear Regression EE 4386/530 Coputational ethods in EE Outline

More information

Randomized Accuracy-Aware Program Transformations For Efficient Approximate Computations

Randomized Accuracy-Aware Program Transformations For Efficient Approximate Computations Randoized Accuracy-Aware Progra Transforations For Efficient Approxiate Coputations Zeyuan Allen Zhu Sasa Misailovic Jonathan A. Kelner Martin Rinard MIT CSAIL zeyuan@csail.it.edu isailo@it.edu kelner@it.edu

More information

The Wilson Model of Cortical Neurons Richard B. Wells

The Wilson Model of Cortical Neurons Richard B. Wells The Wilson Model of Cortical Neurons Richard B. Wells I. Refineents on the odgkin-uxley Model The years since odgkin s and uxley s pioneering work have produced a nuber of derivative odgkin-uxley-like

More information

RAFIA(MBA) TUTOR S UPLOADED FILE Course STA301: Statistics and Probability Lecture No 1 to 5

RAFIA(MBA) TUTOR S UPLOADED FILE Course STA301: Statistics and Probability Lecture No 1 to 5 Course STA0: Statistics and Probability Lecture No to 5 Multiple Choice Questions:. Statistics deals with: a) Observations b) Aggregates of facts*** c) Individuals d) Isolated ites. A nuber of students

More information

Synchronization in large directed networks of coupled phase oscillators

Synchronization in large directed networks of coupled phase oscillators CHAOS 16, 015107 2005 Synchronization in large directed networks of coupled phase oscillators Juan G. Restrepo a Institute for Research in Electronics and Applied Physics, University of Maryland, College

More information

CS Lecture 13. More Maximum Likelihood

CS Lecture 13. More Maximum Likelihood CS 6347 Lecture 13 More Maxiu Likelihood Recap Last tie: Introduction to axiu likelihood estiation MLE for Bayesian networks Optial CPTs correspond to epirical counts Today: MLE for CRFs 2 Maxiu Likelihood

More information

Best Procedures For Sample-Free Item Analysis

Best Procedures For Sample-Free Item Analysis Best Procedures For Saple-Free Ite Analysis Benjain D. Wright University of Chicago Graha A. Douglas University of Western Australia Wright s (1969) widely used "unconditional" procedure for Rasch saple-free

More information

When Short Runs Beat Long Runs

When Short Runs Beat Long Runs When Short Runs Beat Long Runs Sean Luke George Mason University http://www.cs.gu.edu/ sean/ Abstract What will yield the best results: doing one run n generations long or doing runs n/ generations long

More information

Statistical Logic Cell Delay Analysis Using a Current-based Model

Statistical Logic Cell Delay Analysis Using a Current-based Model Statistical Logic Cell Delay Analysis Using a Current-based Model Hanif Fatei Shahin Nazarian Massoud Pedra Dept. of EE-Systes, University of Southern California, Los Angeles, CA 90089 {fatei, shahin,

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Notes for EE227C (Spring 2018): Convex Optiization and Approxiation Instructor: Moritz Hardt Eail: hardt+ee227c@berkeley.edu Graduate Instructor: Max Sichowitz Eail: sichow+ee227c@berkeley.edu October

More information