Using copulas to model time dependence in stochastic frontier models

Size: px
Start display at page:

Download "Using copulas to model time dependence in stochastic frontier models"

Transcription

1 Using copulas to model time dependence in stochastic frontier models Christine Amsler Michigan State University Artem Prokhorov Concordia University May 2011 Peter Schmidt Michigan State University Abstract We consider stochastic frontier models in the setting where there is dependence between cross sections. This setting is typical for panel data. Current methods of modeling time dependence in this setting are either unduly restrictive or computationally infeasible. Some impose restrictive assumptions on the nature of dependence such as the scaling property. Others involve T - dimensional integration, where T is the number of cross-sections, which may be large. Moreover, no known multivariate distribution has the property of having the commonly used, convenient marginals such as normal/half-normal. We show how to use copulas to resolve these issues. The range of dependence we allow for is unrestricted and the computational task involved is easy compared to the alternatives. Also, the resulting estimators are relatively more efficient. We propose two alternative specifications. One applies a copula function to the distribution of the composed error term. This permits the use of FMLE and GMM. The other applies a copula to the distribution of the one-sided error term. This allow for a simulated FMLE and a semiparametric estimation of inefficiencies. A well-known application demonstrates usefulness of the two specifications. JEL Classification: C13 Keywords: Copula, Stochastic Frontier, MLE, GMM, MSLE, MSM Helpful comments from participants of the Canadian Econometrics Study Group Meeting in Montreal, the International Conference on Panel Data in Bonn and the XI European Workshop on Efficiency and Productivity Analysis in Pisa as well as seminar participants at University of New South Wales and Ewha University are gratefully acknowledged.

2 1 Introduction We consider the stochastic frontier model of Aigner et al. (1977) and Meeusen and van Den Broeck (1977): y it = x itβ + v it u it (1) In this model, i denotes individual firms, countries, production units, etc, and t denotes time (when there is no ambiguity we will omit the subscripts). We assume that v it s are independent over i and t and distributed as N(0, σ 2 v). We also assume that the distribution of u it is N(µ it, σ 2 it )+, i.e. it is obtained by truncation to the left at zero of the normal distribution with mean µ it and variance σ 2 it. This distribution is known as half-normal. The distribution parameters (µ it, σit 2 ) may be constant or they may depend on explanatory variables z it. For simplicity, we will assume in what follows that µ it = 0 and σ it = σ u, for any i and t, so that u it N(0, σ 2 u) +. The truncated error term u it represents technical inefficiency. Typically we assume it is independent over i but independence over t is not an attractive assumption. One would expect that inefficiency correlates positively over time firms that are relatively inefficient in one time period will probably also be inefficient in other time periods. There are basically four approaches currently available in the literature to model dependence in inefficiency over t. First, there is the QMLE approach which basically assumes independence of u it over t (see, e.g., Battese and Coelli, 1995). A variation of this approach is to assume that u it s are independent conditional on some explanatory variables z it (see, e.g., Wang, 2002). Second, one may assume that u it is time invariant (see, e.g., Pitt and Lee, 1981). Then, u it = u i for all t. Third, one may assume a multivariate truncated distribution for (u i1,..., u it ), e.g., multivariate truncated Normal (see, e.g., Pitt and Lee, 1981). Then, the likelihood becomes a highly complicated function with T -dimensional integrals. Moreover, Horrace (2005) showed that such a joint distribution function will not have truncated normal marginals. So if we wish to have normal/halfnormal marginals of the composed error but follow this approach to allow for dependence between them over t, it will not provide the right marginals. 2

3 Finally, one may assume scaling. The scaling property assumes that u it has the following multiplicative representation: u it = h(z it ; δ)u it, where u it is a random variable, h(z it; δ) is a known function, δ is a vector of unknown parameters and z it contains variables that affect technical inefficiency. The variables z it may include functions of inputs x it or various characteristics of the environment, in which the firm operates. There are many versions of the stochastic frontier model that satisfy the scaling property (see Alvarez et al., 2006, for a survey of such models). In some versions, u it is assumed to be time invariant and z it includes a time trend (see, e.g., Battese and Coelli, 1992; Kumbhakar, 1990). Other models assume u it is distributed as N(0, σ 2 it )+, where σ 2 it depends on a set of relevant variables z it (e.g., Caudill et al., 1995). But the key feature of the assumption is that u it does not depend on z it, and so changes in z it only affect the scale of the distribution of u it (through the nonrandom function h(z it, δ)) but not its shape (determined by the distribution of u it ). The four approaches discussed above have major drawbacks. The scaling assumption that firms with different z it do not differ in the shape of the distribution of technical inefficiency (differ only in the scale) is hardly realistic. The assumptions that inefficiencies are not correlated over time or that they are time invariant are even less realistic. Moreover, even if the scaling property holds there is still the question of how to allow for dependence in u it. Unless u it s are independent or time invariant, we are faced with the same difficulty as with u it s. There is no tractable joint distribution with convenient marginals. In this paper we develop a tractable method of allowing for dependence without scaling, independence or time-invariance. We use copulas to do that. A distinctive feature of copulas is that they allow to model marginal distributions separately from their dependence structure. As a result we have a flexible joint distribution function, whose marginals are specified by the researcher. The joint distribution can accommodate any degree of dependence. No simplifying assumptions such as scaling or time-invariance are used in constructing the likelihood. Meanwhile, the task of maximizing the joint likelihood remains simple because it involves no T -dimensional integration. The plan of the paper is as follows. Section 2 reviews the traditional QMLE and introduces a score-based GMM estimator. In Section 3, we describe how to use copulas to allow for dependence over time. We use the GMM to motivate two new estimators that arise in this setting. Section 3

4 4.2 addresses the computational issues associated with the new copula-based estimators, in which time dependence is unrestricted. Section 4.4 discusses a test for copula validity, especially relevant for models with given marginals. Section 5.2 proposes a copula based method for estimating inefficiencies. Section 6 provides an empirical illustration of the new copula-based estimators proposed in the paper. Section 7 concludes. 2 Quasi-MLE and score-based GMM Let f v (v) and f u (u) denote the densities of v and u, respectively. Then, f v (v) = 1 e v 2πσv 2 2σ 2 v f u (u) = 2 e u πσu 2 2σ 2 u (2) and the density of the composed error ε = v u can be obtained by convolution as follows f(ɛ) = f v (ɛ + u)f u (u)du (3) = 0 ( ɛ ) ( σ φ Φ ɛλ ), σ σ (4) where σ 2 = σ 2 v + σ 2 u, λ = σu σ v, and φ and Φ are standard normal density and distribution functions, respectively (see, e.g., Aigner et al., 1977; Greene, 2003, p.502). The QMLE is based on this marginal (single t) density of the composed error. Let θ = (β, σ 2, λ) and let f it denote the density of the composed error term evaluated at ε it = v it u it. Then, f it (θ) = f(v it u it ) = f(y it x itβ) (5) The QMLE maximizes the sample likelihood obtained under the assumption of independence over i and t. That is, ˆθ QMLE = arg max θ ln f it (θ) (6) It is well known that, under suitable regularity conditions, the QMLE is consistent even if there is no independence over t. The sandwich estimator of the QMLE asymptotic variance matrix is used to obtain the correct standard errors in this case (see, e.g., Hayashi, 2000, Section 8.7). However, generally the QMLE is inefficient. There is a more precise estimator, which uses the same information as the QMLE. 4 i t

5 To motivate this improved estimator, we rewrite the QMLE as a generalized method of moments (GMM) estimator. Let s it (θ) denote the score of the density function f it (θ), i.e. s it (θ) = θ ln f it (θ), (7) where θ denotes the gradient with respect to θ. Note that s it (θ) is a zero mean function when evaluated at the true parameter value θ o, i.e. Es it (θ o ) = 0, for any i and t (8) Note that the QMLE of θ solves s it (ˆθ QMLE ) = 0, i t which is also the first order condition solved by the GMM estimator based on the moment condition E t s it (θ) = 0. (9) Therefore, the QMLE and the GMM estimator based on (9) are identical. The key to improving efficiency of the QMLE is to notice that the GMM estimator based on (9) uses a suboptimal weighting matrix. The moment condition in (9) sums the zero-mean score functions s it over t while the theory of optimal GMM suggests using a different, data driven weighting mechanism. To be more precise, define the vector of T score functions for each individual i: s i1 (θ) s i (θ) =.. s it (θ) Note that each element of this vector is zero-mean. Thus, Es i (θ o ) = 0 (10) Under appropriate regularity conditions, applying the optimal GMM machinery to these moment conditions is known to produce an estimator whose asymptotic variance matrix is smaller than that of any other estimator using the same moment conditions, including the QMLE (see, e.g., Hansen, 1982). 5

6 In the setting of a general panel data model, Prokhorov and Schmidt (2009) call the optimal GMM estimator based on (10) the improved QMLE (IQMLE). The IQMLE is the solution to following optimization problem where s (θ) = 1 N ˆθ IQMLE = arg min s (θ) W s (θ), (11) θ i s i (θ) and W is the inverse of the estimated covariance matrix of s i (θ o). This estimator is consistent so long as the QMLE is but it is in general more efficient. However, neither of the two estimators allow to model time dependence in u it. 3 Copulas and full MLE Let the bold letters ε, u, v denote vectors of dimension T that contain entire histories of respective error terms, so, e.g., u i = (u i1,..., u it ), and let f ε, f u, f v denote the respective joint (over all t) densities. Explicit modelling of time dependence between u it s requires a model for the multivariate density f ε or f u. We would like this density to accommodate arbitrary dependence and to have the correct marginal densities. That is, if we use f u we would like it to have half-normal marginals, if we use f ε we would like it to accommodate normal/half-normal marginal densities of the composed error term. Copulas can do this. Briefly defined, a copula is a multivariate distribution function with uniform marginals (a rigorous treatment of copulas can be found, e.g., in Nelsen, 2006). If we let C(w 1,..., w T ) denote the copula cdf and c(w 1,..., w T ) denote the copula density then each margin w t, t = 1,..., T, is assumed to come from a uniform distribution on [0, 1]. Copula functions usually have at least one parameter that models dependence between w s. Let ρ denote a generic vector of such parameters. Each parametric copula function is known as a copula family. We provide several examples of copula families in the Appendix A. An important feature of copula functions is that they differ in the range of dependence they can cover. Some copulas can cover the entire range of dependence they are called comprehensive copulas while others can only accommodate a restricted range of dependence. For example, suppose T = 2 and we measure dependence by Pearson s correlation coefficient. Then, the Gaussian, Frank and Plackett copulas are comprehensive, while the Farlie-Gumbel-Morganstern copula can model correlations only between about 0.3 and This will be important in empirical applications 6

7 because the copula-based likelihood we construct should be able to capture a fairly high degree of dependence present in the data. Clearly, we would like to use comprehensive copulas since we are looking to model an arbitrary degree of dependence. By a theorem due to Sklar (1959), given the marginals and a continuous joint distribution H(ξ 1,..., ξ T ), there exists a unique copula such that H(ξ 1,..., ξ T ) = C(F 1 (ξ 1 ),..., F T (ξ T )) or h(ξ 1,..., ξ T ) = c(f 1 (ξ 1 ),..., F T (ξ T )) f 1 (ξ 1 )... f T (ξ T ). We now have a flexible joint distribution function, whose marginals we can specify in advance. As mentioned above, there are two ways of using copulas to model dependence over t, corresponding to whether we apply a copula to form f ε or to form f u. We now consider each of these possibilities. 4 Application of copula to (ε i1,..., ε it ) We start by using a copula to form f ε, the joint (over t) density of the composed error. Given the normal/half-normal marginals of ε s and a copula family C(w 1,..., w t ; ρ), the joint density can be written as follows f ε (ε i ; θ, ρ) = c(f i1 (θ),..., F it (θ); ρ) f i1 (θ)... f it (θ), where, as before, f it (θ) f(ε it ) = f(y it x itβ) is the pdf of the composed error term evaluated at ε it and F it (θ) ε it f(s)ds is the corresponding cdf. This approach gives rise to two estimators and several computational issues, which we will discuss next. 4.1 FMLE and GMM Given a sample of size N, the joint density f ε (ε i ; θ, ρ) permits construction of the sample loglikelihood as follows ln L(θ, ρ) = N (ln c(f i1 (θ),..., F it (θ); ρ) + ln f i1 (θ) ln f it (θ)) (12) i=1 7

8 Because the likelihood is based on the joint distribution of ε s, we will call the estimator that maximizes the likelihood, the full maximum likelihood estimator (FMLE), That is, (ˆθ FMLE, ˆρ FMLE ) = arg max ln L(θ, ρ) (13) θ,ρ The first term in the summation is what distinguishes the full log-likelihood from the quasi-loglikelihood the term reflects the dependence between the composed errors at different t. When viewed from the GMM perspective, the FMLE solves the first order conditions that are identical to the first order conditions solved by the GMM estimator based on the moment conditions E θ ln c i (θ o, ρ o ) + θ ln f i1 (θ o ) θ ln f it (θ o ) = 0, (14) ρ ln c i (θ o, ρ o ) where c i (θ, ρ) = c(f i1 (θ),..., F it (θ); ρ). Again, the key to developing a new estimator is the observation that the moment conditions in (14) are obtained by applying a specific (not necessarily optimal) weighting scheme. θ ln f i1 (θ)... Let ψ i (θ, ρ) θ ln f it (θ) and let (ˆθ GMM, ˆρ GMM ) be the optimal GMM estimator based θ ln c i (θ, ρ) ρ ln c i (θ, ρ) on the moment conditions Eψ i (θ o, ρ o ) = 0 (15) That is, (ˆθ GMM, ˆρ GMM ) is the solution to the following optimization problem where φ(θ, ρ) = 1 N (ˆθ GMM, ˆρ GMM ) = arg min θ,ρ ψ(θ, ρ) W ψ(θ, ρ), (16) i ψ i(θ, ρ) and W is the inverse of the estimated covariance matrix of ψ i (θ o, ρ o ). Prokhorov and Schmidt (2009) consider this estimator in the setting of a general panel data model and show that it is consistent so long as the FMLE above is consistent. Unlike the QMLE and IQMLE, both FMLE and GMM estimators allow for arbitrary time dependence in panel stochastic frontier models while offering important computational advantages over the available alternatives. We discuss these advantages in the next section. 8

9 4.2 Evaluation of integrals In this section we show that by applying a copula to (ε i1,..., ε it ), we circumvent the task of a T -dimensional integration imposed by traditional methods. In effect, we are replacing that task with the task of T evaluations of an one-dimensional integral, which is a much simpler task. To demonstrate the advantages of our approach, we first consider what happens if we use a multivariate truncated distribution to form the full log-likelihood. Suppose, for example, that the joint distribution of u is multivariate truncated normal with parameter matrix Σ (see Pitt and Lee, 1981). Then, { } 1 f u (u) = (2π) T/2 Σ 1/2 exp 2 u Σ 1 u /P 0, (17) where P 0 is the probability that u 0, which can be expressed as the following T -dimensional integral: P 0 = 0 0 (2π) T/2 Σ 1/2 exp { } 1 2 u Σ 1 u du (18) Given f u (u), the joint density of the composed error vector ε can be obtained as yet another T -dimensional integral as follows: f ε (ε) = 0 0 T t=1 (ε 1 t +u t ) 2 e 2σv 2 f u (u) du. (19) 2πσv Note that inside this integral there is the joint density f u (u), which is itself calculated using T - dimensional integration. The resulting likelihood is intractable. Another difficulty with using the multivariate truncated normal distribution is that its marginals are not truncated normal distributions unless the marginals are independent (see Horrace, 2005, Theorems 4 and 6). So if we want to maintain the assumption of truncated normal marginals and allow for dependence by assuming a truncated normal joint distribution, that approach is not acceptable. Moreover, to our knowledge, no known multivariate distribution, aside from those constructed using copulas, has truncated normal marginals. Instead of T -dimensional integrations, the FMLE and GMM estimators require repeated evaluation of F (ε) ε which serves as an argument of the copula function. f(s) ds Evaluation of this integral is a standard task in one-dimensional integration that can be accomplished in standard econometric software by 9

10 quadrature or by simulation. 1 Moreover, Tsay et al. (2009) propose an approximation that can be used specifically in the setting of a stochatic frontier model to speed up the evaluations. Wang et al. (2008) tabulate several quantiles of F (ε) using an alternative parametrization, in which σ is replaced with σ v 1 + λ 2. They obtain their quantiles by simulation. We chose the parametrization (σ, λ) rather than (σ v, λ) because it bounds the variance of each error component by the value of σ 2. This is convenient because given σ the quantiles do not become large negative numbers as λ increases (the cdf does not flatten out). Our cdf is effectively on the domain [ 4, 4] for all λ. Also when applying copulas to ε, we prefer using numerical integration rather than simulation MSLE and GMSM Integral evaluations by simulation generate an approximation error which need be accounted for. This section does that. The basic idea of simulation-based methods is that under suitable regularity conditions a simulator will converge to the object it simulates. estimators of f(ε) and F (ε), respectively. Let ˆf(ε) and ˆF (ε) denote the simulation-based We are interested in the properties of the MLE and GMM estimators that use the estimators instead of the true values of the two functions. Such estimation methods fall into the class of Maximum Simulated Likelihood (MSLE) and Generalized Method of Simulated Moments (GMSM) estimators. Statistical properties of MSLE and GMSM are well-studied (see, e.g., McFadden, 1989; Gouriéroux and Monfort, 1991; McFadden and Ruud, 1994). For example, it is well known that the so called direct simulator ˆf(ε) = 1 S S φ(ε + u s ), s=1 1 The normal/half-normal error structure permits an additional shortcut when evaluating F (ε), the composed error cdf. By the algebraic properties of the cdf, for a fixed λ, F σ(ε) = F 1(ε/σ), where the subscript denotes the value of σ used in evaluting F. Therefore we can express quantiles of F (ε) for any σ as σ times the corresponding quantile for σ = 1. This means that we need to evaluate the integral only at σ = 1. Then, we can obtain its value at any other σ by calculating the product. 2 Our experience suggests that numeral integration by quadrature is somewhat faster than simulation for the same level of precision. A GAUSS code, which can be used to obtain any number of quantiles at desired precision using both methods, is available from the authors. 10

11 where φ is the density of v and u s are draws from N(0, σu) 2 +, s = 1,..., S, converges to f(ɛ) as S, and ˆF (ε) = 1 ˆf(e), N converges to F (ε) as N. Moreover, by a well-known result from Gouriéroux and Monfort (1991), the estimator obtained by maximizing the simulated log-likelihood (ln c( ˆF i1 (θ),..., ˆF it (θ); ρ) + ln ˆf i1 (θ) ln ˆf it (θ)) (20) i has the same asymptotic distribution as the FMLE provided that S N N/S e ε So the simulation error is ignorable asymptotically. A similar result is available for the GMSM estimator. McFadden (1989) shows that, under usual regularity conditions, the GMM estimator based on moment conditions (15), in which f it is replaced with ˆf it and F it is replaced with ˆF it, is consistent for θ even for fixed S. However, the asymptotic variance matrix of the GMSM estimator has to be adjusted to reflect the simulation error unless S. Compared to the MSLE, the GMSM estimator is more complicated and is known to be numerically unstable (see, e.g., McFadden and Ruud, 1994). Moreover, in settings with large T and relatively small N as in our application, the GMSM may generate too many moment conditions to handle given a sample size. Therefore, the MSLE appears more practical. 4.4 Choosing Copula There are many tests that can help choose a copula (see, e.g., Genest et al., 2009, for a review of copula goodness of fit tests). This section proposes a variation of the test of Prokhorov and Schmidt (2009), which is particularly well suited for the case when we apply a copula to (ε i1,..., ε it ). The test is focused on the copula based moment conditions which we are doubtful about, while maintaining the validity of the marginals. 3 The test proceeds as follows. In the first step, we obtain 3 We modify the original test of Prokhorov and Schmidt (2009) to make it suitable for use with the standard QMLE in the first step rather IQMLE. 11

12 ˆθ QMLE. In the second step, we apply the GMM with a specific form of the weighting matrix to the copula based moment conditions E θ ln c i (ˆθ QMLE, ρ) ρ ln c i (ˆθ QMLE, ρ) = 0, (21) where the required form of the weighting matrix is given in Appendix B. This produces an estimate of ρ denote it by ˆρ and the optimized value of the GMM criterion function denote it by Q(ˆθ QMLE, ˆρ). To constuct the test statistic, we look at the produce of the sample size and Q(ˆθ QMLE, ˆρ). The claim is, under the null of a correct copula, N Q(ˆθ QMLE, ˆρ) is asymptotically distributed according to the χ 2 distribution with the degrees of freedom equal to the dimension of θ. We will reject the null hypothesis that the chosen copula is valid for large values of the test statistic. The proof of this claim is given in Appendix B. 5 Application of copula to (u i1,..., u it ) We now consider the alternative copula specification. We apply a copula to form the joint distribution of (u i1,..., u it ) rather than that of (ε i1,..., ε it ). As we mentioned before, a T -dimensional integration will be needed to form the likelihood in this case. This will be similar to the example of multivariate truncated normal in previous section but the use of a copula will permit to avoid some of the difficulties associated with that approach. As before, let f u (u; θ, ρ) denote the copula-based joint density of the one-sided error vector and let f u (u; θ) denote the marginal density of an individual one-sided error term. Then, f u (u; θ, ρ) = c(f u (u 1 ; θ),..., F u (u T ; θ); ρ) f u (u 1 ; θ) f u (u T ; θ), (22) where F u (u; θ) u 0 f u(s; θ) ds is the cdf of the half-normal error term. The joint density of the composed error vector ε can be obtained as follows f ε (ε; θ, ρ) = 0 0 f v (ε + u; θ)f u (u; θ, ρ)du 1 du T = E u(θ,ρ) φ(ε + u), (23) where E u(θ,ρ) denotes the expectation with respect to f u (u; θ, ρ) and f v (v; θ) = φ(v) is the multivariate normal pdf of v, where all v s are independent and have equal variance σ 2 v. As in the multivariate truncated normal example, this is a T -dimensional integral which has no analytical 12

13 form and is not easy to evaluate using numerical methods. Unlike that example, however, it involves only one T -dimensional integral and it has the form of an expectation over a distribution we can easily sample from. We discuss the estimators that are available in this setting, and the issues that arise implementing them, in the next section. 5.1 Full MSLE The key benefit of using a copula is that we can simulate from the distribution of u and estimate f ε (ε) by averaging φ(ε + u) over draws from that distribution. Let S denote the number of simulations. The direct simulator of f ε (ε) can be written as follows ˆf ε (ε; θ, ρ) = 1 S S φ(ε + u s (θ, ρ)), s=1 where u s (θ, ρ) is a draw from f u (u; θ, ρ) constructed in (22). Then, a full simulated log-likelihood can be obtained as follows ln L s (θ, ρ) = i ln ˆf ε (ε i ; θ, ρ), where, as before, ε i = (ε i1,..., ε it ) and ε it = y it x itβ. The full maximum simulated likelihood estimator (FSMLE) is then given by (ˆθ FMSL, ˆρ FMSL ) = arg max θ,ρ ln Ls (θ, ρ). (24) This method is analogous to the simulation-based estimation of univariate densities discussed in a previous section. This is a multivariate extension of that method. Similar asymptotic arguments suggest that the asymptotic distribution of (ˆθ FMSL, ˆρ FMSL ) is identical to the FMLE based on (23), provided that S, N, N/S and suitable regularity conditions hold (see, e.g., Gouriéroux and Monfort, 1991). Note that we do not have a GMSM estimator here because we simulate the joint density f ε directly. However, we have an estimator of inefficiencies. 5.2 Estimation of Technical Inefficiencies The use of a copula to form f u permits the construction of a new and improved estimator of technical inefficiencies u i, i = 1,..., N. 13

14 In a single cross section (single t) setting, Jondrow et al. (1982) propose estimating u it by the conditional expectation Eu it ε it, where Eu it ε it = σλ 1 + λ 2 ( ) φ εi λ 1 Φ σ ( εit λ σ ) ε itλ σ We have defined above four consistent estimators of θ. Using each of them, we can easily construct an analoguous estimator of Eu it ε it (see, e.g., Greene, 2005). However, this estimator does not explicitly incorporate the dependence information. We propose estimating technical inefficiencies by E(u ε) the expected technical inefficiency conditional on the composed error vector. The conditional expectation E(u ε) is with respect to the conditional density f(u ε), which we do not know. However, we now know how to form the joint distribution of (u 1,..., u T ) using a copula and we can draw from that distribution. As before, we omit the subscript i and write u = u 1. u T = ε 1. ε T v 1. v T = ε v where u t N(0, σ 2 u) +, v N(0, σ 2 vi T ), and u and v are independent. Assume we applied a copula to u as discussed above and we now have f u (u). Suppose for simplicity that we use a one-parameter copula. Then, f u (u; σ 2 u, ρ) = c [ F u (u 1 ; σ 2 u),..., F u (u T ; σ 2 u); ρ ] T f u (u t ; σu) 2 Note that f u and F u denote the pdf and cdf, respectively, of the half-normal distribution N(0, σ 2 u) +. Assume for now that (σ 2 u, σ 2 v, ρ) are known. The key observation is that we can draw from the joint distribution of u and v. First, we draw from the copula-based distribution of u. This is done as follows: draw {w 1,..., w T } from copula C generate u t = F 1 (w t ), t = 1,..., T, where F 1 (w) is the inverse cdf of F u (u). We obtain S such draws u s. t=1 14

15 Second, we draw v s N(0, σ 2 vi T ) and obtain ε s = v s u s, s = 1,..., S. We now have draws from the joint distribution of (u, ε) and can estimate E(u ε) by any nonparametric method, e.g., by the multivariate version of the Nadaraya-Watson estimator S s=1 Ê(u ε) = u sk ( ε s ε ) h S s=1 K ( ), ε s ε h where K ( ε s ε h is a standard result that ) = T t=1 K ( ) ε ts ε t h, K( ) is a kernel function and h is the bandwidth parameter. It Ê(u ε) E(u ε) as S, h 0 and Sh T Obviously, this procedure is not feasible unless we know the values of (σu, 2 σv, 2 ρ). But these values are available from the FMSLE. Similarly, to obtain the estimated inefficiencies, we evaluated the estimated conditional expectation at the residuals ˆε it = y it x ˆβ, it where ˆβ also comes from FMSLE. As alternatives, one may use the (I)QMLE, FMLE or GMM estimates of (β, σu, 2 σv). 2 However, these estimators do not produce an estimate of the copula parameter for u. 4 A way around this difficulty is to draw from the joint distribution of (u, ε) for different values of ρ until a certain property of the data, e.g., sample correlation of ˆε it over time, is matched by the simulated sample. Because of the various criteria that can be used in matching (especially if T is large and if ρ is not a scalar), this estimator is less practical then the one using the FMSLE. 6 Empirical Application: Indonesian Rice Farm Production We demonstrate the use of copulas with a well-known data set from Indonesian rice farms (see, e.g., Lee and Schmidt, 1993; Horrace and Schmidt, 1996, 2000). The data set is a panel of 171 Indonesian rice farmers for whom we have 6 annual observations of rice output (in kg), amount of seeds (kg), amount of urea (in kg), labor (in hours), land (in hectares), whether pesticides were used, which of 3 rice varieties was planted and in which of 6 regions the farmer s village is located. Therefore, there are 13 explanatory variables besides the constant. 4 The FMLE and GMM produce estimates of the dependence parameter in the copula for ε but this is not the parameter we need. 15

16 Table 1: Indonesian rice production: QMLE and Gaussian copula-based FMLE and FMSLE HS(2000) QMLE FMLE FMSLE Est. Est. St.Err Rob.St.Err. Est. St.Err. Est. St.Err. CONST SEED UREA TSP LABOR LAND DP DV DV DR DR DR DR DR σ λ ln L We start by stating the results from the QMLE defined in (6). They are given in the first four columns of Table 1. For reference, we give the ML estimates obtained by Horrace and Schmidt (2000) using just the first cross section of the data. Our results are based on all six cross sections. It is important to account for potential correlation between the cross sections. The robust standard errors we report for the QMLE do this. They are obtained using the sandwich formula, while the first set of standard errors are obtained using the estimated Hessian. Unfortunately Horrace and Schmidt (2000) do not report standard errors of their ML estimation or the log-likelihood value. The second set of results are the FMLE defined in (13). We use the Gaussian copula to construct 16

17 a multivariate distribution of ε. The Gaussian copula is obtained from the multivariate Normal distribution with the correlation matrix R by plugging in the inverse of the univariate standard normal distribution function Φ(x)(see Appendix A). It is easy to show (see, e.g., Cherubini et al., 2004, p. 148) that the resulting copula density can be written as follows c(w 1,..., w T ; R) = 1 R e 1 2 ζ (R 1 I)ζ where ζ = (Φ 1 (w 1 ),..., Φ 1 (w T )). In the Gaussian copula, the dependence structure between the marginals is represented by the correlation matrix R. So our FMLE results contain estimates of a 6 6 correlation matrix, which we do not report. The last set of estimates are the FMSLE defined in (24), where we use the Gaussian copula to construct and sample from a multivariate distribution of u. We use S = 1000 draws in evaluating ˆf ε (ε) by simulation. As in FMLE, the results include 15 estimated correlations. Now, these are correlations over t of Φ 1 (F u (u t )), not of Φ 1 (F ε (ε t )), though the two estimated correlation matrices (not reported here) turn out very similar. Next we consider the two moment based estimators, namely the IQMLE defined in (11) and the GMM estimator defined in (16). These estimators have some theoretical advantages such as relatively higher asymptotic efficiency but, in practice, they may be infeasible. In this application, for example, we have 16 parameters, 6 time periods and only 171 cross sectional observations. The vector of moment conditions on which the IQMLE is based is Moreover, the additional vector of copula-based moment conditions employed by the GMM estimator is 15 1 for the parameters of R and 16 1 for the parameters of the marginals. So we have as many as 127 moment conditions. Obtaining stable GMM estimates for this problem with only 171 observations proved impossible. Therefore, instead of estimating the full vector of parameters we considered a simplified problem of estimating only two parameters, the constant and one slope coefficient, while keeping the other parameters fixed at their QMLE values. We therefore have 12 and 14 moment conditions to use in IQMLE and GMM, respectively. This is a more appropriate setting for a moment based estimation using a sample of size 171. Table 2 reports our estimates along with the test statistic of Section 4.4 for the Gaussian copula. The moment based estimates in Table 2 have two versions. Version 1 is the estimates obtained by the two-step GMM. In the first step, we obtain preliminary parameter estimates using the 17

18 Table 2: Indonesian rice production: restricted QMLE, IQMLE and Gaussian copula-based GMM QMLE IQMLE-1 IQMLE-2 GMM-1 GMM-2 Est. St.Err. Est. St.Err. Est. St.Err. Est. St.Err. Est. St.Err. CONST SEED Copula validity test statistic 41.6 identity matrix for weighting. In the second step, we estimate the covariance matrix of the moment conditions and update the weighting by the inverse of the covariance matrix. Version 2 is the iterated GMM estimation, in which we carry on the updating for 5 more iterations. The iterated GMM estimates are more efficient relative to the QMLE and the two-step GMM, which is evidenced by the slightly smaller standard errors of this estimate. On the other hand, the GMM estimators are known to have substantial small sample biases which are coupled with the simulation biases discussed above. The dispersion of the parameter estimates reported in the table suggests that these biases may be an issue here. Moreover, the copula based estimators are subject to the bias caused by copula misspecification. In the table, we provide the copula validity test statistic we discussed in Section 4.4. The test strongly rejects the null that the Gaussian copula is valid. This indicates that the copula based estimates GMM-1 and GMM-2, though most efficient relative to the other reported estimates, may suffer from a copula misspecification bias. This can also be seen from the distance of the GMM parameter estimates from the other estimates that do not use a copula and so, are robust to copula misspecification. Finally, we use the procedure outlined in Section 5.2 to estimate technical inefficiencies. As a benchmark we report inefficiency estimates obtained using the analytical expression for Eu it ε it derived by Jondrow et al. (1982). These estimates are reported in Table 3. The estimates obtained using a copula are reported in Table 4. The copula based estimates employ the Gaussian copula to draw from the joint distribution of u and ε with the number of draws S = 10, 000. Given the simulated sample, we use the Gaussian kernel with a common bandwidth parameter chosen by cross-validation to obtain the improved estimates of inefficiencies. For each method, we report 18

19 Table 3: Indonesian rice production: inefficiency estimates based on JLMS formula Rank Fractile ˆσ û t = 1 t = 2 t = 3 t = 4 t = 5 t = Average estimated inefficiencies in each time period for ten farms, namely those that are at the.05,.15,...,.95 fractiles of the sample distribution of inefficiencies for t = 1. The first two columns report the rank and the quantile of the selected farms. The third and the fourth columns contain the sample variations and the sample means over t, respectively, of the inefficiency estimates, which are reported in columns 5 through 10. The last row of the tables reports the correponding column averages over the ten farms. A few important observations can be made from the tables. First, there is a substantial reduction in variability of inefficiencies over t when we use the copula based estimator. The sample variation ˆσ of the inefficiency estimates over time obtained using the traditional method is several times higher for all farms. The improved estimates of inefficiencies û it vary less over t reflecting the implied correlation between them. On the other hand, high and low inefficiencies obtained using both methods tend to agree. Neither method produces consistently higher or consistently lower inefficiencies for any of the selected farms. However, overall the copula based estimator seems to produces a slightly lower average estimated inefficiency. This may be caused for the copula misspecification bias. Finally, previous authors who used this data set have reported an increase 19

20 Table 4: Indonesian rice production: inefficiency estimates based on a copula Rank Fractile ˆσ û t = 1 t = 2 t = 3 t = 4 t = 5 t = Average in technical inefficiency in the six year period and attributed this to the fact that the first three years correspond to a wet growing season while the last three to a dry. Using our new inefficiency estimates we tend to confirm this result. 7 Concluding Remarks In this paper we have proposed a new, copula-based, method of accounting for dependence between cross sections in panel stochastic frontier models. Compared to available alternatives, this method is simple and flexible. It allows modeling arbitrary dependence and does not involve much computational cost. We propose several estimators based on this method, illustrate how one can test for copula validity and suggest a simulation-based method of estimating technical inefficiencies. Advantages of the proposed approach are not only computational. By assuming a copula we assume a joint distribution. This is important for estimation precision. We use the setting of the Generalized Method of Moments (GMM) to point out that copulas contain information that is not in general redundant. So besides the computational advantages, copulas offer improvement in precision of estimation. 20

21 An interesting extension of this paper would be to compare robustness of the copula-based estimators we propose, with that of the alternative estimators based on the scaling property. After all, both approaches potentially involve an incorrectly specified joint likelihood function. We leave this question for future research. References Aigner, D., C. A. K. Lovell, and P. Schmidt (1977): Formulation and estimation of stochastic frontier production function models, Journal of Econometrics, 6, Alvarez, A., C. Amsler, L. Orea, and P. Schmidt (2006): Interpreting and Testing the Scaling Property in Models where Inefficiency Depends on Firm Characteristics, Journal of Productivity Analysis, 25, Battese, G. and T. Coelli (1995): A model for technical inefficiency effects in a stochastic frontier production function for panel data, Empirical Economics, 20, Battese, G. E. and T. J. Coelli (1992): Frontier production functions, technical efficiency and panel data: With application to paddy farmers in India, Journal of Productivity Analysis, 3. Caudill, S. B., J. M. Ford, and D. M. Gropper (1995): Frontier Estimation and Firm-Specific Inefficiency Measures in the Presence of Heteroscedasticity, Journal of Business and Economic Statistics, 13, Cherubini, U., E. Luciano, and W. Vecchiato (2004): Copula Methods in Finance, Wiley Finance Series, John Wiley and Sons Ltd. Genest, C., B. Rémillard, and D. Beaudoin (2009): Goodness-of-fit tests for copulas: A review and a power study, Insurance: Mathematics and Economics, 44, Gouriéroux, C. and A. Monfort (1991): Simulation Based Inference in Models with Heterogeneity, Annales d Économie et de Statistique, Greene, W. (2005): Reconsidering heterogeneity in panel data estimators of the stochastic frontier model, Journal of Econometrics, 126, , doi: DOI: /j.jeconom Greene, W. H. (2003): Econometric Analysis, Upper Saddle River, NJ : Prentice Hall, 5th ed. Hansen, L. (1982): Large sample properties of generalized method of moments estimators, Econometrica, 50, Hayashi, F. (2000): Econometrics, Princeton University Press. Horrace, W. C. (2005): Some results on the multivariate truncated normal distribution, Journal of Multivariate Analysis, 94,

22 Horrace, W. C. and P. Schmidt (1996): Confidence statements for efficiency estimates from stochastic frontier models, Journal of Productivity Analysis, 7, (2000): Multiple Comparisons with the Best, with Economic Applications, Journal of Applied Econometrics, 15, Jondrow, J., C. A. K. Lovell, I. S. Materov, and P. Schmidt (1982): On the estimation of technical inefficiency in the stochastic frontier production function model, Journal of Econometrics, 19, , doi: DOI: / (82) Kumbhakar, S. C. (1990): Production frontiers, panel data, and time-varying technical inefficiency, Journal of Econometrics, 46, Lee, Y. H. and P. Schmidt (1993): A Production Frontier Model with Flexible Temporal Variation in Technical Inefficiency, in Measurement of Productive Efficiency: Techniques and Applications, ed. by H. Fried, C. Lovell, and P. Schmidt, Oxford University Press. McFadden, D. (1989): A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration, Econometrica, 57, McFadden, D. and P. A. Ruud (1994): Estimation by Simulation, The Review of Economics and Statistics, 76, Meeusen, W. and J. van Den Broeck (1977): Efficiency Estimation from Cobb-Douglas Production Functions with Composed Error, International Economic Review, 18, Nelsen, R. B. (2006): An Introduction to Copulas, vol. 139 of Springer Series in Statistics, Springer, 2 ed. Pitt, M. M. and L.-F. Lee (1981): The measurement and sources of technical inefficiency in the Indonesian weaving industry, Journal of Development Economics, 9, Prokhorov, A. and P. Schmidt (2009): Likelihood-based estimation in a panel setting: robustness, redundancy and validity of copulas, Journal of Econometrics, 153, Sklar, A. (1959): Fonctions de répartition à n dimensions et leurs marges, Publications de l Institut de Statistique de l Université de Paris, 8, Tsay, W.-J., J. Huang, Cliff, T.-T. Fu, and I. L. A. Ho (2009): Maximum Likelihood Estimation of Censored Stochastic Frontier Models: An Application to the Three-Stage DEA Method, IEAS Working Paper : academic research. Wang, H.-J. (2002): Heteroscedasticity and Non-Monotonic Efficiency Effects of a Stochastic Frontier Model, Journal of Productivity Analysis, 18,

23 Wang, W. S., C. Amsler, and P. Schmidt (2008): Goodness of fit tests in stochastic frontier models, mimeo, Michigan State University. A Some copula families - Independence or product copula: C(w 1,..., w T ) = w 1... w T - Gaussian or Normal copula: C(w 1,..., w T ; R) = Φ T (Φ 1 (w 1),..., Φ 1 (w T ); R) where Φ denotes the standard normal cdf and R denotes the correlation matrix. - Gumbel copula: [ C(w 1,..., w T ; ρ) = exp (( ln w 1) ρ ( ln w T ) ρ ) 1/ρ], ρ [1, ) - Clayton copula: [ ] C(w 1,..., w T ; ρ) = max (w ρ w ρ T ) 1/ρ, 0, ρ [ 1, ) {0} - Farlie-Gumbel-Morganstern (FGM) copula: where ρ j [ 1, 1] - General copula by inversion T T C(w 1,..., w T ; ρ) = w t(1 + t=1 t=2 1 j 1 <...<j k T start with cdf s K(x 1,..., x T ), w 1 = F 1(x 1),...,w T = F T (x T ) obtain x 1 = F 1 1 (w 1),...,x T = F 1 (wt ) and - Archimedean copulas T ρ j1...j k (1 w j1 )... (1 w jk )) C(w 1,..., w T ) = K(F 1 1 (w 1),..., F 1 T (w T )) start with a generator function ϕ : (0, 1) [0, ], ϕ < 0 and ϕ > 0 obtain C(w 1,..., w T ) = ϕ 1 (ϕ(w 1) ϕ(w T )) multivariate copulas can often be obtained recursively from a bivariate copula (not generally true for other families) e.g., Gumbel copula is Archimedean with ϕ(t) = ( log t) ρ, Frank copula with ϕ(t) = ln 1 e ρ 1 e ρt 23

24 B Copula validity test The test is similar to the test in Theorem 9 of Prokhorov and Schmidt (2009). We adapt the test to the first step QMLE. Also, let For simplicity, we consider T = 2. For t = 1, 2 and i = 1,..., N, let f ti(θ) = f(ε ti; θ), c i(θ, ρ) = c(f (ε 1i; θ), F (ε 2i; θ); ρ), [ g i(θ) = θ ln f 1i(θ) + θ ln f 2i(θ) ḡ(θ) 1 N r i(θ, ρ) = N g i(θ), i=1 θ ln c i(θ, ρ) ρ ln c i(θ, ρ) r(θ, ρ) 1 N. ], N r i(θ, ρ). Let (θ o, ρ o) denote the true parameter values and define the following matrices V o 11 Eg(θ o)g(θ o), i=1 V o 22 Er(θ o, ρ o)r(θ o, ρ o), V12 o = V21 o Eg(θ o)r(θ o, ρ o), D o 11 E θ g(θ o), D o 21 E θ r(θ o, ρ o), D o 22 E ρr(θ o, ρ o), where expectations are with respect to the true joint density h(ε 1, ε 2). Proposition 1 Let ˆθ QMLE be the QMLE of θ and let ˆρ be obtained by minimizing r(ˆθ QMLE, ρ) B 1 o r(ˆθ QMLE, ρ), where B o = V22 o D o 21D o 11 1 V o 12 V21D o o 11 1 D o 21 +D o 21(D o 11V o 11 1 D o 11) 1 D o 21. Then, N r(ˆθ QMLE, ˆρ) B 1 o r(ˆθ QMLE, ˆρ) a χ 2 p, (25) where p is the dimension of θ. First note that, by standard QMLE results, ˆθ QMLE satisfies N(ˆθQMLE θ o) = (D o 11) 1 Nḡ(θ o) + o p(1). (26) The first order condition for ˆρ can be written as [ ρ r(ˆθ QMLE, ˆρ)] B o 1 r(ˆθ QMLE, ˆρ) = 0, D o 22 B o 1 N r(ˆθ QMLE, ˆρ) = o p(1). (27) 24

25 Now, by the mean-value theorem, we have N r(ˆθqmle, ˆρ) = N r(θ o, ρ o) + D o 21 N(ˆθQMLE θ o) + D o 22 N(ˆρ ρo) + o p(1). (28) Substituting (26) into (28), pre-multiplying by D o 22 B o 1, and solving for N(ˆρ ρ o) using (27) yields N(ˆρ ρo) = (D o 22 B 1 o D o 22) 1 D o 22 Substituting (29) and (26) into (28) and simplifying results in B 1 o N r(θo, ρ o) +(D o 22 B 1 o D o 22) 1 D o 22 B 1 o D o 21D o 11 1 Nḡ(θ o) + o p(1). (29) N r(ˆθqmle, ˆρ) = R o N φ(θo, ρ o) + o p(1), (30) where R o = I D o 22(D o 22 B 1 o D o 22) 1 D o 22 B 1 o φ(θ o, ρ o) = r(θ o, ρ o) D o 21D o 11 1ḡ(θ o). B 1 2 Note that N φ(θ o, ρ o) N(0, B o), and thus B 1/2 o N φ(θo, ρ o) N(0, I). o [I B 2 1 o D o 22(D o 22 B 1 o D o 22) 1 D o 22 B 1 2 o ]B 2 1 o. Thus, the test statistic in (25) can be written as Also, note that R ob 1 o R o = i.e. as a quadratic form in standard normals with the coefficient matrix N φ(ˆθ, ˆρ) P φ(ˆθ, ˆρ), (31) P = I p+q B 1/2 o D o 22(D o 22 B 1 o D o 22) 1 D o 22 B 1/2 o. (32) This matrix is idempotent: it is the projection matrix orthogonal to B 1 2 o D o 22. The χ 2 -test in (25) follows immediately because tr(p) = p + q rank(d o 22) = p. 25

Using copulas to model time dependence in stochastic frontier models

Using copulas to model time dependence in stochastic frontier models Using copulas to model time dependence in stochastic frontier models Christine Amsler Michigan State University Artem Prokhorov Concordia University November 2008 Peter Schmidt Michigan State University

More information

Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite

Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite Christine Amsler Michigan State University Artem Prokhorov Concordia University December

More information

Using Copulas to Model Time Dependence in Stochastic Frontier Models

Using Copulas to Model Time Dependence in Stochastic Frontier Models Department of Economics Working Paper Series Using Copulas to Model Time Dependence in Stochastic Frontier Models 11-002 Christine Amsler Michigan State University Artem Prokhorov Concordia University

More information

A Goodness-of-fit Test for Copulas

A Goodness-of-fit Test for Copulas A Goodness-of-fit Test for Copulas Artem Prokhorov August 2008 Abstract A new goodness-of-fit test for copulas is proposed. It is based on restrictions on certain elements of the information matrix and

More information

On the Accuracy of Bootstrap Confidence Intervals for Efficiency Levels in Stochastic Frontier Models with Panel Data

On the Accuracy of Bootstrap Confidence Intervals for Efficiency Levels in Stochastic Frontier Models with Panel Data On the Accuracy of Bootstrap Confidence Intervals for Efficiency Levels in Stochastic Frontier Models with Panel Data Myungsup Kim University of North Texas Yangseon Kim East-West Center Peter Schmidt

More information

Robustness of a semiparametric estimator of a copula

Robustness of a semiparametric estimator of a copula Robustness of a semiparametric estimator of a copula Gunky Kim a, Mervyn J. Silvapulle b and Paramsothy Silvapulle c a Department of Econometrics and Business Statistics, Monash University, c Caulfield

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Econometrics Workshop UNC

More information

Cost Efficiency, Asymmetry and Dependence in US electricity industry.

Cost Efficiency, Asymmetry and Dependence in US electricity industry. Cost Efficiency, Asymmetry and Dependence in US electricity industry. Graziella Bonanno bonanno@diag.uniroma1.it Department of Computer, Control, and Management Engineering Antonio Ruberti - Sapienza University

More information

A Survey of Stochastic Frontier Models and Likely Future Developments

A Survey of Stochastic Frontier Models and Likely Future Developments A Survey of Stochastic Frontier Models and Likely Future Developments Christine Amsler, Young Hoon Lee, and Peter Schmidt* 1 This paper summarizes the literature on stochastic frontier production function

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics

More information

One-Step and Two-Step Estimation of the Effects of Exogenous Variables on Technical Efficiency Levels

One-Step and Two-Step Estimation of the Effects of Exogenous Variables on Technical Efficiency Levels C Journal of Productivity Analysis, 18, 129 144, 2002 2002 Kluwer Academic Publishers. Manufactured in The Netherlands. One-Step and Two-Step Estimation of the Effects of Exogenous Variables on Technical

More information

An Alternative Specification for Technical Efficiency Effects in a Stochastic Frontier Production Function

An Alternative Specification for Technical Efficiency Effects in a Stochastic Frontier Production Function Crawford School of Public Policy Crawford School working papers An Alternative Specification for Technical Efficiency Effects in a Stochastic Frontier Production Function Crawford School Working Paper

More information

GOODNESS OF FIT TESTS IN STOCHASTIC FRONTIER MODELS. Christine Amsler Michigan State University

GOODNESS OF FIT TESTS IN STOCHASTIC FRONTIER MODELS. Christine Amsler Michigan State University GOODNESS OF FIT TESTS IN STOCHASTIC FRONTIER MODELS Wei Siang Wang Nanyang Technological University Christine Amsler Michigan State University Peter Schmidt Michigan State University Yonsei University

More information

CONSISTENT ESTIMATION OF THE FIXED EFFECTS STOCHASTIC FRONTIER MODEL. Peter Schmidt Michigan State University Yonsei University

CONSISTENT ESTIMATION OF THE FIXED EFFECTS STOCHASTIC FRONTIER MODEL. Peter Schmidt Michigan State University Yonsei University CONSISTENT ESTIMATION OF THE FIXED EFFECTS STOCHASTIC FRONTIER MODEL Yi-Yi Chen Tamkang University Peter Schmidt Michigan State University Yonsei University Hung-Jen Wang National Taiwan University August

More information

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Applied Mathematical Sciences, Vol. 4, 2010, no. 14, 657-666 Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Pranesh Kumar Mathematics Department University of Northern British Columbia

More information

Generalized Method of Moment

Generalized Method of Moment Generalized Method of Moment CHUNG-MING KUAN Department of Finance & CRETA National Taiwan University June 16, 2010 C.-M. Kuan (Finance & CRETA, NTU Generalized Method of Moment June 16, 2010 1 / 32 Lecture

More information

Estimating Production Uncertainty in Stochastic Frontier Production Function Models

Estimating Production Uncertainty in Stochastic Frontier Production Function Models Journal of Productivity Analysis, 12, 187 210 1999) c 1999 Kluwer Academic Publishers, Boston. Manufactured in The Netherlands. Estimating Production Uncertainty in Stochastic Frontier Production Function

More information

Copula-Based Random Effects Models

Copula-Based Random Effects Models Copula-Based Random Effects Models Santiago Pereda-Fernández Banca d Italia September 7, 25 PRELIMINARY, INCOMPLETE, PLEASE DO NOT CITE Abstract In a binary choice panel data framework, standard estimators

More information

Introduction to Estimation Methods for Time Series models Lecture 2

Introduction to Estimation Methods for Time Series models Lecture 2 Introduction to Estimation Methods for Time Series models Lecture 2 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 2 SNS Pisa 1 / 21 Estimators:

More information

Semi-parametric predictive inference for bivariate data using copulas

Semi-parametric predictive inference for bivariate data using copulas Semi-parametric predictive inference for bivariate data using copulas Tahani Coolen-Maturi a, Frank P.A. Coolen b,, Noryanti Muhammad b a Durham University Business School, Durham University, Durham, DH1

More information

Calibration Estimation for Semiparametric Copula Models under Missing Data

Calibration Estimation for Semiparametric Copula Models under Missing Data Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre

More information

Imputation Algorithm Using Copulas

Imputation Algorithm Using Copulas Metodološki zvezki, Vol. 3, No. 1, 2006, 109-120 Imputation Algorithm Using Copulas Ene Käärik 1 Abstract In this paper the author demonstrates how the copulas approach can be used to find algorithms for

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

A FLEXIBLE TIME-VARYING SPECIFICATION OF THE TECHNICAL INEFFICIENCY EFFECTS MODEL

A FLEXIBLE TIME-VARYING SPECIFICATION OF THE TECHNICAL INEFFICIENCY EFFECTS MODEL A FLEXIBLE TIME-VARYING SPECIFICATION OF THE TECHNICAL INEFFICIENCY EFFECTS MODEL Giannis Karagiannis Dept of International and European Economic and Political Studies, University of Macedonia - Greece

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Practical Econometrics. for. Finance and Economics. (Econometrics 2) Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics

More information

(θ θ ), θ θ = 2 L(θ ) θ θ θ θ θ (θ )= H θθ (θ ) 1 d θ (θ )

(θ θ ), θ θ = 2 L(θ ) θ θ θ θ θ (θ )= H θθ (θ ) 1 d θ (θ ) Setting RHS to be zero, 0= (θ )+ 2 L(θ ) (θ θ ), θ θ = 2 L(θ ) 1 (θ )= H θθ (θ ) 1 d θ (θ ) O =0 θ 1 θ 3 θ 2 θ Figure 1: The Newton-Raphson Algorithm where H is the Hessian matrix, d θ is the derivative

More information

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix Yingying Dong and Arthur Lewbel California State University Fullerton and Boston College July 2010 Abstract

More information

Robust Stochastic Frontier Analysis: a Minimum Density Power Divergence Approach

Robust Stochastic Frontier Analysis: a Minimum Density Power Divergence Approach Robust Stochastic Frontier Analysis: a Minimum Density Power Divergence Approach Federico Belotti Giuseppe Ilardi CEIS, University of Rome Tor Vergata Bank of Italy Workshop on the Econometrics and Statistics

More information

ESTIMATING FARM EFFICIENCY IN THE PRESENCE OF DOUBLE HETEROSCEDASTICITY USING PANEL DATA K. HADRI *

ESTIMATING FARM EFFICIENCY IN THE PRESENCE OF DOUBLE HETEROSCEDASTICITY USING PANEL DATA K. HADRI * Journal of Applied Economics, Vol. VI, No. 2 (Nov 2003), 255-268 ESTIMATING FARM EFFICIENCY 255 ESTIMATING FARM EFFICIENCY IN THE PRESENCE OF DOUBLE HETEROSCEDASTICITY USING PANEL DATA K. HADRI * Universy

More information

Estimation of Panel Data Stochastic Frontier Models with. Nonparametric Time Varying Inefficiencies 1

Estimation of Panel Data Stochastic Frontier Models with. Nonparametric Time Varying Inefficiencies 1 Estimation of Panel Data Stochastic Frontier Models with Nonparametric Time Varying Inefficiencies Gholamreza Hajargasht Shiraz University Shiraz, Iran D.S. Prasada Rao Centre for Efficiency and Productivity

More information

Advanced Econometrics

Advanced Econometrics Advanced Econometrics Dr. Andrea Beccarini Center for Quantitative Economics Winter 2013/2014 Andrea Beccarini (CQE) Econometrics Winter 2013/2014 1 / 156 General information Aims and prerequisites Objective:

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Statistics and econometrics

Statistics and econometrics 1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample

More information

Birkbeck Working Papers in Economics & Finance

Birkbeck Working Papers in Economics & Finance ISSN 1745-8587 Birkbeck Working Papers in Economics & Finance Department of Economics, Mathematics and Statistics BWPEF 1809 A Note on Specification Testing in Some Structural Regression Models Walter

More information

Functional Form. Econometrics. ADEi.

Functional Form. Econometrics. ADEi. Functional Form Econometrics. ADEi. 1. Introduction We have employed the linear function in our model specification. Why? It is simple and has good mathematical properties. It could be reasonable approximation,

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

What s New in Econometrics. Lecture 15

What s New in Econometrics. Lecture 15 What s New in Econometrics Lecture 15 Generalized Method of Moments and Empirical Likelihood Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Generalized Method of Moments Estimation

More information

Economic modelling and forecasting

Economic modelling and forecasting Economic modelling and forecasting 2-6 February 2015 Bank of England he generalised method of moments Ole Rummel Adviser, CCBS at the Bank of England ole.rummel@bankofengland.co.uk Outline Classical estimation

More information

The outline for Unit 3

The outline for Unit 3 The outline for Unit 3 Unit 1. Introduction: The regression model. Unit 2. Estimation principles. Unit 3: Hypothesis testing principles. 3.1 Wald test. 3.2 Lagrange Multiplier. 3.3 Likelihood Ratio Test.

More information

Missing dependent variables in panel data models

Missing dependent variables in panel data models Missing dependent variables in panel data models Jason Abrevaya Abstract This paper considers estimation of a fixed-effects model in which the dependent variable may be missing. For cross-sectional units

More information

Quaderni di Dipartimento. Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study. Carluccio Bianchi (Università di Pavia)

Quaderni di Dipartimento. Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study. Carluccio Bianchi (Università di Pavia) Quaderni di Dipartimento Small Sample Properties of Copula-GARCH Modelling: A Monte Carlo Study Carluccio Bianchi (Università di Pavia) Maria Elena De Giuli (Università di Pavia) Dean Fantazzini (Moscow

More information

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Article Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Fei Jin 1,2 and Lung-fei Lee 3, * 1 School of Economics, Shanghai University of Finance and Economics,

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

ECON 4160, Autumn term Lecture 1

ECON 4160, Autumn term Lecture 1 ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

Financial Econometrics and Volatility Models Copulas

Financial Econometrics and Volatility Models Copulas Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns

More information

Binary choice 3.3 Maximum likelihood estimation

Binary choice 3.3 Maximum likelihood estimation Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood

More information

A New Generalized Gumbel Copula for Multivariate Distributions

A New Generalized Gumbel Copula for Multivariate Distributions A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within

More information

Semiparametric Gaussian Copula Models: Progress and Problems

Semiparametric Gaussian Copula Models: Progress and Problems Semiparametric Gaussian Copula Models: Progress and Problems Jon A. Wellner University of Washington, Seattle European Meeting of Statisticians, Amsterdam July 6-10, 2015 EMS Meeting, Amsterdam Based on

More information

Greene, Econometric Analysis (6th ed, 2008)

Greene, Econometric Analysis (6th ed, 2008) EC771: Econometrics, Spring 2010 Greene, Econometric Analysis (6th ed, 2008) Chapter 17: Maximum Likelihood Estimation The preferred estimator in a wide variety of econometric settings is that derived

More information

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter

More information

Improving GMM efficiency in dynamic models for panel data with mean stationarity

Improving GMM efficiency in dynamic models for panel data with mean stationarity Working Paper Series Department of Economics University of Verona Improving GMM efficiency in dynamic models for panel data with mean stationarity Giorgio Calzolari, Laura Magazzini WP Number: 12 July

More information

Parametric Techniques Lecture 3

Parametric Techniques Lecture 3 Parametric Techniques Lecture 3 Jason Corso SUNY at Buffalo 22 January 2009 J. Corso (SUNY at Buffalo) Parametric Techniques Lecture 3 22 January 2009 1 / 39 Introduction In Lecture 2, we learned how to

More information

Semiparametric Gaussian Copula Models: Progress and Problems

Semiparametric Gaussian Copula Models: Progress and Problems Semiparametric Gaussian Copula Models: Progress and Problems Jon A. Wellner University of Washington, Seattle 2015 IMS China, Kunming July 1-4, 2015 2015 IMS China Meeting, Kunming Based on joint work

More information

Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap

Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap University of Zurich Department of Economics Working Paper Series ISSN 1664-7041 (print) ISSN 1664-705X (online) Working Paper No. 254 Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION VICTOR CHERNOZHUKOV CHRISTIAN HANSEN MICHAEL JANSSON Abstract. We consider asymptotic and finite-sample confidence bounds in instrumental

More information

Parametric Techniques

Parametric Techniques Parametric Techniques Jason J. Corso SUNY at Buffalo J. Corso (SUNY at Buffalo) Parametric Techniques 1 / 39 Introduction When covering Bayesian Decision Theory, we assumed the full probabilistic structure

More information

MAXIMUM ENTROPIES COPULAS

MAXIMUM ENTROPIES COPULAS MAXIMUM ENTROPIES COPULAS Doriano-Boris Pougaza & Ali Mohammad-Djafari Groupe Problèmes Inverses Laboratoire des Signaux et Systèmes (UMR 8506 CNRS - SUPELEC - UNIV PARIS SUD) Supélec, Plateau de Moulon,

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria SOLUTION TO FINAL EXAM Friday, April 12, 2013. From 9:00-12:00 (3 hours) INSTRUCTIONS:

More information

Specification Test for Instrumental Variables Regression with Many Instruments

Specification Test for Instrumental Variables Regression with Many Instruments Specification Test for Instrumental Variables Regression with Many Instruments Yoonseok Lee and Ryo Okui April 009 Preliminary; comments are welcome Abstract This paper considers specification testing

More information

The properties of L p -GMM estimators

The properties of L p -GMM estimators The properties of L p -GMM estimators Robert de Jong and Chirok Han Michigan State University February 2000 Abstract This paper considers Generalized Method of Moment-type estimators for which a criterion

More information

Estimation, Inference, and Hypothesis Testing

Estimation, Inference, and Hypothesis Testing Chapter 2 Estimation, Inference, and Hypothesis Testing Note: The primary reference for these notes is Ch. 7 and 8 of Casella & Berger 2. This text may be challenging if new to this topic and Ch. 7 of

More information

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood Advances in Decision Sciences Volume 2012, Article ID 973173, 8 pages doi:10.1155/2012/973173 Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

More information

The profit function system with output- and input- specific technical efficiency

The profit function system with output- and input- specific technical efficiency The profit function system with output- and input- specific technical efficiency Mike G. Tsionas December 19, 2016 Abstract In a recent paper Kumbhakar and Lai (2016) proposed an output-oriented non-radial

More information

Trivariate copulas for characterisation of droughts

Trivariate copulas for characterisation of droughts ANZIAM J. 49 (EMAC2007) pp.c306 C323, 2008 C306 Trivariate copulas for characterisation of droughts G. Wong 1 M. F. Lambert 2 A. V. Metcalfe 3 (Received 3 August 2007; revised 4 January 2008) Abstract

More information

Advanced Econometrics

Advanced Econometrics Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate

More information

Cointegration Lecture I: Introduction

Cointegration Lecture I: Introduction 1 Cointegration Lecture I: Introduction Julia Giese Nuffield College julia.giese@economics.ox.ac.uk Hilary Term 2008 2 Outline Introduction Estimation of unrestricted VAR Non-stationarity Deterministic

More information

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley

More information

ASSESSING A VECTOR PARAMETER

ASSESSING A VECTOR PARAMETER SUMMARY ASSESSING A VECTOR PARAMETER By D.A.S. Fraser and N. Reid Department of Statistics, University of Toronto St. George Street, Toronto, Canada M5S 3G3 dfraser@utstat.toronto.edu Some key words. Ancillary;

More information

Confidence Statements for Efficiency Estimates from Stochastic Frontier Models

Confidence Statements for Efficiency Estimates from Stochastic Frontier Models Syracuse University SURFACE Economics Faculty Scholarship Maxwell School of Citizenship and Public Affairs 6-20-2002 Confidence Statements for Efficiency Estimates from Stochastic Frontier Models William

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 3 Jakub Mućk Econometrics of Panel Data Meeting # 3 1 / 21 Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

Convolution Based Unit Root Processes: a Simulation Approach

Convolution Based Unit Root Processes: a Simulation Approach International Journal of Statistics and Probability; Vol., No. 6; November 26 ISSN 927-732 E-ISSN 927-74 Published by Canadian Center of Science and Education Convolution Based Unit Root Processes: a Simulation

More information

8. Hypothesis Testing

8. Hypothesis Testing FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements

More information

An Information Theoretic Approach to Flexible Stochastic Frontier Models

An Information Theoretic Approach to Flexible Stochastic Frontier Models An Information Theoretic Approach to Flexible Stochastic Frontier Models Douglas J. Miller July 16, 007 Abstract Parametric stochastic frontier models have a long history in applied production economics,

More information

Estimation of growth convergence using a stochastic production frontier approach

Estimation of growth convergence using a stochastic production frontier approach Economics Letters 88 (2005) 300 305 www.elsevier.com/locate/econbase Estimation of growth convergence using a stochastic production frontier approach Subal C. Kumbhakar a, Hung-Jen Wang b, T a Department

More information

Lecture 3 September 1

Lecture 3 September 1 STAT 383C: Statistical Modeling I Fall 2016 Lecture 3 September 1 Lecturer: Purnamrita Sarkar Scribe: Giorgio Paulon, Carlos Zanini Disclaimer: These scribe notes have been slightly proofread and may have

More information

How to select a good vine

How to select a good vine Universitetet i Oslo ingrihaf@math.uio.no International FocuStat Workshop on Focused Information Criteria and Related Themes, May 9-11, 2016 Copulae Regular vines Model selection and reduction Limitations

More information

Likelihood and p-value functions in the composite likelihood context

Likelihood and p-value functions in the composite likelihood context Likelihood and p-value functions in the composite likelihood context D.A.S. Fraser and N. Reid Department of Statistical Sciences University of Toronto November 19, 2016 Abstract The need for combining

More information

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Michal Pešta Charles University in Prague Faculty of Mathematics and Physics Ostap Okhrin Dresden University of Technology

More information

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Robert V. Breunig Centre for Economic Policy Research, Research School of Social Sciences and School of

More information

Minimum Hellinger Distance Estimation in a. Semiparametric Mixture Model

Minimum Hellinger Distance Estimation in a. Semiparametric Mixture Model Minimum Hellinger Distance Estimation in a Semiparametric Mixture Model Sijia Xiang 1, Weixin Yao 1, and Jingjing Wu 2 1 Department of Statistics, Kansas State University, Manhattan, Kansas, USA 66506-0802.

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

CS 195-5: Machine Learning Problem Set 1

CS 195-5: Machine Learning Problem Set 1 CS 95-5: Machine Learning Problem Set Douglas Lanman dlanman@brown.edu 7 September Regression Problem Show that the prediction errors y f(x; ŵ) are necessarily uncorrelated with any linear function of

More information

Brief Review on Estimation Theory

Brief Review on Estimation Theory Brief Review on Estimation Theory K. Abed-Meraim ENST PARIS, Signal and Image Processing Dept. abed@tsi.enst.fr This presentation is essentially based on the course BASTA by E. Moulines Brief review on

More information

Evaluating the Importance of Multiple Imputations of Missing Data on Stochastic Frontier Analysis Efficiency Measures

Evaluating the Importance of Multiple Imputations of Missing Data on Stochastic Frontier Analysis Efficiency Measures Evaluating the Importance of Multiple Imputations of Missing Data on Stochastic Frontier Analysis Efficiency Measures Saleem Shaik 1 and Oleksiy Tokovenko 2 Selected Paper prepared for presentation at

More information

Testing Homogeneity Of A Large Data Set By Bootstrapping

Testing Homogeneity Of A Large Data Set By Bootstrapping Testing Homogeneity Of A Large Data Set By Bootstrapping 1 Morimune, K and 2 Hoshino, Y 1 Graduate School of Economics, Kyoto University Yoshida Honcho Sakyo Kyoto 606-8501, Japan. E-Mail: morimune@econ.kyoto-u.ac.jp

More information

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F

More information

Does k-th Moment Exist?

Does k-th Moment Exist? Does k-th Moment Exist? Hitomi, K. 1 and Y. Nishiyama 2 1 Kyoto Institute of Technology, Japan 2 Institute of Economic Research, Kyoto University, Japan Email: hitomi@kit.ac.jp Keywords: Existence of moments,

More information

GMM and SMM. 1. Hansen, L Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p

GMM and SMM. 1. Hansen, L Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p GMM and SMM Some useful references: 1. Hansen, L. 1982. Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p. 1029-54. 2. Lee, B.S. and B. Ingram. 1991 Simulation estimation

More information

Nonparametric Econometrics

Nonparametric Econometrics Applied Microeconometrics with Stata Nonparametric Econometrics Spring Term 2011 1 / 37 Contents Introduction The histogram estimator The kernel density estimator Nonparametric regression estimators Semi-

More information

Regression. Oscar García

Regression. Oscar García Regression Oscar García Regression methods are fundamental in Forest Mensuration For a more concise and general presentation, we shall first review some matrix concepts 1 Matrices An order n m matrix is

More information

Maximum Likelihood Tests and Quasi-Maximum-Likelihood

Maximum Likelihood Tests and Quasi-Maximum-Likelihood Maximum Likelihood Tests and Quasi-Maximum-Likelihood Wendelin Schnedler Department of Economics University of Heidelberg 10. Dezember 2007 Wendelin Schnedler (AWI) Maximum Likelihood Tests and Quasi-Maximum-Likelihood10.

More information

Markov Switching Regular Vine Copulas

Markov Switching Regular Vine Copulas Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS057) p.5304 Markov Switching Regular Vine Copulas Stöber, Jakob and Czado, Claudia Lehrstuhl für Mathematische Statistik,

More information