EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS

Size: px
Start display at page:

Download "EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS"

Transcription

1 Statistica Sinica , doi:ttp://dx.doi.org/ /ss EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS Jun Sao 1,2 and Seng Wang 3 1 East Cina Normal University, 2 University of Wisconsin-Madison and 3 Matematica Policy Researc Abstract: Model-assisted regression estimators are popular in sample surveys for making use of auxiliary information and improving te Horvitz-Tompson estimators of population totals. In te presence of strata and unequal probability sampling, owever, tere are several ways to form model-assisted regression estimators: regression witin eac stratum or regression by combining all strata, and a separate ratio adjustment for population size, or a combined ratio adjustment, or no adjustment. In te literature, tere is no compreensive teoretical comparison of tese regression estimators. We compare te asymptotic efficiencies of six model-assisted regression estimators under two asymptotic settings. Wen tere are a fixed number of strata wit large stratum sample sizes, our result sows tat one of te six regression estimators is a clear winner in terms of asymptotic efficiency. Wen tere are a large number of strata wit small stratum sample sizes, owever, te story is different. Some comparisons in special cases are also made. Some simulation results are presented to examine finite sample performances of regression estimators and teir variance estimators. Key words and prases: Asymptotic efficiency, bootstrap, combined regression estimators, separate regression estimators, unequal probability witout replacement sampling, variance estimation. 1. Introduction Auxiliary information from sources suc as administrative records is often available in sample surveys. Model-assisted regression estimators are te most popular estimators tat utilize auxiliary information to gain efficiency in estimating population totals. Altoug regression estimators are constructed using a regression model between te variables of interest and some covariates, tey are consistent and asymptotically normal under te traditional design-based framework in wic te randomness of survey estimators is from repeated sampling. Tus tey are robust against violation of te assumed regression model. But, wen te regression model is correct, tey are more efficient tan te estimators tat do not use te auxiliary information. Let U denote te finite population of interest wit N units stratified into H strata, U = U 1 U H, were U contains N units and N 1 + +

2 396 JUN SHAO AND SHENG WANG N H = N. For unit i U, let y i be a nonconstant variable of interest and x i be a nonconstant auxiliary variable associated wit y i. Altoug we consider a univariate x i trougout, all results can be generalized to te case of multivariate x i in a straigtforward manner. Consider te estimation of te population total Y = y i based on a sample S = S 1 S H from U, were eac S is a sample from U according to some probability sampling plan and S 1,..., S H are independently selected. Let n be te size of S and n = n n H. For i S, y i is observed. Te Horvitz-Tompson estimator of te total Y is Ŷ = Ŷ, Ŷ = i S w i y i, were wi 1 is te first-order inclusion probability of unit i in S, a known quantity from te sampling design. Under te traditional design-based framework, y i s are fixed values and te only randomness is from te repeated selection of S from U. Let E s be te expectation wit respect to repeated sampling. Ten, it is well known tat E s Ŷ = Y. For te auxiliary variable, te value of x i is observed for i S, and te value of X = x i is known e.g., from administrative records for eac. To utilize te auxiliary variable x i, te model-assisted approac e.g., Särndal, Swensson, and Wretman 1992 can be adopted to form regression estimators. In te presence of strata, owever, tere are two ways to apply te regression: regression witin eac stratum or regression by combining all strata. Also, one can apply a separate ratio adjustment for population size, or a combined ratio adjustment, or no adjustment. Tis leads to six regression estimators of Y : Ŷ C0 = [Ŷ + ˆβX ˆX ] = Ŷ + ˆβX ˆX, 1.1 Ŷ S0 = [Ŷ + ˆβ X ˆX ] = Ŷ + ˆβ X ˆX, 1.2 Ŷ CC = [ N Ŷ + ˆβ X N ˆX ] = NŶ + ˆβ X N ˆX, 1.3 Ŷ SC = [ N Ŷ + ˆβ X N ˆX ] = NŶ + ˆβ X N ˆX, 1.4 Ŷ CS = [ N Ŷ + ˆβ X N ˆX ] = N Ŷ + ˆβ X N ˆX, 1.5 Ŷ SS = [ N Ŷ + ˆβ X N ˆX ], 1.6

3 MODEL-ASSISTED REGRESSION ESTIMATORS 397 were X = x i, ˆX = i S w ix i, ˆX = i S w i x i, = i S w i, = i S w i, i S ˆβ = w i y i x i ˆX / i S w i x i ˆX / is te slope estimator assuming a single regression line over all strata, and i S ˆβ = w i y i x i ˆX / i S w i x i ˆX / is te slope estimator for te regression line witin stratum. Te first subscript u in Ŷuv given by indicates weter a combined regression line over all strata is assumed u = C or a separate regression line for eac stratum is considered u = S. Te second subscript v in Ŷuv indicates weter no ratio adjustment for population size is applied v = 0, a combined over strata ratio adjustment N/ is applied v = C, or a separate ratio adjustment N / witin eac stratum is applied v = S. Some of te estimators in are te same or similar to tose proposed by previous researcers e.g., Isaki and Fuller 1982, Wrigt 1983, Montanari 1987, Särndal, Swensson, and Wretman 1992 and Fuller 2009 and are frequently used in practice; te oters are included for comparison. Wen combined regression is applied, knowing X, instead of eac X, is enoug for estimators ŶC0, ŶCC, and ŶCS. Intuitively, regression by combining strata is motivated by te tinking tat it produces more efficient estimators wen all regression models across strata are te same wic is not always true as we sow in Section 2. Altoug some discussions on te relative efficiencies among te six estimators in can be found in te literature e.g., Cocran 1977, Särndal, Swensson, and Wretman 1992 and Fuller 2009, a compreensive teoretical comparison of tese estimators is not available. Te purpose of tis paper is to study te asymptotic relative efficiencies among tese six estimators under te model-assisted approac tat imposes a regression model between y i and x i in eac stratum. We consider two types of sampling designs, one wit a fixed number of strata and a large number of sampled units witin eac stratum and te oter one wit a large number of strata and a few sampled units witin eac stratum. Under te first type of design, we are able to draw some definite conclusions on te relative performance of estimators provided te regression models are correct. Note tat if we do not use any model, all six estimators are consistent and asymptotically normal under te design-based framework, but teir asymptotic relative efficiencies cannot be precisely assessed unless some special designs are considered, for example, stratified simple random sampling. An example is given in Section 2.

4 398 JUN SHAO AND SHENG WANG Under te model-assisted framework, some asymptotic results for two types of designs are given in Sections 2 and 3, respectively. Some simulation results are presented in Section 4. Te last section contains some discussion. All tecnical proofs are given in te Appendix. 2. Asymptotic Results for Large n s and Fixed H To consider asymptotics, te population U is viewed as a member of a sequence of populations {U k, k = 1, 2,...}, were te number of units in U k increases to infinity as k. All quantities, suc as y i, N, etc., depend on te index k but k is omitted for simplicity. All limiting processes are wit respect to k. In tis section, we assume tat, in eac stratum, te population size N and te sample size n as k. Te number of strata, H, is fixed. Te sampling design wit a fixed H and large n s is commonly used in many surveys, for example, business surveys. We can establis te asymptotic normality of estimators in wen te y i s and x i s are fixed values and te randomness is from te sample selection of S, = 1,..., H. A simple example, owever, indicates tat, under te design-based approac, we cannot generally tell wic estimator in is te most efficient one unless a special design is considered. Consider tat H = 1, eac i U as probability p i to be selected, and sampling is wit replacement. We compare te estimators Ŷ1 = i S y i/np i and Ŷ2 = NŶ1/, = i S 1/np i, tat are special cases of estimators in 1.1 and 1.2, respectively. Suppose tere exists a positive constant M suc tat M 1 Np i M for all i U and N 1 y2 i M. Ten, by te Central Limit Teorem, nŷ1 Y /N d d v 1 N0, 1, were denotes convergence in distribution and v 1 = 1 2 yi N 2 p i Y. p i Applying te delta metod, we obtain tat nŷ2 Y /N v 2 d N0, 1, were v 2 = v 1 2Y 1 yi N 3 p i N Y + Y p i p i N 4 p i N. p i Ten, Ŷ2 is asymptotically more efficient tan Ŷ1 if and only if Ȳ 2 N 2 1 < 2Ȳ p i N 2 y i p i Ȳ 2, 2.1

5 MODEL-ASSISTED REGRESSION ESTIMATORS 399 were Ȳ = Y/N. Under te design-based framework, owever, y i s are fixed values and we cannot tell weter 2.1 olds except for a few special cases. Tis is similar to te discussion in Section 6.6 of Cocran If we assume te y i s are iid wit mean µ y, ten 2.1 olds for sufficiently large N unless µ y = 0 or p i = N 1 simple random sampling. Tis can be seen as follows. Suppose µ y = 0 and p i is not a constant. By te Law of Large Numbers and te condition M 1 Np i M for all i U, y i Ȳ µ y and p i N 2 µ y p i N 2 0 a.s.. Tis togeter wit Jensen s inequality > N p i N 1 p i = N implies tat 2.1 olds for large enoug N. We consider te model-assisted approac tat views y i, x i, i U, as random vectors following a model. To utilize te auxiliary variable x i, we consider te regression model y i = α + β x i +, i U, = 1,..., H, 2.2 were α and β are, respectively, te unknown intercept and slope for te regression witin stratum U,, i U, are iid wit mean 0 and an unknown variance, x i, i U, are iid wit an unknown variance, and s are independent of x i s. Te estimator ˆβ in 1.8 is te least squares estimator of β, = 1,..., H. If we combine all regression lines into a single line, ten te estimator ˆβ in 1.7 is te least squares estimator of te common slope. Altoug te combined regression is wrong wen β s are unequal, estimators in are still consistent and asymptotically normal since tey are model-assisted estimators. Tis is even true wen model 2.2 is incorrect. A teorem establises te asymptotic normality of estimators , wit respect to te probability under regression model 2.2 and repeated sampling. Let P s, E s, and V s be te probability, expectation, and variance wit respect to repeated sampling, and let P m, E m, and V m be te probability, expectation, and variance wit respect to model 2.2. We require some conditions. C1 For any, n /N 0 and tere exists φ suc tat N /N φ. C2 For any, σx 2 = V mx i <, σ 2 = V m <, i U, and tere exists δ > 0 suc tat as k, E m R w i 1x i µ x 2+δ + E m x i µ x 2+δ i S /S 0, [ ]1+δ/2 V m R i S w i x i x i

6 400 JUN SHAO AND SHENG WANG E m R w i 1 2+δ + E m 2+δ i S /S 0, [ ]1+δ/2 V m R i S w i were R = 1, /N, or /N, and µ x = E m x i for i U. C3 Tere exists M > 1 suc tat for any, N /n M < w i < N M/n. Teorem 1. If 2.2 and C1 C3 old, ten Ŷ uv Y N ϕ uv d N0, 1, were u = C or S, v = 0, C, or S, [ ] ϕ SS = N 2 σ2 w i N, ϕ CS = ϕ SS + 1 N 2 σ2 x β β 2 w i N, ϕ SC = ϕ SS + V s α N N, ϕ CC = ϕ CS + V s [α + µ x β β] N N, ϕ S0 = ϕ SS + 1 and ϕ C0 = ϕ CS + 1 N 2 V s V N 2 s α N, [ ] α + µ x β β N, β = N β σx 2 N σx 2. Te proof is given in te Appendix. Te condition n /N 0 is not needed wen sampling is wit replacement or simple random sampling witout replacement. Te following are some conclusions for te relative efficiencies among estimators in For two estimators Ŷu and Ŷv, Ŷu Ŷv denotes tat Ŷu is asymptotically as efficient as Ŷv, Ŷ u Ŷv denotes tat Ŷu is asymptotically more efficient tan Ŷv, and Ŷu = Ŷv denotes tat Ŷu and Ŷv are asymptotically equivalent. 1. Asymptotically, no oter estimator is more efficient tan ŶSS, even wen regression lines across strata are te same. Some oter estimators may be asymptotically as efficient as ŶSS under some special situations.

7 MODEL-ASSISTED REGRESSION ESTIMATORS Wen α = α 0 and β = β for all, ŶCC = ŶCS = ŶSC = ŶSS ŶC0 = Ŷ S0. Wen α = 0 and β = β for all, all estimators in are asymptotically equivalent. 3. Wen β = β for all, ŶC0 = ŶS0, ŶCC = ŶSC, ŶCS = ŶSS, ŶSS ŶS0, and Ŷ SS ŶSC. Wen = N, ŶS0 = ŶSC. No general conclusion can be made about ŶS0 and ŶSC. 4. Wen tere exist 1 and 2 suc tat β 1 β 2, Ŷ S0 ŶC0, Ŷ SC ŶCC and ŶSS ŶCS. Tus, applying te same ratio, te estimators using separate lines are more efficient tan tose using a single line. 5. Wen α = α 0 for all, Ŷ SC = Ŷ SS ŶS0. Wen α = 0 for all, Ŷ SC = Ŷ SS = Ŷ S0. 6. Wen tere exist 1 and 2 suc tat α 1 α 2, ŶSS ŶSC and ŶSS ŶS0 unless 1 = N 1 and 2 = N 2. Tere is a definite conclusion. Asymptotically, ŶSS is te winner among all estimators in even wen te regression lines across strata are te same. Tis penomenon as been noticed by previous researcers see, e.g., Fuller One explanation is tat, wen all n s are large, te information about te equality of regression lines does not elp to improve model-assisted estimators of te forms tat are robust against model violation. Some model-based estimators may be more efficient tan ŶSS wen all regression lines are te same, but tey are not robust against model violation. Furtermore, te situation is quite different if some n s are small, as te results in te next section indicate. Anoter point is tat te result in Teorem 1 still olds if ˆβ in 1.8 is replaced by a different consistent estimator of β. Fuller 2009 indicates tat, given a sampling design, an optimal slope estimator can be derived e.g., in Fuller 2009 tat is asymptotically more efficient tan ˆβ in 1.8 in terms of estimating β. For te estimation of Y, owever, te efficiency of ˆβ is a second-order asymptotic effect. It does not improve te asymptotic efficiency of Ŷ SS. 3. Asymptotic Results for Large H and Small n s To increase efficiency, many surveys involve a large number of strata. To reduce te cost, a few units are sampled from eac stratum. See, for example, Krewski and Rao We consider a different asymptotic setting: all n s are bounded by a fixed constant and H. For estimators based on separate regressions, Ŷ SS, Ŷ SC, and ŶS0, eac n as to be sufficiently large so tat a regression witin stratum can be fitted. Since model 2.2 involves a two term regression function, we require n > 2 for

8 402 JUN SHAO AND SHENG WANG Ŷ SS, ŶSC, and ŶS0. If x i is multivariate wit dimension p, ten n > p + 1 for eac is required. Wen n is small, it is not possible to obtain a consistent estimator for a parameter related only wit stratum U. However, quantities suc as totals over all strata can be consistently estimated. Consider ŶSS as an example. Let E m x be te expectation under 2.2, conditional on te x i s. Ten E m x y i = α + β x i, i U, and [ N E m x Ŷ + E m x ˆβ X N ˆX ] E m x ŶSS = = [ N α + N β ˆX + β X N ˆX ] = = N α + β X E m x Y = E m x Y. Tus, ŶSS is unbiased wit respect to model 2.2. Since ŶSS is a sum over of independent random variables aving finite means and variances under some conditions, we can apply Liapounov s Central Limit Teorem to establis te asymptotic normality of ŶSS. Tis is te approac considered by Krewski and Rao We need some conditions. D1 Tere exists M 1 suc tat α + β µ x < M 1 for any. D2 Tere exists M 2 suc tat 1/M 2 < σ 2 x = V mx i < M 2 and 1/M 2 < σ 2 = V m < M 2 for any, and tere exists δ > 0 suc tat, as H, H R E m Ŷ + Q X R ˆX E m [R Ŷ + Q X R ˆX ] =1 { H =1 2+δ } 0, 2+δ V m [R Ŷ + Q X R ˆX ] were R = 1, /N, or /N, and Q = ˆβ or ˆβ. D3 As H, β i S w i x i ˆX / 2 i S w i x i ˆX / p β, 2 E s { n N H N } [α + β βµ x ] N 0. =1

9 MODEL-ASSISTED REGRESSION ESTIMATORS 403 Teorem 2. If model 2.2, C3, and D1 D3 old, ten were u = C or S, v = 0, C, or S, and Ŷ uv Y N ϕ uv d N0, 1, ϕ C0 = 1 N 2 V s Λ + 1 [ N 2 Γ ϕ S0 = 1 [ N 2 E se m ϕ CC = V s σ 2 i S Λ ϕ SC = 1 [ N 2 E se m ϕ CS = 1 N 2 E s ϕ SS = 1 N 2 [ + 1 i S σ 2 N 2 Γ 2 [ σ 2 E se m ζ i +w i 1 [ N 2 Γ ζ i +w i 1 i S ] w i N, 2 ] + 1 N 2 σ 2 N n + 1 N 2 V s α, ] w i N, 2 ] + 1 N 2 σ 2 N n +V s ], wi 2 N i S ψ i + N w i 1 2 ] + 1 N 2 σ 2 N n, wit Λ = α + β βµ x, Γ = σ 2 + β β 2 σ 2 x, x = ˆX /, ζ i = α, x i x w i ˆX x i x 2 X, and ψ i = x i x w i N ˆX w i x i x 2 X. w i i S i S Te proof is given in te Appendix. Note tat i S ψ i = 0 and i S ζ i = 0 for any. If i S x i x w 2 i = 0 for any, wic occurs for example wen w i is a constant witin stratum, ten ϕ SS = 1 { N 2 σ 2 E } se m ψi 2 N β β 2 σ 2 2 x E s w i S 2 i 2 N +ϕ CS. i S 3.1 As for te relative efficiencies among estimators in , we ave te following discussion. 1. Wit simple random sampling SRS witin eac stratum, = N, = N, Ŷ C0 = ŶCC = ŶCS, and ŶS0 = ŶSC = ŶSS. Tus, we compare ŶCS and ŶSS. Under SRS witin eac stratum, i S ψ i w i = 0 for any. If β = β for

10 404 JUN SHAO AND SHENG WANG any, ten it follows from 3.1 tat ϕ SS = 1 { N 2 σ 2 E se m i S ψ 2 i } + ϕ CS > ϕ CS 3.2 and, ence, ŶCS ŶSS. However, it also follows from 3.1 tat, wen β β 2 increases to, te ratio ϕ SS /ϕ CS increases to infinity, so ŶSS ŶCS if β β 2 is sufficiently large for some. 2. Under a general sampling design, tere is no definite conclusion about te relative efficiency among ŶC0, Ŷ CC, and ŶCS, because ϕ C0, ϕ CC, and ϕ CS involve expectations and variances wit respect to sampling. For example, tere is no definite conclusion on wic of 1 N 2 V s Λ and V s Λ is larger. Similarly, we are not able to draw a definite conclusion about te relative efficiency among ŶS0, ŶSC, and ŶSS. 3. If β = β and i S ψ i w i = 0 for any, ten ŶCS ŶSS. Tis is because 3.1 olds and, if β = β, ten 3.2 also olds. Similarly, if i S ζ i w i = 0 for any, ten ŶCC ŶSC and ŶC0 ŶS0. If we furter assume tat α = α 0 for any, ten ŶCC ŶC0 and ŶSC ŶS0. Wen α = 0 for any, ŶCC = ŶC0 and ŶSC = ŶS0. 4. If te β s are not all te same, β β 2 increases to for some, ϕ C0, ϕ CC, and ϕ CS all diverge to infinity. On te oter and, none of ϕ C0, ϕ CC, and ϕ CS depends on β β 2. Tis means tat ŶS0 ŶC0, ŶSC ŶCC, and Ŷ SS ŶCS wen β s are different and te differences are large enoug. Te result is quite different from tat in Section 2 were all n s are large and H is fixed. In Section 2 we found a clear winner in terms of asymptotic efficiency, Ŷ SS, regardless of te scenario, wereas for small n s, tere is no winner even if we use SRS witin eac stratum. Te asymptotic relative efficiency between Ŷ CS and ŶSS depends on ow te β s differ. 4. Empirical Results In tis section, we present some empirical results using simulated data sets. We also discuss variance estimation and examine related confidence intervals Simulation results for large n s and fixed H We considered a population wit H = 4 strata. In stratum, we generated independent x i Γϑ, θ, i = 1,..., N = 2000, were Γϑ, θ is te gamma

11 MODEL-ASSISTED REGRESSION ESTIMATORS 405 distribution wit sape parameter ϑ and scale parameter θ, y i = α + β x i + and z i = 30 + x i + ν i, were s and ν i s are independent and normally distributed wit mean 0 and variance 50, and tey are independent of x i s. Values from different strata were independently generated. Te values of te parameters of te gamma distribution in eac stratum were as follows. ϑ 1 θ 1 ϑ 2 θ 2 ϑ 3 θ 3 ϑ 4 θ For eac, we adopted te Rao-Hartley-Cocran RHC sampling sceme to obtain a sample of size n = 200 witout replacement from U using z i s as te weigts see, e.g., Sampford Divide U into n subgroups U 1,..., U n, were eac group consists of k = N /n units. 2. Select unit j from eac subgroup U q using {z i : i U q } as weigts. Under tis sampling sceme, wj 1 = z j / q z i. 3. Samples across strata are independently sampled. Table 1 reports te value of α, β and, based on 2000 simulations, te biases and standard deviations SD of six estimators, two estimated SD, ŜD S obtained by substituting asymptotic variance formulas derived in Section 3 and ŜD B based on te bootstrap variance estimator described in Bickel and Freedman 1984, Sao and Tu 1995, and Antal and Tillé 2011 wit B = 300, togeter wit te coverage probability, CP u, of te approximate 95% confidence interval for te population total Y : [estimated total 1.96ŜD u, estimated total+1.96ŝd u], were u = S or B. Te following observations summarize te results in Table All estimators ave negligible biases. 2. Te simulation results support te asymptotic teory in Teorem 1. Wen β s and α s are different, ŶSS as te best performance. Wen α s and/or β s are te same, oter estimators are comparable wit ŶSS. If ŶSS is not te best, ten te difference between te SD of ŶSS and te SD of te best estimator is negligible. 3. Substitution and bootstrap variance estimators perform reasonably well. Wen te estimator of SD performs well, so does te 95% approximate confidence interval. In a few cases te substitution estimator ŜD S overestimates substantially, wic results in a too large CP. Te bootstrap estimator ŜD B is more stable, but always overestimates in our simulation.

12 406 JUN SHAO AND SHENG WANG Table 1. Simulation Results for Six Estimators Large n s and Fixed H. α β Y Ŷ C0 Ŷ S0 Ŷ CC Ŷ SC Ŷ CS Ŷ SS Bias SD ŜD S ŜD B CP S 93.7% 93.2% 93.4% 93.3% 93.5% 93.1% CP B 94.9% 94.8% 94.8% 94.6% 94.9% 94.6% Bias SD ŜD S ŜD B CP S 97.1% 95.1% 97.0% 94.9% 95.6% 95.1% CP B 96.2% 96.1% 96.4% 96.1% 96.4% 96.1% Bias SD ŜD S ŜD B CP S 95.0% 95.0% 94.8% 95.0% 94.5% 94.8% CP B 95.2% 95.4% 95.6% 96.1% 96.0% 95.7% Bias SD ŜD S ŜD B CP S 97.0% 95.0% 99.9% 94.3% 96.1% 93.9% CP B 96.8% 95.2% 97.2% 94.7% 96.4% 95.1% 4.2. Simulation results for large H and small n s We considered a population wit H = 600 strata. In stratum, we independently generated x i Γϑ, θ, i = 1,..., N = 60, were ϑ and θ were independently generated from te uniform U6, 8. In stratum, y i s and z i s were generated as in Section 4.2. Values from different strata were independently generated. In stratum, a sample of size n = 4 was taken witout replacement witin U, using SRS or RHC as described in Section 4.1. Samples across strata were independently sampled. Te regression parameters were cosen as follows: α, β = a 1, b 1, = 1,..., H/2, and α, β = a 2, b 2, = H/2 + 1,..., H, were values of a k and b k, k = 1, 2, are sown in Table 2. Table 2 reports te same quantities as tose in Table 1 for SRS or RHC, except tat tere is only one estimated SD and ence one CP for eac estimator. For ŶC0, ŶCC, and ŶCS, te estimated SD, ŜD, is based on te bootstrap wit B = 300 and bootstrap sample size n 1 see, for example, McCarty and Snowden 1985, for te reason of using n 1. Tis bootstrap metod, owever,

13 MODEL-ASSISTED REGRESSION ESTIMATORS 407 Table 2. Simulation Results for Six Estimators Large H and Small n s. k a k b k Y Sampling Ŷ C0 Ŷ S0 Ŷ CC Ŷ SC Ŷ CS Ŷ SS SRS Bias SD ŜD CP 95.6% 95.8% RHC Bias SD ŜD CP 95.2% 95.0% 94.8% 95.4% 95.0% 94.4% SRS Bias SD ŜD CP 96.8% 94.8% RHC Bias SD ŜD CP 94.8% 95.6% 95.2% 97.2% 95.4% 94.6% SRS Bias SD ŜD CP 95.2% 96.2% RHC Bias SD ŜD CP 95.4% 94.8% 94.6% 96.8% 94.0% 95.2% SRS Bias SD ŜD CP 96% 95.1% RHC Bias SD ŜD CP 96.8% 95.0% 96.8% 96.3% 96.5% 94.2% does not work for ŶS0, Ŷ SC, and ŶSS, because it is igly possible tat te bootstrap sample in stratum contains only one or two points y i, x i from S wen n is small, wic prevents us from regression fitting. Tus, we consider substitution estimators based on te result in Teorem 2: ˆϕ S0 = 1 N 2 i S ˆσ 2 ζ i+w i N 2 ˆσ 2 N n + 1 N 2 [ ˆα N ] 2,

14 408 JUN SHAO AND SHENG WANG ˆϕ SC = 1 N 2 i S ˆσ 2 ζ i+w i [ N 2 ˆσ 2 N n + ˆα N ] 2, N ˆϕ SS = 1 N 2 ˆσ 2 ψ i + N 2 1 w i 1 + N 2 ˆσ 2 N n, i S were ζ i and ψ i are given in Teorem 2. Under SRS, ŶC0 = ŶCC = ŶCS and ŶS0 = ŶSC = ŶSS. Tus, te results are sown for ŶCS and ŶSS only. Te following is a summary of te results in Table Te biases of all estimators are negligible, altoug tey are relatively large compared wit tose in Table In terms of SD, ŶSS is no longer always te best. Wen regression lines are all te same, te estimators using combined regression are muc more efficient tan tose using separate regression lines, wic is quite different from te situation wit large n s. On te oter and, wen te slopes of regression lines are different, te estimators using separate regression lines are more efficient tan tose using combined regression. Under RHC, among te tree estimators using separate regression lines, ŶSS is no longer te best, ŶSC is. 3. Te substitution variance estimators for ŶS0, ŶSC, and ŶSS and te bootstrap variance estimator for ŶC0, ŶCC, and ŶCS perform well and result in good CP of confidence intervals. 5. Discussion We study te asymptotic efficiencies of estimators in two different asymptotic settings under stratified sampling wit H strata and model 2.2. In te case were H is fixed and all stratum sample sizes tend to infinity, te estimator ŶSS in 1.6 is asymptotically te most efficient estimator, regardless of weter regression models in different strata are te same or not. Wen all stratum sample sizes are small and H tends to infinity, owever, no general conclusion can be made. Wen H is fixed and all stratum sample sizes are large, it is not difficult to derive design-based consistent variance estimators for all six estimators in In fact, it can be sown tat te bootstrap variance estimators used in Section 4.1 are design-based consistent. For te case were all stratum sample sizes are small and H is large, it can still be sown tat te bootstrap variance estimators for ŶC0, Ŷ CC and ŶCS described in Section 4.2 are designbased consistent.

15 MODEL-ASSISTED REGRESSION ESTIMATORS 409 Tere are estimators of Y oter tan tose in For example, an anonymous referee commented tat we could consider regression adjustments instead of sample size adjustments, wic lead to an estimator of te form H [N Ŷ ˆβ X ˆX ˆα N ] =1 wit some estimators ˆα and ˆβ. Asymptotic results can be similarly derived. Acknowledgements We would like to tank tree referees and an associate editor for providing elpful comments and suggestions. Te researc was partially supported by an NSF grant. Appendix Proof of Teorem 1. Let P, E, and V be te te probability, expectation and variance under bot model 2.2 and design. By te fact tat y i = α +β x i +, i S and te definition of ŶSS, we ave Ŷ SS Y N = First, notice tat [ β ˆβ ˆX X + w i N 1 i S N [ ˆX E X ] [ ˆX = E s E m X ] [ = 0 and E w i N N 1 i S By C2 and Liapounov s Central Limit Teorem, ] N N. A.1 ] = 0. N were 1 ψ1 [ ˆX X N ] d N0, 1, ˆX ψ 1 = V X N ˆX = V s E m X ˆX + E s V m X N N ˆX = E s V m X N [ = σ 2 x E s i S wi 1 N N /S 2 ]

16 410 JUN SHAO AND SHENG WANG = σ 2 x [E s w2 i i S 2 1 N ], and te tird equality follows since E m ˆX / X /N = 0. By C3, we ave i S wi 2/ 2 < 1/n M. It is easy to sow tat N / p 1. By te Dominated Convergence Teorem and Liapounov s Central Limit Teorem, we ave 1 [ ˆX X ] d N0, 1, A.2 ϕ1 N were ϕ 1 = σ2 ] x [E N 2 s wi 2 N i S and σ 2 x = E[x i], i U. Similarly, we ave 1 [ ϕ2 1 i S w i = σ2 x N 2 w i N, N ] d N0, 1, were ϕ 2 = σ 2 w i N /N 2. Terefore, by A.1, Ŷ SS Y N ϕ SS = ϕ2 [ ϕ 1 β ˆβ ˆX X + 1 w i ϕss ϕ 2 ϕ1 N ϕ2 i S were ϕ SS = ϕ 2N 2 /N 2 = σ x /σ, A.2, and ˆβ β p 0, we obtain Ŷ SS Y N ϕ SS = ϕ2 N ] N N, [ σ2 w i N ] /N 2. Using ϕ 1 /ϕ 2 = [ 1 ϕss ϕ2 1 Terefore, by Slutsky s Teorem, i S w i Ŷ SS Y N ϕ SS d N0, 1. From te conditions, tere exists b suc tat N i S w i x i ˆX / 2 i S w i x i ˆX / 2 p b. ] N N + o p1. According to te definition of ˆβ, we ave ˆβ p β, were β = b β. Similarly, we can obtain Ŷ CS Y N ϕ CS d N0, 1,

17 MODEL-ASSISTED REGRESSION ESTIMATORS 411 were ϕ CS = τ N 2 /N 2 and τ = β β 2 ϕ 1 + ϕ 2. Terefore, ϕ CS = 1 N 2 σx 2 β β 2 w i N + ϕ SS. For ŶSC, we ave Ŷ SC Y N = α N + N ˆX β ˆβ X + N w i ϵ i. N i S Similar to te previous proof, by te fact tat ˆβ p β, te asymptotic distribution of ŶSC Y /N is as same as T, were T = α N + N 1 i S w i 1 N. Terefore, under C1 C3, T E[T ] ϕsc d N0, 1, were E[T ] = E s [ α / N /N] and ϕ SC = ϕ SS +V s [ α / N /N]. Under C3, N /M < < N M, N 2/Mn < i S wi 2 < MN2 /n. Terefore N/M < < NM and N 2/n M < i S wi 2 < M N 2/n. Ten N α 1 < α M N A.3 Since n /N 0 ten, for any fixed ϵ 0 > 0, wen n is large enoug, σ 2 N n M 1 > ϵ 0, Ten E[T ] ϕsc ϕ SC ϕ SS > 1 N N 2 σ 2 2 n M N > 0. [ E s α N / N ] 1 1 [ E s α N ] 1 0, ϵ 0 N σ2 N /n M 1

18 412 JUN SHAO AND SHENG WANG were te last inequality olds from /N p 1, A.3, and te Dominated Convergence Teorem. Terefore, T/ d ϕ SC N0, 1, wic implies Ŷ SC Y N ϕ SC d N0, 1. Similarly, we can get te asymptotic properties of ŶS0, ŶCC, and ŶC0. Proof of Teorem 2. By te Law of Large Numbers, we ave ˆβ β. Ten, similar to te proof of Teorem 1, we ave Ŷ CS Y = [ β N ˆβ ˆX X + w i ] N N 1 N N. A.4 i S By D2, it is easy to sow tat ŶCS Y /N as te same distribution as T 2, were T 2 = [ ˆX β β X + w i ] N N 1 N N. i S For T 2, E[T 2 ] = E s E m [T 2 ] = 0. Terefore, by Liapounov s Central Limit Teorem, we ave T 2 ϕcs d N0, 1, were [ 1 N ϕ CS = E s N 2 β β 2 σx ] σ2 w 2 i 2 N. i S For ŶCC, by Liapounov s Central Limit Teorem, we ave ŶCC Y /N E s [α + β βµ x / ] N /N d ϕ N0, 1, CC were [ ϕ CC = V s α + β βµ x N ] + N [ ] σ 2 + β β 2 σx 2 i S E wi 2 s 2 i S w i + N 2 N N 2. It can be sown tat n /N 1 = O p 1 and N 2/ nn 2 = o1. Based on tese facts, and similar to te proof of Teorem 1, we get ŶCC Y /N ] E s [α + β βµ x N N d N0, 1. ϕcc

19 MODEL-ASSISTED REGRESSION ESTIMATORS 413 Similar to te proof of Teorem 1, by D1 and D3, ] E s [α + β βµ x N N 0. ϕcc Terefore, For ŶSS, we ave were Ŷ SS Y N Terefore, were = = ϕ SS = Ŷ CC Y N ϕ CC d N0, 1. [ β ˆβ ˆX X [ i S ψ i = N ψ i + w i 1 N + w i 1 i S /S x i x w i ˆX i S x i x 2 X. w i N Ŷ SS Y N ϕ SS d N0, 1, N 1 N ] N N, N 2 [ N 2 σ2 E se m σ 2 ψ i + w i 1 2 N n ] + i S N N 2. ] N N, Similarly, we can get te asymptotic properties of ŶC0, ŶSC, and ŶS0. References Antal, E. and Tillé, Y A direct bootstrap metod for complex sampling designs from a finite population. J. Amer. Statist. Assoc. 106, Bickel, P. J. and Freedman, D. A Asymptotic normality and te bootstrap in stratified sampling. Ann. Statist. 12, Cocran, W. G Sampling Tecniques. Tird edition. Wiley, New York. Fuller, W. A Sampling Statistics. Wiley, Hoboken, New Jersey. Isaki, C. T. and Fuller, W. A Survey design under te regression superpopulation model. J. Amer. Statist. Assoc. 77, Krewski, D. and Rao, J. N. K Inference from stratified samples: properties of te linearization, jackknife and balanced repeated replication metods. Ann. Statist. 9,

20 414 JUN SHAO AND SHENG WANG McCarty, P. J. and Snowden, C. B Te bootstrap and finite population sampling. Vital and Healt Statistics, 2-95, Public Healt Service Publication U.S. Government Printing Office, Wasington, D.C. Montanari, G. E Post-sampling efficient Q-R prediction in large-sample surveys. Internat. Statist. Rev. 55, Sampford, M. R On sampling witout replacement wit unequal probability of selections. Bimetrika 54, Särndal, C. E., Swensson, B. and Wretman, J Model Assisted Survey Sampling. Springer- Verlag, New York. Sao, J. and Tu, D Te Jackknife and Bootstrap. Springer-Verlag, New York. Wrigt, R. L Finite population sampling wit multivariate auxiliary information. J. Amer. Statist. Assoc. 78, Scool of Finance and Statistics, East Cina Normal University, Sangai , Cina. Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, U.S.A. Matematica Policy Researc, Princeton, NJ 08540, U.S.A. Received February 2012; accepted December 2012

EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING

EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING Statistica Sinica 13(2003), 641-653 EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING J. K. Kim and R. R. Sitter Hankuk University of Foreign Studies and Simon Fraser University Abstract:

More information

VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR

VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR Sankyā : Te Indian Journal of Statistics 1995, Volume 57, Series B, Pt. 1, pp. 85-92 VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR By SANJAY KUMAR SAXENA Central Soil and Water Conservation Researc

More information

A = h w (1) Error Analysis Physics 141

A = h w (1) Error Analysis Physics 141 Introduction In all brances of pysical science and engineering one deals constantly wit numbers wic results more or less directly from experimental observations. Experimental observations always ave inaccuracies.

More information

Bootstrap confidence intervals in nonparametric regression without an additive model

Bootstrap confidence intervals in nonparametric regression without an additive model Bootstrap confidence intervals in nonparametric regression witout an additive model Dimitris N. Politis Abstract Te problem of confidence interval construction in nonparametric regression via te bootstrap

More information

Department of Statistics & Operations Research, Aligarh Muslim University, Aligarh, India

Department of Statistics & Operations Research, Aligarh Muslim University, Aligarh, India Open Journal of Optimization, 04, 3, 68-78 Publised Online December 04 in SciRes. ttp://www.scirp.org/ournal/oop ttp://dx.doi.org/0.436/oop.04.34007 Compromise Allocation for Combined Ratio Estimates of

More information

Artificial Neural Network Model Based Estimation of Finite Population Total

Artificial Neural Network Model Based Estimation of Finite Population Total International Journal of Science and Researc (IJSR), India Online ISSN: 2319-7064 Artificial Neural Network Model Based Estimation of Finite Population Total Robert Kasisi 1, Romanus O. Odiambo 2, Antony

More information

Strati cation by Size Revisited

Strati cation by Size Revisited Journal of Of cial Statistics, Vol. 16, No. 2, 2000, pp. 139±154 Strati cation by Size Revisited Alan H. Dorfman 1 and Ricard Valliant 2 Strati cation by size is used in nite population sampling as a means

More information

On Local Linear Regression Estimation of Finite Population Totals in Model Based Surveys

On Local Linear Regression Estimation of Finite Population Totals in Model Based Surveys American Journal of Teoretical and Applied Statistics 2018; 7(3): 92-101 ttp://www.sciencepublisinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20180703.11 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)

More information

Kernel Density Based Linear Regression Estimate

Kernel Density Based Linear Regression Estimate Kernel Density Based Linear Regression Estimate Weixin Yao and Zibiao Zao Abstract For linear regression models wit non-normally distributed errors, te least squares estimate (LSE will lose some efficiency

More information

New Distribution Theory for the Estimation of Structural Break Point in Mean

New Distribution Theory for the Estimation of Structural Break Point in Mean New Distribution Teory for te Estimation of Structural Break Point in Mean Liang Jiang Singapore Management University Xiaou Wang Te Cinese University of Hong Kong Jun Yu Singapore Management University

More information

Polynomial Interpolation

Polynomial Interpolation Capter 4 Polynomial Interpolation In tis capter, we consider te important problem of approximatinga function fx, wose values at a set of distinct points x, x, x,, x n are known, by a polynomial P x suc

More information

A MONTE CARLO ANALYSIS OF THE EFFECTS OF COVARIANCE ON PROPAGATED UNCERTAINTIES

A MONTE CARLO ANALYSIS OF THE EFFECTS OF COVARIANCE ON PROPAGATED UNCERTAINTIES A MONTE CARLO ANALYSIS OF THE EFFECTS OF COVARIANCE ON PROPAGATED UNCERTAINTIES Ronald Ainswort Hart Scientific, American Fork UT, USA ABSTRACT Reports of calibration typically provide total combined uncertainties

More information

Handling Missing Data on Asymmetric Distribution

Handling Missing Data on Asymmetric Distribution International Matematical Forum, Vol. 8, 03, no. 4, 53-65 Handling Missing Data on Asymmetric Distribution Amad M. H. Al-Kazale Department of Matematics, Faculty of Science Al-albayt University, Al-Mafraq-Jordan

More information

Order of Accuracy. ũ h u Ch p, (1)

Order of Accuracy. ũ h u Ch p, (1) Order of Accuracy 1 Terminology We consider a numerical approximation of an exact value u. Te approximation depends on a small parameter, wic can be for instance te grid size or time step in a numerical

More information

How to Find the Derivative of a Function: Calculus 1

How to Find the Derivative of a Function: Calculus 1 Introduction How to Find te Derivative of a Function: Calculus 1 Calculus is not an easy matematics course Te fact tat you ave enrolled in suc a difficult subject indicates tat you are interested in te

More information

232 Calculus and Structures

232 Calculus and Structures 3 Calculus and Structures CHAPTER 17 JUSTIFICATION OF THE AREA AND SLOPE METHODS FOR EVALUATING BEAMS Calculus and Structures 33 Copyrigt Capter 17 JUSTIFICATION OF THE AREA AND SLOPE METHODS 17.1 THE

More information

New families of estimators and test statistics in log-linear models

New families of estimators and test statistics in log-linear models Journal of Multivariate Analysis 99 008 1590 1609 www.elsevier.com/locate/jmva ew families of estimators and test statistics in log-linear models irian Martín a,, Leandro Pardo b a Department of Statistics

More information

Numerical Experiments Using MATLAB: Superconvergence of Nonconforming Finite Element Approximation for Second-Order Elliptic Problems

Numerical Experiments Using MATLAB: Superconvergence of Nonconforming Finite Element Approximation for Second-Order Elliptic Problems Applied Matematics, 06, 7, 74-8 ttp://wwwscirporg/journal/am ISSN Online: 5-7393 ISSN Print: 5-7385 Numerical Experiments Using MATLAB: Superconvergence of Nonconforming Finite Element Approximation for

More information

More on generalized inverses of partitioned matrices with Banachiewicz-Schur forms

More on generalized inverses of partitioned matrices with Banachiewicz-Schur forms More on generalized inverses of partitioned matrices wit anaciewicz-scur forms Yongge Tian a,, Yosio Takane b a Cina Economics and Management cademy, Central University of Finance and Economics, eijing,

More information

Volume 29, Issue 3. Existence of competitive equilibrium in economies with multi-member households

Volume 29, Issue 3. Existence of competitive equilibrium in economies with multi-member households Volume 29, Issue 3 Existence of competitive equilibrium in economies wit multi-member ouseolds Noriisa Sato Graduate Scool of Economics, Waseda University Abstract Tis paper focuses on te existence of

More information

Bootstrap prediction intervals for Markov processes

Bootstrap prediction intervals for Markov processes arxiv: arxiv:0000.0000 Bootstrap prediction intervals for Markov processes Li Pan and Dimitris N. Politis Li Pan Department of Matematics University of California San Diego La Jolla, CA 92093-0112, USA

More information

Financial Econometrics Prof. Massimo Guidolin

Financial Econometrics Prof. Massimo Guidolin CLEFIN A.A. 2010/2011 Financial Econometrics Prof. Massimo Guidolin A Quick Review of Basic Estimation Metods 1. Were te OLS World Ends... Consider two time series 1: = { 1 2 } and 1: = { 1 2 }. At tis

More information

Combining functions: algebraic methods

Combining functions: algebraic methods Combining functions: algebraic metods Functions can be added, subtracted, multiplied, divided, and raised to a power, just like numbers or algebra expressions. If f(x) = x 2 and g(x) = x + 2, clearly f(x)

More information

Mathematics 5 Worksheet 11 Geometry, Tangency, and the Derivative

Mathematics 5 Worksheet 11 Geometry, Tangency, and the Derivative Matematics 5 Workseet 11 Geometry, Tangency, and te Derivative Problem 1. Find te equation of a line wit slope m tat intersects te point (3, 9). Solution. Te equation for a line passing troug a point (x

More information

4. The slope of the line 2x 7y = 8 is (a) 2/7 (b) 7/2 (c) 2 (d) 2/7 (e) None of these.

4. The slope of the line 2x 7y = 8 is (a) 2/7 (b) 7/2 (c) 2 (d) 2/7 (e) None of these. Mat 11. Test Form N Fall 016 Name. Instructions. Te first eleven problems are wort points eac. Te last six problems are wort 5 points eac. For te last six problems, you must use relevant metods of algebra

More information

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA EXAMINATION MODULE 5

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA EXAMINATION MODULE 5 THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA EXAMINATION NEW MODULAR SCHEME introduced from te examinations in 009 MODULE 5 SOLUTIONS FOR SPECIMEN PAPER B THE QUESTIONS ARE CONTAINED IN A SEPARATE FILE

More information

Material for Difference Quotient

Material for Difference Quotient Material for Difference Quotient Prepared by Stepanie Quintal, graduate student and Marvin Stick, professor Dept. of Matematical Sciences, UMass Lowell Summer 05 Preface Te following difference quotient

More information

Numerical Differentiation

Numerical Differentiation Numerical Differentiation Finite Difference Formulas for te first derivative (Using Taylor Expansion tecnique) (section 8.3.) Suppose tat f() = g() is a function of te variable, and tat as 0 te function

More information

Bandwidth Selection in Nonparametric Kernel Testing

Bandwidth Selection in Nonparametric Kernel Testing Te University of Adelaide Scool of Economics Researc Paper No. 2009-0 January 2009 Bandwidt Selection in Nonparametric ernel Testing Jiti Gao and Irene Gijbels Bandwidt Selection in Nonparametric ernel

More information

Polynomial Interpolation

Polynomial Interpolation Capter 4 Polynomial Interpolation In tis capter, we consider te important problem of approximating a function f(x, wose values at a set of distinct points x, x, x 2,,x n are known, by a polynomial P (x

More information

Ratio estimation using stratified ranked set sample

Ratio estimation using stratified ranked set sample METRON - International Journal of Statistics 003, vol. LXI, n. 1, pp. 75-90 HANI M. SAMAWI MAHMOUD I. SIAM Ratio estimation using stratified ranked set sample Summary - Ratio estimation metod is used to

More information

A Simple Matching Method for Estimating Sample Selection Models Using Experimental Data

A Simple Matching Method for Estimating Sample Selection Models Using Experimental Data ANNALS OF ECONOMICS AND FINANCE 6, 155 167 (2005) A Simple Matcing Metod for Estimating Sample Selection Models Using Experimental Data Songnian Cen Te Hong Kong University of Science and Tecnology and

More information

IEOR 165 Lecture 10 Distribution Estimation

IEOR 165 Lecture 10 Distribution Estimation IEOR 165 Lecture 10 Distribution Estimation 1 Motivating Problem Consider a situation were we ave iid data x i from some unknown distribution. One problem of interest is estimating te distribution tat

More information

Uniform Convergence Rates for Nonparametric Estimation

Uniform Convergence Rates for Nonparametric Estimation Uniform Convergence Rates for Nonparametric Estimation Bruce E. Hansen University of Wisconsin www.ssc.wisc.edu/~bansen October 2004 Preliminary and Incomplete Abstract Tis paper presents a set of rate

More information

1 The concept of limits (p.217 p.229, p.242 p.249, p.255 p.256) 1.1 Limits Consider the function determined by the formula 3. x since at this point

1 The concept of limits (p.217 p.229, p.242 p.249, p.255 p.256) 1.1 Limits Consider the function determined by the formula 3. x since at this point MA00 Capter 6 Calculus and Basic Linear Algebra I Limits, Continuity and Differentiability Te concept of its (p.7 p.9, p.4 p.49, p.55 p.56). Limits Consider te function determined by te formula f Note

More information

Symmetry Labeling of Molecular Energies

Symmetry Labeling of Molecular Energies Capter 7. Symmetry Labeling of Molecular Energies Notes: Most of te material presented in tis capter is taken from Bunker and Jensen 1998, Cap. 6, and Bunker and Jensen 2005, Cap. 7. 7.1 Hamiltonian Symmetry

More information

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics 1

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics 1 Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics 1 By Jiti Gao 2 and Maxwell King 3 Abstract We propose a simultaneous model specification procedure for te conditional

More information

LECTURE 14 NUMERICAL INTEGRATION. Find

LECTURE 14 NUMERICAL INTEGRATION. Find LECTURE 14 NUMERCAL NTEGRATON Find b a fxdx or b a vx ux fx ydy dx Often integration is required. However te form of fx may be suc tat analytical integration would be very difficult or impossible. Use

More information

Chapter 5 FINITE DIFFERENCE METHOD (FDM)

Chapter 5 FINITE DIFFERENCE METHOD (FDM) MEE7 Computer Modeling Tecniques in Engineering Capter 5 FINITE DIFFERENCE METHOD (FDM) 5. Introduction to FDM Te finite difference tecniques are based upon approximations wic permit replacing differential

More information

Function Composition and Chain Rules

Function Composition and Chain Rules Function Composition and s James K. Peterson Department of Biological Sciences and Department of Matematical Sciences Clemson University Marc 8, 2017 Outline 1 Function Composition and Continuity 2 Function

More information

Differential Calculus (The basics) Prepared by Mr. C. Hull

Differential Calculus (The basics) Prepared by Mr. C. Hull Differential Calculus Te basics) A : Limits In tis work on limits, we will deal only wit functions i.e. tose relationsips in wic an input variable ) defines a unique output variable y). Wen we work wit

More information

The Complexity of Computing the MCD-Estimator

The Complexity of Computing the MCD-Estimator Te Complexity of Computing te MCD-Estimator Torsten Bernolt Lerstul Informatik 2 Universität Dortmund, Germany torstenbernolt@uni-dortmundde Paul Fiscer IMM, Danisc Tecnical University Kongens Lyngby,

More information

Analytic Functions. Differentiable Functions of a Complex Variable

Analytic Functions. Differentiable Functions of a Complex Variable Analytic Functions Differentiable Functions of a Complex Variable In tis capter, we sall generalize te ideas for polynomials power series of a complex variable we developed in te previous capter to general

More information

Continuity and Differentiability of the Trigonometric Functions

Continuity and Differentiability of the Trigonometric Functions [Te basis for te following work will be te definition of te trigonometric functions as ratios of te sides of a triangle inscribed in a circle; in particular, te sine of an angle will be defined to be te

More information

Adaptive Neural Filters with Fixed Weights

Adaptive Neural Filters with Fixed Weights Adaptive Neural Filters wit Fixed Weigts James T. Lo and Justin Nave Department of Matematics and Statistics University of Maryland Baltimore County Baltimore, MD 150, U.S.A. e-mail: jameslo@umbc.edu Abstract

More information

AMS 147 Computational Methods and Applications Lecture 09 Copyright by Hongyun Wang, UCSC. Exact value. Effect of round-off error.

AMS 147 Computational Methods and Applications Lecture 09 Copyright by Hongyun Wang, UCSC. Exact value. Effect of round-off error. Lecture 09 Copyrigt by Hongyun Wang, UCSC Recap: Te total error in numerical differentiation fl( f ( x + fl( f ( x E T ( = f ( x Numerical result from a computer Exact value = e + f x+ Discretization error

More information

SECTION 1.10: DIFFERENCE QUOTIENTS LEARNING OBJECTIVES

SECTION 1.10: DIFFERENCE QUOTIENTS LEARNING OBJECTIVES (Section.0: Difference Quotients).0. SECTION.0: DIFFERENCE QUOTIENTS LEARNING OBJECTIVES Define average rate of cange (and average velocity) algebraically and grapically. Be able to identify, construct,

More information

Applied Linear Statistical Models. Simultaneous Inference Topics. Simultaneous Estimation of β 0 and β 1 Issues. Simultaneous Inference. Dr.

Applied Linear Statistical Models. Simultaneous Inference Topics. Simultaneous Estimation of β 0 and β 1 Issues. Simultaneous Inference. Dr. Applied Linear Statistical Models Simultaneous Inference Dr. DH Jones Simultaneous Inference Topics Simultaneous estimation of β 0 and β 1 Bonferroni Metod Simultaneous estimation of several mean responses

More information

1 Proving the Fundamental Theorem of Statistical Learning

1 Proving the Fundamental Theorem of Statistical Learning THEORETICAL MACHINE LEARNING COS 5 LECTURE #7 APRIL 5, 6 LECTURER: ELAD HAZAN NAME: FERMI MA ANDDANIEL SUO oving te Fundaental Teore of Statistical Learning In tis section, we prove te following: Teore.

More information

2.8 The Derivative as a Function

2.8 The Derivative as a Function .8 Te Derivative as a Function Typically, we can find te derivative of a function f at many points of its domain: Definition. Suppose tat f is a function wic is differentiable at every point of an open

More information

Lecture 15. Interpolation II. 2 Piecewise polynomial interpolation Hermite splines

Lecture 15. Interpolation II. 2 Piecewise polynomial interpolation Hermite splines Lecture 5 Interpolation II Introduction In te previous lecture we focused primarily on polynomial interpolation of a set of n points. A difficulty we observed is tat wen n is large, our polynomial as to

More information

REVIEW LAB ANSWER KEY

REVIEW LAB ANSWER KEY REVIEW LAB ANSWER KEY. Witout using SN, find te derivative of eac of te following (you do not need to simplify your answers): a. f x 3x 3 5x x 6 f x 3 3x 5 x 0 b. g x 4 x x x notice te trick ere! x x g

More information

Pre-Calculus Review Preemptive Strike

Pre-Calculus Review Preemptive Strike Pre-Calculus Review Preemptive Strike Attaced are some notes and one assignment wit tree parts. Tese are due on te day tat we start te pre-calculus review. I strongly suggest reading troug te notes torougly

More information

Robotic manipulation project

Robotic manipulation project Robotic manipulation project Bin Nguyen December 5, 2006 Abstract Tis is te draft report for Robotic Manipulation s class project. Te cosen project aims to understand and implement Kevin Egan s non-convex

More information

LIMITS AND DERIVATIVES CONDITIONS FOR THE EXISTENCE OF A LIMIT

LIMITS AND DERIVATIVES CONDITIONS FOR THE EXISTENCE OF A LIMIT LIMITS AND DERIVATIVES Te limit of a function is defined as te value of y tat te curve approaces, as x approaces a particular value. Te limit of f (x) as x approaces a is written as f (x) approaces, as

More information

Copyright c 2008 Kevin Long

Copyright c 2008 Kevin Long Lecture 4 Numerical solution of initial value problems Te metods you ve learned so far ave obtained closed-form solutions to initial value problems. A closedform solution is an explicit algebriac formula

More information

Taylor Series and the Mean Value Theorem of Derivatives

Taylor Series and the Mean Value Theorem of Derivatives 1 - Taylor Series and te Mean Value Teorem o Derivatives Te numerical solution o engineering and scientiic problems described by matematical models oten requires solving dierential equations. Dierential

More information

Lab 6 Derivatives and Mutant Bacteria

Lab 6 Derivatives and Mutant Bacteria Lab 6 Derivatives and Mutant Bacteria Date: September 27, 20 Assignment Due Date: October 4, 20 Goal: In tis lab you will furter explore te concept of a derivative using R. You will use your knowledge

More information

Spatial models with spatially lagged dependent variables and incomplete data

Spatial models with spatially lagged dependent variables and incomplete data J Geogr Syst (2010) 12:241 257 DOI 10.1007/s10109-010-0109-5 ORIGINAL ARTICLE Spatial models wit spatially lagged dependent variables and incomplete data Harry H. Kelejian Ingmar R. Pruca Received: 23

More information

A New Diagnostic Test for Cross Section Independence in Nonparametric Panel Data Model

A New Diagnostic Test for Cross Section Independence in Nonparametric Panel Data Model e University of Adelaide Scool of Economics Researc Paper No. 2009-6 October 2009 A New Diagnostic est for Cross Section Independence in Nonparametric Panel Data Model Jia Cen, Jiti Gao and Degui Li e

More information

Poisson Equation in Sobolev Spaces

Poisson Equation in Sobolev Spaces Poisson Equation in Sobolev Spaces OcMountain Dayligt Time. 6, 011 Today we discuss te Poisson equation in Sobolev spaces. It s existence, uniqueness, and regularity. Weak Solution. u = f in, u = g on

More information

The derivative function

The derivative function Roberto s Notes on Differential Calculus Capter : Definition of derivative Section Te derivative function Wat you need to know already: f is at a point on its grap and ow to compute it. Wat te derivative

More information

Basic Nonparametric Estimation Spring 2002

Basic Nonparametric Estimation Spring 2002 Basic Nonparametric Estimation Spring 2002 Te following topics are covered today: Basic Nonparametric Regression. Tere are four books tat you can find reference: Silverman986, Wand and Jones995, Hardle990,

More information

Quasiperiodic phenomena in the Van der Pol - Mathieu equation

Quasiperiodic phenomena in the Van der Pol - Mathieu equation Quasiperiodic penomena in te Van der Pol - Matieu equation F. Veerman and F. Verulst Department of Matematics, Utrect University P.O. Box 80.010, 3508 TA Utrect Te Neterlands April 8, 009 Abstract Te Van

More information

Section 15.6 Directional Derivatives and the Gradient Vector

Section 15.6 Directional Derivatives and the Gradient Vector Section 15.6 Directional Derivatives and te Gradient Vector Finding rates of cange in different directions Recall tat wen we first started considering derivatives of functions of more tan one variable,

More information

Math 102 TEST CHAPTERS 3 & 4 Solutions & Comments Fall 2006

Math 102 TEST CHAPTERS 3 & 4 Solutions & Comments Fall 2006 Mat 102 TEST CHAPTERS 3 & 4 Solutions & Comments Fall 2006 f(x+) f(x) 10 1. For f(x) = x 2 + 2x 5, find ))))))))) and simplify completely. NOTE: **f(x+) is NOT f(x)+! f(x+) f(x) (x+) 2 + 2(x+) 5 ( x 2

More information

Convergence and Descent Properties for a Class of Multilevel Optimization Algorithms

Convergence and Descent Properties for a Class of Multilevel Optimization Algorithms Convergence and Descent Properties for a Class of Multilevel Optimization Algoritms Stepen G. Nas April 28, 2010 Abstract I present a multilevel optimization approac (termed MG/Opt) for te solution of

More information

(a) At what number x = a does f have a removable discontinuity? What value f(a) should be assigned to f at x = a in order to make f continuous at a?

(a) At what number x = a does f have a removable discontinuity? What value f(a) should be assigned to f at x = a in order to make f continuous at a? Solutions to Test 1 Fall 016 1pt 1. Te grap of a function f(x) is sown at rigt below. Part I. State te value of eac limit. If a limit is infinite, state weter it is or. If a limit does not exist (but is

More information

Estimating Peak Bone Mineral Density in Osteoporosis Diagnosis by Maximum Distribution

Estimating Peak Bone Mineral Density in Osteoporosis Diagnosis by Maximum Distribution International Journal of Clinical Medicine Researc 2016; 3(5): 76-80 ttp://www.aascit.org/journal/ijcmr ISSN: 2375-3838 Estimating Peak Bone Mineral Density in Osteoporosis Diagnosis by Maximum Distribution

More information

Quantum Mechanics Chapter 1.5: An illustration using measurements of particle spin.

Quantum Mechanics Chapter 1.5: An illustration using measurements of particle spin. I Introduction. Quantum Mecanics Capter.5: An illustration using measurements of particle spin. Quantum mecanics is a teory of pysics tat as been very successful in explaining and predicting many pysical

More information

Variance Estimation in Stratified Random Sampling in the Presence of Two Auxiliary Random Variables

Variance Estimation in Stratified Random Sampling in the Presence of Two Auxiliary Random Variables International Journal of Science and Researc (IJSR) ISSN (Online): 39-7064 Impact Factor (0): 3.358 Variance Estimation in Stratified Random Sampling in te Presence of Two Auxiliary Random Variables Esubalew

More information

Te comparison of dierent models M i is based on teir relative probabilities, wic can be expressed, again using Bayes' teorem, in terms of prior probab

Te comparison of dierent models M i is based on teir relative probabilities, wic can be expressed, again using Bayes' teorem, in terms of prior probab To appear in: Advances in Neural Information Processing Systems 9, eds. M. C. Mozer, M. I. Jordan and T. Petsce. MIT Press, 997 Bayesian Model Comparison by Monte Carlo Caining David Barber D.Barber@aston.ac.uk

More information

THE IDEA OF DIFFERENTIABILITY FOR FUNCTIONS OF SEVERAL VARIABLES Math 225

THE IDEA OF DIFFERENTIABILITY FOR FUNCTIONS OF SEVERAL VARIABLES Math 225 THE IDEA OF DIFFERENTIABILITY FOR FUNCTIONS OF SEVERAL VARIABLES Mat 225 As we ave seen, te definition of derivative for a Mat 111 function g : R R and for acurveγ : R E n are te same, except for interpretation:

More information

Effect of the Dependent Paths in Linear Hull

Effect of the Dependent Paths in Linear Hull 1 Effect of te Dependent Pats in Linear Hull Zenli Dai, Meiqin Wang, Yue Sun Scool of Matematics, Sandong University, Jinan, 250100, Cina Key Laboratory of Cryptologic Tecnology and Information Security,

More information

Stationary Gaussian Markov Processes As Limits of Stationary Autoregressive Time Series

Stationary Gaussian Markov Processes As Limits of Stationary Autoregressive Time Series Stationary Gaussian Markov Processes As Limits of Stationary Autoregressive Time Series Lawrence D. Brown, Pilip A. Ernst, Larry Sepp, and Robert Wolpert August 27, 2015 Abstract We consider te class,

More information

ch (for some fixed positive number c) reaching c

ch (for some fixed positive number c) reaching c GSTF Journal of Matematics Statistics and Operations Researc (JMSOR) Vol. No. September 05 DOI 0.60/s4086-05-000-z Nonlinear Piecewise-defined Difference Equations wit Reciprocal and Cubic Terms Ramadan

More information

Dedicated to the 70th birthday of Professor Lin Qun

Dedicated to the 70th birthday of Professor Lin Qun Journal of Computational Matematics, Vol.4, No.3, 6, 4 44. ACCELERATION METHODS OF NONLINEAR ITERATION FOR NONLINEAR PARABOLIC EQUATIONS Guang-wei Yuan Xu-deng Hang Laboratory of Computational Pysics,

More information

Math Spring 2013 Solutions to Assignment # 3 Completion Date: Wednesday May 15, (1/z) 2 (1/z 1) 2 = lim

Math Spring 2013 Solutions to Assignment # 3 Completion Date: Wednesday May 15, (1/z) 2 (1/z 1) 2 = lim Mat 311 - Spring 013 Solutions to Assignment # 3 Completion Date: Wednesday May 15, 013 Question 1. [p 56, #10 (a)] 4z Use te teorem of Sec. 17 to sow tat z (z 1) = 4. We ave z 4z (z 1) = z 0 4 (1/z) (1/z

More information

158 Calculus and Structures

158 Calculus and Structures 58 Calculus and Structures CHAPTER PROPERTIES OF DERIVATIVES AND DIFFERENTIATION BY THE EASY WAY. Calculus and Structures 59 Copyrigt Capter PROPERTIES OF DERIVATIVES. INTRODUCTION In te last capter you

More information

School of Geomatics and Urban Information, Beijing University of Civil Engineering and Architecture, Beijing, China 2

School of Geomatics and Urban Information, Beijing University of Civil Engineering and Architecture, Beijing, China 2 Examination Metod and Implementation for Field Survey Data of Crop Types Based on Multi-resolution Satellite Images Yang Liu, Mingyi Du, Wenquan Zu, Scool of Geomatics and Urban Information, Beijing University

More information

Variational Localizations of the Dual Weighted Residual Estimator

Variational Localizations of the Dual Weighted Residual Estimator Publised in Journal for Computational and Applied Matematics, pp. 192-208, 2015 Variational Localizations of te Dual Weigted Residual Estimator Tomas Ricter Tomas Wick Te dual weigted residual metod (DWR)

More information

Differentiation in higher dimensions

Differentiation in higher dimensions Capter 2 Differentiation in iger dimensions 2.1 Te Total Derivative Recall tat if f : R R is a 1-variable function, and a R, we say tat f is differentiable at x = a if and only if te ratio f(a+) f(a) tends

More information

7 Semiparametric Methods and Partially Linear Regression

7 Semiparametric Methods and Partially Linear Regression 7 Semiparametric Metods and Partially Linear Regression 7. Overview A model is called semiparametric if it is described by and were is nite-dimensional (e.g. parametric) and is in nite-dimensional (nonparametric).

More information

The Priestley-Chao Estimator

The Priestley-Chao Estimator Te Priestley-Cao Estimator In tis section we will consider te Pristley-Cao estimator of te unknown regression function. It is assumed tat we ave a sample of observations (Y i, x i ), i = 1,..., n wic are

More information

Journal of Computational and Applied Mathematics

Journal of Computational and Applied Mathematics Journal of Computational and Applied Matematics 94 (6) 75 96 Contents lists available at ScienceDirect Journal of Computational and Applied Matematics journal omepage: www.elsevier.com/locate/cam Smootness-Increasing

More information

Investigating Euler s Method and Differential Equations to Approximate π. Lindsay Crowl August 2, 2001

Investigating Euler s Method and Differential Equations to Approximate π. Lindsay Crowl August 2, 2001 Investigating Euler s Metod and Differential Equations to Approximate π Lindsa Crowl August 2, 2001 Tis researc paper focuses on finding a more efficient and accurate wa to approximate π. Suppose tat x

More information

Exam 1 Solutions. x(x 2) (x + 1)(x 2) = x

Exam 1 Solutions. x(x 2) (x + 1)(x 2) = x Eam Solutions Question (0%) Consider f() = 2 2 2 2. (a) By calculating relevant its, determine te equations of all vertical asymptotes of te grap of f(). If tere are none, say so. f() = ( 2) ( + )( 2)

More information

2.11 That s So Derivative

2.11 That s So Derivative 2.11 Tat s So Derivative Introduction to Differential Calculus Just as one defines instantaneous velocity in terms of average velocity, we now define te instantaneous rate of cange of a function at a point

More information

New Streamfunction Approach for Magnetohydrodynamics

New Streamfunction Approach for Magnetohydrodynamics New Streamfunction Approac for Magnetoydrodynamics Kab Seo Kang Brooaven National Laboratory, Computational Science Center, Building 63, Room, Upton NY 973, USA. sang@bnl.gov Summary. We apply te finite

More information

Analysis of Solar Generation and Weather Data in Smart Grid with Simultaneous Inference of Nonlinear Time Series

Analysis of Solar Generation and Weather Data in Smart Grid with Simultaneous Inference of Nonlinear Time Series Te First International Worksop on Smart Cities and Urban Informatics 215 Analysis of Solar Generation and Weater Data in Smart Grid wit Simultaneous Inference of Nonlinear Time Series Yu Wang, Guanqun

More information

1 Calculus. 1.1 Gradients and the Derivative. Q f(x+h) f(x)

1 Calculus. 1.1 Gradients and the Derivative. Q f(x+h) f(x) Calculus. Gradients and te Derivative Q f(x+) δy P T δx R f(x) 0 x x+ Let P (x, f(x)) and Q(x+, f(x+)) denote two points on te curve of te function y = f(x) and let R denote te point of intersection of

More information

HOW TO DEAL WITH FFT SAMPLING INFLUENCES ON ADEV CALCULATIONS

HOW TO DEAL WITH FFT SAMPLING INFLUENCES ON ADEV CALCULATIONS HOW TO DEAL WITH FFT SAMPLING INFLUENCES ON ADEV CALCULATIONS Po-Ceng Cang National Standard Time & Frequency Lab., TL, Taiwan 1, Lane 551, Min-Tsu Road, Sec. 5, Yang-Mei, Taoyuan, Taiwan 36 Tel: 886 3

More information

OSCILLATION OF SOLUTIONS TO NON-LINEAR DIFFERENCE EQUATIONS WITH SEVERAL ADVANCED ARGUMENTS. Sandra Pinelas and Julio G. Dix

OSCILLATION OF SOLUTIONS TO NON-LINEAR DIFFERENCE EQUATIONS WITH SEVERAL ADVANCED ARGUMENTS. Sandra Pinelas and Julio G. Dix Opuscula Mat. 37, no. 6 (2017), 887 898 ttp://dx.doi.org/10.7494/opmat.2017.37.6.887 Opuscula Matematica OSCILLATION OF SOLUTIONS TO NON-LINEAR DIFFERENCE EQUATIONS WITH SEVERAL ADVANCED ARGUMENTS Sandra

More information

Online Learning: Bandit Setting

Online Learning: Bandit Setting Online Learning: Bandit Setting Daniel asabi Summer 04 Last Update: October 0, 06 Introduction [TODO Bandits. Stocastic setting Suppose tere exists unknown distributions ν,..., ν, suc tat te loss at eac

More information

3.4 Worksheet: Proof of the Chain Rule NAME

3.4 Worksheet: Proof of the Chain Rule NAME Mat 1170 3.4 Workseet: Proof of te Cain Rule NAME Te Cain Rule So far we are able to differentiate all types of functions. For example: polynomials, rational, root, and trigonometric functions. We are

More information

Distribution of reynolds shear stress in steady and unsteady flows

Distribution of reynolds shear stress in steady and unsteady flows University of Wollongong Researc Online Faculty of Engineering and Information Sciences - Papers: Part A Faculty of Engineering and Information Sciences 13 Distribution of reynolds sear stress in steady

More information

MATH1151 Calculus Test S1 v2a

MATH1151 Calculus Test S1 v2a MATH5 Calculus Test 8 S va January 8, 5 Tese solutions were written and typed up by Brendan Trin Please be etical wit tis resource It is for te use of MatSOC members, so do not repost it on oter forums

More information

arxiv: v1 [math.dg] 4 Feb 2015

arxiv: v1 [math.dg] 4 Feb 2015 CENTROID OF TRIANGLES ASSOCIATED WITH A CURVE arxiv:1502.01205v1 [mat.dg] 4 Feb 2015 Dong-Soo Kim and Dong Seo Kim Abstract. Arcimedes sowed tat te area between a parabola and any cord AB on te parabola

More information

Superconvergence of energy-conserving discontinuous Galerkin methods for. linear hyperbolic equations. Abstract

Superconvergence of energy-conserving discontinuous Galerkin methods for. linear hyperbolic equations. Abstract Superconvergence of energy-conserving discontinuous Galerkin metods for linear yperbolic equations Yong Liu, Ci-Wang Su and Mengping Zang 3 Abstract In tis paper, we study superconvergence properties of

More information

Error estimates for a semi-implicit fully discrete finite element scheme for the mean curvature flow of graphs

Error estimates for a semi-implicit fully discrete finite element scheme for the mean curvature flow of graphs Interfaces and Free Boundaries 2, 2000 34 359 Error estimates for a semi-implicit fully discrete finite element sceme for te mean curvature flow of graps KLAUS DECKELNICK Scool of Matematical Sciences,

More information