VALIDATION OF TRACE-DRIVEN SIMULATION MODELS: MORE ON BOOTSTRAP TESTS. Russell C.H. Cheng

Size: px
Start display at page:

Download "VALIDATION OF TRACE-DRIVEN SIMULATION MODELS: MORE ON BOOTSTRAP TESTS. Russell C.H. Cheng"

Transcription

1 Proceedngs of the 2000 Wnter Smulaton Conference J A Jones, R R Barton, K Kang, and P A Fshwck, eds VALIDATION OF TRACE-DRIVEN SIMULATION MODELS: MORE ON BOOTSTRAP TESTS Jack PC Klejnen Department of Informaton Systems (BIK)/ Center for Economc Research (CentER) Tlburg Unversty (KUB) 5000 LE Tlburg, Netherlands Russell CH Cheng Department of Mathematcal Scences Unversty of Southampton, Hghfeld, Southampton SO17 1BJ, UK Bert Bettonvl Department of Informaton Systems (BIK)/ Center for Economc Research (CentER) Tlburg Unversty (KUB) 5000 LE Tlburg, Netherlands ABSTRACT Trace-drven or correlated nspecton smulaton means that the smulated and the real systems have some common nputs (say, arrval tmes) so the two systems outputs are cross-correlated To valdate such smulaton models, ths paper formulates sx valdaton statstcs, whch are nspred by practce and statstcal analyss; for example, the smplest statstc s the dfference between the average smulated and real responses To evaluate these valdaton statstcs, the paper develops novel types of bootstrappng based on subruns Three basc bootstrap procedures are devsed, dependng on the number of smulaton replcates: one, two, or more replcates Moreover, for the case of more than two replcates the paper consders condtonal versus uncondtonal resamplng These sx valdaton statstcs and four bootstrap procedures are evaluated n extensve Monte Carlo experments wth sngle-server queueng systems The man concluson s that bootstrappng of the smplest valdaton statstc gves the correct type I error probablty, and has relatvely hgh power 1 INTRODUCTION Valdaton has many aspects; for a recent revew and references see Klejnen (1999) In ths paper, however, we lmt ourselves to statstcal testng of the valdty of tracedrven smulatons Consder the followng trace-drven smulaton; also see Table 1 The smulated and the real systems have some common nputs (say) A; for example, the same hstorcal sequence of arrval tmes (we use captal letters for random varables, lower-case letters for realzed values, and bold letters for matrces ncludng vectors) The real system generates a tme seres of outputs W ; t whereas the smulaton generates outputs wth = 1,, n and t = 1, V ; t 882 2,, k; for example, sojourn tme of job t on day To evaluate the real system, ts manager characterzes the output tme seres by a sngle performance measure (response) X ; for example, average sojourn tme on day To valdate the smulaton statstcally, ths real performance X s compared wth the smulated performance (say) Y - for the same stuaton (same crcumstances, same scenaro) characterzed by the trace A But how should we compare X and Y? Some solutons are presented n Moors and Strjbosch (1998), but we focus on Klejnen, Bettonvl, and Van Groenendaal (1998), abbrevated here to KLEIJ Lke KLEIJ we assume that all smulaton responses Y are dentcally and ndependently dstrbuted (d) More specfcally, each subrun starts n the empty state, and stops after a fxed number k of jobs The real responses X are also d Unlke KLEIJ we do not assume that ( X, Y ) are bvarate normal Indeed, n case of short subruns (say, k = 10) the responses are serously nonnormal Ths nonnormalty - together wth a small n (number of subruns) - s not well handled by conventonal non-bootstrap technques (Obvously, tracedrven smulaton mples that the two members of the par ( X, Y ) are cross-correlated) We suppose that the smulaton model has at least one more nput varable (eg, servce tme) not recorded on the trace, so ths nput s sampled usng a pseudorandom number stream R There are s smulaton replcatons (usng the same Y (r) trace A ), whch yeld wth r = 1,, s We dstngush three cases for s, namely 1, 2, or more - namely, fve or ten To solve ths problem, we use bootstrappng, whch n general samples - randomly wth replacement - d observatons; see the semnal book on bootstrappng (outsde smulaton), Efron and Tbshran (1993), here abbrevated to EFRON (Other monographs on bootstrappng are Davson and Hnkley (19 ), Mooney and Duval (1993), and Shao and Tu (1995))

2 Table 1: Trace-drven Smulaton Subrun number Trace: A 1 A A n Real performance: X 1 X X n Smulated performance: replcate 1 Y (1) 1 Y (1) Y (1) n replcate r Y (r) 1 Y (r) replcate s Y (s) 1 Y (s) Y (r) n Y (s) n We wsh to test the hypothess that the smulaton model s vald For hypothess testng through bootstrappng outsde smulaton we refer to EFRON and also Shao and Tu (1995, pp 176, 189) Our man dscovery wll be: one smulaton replcate s certanly a vald model for another smulaton replcate So f s $ 2 we can obtan the bootstrap dstrbuton of any valdaton statstc under the null-hypothess of a vald trace-drven smulaton model! Note that - nstead of generatng responses through bootstrappng - we may generate more smulaton responses In practce, however, replcatng a smulaton generally requres much more computer tme than bootstrappng a smulaton We assume that the number of smulaton replcates (symbol s) s gven, and s small compared wth the bootstrap sample sze b (Breman 1992, p 750 also dscusses bootstrappng versus replcatng, but not n a smulaton context) To provde some background of our research, we now summarze the lterature on bootstrappng n smulaton Fredman and Fredman (1995) provde two academc examples Km, Wlleman, Haddock, and Runger (1993) formulate ther so-called threshold bootstrap for the analyss of autocorrelated smulaton outputs Several authors nvestgate bootstrappng of emprcal nput dstrbutons n smulaton: Barton and Schruben (1993), Cheng (1995), Cheng and Holland (1997), and Prtsker (1998) Bootstrappng for valdaton of metamodels s done by Klejnen, Feelders, and Cheng (1998) A summary of the present paper s Klejnen, Cheng, and Bettonvl (2000) Our man concluson wll be: f a trace-drven smulaton model s run more than twce (s > 2), then bootstrappng any statstc gves acceptable (albet conservatve) type I error probablty; the smplest statstc (the average devaton) has good power compared wth the more complcated statstcs The remander of ths artcle s organzed as follows 2 summarzes KLEIJ s F-statstc based on regresson analyss, and proposes fve more valdaton statstcs 3 recaptulates EFRON s bootstrappng of tme seres; EFRON uses blocks, whch we nterpret as termnatng subruns 4 derves three bootstrap procedures for trace-drven smulatons, usng one, two, or more than two smulaton Klejnen, Cheng, and Bettonvl 883 replcatons per subrun; moreover, n case of more than two replcates the resamplng may be ether condtonal or uncondtonal To evaluate these sx valdaton statstcs and four bootstrap technques, 5 desgns a Monte Carlo experment wth queueng models that generate 'real' and smulated sojourn tmes 6 nterprets the results of ths extensve Monte Carlo experment 7 presents conclusons and topcs for future research 2 SIX TESTS FOR VALIDATION The bootstrap enables estmatng the dstrbuton of any statstc, provded the statstc s a contnuous functon of the observatons (eg, the medan s not a contnuous functon) For the valdaton of trace-drven smulatons we nvestgate sx statstcs, denoted as T 1 through T 6 KLEIJ calls a smulaton model vald f the real and the smulated systems have () dentcal means (say) µ x = µ y and () dentcal varances σ 2 x = σ 2 y To test ths composte hypothess, KLEIJ computes the dfferences D = X - Y and the sums Q = X + Y, and regresses D on Q: E(D *Q ' q) ' γ 0 % γ 1 q The null-hypothess then becomes H 0 : γ 0 = 0 and γ 1 = 0 To test ths H 0, KLEIJ computes the two Sums of Squared Errors or SSEs that correspond wth the full and the reduced regresson model: SSE full ' j (D & ˆD ) 2 wth ˆD ' C 0 % C 1 Q where C 0 and C 1 are the Ordnary Least Squares (OLS) estmators of γ 0 and γ 1 ; and SSE reduced ' j D 2 These two SSEs gve the frst valdaton statstc: Y T 1 ' [SSE reduced & SSE full ]/2 SSE full /(n & 2) If X and are nd (see 1), the statstc n Equaton (1) has an F-dstrbuton wth 2 and n - 2 degrees of freedom (df) If ths statstc s sgnfcantly hgh, then KLEIJ concludes that the smulaton model s not vald We propose another valdaton statstc wth ntutve appeal to smulaton practtoners, namely the average absolute predcton error, T 2 = j *D */n (also see Klejnen and Sargent, 1999) A thrd statstc related to the two precedng statstcs s the mean squared devaton (MSE), T 3 = j D 2 /n A fourth statstc s the average devaton, T 4 = j D /n = X & Ȳ A dsadvantage of ths statstc s that postve model errors may compensate negatve errors, and vce versa (Ths phenomenon may be gnored f a wrong smulaton model always underestmates - or always overestmates - the real response whatever the trace s; moreover, ths statstc allows bootstrappng n case of a sngle smulaton run; see 41) (1)

3 Klejnen, Cheng, and Bettonvl The next statstc s the average relatve error, T 5 = j (Y /X )/n, whch s often used n practce Obvously ths statstc assumes that no X s zero; actually, the event X = 0 may occur wth non-neglgble probablty n queueng applcatons wth empty startng states, no excessvely saturated traffc rates, and short subruns (see 6) Fnally, T 6 compares ˆF x and ˆF y, the estmated dstrbuton functon (EDF) computed from the n observatons on X and Y respectvely: 4 T 6 ' * ˆF m x (z) & Ĝ y (z)*dz (2) &4 Note that n Equaton (2) we use the L 1 norm, not the L 2 or the L 4 norms KLEIJ s statstc T 1 also tests equalty of varances, whereas T 2 through T 5 consder only equalty of means More crtera or measures for model selecton are examned n detal n the monograph by Lnhart and Zucchn (1986) 3 EFRON s BOOTSTRAP FOR TIME SERIES EFRON (p 91) assumes a sample of n d observatons Z wth = 1,, n (Hence, n our case we defne Z = (X, Y); see 1) EFRON summarzes the sample data through a statstc T = s ( Z 1,, Z n ) (In our case: T j = s j ( Z 1,, Z n ) wth j = 1,, 6) Bootstrappng means that the orgnal values z are randomly resampled wth replacement, n tmes So, f the superscrpt * ndcates bootstrappng, then the bootstrap observatons are Z ( Ths bootstrap sample gves one observaton on the bootstrap statstc T ( ' s(z ( 1,, Z( n ) To estmate the dstrbuton of ths statstc, the whole bootstrap procedure s repeated b tmes Sortng these b observatons on T ( gves the order statstcs T (,,, and the estmated α quantle (1) T ( (b) T ( (lbαm) of ts dstrbuton, Ths procedure gves a two-sded 1- α confdence nterval for the orgnal statstc T, rangng from the lower estmated α/2 quantle to the upper 1 - α/2 quantle (Alternatve confdence ntervals are dscussed n EFRON and Shao and Tu 1995) Ths nterval can be used for hypothess testng, as we shall see; also see EFRON (p 169) We saw that ths bootstrap assumes d sample observatons Z, but EFRON (pp ) also presents a bootstrap for tme seres, called movng blocks (also see Shao and Tu 1995, pp , ) In our smulaton context we nterpret these blocks as subruns So we have n non-overlappng subruns, each startng n the empty state and each of length k; we do not elmnate the transent phase We shall elaborate our approach n the next secton 4 BOOTSTRAP OF VALIDATION TESTS IN TRACE-DRIVEN SIMULATION We assume a reasonable number of d subruns; more specfcally, we use the same numbers as KLEIJ (p 815): n s ether 10 or 25 We dstngush three stuatons for the number of smulaton runs, for whch we develop dfferent bootstrappng technques: s s 1, 2, or more 41 A Sngle Smulaton Run: s = 1 By assumpton, the n pars ( X, ) are mutually ndependent (as the n subruns are assumed ndependent) Moreover, these pars are dentcally dstrbuted f we do not condton on the trace varable A ; we assumed the latter varable to be d So we bootstrap the n orgnal pars, whch gves the n bootstrap pars ( X (, Y ( ) These bootstrap pars result n the bootstrap valdaton statstcs T ( 1 through T ( 6, albet not necessarly under the null-hypothess of a vald trace-drven smulaton model We repeat ths bootstrappng b tmes, to obtan an estmated 1 - α confdence nterval for each valdaton statstc We have an ntutve target or hypotheszed value for T 4 (= j D /n ), namely zero The two-sded confdence nterval ranges from the lower α /2 quantle to the upper 1 - α /2 quantle of the bootstrap dstrbuton If the confdence nterval does no cover ths target value, then we reject the smulaton model We follow a smlar approach for T 5 (= j (Y /X )/n ), but now wth a target value of one We have no target values for the other four statstcs However, we may compare the frst statstc, wth the tabulated 1 - α quantle of the F-statstc wth 2 and n - 2 degrees of freedom, F 1 & α 2, n & 2 (no bootstrappng) Moreover, for ths statstc we frst apply the normalzng logarthmc transformaton: replace x by log(x) and y by log(y) provded x and are not zero (also see KLEIJ) y 42 Only Two Smulaton Replcates: s = 2 When s = 2 we bootstrap the two replcates of the smulaton model: we replace the par ( X, Y ) by ( Y (1), Y (2) ) Ths yelds a bootstrap confdence nterval per statstc T, under the null-hypothess of a vald trace-drven smulaton model We also have two observatons on each orgnal valdaton statstc under the alternatve hypothess, namely T ' s((x 1, Y (r) 1 ),, (X n, Y (r) n )) wth r = 1, 2 We reject the smulaton model f any of these two observatons on T falls outsde the 1 - α/2 bootstrap confdence nterval: we use α/2 nstead of α because of Bonferron s nequalty (obvously we may also replace any by the maxmum ) Y 884

4 Klejnen, Cheng, and Bettonvl 43 More than Two Smulaton Replcates: s > 2 When s > 2 we proceed smlarly to the case s = 2 However, we now dstngush two approaches: (a) condton on the trace; (b) do not condton on the trace (a) Condtonng: From each column of Table 1 we sample two observatons Y (r) and Y (r ) ) wth r r ) (n the orgnal sample the probablty of a par wth dentcal values s zero n case of contnuous X and Y, so we requre r r ) ) From these n bootstrap pars we compute the valdaton statstc T ( After b repettons we compute a 1 - α confdence nterval for ths T (, as n the case s = 2 (b) No condtonng: Ths approach assumes that the traced varables A are d So now we resample n pars from the whole table More precsely, frst we sample one value from the s n values of Y; next we sample wthout replacement a second value from the remanng sn - 1 values, gvng one bootstrap par; the next par s sampled after replacng the precedng par, etc Let us compare approaches (a) and (b), focusng on the smplest valdaton statstc T 4 Then we see that the expected values of all dfferences between replcated smulaton responses are zero, n both approaches Ther varances, however, are smaller n approach (a): blockng s a well-known varance reducton technque n the desgn of experments So we expect condtonal resamplng to yeld more powerful tests (ths wll turn out to be true: see 6) Analogous to the s = 2 case, we agan compare one real response X wth each of the s smulated responses Y (r) We reject the smulaton model f any of these s values falls outsde the 1 - α/s confdence nterval (Bonferron) 44 Asymptotc Results: Large n In the Appendx we derve asymptotc results for the smplest bootstrap valdaton statstc T ( 4 (ths statstc wll turn out to have the greatest practcal relevance; see 6) We can prove that as n tends to nfnty, the EDF of T ( 4 tends unformly to the EDF of the orgnal statstc T 4, for all four bootstrap methods defned n 41 through 43 Ths unform convergence s mportant f confdence ntervals wth the correct coverage are to be constructed Of course, ths convergence s only asymptotc; our Monte Carlo experments n 6 estmate small-sample performance 45 Mnmal Bootstrap Sample Sze A classc value for b s 1,000; see EFRON (p275), Andrews and Buchnsky (1996), and also Barton and Schruben (1993) and Shao and Tu (1995, pp ) We shall use ths classc value, but also a much smaller value Actually, we are not nterested n the whole dstrbuton functon (say) g of the bootstrapped statstc T (, but only n ts α/2 and 1 - α/2 quantles (we reject the null-hypothess f the value of the orgnal statstc T does not fall between these two quantles) To estmate ths dstrbuton functon g, we sort the b observatons on T (, whch gves T ( (1),, T ( (b) Hence, g(t ( (1) ),, g(t ( () ),, g(t ( (b) ) s an ordered sample from a unform dstrbuton on [0, 1) The expected value of g(t ( () ) s /(b + 1) Consequently, f we take the mnmal bootstrap sample sze, then our estmator of the lower α/2 quantle s the smallest order statstc, namely T ( (1) Lkewse the largest order statstc T ( (b) estmates the upper 1 - α/2 quantle It s easy to prove that the mnmum value for b s b mn ' (2/α) & 1 (3) For example, α = 01 gves b = 19; we shall use ths value (besdes the classc value of 1,000; see 5) However, when we have more than one smulaton replcate (s > 1), then we apply Bonferron s nequalty so α s replaced by α/s For example, for α = 01 and s =10 Equaton (3) gves 199 (stll much smaller than 1,000) Actually, we shall report on b = 19 even when s > 1: we then avod Bonferron s nequalty by randomly selectng a sngle value from the s values for the valdaton statstc computed from the orgnal (non-bootstrapped) observatons on X and Y (r) We reject the smulaton model f ths one value les outsde the bootstrap confdence nterval 5 DESIGN OF QEUEING EXPERIMENTS For the type I error rate of the valdaton tests we use an α of 001, 005, and 010 respectvely These values determne whch quantles of the bootstrap dstrbuton should be used as thresholds (Of course, the hgher α s, the hgher the power s) We focus on α = 010 because t gves the smallest relatve varance for our Monte Carlo results (see 6); besdes, ths value s the only one that we can use for b = 19 Followng KLEIJ, we start wth M/M/1 smulaton models, whch generate 'real' and smulated ndvdual sojourn tmes W and V So these models have Posson arrval and servce parameters (say) λ a ' 1/µ a and λ s ' 1/µ s where µ a and µ s denote the means of the nterarrval and servce tmes We use a tlde to denote a parameter of the smulaton model; for example, λ s refers to the smulaton model, whereas λ s denotes the real parameter 885

5 To study the type I error of the valdaton tests, we use a smulaton model and a real system wth equal servce rates (arrval tmes are on the trace, so smulated and real arrval tmes are the same); hence smulated and real traffc rates are the same: ρ = ρ We use an mperfect smulaton model: the real and the smulated servce tmes use dfferent pseudorandom numbers We examne the followng three factors - followng KLEIJ (p 815) - n a 2 3 desgn: () number of jobs per subrun, k: 10 and 1,000 (affects the degree of nonnormalty); () number of subruns, n: 10 and 25 (affects the convergence of the bootstrap dstrbuton); () real traffc load, ρ : 05 and 10 (affects the cross-correlaton caused by the common trace) To study the type II error, we use unequal smulated and real rates For real load ρ = 05 and number of jobs per subrun k = 1,000 we use ρ = 046, 048, 052, and 054; for k = 10 we use 03, 04, 06, and 07 For ρ = 1 and k = 1,000 we use 096, 098, 102, and 104; for k = 10 we use 08, 09, 12, and 14 (For more extreme values of ρ the estmated power reaches 1) Stll followng KLEIJ (p 815), we use 1,000 macroreplcatons; by defnton, each macro-replcaton ether rejects or accepts a specfc smulaton model (Each macroreplcaton requres b bootstraps; each bootstrap requres kn observatons on the real and the smulated ndvdual outputs) Because we use many pseudorandom numbers, we select our generator wth some care: we use a generator proposed by L Ecuyer (1999), called MRG32k3a wth a cycle length of the order We select seeds randomly All sx valdaton tests use the same data ( X, Y (r) ), whch mproves the comparson of these tests The three values for α also gve postvely correlated results To obtan more general results, we extend KLEIJ: we also use M/G/1 smulaton models where we let G stand for servce tmes wth a gamma dstrbuton (Cheng 1998 gves generators for ths dstrbuton famly; the exponental dstrbuton belongs to ths famly) The real system remans M/M/1 We lmt the desgn to a sngle combnaton of the three factors: traffc load 10, number of jobs per subrun 1,000, number of subruns 10 Fnally, we extend our Monte Carlo study to smulatons wth other prorty rules, namely shortest processng tme (SPT) and longest processng tme (LPT) We use the same factor combnaton as for M/G/1 6 MONTE CARLO RESULTS Our Monte Carlo experments wth varous sngle-server queues result n estmated type I and II error probabltes of our sx valdaton statstcs for fve bootstrap procedures Klejnen, Cheng, and Bettonvl 886 If ths type I error probablty equals the prespecfed (nomnal) value α, we call the valdaton test acceptable: H 0 : E(Â) ' α (4) where  denotes the Monte Carlo estmator of that probablty - wth values ˆα If no statstc satsfes ths condton, we accept a conservatve valdaton procedure (Bonferron s nequalty mples such conservatsm): n Equaton (4) we replace = by # Gven Equaton (4), ths error probablty has a bnomal dstrbuton wth varance α( 1 - α)/1000 (we have 1,000 macro-replcatons) For example, α = 010 gves a standard devaton of We use the normal approxmaton s factor 196 (95% confdence nterval) to test the sgnfcance of the devaton between observed and nomnal type I error probablty: we reject H 0 f *ˆα & α* > In case of a conservatve, one-sded test we accept an ˆα smaller than 01156; see the results prnted n bold n the tables below (There s no need for multple comparsons or jont nferences, whch mght use Bonferron) If several statstcs have acceptable type I error probabltes, then we compare ther estmated type II error probabltes (power complement) How to nterpret the massve amount of data generated by our Monte Carlo experments? We thnk that the prmary user queston s: whch valdaton statstc should be used, gven that t s known how many smulaton replcates are avalable? Remember that when s = 1 we should bootstrap only those two statstcs that have ntutve target values, namely T 4 and T 5 (for T 1 we use the F table) The answer may also depend on other known characterstcs of the gven smulaton, namely the number of d subruns, n If the smulaton represents a queueng system, then another known characterstc mght be the number of customers per subrun (k), the traffc load (ρ), and the queueng dscplne (FIFO, LPT, etc) Some queueng smulatons, however, may be much more complcated than the sngle-server systems that we study, so these characterstcs are of secondary nterest We start our analyss of all these Monte Carlo results by studyng ˆα (type I error) Though we have 2 3 combnatons of ρ, k, and n (see 5), we present data only for the hgh ρ and the low n; see Table 2 We do gve results for both k values, because ths factor may exclude the use of certan valdaton statstcs (namely, T 5 ) and strongly affect nonnormalty of the performance measures X and Y Further, for s = 1 we also present the statstc T 1 as appled by KLEIJ usng the F-table (nstead of bootstrappng) after the normalzng transformaton log(x) and log (Y) Fnally, for s > 2 we may condton on the trace or not, but Table 2 shows

6 Klejnen, Cheng, and Bettonvl results for condtonng only: we found that condtonng does ndeed mprove the power whle mantanng the type I error Part A gves results for short subruns (k = 10) Case s = 1: Not applcable (N/A) holds for T 2, T 3, and T 6 because they have no practcal thresholds; T 5 has a denomnator X = 0 wth hgh probablty so t s also N/A The table look-up of T 1 gves a worse error probablty than bootstrappng the smple statstc T 4 ; nevertheless, even the latter statstc gves sgnfcantly hgh ˆα Case s = 2: Acceptable - though conservatve - results are gven by bootstrappng T 6 Case s = 5: Our bootstrappng gves acceptable - but conservatve - ˆα, except for T 2 and T 3 Case s = 10: Bootstrappng any statstc gves acceptable ˆα Ths case gves results more conservatve than s = 5: Bonferron becomes more conservatve as s ncreases We can prove that as s ncreases for fxed n, then the EDFs of the orgnal statstc T and the bootstrap statstc T ( converge Because ths proof s rather techncal we do not gve t here Part B gves results for long subruns (k = 1,000) Case s = 1: KLEIJ s procedure gves an acceptable result; n long runs the nonnormalty dsappears after the log transformaton Case s = 2: Acceptable but conservatve results are agan gven by bootstrappng T 6 Case s = 5: Bootstrappng the smple statstc T 4 gves acceptable ˆα Case s = 10: Our bootstrap gves acceptable - but conservatve - ˆα for any statstc except T 2 and T 3 Altogether Table 2 suggests the followng conclusons Case s = 1: All valdaton statstcs gve observed type I error probabltes sgnfcantly hgher than the nomnal α, except for KLEIJ s procedure when long subruns are used Case s = 2: Bootstrappng T 6 gves best conservatve results Case s = 5: Bootstrappng the smple statstc T 4 gves acceptable ˆα Case s = 10: Bootstrappng any statstc - except for T 2 and T 3 - gves acceptable ˆα, albet rather conservatve for short subruns The next queston s: whch of the acceptable valdaton statstcs has the hghest power? Table 3 shows the estmated power for these statstcs, for a gven combnaton of s and k We select four smulated traffc rates ρ that dffer from the real rate ρ = 1 (see the four rows) Obvously, any statstc has more power as the smulated load devates more from the real load (read wthn columns) Further, any statstc can detect smaller devatons between real and smulated traffc rates when k s larger (10 versus 1,000) For s > 2 the bootstrapped smple statstc T 4 has good power compared wth the more complcated statstcs We also obtan results for other systems than M/M/1/FIFO (see 5) However, gven the conclusons so far, we focus on T 4 when nterpretng these results Then t suffces to state that the above conclusons also hold for these systems! Table 4 gves estmated type I error probabltes n case of the mnmum bootstrap sample sze (b = 19) These probabltes are smlar to Table 2, though less conservatve when k = 10 Our results (not dsplayed to save space) further show that the power s smaller than n case of a large bootstrap sample sze (for s > 1 we use Bonferron s nequalty n Table 3, whereas we now randomly select one of the s values; whch confounds the effects of small b and usng only one of the s values) 7 CONCLUSIONS AND FUTURE RESEARCH In general, bootstrappng s a versatle tool, as t allows the estmaton of the dstrbuton of any statstc T(Z) for any type of nput dstrbuton for Z However, ths tool requres masterng the art of modelng: the researchers stll have to nterpret ther problems Indeed, EFRON (pp 115, 383) states bootstrappng s not a unquely defned concept alternatve bootstrap methods may coexst More specfcally, for valdaton n smulaton we focused on statstcal tests for the valdaton of trace-drven termnatng smulatons wth d response Y Gven the d real response X, we proposed sx valdaton statstcs T j (X, Y)(j = 1,, 6) The pars (X, Y) are correlated, and may be non-normally dstrbuted We developed dfferent bootstrap methods that vary wth the number of smulaton replcates (symbol s) All these methods use subruns When we have more than two replcates (s > 2), we ether condton or we do not condton on the trace To evaluated and llustrate the resultng tests, we appled them to sngle-server queueng smulaton models wth dfferent prorty rules Whether these Monte Carlo results Further, these conclusons suggest that - for bootstrapped valdaton - a trace-drven smulaton model be run more than twce (usng dfferent random numbers) 887

7 Klejnen, Cheng, and Bettonvl hold for other applcatons, requres further research; the current results mght be seen as rules of thumb These rules are as follows Case s = 1: Most valdaton statstcs gve type I error probabltes hgher than the nomnal α If a normalzng transformaton can be found, then follow KLEIJ; that s, use the F-table wthout bootstrappng Case s = 2: Bootstrappng T 6 gves acceptable - but conservatve - results Case s > 2: Many statstcs gve acceptable - possbly conservatve - ˆα So we recommend to run a trace-drven smulaton model more than twce The smplest statstc, namely the average devaton T 4 = j D /n, has good power compared wth the more complcated statstcs A surprsngly small bootstrap sample sze mght suffce to quckly decde on the valdty of a smulaton model Then, lttle extra computer tme s needed for bootstrappng Nevertheless, f the small bootstrap sample results n a borderlne value for the valdaton statstc, then we recommend a larger bootstrap sample - especally snce n practce bootstrappng requres far less computer tme than smulaton does In future research we mght extend our analyss to other termnatng smulatons (eg, queueng networks), and to steady-state and non-statonary smulatons For example, f the trace does not reman statonary over subruns, then we may condton and resample one response from each subrun (column n Table 1; see 43) Whereas we use subruns, EFRON uses overlappng blocks; also see Shao and Tu (1995, pp ) Such a samplng procedure has also been explored n nontermnatng, statonary smulaton: see Sherman (1995) We mght also study a complcaton that KLEIJ mentoned but dd not solve: a more general null-hypothess states that the dfference between the real and the smulated systems expected values s smaller than some postve constant δ, not necessarly zero:: *E(X) & E(Y)* < δ Snce bootstrappng uses smulaton (Monte Carlo for resamplng the orgnal values z), typcal smulaton problems may be further explored n a bootstrappng context For example, the determnaton of the sample sze n quantle estmaton s a standard problem n smulaton; see Alexopoulos and Sela (1998) We add that computer tme may be saved by not takng a fxed sample sze b for the bootstrap Instead, we may use Wald s sequental probablty rato test (SPRT); see Ghosh and Sen (1991) Varance reducton technques may also be appled to bootstrappng Indeed, Shao and Tu (1995, pp ) dscuss antthetc and mportance samplng n bootstrappng We assumed that the number of replcates s s so small that bootstrappng s needed If, however, (say) s = 100, then we can use classc tests such as Student s t test, a dstrbuton-free test (eg, sgn test, rank test), or goodnessof-ft tests (see D'Agostno and Stephens (1986) and Vncent (1998)) APPENDIX: CONVERGENCE OF EDFs OF T ( 4 AND AS n INCREASE T 4 We gve a theoretcal backng for the condtonal samplng bootstrap method descrbed n 43: for T 4 (the statstc we recommend) we show that T ( 4 & E(T( 4 ) has the same asymptotc dstrbuton as T 4 & E(T 4 ), as n tends to nfnty Condtonal samplng s both the most nterestng and the most dffcult case Here a bootstrap sample has the form {Z ( ' Y U() E(Z ( ) ' 1 s(s & 1) j (u, v)0c (Y u & Y v ), E(Z (2 ) ' & Y V() ; ' 1, þ, n} (A-1) where (U(), V()) are d pars of random values selected from the s(s - 1) dstnct pars C = { r, r ) ; r, r ) = 1,, s, r r ) }, wth all pars beng equally lkely to be selected Ths gves 1 s(s & 1) j (Y u & Y v )2 (u, v)0c (A-2) Elementary consderatons show that E(T ( 4 ) and Var(T ( 4 ) are exactly the same as n the uncondtonal case; moreover wth probablty 1, E(T ( 4 ) 6 E(T 4 ) and Var(T ( 4 ) 6 Var(T 4 ) However the form of the moments n Equaton (A-2) shows that the Z ( are not dentcally dstrbuted Thus we need an addtonal assumpton to guarantee that T ( 4 s asymptotcally normal Theorem: Let T ( 4 be calculated from the condtonal bootstrap sample n Equaton (A-1) where s > 2 Let and τ ' E Y [Z ( & E(Z ( )] 2 < 4 κ ' E Y Z ( & E(Z ( ) 3 < 4, 888

8 Klejnen, Cheng, and Bettonvl where the outer expectatons are taken wth respect to Y ' (Y (1),, Y (s) ), the s observatons smulated Further, let c(z) ' Pr[ n (T 4 & E(T 4 )) # z] c ( (z) ' Pr[ n (T ( 4 & T 4 ) # z] Then wth probablty 1 we have Proof: Let sup z c(z) & c ( (z) 6 0 B n ' j Var(Z ( ), C n ' j E( Z ( & E(Z ( ) 3 ) Then by the strong law of large numbers n 1/2 B &3/2 n C n 6 τ &3/2 κ wth probablty 1 as n 6 4 Thus B &3/2 n C n 6 0 (A-3) wth probablty 1 as n 6 4 It follows by Lyapunov s Theorem (gven n eg Petrov (1995) as Theorem 49) that T ( 4 s asymptotcally normally dstrbuted wth probablty 1 Wth probablty 1 we have E(T ( 4 ) 6 E(T 4 ) and Var(T ( 4 ) 6 Var(T 4 ) so we can apply Theorem 67 n Hjorth (1994) (see also Sngh (1981) and Bckel and Freedman (1981), to show that Equaton (A-3) holds REFERENCES Alexopoulos, C and AF Sela 1998, Output Data Analyss Handbook of Smulaton, edted by Jerry Banks, Wley, New York Andrews, DWK and M Buchnsky 1996, On the number of bootstrap repettons for bootstrap standard error estmates Cowles Foundaton Dscusson Paper no 1141, Yale Unversty, PO Box , New Haven, Connectcut Barton, RR and LW Schruben 1993, Unform and bootstrap resamplng of emprcal dstrbutons In Proceedngs of the 1993 Wnter Smulaton Conference, ed GW Evans et al, IEEE, Pscataway, NJ Bckel, PJ and Freedman, DA 1981 Some asymptotc theory for the bootstrap Annals of Statstcs, 9, Breman, L 1992, The lttle bootstrap and other methods for dmensonalty selecton n regresson: x-fxed predcton error Journal Amercan Statstcal Assocaton, 87, no 419, pp Cheng, RCH 1995, Bootstrap methods for computer smulaton experments Proceedngs of the 1995 Wnter Smulaton Conference, ed C Alexopoulos, K Kang, WR Llegdon, and D Goldsman , Random varate generaton Handbook of Smulaton, edted by J Banks, Wley, New York --- and W Holland 1997, Senstvty of computer smulaton experments to errors n nput data Journal Statstcal Computaton and Smulaton, 57(1-4): D' Agostno, RD and HA Stephens, edtors 1986, Goodness-of-ft dstrbutons Marcel Dekker, New York Davson, AC and DVHnkley, Bootstrap methods and ther applcaton, CUP Efron, B and RJ Tbshran (1993), Introducton to the Bootstrap Chapman & Hall, New York Fredman, LW and HH Fredman (1995), Analyzng smulaton output usng the bootstrap method Smulaton, 64(2): Ghosh, BK and PK Sen (1991), Handbook of Sequental Analyss Marcel Dekker, New York Hjorth, JSU (1994) Computer ntensve statstcal methods, Chapman & Hall, London Km, YB, TR Wlleman, J Haddock, and GC Runger (1993), The threshold bootstrap: a new approach to smulaton output analyss In Proceedngs of the 1993 Wnter Smulaton Conference, ed GW Evans, M Mollaghasem, EC Russell, and WE Bles Klejnen (1999) Valdaton of models: statstcal technques and data avalablty Proceedngs of the 1999 Wnter Smulaton Conference, (ed by PA Farrngton, HB Nembhard, DTSturrock, and GWEvans) ---, B Bettonvl, and W Van Groenendaal (1998), Valdaton of trace drven smulaton models: a novel regresson test Management Scence, 44: , RCH Cheng, and B Bettonvl (2000), Valdaton of trace-drven smulaton models: bootstrapped tests Management Scence(under revew) ---, AJ Feelders, and RCH Cheng (1998)Bootstrappng and valdaton of metamodels n smulaton Proceedngs of the 1998 Wnter Smulaton Conference --- and RG Sargent (1999), A methodology for fttng and valdatng metamodels n smulaton European Journal Operatonal Research (accepted) L Ecuyer, PL (1999), Good parameter sets for combned multple recursve random number generators Operatons Research, 47(1) Lnhart, H and W Zucchn (1986), Model selecton Wley, New York 889

9 Klejnen, Cheng, and Bettonvl Mooney, CZ and RD Duval (1993), Bootstrappng: a nonparametrc approach to statstcal nference Sage Publcatons, Newbury Park, Calforna Moors, JJA and LWG Strjbosch (1998), New proposals for the valdaton of trace-drven smulatons Communcatons n Statstcs: Smulaton and Computaton, 27(4): Petrov, VV (1995), Lmt theorems of probablty theory, Oxford Unversty Press, Oxford Prtsker, AA (1998), Lfe & death decsons OR/MS Today, 25(4): Shao, J and D Tu (1995), The jackknfe and bootstrap Sprnger-Verlag, New York Sherman, M (1995), On batch means n the smulaton and statstcs communtes In Proceedngs of the 1995 Wnter Smulaton Conference, ed C Alexopoulos, K Kang, WR Llegdon, and D Goldsman Sngh, K (1981) On the asymptotc accuracy of Efron's bootstrap Annals of Statstcs, 9: Vncent, S (1998), Input data analyss Handbook of smulaton, edted by J Banks, Wley, New York ACKNOWLEDGMENT Cheng and Klejnen acknowledge the NATO Collaboratve Research Grants Programme s fnancal support for ther project on 'Senstvty analyss for mproved smulaton modelng' Table 2: Estmated Type I Error Probablty of Valdaton Statstc (T) for Varyng Number of Smulaton Replcates (s) of M/M/1/FIFO wth Number of Customers per Subrun k, Traffc Rate ρ = 1; Number of Subruns n = 10; Nomnal α = 010; Bootstrap Sample Sze b = 1,000; Bold Numbers Denote Acceptable Results 1) F-table used (nstead of bootstrap) after normalzng transformaton log(x) and log (Y) (A) Number of Customers per Subrun k = 10 s T 1 T 2 T 3 T 4 T 5 T ) N/A N/A 174 N/A N/A (B) Number of Customers per Subrun k = 1,000 s T 1 T 2 T 3 T 4 T 5 T ) N/A N/A N/A

10 Klejnen, Cheng, and Bettonvl Table 3: Estmated Power of Acceptable Statstcs T for Varyng Smulated Traffc Rates ρ and Fxed Real Traffc Rate ρ = 1 (for Remanng Symbols See Table 2) s = 1; k = 1,000 s = 2; k = 10 s = 2; k = 1,000 ρ T 1 ρ T 1 T 6 ρ T 1 T s = 5; k = 10 s = 5; k = 1,000 ρ T 1 T 4 T 5 T 6 ρ T s = 10; k = 10 ρ T 1 T 2 T 3 T 4 T 5 T s = 10; k = 1,000 ρ T 1 T 4 T 5 T

11 Klejnen, Cheng, and Bettonvl Table 4: Estmated Type I Error Probablty Usng Small Bootstrap Sample Sze b = 19 (Remanng Symbols Defned n Table 2) k = 10 s T 1 T 2 T 3 T 4 T 5 T ) N/A N/A 160 N/A N/A k = 1,000 s T 1 T 2 T 3 T 4 T 5 T ) N/A N/A N/A AUTHOR BIOGRAPHIES JACK PC KLEIJNEN s a Professor of Smulaton and Informaton Systems Hs research concerns smulaton, mathematcal statstcs, nformaton systems, and logstcs; ths research resulted n sx books and nearly 160 artcles He has been a consultant for several organzatons n the USA and Europe, and has served on many nternatonal edtoral boards and scentfc commttees He spent several years n the USA, at both unverstes and companes, and receved a number of nternatonal fellowshps and awards More nformaton s provded on hs web page: < centerkubnl/staff/klejnen> RUSSELL CH CHENG s Professor of Operatonal Research at the Unversty of Southampton He has an MA and the Dploma n Mathematcal Statstcs from Cambrdge Unversty, England He obtaned hs PhD from Bath Unversty He s Charman of the UK Smulaton Socety, a Fellow of the Royal Statstcal Socety, Member of the Operatonal Research Socety Hs research nterests nclude: varance reducton methods and parametrc estmaton methods He s Jont Edtor of the IMA Journal on Mathematcs Appled to Busness and Industry BERT BETTONVIL s Assocate Professor at the Department of Informaton Systems of Tlburg Unversty Educated as a mathematcal statstcan, hs research nterests are n the feld of statstcal aspects of smulaton and n research methodology 892

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T.

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T. Assgnment 5 Smulaton for Logstcs Mont, N.E. Yunta, T. November 26, 2007 1. Smulaton Desgn The frst objectve of ths assgnment s to derve a 90% two-sded Confdence Interval (CI) for the average watng tme

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Modeling and Simulation NETW 707

Modeling and Simulation NETW 707 Modelng and Smulaton NETW 707 Lecture 5 Tests for Random Numbers Course Instructor: Dr.-Ing. Magge Mashaly magge.ezzat@guc.edu.eg C3.220 1 Propertes of Random Numbers Random Number Generators (RNGs) must

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov, UCLA STAT 3 ntroducton to Statstcal Methods for the Lfe and Health Scences nstructor: vo Dnov, Asst. Prof. of Statstcs and Neurology Chapter Analyss of Varance - ANOVA Teachng Assstants: Fred Phoa, Anwer

More information

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

Foundations of Arithmetic

Foundations of Arithmetic Foundatons of Arthmetc Notaton We shall denote the sum and product of numbers n the usual notaton as a 2 + a 2 + a 3 + + a = a, a 1 a 2 a 3 a = a The notaton a b means a dvdes b,.e. ac = b where c s an

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

CS-433: Simulation and Modeling Modeling and Probability Review

CS-433: Simulation and Modeling Modeling and Probability Review CS-433: Smulaton and Modelng Modelng and Probablty Revew Exercse 1. (Probablty of Smple Events) Exercse 1.1 The owner of a camera shop receves a shpment of fve cameras from a camera manufacturer. Unknown

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE Expected Outcomes Able to test the goodness of ft for categorcal data. Able to test whether the categorcal data ft to the certan dstrbuton such as Bnomal,

More information

Notes on Frequency Estimation in Data Streams

Notes on Frequency Estimation in Data Streams Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Exercises of Chapter 2

Exercises of Chapter 2 Exercses of Chapter Chuang-Cheh Ln Department of Computer Scence and Informaton Engneerng, Natonal Chung Cheng Unversty, Mng-Hsung, Chay 61, Tawan. Exercse.6. Suppose that we ndependently roll two standard

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

On the correction of the h-index for career length

On the correction of the h-index for career length 1 On the correcton of the h-ndex for career length by L. Egghe Unverstet Hasselt (UHasselt), Campus Depenbeek, Agoralaan, B-3590 Depenbeek, Belgum 1 and Unverstet Antwerpen (UA), IBW, Stadscampus, Venusstraat

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model A Monte Carlo Study for Swamy s Estmate of Random Coeffcent Panel Data Model Aman Mousa, Ahmed H. Youssef and Mohamed R. Abonazel Department of Appled Statstcs and Econometrcs, Instute of Statstcal Studes

More information

Lena Boneva and Oliver Linton. January 2017

Lena Boneva and Oliver Linton. January 2017 Appendx to Staff Workng Paper No. 640 A dscrete choce model for large heterogeneous panels wth nteractve fxed effects wth an applcaton to the determnants of corporate bond ssuance Lena Boneva and Olver

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

18.1 Introduction and Recap

18.1 Introduction and Recap CS787: Advanced Algorthms Scrbe: Pryananda Shenoy and Shjn Kong Lecturer: Shuch Chawla Topc: Streamng Algorthmscontnued) Date: 0/26/2007 We contnue talng about streamng algorthms n ths lecture, ncludng

More information

The Order Relation and Trace Inequalities for. Hermitian Operators

The Order Relation and Trace Inequalities for. Hermitian Operators Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Convergence of random processes

Convergence of random processes DS-GA 12 Lecture notes 6 Fall 216 Convergence of random processes 1 Introducton In these notes we study convergence of dscrete random processes. Ths allows to characterze phenomena such as the law of large

More information

U-Pb Geochronology Practical: Background

U-Pb Geochronology Practical: Background U-Pb Geochronology Practcal: Background Basc Concepts: accuracy: measure of the dfference between an expermental measurement and the true value precson: measure of the reproducblty of the expermental result

More information

Cathy Walker March 5, 2010

Cathy Walker March 5, 2010 Cathy Walker March 5, 010 Part : Problem Set 1. What s the level of measurement for the followng varables? a) SAT scores b) Number of tests or quzzes n statstcal course c) Acres of land devoted to corn

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

A note on almost sure behavior of randomly weighted sums of φ-mixing random variables with φ-mixing weights

A note on almost sure behavior of randomly weighted sums of φ-mixing random variables with φ-mixing weights ACTA ET COMMENTATIONES UNIVERSITATIS TARTUENSIS DE MATHEMATICA Volume 7, Number 2, December 203 Avalable onlne at http://acutm.math.ut.ee A note on almost sure behavor of randomly weghted sums of φ-mxng

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH

DETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata TC XVII IMEKO World Congress Metrology n the 3rd Mllennum June 7, 3,

More information

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

TAIL BOUNDS FOR SUMS OF GEOMETRIC AND EXPONENTIAL VARIABLES

TAIL BOUNDS FOR SUMS OF GEOMETRIC AND EXPONENTIAL VARIABLES TAIL BOUNDS FOR SUMS OF GEOMETRIC AND EXPONENTIAL VARIABLES SVANTE JANSON Abstract. We gve explct bounds for the tal probabltes for sums of ndependent geometrc or exponental varables, possbly wth dfferent

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Midterm Examination. Regression and Forecasting Models

Midterm Examination. Regression and Forecasting Models IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information