Two-Stage Controlled Fractional Factorial Screening for Simulation Experiments
|
|
- Dwayne Stevens
- 6 years ago
- Views:
Transcription
1 Two-Stage Cotrolled Fractioal Factorial Screeig for Simulatio Experimets Hog Wa Assistat Professor School of Idustrial Egieerig Purdue Uiversity 35 N. Grat Street West Lafayette, IN (765) Fax: (765) Bruce E. Akema Associate Professor Departmet of Idustrial Egieerig ad Maagemet Scieces Northwester Uiversity 245 Sherida Rd. Evasto, IL. 628 (847) Fax (847) url: JUNE 23, 26 ABSTRACT Factor screeig with statistical cotrol makes sese i the cotext of simulatio experimets that have radom error, but ca be ru automatically o a computer ad thus ca accommodate a large umber of replicatios. The discrete-evet simulatios commo i the operatios research field are well suited to cotrolled screeig. I this paper two methods of factor screeig with cotrol of Type I error ad power are compared. The two screeig methods are both robust with respect to two-factor iteractios ad o-costat variace. The first method is the sequetial method itroduced by Wa, Akema, ad Nelso (25a) called CSB-X. The secod method uses a fractioal factorial desig i combiatio with a two-stage procedure for cotrollig power that has bee adapted from Bishop ad Dudewicz (978). The two-stage cotrolled fractioal factorial (TCFF) method requires less prior iformatio ad is more efficiet whe the percetage of importat factors is 5% or higher.
2 Key Words Computer experimets, Discrete-evet simulatio, Desig of Experimets, Factor Screeig, Sequetial Bifurcatio Bruce E. Akema is a Associate Professor i the Departmet of Idustrial Egieerig ad Maagemet Scieces at the McCormick School of Egieerig at Northwester Uiversity. His curret research iterests iclude respose surface methodology, desig of experimets, robust desig, experimets ivolvig variace compoets ad dispersio effects, ad desig for simulatio experimets. He is a past chair of the Quality Statistics ad Reliability Sectio of INFORMS, is a Associate Editor for Naval Research Logistics ad is a Departmet Editor for IIE Trasactios: Quality ad Reliability Egieerig. He is a member of ASA ad ASQ. akema@orthwester.edu URL: Hog Wa is a Assistat Professor i the School of Idustrial Egieerig at Purdue Uiversity. Her research iterests iclude desig ad aalysis of simulatio experimets; simulatio optimizatio; simulatio of maufacturig, healthcare ad fiacial systems; quality cotrol; ad applied statistics. She has taught a variety of courses ad is a member of INFORMS ad ASA. Her ad web addresses are hwa@purdue.edu ad 2
3 Itroductio The term computer experimet is commo i the statistical literature (see Sater, Willliams, ad Notz, 23) ad refers to a experimet o a computer simulatio of a physical system such as a material or a process. Typically, computer experimets are thought to be determiistic (havig o radom error other tha roudig error). This is because may egieerig simulatios use fiite elemet or other differetial equatio-based simulatio methods that have o radom compoet. However, i the operatios research commuity, the term simulatio is most commoly used to refer to discrete-evet simulatios of maufacturig or service systems (see Baks, Carso, Nelso ad Nicol, 25). These simulatios are typically complex queueig systems that simulate products or customers flowig through a system. The simulatio is drive by radom umber geerators that draw radom arrival times for the products or customers ad radom service ad failure times for the machies or statios. The iput distributios for these radom umber geerators are based o the distributios observed for each type of machie or statio i the system. Typical software packages for this type of discrete-evet simulatio iclude Witess, Area, Simul8, ad Factory Explorer. May other software packages have bee developed for more specific scearios, for example, MedModel for healthcare simulatio. Although they do have radom oise like physical experimets, discrete-evet simulatio experimets share some attributes with computer experimets: ) I may cases, the whole experimet ca be programmed to ru automatically. 2) Compared to physical experimets, it is usually cheaper ad easier to switch factor settigs i a simulatio experimet. Therefore, the desig of the simulatio experimet is ofte sequetial, usig a desig criterio to choose the ext ru or to stop the experimet. 3
4 3) I may cases, replicatios are ot as expesive as they are i physical experimets. This fact allows for the experimet size to be determied by a desired precisio (or correctess) istead of a preset experimetal budget. I this article, we will compare two approaches for factor screeig i the cotext of discrete-evet simulatio experimets. Factor screeig refers to a experimet that is desiged to sort a large set of factors ito those that have a importat effect o the system i questio ad those that do ot substatially affect the system respose (see Dea ad Lewis, 25). Factor screeig is used for may differet purposes. It is ofte used as a first stage of a optimizatio or ivestigatio procedure iteded to reduce the umber of factors that must be icluded i more detailed ivestigatio. However, there are may other reasos for performig factor screeig icludig () gaiig a better uderstadig of a system by fidig the factors that affect the output ad equally importat fidig those factors which have a egligible effect o the output over the rage of iterest, (2) preparig for future adjustmet or cotrol of a system by fidig a set of factors that ca affect the output, ad (3) simplifyig the simulatio model by removig portios of the model that have little effect o the output of iterest. Typically i factor screeig, a hierarchical argumet is used so factors with importat mai effects are first screeed out ad the additioal experimets are doe to determie if those importat factors iteract with each other or have higher order effects. Sice the cost of replicatio is so high, the emphasis i factor screeig for physical experimets has bee to allow for the estimatio of as may mai effects as possible i as few rus as possible. This has lead to approaches such as supersaturated desigs (see Lu ad Xu, 24) ad group screeig (see Lewis ad Dea, 2). I view of the three characteristics of discrete-evet simulatio experimets listed above, the screeig methods that we will compare i this article will be drive ot by a specific umber of experimetal rus, but istead by a desired level of Type I 4
5 error (the probability of declarig a factor importat whe it is ot) ad power (the probability of correctly declarig a factor importat). Due to the statistical cotrol that we impose o this screeig experimet, we will call this cotrolled screeig. Of course, give that the desired level of statistical cotrol (power ad level of Type I error) is achieved, it is desirable to reduce the umber of replicatios ad thereby reduce the total amout of computer time eeded for the screeig experimet. Thus, the umber of replicatios eeded to achieve the desired level of cotrol will be the criterio for comparig the two screeig methods. I discrete-evet simulatios, the resposes of iterest are ofte system performace measures like average cycle time for a product to pass through the system. For complicated systems, there are ofte hudreds of factors. For example, there may be may statios or machies i the maufacturig system ad each statio or machie may have several factors associated with it. I may cases, these factors have a kow directio of effect o the respose. For example, icreasig the speed of a trasporter will likely decrease the average cycle time if it has ay effect at all. Usig the assumptio that the directios of the effects of the factors are kow ad the hierarchical assumptio that liear effects are a good idicator factor importace, Wa, Akema, ad Nelso (25a) proposed Cotrolled Sequetial Bifurcatio (CSB), a cotrolled factor screeig method for simulatio experimets that uses a form of biary search called sequetial bifurcatio. Sequetial bifurcatio was origially proposed for determiistic screeig by Bettovil ad Kleije (997). CSB was exteded to provide error cotrol for screeig mai effects eve i the presece of two-factor iteractios ad secod order effects of the factors (Wa, Akema, ad Nelso, 25b). The ew method is called CSB-X. A iterestig fidig i Wa, et al. (25b) is that if iteractios are ot preset, CSB-X is usually as efficiet as CSB i idetifyig mai effects (this depeds o the variace structure at differet levels of the factors). Whe two-factor iteractios are preset, CSB is uable to correctly 5
6 cotrol the idetificatio of the mai effects due to cofoudig with the iteractios. Thus, CSB-X is recommeded for cotrolled screeig i almost all cases. The primary purpose of this paper is to compare CSB-X with aother method of screeig, amely a fractioal factorial desig that is used as part of a two-stage method for cotrollig Type I error ad power. I the ext sectio, we briefly describe the objectives of cotrolled screeig. This structure was first itroduced i Wa, et al. (25a). Sectio 3 briefly describes CSB-X ad the itroduces a versio of fractioal factorial screeig that cotrols Type I error ad power usig a two-stage samplig procedure similar to the oe used i Bishop ad Dudewicz (978) for a twoway ANOVA layout. Sectio 4 compares the two methods uder a variety of scearios ad shows the circumstaces uder which each method is preferred. Coclusios are the draw. Objective of the Cotrolled Screeig The purpose of this paper is to compare the performace of two methods of cotrolled screeig. I order to compare the methods fairly, some structure o the screeig problem is ow established. We begi with the model. Suppose there are K factors ad a simulatio output respose represeted by Y, the we assume the followig full secod order model: K K K, () Y = β + β x + β x x + ε k k km k m k= k= m= k where x k represets the level of factor k ad ε is a ormally distributed error term whose 2 variace is ukow ad may deped o the levels of the factors, i.e., ( ) ( x x x ) ε ~Nor(, σ x ), where x =, 2,, K. I this paper, we assume that the factor levels are coded o a scale from - to. Later, we will focus o Resolutio IV, two-level desigs which ofte used i mai effects screeig with o-egligible iteractios. I these desigs, the liear mai effects estimators 6
7 are ot cofouded by ay other terms; however, may of the secod order iteractios may be cofouded together ad the pure quadratic terms will cofouded with the estimate of β. Other desigs could i priciple be adapted to the proposed method, but we leave this for future research. To ease the discussio, we also assume that larger values of the respose, Y, are more desirable. The case of a smaller is better respose is easily hadled by makig the respose egative. The goal of the cotrolled screeig is to provide a experimetal desig ad aalysis strategy to idetify which factors have large liear mai effects, the β k s i (). Two thresholds are established:. is the threshold of importace used to establish the level of Type I error. Specifically, if β <, the the probability of declarig factor k importat should k be less tha α, where α is selected by the experimeter. Typically α = is the critical threshold used to cotrol the power. Specifically, if β, the the probability of declarig factor k importat should be greater tha γ, where γ is selected by the experimeter Typically γ.8. k The scalig of the factors ca be doe i ay way that the experimeter fids useful; however, sice all factor effects are to be compared to the same thresholds, they should be comparable i some sese. Oe method would be to have the respose be dollars of profit produced ad thus all costs for chagig factor levels would be icorporated ito the profit calculatio. Aother scalig method, itroduced i Wa, et al. (25a), assures that all the factor effects are o the same cost scale. For purposes of discussio, we will follow the cost-based scalig method of Wa, et al. (25a). The method assumes that the lower the factor settig, the 7
8 lower the cost. The scalig is such that the cost to chage each factor from the level () to the highest cost level () is a fixed cost, say c*. Wa, et al. (25a) shows how the levels of discrete factors (e.g. the umber of machies or umber of operators) ca be icorporated ito this costbased scalig system by itroducig a weightig for each factor. The beefit of this cost-based scalig is that each liear effect, β k, ca be compared directly to the thresholds ad to the other liear effects sice β k represets the expected chage i the respose whe spedig c* to chage factor k. The two thresholds ow also become easily expressed cocepts: The threshold of importace,, is the miimum amout by which the respose must rise i order to justify a expediture of c* dollars. The critical threshold,, is a icrease i the respose that should ot be missed if it ca be achieved for a expediture of c* dollars. Sice the respose is to be maximized, the it is atural to assume that icreasig the cost of ay factor will either have o effect or icrease the respose. Thus i Wa, et al (25a), it is assumed that the directios of all mai effects are kow ad the levels of the factors are set to make all the mai effects positive, i.e., βk, < k K. Methodology I this sectio, we will first review CSB-X, the method proposed i Wa, et al. (25b) that provides cotrolled factor screeig i the presece of two-factor iteractios. We will the preset a method of cotrolled screeig that uses a two-stage hypothesis testig procedure i a stadard fractioal factorial desig. The secod method will be called TCFF for Two-stage Cotrolled Fractioal Factorial. 8
9 CSB-X, Cotrolled Sequetial Bifurcatio with Iteractios CSB-X is a cotrolled versio of sequetial bifurcatio. Sequetial bifurcatio is a series of steps, where withi each step a group of factors is tested. Withi the group, all factors are completely cofouded together (as i group screeig, see Lewis ad Dea, 2). Therefore, it is critical to CSB-X that the directio of all the factor effects be positive (i.e. βk, < k K ) to prevet cacellatio of effects. If a group of factors is tested ad the group effect (the sum of all the mai effects of the factors i the group) is ot foud to be sigificatly greater tha (the threshold of importace) the all the factors i the group are declared uimportat ad are dropped from further screeig. If the group effect is foud to be greater tha ad there is oly oe factor i the group, the that factor is declared to be importat. If the group effect is foud to be greater tha ad there is more tha oe factor i the group, the group is split ito two subgroups that are to be tested later. This procedure is followed util all factors are classified as either importat or uimportat. Kleije ad Bettovil (997) suggest various practical methods for splittig ad testig the groups. Specifically, splittig groups ito subgroups whose size is a power of 2 ca improve the efficiecy of the procedure. Also, if oe selects for testig groups that have large group effects before groups that have small group effects, the procedure ca sequetially reduce the upper boud o the greatest effect that has ot yet bee idetified. This upper boud is very useful if the procedure is stopped early. Sice we are comparig the statistical cotrol of the procedures, we pla to complete the screeig procedure i all cases ad thus the sequetial upper boud is ot of iterest. Also, we foud i empirical testig, that splittig groups ito subgroups whose size is a power of 2 did ot always improve efficiecy ad ever led to icreases i efficiecy that substatially chaged our coclusios. Thus, for better compariso 9
10 to Wa, et al. (25b), we will use the procedure that simply splits each importat group ito two equally sized subgroups. I the case of a odd group size, the subgroup sizes differ by oe. We will also follow the testig procedure of Wa, et al. (25b). To defie the desig poits of their experimet, the desig poit at level k is defied as x i ( k), i =, 2,3, k =, i = k +, k + 2, K. Thus, level k has all factors with idices greater tha k set to o the to scale ad all factors with idices less tha or equal to k, set to. Similarly, a desig poit at level k is defied as x i ( k), i =, 2,3, k =, i = k +, k + 2, K. To test a group of factors, say factors with idices i the set { k +, k + 2,, k2}, four desig poits are used, level k, level k, level k 2, level k2. To follow the otatio i Wa, et al. (25b), let Z ( k ) represet the observed simulatio output for the th replicatio at level k. The procedure esures that all four desig poits have the same umber of replicatios ad the test statistic, D ( k, k ) 2, is calculated as ( ) ( ) = ( ) ( ) ( ) ( ) D k k Z k Z k Z k Z k, /2. They show that give (), the expected value of D ( k, k ) from the group { k +, k + 2,, k2}, i.e., 2 (, 2) = 2 is the sum of the factors mai effects k k. k= k + E D k k β
11 The group is declared importat if the ull hypothesis, H : k2 β, is rejected at k k= k + level α. The group is declared uimportat if the ull hypothesis is ot rejected, provided that the test used has power γ for the alterative hypothesis, H A : k2 k= k + β. To guaratee this k statistical cotrol, Wa, et al. (25b) provide a fully sequetial test that first takes a iitial umber,, of replicatios at each of the four desig poits ad the adds replicatios oe-at-atime (to all four desig poits) util the group effect ca be declared either importat or uimportat. Eve uder o-costat variace coditios as i (), CSB-X with the fully sequetial test guaratees that the probability of Type I error is α for each factor idividually ad the power is at least γ for each step of the algorithm. TCFF, Two-stage Cotrolled Fractioal Factorial The TCFF method begis by selectig a fractioal factorial desig that is capable of achievig the desired resolutio for the umber of factors beig screeed. Recall that a resolutio III desig ca estimate all mai effects idepedet of each other, but does cofoud mai effects with certai two-factor iteractios (this is similar to CSB). A resolutio IV desig also allows for idepedet estimatio of mai effects, ad i additio removes ay bias due to two-factor iteractios betwee the factors. I order to match the capabilities of CSB-X, we will use resolutio IV desigs for the compariso i this paper. May books (e.g. Wu ad Hamada, 2) ad software packages provide recommeded fractioal factorial desigs for various resolutios ad various values of K, the umber of factors to be screeed. If a recommeded desig caot be easily foud for a particular value of K, a resolutio IV desig ca always be easily costructed by takig a orthogoal array large eough to accommodate K two-level factors ad the foldig it over accordig to the procedure
12 described o page 72 of Wu ad Hamada (2). Sice the foldover procedure doubles the umber of rus i the orthogoal array, this procedure produces a resolutio IV desig for K factors that has at least 2 (K+) desig poits. A very extesive list of orthogoal arrays for differet values of K has recetly bee published by Kuhfield ad Tobias (25). The method described below for cotrollig the power does ot require ay particular resolutio desig ad i fact ca be used with higher resolutio desigs if oe wats to scree for factors with importat two-factor iteractios or higher order effects. Of course higher resolutio desigs will require more desig poits ad potetially may more observatios. Oce the fractioal factorial desig (with N desig poits) is selected, the ext step is to cotrol the power ad Type I error of the mai effect estimates. To accomplish this, we adapt a two-stage method developed to cotrol power for the two-way ANOVA layout by Bishop ad Dudewicz (978). The first stage of the method requires a iitial umber, say, of replicatios of the fractioal factorial desig, i.e., observatios at each desig poit. Sice the fractioal factorial is likely to be quite large, we recommed that be relatively small such as 3 or 5. The secod stage of the method adds replicatios to certai rows of the factorial desig to get better estimates of the mea respose of those rows for which the resposes have high variace. We deote the total umber of replicatios for row i after both stages as i. Thus, the umber of additioal replicatios take for row i i the secod stage is i. The TCFF procedure is represeted graphically i Figure. The keys to the cotrol of Type error ad power for this method are: () the calculatio for i ad (2) careful selectio of the estimators for each factor effect. 2
13 The TCFF Method The Desig Matrix First Stage Secod Stage K factors replicatios i - replicatios N rows Figure. A graphical view of the TCFF Method. Before describig the calculatios for i, a brief overview of the aalysis will be helpful. Oce all the replicatios for stage two are made for each row, a sigle pseudo-observatio, called Y i, will be calculated for each row. Each Y i will be a weighted average of all the observatios from row i. Give the ormal error structure show i (), careful selectio of the umber of rus i each row ad the weights i the weighted averages will result i each of these Y i s havig a distributio which is proportioal to a o-cetral t-distributio with - degrees of freedom. Oce the psuedo-observatios are calculated for each row, the desig is treated like a ureplicated fractioal factorial desig ad the mai effects of each factor are calculated i the stadard way (either with cotrasts as described o page 228 of Motgomery, 2 or through regressio aalysis, see page 3 of Wu ad Hamada, 2). Sice each of the Y i s follows a t- distributio, the ull distributio of each effect estimate ca be foud through Mote Carlo simulatio or through a ormal approximatio. Thus, a α-level test ca be used to test the 3
14 hypothesis: H : βk or H : β k >. The calculatio of the umber of replicatios i the secod stage of the method esures that if the effect of factor k is greater tha, the probability that factor k will be declared importat is greater tha γ. The steps for determiig i are ow give (The formulas are adapted from Bishop ad Dudewicz, 978.):. Let the cdf of the distributio of the average of N idepedet stadard t- distributed radom variables each with degrees of freedom be called (,, ) t N x, where x is the argumet of the cdf. Let c be the -α quatile of this distributio ad c be the -γ quatile of this distributio. These quatiles ca be easily obtaied through simulatio or through a ormal approximatio (see the Appedix). Values for c ad c for some commo values of ad N are give i the tables i the Appedix. 2. Let z = c c 2 ad let s i be the sample stadard deviatio from the replicates of row i from the first stage. The total umber of replicatios eeded for row i after the secod stage of the method is the i 2 s i = max +, +, z where x deotes the greatest iteger less tha x. Thus, i the secod stage, i ew observatios are take for each row i. After all observatios are take, each factor effect, β k, is tested as follows: 4
15 . For each row i the fractioal factorial desig, a weight for each replicatio must be calculated. The jth replicatio i the ith row will be called y ij ad the associated weight will be called a ij. 2. A prelimiary value called b i is first calculated for each row as: b i 2 ( ) iz si = + ( ) s 2 i i i, where is the umber of iitial replicatios i the first stage, i is the total umber of replicatios of row i after the secod stage, z, ad s i is = c c 2 the sample stadard deviatio from the iitial replicates of row i from the first stage. 3. The weight for the jth replicatio i the ith row is a ij ( ) i bi = for j =, 2,,, ad a ij b j = +, + 2,, i. = i for 4. These weights are the used to calculate a sigle weighted average respose for each row, called Y i, where Y. Give (), each Y i has a scaled ocetral t-distributio such that i = a y i ij ij j= K K K Y ~ zt + β + β x + β x x, (2) i i k ik km ik im k= k= m= k where each t i is a idepedet radom variable that has a cetral t-distributio with degrees of freedom. 5
16 5. The desig ca ow be viewed as a ureplicated fractioal factorial desig with observatios Y i. Let the level of the kth factor i the ith row of the desig matrix be x ik. The estimator for the coefficiet of the mai effect for the kth factor is: N ˆ β = x Y. (3) k ik i N i= 6. Each x ik i the fractioal factorial desig will be either or, ad each colum, x k, is orthogoal to the mea, all the other mai effects, ad all two-factor iteractio colums. Sice the t-distributio is symmetric (i.e. t i has the same distributio as t i ), it follows from (2) ad (3) that N ˆ ti βk ~ βk + z, (4) N i= where each t i is a idepedet radom variable that has a cetral t-distributio with degrees of freedom. 7. We ow test the hypothesis: H : βk or H : β k >. The ull hypothesis is rejected oly if ˆk β > + c z sice it follows from (4) that ˆ β > + c z β < < α, Pr k k where c is the -α quatile of the distributio with cdf t ( N,, x) Bishop ad Dudewicz (978) use a theorem from Stei (945) to show that for a two-way ANOVA layout, this procedure provides exact cotrol of Type I error eve if the variace chages across treatmet combiatios. They also exted the results to a multi-way ANOVA i Bishop ad Dudewicz (98). Sice the fractioal factorial desig ca be viewed as a multi-way ANOVA with various mai effects fully cofouded with high order iteractios whose effects 6.
17 are assumed to be zero, the procedure guaratees exact cotrol of Type I error give the orthogoal desig ad the model i (). Power cotrol is achieved by choosig the value for z. The ull hypothesis is rejected oly if ˆk β > + c z. By (4), ˆ Pr βk > + c z βk γ. To cotrol power, z must be the set such that + c z = + c z so that if β, the probability of rejectig the ull hypothesis is greater tha γ (i.e. ˆ Pr βk > + c z βk γ ). Therefore z = c c This cocludes the methodology for TCFF. I the ext sectio, a simple umerical example with a small umber of factors is provided to help clarify the computatios. 2. Numerical Example Suppose that a small maufacturig system has 2 statios ad at each statio there are three factors: the umber of machies (coded M ad M2), the umber of operators (coded O ad O2), ad the frequecy of prevetative maiteace (coded F ad F2). The goal is to use a 6-ru resolutio IV fractioal factorial to scree for factors that have a large effect o the daily throughput of the system. Table shows the desig matrix ad the first stage observatios. We have chose = 3, =, α =.5, γ =.95, ad = 4. 7
18 The Desig Matrix Row M M2 O O2 F F2 i The First Stage Replicatios y y y i2 y i3 i Table. Data from the first stage of the umerical example. s i I order to calculate i for each row, we begi by calculatig z. We ca approximate c ad c, the α ad γ quatiles of the t distributio by applyig the cetral limit theorem ad usig a ormal approximatio. Sice the variace of a t-distributio with 3 degrees of freedom is 3, the variace of the average of 6 idepedet t-distributed radom variables should be approximately ormal with variace 3/6. Thus, c φ ( ) 3.95 =.722 ad 6 c 3 6 φ (.5) =.722, where φ ( x) is the iverse fuctio for the stadard ormal distributio. Usig, Mote Carlo simulatios, we foud values of c =.675 ad c =.675 (see Appedix Table A2). We will use the Mote Carlo simulatio results to 8
19 calculate z as follows: z 2 = = c c 35,66. The calculated values for i are (5, 5, 5, 5, 5, 5, 5, 7, 9, 5, 5, 5, 5, 5, 5, 2) for rows -6 respectively. The i observatios for the secod stage, the b i s, ad Y i s are give i Table 2. Row i5 The Secod Stage Replicatios ad Computatios y y i6 y i7 y i8 y i9 y i y i y i2 b i Table 2. Data from the secod stage of the umerical example. The estimates for the factor effect coefficiets are show i Table 3. Sice + c z = 7, the we see that M ad F2 have importat effects. Coefficiet Mea M M2 O O2 F F2 Estimate Table 3. The estimates for the coefficiets for the mai effect of each factor. Compariso of Methods With ormally distributed error, both CSB-X ad TCFF have guarateed performace for Type I error ad power, eve whe there is uequal variace ad two-factor iteractios preset i the model. Thus, the primary measure for compariso betwee the two methods is the umber of replicatios that it takes to gai the required performace. We set up scearios uder which to test the two methods. Each of these scearios is used for 2 ad 5 factors with %, 9 Y i
20 5%, ad % of the factors importat. The TCFF method uses a 52-ru, resolutio IV desig for the 2 factor cases ad a 24-ru, resolutio IV desig for the 5 factor cases. TCFF uses =3 replicatios for the first stage. CSB-X uses =5. I each sceario, the size of the importat effects is 5 ad the size of uimportat effects is. We set to 2, ad to 4 for all scearios ad the error was ormally distributed. The first 3 scearios have equal variace. I the ext 5 scearios, each of the importat factors also has a dispersio effect, which meas that whe oe of those factors chages level, the variability of the error either icreases or decreases. Half the importat factors icrease the variability ad the other half decrease the variability. Each dispersio effect icreases or decreases the stadard deviatio of the error by 2% i the cases with 2 factors ad by 8% i the cases with 5 factors. I the fial 3 scearios, the stadard deviatio of the error is proportioal to the expected value of the respose. The costat of proportioality was. ad.4 for the 2 ad 5 factor cases, respectively. These are very importat scearios sice for both simulatio experimets ad physical experimets, it is very commo that variability ad average respose value are related. Each of the scearios is described below.. The importat effects are clustered together at the begiig of the factor set ad the stadard deviatio for the radom error is set to 3 for all observatios. 2. The importat effects are distributed at regular itervals throughout the factor set ad the stadard deviatio for the radom error is set to 3 for all observatios. 3. The importat effects are radomly distributed throughout the factor set ad the stadard deviatio for the radom error is set to 3 for all observatios. 4. The importat effects are clustered together at the begiig of the factor set. Each of the importat factors also has a dispersio effect. I this sceario all the positive dispersio effects are clustered at the begiig of the factor set. 2
21 5. The importat effects are clustered together at the begiig of the factor set. Each of the importat factors also has a dispersio effect. I this sceario, the positive dispersio effects are distributed throughout the set of importat factors. 6. The importat effects are distributed throughout the factor set. Each of the importat factors also has a dispersio effect. I this sceario all the positive dispersio effects are clustered at the begiig of the set of importat factors. 7. The importat effects are distributed throughout the factor set. Each of the importat factors also has a dispersio effect. I this sceario, the positive dispersio effects are distributed throughout the set of importat factors. 8. The importat effects are radomly distributed throughout the factor set. Each of the importat factors also has a dispersio effect which is radomly chose to be positive or egative. 9. The importat effects are clustered together at the begiig of the factor set ad the stadard deviatio for the radom error is proportioal to the average respose.. The importat effects are distributed at regular itervals throughout the factor set ad the stadard deviatio for the radom error is proportioal to the average respose.. The importat effects are radomly distributed throughout the factor set ad the stadard deviatio for the radom error is proportioal to the average respose. There were macro-replicatios of each sceario for 2 ad 5 factors ad with %, 5%, ad % of the factors beig importat. Sice both methods elimiate the effects of twofactor iteractios, iteractios were radomly geerated for each simulatio ad the two methods were give the same iteractio matrix for each of the macro-replicatios uder each sceario. The iteractios were geerated from a N (,2) distributio ad radomly assiged 2
22 betwee factors. Factors with importat effects had a 64% chace of iteractio with other importat factors ad a 6% chace of iteractio with factors with uimportat effects. Each pair of uimportat factors had oly a 4% chace of iteractio. Surprisigly, usig the same iteractio matrix did ot seem to iduce much correlatio betwee the macro-replicatios. The average ad stadard deviatio of the umber of replicatios across the macro-replicatios for each method for each sceario are reported i Tables 4 ad 5. The average umber of replicatios is plotted across the scearios i Figures 2-7. It is clear from Figures 2 ad 5 that CSB-X requires fewer replicatios whe oly % of the factors are importat. I most of these cases, the TCFF method takes the miimum umber of rus 4 24 = 496 rus. Figures 3, 4, 6, ad 7 show that the TCFF method is better o average tha CSB-X whe the percetage of importat factors is greater tha 5%. Tables 4 ad 5 show that the stadard deviatio across the macro-replicatios is always lower for TCFF ad that it ca be quite high for CSB-X especially i the scearios where the stadard deviatio of the error is proportioal to the average respose (Scearios 9,, ad ). Also, CSB-X is sesitive to the order i which the importat effects appear, where TCFF is ot. Naturally CSB-X does very well whe the importat effects are clustered, but performs worse whe they are radomly placed or distributed regularly. Fially, recall that CSB-X requires the directio of the factor effects to be kow, where TCFF does ot. Coclusio Two methods, CSB-X ad TCFF, for screeig of a large umber of factors with statistical cotrol of the Type I error ad the power have bee reviewed ad compared. If there are very few importat factors (less tha 5%) ad there is substatial prior iformatio (kow directio of factor effects ad some prior iformatio that allows the importat effects to be clustered), CSB-X ca have substatial gais i efficiecy over the TCFF approach, however, it 22
23 ca have highly variable results depedig o the quality of the prior iformatio ad the extet to which the error variace chages across observatios. The TCFF method requires little prior iformatio, but does ivolve a substatial iitial ivestmet of replicatios. Oce this iitial ivestmet is made, the method is relatively stable ad performs better tha CSB-X whe the percetage of importat factors icreases above 5%. 23
24 Sceario # of Factors # of Importat Factors Sigificat Effects Order Variace Model CSB-X TCFF Ave # of Reps Stdev. of Reps Ave # of Reps 2 2 Clustered Equal Distributed Equal Radom Equal Clustered Clust. Disp. Eff Clustered Dist. Disp. Eff Distributed Clust. Disp. Eff Distributed Dist. Disp. Eff Radom Disp. Eff Clustered Prop. to mea Distributed Prop. to mea Radom Prop. to mea Clustered Equal Distributed Equal Radom Equal Clustered Clust. Disp. Eff Clustered Dist. Disp. Eff Distributed Clust. Disp. Eff Distributed Dist. Disp. Eff Radom Disp. Eff Clustered Prop. to mea Distributed Prop. to mea Radom Prop. to mea Clustered Equal Distributed Equal Radom Equal Clustered Clust. Disp. Eff Clustered Dist. Disp. Eff Distributed Clust. Disp. Eff Distributed Dist. Disp. Eff Radom Disp. Eff Clustered Prop. to mea Distributed Prop. to mea Radom Prop. to mea Table 4. Simulatio results for CSB-X ad TCFF with 2 factors. Stdev. of Reps 24
25 Sceario # of factors # of Importat Factors Sigificat Effects Order Variace Model CSB-X TCFF Ave # of Reps Stdev. of Reps Ave # of Reps 5 5 Clustered Equal Distributed Equal Radom Equal Clustered Clust. Disp. Eff Clustered Dist. Disp. Eff Distributed Clust. Disp. Eff Distributed Dist. Disp. Eff Radom Disp. Eff Clustered Prop. to mea Distributed Prop. to mea Radom Prop. to mea Clustered Equal Distributed Equal Radom Equal Clustered Clust. Disp. Eff Clustered Dist. Disp. Eff Distributed Clust. Disp. Eff Distributed Dist. Disp. Eff Radom Disp. Eff Clustered Prop. to mea Distributed Prop. to mea Radom Prop. to mea Clustered Equal Distributed Equal Radom Equal Clustered Clust. Disp. Eff Clustered Dist. Disp. Eff Distributed Clust. Disp. Eff Distributed Dist. Disp. Eff Radom Disp. Eff Clustered Prop. to mea Distributed Prop. to mea Radom Prop. to mea Table 5. Simulatio results for CSB-X ad TCFF with 5 factors. Stdev. of Reps 25
26 CSB-X vs Two-Stage FFD (% of 2 factors importat) 6 Average # of Reps CSB-X 2-Stage FFD Scearios Figure 2. Average umber of rus with % of 2 factors importat. CSB-X vs Two-Stage FFD (5% of 2 factors importat) Average # of Reps CSB-X 2-Stage FFD Scearios Figure 3. Average umber of rus with 5% of 2 factors importat. CSB-X vs Two-Stage FFD (% of 2 factors importat) Average # of Reps CSB-X 2-Stage FFD Scearios Figure 4. Average umber of rus with % of 2 factors importat. 26
27 CSB-X vs Two-Stage FFD (% of 5 factors importat) Average # of Reps Scearios CSB-X 2-Stage FFD Figure 5. Average umber of rus with % of 5 factors importat. CSB-X vs Two-Stage FFD (5% of 5 factors importat) Average # of Reps CSB-X 2-Stage FFD Scearios Figure 6. Average umber of rus with 5% of 5 factors importat. CSB-X vs Two-Stage FFD (% of 5 factors importat) 6 Average # of Reps CSB-X 2-Stage FFD Scearios Figure 7. Average umber of rus with % of 5 factors importat. 27
28 Appedix: Critical Values for the t distributio: I this appedix, we show how to compute values for c ad c through Mote Carlo simulatio or with a ormal approximatio. Let the cdf of the distributio of the average of N idepedet stadard t-distributed radom variables each with degrees of freedom be called t ( N,, x), where x is the argumet of the cdf. Let c be the -α quatile of this distributio ad c be the -γ quatile of this distributio. Values for c ad c for some commo values of α, γ, ad N are give i Tables A, A2, ad A3. Mote Carlo Simulatio to determie c ad c for t ( N,, x).. Geerate a set of N radom variables, t i {, N}, from a cetral t- distributio with degrees of freedom ad compute the average, called i t j N ti N i = =. 2. Choose a large umber, M, ad repeat Step M times to obtai the average of each set, t j {, M}. j 3. Rak order the t j s so that t ( j) > t ( k) j > k to obtai t ( ) j {, M}. j 4. Let j = Roud [ α M ] ad j Roud [ γ M ] c ( ) j ( ) = t, where Roud [ x ] rouds x to the earest iteger. Normal Approximatio to determie c ad = ( ), the c = t( ) ad c for t ( N,, x). j The variace of a t-distributio with v degrees of freedom is v v 2 for v > 2. If N is relatively large, the by the cetral limit theorem, the average of N idepedet t-distributed 28
29 radom variables is approximately ormal with variace v N v ( 2). Thus, c v N v φ ( 2) ( α ) ad c v N v φ ( 2) ( γ ), where φ ( x) is the iverse fuctio for the stadard ormal distributio. I Tables A, A2, ad A3, critical values c ad c calculated by Mote Carlo simulatio (ad the ormal approximatio) are preseted for cases where the ormal approximatio is ot very accurate. For N > 32, the ormal approximatio is reasoably accurate ad thus tables are ot ecessary. Notice that if ( γ ) α =, the c = c. The ormal approximatio caot be used whe = 3 sice there is o closed form for the variace of the t-distributio with 2 degrees of freedom. = (/a) = (.425) = 5.2 (.63) = 6.3 (.62) = 7.7 (.7) = 8. (.973) = (.95) =.965 (.933) Table A. The critical values, c, for. N = 8 N = 6 N = (/a).98 (.7).848 (.822).737 (.75).726 (.72).72 (.688).668 (.672).676 (.659).426 (/a).758 (.72).63 (.582).536 (.53).57 (.54).57 (.487).483 (.475).473 (.466) α =. These are also the values, c, for γ =.99. (Normal Approximatio results are i paretheses.) 29
30 = 3.29 (/a) = 4.93 (.7) = 5.82 (.822) = (.75) = 7.78 (.72) = (.688) = (.672) =.665 (.659) Table A2. The critical values, c, for.5 N = 8 N = 6 N = (/a).675 (.72).57 (.582).523 (.53).58 (.54).48 (.487).473 (.475).465 (.466).742 (/a).494 (.54).4 (.4).378 (.375).365 (.356).346 (.344).336 (.336).33 (.33) α =. These are also the values, c, for γ =.95. (Normal Approximatio results are i paretheses.) = 3.92 (/a) = (.785) = 5.66 (.64) = (.585) = (.555) = (.536) = 9.58 (.523) =.52 (.54) Table A3. The critical values, c, for. N = 8 N = 6 N = (/a).5 (.555).444 (.543).44 (.44).396 (.392).38 (.379).36 (.37).36 (.363).543 (/a).37 (.392).39 (.32).292 (.292).279 (.277).273 (.268).263 (.262).259 (.257) α =. These are also the values, c, for γ =.9. (Normal Approximatio results are i paretheses.) 3
31 Ackowledgemet Dr. Akema s work was partially supported by EPSRC Grat GR/S9673 ad was udertake while Dr. Akema was visitig Southampto Statistical Scieces Research Istitute at the Uiversity of Southampto i the UK. Dr. Wa s research was partially supported by Purdue research Foudatio (Award No ). We would also like to thak Russell Cheg, Sue Lewis, Barry Nelso, ad a aoymous referee for may helpful discussios ad commets. Refereces Baks, J.; Carso II, J. S.; Nelso, B. L.; ad Nicol, D. M. (25). Discrete-Evet System Simulatio. Fourth Editio, Pretice Hall, New Jersey. Bettovil, B. ad Kleije, J. P. C. (997). Searchig for Importat Factors i Simulatio Models with may Factors: Sequetial Bifurcatio. Europea Joural of Operatioal Research 96, pp Bishop, T. A. ad Dudewicz, E. J. (978). Exact Aalysis of Variace with Uequal Variaces Test Procedures ad Tables. Techometrics 2, pp Bishop, T. A. ad Dudewicz, E. J. (98). Heteroscedastic ANOVA. Sakhya 2, pp Dea, A. M. ad Lewis, S. M., eds. (25). Screeig: Methods for Experimetatio i Idustry. Spriger-Verlag, New York. Kuhfeld, W. F. ad R. D. Tobias, R. D., (25)., Large Factorial Desigs for Product Egieerig ad Marketig Research Applicatios., Techometrics 47, pp Lewis, S. M. ad Dea, A. M. (2). Detectio of Iteractios i Experimets with Large Numbers of Factors (with discussio).. J. R. Statist. Soc. B 63, pp
32 Lu, X. ad Wu, X. (24) A Strategy of Searchig for Active Factors i Supersaturated Screeig Experimets. Joural of Quality Techology 36, pp Motgomery, D. C. (2). The Desig ad Aalysis of Experimets, Fifth Editio, Wiley, 2. Sater, T. J.; Williams, B.; ad Notz, W. (23). The Desig ad Aalysis of Computer Experimets. Spriger-Verlag, New York. Stei, C. (945). A Two Sample Test for a Liear Hypothesis whose Power is Idepedet of the Variace. Aals of Mathematical Statistics 6, pp Wa, H.; Akema, B. E.; ad Nelso, B. L. (25a). Cotrolled Sequetial Bifurcatio: A New Factor-Screeig Method for Discrete-Evet Simulatio. Operatios Research, to appear. Wa, H.; Akema, B. E.; ad Nelso, B. L. (25b). Simulatio Factor Screeig With Cotrolled Sequetial Bifurcatio i the Presece of Iteractios. Workig Paper, Departmet of Idustrial Egieerig ad Maagemet Scieces, Northwester Uiversity, Evasto, IL. Wu, J. C. F. ad Hamada, M. (2). Experimets: Plaig, Aalysis ad Parameter Desig Optimizatio. Wiley, New York. 32
There is no straightforward approach for choosing the warmup period l.
B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationOutput Analysis (2, Chapters 10 &11 Law)
B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More information1 Review of Probability & Statistics
1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationChapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation
Chapter Output Aalysis for a Sigle Model Baks, Carso, Nelso & Nicol Discrete-Evet System Simulatio Error Estimatio If {,, } are ot statistically idepedet, the S / is a biased estimator of the true variace.
More informationStatistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes
Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio
More informationThe Random Walk For Dummies
The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli
More informationOPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES
OPTIMAL ALGORITHMS -- SUPPLEMENTAL NOTES Peter M. Maurer Why Hashig is θ(). As i biary search, hashig assumes that keys are stored i a array which is idexed by a iteger. However, hashig attempts to bypass
More informationKinetics of Complex Reactions
Kietics of Complex Reactios by Flick Colema Departmet of Chemistry Wellesley College Wellesley MA 28 wcolema@wellesley.edu Copyright Flick Colema 996. All rights reserved. You are welcome to use this documet
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationAccess to the published version may require journal subscription. Published with permission from: Elsevier.
This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationOpen book and notes. 120 minutes. Cover page and six pages of exam. No calculators.
IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More informationInvestigating the Significance of a Correlation Coefficient using Jackknife Estimates
Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------
More informationApril 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE
April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece
More informationStatistical Intervals for a Single Sample
3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea
More informationComputing Confidence Intervals for Sample Data
Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios
More informationCS284A: Representations and Algorithms in Molecular Biology
CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by
More informationApproximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation
Metodološki zvezki, Vol. 13, No., 016, 117-130 Approximate Cofidece Iterval for the Reciprocal of a Normal Mea with a Kow Coefficiet of Variatio Wararit Paichkitkosolkul 1 Abstract A approximate cofidece
More informationANALYSIS OF EXPERIMENTAL ERRORS
ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationOutput Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationG. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan
Deviatio of the Variaces of Classical Estimators ad Negative Iteger Momet Estimator from Miimum Variace Boud with Referece to Maxwell Distributio G. R. Pasha Departmet of Statistics Bahauddi Zakariya Uiversity
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More informationy ij = µ + α i + ɛ ij,
STAT 4 ANOVA -Cotrasts ad Multiple Comparisos /3/04 Plaed comparisos vs uplaed comparisos Cotrasts Cofidece Itervals Multiple Comparisos: HSD Remark Alterate form of Model I y ij = µ + α i + ɛ ij, a i
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationDouble Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution
Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationAnalysis of Experimental Measurements
Aalysis of Experimetal Measuremets Thik carefully about the process of makig a measuremet. A measuremet is a compariso betwee some ukow physical quatity ad a stadard of that physical quatity. As a example,
More informationA RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS
J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a
More informationBasis for simulation techniques
Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios
More information6 Sample Size Calculations
6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig
More informationPSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9
Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationSequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet
More information2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2
Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:
More informationResponse Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable
Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated
More informationA sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationMathematical Modeling of Optimum 3 Step Stress Accelerated Life Testing for Generalized Pareto Distribution
America Joural of Theoretical ad Applied Statistics 05; 4(: 6-69 Published olie May 8, 05 (http://www.sciecepublishiggroup.com/j/ajtas doi: 0.648/j.ajtas.05040. ISSN: 6-8999 (Prit; ISSN: 6-9006 (Olie Mathematical
More informationExtreme Value Charts and Analysis of Means (ANOM) Based on the Log Logistic Distribution
Joural of Moder Applied Statistical Methods Volume 11 Issue Article 0 11-1-01 Extreme Value Charts ad Aalysis of Meas (ANOM) Based o the Log Logistic istributio B. Sriivasa Rao R.V.R & J.C. College of
More informationPower and Type II Error
Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error
More informationModule 1 Fundamentals in statistics
Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationComparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series
Applied Mathematical Scieces, Vol. 7, 03, o. 6, 3-337 HIKARI Ltd, www.m-hikari.com http://d.doi.org/0.988/ams.03.3430 Compariso Study of Series Approimatio ad Covergece betwee Chebyshev ad Legedre Series
More informationAP Statistics Review Ch. 8
AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More informationTHE KALMAN FILTER RAUL ROJAS
THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationGUIDELINES ON REPRESENTATIVE SAMPLING
DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationCorrelation Regression
Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother
More informationIntroducing Sample Proportions
Itroducig Sample Proportios Probability ad statistics Aswers & Notes TI-Nspire Ivestigatio Studet 60 mi 7 8 9 0 Itroductio A 00 survey of attitudes to climate chage, coducted i Australia by the CSIRO,
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationStochastic Simulation
Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationRAINFALL PREDICTION BY WAVELET DECOMPOSITION
RAIFALL PREDICTIO BY WAVELET DECOMPOSITIO A. W. JAYAWARDEA Departmet of Civil Egieerig, The Uiversit of Hog Kog, Hog Kog, Chia P. C. XU Academ of Mathematics ad Sstem Scieces, Chiese Academ of Scieces,
More informationA proposed discrete distribution for the statistical modeling of
It. Statistical Ist.: Proc. 58th World Statistical Cogress, 0, Dubli (Sessio CPS047) p.5059 A proposed discrete distributio for the statistical modelig of Likert data Kidd, Marti Cetre for Statistical
More informationOn an Application of Bayesian Estimation
O a Applicatio of ayesia Estimatio KIYOHARU TANAKA School of Sciece ad Egieerig, Kiki Uiversity, Kowakae, Higashi-Osaka, JAPAN Email: ktaaka@ifokidaiacjp EVGENIY GRECHNIKOV Departmet of Mathematics, auma
More informationStatistical Fundamentals and Control Charts
Statistical Fudametals ad Cotrol Charts 1. Statistical Process Cotrol Basics Chace causes of variatio uavoidable causes of variatios Assigable causes of variatio large variatios related to machies, materials,
More informationThe standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
More informationEstimation of a population proportion March 23,
1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes
More informationPower Comparison of Some Goodness-of-fit Tests
Florida Iteratioal Uiversity FIU Digital Commos FIU Electroic Theses ad Dissertatios Uiversity Graduate School 7-6-2016 Power Compariso of Some Goodess-of-fit Tests Tiayi Liu tliu019@fiu.edu DOI: 10.25148/etd.FIDC000750
More informationMBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS
MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos
More informationContinuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised
Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for
More information