STATISTICAL PROCESS ADJUSTMENT PROBLEMS IN SHORT-RUN MANUFACTURING

Size: px
Start display at page:

Download "STATISTICAL PROCESS ADJUSTMENT PROBLEMS IN SHORT-RUN MANUFACTURING"

Transcription

1 The Pennsylvania State University The Graduate School Department of Industrial and Manufacturing Engineering STATISTICAL PROCESS ADJUSTMENT PROBLEMS IN SHORT-RUN MANUFACTURING A Thesis in Industrial Engineering and Operations Research by Omer Arda Vanli c 2007 Omer Arda Vanli Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy August 2007

2 The thesis of Omer Arda Vanli was reviewed and approved by the following: Enrique Del Castillo Professor of Industrial and Manufacturing Engineering Thesis Adviser Chair of Committee M. Jeya Chandra Professor of Industrial and Manufacturing Engineering Tao Yao Assistant Professor of Industrial and Manufacturing Engineering Murali Haran Assistant Professor of Statistics Nital S. Patel Senior Staff Engineer, Intel Corporation Special Member Richard J. Koubek Professor and Departmental Head of Industrial and Manufacturing Engineering Signatures are on file in the Graduate School.

3 ii ABSTRACT Manufacturing engineers often change or adjust the operating conditions of a production process by manipulating a set of variables, or controllable factors. The goal is usually to keep some other variables of interest, the responses, close to given target values in the presence of uncontrollable variables, the noise factors and the disturbances, that also affect the responses. The performance of a process adjustment technique, which indicates how to change the controllable factors of a process, depends on the amount of information available about the relation between the controllable factors, or inputs, the noise factors and disturbances, and the responses, or outputs. This information usually results in an input-output, or transfer function, model. This dissertation considers problems related to the identification of these models, and proposes new process adjustment techniques when the amount of information available is limited (i.e. process runs are short) and noise factors are present in a process. A specific problem addressed in this dissertation is how to identify the input-output model of a process that is being adjusted. Such a closed-loop identification method is necessary in industrial processes that cannot be left to run without control. Traditional system identification techniques assume open-loop (no control) operation. The first part of this dissertation presents new methodology for the identification of Box- Jenkins transfer function models under closed-loop operating conditions. It is shown how the input-output delay of the process represents crucial information and that, if known a priori, it would facilitate the identification of the rest of the model. Hence, new methods for

4 iii the specific estimation of the input-output delay, while the process operates in closed-loop, are proposed. The methods are based on Time Series change-point detection techniques. In the second part of the dissertation we study a system identification problem frequently found in semiconductor manufacturing. This is the so-called context-based model identification problem, where different process models need to be identified for different batches of products depending on the manufacturing context under which the process data was obtained (e.g. the product type, operation, chamber, tool etc.). A model identification method that uses categorical variable selection methods is developed and applied to a real semiconductor manufacturing data set. The last part of the thesis presents new adjustment methods for processes that involve uncontrollable noise factors. In the statistical process optimization literature, Robust Parameter Design (RPD) methods have been used for designing processes that are insensitive against variation caused by noise factors. These methods, however, are largely applied off-line; that is, they are not process adjustment methods that recommend different controllable factor settings depending on the on-line noise factors measured during production. Instead, they determine the optimal process settings before production starts and they do not alter the optimal settings during production. In this research, new Bayesian process adjustment methods for on-line robust parameter design are proposed. The proposed online RPD controllers are feedforward multiple response controllers that utilize on-line noise factor measurements, assumed available.

5 Contents List of Figures ix List of Tables xi List of Acronyms xiii Acknowledgements xiv 1 Introduction Dissertation Topics and Research Objectives Closed-Loop System Identification Methods Context-Based Model Identification for Run-to-Run Control On-line Robust Parameter Design Dissertation Outline Literature Review Time Series Control and Bayesian Inference Closed-loop System Identification Methods Run-to-Run Control and Model Variable Selection Methods iv

6 2.4 Robust Parameter Design Methods v 3 Closed-Loop System Identification for Small Samples with Constraints Introduction Closed-Loop identification of Box-Jenkins transfer function models Identification and Parameter Estimation of the Process Model Least squares estimation of the process model Identifiability conditions Some constraint types and their specification in practice Effects of adding constraints on the identifiability of a process Simulation Example: Benefits of adding constraints when only small samples are available Estimation of the process model from the simulated closed-loop realizations Assessment of the benefits of using constraints Case Study: A Gas Furnace Process Chapter Summary Change-point Methods for Closed-loop Delay Estimation Introduction Process Assumptions Bayesian approaches Prior specification of the parameters Posterior distribution of the parameters

7 vi Full Bayesian Approach Reduced Bayesian Approach A sequential probability ratio test approach A cumulative sum approach Example 1: Illustration of the proposed methods Performance of the proposed approaches under a second order process Example 2: Comparison with the Laguerre method Guidelines for designing the delay estimation experiment Chapter Summary Appendices of Chapter Appendix 4A: Selection of the reference values of the SPRT and the CUSUM schemes Appendix 4B: Estimation of the parameters required in the change-point methods Appendix 4C: Convergence Diagnostics for the MCMC simulations in the Full Bayesian Approach Context-based Model Identification for Run-to-Run Control Introduction Proposed Model Identification Approach Regression Model Variable Selection ANOVA Model Variable Selection Variable Selection Criterion and the Stopping Rule

8 vii 5.4 Context-Based Model Identification Algorithm Regression Model Variable Selection ANOVA Model Variable Selection Simulation Example Application to a Lithography Alignment Process Chapter Summary Bayesian Approaches for On-line Robust Parameter Design Introduction Process Assumptions Bayesian Posterior Predictive Density Solving the Bayesian Robust Control Problems Existing Approaches Performance evaluation of the proposed and existing methods Extension to Multiple Response Processes Chapter Summary Appendices of Chapter Appendix 6A: Representing the prediction variance as a polynomial in the control factors Appendix 6B: Computing the noise factor forecasts required in the BR control law Appendix 6C: Computing the noise factor posterior moments required in the BRN control law

9 viii Appendix 6D: Obtaining the posterior distribution of the parameters of an MA(1) model using MCMC Summary of contributions and directions for future research Contributions in Closed-loop System Identification Methods Contributions in Context-based Model Identification for Run-to-run Control Contributions in On-line Robust Parameter Design Recommendations for Future Research Bibliography 200

10 List of Figures 2.1 Open-loop and Closed-loop operation of a process Input-output delay of a dynamic process Non-segregated and fully segregated run-to-run controllers Traditional (off-line) and feedforward (on-line) RPD control methods Effect of adding a constraint on the identifiability of the process model The sample squared bias and variance of the identified models Percentages of the estimated controllers Closed-loop identification of the gas furnace process data The unit step response of the dynamics and disturbance transfer functions Input-output delay of a dynamic process Simulated delay estimation experiment(example1) The results of the delay estimation analyses (Example 1) Comparison of the step responses Simulated delay estimation experiments (Example 2) Histograms of the delay estimates (Example 2) Experimentation and operation periods of a hypothetical process ix

11 x 4.8 Sensitivity of the cost functions to operation length The sensitivity of the cost function to number of replications The sensitivity of the cost functions to the true delay Effect of the selection of the reference value on run length Simulated marginal posterior distributions Hypothetical data from a three-context process Sequential regression Flow chart of the model identification algorithm Simulation example disturbance data Pairwise plots of the context variables (Lithography Process) Disturbance values after discounting the autocorrelations (Lithography Process) Disturbance values after removing the offsets (Lithography Process) Predictions and actual measurements (Lithography Process) Flow chart of the BRN control approach Variances of the optimal control factor settings and the empirical cumulative distribution function of MSE(y) Historical noise factor data and the posterior distributions of the parameters The empirical CDF of M SE(y) under different control laws and noise factors The designs and hypothetical process data used in the examples Sample simulation of the Plasma Etch Process over the production period The empirical CDF of the Plasma Etch Process response MSE

12 List of Tables 3.1 Closed-loop equations with PI controllers Constraints used in simulations Cost of identification and cost of operation (gas furnace process) Mean, variance and RMSE of the delay estimates (Example 1) Mean, variance and the root mean square error of the delay estimates of the first order and second order processes Scenarios for prior distributions in the reduced Bayesian approach Mean, Variance and Root Mean Square Error (RMSE) of the delay estimates obtained (Example 2) The variance of the process under MMSE controller as a function of the estimated delay Raftery and Lewis diagnostic and the Geweke diagnostic test results Disturbance data, model error and context variable levels for the 2 tool 2 product example The record numbers of the tools 1,2 and the products 1,2 for the data in Table xi

13 xii 5.3 Drift rates (Simulation Example) Iterations in the first scenario (Simulation example) Iterations in the second scenario (Simulation example) Context variables (Lithography Process) Iterations of the variable selection algorithm (Lithography Process) Autoregressive parameter estimates (Lithography Process) Summary of the estimated model (Lithography Process) Fisher s white noise test results on the residuals Existing and proposed approaches considered in this chapter

14 List of Acronyms xiii RPD SPC APC CDF MMSE MSE ARMA BJ ARMAX CUSUM SPRT ANOVA PI i.i.d AR IMA TF Robust parameter design Statistical process control Automatic process control Cumulative distribution function Minimum mean square error Mean square error Auto Regressive Moving Average Box Jenkins Auto Regressive Moving Average exogeneous Cumulative Sum Sequential Probability Ratio Test Analysis of Variance Proportional Integral independently and identically distributed Autoregressive Integrated Moving Average Transfer Function

15 xiv ACKNOWLEDGEMENTS I would like start by expressing my gratitude to my advisor Dr. Enrique Del Castillo for his genuine interest and attention in my work throughout my doctoral study. Without his constant support, motivation and inspiration this dissertation would not be possible. He not only guided me through my thesis work but also introduced me to many industrial contacts that are very important both in my doctoral study and for my future career. This dissertation work was partially funded by his research grant from Intel Corporation. I would like to thank Dr. Jeya Chandra, my dissertation committee member, for giving me the opportunity to work as his research assistant and as a teaching assistant in the Quality and Manufacturing Management (QMM) program. I convey my thanks to the faculty and staff of the Industrial Engineering department, the QMM program and the Statistics department at Penn State. I thank Dr. Nital S. Patel, my dissertation committee member, for his interest in my dissertation and his sincere support in writing the paper Model Context Selection for Runto-run Control. I also would like to thank the other committee members, Dr. Tao Yao and Dr. Murali Haran. The classes I have taken with my committee members and their feedbacks on my research proposal have been very valuable in improving my dissertation. I also thank Dr. Dennis Lin for reviewing my thesis proposal and his comments. My special thanks are to my colleagues and friends in the Engineering Statistics Lab, Dr. Ramkumar Rajagopal, Dr. Zilong Lian and Eduardo Santiago Duran, for sharing with me many useful research discussions. I thank Dr. Bianca M. Colosimo for collaborating with

16 xv me on the paper Statistical Change-point Methods for Closed-loop Delay Estimation. I also would like thank Dr. Mani Janakiram of Intel Corporation for mentoring me during my internship at Intel and for co-authoring the paper Model Context Selection for Run-to-run Control. Finally, I am grateful to my parents, Ercin Vanli and Gulderen Vanli and my younger brother, Tunca Vanli, for giving me endless emotional support and always being by my side throughout my doctoral study.

17 To My Parents xvi

18 Chapter 1 Introduction Manufacturing engineers often change or adjust the operating conditions of a production process by manipulating a set of variables, or controllable factors. The goal is usually to keep some other variables of interest, the responses, close to given target values in the presence of uncontrollable variables, the noise factors and the disturbances, that also affect the responses. The performance of a process adjustment technique, which indicates how to change the controllable factors of a process, depends on the amount of information available about the relation between the controllable factors, or inputs, the noise factors and disturbances, and the responses, or outputs. This information usually results in an input-output, or transfer function, model. This dissertation considers problems related to the identification of these models, and proposes new process adjustment techniques when the amount of information available is limited and noise factors, which in turn can be explicitly modeled, are present in a process. Throughout this dissertation we assume all measurements occur at discrete, equidistant 1

19 2 points in time. This is a common assumption in statistical process control (SPC) methods. 1 Limited data typically exists in situations in industry when production runs are short. A specific problem addressed in this dissertation is how to identify the input-output model of a process that is being adjusted. Such a closed-loop identification method is necessary in industrial processes that cannot be left to run without control. Traditional system identification techniques assume open-loop (no control) operation. Industrial process models need to be identified and estimated from operating data, hence they involve some level of uncertainty. Automatic process control (APC) methods of feedback and feedforward control can be effective at reducing variation when the process model is accurate, however, their performance can deteriorate to the point of actually increasing the variation if the process model is poorly estimated. According to the modeling methods employed, manufacturing processes are generally classified into two groups: discrete-part and continuous-flow manufacturing processes [16, 14]. In a discrete-part process, the dynamical behavior is not significant and a static transfer function model that reflects an immediate effect of the inputs on the outputs (i.e. an input-output delay of one time period) frequently provides adequate accuracy. These models have a regression model form and are fitted using experimental design methods. By contrast, in continuous-flow processes, the dynamical input-output behavior is significant. In particular, due to the transport time fluids take to flow through piping, the inputoutput delays are longer. Therefore, transfer function models that accurately represent the transient (dynamic) response and multiple periods of delay need to be used for continuous- 1 SPC methods are not control but monitoring methods in the sense that no explicit adjustment methods are utilized and the only goal is instead the detection of process faults and errors.

20 3 flow processes. These models are fitted using Time Series analysis and transfer function model identification methods [13]. The methods presented in this thesis are applicable for both discrete-part and continuous-flow processes. Process variation can be caused by unmeasured disturbances (modeled in statistical models as random errors) or be due to our uncertainty in the parameters of the model we wish to estimate from data. In addition to these sources of variation, some process disturbances can be measured. Measured disturbances, the so-called noise factors, are uncontrollable during actual production. Ambient temperature and raw material variation are two typical examples of noise factors encountered in manufacturing. In the statistical process optimization literature, Robust Parameter Design (RPD) methods [81, 64] have been proposed and studied for designing processes that are insensitive against variation caused by noise factors. These methods, however, are largely applied off-line; that is, they are not process adjustment methods that recommend different controllable factor settings depending on the on-line noise factors measured during production. Instead, they determine the optimal process settings before production starts and they do not alter the optimal settings during production. These settings are hoped to make the process resistent, or robust, to the subsequent variation in the noise factors. An example of discrete-part production processes where the techniques developed in this thesis are applicable is semiconductor manufacturing. In semiconductor manufacturing it is common to find the same tool processing different types of products and operations because of the high capital costs of the tools and the limited capacity of the manufacturing facility [69]. The particular combination of different (categorical) factors related to the process, such as the product, operation, chamber and machine, is defined as the manufacturing

21 context of the process, and the factors that define the context of the process are called 4 the context variables [31]. In production, different products and operations often need to be processed on the same machine or process stage and this causes the parameters of the process model to be non-homogeneous across different contexts. Run-to-run control, a class of process adjustment methods used in semiconductor manufacturing, takes into account the variability in the process parameters by defining a different process model for each context. The necessity to identify the different process models with respect to their context gives rise to a different kind of system identification problem for run-to-run control: the problem of identifying which context variables require a different process model. The contributions of this research focus on system identification and controller design, two widely studied areas in the APC literature. While this dissertation does not consider SPC monitoring problems, some SPC techniques will be used in developing the new identification and control methods. The research in this thesis is motivated by the characteristics of modern manufacturing environments where the processes are operated over relatively short periods of time. In semiconductor manufacturing, for example, the process runs are necessarily short because many different types of products and operations must be processed within a fixed interval of time. As it will be discussed in the following chapters, the limited amount of historical data in short-run processes presents challenges for system identification. In summary, the overall goals of this research are (i) to develop efficient model identification methods based on limited data obtained during the closed-loop operation of a process;

22 (ii) to develop a methodology for context-based model identification problem (as it occurs in semiconductor manufacturing); 5 (iii) to propose new process adjustment methods for on-line Robust Parameter Design control, i.e., methods that utilize noise factor measurements obtained during production to re-optimize the process settings for better control. The next section describes in more detail the problems addressed in this dissertation and the specific objectives related to each of the overall goals mentioned above. 1.1 Dissertation Topics and Research Objectives Closed-Loop System Identification Methods In many industrial settings, it is of interest that the unknown model of the process be identified in closed loop, that is, while the process is being adjusted by a feedback controller. The identified model can then be used to re-design the controller in such a way that the quality characteristic under control better achieves a given objective. Open-loop identification of the process (i.e. observing the process when no feedback control is exercised) would not be feasible or cost effective in an industrial process because the process could be unstable and drift off-target dangerously or could produce expensive scrap when uncontrolled. The main difficulty with closed-loop identification is that the control input level is frequently a linear function of the output and the data collected during closed-loop operation is less informative than open-loop data ([48], pg. 430). As a consequence of this, the

23 6 parameter estimates tend to be less precise and sometimes even a unique solution is not obtainable. Traditional approaches to closed-loop identification of transfer function models require a sufficiently large data set and model forms that are general enough while at the same time requiring some form of external excitation (a dither signal ) be applied to the process. The use of a dither signal, a commonly applied approach in closed-loop identification [18, 19], can unfortunately be too expensive if the cost of changing the controllable factor or the cost of running the process off-target is high, precisely the conditions that justify the use of a controller. This research will concentrate on identifying and estimating the transfer function model of a process in closed-loop without the use of any external excitation. In order to improve the parameter estimates obtained in closed-loop, prior knowledge about the process will be used. In particular, the input-output process delay will be utilized as one form of prior process knowledge in identification. It will be shown that if prior information on the delay is available then identification of the transfer function under closed-loop operation becomes easier. Hence, we investigate methods for estimating the input-output delay only. The first specific objective in this topic is to develop a method for closed-loop identification of transfer function models from small samples that does not rely on the use of a dither signal. The method aims at improving closed-loop model parameter estimates by using some forms of prior process information that are commonly available in practice. The second specific objective in this topic is to develop statistical change-point detection methods for closed-loop estimation of the process delay. The method uses a step-response test to induce a change in the output and applies a change-point detection method to

24 7 estimate the time of the output change and hence, the delay Context-Based Model Identification for Run-to-Run Control Run-to-run control is frequently applied for adjusting semiconductor manufacturing processes. Within each batch (or run) of silicon wafers, the control action is exercised by automatic proportional-integral (PI) controllers, while between batches, the set-points of the automatic controllers are adjusted by a run-to-run controller [27]. As it was mentioned before, due to the many different products and operations that need to be processed simultaneously, it is necessary to define the process model with respect to the manufacturing context of the process. However, due to the large number of products and operations usually encountered in semiconductor manufacturing [31] it is usually not feasible to define a different model for each possible combination of context variables. This research is concerned with model identification for run-to-run control, that is, identifying the model structure that best explains the variability - process offsets or shifts and autocorrelation- in the data with fewest possible explanatory variables. The context of a process will be defined as a set of categorical variables. The specific objective of this topic is to develop a context-based model identification approach that uses statistical linear models and stepwise regression methods to identify the categorical context variables that best explain the autocorrelation and the offsets in the process data.

25 On-line Robust Parameter Design Robust Parameter Design (RPD), a methodology introduced by Taguchi [81], has been widely employed for designing products and processes that are robust against environmental effects. The method consists of determining the levels of controllable factors that make the process least sensitive (or most robust) to the variability transmitted by noise variables. In traditional RPD methods, such as the dual response surface approach (see e.g., Myers et. al., [64]), the robust settings of the controllable factors are computed off-line (i.e. before production starts) and once fixed, the control factor settings are not altered during production. By contrast, automatic process control methods used in production adjust the controllable factors continuously to minimize the variability created by uncontrollable variables, which are not explicitly modeled. In many manufacturing processes, however, the noise factors can be measured during production. This implies that it is possible to use the additional noise factor information in order to continuously update the choice of the controllable factors as production takes place. Furthermore, if the noise factors have strong autocorrelation, then the off-line robust settings (which typically assume i.i.d noise factors) will be ineffective in achieving the desired levels of process variability. It is necessary to recompute the process settings on-line, as production takes place. The specific objective in this topic is to develop Bayesian methods to on-line robust parameter design that incorporate measurements of the noise factors. A Bayesian model accounts for uncertainty in the parameters of the model, therefore, the method is aimed at providing solutions that are robust against both the on-line variability of the uncontrollable noise factors and the uncertainty in the model parameters.

26 9 1.2 Dissertation Outline The remaining chapters of this dissertation are organized as follows. In Chapter 2, the literature relevant to the topics in this dissertation is reviewed. Time Series process control methods and Bayesian methods of inference are reviewed. This is followed by a review of methods for closed-loop system identification (identification of transfer function models, delay estimation and change-point detection), run-to-run control and model variable selection, and robust parameter design. In Chapter 3, a new method for the identification of transfer function models under closed-loop operation is presented. The method is based on constrained non-linear least squares estimation and does not rely on a dither signal. In an effort to improve the parameter estimates obtained from closed-loop data, it incorporates prior knowledge about the process into the estimation procedure as constraints on the parameters. We discuss some forms of prior process knowledge that are commonly available in practice and how they can be represented as constraints. The proposed approach is further compared against a dither signal identification method by using real process data. In Chapter 4, three new methods are proposed to estimate the input-output delay under closed-loop operating conditions. The methods are based on Bayesian, Sequential Probability Ratio Test (SPRT) and Cumulative Sum (CUSUM) change-point detection techniques. The delay is estimated from a step-response test where the target of the process is changed in a step fashion and the output change is measured by a change-point detection method. In Chapter 5, a new model identification method is proposed for use in run-to-run

27 10 control. The method represents the offsets and the autocorrelation in the disturbances as a function of different variables that define the context under which the observations were taken. A fixed effects analysis of variance (ANOVA) model is used to represent the offsets and an autoregressive (AR) time-series model is used to represent the autocorrelations and drift. A simulation example, where the true process model is known, and an application to a real semiconductor lithography process are presented to illustrate the proposed approach. Chapter 6 presents new Bayesian approaches for on-line robust parameter design. Both single and multiple response processes are considered. A Bayesian linear regression model is used to describe the input-output relationship of the process and a vector ARMA time series model is used to model the noise factors. Optimization models are formulated and solved in closed-form to provide an on-line solution to the robust parameter design problem. The proposed approaches are illustrated and compared against existing RPD methods (certainty equivalence and dual response control methods). A semiconductor plasma etch process example is presented to illustrate the proposed multiple response robust control approach. Chapter 7 presents a summary of the research contributions and comments on possible directions for future research.

28 Chapter 2 Literature Review The literature on time series control methods is so extensive that only the themes directly related to this dissertation will be reviewed in this chapter. An overview of Time Series Control methods, and in particular, Bayesian methods of inference for time series models is given in Section 2.1. We review in Section 2.2 the main results existing in the System identification literature, which includes results about the closed-loop identification of transfer function models of the kind discussed in Chapter 3 of this thesis. Existing methods for estimating the input-output delay are also discussed in this section, as this problem is studied in Chapter 4. The run-to-run control problem as it occurs in semiconductor manufacturing is discussed in Section 2.3. Modeling approaches that have been proposed to consider the context of each batch of measurements are discussed. Our approach for identification of contextbased run-to-run models presented in Chapter 5 is based on variable selection methods for Analysis of Variance (ANOVA) models, so these methods are reviewed also in Section 2.3. Finally, we review Robust Parameter Design (RPD) methods in Section 2.4 and include a 11

29 review of recent papers that have been proposed to utilize on-line noise factor information for better control, a problem we study in Chapter Time Series Control and Bayesian Inference Time series analysis In this dissertation Box-Jenkins (BJ) transfer function models [13] will be employed to describe dynamic-stochastic processes. Consider the model y t = G(B)u t k + v t (2.1) where y t is the output (the response variable), u t is the input (the controllable factor) and k is the input-output process delay. Observations are assumed to be obtained (sampled) at equidistant points in time t. The delay is the number of whole time periods that elapses between a change is made in the control input and its effect is observed at the process output. The process transfer function G(B) is assumed to be a linear polynomial in the back shift operator B (BY t = Y t 1 ) and it describes the dynamic (transient) relation between the output and the input. The disturbance v t represents the effect of all sources of variation at the output. The disturbance is unobservable (i.e. not measured) during the operation of the process. In a BJ model, the transfer function is represented as a ratio of two polynomials: G(B) = b(b) a(b) (2.2) and the process disturbance is represented using autoregressive-integrated-moving average

30 13 (ARIMA) time series models. An ARIMA model with drift is given as [13]: v t = δ (1 B) + θ(b) d φ(b)(1 B) ɛ d t (2.3) where {ɛ t } is a Gaussian white noise process, that is, a sequence of an independently and identically distributed (i.i.d) normal random variables with mean 0 and variance σ 2. The parameter d 0 is the degree of integration of the disturbance and δ is a drift constant useful in modelling possible trends in the process mean, such as tool wear which frequently occurs in machining operations. Here θ(b) and φ(b) are polynomials in B with all their roots outside the unit circle. When d 1 the disturbance is said to be non-stationary. For continuous-flow processes with short sampling intervals, the input-output dynamics are significant (i.e. the output depends on the current and past values of itself and those of the input) and a transfer function of the form (2.2) must be used. For discrete-part and run-to-run batch production processes, however, the input-output dynamics are not significant and a static, or pure-gain, transfer function model of the form y t = gu t k + v t. (2.4) usually provides sufficient accuracy [4]. In this model g is the constant gain of the process and it is assumed that all the effect g of a change in the input is completely observed at the output after k time intervals (i.e. there are no transient dynamics). A pure-gain transfer function plus an IMA(1,1) disturbance model v t is widely employed in the control of discrete-part manufacturing processes [16]. A linear regression equation such as y t = β 0 + β 1 u 1,t β r u r,t + v t

31 14 also models only the static (steady-state) response of the process and it is equivalent to a pure-gain transfer function model with r control inputs and a single output. Note that in this model it is implicitly assumed that the input-output delay is one period. Process control A process is said to be under closed-loop control, or to operate in closed-loop, when it operates under the action of a feedback controller. Feedback control can be used to reduce the output variability caused by the unobservable disturbance. A feedback controller determines the control input level as a function of the current and past observations of the output: u t = C(B)(y t T ) (2.5) where T is the target, or set-point, of the process and C(B) is the controller polynomial. In manufacturing processes there are often disturbances that are measured (i.e. observable) during the operation. Observable but uncontrollable factors are referred to as noise factors in the process optimization literature (see Section 2.4). In contrast, feedforward control can be used in reducing the influence of the measured uncontrollable variables on the process variability ([6], pg. 190). For example, consider the process model y t = βu t + γz t + v t where u t is a controllable factor, z t is a noise factor and β and γ are known parameters. Let ẑ t denote the one-step ahead forecast of z t computed from on-line measurements of the noise factor. Then the feedforward control action that will make the expected response to be on target is u t = (T γẑ t )/β.

32 15 Bayesian Inference in time series and regression models In Bayesian models, any unknown quantities, including parameters, are modeled as random variables. Inferences about the parameters are based on the posterior distribution of the parameters, found by applying Bayes rule: p(β y) p(y β)p(β) (2.6) where β is the vector of parameters, p(y β) is the likelihood function of the observed output data y and p(β) is the prior distribution which reflects the knowledge the user has about the parameters before observing the data. Prior distributions that are uniform across all values of the unknown parameter are used to reflect very little prior information about the parameter and are called vague, diffuse or noninformative. Conversely, prior distributions that are not uniform across all values of the parameter are used to express the prior belief that some values of the parameter are more likely than other values and are called informative. The Bayesian method also provides a natural way of making inferences on future values of the response using the posterior predictive distribution. If ỹ is a future value of the response, it is possible to make inferences on ỹ from its posterior distribution given by (Gelman et. al. [34]) p(ỹ y) = p(ỹ β, y)p(β y)dβ. β It is noted that the posterior predictive distribution of the response accounts for the uncertainty in the true value of the parameters β because it assumes that β is a random variable and evaluates its posterior distribution using Bayes rule (2.6).

33 16 In this dissertation, Bayesian time series and regression models will be employed. Bayesian analysis of time series models were studied by Marriott et. al. [53] and Monahan [61]. The method discussed by Monahan [61] uses numerical integration to compute the normalization constant of the posterior distribution of the ARMA parameters. However, he considered only low order ARMA models. Marriott et. al. [53] presented a more general approach that works for any order ARMA models. They used Markov chain Monte Carlo (MCMC) methods to simulate the posterior distribution of the ARMA parameters. In order to account for the stationarity and invertibility constraints in the MCMC sampling procedure, a linear transformation was applied on the ARMA parameters, so that the transformed variables are unconstrained and hence can be more easily sampled. Appendix 6D reviews this method in detail. Press [73] studied Bayesian univariate and multivariate regression models. He showed that the posterior predictive density for a diffuse prior on the parameters is a t distribution in the case of single response, and a multivariate t distribution in the case of multiple responses. 2.2 Closed-loop System Identification Methods In Chapter 3 of this thesis, we present a new method for the identification of Box-Jenkins transfer function models in closed-loop conditions that utilizes prior knowledge about the process. For our purposes, the objective of system identification is to obtain a transfer function (TF) for the process dynamics and an ARIMA model for the disturbance from operating data. Identification methods in both open-loop (i.e. when no feedback control is

34 17 (a) Open-loop operation (b) Closed-loop operation ε t ε t Disturbance ARIMA model Disturbance ARIMA model u t Σ y t -T Σ u t Σ y t Process TF model Controller Process TF model Figure 2.1: Open-loop and Closed-loop operation of a process exercised) and in closed-loop operation have been studied. Figure 2.1 illustrates these two modes of operation. In this section we review methods for closed-loop identification of transfer function models, methods for delay estimation and methods for change-point detection. Identification of transfer function models Parameter Identifiability and Persistency of Excitation Conditions A process is said to be parameter identifiable if the estimators of its model parameters converge to the true values as the sample size tends to infinity (i.e. the estimators are consistent) [45]. The parameter identifiability condition therefore guarantees that all parameters of a process can be uniquely determined from data. The input signal u t that satisfies this requirement is said to be persistently exciting the process ([7], pg. 73). It has been shown that parameter identifiability in closed-loop can be achieved in several ways: (i) By applying an independent signal in the controller, such as, by using a time varying target or set-point T or by adding a random dither signal that is persistently exciting of sufficient order (Söderström et. al., [80]); (ii) By switching between different

35 18 linear controllers (Ljung et. al., [45]); and (iii) By having a complex enough controller in the feedback loop (Söderström et. al., [79]). The persistency of excitation condition on the input is related to the multi-collinearity problem in regression. This will be illustrated in the following example. Example. Persistency of Excitation: This example, taken from Åström and Wittenmark ([6], pg. 82), considers the true process description y t = a 1 y t 1 + b 1 u t 1 + ɛ t. Assuming a model of the correct structure, the input-output data {y 1, u 1,..., y N, u N } can be written in the matrix form as y 2 y 3. y N }{{} y = y 1 u 1 y 2 u 2. y N 1. u N 1 } {{ } X a 1 + b 1 }{{} β ɛ 2 ɛ 3. ɛ N. }{{} ɛ The least squares estimate of the parameter vector is ˆβ = (X X) 1 X y. The parameters β of the true process can be uniquely estimated only when the matrix X X is invertible, that is, when X is of full column rank. If this condition is satisfied than the input signal u t is persistently exciting (of order 2 = dim(β) in this case) and the process is parameter identifiable.

36 19 Suppose a proportional controller u t = c 1 y t is used instead. It can be seen that X = y 1 c 1 y 1 y 2 c 1 y 2.. y N 1 c 1 y N 1 and the columns of X are linearly dependent hence the process is not parameter identifiable. However, if a control input with an added dither signal d t is used, that is, if u t = c 1 y t + d t, then the columns of X are linearly independent and parameter identifiability is restored. It can also be seen that if a higher order controller u t = (c 1 + c 2 B)y t is used then X = y 1 c 1 y 1 + c 2 y 0 y 2 c 1 y 2 + c 2 y 1.. y N 1 c 1 y N 1 + c 2 y N 2 and the columns of X are linearly independent hence parameter identifiability is also restored. Closed-Loop Identification of Box-Jenkins Models: Closed-loop identification with Box-Jenkins transfer function models has been studied by various authors. Box and MacGregor [19] applied the results by Söderström et. al. [80] to processes under minimum mean squared error (MMSE) control. It was assumed that the true input-output delay and model orders were known. They showed that the precision of the parameter estimates can be improved by the addition of a dither signal. Luceño [50] investigated the case of switching between multiple PI controllers during the operation of

37 20 the process and the effects this has on the quality of parameter estimates, but also assumed that the correct model orders and the delay were known. Use of Prior Knowledge in Closed-Loop Identification: The idea of applying prior process knowledge in closed-loop identification to improve parameter estimates was discussed by Box and MacGregor [18] for the cases of prior knowledge on the transfer function dynamics and on the disturbance model. MacGregor and Fogal [51] investigated the benefits of knowing the disturbance model by applying disturbance model pre-filters in identification. Ljung and Forssell [47] showed that complete knowledge of the true disturbance model is sufficient for parameter identifiability. Pan and Del Castillo [68] considered the stationarity and invertibility conditions as a form of prior knowledge and showed by example that closed-loop identification is possible by utilizing these conditions. While there have been several studies on utilizing prior process knowledge in process identification, no systematic study that addresses the relative effectiveness of different forms of prior knowledge was provided. Delay Estimation and Change-point Detection Methods Chapter 4 of this dissertation discusses new methods for the estimation of input-output delay of dynamic processes under closed-loop operating condition. The presented methods use Time Series change-point detection methods. As an illustration of the input-output delay of a transfer function model, consider Figure 2.2. Here, the input u t is increased in a step fashion, the output y t is the response of the process to this input and the input-output delay k is the number of whole time periods

38 21 u t (Input) t y t (Output) t k Figure 2.2: Input-output delay of a dynamic process that elapses between making the change in the input and observing its effect at the output. As it can be seen, due to the dynamics of the process, the output reaches its steady state level exponentially after several time periods rather than in a step fashion after one time period. As it will be discussed in Chapter 4, the process dynamics must be taken into consideration when using a change-point detection method to estimate the delay. Accurate knowledge of the process delay is important in control design and in closedloop process identification. Many optimal control design methods, such as minimum MSE control [6], are very sensitive to the knowledge of the process delay and the closed-loop process under a designed controller may become unstable if the model delay does not coincide with the process delay [44]. For the design of minimum MSE controllers, Bohlin [11] shows that the parameters of such controllers can be uniquely estimated from closedloop data as long as the process delay is known. The knowledge of the delay is also important for the control performance indices that are used to compare the performance of

39 22 a control loop to its best achievable performance and to decide whether it is warranted to re-tune a controller. In the controller performance indices proposed by Harris et. al. [38] and by Horch and Isaksson [40] the delay is the only process information that is required to be known. Delay estimation has been extensively studied in the control systems literature. Björklund and Ljung [10] give a comparison study of some of the recent open-loop delay estimation approaches. Kurz and Goedecke [44] develops a recursive least squares process identification approach for processes with time varying delays. Isaksson, Horch and Dumont [41] propose a closed-loop delay estimation method that uses a Laguerre transfer function model and step response tests. Transfer function models that use Laguerre functions have received considerable attention in the system identification literature. In a Laguerre transfer function model, G(B) is defined as a polynomial in Laguerre filters (which are obtained by taking the Z transform of the Laguerre functions) rather than a polynomial in B. Laguerre functions, through their orthonormality properties, have the advantage over using simple polynomials in B of more accurately approximating transient process behavior and hence are suitable in estimating the delay in a step response test [84]. Delay Estimation Using Laguerre Transfer Function Models The delay estimation method presented by Isaksson et. al. [41] can be summarized as follows. The estimated Laguerre transfer function is first written in the frequency domain and it is factorized into a Minimum Phase part and an All Phase part as G(e iωt s ) = G mp (e iωt s )G ap (e iωt s )

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M. TIME SERIES ANALYSIS Forecasting and Control Fifth Edition GEORGE E. P. BOX GWILYM M. JENKINS GREGORY C. REINSEL GRETA M. LJUNG Wiley CONTENTS PREFACE TO THE FIFTH EDITION PREFACE TO THE FOURTH EDITION

More information

Elements of Multivariate Time Series Analysis

Elements of Multivariate Time Series Analysis Gregory C. Reinsel Elements of Multivariate Time Series Analysis Second Edition With 14 Figures Springer Contents Preface to the Second Edition Preface to the First Edition vii ix 1. Vector Time Series

More information

Closed-Loop Disturbance Identification and Controller Tuning for Discrete Manufacturing Processes. Overview

Closed-Loop Disturbance Identification and Controller Tuning for Discrete Manufacturing Processes. Overview Closed-Loop Disturbance Identification and Controller Tuning for Discrete Manufacturing Processes Enrique del Castillo Department of Industrial & Manufacturing Engineering The Pennsylvania State University

More information

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY Time Series Analysis James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY & Contents PREFACE xiii 1 1.1. 1.2. Difference Equations First-Order Difference Equations 1 /?th-order Difference

More information

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY Time Series Analysis James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY PREFACE xiii 1 Difference Equations 1.1. First-Order Difference Equations 1 1.2. pth-order Difference Equations 7

More information

Time Series: Theory and Methods

Time Series: Theory and Methods Peter J. Brockwell Richard A. Davis Time Series: Theory and Methods Second Edition With 124 Illustrations Springer Contents Preface to the Second Edition Preface to the First Edition vn ix CHAPTER 1 Stationary

More information

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?

More information

Statistical Methods for Forecasting

Statistical Methods for Forecasting Statistical Methods for Forecasting BOVAS ABRAHAM University of Waterloo JOHANNES LEDOLTER University of Iowa John Wiley & Sons New York Chichester Brisbane Toronto Singapore Contents 1 INTRODUCTION AND

More information

1 An Overview and Brief History of Feedback Control 1. 2 Dynamic Models 23. Contents. Preface. xiii

1 An Overview and Brief History of Feedback Control 1. 2 Dynamic Models 23. Contents. Preface. xiii Contents 1 An Overview and Brief History of Feedback Control 1 A Perspective on Feedback Control 1 Chapter Overview 2 1.1 A Simple Feedback System 3 1.2 A First Analysis of Feedback 6 1.3 Feedback System

More information

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services

More information

Univariate ARIMA Models

Univariate ARIMA Models Univariate ARIMA Models ARIMA Model Building Steps: Identification: Using graphs, statistics, ACFs and PACFs, transformations, etc. to achieve stationary and tentatively identify patterns and model components.

More information

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION - Vol. V - Prediction Error Methods - Torsten Söderström

CONTROL SYSTEMS, ROBOTICS, AND AUTOMATION - Vol. V - Prediction Error Methods - Torsten Söderström PREDICTIO ERROR METHODS Torsten Söderström Department of Systems and Control, Information Technology, Uppsala University, Uppsala, Sweden Keywords: prediction error method, optimal prediction, identifiability,

More information

Pattern Recognition and Machine Learning

Pattern Recognition and Machine Learning Christopher M. Bishop Pattern Recognition and Machine Learning ÖSpri inger Contents Preface Mathematical notation Contents vii xi xiii 1 Introduction 1 1.1 Example: Polynomial Curve Fitting 4 1.2 Probability

More information

THE DETECTION OF SHIFTS IN AUTOCORRELATED PROCESSES WITH MR AND EWMA CHARTS

THE DETECTION OF SHIFTS IN AUTOCORRELATED PROCESSES WITH MR AND EWMA CHARTS THE DETECTION OF SHIFTS IN AUTOCORRELATED PROCESSES WITH MR AND EWMA CHARTS Karin Kandananond, kandananond@hotmail.com Faculty of Industrial Technology, Rajabhat University Valaya-Alongkorn, Prathumthani,

More information

Stochastic Models, Estimation and Control Peter S. Maybeck Volumes 1, 2 & 3 Tables of Contents

Stochastic Models, Estimation and Control Peter S. Maybeck Volumes 1, 2 & 3 Tables of Contents Navtech Part #s Volume 1 #1277 Volume 2 #1278 Volume 3 #1279 3 Volume Set #1280 Stochastic Models, Estimation and Control Peter S. Maybeck Volumes 1, 2 & 3 Tables of Contents Volume 1 Preface Contents

More information

Condensed Table of Contents for Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control by J. C.

Condensed Table of Contents for Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control by J. C. Condensed Table of Contents for Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control by J. C. Spall John Wiley and Sons, Inc., 2003 Preface... xiii 1. Stochastic Search

More information

Introduction to Eco n o m et rics

Introduction to Eco n o m et rics 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. Introduction to Eco n o m et rics Third Edition G.S. Maddala Formerly

More information

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications Yongmiao Hong Department of Economics & Department of Statistical Sciences Cornell University Spring 2019 Time and uncertainty

More information

at least 50 and preferably 100 observations should be available to build a proper model

at least 50 and preferably 100 observations should be available to build a proper model III Box-Jenkins Methods 1. Pros and Cons of ARIMA Forecasting a) need for data at least 50 and preferably 100 observations should be available to build a proper model used most frequently for hourly or

More information

Choosing the Summary Statistics and the Acceptance Rate in Approximate Bayesian Computation

Choosing the Summary Statistics and the Acceptance Rate in Approximate Bayesian Computation Choosing the Summary Statistics and the Acceptance Rate in Approximate Bayesian Computation COMPSTAT 2010 Revised version; August 13, 2010 Michael G.B. Blum 1 Laboratoire TIMC-IMAG, CNRS, UJF Grenoble

More information

Empirical Market Microstructure Analysis (EMMA)

Empirical Market Microstructure Analysis (EMMA) Empirical Market Microstructure Analysis (EMMA) Lecture 3: Statistical Building Blocks and Econometric Basics Prof. Dr. Michael Stein michael.stein@vwl.uni-freiburg.de Albert-Ludwigs-University of Freiburg

More information

A Course in Time Series Analysis

A Course in Time Series Analysis A Course in Time Series Analysis Edited by DANIEL PENA Universidad Carlos III de Madrid GEORGE C. TIAO University of Chicago RUEY S. TSAY University of Chicago A Wiley-Interscience Publication JOHN WILEY

More information

Exercises - Time series analysis

Exercises - Time series analysis Descriptive analysis of a time series (1) Estimate the trend of the series of gasoline consumption in Spain using a straight line in the period from 1945 to 1995 and generate forecasts for 24 months. Compare

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

VALIDATION OF A DC-DC BOOST CIRCUIT MODEL AND CONTROL ALGORITHM

VALIDATION OF A DC-DC BOOST CIRCUIT MODEL AND CONTROL ALGORITHM VALIDATION OF A DC-DC BOOST CIRCUIT MODEL AND CONTROL ALGORITHM Dissertation Submitted to The School of Engineering of the UNIVERSITY OF DAYTON In Partial Fulfillment of the Requirements for The Degree

More information

7. Forecasting with ARIMA models

7. Forecasting with ARIMA models 7. Forecasting with ARIMA models 309 Outline: Introduction The prediction equation of an ARIMA model Interpreting the predictions Variance of the predictions Forecast updating Measuring predictability

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

DESIGN AND ANALYSIS OF EXPERIMENTS Third Edition

DESIGN AND ANALYSIS OF EXPERIMENTS Third Edition DESIGN AND ANALYSIS OF EXPERIMENTS Third Edition Douglas C. Montgomery ARIZONA STATE UNIVERSITY JOHN WILEY & SONS New York Chichester Brisbane Toronto Singapore Contents Chapter 1. Introduction 1-1 What

More information

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Linear Models 1. Isfahan University of Technology Fall Semester, 2014 Linear Models 1 Isfahan University of Technology Fall Semester, 2014 References: [1] G. A. F., Seber and A. J. Lee (2003). Linear Regression Analysis (2nd ed.). Hoboken, NJ: Wiley. [2] A. C. Rencher and

More information

Introduction to. Process Control. Ahmet Palazoglu. Second Edition. Jose A. Romagnoli. CRC Press. Taylor & Francis Group. Taylor & Francis Group,

Introduction to. Process Control. Ahmet Palazoglu. Second Edition. Jose A. Romagnoli. CRC Press. Taylor & Francis Group. Taylor & Francis Group, Introduction to Process Control Second Edition Jose A. Romagnoli Ahmet Palazoglu CRC Press Taylor & Francis Group Boca Raton London NewYork CRC Press is an imprint of the Taylor & Francis Group, an informa

More information

Experimental designs for multiple responses with different models

Experimental designs for multiple responses with different models Graduate Theses and Dissertations Graduate College 2015 Experimental designs for multiple responses with different models Wilmina Mary Marget Iowa State University Follow this and additional works at:

More information

MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH. I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo

MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH. I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo Vol.4, No.2, pp.2-27, April 216 MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo ABSTRACT: This study

More information

Time Series I Time Domain Methods

Time Series I Time Domain Methods Astrostatistics Summer School Penn State University University Park, PA 16802 May 21, 2007 Overview Filtering and the Likelihood Function Time series is the study of data consisting of a sequence of DEPENDENT

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

Lecture 2: Univariate Time Series

Lecture 2: Univariate Time Series Lecture 2: Univariate Time Series Analysis: Conditional and Unconditional Densities, Stationarity, ARMA Processes Prof. Massimo Guidolin 20192 Financial Econometrics Spring/Winter 2017 Overview Motivation:

More information

Response Surface Methodology

Response Surface Methodology Response Surface Methodology Process and Product Optimization Using Designed Experiments Second Edition RAYMOND H. MYERS Virginia Polytechnic Institute and State University DOUGLAS C. MONTGOMERY Arizona

More information

INTRODUCTION TO PATTERN RECOGNITION

INTRODUCTION TO PATTERN RECOGNITION INTRODUCTION TO PATTERN RECOGNITION INSTRUCTOR: WEI DING 1 Pattern Recognition Automatic discovery of regularities in data through the use of computer algorithms With the use of these regularities to take

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

interval forecasting

interval forecasting Interval Forecasting Based on Chapter 7 of the Time Series Forecasting by Chatfield Econometric Forecasting, January 2008 Outline 1 2 3 4 5 Terminology Interval Forecasts Density Forecast Fan Chart Most

More information

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA CHAPTER 6 TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA 6.1. Introduction A time series is a sequence of observations ordered in time. A basic assumption in the time series analysis

More information

Independent Component Analysis. Contents

Independent Component Analysis. Contents Contents Preface xvii 1 Introduction 1 1.1 Linear representation of multivariate data 1 1.1.1 The general statistical setting 1 1.1.2 Dimension reduction methods 2 1.1.3 Independence as a guiding principle

More information

17 : Markov Chain Monte Carlo

17 : Markov Chain Monte Carlo 10-708: Probabilistic Graphical Models, Spring 2015 17 : Markov Chain Monte Carlo Lecturer: Eric P. Xing Scribes: Heran Lin, Bin Deng, Yun Huang 1 Review of Monte Carlo Methods 1.1 Overview Monte Carlo

More information

Computational statistics

Computational statistics Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated

More information

Ross Bettinger, Analytical Consultant, Seattle, WA

Ross Bettinger, Analytical Consultant, Seattle, WA ABSTRACT DYNAMIC REGRESSION IN ARIMA MODELING Ross Bettinger, Analytical Consultant, Seattle, WA Box-Jenkins time series models that contain exogenous predictor variables are called dynamic regression

More information

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition Preface Preface to the First Edition xi xiii 1 Basic Probability Theory 1 1.1 Introduction 1 1.2 Sample Spaces and Events 3 1.3 The Axioms of Probability 7 1.4 Finite Sample Spaces and Combinatorics 15

More information

Linear Regression Models

Linear Regression Models Linear Regression Models Model Description and Model Parameters Modelling is a central theme in these notes. The idea is to develop and continuously improve a library of predictive models for hazards,

More information

CBE495 LECTURE IV MODEL PREDICTIVE CONTROL

CBE495 LECTURE IV MODEL PREDICTIVE CONTROL What is Model Predictive Control (MPC)? CBE495 LECTURE IV MODEL PREDICTIVE CONTROL Professor Dae Ryook Yang Fall 2013 Dept. of Chemical and Biological Engineering Korea University * Some parts are from

More information

Expressions for the covariance matrix of covariance data

Expressions for the covariance matrix of covariance data Expressions for the covariance matrix of covariance data Torsten Söderström Division of Systems and Control, Department of Information Technology, Uppsala University, P O Box 337, SE-7505 Uppsala, Sweden

More information

Modelling Under Risk and Uncertainty

Modelling Under Risk and Uncertainty Modelling Under Risk and Uncertainty An Introduction to Statistical, Phenomenological and Computational Methods Etienne de Rocquigny Ecole Centrale Paris, Universite Paris-Saclay, France WILEY A John Wiley

More information

On Input Design for System Identification

On Input Design for System Identification On Input Design for System Identification Input Design Using Markov Chains CHIARA BRIGHENTI Masters Degree Project Stockholm, Sweden March 2009 XR-EE-RT 2009:002 Abstract When system identification methods

More information

Bayesian Inference: Probit and Linear Probability Models

Bayesian Inference: Probit and Linear Probability Models Utah State University DigitalCommons@USU All Graduate Plan B and other Reports Graduate Studies 5-1-2014 Bayesian Inference: Probit and Linear Probability Models Nate Rex Reasch Utah State University Follow

More information

Statistics 349(02) Review Questions

Statistics 349(02) Review Questions Statistics 349(0) Review Questions I. Suppose that for N = 80 observations on the time series { : t T} the following statistics were calculated: _ x = 10.54 C(0) = 4.99 In addition the sample autocorrelation

More information

5 Transfer function modelling

5 Transfer function modelling MSc Further Time Series Analysis 5 Transfer function modelling 5.1 The model Consider the construction of a model for a time series (Y t ) whose values are influenced by the earlier values of a series

More information

Bayesian Regression Linear and Logistic Regression

Bayesian Regression Linear and Logistic Regression When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

ARIMA Models. Jamie Monogan. January 25, University of Georgia. Jamie Monogan (UGA) ARIMA Models January 25, / 38

ARIMA Models. Jamie Monogan. January 25, University of Georgia. Jamie Monogan (UGA) ARIMA Models January 25, / 38 ARIMA Models Jamie Monogan University of Georgia January 25, 2012 Jamie Monogan (UGA) ARIMA Models January 25, 2012 1 / 38 Objectives By the end of this meeting, participants should be able to: Describe

More information

3 Theory of stationary random processes

3 Theory of stationary random processes 3 Theory of stationary random processes 3.1 Linear filters and the General linear process A filter is a transformation of one random sequence {U t } into another, {Y t }. A linear filter is a transformation

More information

Development of Stochastic Artificial Neural Networks for Hydrological Prediction

Development of Stochastic Artificial Neural Networks for Hydrological Prediction Development of Stochastic Artificial Neural Networks for Hydrological Prediction G. B. Kingston, M. F. Lambert and H. R. Maier Centre for Applied Modelling in Water Engineering, School of Civil and Environmental

More information

Alexander Scheinker Miroslav Krstić. Model-Free Stabilization by Extremum Seeking

Alexander Scheinker Miroslav Krstić. Model-Free Stabilization by Extremum Seeking Alexander Scheinker Miroslav Krstić Model-Free Stabilization by Extremum Seeking 123 Preface Originating in 1922, in its 95-year history, extremum seeking has served as a tool for model-free real-time

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

Design and Performance Analysis of the Exponentially Weighted Moving Average Mean Estimate for Processes Subject to Random Step Changes

Design and Performance Analysis of the Exponentially Weighted Moving Average Mean Estimate for Processes Subject to Random Step Changes Design and Performance Analysis of the Exponentially Weighted Moving Average Mean Estimate for Processes Subject to Random Step Changes Argon Chen Graduate Institute of Industrial Engineering National

More information

Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition

Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition Karl-Rudolf Koch Introduction to Bayesian Statistics Second, updated and enlarged Edition With 17 Figures Professor Dr.-Ing., Dr.-Ing.

More information

Bayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007

Bayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007 Bayesian inference Fredrik Ronquist and Peter Beerli October 3, 2007 1 Introduction The last few decades has seen a growing interest in Bayesian inference, an alternative approach to statistical inference.

More information

Lessons in Estimation Theory for Signal Processing, Communications, and Control

Lessons in Estimation Theory for Signal Processing, Communications, and Control Lessons in Estimation Theory for Signal Processing, Communications, and Control Jerry M. Mendel Department of Electrical Engineering University of Southern California Los Angeles, California PRENTICE HALL

More information

ARIMA Modelling and Forecasting

ARIMA Modelling and Forecasting ARIMA Modelling and Forecasting Economic time series often appear nonstationary, because of trends, seasonal patterns, cycles, etc. However, the differences may appear stationary. Δx t x t x t 1 (first

More information

Adaptive Dual Control

Adaptive Dual Control Adaptive Dual Control Björn Wittenmark Department of Automatic Control, Lund Institute of Technology Box 118, S-221 00 Lund, Sweden email: bjorn@control.lth.se Keywords: Dual control, stochastic control,

More information

Ensemble Data Assimilation and Uncertainty Quantification

Ensemble Data Assimilation and Uncertainty Quantification Ensemble Data Assimilation and Uncertainty Quantification Jeff Anderson National Center for Atmospheric Research pg 1 What is Data Assimilation? Observations combined with a Model forecast + to produce

More information

STATISTICS-STAT (STAT)

STATISTICS-STAT (STAT) Statistics-STAT (STAT) 1 STATISTICS-STAT (STAT) Courses STAT 158 Introduction to R Programming Credit: 1 (1-0-0) Programming using the R Project for the Statistical Computing. Data objects, for loops,

More information

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL B. N. MANDAL Abstract: Yearly sugarcane production data for the period of - to - of India were analyzed by time-series methods. Autocorrelation

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

A Data-Driven Model for Software Reliability Prediction

A Data-Driven Model for Software Reliability Prediction A Data-Driven Model for Software Reliability Prediction Author: Jung-Hua Lo IEEE International Conference on Granular Computing (2012) Young Taek Kim KAIST SE Lab. 9/4/2013 Contents Introduction Background

More information

ADAPTIVE FILTER THEORY

ADAPTIVE FILTER THEORY ADAPTIVE FILTER THEORY Fourth Edition Simon Haykin Communications Research Laboratory McMaster University Hamilton, Ontario, Canada Front ice Hall PRENTICE HALL Upper Saddle River, New Jersey 07458 Preface

More information

Statistical Methods. for Forecasting

Statistical Methods. for Forecasting Statistical Methods for Forecasting Statistical Methods for Forecasting BOVAS ABRAHAM JOHANNES LEDOLTER WILEY- INTERSCI ENCE A JOHN WILEY & SONS, INC., PUBLICA'TION Copyright 0 1983.2005 by John Wiley

More information

ARIMA Models. Jamie Monogan. January 16, University of Georgia. Jamie Monogan (UGA) ARIMA Models January 16, / 27

ARIMA Models. Jamie Monogan. January 16, University of Georgia. Jamie Monogan (UGA) ARIMA Models January 16, / 27 ARIMA Models Jamie Monogan University of Georgia January 16, 2018 Jamie Monogan (UGA) ARIMA Models January 16, 2018 1 / 27 Objectives By the end of this meeting, participants should be able to: Argue why

More information

Markov Chain Monte Carlo methods

Markov Chain Monte Carlo methods Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As

More information

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω ECO 513 Spring 2015 TAKEHOME FINAL EXAM (1) Suppose the univariate stochastic process y is ARMA(2,2) of the following form: y t = 1.6974y t 1.9604y t 2 + ε t 1.6628ε t 1 +.9216ε t 2, (1) where ε is i.i.d.

More information

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author... From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. Contents About This Book... xiii About The Author... xxiii Chapter 1 Getting Started: Data Analysis with JMP...

More information

Box-Jenkins ARIMA Advanced Time Series

Box-Jenkins ARIMA Advanced Time Series Box-Jenkins ARIMA Advanced Time Series www.realoptionsvaluation.com ROV Technical Papers Series: Volume 25 Theory In This Issue 1. Learn about Risk Simulator s ARIMA and Auto ARIMA modules. 2. Find out

More information

Sequential Monte Carlo Methods for Bayesian Computation

Sequential Monte Carlo Methods for Bayesian Computation Sequential Monte Carlo Methods for Bayesian Computation A. Doucet Kyoto Sept. 2012 A. Doucet (MLSS Sept. 2012) Sept. 2012 1 / 136 Motivating Example 1: Generic Bayesian Model Let X be a vector parameter

More information

A Perceptions Framework for Categorizing Inventory Policies in Single-stage Inventory Systems

A Perceptions Framework for Categorizing Inventory Policies in Single-stage Inventory Systems 07-036 A Perceptions Framework for Categorizing Inventory Policies in Single-stage Inventory Systems Noel H. Watson Copyright 2006, 2007, 2008 by Noel H. Watson Working papers are in draft form. This working

More information

MODULE -4 BAYEIAN LEARNING

MODULE -4 BAYEIAN LEARNING MODULE -4 BAYEIAN LEARNING CONTENT Introduction Bayes theorem Bayes theorem and concept learning Maximum likelihood and Least Squared Error Hypothesis Maximum likelihood Hypotheses for predicting probabilities

More information

Irr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland

Irr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland Frederick James CERN, Switzerland Statistical Methods in Experimental Physics 2nd Edition r i Irr 1- r ri Ibn World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI CONTENTS

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is

More information

Modelling and forecasting of offshore wind power fluctuations with Markov-Switching models

Modelling and forecasting of offshore wind power fluctuations with Markov-Switching models Modelling and forecasting of offshore wind power fluctuations with Markov-Switching models 02433 - Hidden Markov Models Pierre-Julien Trombe, Martin Wæver Pedersen, Henrik Madsen Course week 10 MWP, compiled

More information

Markov Chain Monte Carlo

Markov Chain Monte Carlo Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).

More information

Estimation for state space models: quasi-likelihood and asymptotic quasi-likelihood approaches

Estimation for state space models: quasi-likelihood and asymptotic quasi-likelihood approaches University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2008 Estimation for state space models: quasi-likelihood and asymptotic

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

Modeling and Identification of Dynamic Systems (vimmd312, 2018)

Modeling and Identification of Dynamic Systems (vimmd312, 2018) Modeling and Identification of Dynamic Systems (vimmd312, 2018) Textbook background of the curriculum taken. In parenthesis: material not reviewed due to time shortage, but which is suggested to be read

More information

ST 740: Markov Chain Monte Carlo

ST 740: Markov Chain Monte Carlo ST 740: Markov Chain Monte Carlo Alyson Wilson Department of Statistics North Carolina State University October 14, 2012 A. Wilson (NCSU Stsatistics) MCMC October 14, 2012 1 / 20 Convergence Diagnostics:

More information

A HYBRID MODEL OF SARIMA AND ANFIS FOR MACAU AIR POLLUTION INDEX FORECASTING. Eason, Lei Kin Seng (M-A ) Supervisor: Dr.

A HYBRID MODEL OF SARIMA AND ANFIS FOR MACAU AIR POLLUTION INDEX FORECASTING. Eason, Lei Kin Seng (M-A ) Supervisor: Dr. A HYBRID MODEL OF SARIMA AND ANFIS FOR MACAU AIR POLLUTION INDEX FORECASTING THESIS DISSERATION By Eason, Lei Kin Seng (M-A7-6560-7) Supervisor: Dr. Wan Feng In Fulfillment of Requirements for the Degree

More information

FE570 Financial Markets and Trading. Stevens Institute of Technology

FE570 Financial Markets and Trading. Stevens Institute of Technology FE570 Financial Markets and Trading Lecture 5. Linear Time Series Analysis and Its Applications (Ref. Joel Hasbrouck - Empirical Market Microstructure ) Steve Yang Stevens Institute of Technology 9/25/2012

More information

Approximating Bayesian Posterior Means Using Multivariate Gaussian Quadrature

Approximating Bayesian Posterior Means Using Multivariate Gaussian Quadrature Approximating Bayesian Posterior Means Using Multivariate Gaussian Quadrature John A.L. Cranfield Paul V. Preckel Songquan Liu Presented at Western Agricultural Economics Association 1997 Annual Meeting

More information

Markov chain optimisation for energy systems (MC-ES)

Markov chain optimisation for energy systems (MC-ES) Markov chain optimisation for energy systems (MC-ES) John Moriarty Queen Mary University of London 19th August 2016 Approaches to the stochastic optimisation of power systems are not mature at research

More information

DATA-DRIVEN TECHNIQUES FOR ESTIMATION AND STOCHASTIC REDUCTION OF MULTI-SCALE SYSTEMS

DATA-DRIVEN TECHNIQUES FOR ESTIMATION AND STOCHASTIC REDUCTION OF MULTI-SCALE SYSTEMS DATA-DRIVEN TECHNIQUES FOR ESTIMATION AND STOCHASTIC REDUCTION OF MULTI-SCALE SYSTEMS A Dissertation Presented to the Faculty of the Department of Mathematics University of Houston In Partial Fulfillment

More information

Analysis and Synthesis of Single-Input Single-Output Control Systems

Analysis and Synthesis of Single-Input Single-Output Control Systems Lino Guzzella Analysis and Synthesis of Single-Input Single-Output Control Systems l+kja» \Uja>)W2(ja»\ um Contents 1 Definitions and Problem Formulations 1 1.1 Introduction 1 1.2 Definitions 1 1.2.1 Systems

More information

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics Table of Preface page xi PART I INTRODUCTION 1 1 The meaning of probability 3 1.1 Classical definition of probability 3 1.2 Statistical definition of probability 9 1.3 Bayesian understanding of probability

More information

Basics: Definitions and Notation. Stationarity. A More Formal Definition

Basics: Definitions and Notation. Stationarity. A More Formal Definition Basics: Definitions and Notation A Univariate is a sequence of measurements of the same variable collected over (usually regular intervals of) time. Usual assumption in many time series techniques is that

More information

PATTERN CLASSIFICATION

PATTERN CLASSIFICATION PATTERN CLASSIFICATION Second Edition Richard O. Duda Peter E. Hart David G. Stork A Wiley-lnterscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane Singapore Toronto CONTENTS

More information

Contents 1 Introduction 2 Statistical Tools and Concepts

Contents 1 Introduction 2 Statistical Tools and Concepts 1 Introduction... 1 1.1 Objectives and Approach... 1 1.2 Scope of Resource Modeling... 2 1.3 Critical Aspects... 2 1.3.1 Data Assembly and Data Quality... 2 1.3.2 Geologic Model and Definition of Estimation

More information

The performance of estimation methods for generalized linear mixed models

The performance of estimation methods for generalized linear mixed models University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2008 The performance of estimation methods for generalized linear

More information