UTILIZING CLASSICAL SCALING FOR FAULT IDENTIFICATION BASED ON CONTINUOUS-BASED PROCESS MIRA SYAHIRAH BT ABD WAHIT

Similar documents
EVALUATION OF FUSION SCORE FOR FACE VERIFICATION SYSTEM REZA ARFA

ONLINE LOCATION DETERMINATION OF SWITCHED CAPACITOR BANK IN DISTRIBUTION SYSTEM AHMED JAMAL NOORI AL-ANI

DEVELOPMENT OF GEODETIC DEFORMATION ANALYSIS SOFTWARE BASED ON ITERATIVE WEIGHTED SIMILARITY TRANSFORMATION TECHNIQUE ABDALLATEF A. M.

COVRE OPTIMIZATION FOR IMAGE STEGANOGRAPHY BY USING IMAGE FEATURES ZAID NIDHAL KHUDHAIR

DETECTION OF STRUCTURAL DEFORMATION FROM 3D POINT CLOUDS JONATHAN NYOKA CHIVATSI UNIVERSITI TEKNOLOGI MALAYSIA

ISSN: (Online) Volume 3, Issue 5, May 2015 International Journal of Advance Research in Computer Science and Management Studies

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Dimensionality Reduction Techniques (DRT)

ULTIMATE STRENGTH ANALYSIS OF SHIPS PLATE DUE TO CORROSION ZULFAQIH BIN LAZIM

ARTIFICIAL NEURAL NETWORK AND KALMAN FILTER APPROACHES BASED ON ARIMA FOR DAILY WIND SPEED FORECASTING OSAMAH BASHEER SHUKUR

MODELING AND SIMULATION OF BILAYER GRAPHENE NANORIBBON FIELD EFFECT TRANSISTOR SEYED MAHDI MOUSAVI UNIVERSITI TEKNOLOGI MALAYSIA

RAINFALL SERIES LIM TOU HIN

EFFECTS OF Ni 3 Ti (DO 24 ) PRECIPITATES AND COMPOSITION ON Ni-BASED SUPERALLOYS USING MOLECULAR DYNAMICS METHOD

A COMPARISO OF MODAL FLEXIBILITY METHOD A D MODAL CURVATURE METHOD I STRUCTURAL DAMAGE DETECTIO OOR SABRI A ZAHARUDI

THE DEVELOPMENT OF PORTABLE SENSOR TO DETERMINE THE FRESHNESS AND QUALITY OF FRUITS USING CAPACITIVE TECHNIQUE LAI KAI LING

Basics of Multivariate Modelling and Data Analysis

DYNAMIC SIMULATION OF COLUMNS CONSIDERING GEOMETRIC NONLINEARITY MOSTAFA MIRSHEKARI

SYSTEM IDENTIFICATION MODEL AND PREDICTIVE FUNCTIONAL CONTROL OF AN ELECTRO-HYDRAULIC ACTUATOR SYSTEM NOOR HANIS IZZUDDIN BIN MAT LAZIM

MODELING AND CONTROL OF A CLASS OF AERIAL ROBOTIC SYSTEMS TAN ENG TECK UNIVERSITI TEKNOLOGY MALAYSIA

Table of Contents. Multivariate methods. Introduction II. Introduction I

UNCERTAINTY ANALYSIS OF TWO-SHAFT GAS TURBINE PARAMETER OF ARTIFICIAL NEURAL NETWORK (ANN) APPROXIMATED FUNCTION USING SEQUENTIAL PERTURBATION METHOD

ANOLYTE SOLUTION GENERATED FROM ELECTROCHEMICAL ACTIVATION PROCESS FOR THE TREATMENT OF PHENOL

EFFECTS OF SURFACE ROUGHNESS USING DIFFERENT ELECTRODES ON ELECTRICAL DISCHARGE MACHINING (EDM) MOHD ABD LATIF BIN ABD GHANI

THE DEVELOPMENT OF A HYDRAULIC FLOW METER AZAM BIN AHMAD

ROOFTOP MAPPING AND INFORMATION SYSTEM FOR RAINWATER HARVESTING SURAYA BINTI SAMSUDIN UNIVERSITI TEKNOLOGI MALAYSIA

OPTIMIZATION OF CHROMIUM, NICKEL AND VANADIUM ANALYSIS IN CRUDE OIL USING GRAPHITE FURNACE ATOMIC ABSORPTION SPECTROSCOPY NURUL HANIS KAMARUDIN

MULTISTAGE ARTIFICIAL NEURAL NETWORK IN STRUCTURAL DAMAGE DETECTION GOH LYN DEE

DEMULSIFICATION OF CRUDE OIL EMULSIONS VIA MICROWAVE ASSISTED CHEMICALS (ENVIRONMENTAL FRIENDLY) NUR HAFIZAH BINTI HUSSIN

BOUNDARY INTEGRAL EQUATION WITH THE GENERALIZED NEUMANN KERNEL FOR COMPUTING GREEN S FUNCTION FOR MULTIPLY CONNECTED REGIONS

EFFECT OF NOZZLE ANGLE ON JET IMPINGEMENT COOLING SYSTEM KHAIDER BIN ABU BAKAR

A COMPUTATIONAL FLUID DYNAMIC FRAMEWORK FOR MODELING AND SIMULATION OF PROTON EXCHANGE MEMBRANE FUEL CELL HAMID KAZEMI ESFEH

A STUDY ON THE APPLICABILITY OF SYMBOLIC COMPUTATION IN STABILISING CONTROL DESIGN FOR SWITCHED SYSTEMS AT-TASNEEM BINTI MOHD AMIN

POSITION CONTROL USING FUZZY-BASED CONTROLLER FOR PNEUMATIC-SERVO CYLINDER IN BALL AND BEAM APPLICATION MUHAMMAD ASYRAF BIN AZMAN

OPTIMAL CONTROL BASED ON NONLINEAR CONJUGATE GRADIENT METHOD IN CARDIAC ELECTROPHYSIOLOGY NG KIN WEI UNIVERSITI TEKNOLOGI MALAYSIA

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

A STUDY ON THE CHARACTERISTICS OF RAINFALL DATA AND ITS PARAMETER ESTIMATES JAYANTI A/P ARUMUGAM UNIVERSITI TEKNOLOGI MALAYSIA

SEDIMENTATION RATE AT SETIU LAGOON USING NATURAL. RADIOTRACER 210 Pb TECHNIQUE

Introduction to Machine Learning

Principal Components Analysis. Sargur Srihari University at Buffalo

ROOT FINDING OF A SYSTEM OF NONLINEAR EQUATIONS USING THE COMBINATION OF NEWTON, CONJUGATE GRADIENT AND QUADRATURE METHODS

Experimental Design and Data Analysis for Biologists

MODELING AND CONTROLLER DESIGN FOR AN INVERTED PENDULUM SYSTEM AHMAD NOR KASRUDDIN BIN NASIR UNIVERSITI TEKNOLOGI MALAYSIA

FOURIER TRANSFORM TECHNIQUE FOR ANALYTICAL SOLUTION OF DIFFUSION EQUATION OF CONCENTRATION SPHERICAL DROPS IN ROTATING DISC CONTACTOR COLUMN

Principal Component Analysis (PCA) Theory, Practice, and Examples

FRAGMENT REWEIGHTING IN LIGAND-BASED VIRTUAL SCREENING ALI AHMED ALFAKIABDALLA ABDELRAHIM

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Learning based on kernel-pca for abnormal event detection using filtering EWMA-ED.

INTEGRATED MALAYSIAN METEOROLOGICAL DATA ATMOSPHERIC DISPERSION SOFTWARE FOR AIR POLLUTANT DISPERSION SIMULATION

CORRELATION STUDY OF THE STRAIN AND VIBRATION SIGNALS OF A BEAM HAMIZATUN BINTI MOHD FAZI

Principal component analysis

Evaluating Goodness of Fit in

CHARACTERISTICS OF SOLITARY WAVE IN FIBER BRAGG GRATING MARDIANA SHAHADATUL AINI BINTI ZAINUDIN UNIVERSITI TEKNOLOGI MALAYSIA

OPTICAL TWEEZER INDUCED BY MICRORING RESONATOR MUHAMMAD SAFWAN BIN ABD AZIZ

FLOOD MAPPING OF NORTHERN PENINSULAR MALAYSIA USING SAR IMAGES HAFSAT SALEH DUTSENWAI

MONTE CARLO SIMULATION OF NEUTRON RADIOGRAPHY 2 (NUR-2) SYSTEM AT TRIGA MARK II RESEARCH REACTOR OF MALAYSIAN NUCLEAR AGENCY

Machine Learning 11. week

DEVELOPMENT OF FAULT DETECTION, DIAGNOSIS AND CONTROL SYSTEM IDENTIFICATION USING MULTIVARIATE STATISTICAL PROCESS CONTROL (MSPC)

MODELLING AND EXPERIMENTAL INVESTIGATION ON PERFORATED PLATE MOBILITY CHEAH YEE MUN UNIVERSITI TEKNIKAL MALAYSIA MELAKA

1 A factor can be considered to be an underlying latent variable: (a) on which people differ. (b) that is explained by unknown variables

Principal component analysis

Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA

Machine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012

Predictive analysis on Multivariate, Time Series datasets using Shapelets

MECHANICAL PROPERTIES OF CALCIUM CARBONATE AND COCONUT SHELL FILLED POLYPROPYLENE COMPOSITES MEHDI HEIDARI UNIVERSITI TEKNOLOGI MALAYSIA

Robustness of Principal Components

DIMENSION REDUCTION AND CLUSTER ANALYSIS

Basics of Multivariate Modelling and Data Analysis

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

FINITE ELEMENT METHOD FOR TWO-DIMENSIONAL ELASTICITY PROBLEM HANIMAH OTHMAN

Jurnal Teknologi CHARACTERIZATION OF A SHORT MICROCHANNEL DEVICE FOR SURFACE COOLING. Full Paper. Kamaruzaman N. B *, Saat A.

ECE 661: Homework 10 Fall 2014

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

MONITORING THE VARIABILITY OF PETROCHEMICAL ZA FERTILIZER PRODUCTION PROCESS BASED ON SUBGROUP OBSERVATIONS

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

Unconstrained Ordination

Statistical Analysis of fmrl Data

MATHEMATICAL MODELLING OF UNSTEADY BIOMAGNETIC FLUID FLOW AND HEAT TRANSFER WITH GRAVITATIONAL ACCELERATION

THE EFFECT OF PLATE GEOMETRY ON FLOW AND HEAT TRANSFER ACROSS A PARALLEL PLATE HEAT EXCHANGER

Synergy between Data Reconciliation and Principal Component Analysis.

SQUEAL SUPPRESSION APPROACHES OF A DISC BRAKE ASSEMBLY SAW CHUN LIN UNIVERSITI TEKNOLOGI MALAYSIA

MDS codec evaluation based on perceptual sound attributes

ADSORPTION AND STRIPPING OF ETHANOL FROM AQUEOUS SOLUTION USING SEPABEADS207 ADSORBENT

EXPERIMENTAL STUDY ON THE NOISE REDUCTION OF HYDRAULIC BENCH MUHAMMAD SAUFI BIN ABDUL

Comparison of statistical process monitoring methods: application to the Eastman challenge problem

A PROPOSED DOUBLE MOVING AVERAGE (DMA) CONTROL CHART

Karhunen-Loève Transform KLT. JanKees van der Poel D.Sc. Student, Mechanical Engineering

ELIMINATION OF RAINDROPS EFFECTS IN INFRARED SENSITIVE CAMERA AHMAD SHARMI BIN ABDULLAH

INDIRECT TENSION TEST OF HOT MIX ASPHALT AS RELATED TO TEMPERATURE CHANGES AND BINDER TYPES AKRIMA BINTI ABU BAKAR

HYDROGEN RECOVERY FROM THE REFORMING GAS USING COMMERCIAL ACTIVATED CARBON MEHDI RAHMANIAN

INDEX SELECTION ENGINE FOR SPATIAL DATABASE SYSTEM MARUTO MASSERIE SARDADI UNIVERSITI TEKNOLOGI MALAYSIA

THE EFFECT OF FREQUENCY ON FLOW AND HEAT TRANSFER OF A HEAT EXCHANGER IN AN OSCILLATORY FLOW CONDITION

BORANG PENGESAHAN STATUS TESIS

DEVELOPMENT OF PROCESS-BASED ENTROPY MEASUREMENT FRAMEWORK FOR ORGANIZATIONS MAHMOOD OLYAIY

Freeman (2005) - Graphic Techniques for Exploring Social Network Data

DENSITY FUNCTIONAL THEORY SIMULATION OF MAGNETISM DUE TO ATOMIC VACANCIES IN GRAPHENE USING SIESTA

New Nonlinear Four-Step Method for

CLASSIFICATION OF MULTIBEAM SNIPPETS DATA USING STATISTICAL ANALYSIS METHOD LAU KUM WENG UNIVERSITITEKNOLOGI MALAYSIA

Introduction to Structural Equation Modeling

Unsupervised Learning Methods

Nur Atikah binti Md.Nor

Transcription:

UTILIZING CLASSICAL SCALING FOR FAULT IDENTIFICATION BASED ON CONTINUOUS-BASED PROCESS MIRA SYAHIRAH BT ABD WAHIT Thesis submitted in fulfillment of the requirements for the degree of Bachelor Of Chemical Engineering Faculty of Chemical Engineering and Chemical Resources UNIVERSITI MALAYSIA PAHANG JANUARY 2013

ABSTRACT This study is about to develop a new method on Fault Identification using Classical Scaling. Process Monitoring is from Statistical Process Control (SPC), the statistical tool which used of statistical methods and to control of a process, by repeated sampling measurements or to predict results. It also help to determine whether the process is working properly or not. This Statistical Process Control (SPC) will forms charts with data (control charts) to shown the result. This study will develop the normal Multivariate Statistical Process Monitoring (MSPM). The data will be run as the normal Multivariate Statistical Process Monitoring (MSPM). So from the technique that used in this study such as Principal Component Analysis, Statistical Process Control and Multivariate Statistical Process Monitoring (MSPM), this study will be develop which the most faster technique that will be detecting the fault in the system used. iv

ABSTRAK Kajian ini adalah untuk membangunkan satu kaedah baru mengenai Pengenalan Kerosakan menggunakan Penskalaan klasik. Proses Pemantauan adalah dari Kawalan Proses Statistik (SPC), alat statistik yang menggunakan kaedah statistik dan untuk mengawal proses, dengan mengulangi ukuran sampel atau untuk meramalkan keputusan. Ia juga membantu untuk menentukan sama ada proses ini berfungsi dengan betul atau tidak. Kawalan Proses Statistik (SPC) akan membentuk carta dengan data (carta kawalan) untuk menunjukkan hasilnya. Kajian ini akan membangunkan Proses Pemantauan Multivariat Statistik (MSPM) normal. Data akan berjalan seperti Proses Pemantauan Multivariat Statistik (MSPM) normal. Jadi dari teknik yang digunakan dalam kajian ini seperti Analisis Komponen Prinsipal, Kawalan Proses Statistik dan Proses Pemantauan Multivariat Statistik (MSPM), kajian ini akan membangunkan satu teknik yang paling cepat untuk mengesan kerosakan dalam sistem yang digunakan. v

TABLE OF CONTENTS TITLE PAGE SUPERVISOR S DECLARATION STUDENT S DECLARATION ACKNOWLEDGEMENTS ABSTRACT ABSTRAK TABLE OF CONTENTS LIST OF TABLES LIST OF FIGURES i ii iii iv v vi viii ix CHAPTER 1 INTRODUCTION 1.1 Background of The Study 1 1.2 Problem Statement 2 1.3 Research Problem 3 1.4 Aims and Objectives 3 1.5 Scope of Proposed Study 3 1.6 Expected Outcome 4 1.7 Significance of Proposed Study 4 1.8 Conclusion 4 CHAPTER 2 LITERATURE REVIEW 2.1 Fundamental of Principal Component Analysis 5 2.2 Fundamental of Multidimensional Scaling 7 vi

CHAPTER 3 METHODOLOGY 3.1 Procedure 3.1.1 Overall Framework in Multivariate Statistical Process Monitoring (MSPM) System 9 3.1.2 Fault Detection and Fault Identification 10 3.2 Case Study 13 CHAPTER 4 RESULTS AND DISCUSSIONS 4.1 Normal Operation Condition (NOC) Data 16 4.2 Fault 8 4.2.1 Abrupt Fault 18 4.2.2 Incipient Fault 19 4.3 Fault 9 4.3.1 Abrupt Incipient Fault 20 4.3.2 Incipient Fault 21 4.4 Fault Identification 22 4.5 Discussion 24 CHAPTER 5 CONCLUSION 26 REFERENCES 27 vii

LIST OF TABLES Table No. Title Page 33.2b List of Abnormal Operation in the CSTR System 14 3.2c List of Variables In The CSTR System For Monitoring 14 viii

LIST OF FIGURES Figure No. Title Page 3.1.1 Overall Framework 9 3.1.2 Off-line Modeling and Monitoring (phase I) and On-line Monitoring (phase II) 10 3.2a CSTR system (Zhang, 2006) 13 4.1a T 2 Statistics versus Observations of NOC Data 17 4.1b SPE Statistics versus Observations of NOC Data 17 4.2.1a T 2 Statistics versus Observations of Fault 8 for abrupt fault 18 4.2.1b SPE Statistics versus Observations of Fault 8 for abrupt fault 19 4.2.2a T 2 Statistics versus Observations of Fault 8 for incipient fault 19 4.2.2b SPE Statistics versus Observations of Fault 8 for incipient fault 20 4.3.1a T 2 Statistics versus Observations of Fault 9 for abrupt fault 20 4.3.1b SPE Statistics versus Observations of Fault 9 for abrupt fault 21 4.3.2a T 2 Statistics versus Observations of Fault 9 for incipient fault 21 4.3.2b SPE Statistics versus Observations of Fault 9 for incipient fault 22 4.4a Example of PCA Scores 22 4.4b Normal Operation Condition (NOC) versus Observations 23 4.4c First Principal Component versus Second Principal Component 23 ix

1 CHAPTER 1 INTRODUCTION 1.1 BACKGROUND OF THE STUDY Process Monitoring is an effective tool used to maintain control of a manufacturing operation, as well as demonstrating ongoing compliance to specifications.( Finepoint, 2003). It help to monitor some of critical points throughout the industrial process to increase productivity, maintain product quality, reduce downtime, prevent product loss or prevent accidents. It also will assist in controlling the product quality that relies on many factors such as temperature, pressure and flowrate. Chemical processes in industry require monitoring and management to maximize production and assist in providing safe product for consumption. With process monitoring, it s target to quickly identify deviation from the baseline performance. Other than that, there is Process Control which defines the active changing of the process based on the results of process monitoring. If the process monitoring tools have detected an out-of-control situation, the person responsible for the process makes a change to bring the process back into the control.

2 If problem happen, it can be solved using the problem resolution process. In this process, there are have 3 parts which are Fault Analysis, Fault Identification or Troubleshooting and Fault Resolution. For Fault Analysis, this process can hypothesize the cause of error and list the potential faults. For Fault Identification, it will eliminate and isolate the list of potential faults until offending fault is found, or we can mention that this fault will locate and identify the cause of the problem. And lastly is Fault Resolution is to apply the fix. In Process Monitoring, there apply the Statistical Process Control (SPC) which is statistical tool for inspects random sample from a process. Statistical Process Control (SPC) also used to determine whether the process is in control or not. From the Statistical Process Control (SPC), it will forms charts with data (Control charts) to obtain the result of the process. For the control charts, there have two types of data which are variable data and attribute data. In Statistical Process Control (SPC), there have three types of fundamental which are understanding the process, understanding the causes of variation and elimination the sources of special cause variation. In understanding a process, usually the process using control charts, where to identify the variation that will be due into special causes. When the process capability is found lacking, it will be effort to determine causes of that variance. They usually used Ishikawa diagrams, designed experiments and Pareto charts as their tools. 1.2 PROBLEM STATEMENT In Process Monitoring, there are many ways on detecting fault in the system process. For example, there have Statistical Process Control (SPC) and Principal Component Analysis (PCA). In the Statistical Process Control (SPC), we can get the results in control charts form. But there are limitations of Principal Component Analysis which it will take long step in detection the

fault in the system. So that, this study will be develop the new Fault Identification method which is Classical Scaling. 3 1.3 RESEARCH PROBLEM In this research, it will be more focusing on the variables of the process, using the concept of Process Monitoring and Statistical Process Control (SPC). Then, it will extend the process using Multivariate Statistical Process Monitoring (MSPM) versus Statistical Process Control (SPC), or using Classical Scaling (CMDS) which to compare the methods that will be detected the variables faster. 1.4 AIMS AND OBJECTIVES 1.4.1 Run the normal Multivariate Statistical Process Monitoring (MSPM) 1.4.2 Develop the new Multivariate Statistical Process Monitoring (MSPM) 1.4.3 Develop Fault Identification new method, which is Classical Scaling (CMDS) 1.5 SCOPE OF PROPOSED STUDY In this research, it will propose a new technique on identify the fault in the process. It will be included many types of methods, to compare which one will be detect the variables faster. This study will be developed and run based on Matlab software.

4 1.6 EXPECTED OUTCOME At the end of this research, the results will shows that using the Fault Identification new method, Classical Scaling (CMDS), that method will be detect the variables more faster than other methods. 1.7 SIGNIFICANT OF PROPOSED STUDY When the new method, Classical Scaling (CMDS) will be develop, it help the system detect the faults and the variables more faster than the other methods. Therefore, it will produce the best quality of results in the system required. 1.8 CONCLUSION This study is about to develop a new Fault Identification method which is Classical Scaling. This method is using the Multivariate Statistical Process Monitoring (MSPM) method. So, it will help to detect the fault faster rather use the Principal Component Analysis (PCA) method. It will help to make a better quality of product in the system. In the second chapter, the literature review will be classified into two sections that are fundamental of the Principal Component Analysis (PCA) and Multivariate Statistical Process Monitoring (MSPM). For the third chapter which is methodology, it will come out with the procedure of this study and the case study used in this study. For the forth chapter, which are results and discussions. It will come out with the results that will generate from the study and discussion about the responses to the results. Finally, the last chapter is conclusion which to conclude the study and to ensure that the study has achieved its objectives.

5 CHAPTER 2 LITERATURE REVIEW 2.1 Fundamental of Principal Component Analysis (PCA) Principal Component Analysis (PCA) is one of the most common methods used by the data analysis to provide a condensed description and describe patterns of variation in multivariate data sets. Moreover, Principal Component Analysis is also able to retain meaningful information in the early axes whereas variation associated to experimental error, measurement in accuracy, and rounding is summarized in later axis (Gauch, 1982). Principal component analysis (PCA) is a well-established technique for monitoring and disturbance detection of multivariate process, as it enables variability assessment through dimensionality reduction. Principal component analysis is also a variable reduction procedure. When there have obtained the data on the number of variables and known that there is some disturbance in those variables, which means that some of the variables are correlated with one another. Because of the disturbance, it should be possible to reduce the observed variables into smaller number of principal component that will gives the most of the variance in the observed variables.

6 In the way to performing a principal component analysis, it is possible to calculate a score for each of subject on a given principal component. Then, the number of components extracted in a principal component analysis is equal to the number of observed variables being analyzed. However, in most analyses, only the first few components account for meaningful amounts of variance, so only these first few components are retained, interpreted, and used in subsequent analyses (such as in multiple regression analyses). This procedure is recommended to determine the minimum number of factors that will account for the maximum variance in the data in use in the particular multivariate analysis. This is a common technique for searching the patterns in the data that consist of high dimensions. From this technique, the data that will get is the standard deviation, covariance, eigenvectors and eigenvalues. The eigenvalues refer to the total variance explained by each factor. The standard deviation measures the variability of the data. The task of principal component analysis is to identify the patterns in the data and to direct the data by highlighting their similarities and differences. In order to perform principal component analysis in an appropriate manner, we needs to subtract the mean from each of the dimensions of the data. The mean subtracted is the average across each dimension. This task is called data adjustment. The next task is to calculate the covariance matrix. The covariance can be computed only if the data is two dimensional. If the dimension of data is more than two, then the covariance is calculated or measured more than once. If the data is two dimensional, then the covariance matrix in principal component analysis is a square matrix with non diagonal elements in this matrix as positive elements. As the definition of eigenvalues, the calculation in principal component analysis involves the extraction of the total variance from each factor. The role of eigenvalues, which are then formed into vectors, is to provide the researcher with information about the patterns in the data. The principal component is referred to as that eigenvector. This has the highest eigenvalue. Once the eigenvectors are found from the covariance matrix, the next task is to

7 sort the eigenvalues from highest to lowest. Thus, principal component analysis gives the original factors in terms of differences and similarities between the factors. But sometimes, Principal component analysis is confused with factor analysis, because there are many important similarities between both of the procedures. Which mean that the both procedures are variables reduction methods that can be used to identify the observed variables that tend to get together empirically. Both procedures sometimes will be provide very similar results. But there are some important conceptual differences between principal component analysis and factor analysis that should be understood. This most related to the assumption of an underlying causal structure. Factor analysis assumes that the covariation in the observed variables is due to the presence of one or more latent variables factors that exert causal influence on these observed variables. In contrast, principal component analysis wills no assumption about the underlying causal model. This is because principal component analysis is simply a variable reduction procedure that typically results in relatively small number of components that gives most of the variance in a set of observed variables. 2.2 Fundamental of Multidimensional Scaling Multidimensional Scaling (MDS) is a method that represents measurements of similarity or dissimilarity among pairs of objects as distances between points of a low-dimensional multidimensional space (Borg & Groenen, 2005). Otherwise, Multidimensional Scaling (MDS) can be describes a family of techniques for the analysis of proximity data on a set of stimuli to reveal the hidden structure underlying the data ( Steyvers, 2002). The proximity data can come from similarity judgments, identification confusion matrices, grouping data, same-different errors or any other measure of pairwise similarity. Multidimensional Scaling (MDS) is a set of data analysis techniques that display the structure of distance-like data as a geometrical picture.

8 Multidimensional Scaling can be classified according to whether the similarities data are qualitative is called as nonmetric MDS or quantitative known as metric MDS. The number of similarity matrices and the nature of the MDS model can also classify MDS types. This classification yields classical MDS which using one matrix and unweighted model, replicated MDS which used several matrices and unweighted model, and weighted MDS which using several matrices and weighted model. This method will give a graphical display of the structure of the data, one that is much easier to understand than an array of numbers and, moreover, one that displays the essential information in the data, smoothing out noise. This method also can be described by values along a set of dimensions that places these stimuli as points in a multidimensional space and that the similarity between stimuli is inversely related to the distances of the corresponding points in the multidimensional space. Groenen & Velden (2004) stated that Multidimensional Scaling (MDS) objective s to represent the dissimilarities as distances between points in allow dimensional space such that the distances correspond as closely as possible to the dissimilarities. This method can be applied with different purposes. For example, by applying exploratory data analysis, which is by placing objects as points in a low dimensional space, the observed complexity in the original data matrix can often be reduced while preserving the essential information in the data. By a representation of the pattern of proximities in two or three dimensions, researchers can visually study the structure in the data. It also has been used to discover the mental representation of stimuli that explains how similarity judgments are generated. Sometimes, MDS reveals the psychological dimensions hidden in the data that can meaningfully describe the data. The multidimensional representations resulting from MDS are also often useful as the representational basis for various mathematical models of categorization, identification, and/or recognition memory (Nosofsky, 1992) or generalization (Shepard, 1987).

9 CHAPTER 3 METHODOLOGY 3.1 Procedure 3.1.1 Overall Framework in Multivariate Statistical Process Monitoring (MSPM) System FAULT DETECTION FAULT IDENTIFICATION FAULT DIAGNOSIS PROCESS RECOVERY Figure 3.1.1: Overall Framework

10 In Multivariate Statistical Process Monitoring (MSPM) System, there are four main steps. First step is Fault Detection, which is to designate the departure of observed samples from an acceptable range using a set of parameters. For the second step is Fault Identification, which to identifying the observed process variables that are most relevant to the fault which is typically identified by using the contribution plot technique. For the third step which is Fault Diagnosis. This step is specially to determine the type of fault which has been significantly contributed to the signal. And lastly for forth step, Process Recovery to remove the causes that contribute to the detected fault. 3.1.2 Fault Detection and Fault Identification This is the procedures of fault detection and fault identification to be complete. Figure 3.1.2 : Off-line Modeling and Monitoring (phase I) and On-line Monitoring (phase II)

11 Phase I, Off-line Modeling and Monitoring stage. In this phase, a set of normal operation condition data are identified off-line based on the historical process data archive. NOC simply implies that the process is operated at the desired setting condition and produces satisfactory products that meet the qualitative as well as quantitative specified standard (Martin et al., 1996). The data that we get will be standardized to zero mean and variance with respective to each of the variables because principle component analysis results depend on data scales. In the second step, the development of principle component analysis model for the normal operation condition (NOC) data requires the establishment of a set of variance-covariance matrix, C mxm. x j, i x j, i x i i c1, 1 1 c2, 1 C X' X n 1 cm, 1 c c c 1, 2 2, 2 m, 2 c c c 1,m 2,m m,m C is then transformed into a set of basic structures of eigen-based formula. C VV Finally, the principle component analysis model can be simply developed by: P p1 x1, 1v, xn, 1v, 1 1 1 1 pm x x P XV v 1,m m, 1 v n,m m, 1 T x x v 1, 1 1,m v n, 1 1,m x x 1,m n,m v v m,m m,m

The third step basically involves calculation of the Hotelling s T 2 and SPE monitoring statistics. 12 For T 2, T 2 i a j1 p 2 i, j j For SPE ~ E X - Xˆ T X - PaVa X XVa Va T X I - V V SPE i a a T ~~ e e i T i For the final step in phase I, it will deals with developing the control limits for both of the statistics. SPE T F A, n A, z A n 1 ( n A) 2 h 1 h0 1 h0 2 2 0 1h0 1 ( 1) 2 1 1 In phase II, is for On-line Monitoring. In this phase, the step 5 until step 7 is similar to the step 1 until 3 in the phase I. Until step 8 which are the last step, there are two main operations that have to be conducted separately, which are fault detection and fault identification. For fault detection, it is a fault situation that regards as a result of an occurrence of a special event that is not in conformance to the common cause nature. A fault situation will be declared if either of the monitoring statistics exceeding its respective control limit for a predefined successive number of samples. In this phase also, the data will be more focusing on the abrupt fault and the incipient fault. After that, fault identification will be based on the contribution plot which the variables have given the most contribution to the fault.

13 3.2 Case Study The multivariate statistical process control approaches are based on noncausal models built from historical process data using multivariate latent variable methods such as PCA and PLS. The faults are then detected by referencing future data against these covariance models, and isolation is attempted through examining contributions to the breakdown of the covariance structure. There are major differences between these approaches arising mainly from the different types of models and data utilized to build them. Each of them has clear, but complementary, strengths and weaknesses. These are discussed using simulated data from a CSTR process shown below. Figure 3.2a: CSTR system (Zhang, 2006)

14 Table 3.2b : List of Abnormal Operation in the CSTR System Fault Fault Causes Cases 1 Pipe 1 blockage 2 External feed-reactant flow rate too high 3 Pipe 2 or 3 is blocked or pump fails 4 Pipe 10 or 11 is blocked or control valve 1 fails low 5 External feed-reactant temperature abnormal 6 Control valve 2 fails high 7 Pipe 7,8 or 9 is blocked or control valve 3 fails low 8 Control valve 1 fails high 9 Pipe 4,5 or 6 is blocked or control valve 3 fails low 10 Control valve 3 fails high 11 External feed-reactant concentration too low Table 3.2c : List Of Variables In The CSTR System For Monitoring Process No Variables Variables Names 1 V1 Tank temperature 2 V2 Tank level 3 V3 Feed temperature 4 V4 Inlet flow rate 5 V5 Recycle flow rate 6 V6 Outlet flow rate 7 V7 Cooling water flow rate 8 V8 Product concentration 9 V9 Feed concentration 10 V10 Heat exchanger entrance pressure Instruments 11 V11 Controller 1 12 V12 Controller 2 13 V13 Controller 3

15 In Figure 3.2a, it showed the case study that used in this study. In this case study, the researchers have listed the abnormal operation that might be happen in the CSTR system in Table 3.2b. There are many fault that might be happen in this case study such as the pipe are blockage, external feed-reactant too high or the control valve are fails. In Table 3.2c, there are list of the variables occurred in the CSTR system to monitor.

16 CHAPTER 4 RESULT AND DISCUSSION 4.1 Normal Operation Condition (NOC) Data When start run the phase I, the first data that will be get is the normal operation condition (NOC) data. this data will be as the reference data when to detecting the variable that contribute to the fault. The data that will be get as below which are T 2 Statistics versus Observations data and SPE Statistics versus Observations data.

SPE Statistics T2 Statistics 17 14 12 NOC 95%limit 99%limit PCA-based MSPM Monitoring Chart 10 8 6 4 2 0 0 10 20 30 40 50 60 70 80 90 100 Observations Figure 4.1a: T 2 Statistics versus Observations 14 12 NOC 95%limit 99%limit PCA-based MSPM Monitoring Chart 10 8 6 4 2 0 0 10 20 30 40 50 60 70 80 90 100 Observations Figure 4.1b: SPE Statistics versus Observations