Characterisation of the plasma density with two artificial neural network models

Similar documents
No. 6 Determining the input dimension of a To model a nonlinear time series with the widely used feed-forward neural network means to fit the a

Portugaliae Electrochimica Acta 26/4 (2008)

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD

Artificial Neural Networks. Edward Gatt

Dynamical analysis and circuit simulation of a new three-dimensional chaotic system

Projective synchronization of a complex network with different fractional order chaos nodes

Effect of Spiral Microwave Antenna Configuration on the Production of Nano-crystalline Film by Chemical Sputtering in ECR Plasma

CHAPTER IX Radial Basis Function Networks

Photodetachment of H in an electric field between two parallel interfaces

Theoretical analysis of ion kinetic energies and DLC film deposition by CH 4 +Ar (He) dielectric barrier discharge plasmas

Neuro-Fuzzy Comp. Ch. 4 March 24, R p

Adjustment of electron temperature in ECR microwave plasma

Sunlight loss for femtosecond microstructured silicon with two impurity bands

Efficient removal of typical dye and Cr(VI) reduction using N-doped

Radiation energy flux of Dirac field of static spherically symmetric black holes

Time-delay feedback control in a delayed dynamical chaos system and its applications

A comparison of the defects introduced during plasma exposure in. high- and low-k dielectrics

PIC/MCC Simulation of Radio Frequency Hollow Cathode Discharge in Nitrogen

Neural Networks Lecture 4: Radial Bases Function Networks

Effect of Sample Configuration on Droplet-Particles of TiN Films Deposited by Pulse Biased Arc Ion Plating

Nonchaotic random behaviour in the second order autonomous system

4. Multilayer Perceptrons

A Novel Analytical Expressions Model for Corona Currents Based on Curve Fitting Method Using Artificial Neural Network

Generalized projective synchronization of a class of chaotic (hyperchaotic) systems with uncertain parameters

Silicon wafer characterisation by laser ultrasonics and neural networks

Long- and short-term average intensity for multi-gaussian beam with a common axis in turbulence

Bifurcation control and chaos in a linear impulsive system

Analysis of second-harmonic generation microscopy under refractive index mismatch

Introduction to Natural Computation. Lecture 9. Multilayer Perceptrons and Backpropagation. Peter Lewis

Neural Networks and the Back-propagation Algorithm

Artificial Intelligence

General Synthesis of Graphene-Supported. Bicomponent Metal Monoxides as Alternative High- Performance Li-Ion Anodes to Binary Spinel Oxides

Multilayer Perceptrons and Backpropagation

Radial Basis Function Networks. Ravi Kaushik Project 1 CSC Neural Networks and Pattern Recognition

A Wavelet Neural Network Forecasting Model Based On ARIMA

932 Yang Wei-Song et al Vol. 12 Table 1. An example of two strategies hold by an agent in a minority game with m=3 and S=2. History Strategy 1 Strateg

Machine Learning for Large-Scale Data Analysis and Decision Making A. Neural Networks Week #6

POWER SYSTEM DYNAMIC SECURITY ASSESSMENT CLASSICAL TO MODERN APPROACH

Introduction to Neural Networks

Electron cyclotron resonance plasma enhanced direct current sputtering discharge with magnetic-mirror plasma confinement

Method for Recognizing Mechanical Status of Container Crane Motor Based on SOM Neural Network

A Modified Radial Basis Function Method for Predicting Debris Flow Mean Velocity

Large Storage Window in a-sinx/nc-si/a-sinx Sandwiched Structure

Anti-synchronization of a new hyperchaotic system via small-gain theorem

PACS: Wc, Bh

Estimation of the Pre-Consolidation Pressure in Soils Using ANN method

Lab 5: 16 th April Exercises on Neural Networks

APPLICATION OF RADIAL BASIS FUNCTION NEURAL NETWORK, TO ESTIMATE THE STATE OF HEALTH FOR LFP BATTERY

Multilayer Perceptron Tutorial

PAD MODELING BY USING ARTIFICIAL NEURAL NETWORK

Structural evolution of silicone oil liquid exposed to Ar plasma

A new four-dimensional chaotic system

Keywords- Source coding, Huffman encoding, Artificial neural network, Multilayer perceptron, Backpropagation algorithm

AN INTRODUCTION TO NEURAL NETWORKS. Scott Kuindersma November 12, 2009

Effect of an axial magnetic field on a DC argon arc

New Feedback Control Model in the Lattice Hydrodynamic Model Considering the Historic Optimal Velocity Difference Effect

Plasma Diagnosis for Microwave ECR Plasma Enhanced Sputtering Deposition of DLC Films

Teleportation of an n-bit one-photon and vacuum entangled GHZ cavity-field state

Discussion About Nonlinear Time Series Prediction Using Least Squares Support Vector Machine

Artificial Neural Networks (ANN) Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso

Double-distance propagation of Gaussian beams passing through a tilted cat-eye optical lens in a turbulent atmosphere

Revision: Neural Network

The near-infrared spectra and distribution of excited states of electrodeless discharge rubidium vapour lamps

NONLINEAR CLASSIFICATION AND REGRESSION. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

Lecture 7 Artificial neural networks: Supervised learning

Supporting Infromation

Multilayer Perceptron

Supervised (BPL) verses Hybrid (RBF) Learning. By: Shahed Shahir

Dynamics of Drift and Flute Modes in Linear Cylindrical ECR Plasma

Calibration and Temperature Retrieval of Improved Ground-based Atmospheric Microwave Sounder

New Application of the (G /G)-Expansion Method to Excite Soliton Structures for Nonlinear Equation

Artificial Neural Network

Transport properties through double-magnetic-barrier structures in graphene

A New Integrable Couplings of Classical-Boussinesq Hierarchy with Self-Consistent Sources

ARTIFICIAL NEURAL NETWORKS گروه مطالعاتي 17 بهار 92

A NOVEL FILLED FUNCTION METHOD FOR GLOBAL OPTIMIZATION. 1. Introduction Consider the following unconstrained programming problem:

Deep Learning for Partial Differential Equations (PDEs)

698 Zou Yan-Li et al Vol. 14 and L 2, respectively, V 0 is the forward voltage drop across the diode, and H(u) is the Heaviside function 8 < 0 u < 0;

In situ electrical characterization of dielectric thin films directly exposed to plasma vacuum-ultraviolet radiation

Application of Artificial Neural Network Model in Calculation of Pressure Drop Values of Nanofluid

Electrohydromechanical analysis based on conductivity gradient in microchannel

Design Collocation Neural Network to Solve Singular Perturbed Problems with Initial Conditions

THE SCATTERING FROM AN ELLIPTIC CYLINDER IRRADIATED BY AN ELECTROMAGNETIC WAVE WITH ARBITRARY DIRECTION AND POLARIZATION

RESEARCH ON AEROCRAFT ATTITUDE TESTING TECHNOLOGY BASED ON THE BP ANN

An Adaptive Neural Network Scheme for Radar Rainfall Estimation from WSR-88D Observations

Investigation of H 2 :CH 4 Plasma Composition by Means of Spatially Resolved Optical Spectroscopy

DEEP LEARNING AND NEURAL NETWORKS: BACKGROUND AND HISTORY

Design and performance of Hall probe measurement system in CSNS

Using Kernel PCA for Initialisation of Variational Bayesian Nonlinear Blind Source Separation Method

Data Mining Part 5. Prediction

C4 Phenomenological Modeling - Regression & Neural Networks : Computational Modeling and Simulation Instructor: Linwei Wang

The Effect of the Discharge Chamber Structure on the Performance of a 5 cm-diameter ECR Ion Thruster

Diffusion during Plasma Formation

Optical time-domain differentiation based on intensive differential group delay

Study of DC Cylindrical Magnetron by Langmuir Probe

Analysis of Multilayer Neural Network Modeling and Long Short-Term Memory

MODELING OF AN ECR SOURCE FOR MATERIALS PROCESSING USING A TWO DIMENSIONAL HYBRID PLASMA EQUIPMENT MODEL. Ron L. Kinder and Mark J.

Introduction to Neural Networks: Structure and Training

A tunable corner-pumped Nd:YAG/YAG composite slab CW laser

MULTIPACTOR ON A DIELECTRIC SURFACE WITH LONGITUDINAL RF ELECTRIC FIELD ACTION

Transcription:

Characterisation of the plasma density with two artificial neural network models Wang Teng( 王腾 ) a)b), Gao Xiang-Dong( 高向东 ) a), and Li Wei( 李炜 ) c) a) Faculty of Electromechanical Engineering, Guangdong University of Technology, Guangzhou 510090, China b) School of Computer, South China Normal University, Guangzhou 510631, China c) School of Physics and Telecommunication Engineering, South China Normal University, Guangzhou 510006, China (Received 25 April 2009; revised manuscript received 4 August 2009) This paper establishes two artificial neural network models by using a multi layer perceptron algorithm and radial based function algorithm in order to predict the plasma density in a plasma system. In this model, the input layer is composed of five neurons: the radial position, the axial position, the gas pressure, the microwave power and the magnet coil current. The output layer is the target output neuron: the plasma density. The accuracy of prediction is tested with the experimental data obtained by the Langmuir probe. The effectiveness of two artificial neural network models are demonstrated, the results show good agreements with corresponding experimental data. The ability of the artificial neural network model to predict the plasma density accurately in an electron cyclotron resonance-plasma enhanced chemical vapour deposition system can be concluded, and the radial based function is more suitable than the multi layer perceptron in this work. Keywords: plasma density, prediction, multi layer perceptron, radial based function PACC: 0540, 5270D, 5225L 1. Introduction The technology of low-temperature plasma has been widely used in the fields of microelectronics, materials, chemical industry, film deposition and plasma etching. It is very important to study inner physical parameters in order to optimise the technology process. During the recent decades, there has been a lot of research on the application of the electron cyclotron resonance (ECR) plasma system, [1 3] because the ECR plasma system has many good characteristics such as low gas pressure and high plasma density. Although the plasma density of ECR plasma is very important to the plasma processing technology, it was less investigated, and the scope of investigating data was limited by the cost and time. [4 7] So, it is necessary to obtain more desired parameters from some other means. Artificial neural network (ANN) is now already a very well developed, natural and routine technique, which utilises interconnected nodes or neurons to form a network that can model a complex functional relationship. The technique is particularly suited to problems which involve the manipulation of multiple parameters and non-linear interpolation. Recently, some researchers attempted to use ANN models for various applications in various fields. [8 20] But there is little information about the application of ANN in the simulation of plasma density distribution. In our work, two ANN models are studied to characterise the nonlinear plasma density of ECR-plasma enhanced chemical vapour deposition (PECVD) plasma system. 2. The experimental methods Project supported by the National Natural Science Foundation of China (Grant No. 60375012). Corresponding author. E-mail: gaoxd666@126.com c 2010 Chinese Physical Society and IOP Publishing Ltd The schematic diagram of the ECR-PECVD plasma system is showed in Fig. 1. It consists of a wave-guide (1), a resonance room with dimensions ϕ14 20 cm (2), a magnetic coil (3), a reaction chamber with dimensions ϕ35 50 cm (4), a substrate table (5), a vacuum system (6), and a mass flow controller (MFC) (7). The various magnetic field strengths can be obtained by adjusting the magnetic coil currents (I m ). The working gas N 2 is introduced into the resonance chamber by the MFC. The energy of the microwave is strongly absorbed by the plasma in the ECR zone. As a result, high-density plasma is generated with high-degree of ionisation. http://www.iop.org/journals/cpb http://cpb.iphy.ac.cn 070505-1

Fig. 1. The schematic diagram of the ECR-PECVD plasma system. The plasma density in the ECR-PECVD plasma system is measured by a Langmuir probe. The plasma density is mainly determined by five factors: the radial position, the axial position, the gas pressure, the microwave power and the magnetic fields. So, we make our measurements at various radial positions (R = 0, 2, 4, 6, 8, 10, 12, 14 cm), axial positions (Z = 0, 10, 15, 20, 25, 30 cm), gas pressure (P 1 = 0.03, 0.05, 0.07, 0.09, 0.10, 0.12, 0.14, 0.16 Pa), microwave power (P 2 = 300, 400, 500, 600, 650, 800 W) and magnet coil currents (I m = 140, 145, 150, 155, 160, 165 A). Three hundreds data from above are randomly chosen as the training set. 3. The ANN model There are several kinds of training algorithm available in the ANN system. In this study, a multi layer perceptron (MLP) algorithm and a radial based function (RBF) algorithm are used to train the artificial neural network. The MLP networks are mainly composed of three layers of neurons: the input layer, the hidden layer and the output layer. The units of the system are connected by the weights. The input layer consists of all the input factors. Information from the input layer is then processed through the hidden layer, and the following information is computed in the output layer. The MLP training algorithm is an iterative gradient algorithm, designed to minimise the mean square error between the predicted output and the actual output. The key steps for establishment of the MLP model can be summarised briefly. Initialisation: prior to the first iteration of loop, initialise the values of the connection weights and the output threshold values of the neurons, which have random values in [ 1, 1]. Maintenance: offer the input data to the neurons of the input layer. Then calculate the input and the output of neurons in the hidden layer and in the output layer. After each loop, the error between the predicted output and the actual output are propagated backward to adjust the weight in a manner mathematically guaranteed to converge. Termination: the iteration continues until the overall error between calculated and target output is approaching to the pre-set error criteria. The RBF networks have a very strong mathematical foundation rooted in regularisation theory for solving ill-conditioned problems. As its name implies, radially symmetric basis function is used as activation functions of nodes in hidden layer. The transformation from the input layer to the output f(x) is computed by n f(x) = W i R i (x) + W 0, i=1 where W i (i = 1, 2,..., n) is the connection weight between the hidden layer and output layer, W 0 is the 070505-2

bias, R i (x) (i = 1, 2,..., n) are RBFs, Chin. Phys. B Vol. 19, No. 7 (2010) 070505 R i (x) = φ X C i, φ X C i is the activation function in the jargon of the ANN, here it is a Gaussian function, n R i = exp j=1 2 X i C i 2σij 2, where Ci T = [C i1, C i2, C i3,..., C in ] (i = 1, 2,..., n) is the centre of the receptive field, and σ ij (i, j = 1, 2,..., n) is the width of the Gaussian function, which indicates the selectivity of the neuron. The centre C can be randomly chosen by the simplest way from the train set. Another approach is to use the k- means technique of clustering input training set into groups and choose the centre of each group as the centre. Also, C can be treated as a network parameter along with W i and adjusted through error-correction training. After the centre C is determined, the connection weights between the hidden layer and output layer can be obtained through ordinary training. In our ANN models, there are five input neurons: the radial position, the axial position, the gas pressure, the microwave power and the magnetic coil current. The numbers of the neurons in the hidden layer is 12. The output layer is our target output neuron: the plasma density. The neural networks are realised in MATLAB7.0 on the platform of Microsoft Windows XP. The neural network requires that the range of the both input values and output values should be between 0 and 1, so all data in our model are unified. Fig. 2. The effects of the training cycles on the different network performance. From Fig. 2 we can know that both ANN models can attain our training goal, and the RBF model is the better efficient design. In this RBF model, when the neural network was trained up to 24 times, the goal of training accuracy was attained, while the MLP model needs 30 times to attain the goal of training accuracy. It is clearly noticed from this figure that further training cycles have no considerable effect on the network performance. The errors of the training set decrease dramatically at the beginning of the training cycles, then they gradually tend to approach the minimum value. The accuracy of prediction in both ANN models is shown in Fig. 3. The vertical axis indicates the error between six random experimental data from the training set and corresponding tested data from two different ANN models. The horizontal axis indicates the serial numbers of those experimental data. From this figure we can conclude that the RBF model has better ability of prediction than the MLP model. It is believed that the number of neurons in the hidden layer plays an important role. 4. Results and discussions The effects of training cycles on the network performance of two ANN models are shown in Fig. 2, in which the vertical axis indicates the difference between the training performance and the training goal. We set the training goal as 1 10 4 to simplify the model. Fig. 3. Comparisons of predicting errors between two models. 070505-3

In order to make the model more effective, we again choose some predicted data that are not same as the data in the training set and measured the predicted data again. We calculate the relative errors, the e MSE and the e MSRE of two models: N (V calc,i V meas,i ) 2 i=1 e MSE =, N [ N Vcalc,i V meas,i i=1 N e MSRE = N where e MSE is the mean squared error, e MSRE is the mean squared relative error, V calc is computed value, V meas is measured value, and N is the number of samples. Corresponding experimental data of test set in two ANN models are given in Table 1. ] 2, Table 1. Data used in test set and the predicting values of two ANN models. gas pressure microwave magnet coil axial position radial position /Pa power/w current/a /cm /cm plasma density/10 10 cm 3 experimental data predicted data 0.06 620 143 0 8 2.260 2.126 2.211 0.11 620 143 10 4 5.406 5.529 5.473 0.13 450 153 10 14 1.985 1.863 1.919 0.11 530 148 15 8 3.824 3.854 3.896 0.08 580 158 20 10 3.405 3.365 3.352 0.15 620 162 30 2 2.284 2.189 2.168 It is shown that the predicted and experimental values are very close to each other. From Table 2, we can see that the relative errors of MLP model are between 0.7% and 6.1%, while those of RBF model are between 1.2% and 5.1%, and the e MSE and the e MSRE of RBF model are less than MLP model. It can be obviously concluded that the predictions of two ANN models are in good agreement with the experimental data, and the RBF model is more accurate than the MLP model. The ability of ANN model to predict the plasma density accurately in an ECR-PECVD system can be concluded. Pankaj and Deo [21] believed that the RBF is better, which can be due to the fact that it involves fewer parameters, and does not use a large amount of user specified parameters and probably has no large requirements of sample size. We think the key is that RBF used the local-response RBF to replace the whole-response function. Because the characteristics of local-response, the RBF can perform better and faster in approach of function. In addition to this, the random centre of the RBF can avoid the choice of initial weight in the MLP. So, in our study, the RBF is more suitable than the MLP when the neuron numbers in hidden layer is the same. Table 2. The error between experimental data and predicted data. MLP relative error 5.9% 2.3% 6.1% 0.7% 1.1% 4.2% 2.2% 1.2% 3.3% 1.8% 1.5% 5.1% RBF e MSE 0.0996 0.0738 e MSRE 0.1660 0.1233 MLP RBF 5. Conclusions In present work, a relative new technique ANN associated with nonlinear system is used in modeling the prediction of the plasma density in an ECR-PECVD plasma system. The RBF and MLP ANN models are established. The effectiveness of two ANN models are demonstrated, the results show good agreements with corresponding experimental data. The ability of ANN model to predict accurately the plasma density in an ECR-PECVD system can be concluded, and the RBF is more suitable than the MLP in our work. These indicate that ANN is a useful tool in dealing with some complex nonlinear problems of the plasma density and can obtain a considerable saving in terms of cost and time. 070505-4

References [1] Toader E I 2004 Plasma Sources Sci. Technol. 13 646 [2] Chen J F and Ren Z X 1999 Vacuum 52 411 [3] Mayuko K and Hiroshi M 2006 Vacuum 80 771 [4] Jin X Y, Qiu X J and Zhu Z Y 2006 Acta Phys. Sin. 55 5338 (in Chinese) [5] Wang L, Cao J X, Wang Y, Niu T Y, Wang G and Zhu Y 2007 Acta Phys. Sin. 56 1429 (in Chinese) [6] Hiroshi M, Doan H T and Yoshinobu K 2005 Surf. Coat. Technol. 200 850 [7] Musil J 1996 Vacuum 47 145 [8] Wang Y S, Sun J, Wang C J and Fan H D 2008 Acta Phys. Sin. 57 6120 (in Chinese) [9] Wang H, Wang A K, Yang Q W, Ding X T, Dong J Q, Sanuki H and Itoh K 2007 Chin. Phys. 16 3738 [10] Du X L, Chen G C, Jiang D Y, Yao X Z and Zhu H S 1999 Acta Phys. Sin. 48 257 (in Chinese) [11] Dorteoust, Wu H M and Graves D B 1994 Plasma Sources Sci. Technol. 3 25 [12] Lungu C P and Iwasak K 2002 Vacuum 66 197 [13] Wu H M, Graves D B and Porteos R K 1995 Plasma Sources Sci. Technol. 4 22 [14] Sterjovski Z and Nolan D 2006 J. Mater. Proc. Technol. 170 536 [15] Xu L J and Xing J D 2007 Mater. Des. 28 1425 [16] Bezerra E M and Ancelotti A C 2007 Mater. Sci. Eng. A 464 177 [17] Scott D J and Coveney P V 2007 J. Eur. Cerma. Soc. 27 4425 [18] Ming D J, Chyuan D L and Wang J T 2005 Appl. Surf. Sci. 245 290 [19] Zhang G and Guessasma S 2006 Surf. Coat. Technol. 200 2610 [20] Hakan C and Hasan O 2006 Wear 261 064 [21] Pankaj S and Deo M C 2007 Applied Soft Computing 7 968 070505-5