Gas Detection System Based on Multi-Sensor Fusion with BP Neural Network

Similar documents
ARTIFICIAL NEURAL NETWORKS گروه مطالعاتي 17 بهار 92

Open Access Research on Data Processing Method of High Altitude Meteorological Parameters Based on Neural Network

A FUZZY NEURAL NETWORK MODEL FOR FORECASTING STOCK PRICE

Artificial Intelligence

CSE 352 (AI) LECTURE NOTES Professor Anita Wasilewska. NEURAL NETWORKS Learning

ANN TECHNIQUE FOR ELECTRONIC NOSE BASED ON SMART SENSORS ARRAY

ARTIFICIAL NEURAL NETWORK PART I HANIEH BORHANAZAD

Neural Networks and the Back-propagation Algorithm

Neural Networks DWML, /25

Artificial Neural Network Method of Rock Mass Blastability Classification

Artificial Neural Network

Artificial Neural Networks Examination, March 2004

Artificial Neural Networks Examination, June 2005

Neural Networks. Bishop PRML Ch. 5. Alireza Ghane. Feed-forward Networks Network Training Error Backpropagation Applications

Keywords- Source coding, Huffman encoding, Artificial neural network, Multilayer perceptron, Backpropagation algorithm

Artificial Neural Networks" and Nonparametric Methods" CMPSCI 383 Nov 17, 2011!

Neural networks. Chapter 19, Sections 1 5 1

An artificial neural networks (ANNs) model is a functional abstraction of the

EEE 241: Linear Systems

Unit III. A Survey of Neural Network Model

Classification goals: Make 1 guess about the label (Top-1 error) Make 5 guesses about the label (Top-5 error) No Bounding Box

4. Multilayer Perceptrons

18.6 Regression and Classification with Linear Models

Machine Learning. Neural Networks

Feed-forward Networks Network Training Error Backpropagation Applications. Neural Networks. Oliver Schulte - CMPT 726. Bishop PRML Ch.

A Logarithmic Neural Network Architecture for Unbounded Non-Linear Function Approximation

Multilayer Neural Networks

Neural networks. Chapter 20. Chapter 20 1

Artificial Neural Networks Examination, June 2004

Serious limitations of (single-layer) perceptrons: Cannot learn non-linearly separable tasks. Cannot approximate (learn) non-linear functions

Bearing fault diagnosis based on EMD-KPCA and ELM

Artificial Neural Network and Fuzzy Logic

Materials Science Forum Online: ISSN: , Vols , pp doi: /

Introduction Neural Networks - Architecture Network Training Small Example - ZIP Codes Summary. Neural Networks - I. Henrik I Christensen

Introduction To Artificial Neural Networks

Artificial Neural Networks

POWER SYSTEM DYNAMIC SECURITY ASSESSMENT CLASSICAL TO MODERN APPROACH

Computational statistics

PATTERN CLASSIFICATION

SPSS, University of Texas at Arlington. Topics in Machine Learning-EE 5359 Neural Networks

RESEARCH ON AEROCRAFT ATTITUDE TESTING TECHNOLOGY BASED ON THE BP ANN

Artificial Neural Networks. Edward Gatt

AN INTRODUCTION TO NEURAL NETWORKS. Scott Kuindersma November 12, 2009

CS:4420 Artificial Intelligence

Machine Learning. Neural Networks. (slides from Domingos, Pardo, others)

Revision: Neural Network

Pattern Recognition Prof. P. S. Sastry Department of Electronics and Communication Engineering Indian Institute of Science, Bangalore

Modeling and Compensation for Capacitive Pressure Sensor by RBF Neural Networks

Artificial Neural Networks

Introduction to Neural Networks

AI Programming CS F-20 Neural Networks

Multilayer Perceptron = FeedForward Neural Network

What Do Neural Networks Do? MLP Lecture 3 Multi-layer networks 1

Multilayer Perceptron

Artificial Neural Networks Francesco DI MAIO, Ph.D., Politecnico di Milano Department of Energy - Nuclear Division IEEE - Italian Reliability Chapter

Lecture 4: Perceptrons and Multilayer Perceptrons

A Novel Activity Detection Method

Neural Networks and Fuzzy Logic Rajendra Dept.of CSE ASCET

CSC 578 Neural Networks and Deep Learning

Effects of BP Algorithm-based Activation Functions on Neural Network Convergence

NONLINEAR CLASSIFICATION AND REGRESSION. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

ESTIMATION OF HOURLY MEAN AMBIENT TEMPERATURES WITH ARTIFICIAL NEURAL NETWORKS 1. INTRODUCTION

Based on the original slides of Hung-yi Lee

RBF Neural Network Combined with Knowledge Mining Based on Environment Simulation Applied for Photovoltaic Generation Forecasting

Multilayer Perceptrons (MLPs)

Application of Artificial Neural Networks in Evaluation and Identification of Electrical Loss in Transformers According to the Energy Consumption

RESEARCHES ON FUNCTION-LINK ARTIFICIAL NEURAL NETWORK BASED LOAD CELL COMPENSATION

The Perceptron. Volker Tresp Summer 2016

Sections 18.6 and 18.7 Analysis of Artificial Neural Networks

Artificial Neural Networks. Historical description

A thorough derivation of back-propagation for people who really want to understand it by: Mike Gashler, September 2010

Data Mining Part 5. Prediction

Lecture 5: Logistic Regression. Neural Networks

Unit 8: Introduction to neural networks. Perceptrons

Study of Fault Diagnosis Method Based on Data Fusion Technology

ARTIFICIAL INTELLIGENCE. Artificial Neural Networks

Design Collocation Neural Network to Solve Singular Perturbed Problems with Initial Conditions

Sections 18.6 and 18.7 Artificial Neural Networks

Machine Learning for Large-Scale Data Analysis and Decision Making A. Neural Networks Week #6

Multilayer Neural Networks

Reading Group on Deep Learning Session 1

Neural networks. Chapter 20, Section 5 1

22c145-Fall 01: Neural Networks. Neural Networks. Readings: Chapter 19 of Russell & Norvig. Cesare Tinelli 1

Deep Feedforward Networks

Using a Hopfield Network: A Nuts and Bolts Approach

Neural Networks biological neuron artificial neuron 1

Last update: October 26, Neural networks. CMSC 421: Section Dana Nau

2015 Todd Neller. A.I.M.A. text figures 1995 Prentice Hall. Used by permission. Neural Networks. Todd W. Neller

Sections 18.6 and 18.7 Artificial Neural Networks

Neural Networks. Chapter 18, Section 7. TB Artificial Intelligence. Slides from AIMA 1/ 21

Introduction to Machine Learning Spring 2018 Note Neural Networks

The Perceptron. Volker Tresp Summer 2014

CSC321 Lecture 5: Multilayer Perceptrons

Lecture 5 Neural models for NLP

Neural Networks and Deep Learning

Neural Networks. CSE 6363 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington

A Study of the Influence of L-Cut on Resistance of Chip Resistor Based on Finite Element Method

ECE521 Lecture 7/8. Logistic Regression

Machine Learning and Data Mining. Multi-layer Perceptrons & Neural Networks: Basics. Prof. Alexander Ihler

Using Neural Networks for Identification and Control of Systems

Transcription:

Sensors & Transducers 2013 by IFSA http://www.sensorsportal.com Gas Detection System Based on Multi-Sensor Fusion with BP Neural Network Qiu-Xia LIU Department of Physics, Heze University, Heze Shandong 274015, China Tel.: 15963026267 E-mail: liuqiuxia66@163.com Received: 23 September 2013 /Accepted: 25 October 2013 /Published: 31 October 2013 Abstract: Using the sensor technology as the core, combining with the A/D technology, pattern recognition technology and other technologies, we have designed a new type of gas detection system. Gases are detected through BP neural network algorithm. The design of the system, the establishment of BP neural network and the improvement of its algorithm are given. The fusion technology of multi-sensor is expounded. Firstly, gases are detected by the gas sensor array. Secondly, the detected signals are sent to the preconditioning circuits of signals for corresponding process, and then they are sent to the pattern recognition circuit, signals are analyzed and recognized in the pattern recognition circuit. Finally, signals are input into the controlling circuits, relevant information are displayed or broadcast. Repeated experiments show that the gas detection system based on multi-sensor fusion with BP neural network can detect gases well, so it has higher application value. Copyright 2013 IFSA. Keywords: BP neural network, Gas sensor, Information fusion, Detection, MSP430. 1. Introduction With the rapid development of science and technology, in the industrial and agricultural production and social life, people pay more and more attention to various gases on the pollution of the environment, especially with the development and utilization of coal gas, liquefied petroleum gas and natural gas, dangerousness of various gases are increasing. So the detection and monitoring is needed for a variety of flammable, explosive and toxic gases. Electrochemical and optical methods are mainly used to detect gases early, the detection speed is slower, its equipment is more complex, its cost is higher and it is inconvenient for use [1]. What s more, the fusion system based on multi-sensor has the performances of strong fault toleration and high reliability [2]. Therefore, we have studied and designed the gas detection system based on multi-sensor fusion with BP neural network. 2. Fusion Technology of Multi-Sensor 2.1. The Principle of Multi-Sensor Information Fusion Information fusion of multi-sensor is a basic function of human beings and other biological systems, humans has instinctive functions of synthesizing the detected information from body organs with future knowledge, in order to estimate the surrounding environment and the taking place Article number P_1382 31

events, the information fusion structure of the brain network system is shown in Fig. 1. Because human senses with different metric features, various physical phenomena which can detect different spatial scale, the process is complex, and adaptive [3]. levels of the system. According to the abstraction levels of information processed by sensors, currently the signal fusion is divided into three grades, namely the signal level fusion, feature level fusion and decision level fusion [4]. The hierarchy of multisensor information fusion is shown in Fig. 2. Fig. 1. Information fusion structure of the brain network system. Multi-sensor information fusion makes full use of multi-sensor resources, through reasonable control, the use of various sensors and observation information, on the base of optimization criterions, the complementary and redundant information of each sensor are associated. The goal of information fusion is to gain more effective information on the base of separation information from each sensor and through optimizing combination. Its ultimate aim is to improve the effectiveness of the whole sensor system by using the advantage of multiple sensors common or joint operation. Multi-sensor information fusion system through the effective use of multiple sensors, in order to maximize the amount of information of target and environment to be detected. There is essential difference between the method of multi-sensor information fusion and the method of classical information process, the key lies in multi sensor information fusion process has more complex forms, but they are usually on different level fusion algorithm, seeking reasonable and efficient fusion algorithm is crucial to the performance of the fusion system. The unreasonable fusion algorithm may cause rich data and lack information. Therefore, multi-sensor signal processing method is the key to study the theory of information fusion. 2.2. The Hierarchy of Multi-Sensor Information Fusion Multi-sensor information fusion is essentially a method of sensor information processing. Information fusion can be performed at different Fig. 2. Hierarchy of multi-sensor information fusion. Data layer fusion associates the raw data of each sensor directly, and then they are sent to the fusion center, a comprehensive evaluation of the measured object is completed [5]. The character layer fusion refers that can gain full represent the amount or the sufficient statistics as its feature information by using the original data obtained from the sensors. And then they are classified, clustered and synthesized. Finally, the data are correlated and normalized, they are sent to the fusion center for analysis and fusion, a comprehensive evaluation of the measured object is completed [6]. Decision layer fusion is a kind of high-level fusion, its essence is to make the best of every decision according to the standards and the credibility of the decision [7]. In general, the information fusion is closer to the information source, the precision is high. Therefore, with the improvement of fusion layer, although system fault tolerance will increase, but preservation details of fusion information will be reduced, and the accuracy will be reduced. Therefore, the fusion process should consider the possibility of realization, local processing ability of sensor subsystem, communication ability and other aspects. 3. Establishment of BP Neural Network 3.1. Structure Model of BP Artificial Neural Network BP is a network of three or more than three layers of neural network, it includes the input layer, the middle layer (the hidden layer) and the output layer, its structure is shown in Fig. 3. The connection between the upper and lower, and there is no 32

connection between each layer of neurons. In general, the input sample set of BP network is set {(x, y)}, x is the input vector, y is an ideal output vector corresponding to x. The object of BP network is to realize the nonlinear mapping from the input vector to the output vector. The dimension of input vector and output vector is directly determined by the problem, they respectively decide numbers of input neurons and output neurons. The layer of hidden network and the number of each hidden layer neurons directly relate to requirements of specific application and numbers of input and output units, more or less is likely to affect the performance of network [8]. network training faster than the speed of the standard gradient dozens or even hundreds of times. We divide the improved algorithms into two categories, the first method is the improved algorithm based on standard gradient descent, it is the development of the standard gradient descent method. Among them, the BP algorithm with additional momentum, variable learning rate BP algorithm and resilient BP algorithm are selected. The second method is the improved algorithm based on standard numerical optimization, the conjugate gradient method, quasi Newton method and Levenberg-Marquardt method are selected. The standard BP algorithm is based on gradient descent method, it modifies the network weights and bias of the gradient by computing the objective function, it is the development of the standard gradient descent method, and it only uses the firstorder derivatives of the objective function of the weights and bias [9]. The iteration process of standard gradient descent right value and bias correction can be expressed as: 1) ( K) ( K) X = X η f( X ) (2) Fig. 3. Structure of feedforward neural network with a hidden layer. The BP network is a back propagation network, its activation function must be differentiable everywhere, so it cannot use the two value threshold function of type {0, l} or sign function {-1, l}, S type of activation function is often used by back propagation network. The activation function is commonly expressed by S curve of common logarithm or hyperbolic tangent, the relationship of logarithm S type of activation function is: 1 f = 1 + exp[ ( n + b)] 3.2. The Improved Algorithm of BP Artificial Neural Network (1) Though the back propagation method is applied widely, but due to the use of the gradient descent method, it has its own limitations and shortcomings, the main performance is too long training time and is easy to fall into local minima. In fact, the traditional BP algorithm based on gradient descent method in solving practical problems often affects the solution quality because of the slow convergence speed. Therefore, since the middle of 80's, many researchers do the thorough research, on the basis of the standard BP algorithm, people do a lot of useful improvements, and the main goal is to speed up the process of training, avoid falling into local minimum and improve other skills. In addition, a lot of training algorithms based on nonlinear optimization are provided, they can make the convergence speed of Among them, X is the vector composed by weights and bias of the network, η is the learning rate, f( X ) is the objective function, f( X ) expresses the gradient of the objective function. When the additional momentum method modifies its weights, it not only considers the role of error in the gradient, but also considers the impact of changing trends in the error surface. In the absence of additional momentum, the network may get into local minimum value, additional momentum effect is likely over these minima. The method is based on the back propagation method, it adds a proportional to the previous weight changes of values to the change of each weight, and generates new changes of weights based on the back propagation method. The iterative process of BP algorithm with additional momentum weight corrections can be expressed as: 1) ( K) ( K) X = mc X (1 mc) f( X ) (3) Among them, K is the number of training, mc is momentum factor, it is about 0.95. Adaptive learning rate is automatically adjusting learning rate in the training process. The criterion of adjusting the learning rate is: checking whether the correct value of weigh really reduces the error function, if that is the case, then the learning rate is small, it can be increased a weight, if this is not the case, the over modulation happens, then it should reduce the value of the learning rate. The adjustment formula of an adaptive learning rate is shown in the following: 1.05 η ( K) SSE( K) < SSE( K 1) η ( K + 1) = 0.7 η ( K) SSE( K) > 1.04 SSE( K 1) η( K) other 33

SSE is the output sum of the network. Practice has proved that using adaptive learning rate of network training times is just one of dozens of fixed learning rate of network training times. So adaptive learning rate of network training is a very effective method. Resilient BP algorithm only takes the partial derivative of the symbol, not considers the amplitude of partial derivative. Partial derivative symbols decide the update direction s of weights, while the size of the weigh changes is decided by an independent update value. In two successive iterations, if the symbol of partial derivative of the objective functions on a weight is invariant, the corresponding update value will be increased, if the symbol is variable, the corresponding update value will be decreased. The iterative process of the weight correction can be expressed as follows: 4. Design and Test of the System 4.1. Structure Diagram of the System The gas detection system is mainly consisted of gas sensor array, preconditioning circuits of signals, the pattern recognition circuit and controlling circuits [10], its structure diagram is shown in Fig. 4. 1) ( K) ( K) ( K) X = X X s ign( f ( X )) (4) X is the previous update value, the initial value of X should be preset according to the actual application. In resilient BP algorithm, when training is in oscillation, variable weight will be reduced; when the weights in a few iterations are in a direction change, change of weight will be increased. Therefore, in general, the convergence speed of resilient BP algorithm is better than the other methods, it is much faster, but the resilient BP algorithm is not complex, it also does not need to consume more memory. It is different from the gradient descent method, the optimization algorithm based on numerical values not only uses the first-order derivative of objective function, but also uses the second-order derivative of objective function. This kind of algorithm includes the conjugate gradient method, quasi Newton method and Levenberg-Marquardt method, they can be described as: (K + 1 ) f(x )=min f(x + η S(X )) (K + 1 ) X = X +η S(X ), (5) where X is the vector consisted of all weights and bias of the network, S( X ) is the search direction of vector space composed by each component of X, (K + 1 ) η is the dinky step size of f(x ) in the direction of S( X ). In this way, the optimization of the network weight is divided into two steps: firstly, the optimal search direction in the current iteration is determined, and then the optimal iteration step in this direction is found. The selection of optimal search step η is a dimensional search problem. There are many available methods for this problem. The difference among the conjugate gradient method, quasi Newton method and Levenberg-Marquardt method is different in the best choice of search direction [9]. Fig. 4. Structure diagram of the system. 4.1.1. Gas Sensor Array According to the characteristics of the gas sensors, they are often affected by other gases, it is so-called cross sensitivity [10]. This phenomenon is detrimental for the selectivity of the sensor and the measurement accuracy, according to the traditional method, is to eliminate this effect by looking for new sensitive material device structure and compensation circuit. However, this method not only makes the device structure complicated, and the manufacturing cost of the device is increased. So we put forward a new solution, we make multiple sensor compose the gas sensor arrays, making full use of the cross sensitivity of the sensor, combining with the pattern recognition technology to realize the cross sensitivity of sensor and improve the measuring precision of the sensor. 4.1.2. Preconditioning Circuits of Signals In general, the preconditioning circuits of signals include adjusting circuit of signals and A/D. Adjusting circuit of signals in essence is a kind of interface circuit, it changes the response of the sensor to gas into easily measured electric signal. When we design reconditioning circuits, we should consider the output signal's precision and dynamic range, at the same time, we should consider the input range of ADC, quantization precision and further requirements for subsequent process. A/D changes the analog signal into digital signal required for pattern recognition system. The relationship between the preconditioning circuits of signals and A/D is mutual influence and mutual restriction, only good cooperation can give full play to the role of signal precondition [10]. 34

4.1.3. Technology of Pattern Recognition Technology of pattern recognition is one kind of recognition ability to use computers to simulate human [11]. Signals of the sensor array are analyzed properly by the technology of pattern recognition, the response characteristics of sensors are extracted, information and concentration of mixed gas are obtained. The recognition process is divided into two stages. First of all, the outputs of the sensor array are trained by the pattern recognition method to build modules, it links known smell output and knowledge base by using mathematical rules. And then, on the basis of the received training database, it tests the response of the unknown odor, signals of sensor array are converted into types and concentrations of measured objects. The principal component analysis method of pattern recognition is mainly principal component analysis, partial least squares regression, discriminate analysis, cluster analysis, fuzzy inference and artificial neural network. 4.1.4. Controlling Circuit In the gas detection system, we select single chip microcomputer MSP430 as the controlling core, MSP430 has low power consumption, strong function, an internal A/D and USART module [12], it can not only separate analog signal process, it can also communicate with the computer. 4.2. Test of the System In the gas detection system, we select MQ-2, MQ-4, MQ-5, MQ-6, MQ-7 and MQ-8 as sensitive elements of gases, using SCM MSP430 as the controlling core, it can also complete A/D, the number of hidden units of BP network is six, using algebraic algorithm method for data pre-process, different concentrations of CO, CH 4 and H 2 are tested by the system. Part of the measurement results are shown in Table 1. Table 1. Results of quantitative test results. Concentration of the sample Concentration of the forecast gas (ppm) gas (ppm) CO CH 4 H 2 400 396 393 395 800 801 799 792 1200 1198 1202 1201 1600 1603 1604 1598 2000 1992 1994 2006 5. Conclusion We have studied the gas detection system comprehensively and made full use of cross sensitive characteristics of gas sensors. Quantitative responses of unknown gases are detected by gas sensor array, we use outputs of gas sensors as inputs of the BP network, and we use concentrations of gases as outputs of the BP network. In this way, quantitative detections of unknown gases are completed. Testing results show the gas detection system based on multisensor fusion with BP neural network can realize information fusion of multiple gas sensors well, and the errors of output concentrations are smaller. References [1]. B. M. Hooper, M. A. Hooper, Indoor air pollution: A short review, Clean Air, Vol. 5, Issue 2, 2009, pp. 47. [2]. Luozhigang Guoge, The research and evolvement of the method of the multi-sensor data fusion, Mechatronics, Vol. 5, Issue 1, 2003, pp. 12-17. [3]. T. C. Pearce, J. W. Gardner, W. Gopel, The next generation of electronic nose, Sensors, Vol. 15, Issue 3, 2008, pp. 61-73. [4]. D. E. Goldberg, Probability matching, the magnitude of reinforcement, and classifier system bidding, Machine Learning, Vol. 25, Issue 4, 2010, pp. 407. [5]. H. M. Paula, M. W. Roberts, R. E. Battle, Reliability performance of fault-tolerant digital control system, IEEE Transactions on Reliability, Vol. 54, Issue 3, 2005, pp. 413-418. [6]. Tomoyuki Sasaki, Hidehiro Nakano, Akihide Utani, Arata Miyafichi and Hisao Yamamoto, An adaptive selection scfieme of forwarding nodes in wireless sensor networks using a chaotic neural network, ICIC Express Letters, Vol. 4, Issue 5, 2010, pp. 1649-1656. [7]. Zhangshuqing, Denhong, Wangyanlin, The application and improvement of the D-S evidence theories at the data fusion, Sensor Technique Transaction, Vol. 14, Issue 7, 2003, pp. 78-81. [8]. Dong Changhong, Artificial neural network and application with Matlab, Beijing, National Defence Industry Press, 2005. [9]. Yang Jiangang, Practical application of artificial neural network, Hangzhou, Zhejiang University Press, 2001. [10]. Guoliang Qu, John JR. Feddes et al., Measuring odor concentration with an electronic nose, Transactions of the American Society of Agricultural Engineers, Vol. 47, Issue 6, 2011, pp. 1807-1812. [11]. Di Natale Corrado, Fabrizio A. M. Davide, Arnaldo D'Amico, A self-organizing system for pattern classification: time arraying statistics and sensor drift effects, Sensors and Actuators B, Vol. 10, Issue 11, 1995, pp. 237-241. [12]. Li Xin Jia, Yong Wang, Design and practice of electronic system, Beijing: Higher Education Press, 2006. 2013 Copyright, International Frequency Sensor Association (IFSA). All rights reserved. (http://www.sensorsportal.com) 35