Covariance Matrix Estimation for Reinforcement Learning
|
|
- Theresa Briggs
- 6 years ago
- Views:
Transcription
1 Covariance Matrix Estimation for Reinforcement Learning Tomer Lancewicki Deartment of Electrical Engineering and Comuter Science University of Tennessee Knoxville, TN Itamar Arel Deartment of Electrical Engineering and Comuter Science University of Tennessee Knoxville, TN Abstract One of the goals in scaling reinforcement learning RL) ertains to dealing with high-dimensional and continuous stateaction saces. In order to tackle this roblem, recent efforts have focused on harnessing well-develoed methodologies from statistical learning, estimation theory and emirical inference. A key related challenge is tuning the many arameters and efficiently addressing numerical roblems, such that ultimately efficient RL algorithms could be scaled to real-world roblem settings. Methods such as Covariance Matrix Adatation - Evolutionary Strategy CMAES), Policy Imrovement with Path Integral PI ) and their variations heavily deends on the covariance matrix of the noisy data observed by the agent. It is well known that covariance matrix estimation is roblematic when the number of samles is relatively small comared to the number of variables. One way to tackle this roblem is through the use of shrinkage estimators that offer a comromise between the samle covariance matrix and a well-conditioned matrix also known as the target) with the aim of minimizing the mean-squared error MSE). Recently, it has been shown that a Multi-Target Shrinkage Estimator MTSE) can greatly imrove the single-target variation by utilizing several targets simultaneously. Unlike the comutationally comlex cross-validation CV) rocedure, the shrinkage estimators rovide an analytical framework which is an attractive alternative to the CV comuting rocedure. We consider the alication of shrinkage estimators in dealing with a function aroximation roblem, using the quadratic discriminant analysis QDA) technique and show that a two-target shrinkage estimator generates imroved erformance. The aroach aves the way for imroved value function estimation in large-scale RL settings, offering higher efficiency and fewer hyer-arameters. Keywords: covariance matrix estimation, ath integral, classification uncertainty The authors are with the Machine Intelligence Lab at the University of Tennessee - htt://mil.engr.utk.edu
2 1 Introduction Reinforcement learning RL) alied to real-world roblems inherently involves combining otimal control theory and dynamic rogramming methods with learning techniques from statistical estimation theory [1,, 3, 4]. The motivation is achieving efficient value function aroximation for the non-stationary iterative learning rocess involved, articularly when the number of state variables exceeds 10 [5]. Recent efforts in scaling RL address continuous state and/or action saces by otimizing arametrized olicies. For examle, the Policy Imrovement with Path Integral PI ) [5] combines a derivation from first rinciles of stochastic otimal control with tools from statistical estimation theory. It has been shown in [6] that PI is a member of a wider family of methods which share robabilistic modeling concets such as Covariance Matrix Adatation - Evolutionary Strategy CMAES) [7] and the Cross-Entroy Methods CEM) [8]. The Path Integral Policy Imrovement with Covariance Matrix Adatation PI -CMA) [6] takes advantage on the PI method by determining the magnitude of the exloration noise automatically [6]. The PI -SEQ [9] scheme alies PI to sequences of motion rimitives. One alication of the PI -SEQ is concerned with object grasing under uncertainty [9, Sec. 5] while alying the exerimental aradigm of [10]. The latter aroach has illustrated that over time, humans adat their reaching motion and gras to the shae of the object osition distribution, determined by the orientation of the main axis of its covariance matrix. Moreover, it has been shown that the PI otimal control olicy can be aroximated through linear regression [11]. This connection allows the use of well-develoed linear regression algorithms for learning the otimal olicy. The aforementioned methods rely on accurate covariance matrix estimation of the multivariate data involved. Unfortunately, when the number of observations n is comarable to the number of state variables the covariance estimation roblem become more challenging. In such scenarios, the samle covariance matrix is not well-conditioned and is not necessarily invertible desite the fact that those two roerties are required for most alications). When n, the inversion cannot be comuted at all [5, Sec..]. The same covariance roblem arises in other related alications of RL. For examle, in RL with Gaussian rocesses, the covariance matrix is regularized [1, Sec. ]. However, although the regularization arameter lays a ivotal role, it is not clear how it should be set [1, Sec. 3]. Other related work [13] study the ability to mitigate otentially overconfident classifications by assessing how qualified the system is to make a judgment on the current test datum. It is well known that for a small ratio of training observations n to observation dimensionality, conventional Quadratic Discriminant Analysis QDA) classifier erform oorly, due to a highly variable class conditional samle covariance matrices. In order to imrove the classifiers erformance, regularization is recommended, with the aim of roviding an aroriate comromise between the bias and variance of the solution. While other regularization methods [14] define regularization coefficients by the comutationally comlicated cross-validation CV) rocedure, the shrinkage estimators studied in this aer rovide an analytical solution, which is an attractive alternative to the CV rocedure. This aer elaborates on the Multi-Target Shrinkage Estimator MTSE) [15] that addresses the roblem of covariance matrix estimation when the number of samles is relatively small comared to the number of variables. MTSE offers a comromise between the samle covariance matrix and well-conditioned matrices also known as targets) with the aim of minimizing the mean-squared error MSE). Section resents the MTSE and examine the squared biases of two diagonal targets. In Section 3, we conduct a careful exerimental study and examine the two-target and one-target shrinkage estimator, as well as the Lediot-Wolf LW) [16] method for different covariance matrices. We demonstrate an alication for the quadratic discriminant analysis QDA) classifier, showing that the test classification accuracy rate TCAR) is higher when using the two-target, rather than one-target, shrinkage regularization. The QDA classifier is a fundamental comonent in DeSTIN [17] which is a dee learning system for satiotemoral feature extraction. The DeSTIN architecture currently assumes diagonal covariance matrices, which is one of the targets examined in this aer. In our future research we intend to utilize the results shown in this aer in order to imrove the DeSTIN architecture. Multi-Target Shrinkage Estimation Let {x i } n be a samle of indeendent identical distributed i.i.d.) -dimensional vectors drawn from a density having zero mean and covariance Σ = {σ ij }. The most common estimator of Σ is the samle covariance matrix S = {s ij }, defined as S = 1 n x i x T i 1) n and is unbiased, i.e., E {S} = Σ. The MTSE model [15] defined as ) t t ˆΣ γ) = 1 γ i S + γ i T i, ) 1
3 where t is the number of the targets T i, i = 1,..., t and γ = [γ 1,..., γ t ] T is the vector of shrinkage coefficients. Our objective is therefore to find ˆΣ γ) ), which minimizes the MSE loss function { } L γ) = E ˆΣ γ) Σ. 3) The otimal shrinkage coefficient vector γ that minimize L γ) 3) can be found by using a strictly convex quadratic rogram [15]. In this aer, we use the two diagonal targets T 1 = F Tr S) I, T = diags). 4) Following the develoments in [16, Sec..], the covariance matrix Σ can be written as Σ = VΛV T, where V and Λ are the eigenvector and eigenvalue matrices of Σ, resectively. The eigenvalues of Σ are denoted as ζ i, i = 1,..., in increasing order, i.e., ζ 1 ζ... ζ, and it is well known that ζ i = Tr Σ). As a result, the squared bias of T 1 with resect to Σ can be written as E {T 1 } Σ F = 1 Tr Σ) I VΛVT F = ζi ζ ), ζ = Tr Σ) = 1 ζ i 5) where ζ is the mean of the eigenvalues ζ i, i = 1,...,. The above result shows that E {T 1 } Σ F is equal to the disersion of the eigenvalues around their mean. Therefore, T 1 becomes less suitable in describing Σ when the disersion of the eigenvalues 5) increases. On the other hand, the exression of the squared bias of T with resect to Σ can be written as E {T } Σ F = diag Σ) Σ F = σ ij, 6) i j which shows that it is equal to the off-diagonal entries in Σ. Therefore, T becomes less suitable for describing Σ when the variables of Σ are more highly correlated. 3 Exeriments In this section, we resent an extensive exerimental study of one-target and two-target shrinkage estimators. The estimators are affected by the squared bias and the variance of a target, when the latter deends on the number of data observations n. Therefore, we examine cases of different true covariance matrices Σ that result in different biases of T 1 and T. We then examine the estimator s erformance as a function of n. In order to study the effect of the squared biases, we create a covariance matrix Σ with determinant of one, i.e., Σ = 1, according to two arameters. The first arameter is the condition number η, which is the ratio of the largest eigenvalue ζ max to the smallest eigenvalue ζ min of Σ, i.e., η = ζmax ζ min. In the exeriments, the eigenvalues of Σ denoted as ζ i, i = 1,,..., are generated according to ) i 1) ζ i = ζ min η 1) 1) + 1, i = 1,...,. 7) Then, the eigenvalue matrix Σ is defined as having elements ζ i, i = 1,,..., in the matrix form Λ η) = diag ζ 1, ζ,..., ζ ). 8) The second arameter K, controls the rotation of Λ η). Our aroach is to select a set of orthonormal transformations, as in [18, Sec..B] E K) = K k=1 k E k = E 1 E... E K, where each matrix E k is defined as E k = E kl =E k1 E k... E K k). 9) The matrix E kl is an orthonormal rotation of 45 0 in a two-coordinate lane for the coordinates k and + 1 l), i.e., where Φ i k, j k ) is defined as [Φ] ij = l=1 E kl = I + Φ k, + 1 l), 10) if i = j = i k or i = j = j k if i = i k and j = j k if i = j k and j = i k otherwise. 11)
4 The arameter K is an integer value with the range 0 K 1, where K = 0 indicates there is no rotation, and K = 1 indicates full rotation, such that all the coordinates rotate with resect to each other at an angle of Then, by using Λ η) 8) and E 9), the covariance matrix is created by Σ η, K) = E K) Λ η) E T K). 1) By emloying the covariance matrix 1), the biases of T 1 and T can be controlled indeendently for η > 1. The squared bias E {T 1 } Σ F is affected only by η, and increases as η does, when E {T 1} Σ F = 0 for η = 1. The E {T } Σ F is affected only by K, and increases as K does, when E {T } Σ F = 0 for K = 0. It should be noted that if η = 1 then K has no imact while if η is near 1, then K could has minor imact. The shrinkage estimators used in the study are of the one-target variety with T 1 and T. In the figures that aear in this section, these estimators are denoted as T1 and T, resectively. The LW estimator [16] is of the one-target shrinkage variety with T 1, which uses a biased shrinkage coefficient estimator and is denoted as LW. Finally, the two-target shrinkage estimator aears in the figures as TT. We show that the two-target estimator can imrove classification results comared with one-target estimators, when using the quadratic discriminant analysis QDA) method. The urose of the QDA is to assign observations to one of several g = 1,..., G grous with -variate normal distributions 1 ) f g x) = π) Σ g ex 0.5 x m g ) T Σ 1 g x m g ), 13) where m g and Σ g are the oulation mean vector and covariance matrix of the grou g. An observation x is assigned to a class ĝ according to dĝ x) = min d g x), 14) 1 g G with d g x) = x m g ) T Σ 1 g x m g ) + ln Σ g ln π g, 15) where π g is the unconditional rior robability of observing a member from the grou g. In our exeriments, we classify two grous G = ), with observations generated from a normal distribution with zero mean and π 1 = π. The covariance matrix of the first grou is the identity matrix Σ 1 = I, while that of the second grou is the covariance matrix Σ η, K) = Σ η, K) 1), which is generated on the basis of the revious exeriments. The goal is to study the effectiveness of the shrinkage estimators when using QDA, by assigning observations to one of these two grous, based on the classification rule 14). We run our exeriments for n =, 3,..., 30. For each n, twenty sets of data of size n are roduced. a) b) Figure 1: QDA for a) Σ η, 0) = Λ η) with η = 10 and b) an unrestricted Σ 10, K) with K = 5 We summarize for each exeriment the average test classification accuracy rate TCAR) with standard deviations the bars in the figure) over the twenty relications for each n. For each grou, 10 5 test observations were generated in order to exam the efficiency of the classifier. We rovide the best TCAR, calculated by using 14), when the covariance matrices are known, denoted in the figures as Bayes. We also comare the results for a regularization [19, sec. 6], where the zero eigenvalues were relaced with a small number just large enough to ermit numerically stable inversion. This has the effect of roducing a classification rule based on Euclidean distance in the zero-variance subsace. We denote this rocedure as the zero-variance regularization ZVR). In all exeriments, the TCAR of the two-target estimator is higher than the one-target variety. The LW estimator is inferior to its unbiased version when dealing with a small number of observations, and converges to its unbiased version as the number of observations increases. Fig. 1a) resents the result 3
5 when the covariance matrix is a diagonal matrix, i.e., Σ η, 0) = Λ η), with η = 10, and therefore T is unbiased while T 1 is biased. The target T 1 rovides a higher TCAR than T for small numbers of observations, and then T rovides a better TCAR. In Fig. 1b), the covariance matrix is unrestricted, i.e., Σ 10, K), with K = 5. The targets T 1 and T are biased. The squared bias of T 1 is not affected by K; whereas the higher the value of K, the higher the squared bias of T, and therefore T loses its advantage over T 1. In conclusion, it has been shown that the Multi-Target Shrinkage Estimator MTSE) [15] can greatly imrove the singletarget variation in the sense of mean-squared error MSE) by utilizing several targets simultaneously. We consider the alication of shrinkage estimator in the context of a function aroximation roblem, using the quadratic discriminant analysis QDA) technique and show that a two-target shrinkage estimator generates imroved erformance. This is done by a careful exerimental study which examines the squared biases of the two diagonal targets. Unlike the comutationally comlex cross-validation CV) rocedure; the shrinkage estimators rovide an analytical solution which is an attractive alternative to the CV comuting rocedure, commonly used in the QDA. The aroach aves the way for imroved value function estimation in large-scale RL settings, offering higher efficiency and fewer hyer-arameters. References [1] P. Dayan and G. E. Hinton, Using exectation-maximization for reinforcement learning, Neural Comutation, vol. 9, no., , [] M. Ghavamzadeh and Y. Engel, Bayesian actor-critic algorithms, in Proceedings of the 4th international conference on Machine learning. ACM, 007, [3] M. Toussaint and A. Storkey, Probabilistic inference for solving discrete and continuous state markov decision rocesses, in Proceedings of the 3rd international conference on Machine learning. ACM, 006, [4] N. Vlassis, M. Toussaint, G. Kontes, and S. Pieridis, Learning model-free robot control by a monte carlo em algorithm, Autonomous Robots, vol. 7, no., , 009. [5] E. Theodorou, J. Buchli, and S. Schaal, A generalized ath integral control aroach to reinforcement learning, J. Mach. Learn. Res., vol. 11, , Dec [6] F. Stul and O. Sigaud, Path integral olicy imrovement with covariance matrix adatation, in Proceedings of the 9th International Conference on Machine Learning ICML), 01. [7] N. Hansen and A. Ostermeier, Comletely derandomized self-adatation in evolution strategies, Evolutionary Comutation, vol. 9, no., , June 001. [8] S. Mannor, R. Y. Rubinstein, and Y. Gat, The cross entroy method for fast olicy search, in ICML, 003, [9] F. Stul, E. Theodorou, and S. Schaal, Reinforcement learning with sequences of motion rimitives for robust maniulation, IEEE Transactions on Robotics, vol. 8, no. 6, , Dec 01. [10] V. N. Christooulos and P. R. Schrater, Grasing objects with environmentally induced osition uncertainty, PLoS comutational biology, vol. 5, no. 10, 009. [11] F. Farshidian and J. Buchli, Path integral stochastic otimal control for reinforcement learning, in The 1st Multidiscilinary Conference on Reinforcement Learning and Decision Making RLDM013), 013. [1] G. Chowdhary, M. Liu, R. Grande, T. Walsh, J. How, and L. Carin, Off-olicy reinforcement learning with gaussian rocesses, IEEE/CAA Journal of Automatica Sinica, vol. 1, no. 3,. 7 38, 014. [13] H. Grimmett, R. Paul, R. Triebel, and I. Posner, Knowing when we don t know: Introsective classification for mission-critical decision making, in 013 IEEE International Conference on Robotics and Automation ICRA), May 013, [14] P. J. Bickel and E. Levina, Regularized estimation of large covariance matrices, The Annals of Statistics, vol. 36, no. 1, , 008. [15] T. Lancewicki and M. Aladjem, Multi-target shrinkage estimation for covariance matrices, IEEE Transactions on Signal Processing, vol. 6, no. 4, , Dec 014. [16] O. Ledoit and M. Wolf, A well-conditioned estimator for large-dimensional covariance matrices, Journal of Multivariate Analysis, vol. 88, no., , 004. [17] S. Young, J. Lu, J. Holleman, and I. Arel, On the imact of aroximate comutation in an analog destin architecture, IEEE Transactions on Neural Networks and Learning Systems, vol. 5, no. 5, , May 014. [18] G. Cao, L. Bachega, and C. Bouman, The sarse matrix transform for covariance estimation and analysis of high dimensional signals, IEEE Transactions on Image Processing, vol. 0, no. 3, , 011. [19] J. H. Friedman, Regularized discriminant analysis, Journal of the American Statistical Association, vol. 84, no. 405, ,
Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)
Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment
More informationEstimation of the large covariance matrix with two-step monotone missing data
Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationRadial Basis Function Networks: Algorithms
Radial Basis Function Networks: Algorithms Introduction to Neural Networks : Lecture 13 John A. Bullinaria, 2004 1. The RBF Maing 2. The RBF Network Architecture 3. Comutational Power of RBF Networks 4.
More informationarxiv: v1 [physics.data-an] 26 Oct 2012
Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch
More informationA Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression
Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi
More informationarxiv: v2 [stat.me] 3 Nov 2014
onarametric Stein-tye Shrinkage Covariance Matrix Estimators in High-Dimensional Settings Anestis Touloumis Cancer Research UK Cambridge Institute University of Cambridge Cambridge CB2 0RE, U.K. Anestis.Touloumis@cruk.cam.ac.uk
More informationInformation collection on a graph
Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements
More informationResearch Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **
Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R
More informationGeneral Linear Model Introduction, Classes of Linear models and Estimation
Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)
More informationInformation collection on a graph
Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements
More informationOn split sample and randomized confidence intervals for binomial proportions
On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have
More informationA New Asymmetric Interaction Ridge (AIR) Regression Method
A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and
More informationOn parameter estimation in deformable models
Downloaded from orbitdtudk on: Dec 7, 07 On arameter estimation in deformable models Fisker, Rune; Carstensen, Jens Michael Published in: Proceedings of the 4th International Conference on Pattern Recognition
More informationUsing the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process
Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund
More informationProbability Estimates for Multi-class Classification by Pairwise Coupling
Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics
More informationApproximating min-max k-clustering
Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost
More informationMATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK
Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment
More informationDistributed Rule-Based Inference in the Presence of Redundant Information
istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced
More informationRobustness of classifiers to uniform l p and Gaussian noise Supplementary material
Robustness of classifiers to uniform l and Gaussian noise Sulementary material Jean-Yves Franceschi Ecole Normale Suérieure de Lyon LIP UMR 5668 Omar Fawzi Ecole Normale Suérieure de Lyon LIP UMR 5668
More informationBayesian Model Averaging Kriging Jize Zhang and Alexandros Taflanidis
HIPAD LAB: HIGH PERFORMANCE SYSTEMS LABORATORY DEPARTMENT OF CIVIL AND ENVIRONMENTAL ENGINEERING AND EARTH SCIENCES Bayesian Model Averaging Kriging Jize Zhang and Alexandros Taflanidis Why use metamodeling
More informationSystem Reliability Estimation and Confidence Regions from Subsystem and Full System Tests
009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract
More informationESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION
STATISTICA, anno LXVIII, n., 008 ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION 1. INTRODUCTION The Inverse Gaussian distribution was first introduced
More informationEstimating Time-Series Models
Estimating ime-series Models he Box-Jenkins methodology for tting a model to a scalar time series fx t g consists of ve stes:. Decide on the order of di erencing d that is needed to roduce a stationary
More informationFor q 0; 1; : : : ; `? 1, we have m 0; 1; : : : ; q? 1. The set fh j(x) : j 0; 1; ; : : : ; `? 1g forms a basis for the tness functions dened on the i
Comuting with Haar Functions Sami Khuri Deartment of Mathematics and Comuter Science San Jose State University One Washington Square San Jose, CA 9519-0103, USA khuri@juiter.sjsu.edu Fax: (40)94-500 Keywords:
More informationAI*IA 2003 Fusion of Multiple Pattern Classifiers PART III
AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers
More informationBayesian Spatially Varying Coefficient Models in the Presence of Collinearity
Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory
More informationUnsupervised Hyperspectral Image Analysis Using Independent Component Analysis (ICA)
Unsuervised Hyersectral Image Analysis Using Indeendent Comonent Analysis (ICA) Shao-Shan Chiang Chein-I Chang Irving W. Ginsberg Remote Sensing Signal and Image Processing Laboratory Deartment of Comuter
More informationUncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning
TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment
More informationA multiple testing approach to the regularisation of large sample correlation matrices
A multile testing aroach to the regularisation of large samle correlation matrices Natalia Bailey Queen Mary, University of London M. Hashem Pesaran University of Southern California, USA, and rinity College,
More informationDETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS
Proceedings of DETC 03 ASME 003 Design Engineering Technical Conferences and Comuters and Information in Engineering Conference Chicago, Illinois USA, Setember -6, 003 DETC003/DAC-48760 AN EFFICIENT ALGORITHM
More informationAn Analysis of Reliable Classifiers through ROC Isometrics
An Analysis of Reliable Classifiers through ROC Isometrics Stijn Vanderlooy s.vanderlooy@cs.unimaas.nl Ida G. Srinkhuizen-Kuyer kuyer@cs.unimaas.nl Evgueni N. Smirnov smirnov@cs.unimaas.nl MICC-IKAT, Universiteit
More informationOn Line Parameter Estimation of Electric Systems using the Bacterial Foraging Algorithm
On Line Parameter Estimation of Electric Systems using the Bacterial Foraging Algorithm Gabriel Noriega, José Restreo, Víctor Guzmán, Maribel Giménez and José Aller Universidad Simón Bolívar Valle de Sartenejas,
More informationThe analysis and representation of random signals
The analysis and reresentation of random signals Bruno TOÉSNI Bruno.Torresani@cmi.univ-mrs.fr B. Torrésani LTP Université de Provence.1/30 Outline 1. andom signals Introduction The Karhunen-Loève Basis
More informationEstimating function analysis for a class of Tweedie regression models
Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal
More informationDIFFERENTIAL evolution (DE) [3] has become a popular
Self-adative Differential Evolution with Neighborhood Search Zhenyu Yang, Ke Tang and Xin Yao Abstract In this aer we investigate several self-adative mechanisms to imrove our revious work on [], which
More informationSTABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS
STABILITY ANALYSIS TOOL FOR TUNING UNCONSTRAINED DECENTRALIZED MODEL PREDICTIVE CONTROLLERS Massimo Vaccarini Sauro Longhi M. Reza Katebi D.I.I.G.A., Università Politecnica delle Marche, Ancona, Italy
More informationFeedback-error control
Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller
More informationDeriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.
Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &
More informationLinear diophantine equations for discrete tomography
Journal of X-Ray Science and Technology 10 001 59 66 59 IOS Press Linear diohantine euations for discrete tomograhy Yangbo Ye a,gewang b and Jiehua Zhu a a Deartment of Mathematics, The University of Iowa,
More informationarxiv: v2 [stat.me] 27 Apr 2018
Nonasymtotic estimation and suort recovery for high dimensional sarse covariance matrices arxiv:17050679v [statme] 7 Ar 018 1 Introduction Adam B Kashlak and Linglong Kong Deartment of Mathematical and
More informationEvaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models
Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models Ketan N. Patel, Igor L. Markov and John P. Hayes University of Michigan, Ann Arbor 48109-2122 {knatel,imarkov,jhayes}@eecs.umich.edu
More informationRobust Solutions to Markov Decision Problems
Robust Solutions to Markov Decision Problems Arnab Nilim and Laurent El Ghaoui Deartment of Electrical Engineering and Comuter Sciences University of California, Berkeley, CA 94720 nilim@eecs.berkeley.edu,
More informationHotelling s Two- Sample T 2
Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test
More informationSpectral Analysis by Stationary Time Series Modeling
Chater 6 Sectral Analysis by Stationary Time Series Modeling Choosing a arametric model among all the existing models is by itself a difficult roblem. Generally, this is a riori information about the signal
More informationYixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA
Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelsach, K. P. White, and M. Fu, eds. EFFICIENT RARE EVENT SIMULATION FOR HEAVY-TAILED SYSTEMS VIA CROSS ENTROPY Jose
More informationSession 5: Review of Classical Astrodynamics
Session 5: Review of Classical Astrodynamics In revious lectures we described in detail the rocess to find the otimal secific imulse for a articular situation. Among the mission requirements that serve
More informationNotes on Instrumental Variables Methods
Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of
More informationLINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL
LINEAR SYSTEMS WITH POLYNOMIAL UNCERTAINTY STRUCTURE: STABILITY MARGINS AND CONTROL Mohammad Bozorg Deatment of Mechanical Engineering University of Yazd P. O. Box 89195-741 Yazd Iran Fax: +98-351-750110
More informationTowards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK
Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)
More informationAn Ant Colony Optimization Approach to the Probabilistic Traveling Salesman Problem
An Ant Colony Otimization Aroach to the Probabilistic Traveling Salesman Problem Leonora Bianchi 1, Luca Maria Gambardella 1, and Marco Dorigo 2 1 IDSIA, Strada Cantonale Galleria 2, CH-6928 Manno, Switzerland
More informationRatio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute
ajesh Singh, ankaj Chauhan, Nirmala Sawan School of Statistics, DAVV, Indore (M.., India Florentin Smarandache Universit of New Mexico, USA atio Estimators in Simle andom Samling Using Information on Auxiliar
More informationx and y suer from two tyes of additive noise [], [3] Uncertainties e x, e y, where the only rior knowledge is their boundedness and zero mean Gaussian
A New Estimator for Mixed Stochastic and Set Theoretic Uncertainty Models Alied to Mobile Robot Localization Uwe D. Hanebeck Joachim Horn Institute of Automatic Control Engineering Siemens AG, Cororate
More informationSolved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points.
Solved Problems Solved Problems P Solve the three simle classification roblems shown in Figure P by drawing a decision boundary Find weight and bias values that result in single-neuron ercetrons with the
More informationRANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES
RANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES AARON ZWIEBACH Abstract. In this aer we will analyze research that has been recently done in the field of discrete
More informationTIME-FREQUENCY BASED SENSOR FUSION IN THE ASSESSMENT AND MONITORING OF MACHINE PERFORMANCE DEGRADATION
Proceedings of IMECE 0 00 ASME International Mechanical Engineering Congress & Exosition New Orleans, Louisiana, November 17-, 00 IMECE00-MED-303 TIME-FREQUENCY BASED SENSOR FUSION IN THE ASSESSMENT AND
More informationA Simple Weight Decay Can Improve. Abstract. It has been observed in numerical simulations that a weight decay can improve
In Advances in Neural Information Processing Systems 4, J.E. Moody, S.J. Hanson and R.P. Limann, eds. Morgan Kaumann Publishers, San Mateo CA, 1995,. 950{957. A Simle Weight Decay Can Imrove Generalization
More informationMODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL
Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management
More informationPublished: 14 October 2013
Electronic Journal of Alied Statistical Analysis EJASA, Electron. J. A. Stat. Anal. htt://siba-ese.unisalento.it/index.h/ejasa/index e-issn: 27-5948 DOI: 1.1285/i275948v6n213 Estimation of Parameters of
More informationState Estimation with ARMarkov Models
Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,
More informationA Recursive Block Incomplete Factorization. Preconditioner for Adaptive Filtering Problem
Alied Mathematical Sciences, Vol. 7, 03, no. 63, 3-3 HIKARI Ltd, www.m-hiari.com A Recursive Bloc Incomlete Factorization Preconditioner for Adative Filtering Problem Shazia Javed School of Mathematical
More informationLower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data
Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment
More informationGenetic Algorithms, Selection Schemes, and the Varying Eects of Noise. IlliGAL Report No November Department of General Engineering
Genetic Algorithms, Selection Schemes, and the Varying Eects of Noise Brad L. Miller Det. of Comuter Science University of Illinois at Urbana-Chamaign David E. Goldberg Det. of General Engineering University
More informationNamed Entity Recognition using Maximum Entropy Model SEEM5680
Named Entity Recognition using Maximum Entroy Model SEEM5680 Named Entity Recognition System Named Entity Recognition (NER): Identifying certain hrases/word sequences in a free text. Generally it involves
More informationHidden Predictors: A Factor Analysis Primer
Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor
More informationIntroduction to Probability and Statistics
Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based
More informationConvex Optimization methods for Computing Channel Capacity
Convex Otimization methods for Comuting Channel Caacity Abhishek Sinha Laboratory for Information and Decision Systems (LIDS), MIT sinhaa@mit.edu May 15, 2014 We consider a classical comutational roblem
More informationAN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES
AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES Emmanuel Duclos, Maurice Pillet To cite this version: Emmanuel Duclos, Maurice Pillet. AN OPTIMAL CONTROL CHART FOR NON-NORMAL PRO- CESSES. st IFAC Worsho
More informationOn Wald-Type Optimal Stopping for Brownian Motion
J Al Probab Vol 34, No 1, 1997, (66-73) Prerint Ser No 1, 1994, Math Inst Aarhus On Wald-Tye Otimal Stoing for Brownian Motion S RAVRSN and PSKIR The solution is resented to all otimal stoing roblems of
More informationDesign of NARMA L-2 Control of Nonlinear Inverted Pendulum
International Research Journal of Alied and Basic Sciences 016 Available online at www.irjabs.com ISSN 51-838X / Vol, 10 (6): 679-684 Science Exlorer Publications Design of NARMA L- Control of Nonlinear
More informationEvaluation of the critical wave groups method for calculating the probability of extreme ship responses in beam seas
Proceedings of the 6 th International Shi Stability Worsho, 5-7 June 207, Belgrade, Serbia Evaluation of the critical wave grous method for calculating the robability of extreme shi resonses in beam seas
More informationShadow Computing: An Energy-Aware Fault Tolerant Computing Model
Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms
More informationAn Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling
Journal of Modern Alied Statistical Methods Volume 15 Issue Article 14 11-1-016 An Imroved Generalized Estimation Procedure of Current Poulation Mean in Two-Occasion Successive Samling G. N. Singh Indian
More informationarxiv: v3 [physics.data-an] 23 May 2011
Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland
More informationAdaptive estimation with change detection for streaming data
Adative estimation with change detection for streaming data A thesis resented for the degree of Doctor of Philosohy of the University of London and the Diloma of Imerial College by Dean Adam Bodenham Deartment
More informationA Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition
A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino
More informationPairwise active appearance model and its application to echocardiography tracking
Pairwise active aearance model and its alication to echocardiograhy tracking S. Kevin Zhou 1, J. Shao 2, B. Georgescu 1, and D. Comaniciu 1 1 Integrated Data Systems, Siemens Cororate Research, Inc., Princeton,
More informationScaling Multiple Point Statistics for Non-Stationary Geostatistical Modeling
Scaling Multile Point Statistics or Non-Stationary Geostatistical Modeling Julián M. Ortiz, Steven Lyster and Clayton V. Deutsch Centre or Comutational Geostatistics Deartment o Civil & Environmental Engineering
More informationSIMULATED ANNEALING AND JOINT MANUFACTURING BATCH-SIZING. Ruhul SARKER. Xin YAO
Yugoslav Journal of Oerations Research 13 (003), Number, 45-59 SIMULATED ANNEALING AND JOINT MANUFACTURING BATCH-SIZING Ruhul SARKER School of Comuter Science, The University of New South Wales, ADFA,
More informationPER-PATCH METRIC LEARNING FOR ROBUST IMAGE MATCHING. Sezer Karaoglu, Ivo Everts, Jan C. van Gemert, and Theo Gevers
PER-PATCH METRIC LEARNING FOR ROBUST IMAGE MATCHING Sezer Karaoglu, Ivo Everts, Jan C. van Gemert, and Theo Gevers Intelligent Systems Lab, Amsterdam, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
More informationPArtially observable Markov decision processes
Solving Continuous-State POMDPs via Density Projection Enlu Zhou, Member, IEEE, Michael C. Fu, Fellow, IEEE, and Steven I. Marcus, Fellow, IEEE Abstract Research on numerical solution methods for artially
More informationUse of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek
Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.
More informationDEPARTMENT OF ECONOMICS ISSN DISCUSSION PAPER 20/07 TWO NEW EXPONENTIAL FAMILIES OF LORENZ CURVES
DEPARTMENT OF ECONOMICS ISSN 1441-549 DISCUSSION PAPER /7 TWO NEW EXPONENTIAL FAMILIES OF LORENZ CURVES ZuXiang Wang * & Russell Smyth ABSTRACT We resent two new Lorenz curve families by using the basic
More informationTopology Optimization of Three Dimensional Structures under Self-weight and Inertial Forces
6 th World Congresses of Structural and Multidiscilinary Otimization Rio de Janeiro, 30 May - 03 June 2005, Brazil Toology Otimization of Three Dimensional Structures under Self-weight and Inertial Forces
More informationUncorrelated Multilinear Discriminant Analysis with Regularization and Aggregation for Tensor Object Recognition
TNN-2007-P-0332.R1 1 Uncorrelated Multilinear Discriminant Analysis with Regularization and Aggregation for Tensor Object Recognition Haiing Lu, K.N. Plataniotis and A.N. Venetsanooulos The Edward S. Rogers
More informationTests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)
Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant
More informationdn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential
Chem 467 Sulement to Lectures 33 Phase Equilibrium Chemical Potential Revisited We introduced the chemical otential as the conjugate variable to amount. Briefly reviewing, the total Gibbs energy of a system
More informationMonte Carlo Studies. Monte Carlo Studies. Sampling Distribution
Monte Carlo Studies Do not let yourself be intimidated by the material in this lecture This lecture involves more theory but is meant to imrove your understanding of: Samling distributions and tests of
More informationAggregate Prediction With. the Aggregation Bias
100 Aggregate Prediction With Disaggregate Models: Behavior of the Aggregation Bias Uzi Landau, Transortation Research nstitute, Technion-srael nstitute of Technology, Haifa Disaggregate travel demand
More informationApplications to stochastic PDE
15 Alications to stochastic PE In this final lecture we resent some alications of the theory develoed in this course to stochastic artial differential equations. We concentrate on two secific examles:
More informationEstimation of component redundancy in optimal age maintenance
EURO MAINTENANCE 2012, Belgrade 14-16 May 2012 Proceedings of the 21 st International Congress on Maintenance and Asset Management Estimation of comonent redundancy in otimal age maintenance Jorge ioa
More informationSampling and Distortion Tradeoffs for Bandlimited Periodic Signals
Samling and Distortion radeoffs for Bandlimited Periodic Signals Elaheh ohammadi and Farokh arvasti Advanced Communications Research Institute ACRI Deartment of Electrical Engineering Sharif University
More informationt 0 Xt sup X t p c p inf t 0
SHARP MAXIMAL L -ESTIMATES FOR MARTINGALES RODRIGO BAÑUELOS AND ADAM OSȨKOWSKI ABSTRACT. Let X be a suermartingale starting from 0 which has only nonnegative jums. For each 0 < < we determine the best
More informationCOMPARISON OF VARIOUS OPTIMIZATION TECHNIQUES FOR DESIGN FIR DIGITAL FILTERS
NCCI 1 -National Conference on Comutational Instrumentation CSIO Chandigarh, INDIA, 19- March 1 COMPARISON OF VARIOUS OPIMIZAION ECHNIQUES FOR DESIGN FIR DIGIAL FILERS Amanjeet Panghal 1, Nitin Mittal,Devender
More informationDistributed K-means over Compressed Binary Data
1 Distributed K-means over Comressed Binary Data Elsa DUPRAZ Telecom Bretagne; UMR CNRS 6285 Lab-STICC, Brest, France arxiv:1701.03403v1 [cs.it] 12 Jan 2017 Abstract We consider a networ of binary-valued
More informationTensor-Based Sparsity Order Estimation for Big Data Applications
Tensor-Based Sarsity Order Estimation for Big Data Alications Kefei Liu, Florian Roemer, João Paulo C. L. da Costa, Jie Xiong, Yi-Sheng Yan, Wen-Qin Wang and Giovanni Del Galdo Indiana University School
More informationCONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP
Submitted to the Annals of Statistics arxiv: arxiv:1706.07237 CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP By Johannes Tewes, Dimitris N. Politis and Daniel J. Nordman Ruhr-Universität
More informationAlgorithms for Air Traffic Flow Management under Stochastic Environments
Algorithms for Air Traffic Flow Management under Stochastic Environments Arnab Nilim and Laurent El Ghaoui Abstract A major ortion of the delay in the Air Traffic Management Systems (ATMS) in US arises
More informationA New GP-evolved Formulation for the Relative Permittivity of Water and Steam
ew GP-evolved Formulation for the Relative Permittivity of Water and Steam S. V. Fogelson and W. D. Potter rtificial Intelligence Center he University of Georgia, US Contact Email ddress: sergeyf1@uga.edu
More informationImplementation and Validation of Finite Volume C++ Codes for Plane Stress Analysis
CST0 191 October, 011, Krabi Imlementation and Validation of Finite Volume C++ Codes for Plane Stress Analysis Chakrit Suvanjumrat and Ekachai Chaichanasiri* Deartment of Mechanical Engineering, Faculty
More information