Asymptotic behavior of Support Vector Machine for spiked population model
|
|
- Giles Wilkerson
- 6 years ago
- Views:
Transcription
1 Asymptotic behavior of Support Vector Machine for spiked population model Supplementary Material This note contains technical proofs of Propositions, and 3 Proof of proposition Denote X [x,, x M ] T, y y,, y M T, then the joint probability density function of X, y is px, y M i px i, y i In statistical mechanics, we start from the partition function Z β X, y dw exp β w M i yi x T i w Θ, S β is the inverse temperature and the SVM constraints are enforced strictly using Heaviside step function Θ defined as Θt if t 0 and Θt 0 if t<0 At the low temperature limit, ie β, Z β X, y in equation S is dominated by the solution vector w of the hard SVM algorithm 3 The properties of the SVM can be computed from the zero temperature average free energy F lim lim β β log Z βx, y X,y, S the bracket stands for the average over all training sets In order to evaluate the integration of a log function, we make use of the replica method based on the identity Z n log Z lim n 0 n lim n 0 n log Zn, S3 and rewrite S as F lim lim β β lim n 0 n log Ξ nβ, S4
2 Ξ n β {Z β X, y} n X,y M {Z β X, y} n px i, y i dx i dy i i S5 Equation S4 can be derived by using the fact that lim n 0 Ξ n β and exchanging the order of the averaging and the differentiation with respect to n In the replica method, we will first evaluate Ξ n β for integer n and then apply to real n and take the limit of n 0 For integer n, in order to represent {Z β X, y} n in the integrand of S5, we use the identity n fxpdx fx fx n pdx pdx n, and obtain {Z β X, y} n [ n dw ν exp { β w ν } M yi x T i w ν Θ ] S6 i we have introduced replicated parameters w ν [w ν,,, w ν, ], ν,, n Exchanging the order of the two limits and n 0 in S4, we have F lim β β lim n 0 lim n log Ξ nβ S7 The integrations over X and y can be performed which leads to Ξ n β n n dw ν exp dw ν exp β β n M w ν n i yi x T i w ν Θ x i,y i } {{ } G n n w ν + M log G n, S8
3 G n n yx T w ν Θ x,y { n x T w ν dx Θ px y p + + n x T w ν Θ x y n } x T w ν Θ px y p Here for the second equation, the two terms inside { } give equal contribution to the integration due to the reflection x x because px y p x y and p + + p By introducing the Fourier integral representation of Θ function x T w ν Θ π dt ν { } dˆt ν exp i t ν xt w ν ˆt ν, we have G n π n { d n t ν d nˆt ν exp i n } t ν xt w ν ˆt ν x y According to Berry-Esseen Central Limit Theorem, in the limit, the joint distribution of x T w /,, x T w n / for fixed w,, w n is a multivariate normal with mean µ T w /,, µ T w n / and covariance matrix Q Q νν, Q νν wν T Σw ν /, Σ is defined in Further define Q νν w T ν w ν / and R ν,m w T ν v m / Since ˆµ v, one has w T ν ˆµ/ R ν, ˆµ µ/µ and µ µ The integration over x T w ν and ˆt gives G n π n/ d n t ν det Q exp { νν t ν µr ν, Q νν t ν µr ν, } S9 We have used a shorthand notation: d n t ν stands for n dt ν Substitute the covariance matrix ex- 3
4 pression into Q νν, we have Q νν σ w ν T w ν + λ m vmw T ν vmw T ν σ Q νν + λ m R ν,m R ν,m m m Therefore the dependence of G n on w ν is explicitly through the order parameters Q νν and R ν,m The integration over w ν are performed in terms of integrations over R ν,m and Q νν, ie dw ν ν ν dw ν dr ν,m δ R ν,m vt mw ν dq νν δ Q νν wt ν w ν ν,m νν We rewrite these delta functions by using the Fourier representations In doing so, constant factors can be applied to the Fourier integration variables, and we choose convenient factors for later calculations After dropping irrelevant prefactors, we get dw ν ν dq νν d ˆQ νν dr ν,m d ˆR ν,m exp νν,m ˆR ν,m ˆQ νν ˆRν,m νν log det ˆQ ˆQνν Q νν + i ν,m ˆR ν,m R ν,m We have used a shorthand notation: dx ν,m stands for K n m dx ν,m X is one of R, ˆR; dq νν and d ˆQ νν stand for νν Q νν and νν ˆQνν respectively To obtain the leading-order contribution, we integrate over the Fourier variables ˆQ νν and ˆR ν,m, and retain only terms in the exponent of the integrand that are extensive in This gives dw ν ν dq νν dr ν,m expt n, T n Q log det R m R T m, S0 m and vector R m R,m,, R n,m T We rewrite S8 in terms of the integrations over R ν,m and 4
5 Q νν Ξ n β { } n dq νν dr ν,m exp β Q νν + T n + α log G n ow we apply steepest descent method to the remaining integrations over Q νν and R ν,m According to Varadhan s proposition Tanaka, 00, only the saddle points of the exponent of the integrand contribute to the integration in the limit of However, looking for saddle-points over all the entire space is in general difficult to perform We assume replica symmetry for saddle-points such that they are invariant under exchange of any two replica indices ν and ν, ν ν Under this symmetry assumption, the space is greatly reduced and the exponent of the integrand can be explicitly evaluated We put Q νν q 0 ν, Q νν q ν, ν, ν ν, R νm R m ν S The matrix Q can be simplified as Q q 0 q I + q Rm T m Further define ˆq 0 σ q 0 + λ m Rm, ˆq σ q + λ m Rm m m Substituting into S9, log G n can be simplified as log G n n Dz log Φt S t µr ˆq z ˆq0 ˆq, 5
6 and Dz dz π exp z denotes the weight of standard normal distribution and Φs s Dz denotes the standard normal cumulative distribution function Similarly, we can simplify S0 as T n n { logq 0 q + q q 0 q K m R m } S3 Substituting S and S3 into S8, and then into S7, we get F q 0 logq 0 q β q K m R m α βq 0 q β Dz log Φt In order for F to be non-trivial and well behaved in the limit β, we introduce the scaled parameter q βq 0 q We then find in the limit β, the free energy becomes F q 0 q K m R m + αˆq zc z c z, S4 q q z c µr σ ˆq, ˆq q + λ m Rm m From S4, we get the saddle-point equations zc q 0 Rm αˆq Dzz c z, m zc R α ˆq µ Dzz c z, σ R m 0 m,, K Denote θ the angle between w and µ, and define ρ cosθ R / q 0, the above equations can be 6
7 simplified as ρ zc α + λ ρ ρ αµ + λ ρ σ zc Dzz c z, Dzz c z S5 S6 Therefore, given α, λ, µ, σ, equations S5 and S6 can be used to solve two unknown parameters ρ and z c Proof of proposition The proof of Proposition is similar to the proof for Proposition We start from the partition function Z β X, y dwd M ξ i exp { βw T w + τ } M M yi x T i w ξ i Θ + ξ i, S7 i i τ is SVM tuning parameter At the low temperature limit, ie β, Z β X, y in equation S7 is dominated by the solution vector w of the soft-margin SVM algorithm 4 The replicated partition function is {Z β X, y} n [ n { dw ν d M ξ ν,i exp β w ν + τ 0 } M M yi x T i w ν ξ ν,i Θ + ξ ν,i ] i i Integration over X and y gives {Z β X, y} n X,y M i n dw ν exp β n w ν n n n yi x T i w dξ ν,i exp βτ ξ ν,i Θ ν + ξ ν,i 0 x i,y i }{{} n dw ν exp β G n n w ν + M log G n, 7
8 G n 0 0 d n ξ ν exp d n ξ ν exp βτ βτ n n ξ ν n n ξ ν yx T w ν Θ + ξ ν x,y x T w ν Θ + ξ ν x y Under replica symmetry assumption S, in the limit n 0 through some lengthy but standard calculation one obtains log G n β nβ ˆq zc qˆτ zc Dz{qˆτ ˆτz c z} Dz z c z, S8 q ˆτ στ ˆq, q βq 0 q Substituting S8 and S3 into S8, and then into S7, we find F q 0 q m R m q q 0 zc qˆτ Dz{qˆτ ˆτz c z} zc Dz z c z, q z c µr σ ˆq, ˆτ στ ˆq 8
9 We then find the saddle-point equations q 0 K m R m αq ˆτ zc qˆτ ˆq q αˆτ R q ˆq αˆτµ σ zc qˆτ zc qˆτ zc Dz α Dzz α q zc zc Dzz c z 0 z c zz 0 Dz αµ Dzz c z 0 qσ R m 0 m,, K, which can be simplified as ρ zc qˆτ + λ ρ αqˆτ q αˆτ ρ q + λ ρ αˆτµ σ zc qˆτ zc qˆτ zc Dz α Dzz α q Dz αµ qσ zc zc Dzz c z 0, S9 z c zz 0, S0 Dzz c z 0, S z c / q 0 µρ σ + λ ρ, ˆτ στ q0 + λ ρ Therefore, given α, λ, µ, σ, τ, equations S9, S0, S can be used to solve 3 unknown parameters ρ, q 0, q Proof of proposition 3 We start from the partition function Z β X, y dw exp [ β { M w M + i x T M i w + r + i x T i w r }] S 9
10 At the low temperature limit, ie β, Z β X, y in equation S is dominated by the vector µ c x + x, x + and x represent the sample means for Class + and Class respectively Following the same procedure as the proofs for Propositions and, we obtain the free energy as F q 0 R x + αq 0 ασ + x αrµ r + r The saddle-point equations are x α, R µ, q 0 4µ + σ αr + r Therefore the squared distance between two class means converges to the value given by 3 References Tanaka, T 00 A statistical-mechanics approach to large-system analysis of cdma multiuser detectors Information Theory, IEEE Transactions on 48,
Asymptotic behavior of Support Vector Machine for spiked population model
Journal of Machine Learning Research 18 (2017) 1-21 Submitted 11/16; Revised 3/17; Published 4/17 Asymptotic behavior of Support Vector Machine for spiked population model Department of Epidemiology and
More informationLecture 25: Review. Statistics 104. April 23, Colin Rundel
Lecture 25: Review Statistics 104 Colin Rundel April 23, 2012 Joint CDF F (x, y) = P [X x, Y y] = P [(X, Y ) lies south-west of the point (x, y)] Y (x,y) X Statistics 104 (Colin Rundel) Lecture 25 April
More informationStochastic processes for symmetric space-time fractional diffusion
Stochastic processes for symmetric space-time fractional diffusion Gianni PAGNINI IKERBASQUE Research Fellow BCAM - Basque Center for Applied Mathematics Bilbao, Basque Country - Spain gpagnini@bcamath.org
More informationRandom Variables and Their Distributions
Chapter 3 Random Variables and Their Distributions A random variable (r.v.) is a function that assigns one and only one numerical value to each simple event in an experiment. We will denote r.vs by capital
More informationPerturbed Gaussian Copula
Perturbed Gaussian Copula Jean-Pierre Fouque Xianwen Zhou August 8, 6 Abstract Gaussian copula is by far the most popular copula used in the financial industry in default dependency modeling. However,
More information4. CONTINUOUS RANDOM VARIABLES
IA Probability Lent Term 4 CONTINUOUS RANDOM VARIABLES 4 Introduction Up to now we have restricted consideration to sample spaces Ω which are finite, or countable; we will now relax that assumption We
More informationIntroduction to Probability and Stocastic Processes - Part I
Introduction to Probability and Stocastic Processes - Part I Lecture 2 Henrik Vie Christensen vie@control.auc.dk Department of Control Engineering Institute of Electronic Systems Aalborg University Denmark
More informationAero III/IV Conformal Mapping
Aero III/IV Conformal Mapping View complex function as a mapping Unlike a real function, a complex function w = f(z) cannot be represented by a curve. Instead it is useful to view it as a mapping. Write
More informationsheng@mail.ncyu.edu.tw Content Joint distribution functions Independent random variables Sums of independent random variables Conditional distributions: discrete case Conditional distributions: continuous
More informationThe Uses of Differential Geometry in Finance
The Uses of Differential Geometry in Finance Andrew Lesniewski Bloomberg, November 21 2005 The Uses of Differential Geometry in Finance p. 1 Overview Joint with P. Hagan and D. Woodward Motivation: Varadhan
More informationwhere r n = dn+1 x(t)
Random Variables Overview Probability Random variables Transforms of pdfs Moments and cumulants Useful distributions Random vectors Linear transformations of random vectors The multivariate normal distribution
More informationMATH 205C: STATIONARY PHASE LEMMA
MATH 205C: STATIONARY PHASE LEMMA For ω, consider an integral of the form I(ω) = e iωf(x) u(x) dx, where u Cc (R n ) complex valued, with support in a compact set K, and f C (R n ) real valued. Thus, I(ω)
More informationGaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008
Gaussian processes Chuong B Do (updated by Honglak Lee) November 22, 2008 Many of the classical machine learning algorithms that we talked about during the first half of this course fit the following pattern:
More informationLecture 8: The Metropolis-Hastings Algorithm
30.10.2008 What we have seen last time: Gibbs sampler Key idea: Generate a Markov chain by updating the component of (X 1,..., X p ) in turn by drawing from the full conditionals: X (t) j Two drawbacks:
More informationSTA 2201/442 Assignment 2
STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution
More informationEEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as
L30-1 EEL 5544 Noise in Linear Systems Lecture 30 OTHER TRANSFORMS For a continuous, nonnegative RV X, the Laplace transform of X is X (s) = E [ e sx] = 0 f X (x)e sx dx. For a nonnegative RV, the Laplace
More informationConditional Distributions
Conditional Distributions The goal is to provide a general definition of the conditional distribution of Y given X, when (X, Y ) are jointly distributed. Let F be a distribution function on R. Let G(,
More informationStochastic Processes. M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno
Stochastic Processes M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno 1 Outline Stochastic (random) processes. Autocorrelation. Crosscorrelation. Spectral density function.
More informationELEMENTS OF PROBABILITY THEORY
ELEMENTS OF PROBABILITY THEORY Elements of Probability Theory A collection of subsets of a set Ω is called a σ algebra if it contains Ω and is closed under the operations of taking complements and countable
More informationQuestion 1. The correct answers are: (a) (2) (b) (1) (c) (2) (d) (3) (e) (2) (f) (1) (g) (2) (h) (1)
Question 1 The correct answers are: a 2 b 1 c 2 d 3 e 2 f 1 g 2 h 1 Question 2 a Any probability measure Q equivalent to P on F 2 can be described by Q[{x 1, x 2 }] := q x1 q x1,x 2, 1 where q x1, q x1,x
More informationMultivariate Distribution Models
Multivariate Distribution Models Model Description While the probability distribution for an individual random variable is called marginal, the probability distribution for multiple random variables is
More information( ), λ. ( ) =u z. ( )+ iv z
AMS 212B Perturbation Metho Lecture 18 Copyright by Hongyun Wang, UCSC Method of steepest descent Consider the integral I = f exp( λ g )d, g C =u + iv, λ We consider the situation where ƒ() and g() = u()
More information4 Classical Coherence Theory
This chapter is based largely on Wolf, Introduction to the theory of coherence and polarization of light [? ]. Until now, we have not been concerned with the nature of the light field itself. Instead,
More informationHW4 : Bivariate Distributions (1) Solutions
STAT/MATH 395 A - PROBABILITY II UW Winter Quarter 7 Néhémy Lim HW4 : Bivariate Distributions () Solutions Problem. The joint probability mass function of X and Y is given by the following table : X Y
More informationGeometry and Motion Selected answers to Sections A and C Dwight Barkley 2016
MA34 Geometry and Motion Selected answers to Sections A and C Dwight Barkley 26 Example Sheet d n+ = d n cot θ n r θ n r = Θθ n i. 2. 3. 4. Possible answers include: and with opposite orientation: 5..
More informationHigher-order ordinary differential equations
Higher-order ordinary differential equations 1 A linear ODE of general order n has the form a n (x) dn y dx n +a n 1(x) dn 1 y dx n 1 + +a 1(x) dy dx +a 0(x)y = f(x). If f(x) = 0 then the equation is called
More informationIntroduction to Machine Learning
Introduction to Machine Learning 12. Gaussian Processes Alex Smola Carnegie Mellon University http://alex.smola.org/teaching/cmu2013-10-701 10-701 The Normal Distribution http://www.gaussianprocess.org/gpml/chapters/
More informationStatistics 300B Winter 2018 Final Exam Due 24 Hours after receiving it
Statistics 300B Winter 08 Final Exam Due 4 Hours after receiving it Directions: This test is open book and open internet, but must be done without consulting other students. Any consultation of other students
More informationx. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).
.8.6 µ =, σ = 1 µ = 1, σ = 1 / µ =, σ =.. 3 1 1 3 x Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ ). The Gaussian distribution Probably the most-important distribution in all of statistics
More informationPerhaps the simplest way of modeling two (discrete) random variables is by means of a joint PMF, defined as follows.
Chapter 5 Two Random Variables In a practical engineering problem, there is almost always causal relationship between different events. Some relationships are determined by physical laws, e.g., voltage
More informationSpring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =
Spring 2012 Math 541A Exam 1 1. (a) Let Z i be independent N(0, 1), i = 1, 2,, n. Are Z = 1 n n Z i and S 2 Z = 1 n 1 n (Z i Z) 2 independent? Prove your claim. (b) Let X 1, X 2,, X n be independent identically
More informationMultiple Random Variables
Multiple Random Variables This Version: July 30, 2015 Multiple Random Variables 2 Now we consider models with more than one r.v. These are called multivariate models For instance: height and weight An
More informationMultiple Choice. Compute the Jacobian, (u, v), of the coordinate transformation x = u2 v 4, y = uv. (a) 2u 2 + 4v 4 (b) xu yv (c) 3u 2 + 7v 6
.(5pts) y = uv. ompute the Jacobian, Multiple hoice (x, y) (u, v), of the coordinate transformation x = u v 4, (a) u + 4v 4 (b) xu yv (c) u + 7v 6 (d) u (e) u v uv 4 Solution. u v 4v u = u + 4v 4..(5pts)
More informationLarge dimensional analysis of general margin based classification methods
Large dimensional analysis of general margin based classification methods Hanwen Huang arxiv:1901.08057v1 [stat.ml] 3 Jan 019 Department of Epidemiology and Biostatistics University of Georgia, Athens,
More informationYaming Yu Department of Statistics, University of California, Irvine Xiao-Li Meng Department of Statistics, Harvard University.
Appendices to To Center or Not to Center: That is Not the Question An Ancillarity-Sufficiency Interweaving Strategy (ASIS) for Boosting MCMC Efficiency Yaming Yu Department of Statistics, University of
More informationMultivariate Distributions (Hogg Chapter Two)
Multivariate Distributions (Hogg Chapter Two) STAT 45-1: Mathematical Statistics I Fall Semester 15 Contents 1 Multivariate Distributions 1 11 Random Vectors 111 Two Discrete Random Variables 11 Two Continuous
More informationPROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS
PROBABILITY: LIMIT THEOREMS II, SPRING 218. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please
More informationInformation geometry for bivariate distribution control
Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic
More informationMLCC 2017 Regularization Networks I: Linear Models
MLCC 2017 Regularization Networks I: Linear Models Lorenzo Rosasco UNIGE-MIT-IIT June 27, 2017 About this class We introduce a class of learning algorithms based on Tikhonov regularization We study computational
More informationFundamental Limits of PhaseMax for Phase Retrieval: A Replica Analysis
Fundamental Limits of PhaseMax for Phase Retrieval: A Replica Analysis Oussama Dhifallah and Yue M. Lu John A. Paulson School of Engineering and Applied Sciences Harvard University, Cambridge, MA 0238,
More informationThe moment-generating function of the log-normal distribution using the star probability measure
Noname manuscript No. (will be inserted by the editor) The moment-generating function of the log-normal distribution using the star probability measure Yuri Heymann Received: date / Accepted: date Abstract
More informationBefore you begin read these instructions carefully.
MATHEMATICAL TRIPOS Part IA Friday, 1 June, 2018 1:30 pm to 4:30 pm PAPER 2 Before you begin read these instructions carefully. The examination paper is divided into two sections. Each question in Section
More informationDefinite integrals. We shall study line integrals of f (z). In order to do this we shall need some preliminary definitions.
5. OMPLEX INTEGRATION (A) Definite integrals Integrals are extremely important in the study of functions of a complex variable. The theory is elegant, and the proofs generally simple. The theory is put
More informationMultivariate probability distributions and linear regression
Multivariate probability distributions and linear regression Patrik Hoyer 1 Contents: Random variable, probability distribution Joint distribution Marginal distribution Conditional distribution Independence,
More informationCheng Soon Ong & Christian Walder. Canberra February June 2018
Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 2018 (Many figures from C. M. Bishop, "Pattern Recognition and ") 1of 89 Part II
More informationBrownian Motion. 1 Definition Brownian Motion Wiener measure... 3
Brownian Motion Contents 1 Definition 2 1.1 Brownian Motion................................. 2 1.2 Wiener measure.................................. 3 2 Construction 4 2.1 Gaussian process.................................
More informationMATH 44041/64041 Applied Dynamical Systems, 2018
MATH 4441/6441 Applied Dynamical Systems, 218 Answers to Assessed Coursework 1 1 Method based on series expansion of the matrix exponential Let the coefficient matrix of α the linear system be A, then
More informationCopulas. MOU Lili. December, 2014
Copulas MOU Lili December, 2014 Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion Preliminary MOU Lili SEKE Team 3/30 Probability
More informationSupplementary material to Expectation Propagation for Likelihoods Depending on an Inner Product of Two Multivariate Random Variables
Supplementary material to Expectation Propagation for Likelihoods Depending on an Inner Product of Two Multivariate Random Variables Tomi Peltola tomi.peltola@aalto.fi Department of Biomedical Engineering
More informationChapter 5 continued. Chapter 5 sections
Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions
More informationEE4601 Communication Systems
EE4601 Communication Systems Week 2 Review of Probability, Important Distributions 0 c 2011, Georgia Institute of Technology (lect2 1) Conditional Probability Consider a sample space that consists of two
More informationSolutions to Homework 11
Solutions to Homework 11 Read the statement of Proposition 5.4 of Chapter 3, Section 5. Write a summary of the proof. Comment on the following details: Does the proof work if g is piecewise C 1? Or did
More informationXY model in 2D and 3D
XY model in 2D and 3D Gabriele Sicuro PhD school Galileo Galilei University of Pisa September 18, 2012 The XY model XY model in 2D and 3D; vortex loop expansion Part I The XY model, duality and loop expansion
More informationProblem set 7 Math 207A, Fall 2011 Solutions
Problem set 7 Math 207A, Fall 2011 s 1. Classify the equilibrium (x, y) = (0, 0) of the system x t = x, y t = y + x 2. Is the equilibrium hyperbolic? Find an equation for the trajectories in (x, y)- phase
More informationRadiation by a dielectric wedge
Radiation by a dielectric wedge A D Rawlins Department of Mathematical Sciences, Brunel University, United Kingdom Joe Keller,Cambridge,2-3 March, 2017. We shall consider the problem of determining the
More informationMath Homework 2
Math 73 Homework Due: September 8, 6 Suppose that f is holomorphic in a region Ω, ie an open connected set Prove that in any of the following cases (a) R(f) is constant; (b) I(f) is constant; (c) f is
More informationLinear Classification
Linear Classification Lili MOU moull12@sei.pku.edu.cn http://sei.pku.edu.cn/ moull12 23 April 2015 Outline Introduction Discriminant Functions Probabilistic Generative Models Probabilistic Discriminative
More informationA Probability Review
A Probability Review Outline: A probability review Shorthand notation: RV stands for random variable EE 527, Detection and Estimation Theory, # 0b 1 A Probability Review Reading: Go over handouts 2 5 in
More informationApplied Mathematics Masters Examination Fall 2016, August 18, 1 4 pm.
Applied Mathematics Masters Examination Fall 16, August 18, 1 4 pm. Each of the fifteen numbered questions is worth points. All questions will be graded, but your score for the examination will be the
More informationNonlinear Systems and Control Lecture # 19 Perturbed Systems & Input-to-State Stability
p. 1/1 Nonlinear Systems and Control Lecture # 19 Perturbed Systems & Input-to-State Stability p. 2/1 Perturbed Systems: Nonvanishing Perturbation Nominal System: Perturbed System: ẋ = f(x), f(0) = 0 ẋ
More informationMTH4101 CALCULUS II REVISION NOTES. 1. COMPLEX NUMBERS (Thomas Appendix 7 + lecture notes) ax 2 + bx + c = 0. x = b ± b 2 4ac 2a. i = 1.
MTH4101 CALCULUS II REVISION NOTES 1. COMPLEX NUMBERS (Thomas Appendix 7 + lecture notes) 1.1 Introduction Types of numbers (natural, integers, rationals, reals) The need to solve quadratic equations:
More informationlim n C1/n n := ρ. [f(y) f(x)], y x =1 [f(x) f(y)] [g(x) g(y)]. (x,y) E A E(f, f),
1 Part I Exercise 1.1. Let C n denote the number of self-avoiding random walks starting at the origin in Z of length n. 1. Show that (Hint: Use C n+m C n C m.) lim n C1/n n = inf n C1/n n := ρ.. Show that
More informationNotes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed
18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,
More informationNotes on the Multivariate Normal and Related Topics
Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions
More informationApplications of Ito s Formula
CHAPTER 4 Applications of Ito s Formula In this chapter, we discuss several basic theorems in stochastic analysis. Their proofs are good examples of applications of Itô s formula. 1. Lévy s martingale
More informationF n = F n 1 + F n 2. F(z) = n= z z 2. (A.3)
Appendix A MATTERS OF TECHNIQUE A.1 Transform Methods Laplace transforms for continuum systems Generating function for discrete systems This method is demonstrated for the Fibonacci sequence F n = F n
More informationStat 5101 Notes: Algorithms
Stat 5101 Notes: Algorithms Charles J. Geyer January 22, 2016 Contents 1 Calculating an Expectation or a Probability 3 1.1 From a PMF........................... 3 1.2 From a PDF...........................
More informationREVIEW OF MAIN CONCEPTS AND FORMULAS A B = Ā B. Pr(A B C) = Pr(A) Pr(A B C) =Pr(A) Pr(B A) Pr(C A B)
REVIEW OF MAIN CONCEPTS AND FORMULAS Boolean algebra of events (subsets of a sample space) DeMorgan s formula: A B = Ā B A B = Ā B The notion of conditional probability, and of mutual independence of two
More information(x 3)(x + 5) = (x 3)(x 1) = x + 5. sin 2 x e ax bx 1 = 1 2. lim
SMT Calculus Test Solutions February, x + x 5 Compute x x x + Answer: Solution: Note that x + x 5 x x + x )x + 5) = x )x ) = x + 5 x x + 5 Then x x = + 5 = Compute all real values of b such that, for fx)
More informationE X A M. Probability Theory and Stochastic Processes Date: December 13, 2016 Duration: 4 hours. Number of pages incl.
E X A M Course code: Course name: Number of pages incl. front page: 6 MA430-G Probability Theory and Stochastic Processes Date: December 13, 2016 Duration: 4 hours Resources allowed: Notes: Pocket calculator,
More informationlaplace s method for ordinary differential equations
Physics 24 Spring 217 laplace s method for ordinary differential equations lecture notes, spring semester 217 http://www.phys.uconn.edu/ rozman/ourses/p24_17s/ Last modified: May 19, 217 It is possible
More informationFormulas for probability theory and linear models SF2941
Formulas for probability theory and linear models SF2941 These pages + Appendix 2 of Gut) are permitted as assistance at the exam. 11 maj 2008 Selected formulae of probability Bivariate probability Transforms
More informationMachine Learning. Support Vector Machines. Fabio Vandin November 20, 2017
Machine Learning Support Vector Machines Fabio Vandin November 20, 2017 1 Classification and Margin Consider a classification problem with two classes: instance set X = R d label set Y = { 1, 1}. Training
More information10 BIVARIATE DISTRIBUTIONS
BIVARIATE DISTRIBUTIONS After some discussion of the Normal distribution, consideration is given to handling two continuous random variables. The Normal Distribution The probability density function f(x)
More information04. Random Variables: Concepts
University of Rhode Island DigitalCommons@URI Nonequilibrium Statistical Physics Physics Course Materials 215 4. Random Variables: Concepts Gerhard Müller University of Rhode Island, gmuller@uri.edu Creative
More informationProbability and Distributions
Probability and Distributions What is a statistical model? A statistical model is a set of assumptions by which the hypothetical population distribution of data is inferred. It is typically postulated
More information3. Probability and Statistics
FE661 - Statistical Methods for Financial Engineering 3. Probability and Statistics Jitkomut Songsiri definitions, probability measures conditional expectations correlation and covariance some important
More informationLegendre-Fenchel transforms in a nutshell
1 2 3 Legendre-Fenchel transforms in a nutshell Hugo Touchette School of Mathematical Sciences, Queen Mary, University of London, London E1 4NS, UK Started: July 11, 2005; last compiled: August 14, 2007
More informationPROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS
PROBABILITY: LIMIT THEOREMS II, SPRING 15. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please
More informationBivariate Distributions
Bivariate Distributions EGR 260 R. Van Til Industrial & Systems Engineering Dept. Copyright 2013. Robert P. Van Til. All rights reserved. 1 What s It All About? Many random processes produce Examples.»
More informationSupersymmetric gauge theory, representation schemes and random matrices
Supersymmetric gauge theory, representation schemes and random matrices Giovanni Felder, ETH Zurich joint work with Y. Berest, M. Müller-Lennert, S. Patotsky, A. Ramadoss and T. Willwacher MIT, 30 May
More informationAnalyse 3 NA, FINAL EXAM. * Monday, January 8, 2018, *
Analyse 3 NA, FINAL EXAM * Monday, January 8, 08, 4.00 7.00 * Motivate each answer with a computation or explanation. The maximum amount of points for this exam is 00. No calculators!. (Holomorphic functions)
More informationHankel Transforms - Lecture 10
1 Introduction Hankel Transforms - Lecture 1 The Fourier transform was used in Cartesian coordinates. If we have problems with cylindrical geometry we will need to use cylindtical coordinates. Thus suppose
More informationLecture 14: Multivariate mgf s and chf s
Lecture 14: Multivariate mgf s and chf s Multivariate mgf and chf For an n-dimensional random vector X, its mgf is defined as M X (t) = E(e t X ), t R n and its chf is defined as φ X (t) = E(e ıt X ),
More informationVerona Course April Lecture 1. Review of probability
Verona Course April 215. Lecture 1. Review of probability Viorel Barbu Al.I. Cuza University of Iaşi and the Romanian Academy A probability space is a triple (Ω, F, P) where Ω is an abstract set, F is
More informationconditional cdf, conditional pdf, total probability theorem?
6 Multiple Random Variables 6.0 INTRODUCTION scalar vs. random variable cdf, pdf transformation of a random variable conditional cdf, conditional pdf, total probability theorem expectation of a random
More information1. Stochastic Processes and filtrations
1. Stochastic Processes and 1. Stoch. pr., A stochastic process (X t ) t T is a collection of random variables on (Ω, F) with values in a measurable space (S, S), i.e., for all t, In our case X t : Ω S
More informationLecture 1: Pragmatic Introduction to Stochastic Differential Equations
Lecture 1: Pragmatic Introduction to Stochastic Differential Equations Simo Särkkä Aalto University, Finland (visiting at Oxford University, UK) November 13, 2013 Simo Särkkä (Aalto) Lecture 1: Pragmatic
More informationModelling and Estimation of Stochastic Dependence
Modelling and Estimation of Stochastic Dependence Uwe Schmock Based on joint work with Dr. Barbara Dengler Financial and Actuarial Mathematics and Christian Doppler Laboratory for Portfolio Risk Management
More informationMAS113 Introduction to Probability and Statistics. Proofs of theorems
MAS113 Introduction to Probability and Statistics Proofs of theorems Theorem 1 De Morgan s Laws) See MAS110 Theorem 2 M1 By definition, B and A \ B are disjoint, and their union is A So, because m is a
More informationLarge Deviations Theory, Induced Log-Plus and Max-Plus Measures and their Applications
Large Deviations Theory, Induced Log-Plus and Max-Plus Measures and their Applications WM McEneaney Department of Mathematics North Carolina State University Raleigh, NC 7695 85, USA wmm@mathncsuedu Charalambos
More informationDynamics on a General Stage Structured n Parallel Food Chains
Memorial University of Newfoundland Dynamics on a General Stage Structured n Parallel Food Chains Isam Al-Darabsah and Yuan Yuan November 4, 2013 Outline: Propose a general model with n parallel food chains
More informationPhD in Theoretical Particle Physics Academic Year 2017/2018
July 10, 017 SISSA Entrance Examination PhD in Theoretical Particle Physics Academic Year 017/018 S olve two among the four problems presented. Problem I Consider a quantum harmonic oscillator in one spatial
More information4 Pairs of Random Variables
B.Sc./Cert./M.Sc. Qualif. - Statistical Theory 4 Pairs of Random Variables 4.1 Introduction In this section, we consider a pair of r.v. s X, Y on (Ω, F, P), i.e. X, Y : Ω R. More precisely, we define a
More informationBASICS OF PROBABILITY
October 10, 2018 BASICS OF PROBABILITY Randomness, sample space and probability Probability is concerned with random experiments. That is, an experiment, the outcome of which cannot be predicted with certainty,
More informationBoulder School for Condensed Matter and Materials Physics. Laurette Tuckerman PMMH-ESPCI-CNRS
Boulder School for Condensed Matter and Materials Physics Laurette Tuckerman PMMH-ESPCI-CNRS laurette@pmmh.espci.fr Dynamical Systems: A Basic Primer 1 1 Basic bifurcations 1.1 Fixed points and linear
More informationSome asymptotic properties of solutions for Burgers equation in L p (R)
ARMA manuscript No. (will be inserted by the editor) Some asymptotic properties of solutions for Burgers equation in L p (R) PAULO R. ZINGANO Abstract We discuss time asymptotic properties of solutions
More information2.10 Saddles, Nodes, Foci and Centers
2.10 Saddles, Nodes, Foci and Centers In Section 1.5, a linear system (1 where x R 2 was said to have a saddle, node, focus or center at the origin if its phase portrait was linearly equivalent to one
More informationEstimation of the functional Weibull-tail coefficient
1/ 29 Estimation of the functional Weibull-tail coefficient Stéphane Girard Inria Grenoble Rhône-Alpes & LJK, France http://mistis.inrialpes.fr/people/girard/ June 2016 joint work with Laurent Gardes,
More informationExercises for Multivariable Differential Calculus XM521
This document lists all the exercises for XM521. The Type I (True/False) exercises will be given, and should be answered, online immediately following each lecture. The Type III exercises are to be done
More information