CIS 700: algorithms for Big Data
|
|
- Fay Cooper
- 6 years ago
- Views:
Transcription
1 CIS 700: algorthms for Bg Data Lecture 5: Dmenson Reducton Sldes at htt://grgory.us/bg-data-class.html Grgory Yaroslavtsev htt://grgory.us
2 Today Dmensonalty reducton AMS as dmensonalty reducton Johnson-Lndenstrauss transform
3 L -norm Estmaton Stream: m udates x, Δ n R that defne vector f where f j = :x =j Δ. Examle: For n = 4,3, 3, 0.5,,,,,,,,, (4,) f = (4,, 0.5, ) L -norm: f = f
4 L -norm Estmaton L -norm: f = f Two lectures ago: f 0 = F 0 -moment f = F -moment (va AMS sketchng) log n ε Sace: O log δ Technque: lnear sketches f 0 : S f for random sets S f : σ f for random sgns σ
5 AMS as dmensonalty reducton Mantan a lnear sketch vector Z = Z,, Z k = Rf Z = σ j f j, where σ j R *,+ j,n- Estmator Y for f : k k = Z = Rf k Dmensonalty reducton : x Rx, heavy tal Pr Y f c f k c
6 Normal Dstrbuton Normal dstrbuton N(0,) Range: (, + ) Densty: μ x = π e x Mean = 0, Varance = Basc facts: If X and Y are ndeendent r.v. wth normal dstrbuton then X + Y has normal dstrbuton Var cx = c Var,X- If X, Y are ndeendent, then Var X + Y = Var X + Var,Y-
7 Johnson-Lndenstrauss Transform Instead of ± let σ be..d. random varables from normal dstrbuton N(0,) Z = σ f We stll have E Z = f = f because: E σ E σ j = 0; E σ = varance of σ = Defne Z = (Z,, Z k ) and defne: Z = Zj j E Z = k f JL Lemma: There exsts C > 0 s.t. for small enough ε > 0: Pr Z k f > εk f ex Cε k
8 Proof of JL Lemma JL Lemma: C > 0 s.t. for small enough ε > 0: Pr Z k f > εk f ex Cε k Assume f =. We have Z = j σ j f and Z = (Z,, Z k ) E Z = k f = k Alternatve form of JL Lemma: Pr Z > k + ε ex ε k + O(k ε 3 )
9 Proof of JL Lemma Alternatve form of JL Lemma: Pr Z > k + ε ex ε k + O(k ε 3 ) Let Y = Z and α = k + ε For every s > 0 we have: Pr Y > α = Pr,e sy > e sα - By Markov and ndeendence of Z s: Pr e sy > e sα E esy We have Z N(0,), hence: e sα = e sα E e s Z = e sα E e sz E e sz = π e st e t dt = k = s
10 Proof of JL Lemma Alternatve form of JL Lemma: Pr Z > k + ε ex ε k + O(k ε 3 ) For every s > 0 we have: Pr Y > α e sα E e sz k = = e sα s k Let s = k and recall that α = k + ε α A calculaton fnshes the roof: Pr Y > α ex ε k + O(k ε 3 )
11 Johnson-Lndenstrauss Transform Sngle vector: k = O log δ ε Tght: k = Ω log δ ε [Woodruff 0] n vectors smultaneously: k = O log n log δ ε Tght: k = Ω log n log δ ε [Molnaro, Woodruff, Y. 3] Dstances between n vectors = O(n ) vectors: k = O log n log δ ε
12 Random Varables and Norms For a random varable X and let: / X = E X Facts: For any c: cx = c X s a norm (Mnkowsk s nequalty) q for q (Monotoncty of norms) Jensen s nequalty (used a lot for F = x ): If F s convex then F E X E[F(X)]
13 Khntchne Inequalty [Khntchne]For, x R n and σ..d. Rademachers: σ x x For r (ether σ or g N 0, ) exand E r x All odd owers of r are zero All even moments for σ are, and for g are σ x g x g x N 0, x g x x
14 Symmetrzaton [Symmetrzaton]: If Z,, Z n are ndeendent and σ are..d. Rademachers: Z E Z σ Z Let Y Y n be ndeendent wth the same dstrbuton as Z Z E Z = Z E Y Y Z Y (Jensen) = σ Z Y (Z Y are ndeendent and symmetrc) σ Z (trangle nequalty)
15 Decoulng Let x, x n be ndeendent wth mean 0 and x, x n dentcally dstrbuted as x and ndeendent of them. For any a j and : j a j x x j 4 a j x x j Let η,, η n be..d. Bernoulls (0/ w.. /): j a j x x j,j = 4 E η j a j x x j η ( η j ) 4 j a j x x j η ( η j ) (Jensen) There exsts η 0, n such that: j a j x x j η ( η j ) where S = η =. S j S a jx x j
16 Decoulng (contnued) Let x S be an S-dmensonal vector of x for S. S j S a j x x j = S j S a jx x j = E xs E x S,j a j x x (E x j = E x = 0),j a j x x j (Jensen) Overall: a j x x j 4 a j x x j j,j
17 Hanson-Wrght Inequalty For σ,, σ n ndeendent Rademachers and A R n n real and symmetrc for all : σ T Aσ E σ T Aσ A F + A Recall: A F = j a j = Tr(A T A) A = su v 0 Av v
18 Hanson-Wrght Inequalty For σ,, σ n ndeendent Rademachers and A R n n real and symmetrc for all : σ T Aσ E σ T Aσ A F + A σ T Aσ E σ T Aσ σ T Aσ (decoulng) Aσ (Khntchne) = Aσ Aσ / (monotoncty of norms)
19 Hanson-Wrght (contnued) Aσ Aσ E Aσ + Aσ E Aσ = A F + Aσ E Aσ A F + Aσ E Aσ A F + σ T A T Aσ (decoulng) (trangle neq.) A F A T / Aσ (Khntchne) A F A Ax
20 Hanson-Wrght (contnued) Aσ A F A Let E = Ax then E C 4 A Aσ E C A F 0 E larger root of the quadratc equaton above E A F + A (Hanson-Wrght) For σ,, σ n ndeendent Rademachers and A R n n real and symmetrc for all : σ T Aσ E σ T Aσ A F + A
21 Reca For a random varable X and let: X = E X / [Khntchne]For, x R n and σ..d. Rademachers: σ x x [Symmetrzaton]: If Z,, Z n are ndeendent and σ are..d. Rademachers: Z E Z σ Z [Hanson-Wrght]For σ,, σ n ndeendent Rademachers and A R n n real and symmetrc for all : σ T Aσ E σ T Aσ A F + A
22 Bernsten Inequalty Let X,, X n be nde. r.v s such that X K almost surely and E X σ. For all : X E X X E X σ X ( X ) (Khntchne) = X σ + K (symmetrzaton) σ + X E X / (trangle nequalty)
23 Bernsten Inequalty (cont.) X E X ( X ) σ + X σ + σ X σ X 4 / E X σ K X / (symmetrzaton) (Khntchne)
24 Bernsten Inequalty (cont.) Let E = ( X ) then for some C > 0: E C 4 KE Cσ 0 E larger root of ths quadratc equaton E σ + K [Bernsten] Let X,, X n be nde. r.v s such that X K almost surely and E X σ. For all : X E X σ + K
25 Sarse Johnson-Lndenstrauss Transform Let Π R m n be a JL-matrx where m = O ε log δ whch satsfes for x = : Pr Π Πx ε δ Takes O m x 0 tme to comute JL Would be O s x 0 tme Π only had s nonzero entres er column
26 Basc Sarse JL Transform Pck -wse nde. hash functon n m Pck 4-wse nde. hash functon σ n *,+ For each,n- let Π h, = σ, the rest are 0 [Thoru, Zhang ]: Ths s JL f m ε δ Best ossble snce s = Analyss: standard exectaton/varance usng bounded ndeendence + Chebyshev To mrove m let s use Hanson-Wrght (hgher moment than Chebyshev s second)
27 Sarse JL Transform: Constructon Π r, = η r, σ r, / s, where η are Bernoulls and σ are Rademachers For all r, : E η r, = s m For all : η r, = s (s non-zeros er column) η r, are negatvely correlated: E η r, r, S E η r, r, S = s m Each column chosen unformly from Bnom m, s columns of weght s works here S
28 By Hanson-Wrght: Z A x,η F + A x,η Sarse JL Transform: Analyss Thm [KN 4]: If m = O log and s εm: ε δ x: x =, Pr Πx Π ε δ Z = Πx = s m r= j η r, η r,j σ r, σ r,j x x j σ T A x,η σ A x,η s a block-dagonal matrx wth m blocks where r-th block s s x r x r T but wth zeros on the dagonal x r s a vector wth entres x r = η r, x A x,η F + A x,η
29 Sarse JL Transform: Analyss (Oerator norm) Snce A x,η s block-dagonal A x,η s the largest norm of any block Egenvalues n the r-th block are at most A x,η s s max x r, x r s
30 Sarse JL Transform: Analyss m Defne Q,j = r= η r, η r,j so that: A x,η F = /s x x j Q,j j Lemma: If s /m then, j Q,j A x,η F = A x,η F s x x j Q,j s j j / m x x j Q,j / (trangle neq.)
31 Sarse JL Transform: Analyss By Markov (m = O s ε log /δ, s εm, ): m Pr, Πx > ε- = Pr σ T A x,η σ > ε ε E, σ T A x,η σ - (Markov) ε C m + s = ε C ε + ε δ
32 Sarse JL Transform: Analyss Lemma: If s /m then, j Q,j Suose η a,,, η as, are all where a < < a s. s Note that Q j = t= Y t where t s an ndcator r.v. for the event η at,j =. Y t s are not nde. but negatvely correlated -th moment at most -th moments of..d. Bernoulls wth exectaton s (exand Y m t t and comare term by term) By Bernsten nequalty: Q j = t Y t s m +
33 FFT-based Fast JL-Transform [Alon, Chazelle 09] Runnng tme O(n log n) Defne Π R m n as Π = m S H D S = m n samlng matrx (wth relacement) H = unnormalzed bounded orthonormal system,.e. H R n n ; H T H = I; max H,j,j D = dag(α) for (α,, α n )..d. Rademachers If H = Hadamard matrx O(n log n) tme to comute Πx
34 FFT-based Fast JL-Transform Change S to S η = dag(η,, η n ) where η are Bernoulls wth exectaton E η = m/n [CNW 5] If Π = S m ηhd, m ε log log δ εδ x: x =, Pr Π Πx ε δ Let z = HDx so Πx = η m z Wll show that m n = η z s small
35 FFT-based Fast JL-Transform n m η z m = m m m η z 4 (max max max / m σ η z (Khntchne) η z ) η z η z η z m n η z / = η z symmetrzaton + (trangle nequalty)
36 FFT-based Fast JL-Transform max max η z η z max q η z = Eα,η max E α,η,η z q - q (n max = (n max for q = max (, log m) q η z q /q E α,η E α,η η z q ) q E η η E α,z q -) q = (m max max z (m q by choce of q) q = max z q q (Khntchne) E α,z q -) q η z q q =
37 FFT-based Fast JL-Transform Let E = m n = η z E C q m E C q m 0 E max q, q m m Markov: Pr Π Πx ε ε E δ = log /δ and m ε log δ log m δ
Dimensionality Reduction Notes 2
Dmensonalty Reducton Notes 2 Jelan Nelson mnlek@seas.harvard.edu August 11, 2015 1 Optmalty theorems for JL Yesterday we saw for MJL that we could acheve target dmenson m = O(ε 2 log N), and for DJL we
More informationDimensionality Reduction Notes 1
Dmensonalty Reducton Notes 1 Jelan Nelson mnlek@seas.harvard.edu August 10, 2015 1 Prelmnares Here we collect some notaton and basc lemmas used throughout ths note. Throughout, for a random varable X,
More informationMining Data Streams-Estimating Frequency Moment
Mnng Data Streams-Estmatng Frequency Moment Barna Saha October 26, 2017 Frequency Moment Computng moments nvolves dstrbuton of frequences of dfferent elements n the stream. Frequency Moment Computng moments
More informationMatrix Approximation via Sampling, Subspace Embedding. 1 Solving Linear Systems Using SVD
Matrx Approxmaton va Samplng, Subspace Embeddng Lecturer: Anup Rao Scrbe: Rashth Sharma, Peng Zhang 0/01/016 1 Solvng Lnear Systems Usng SVD Two applcatons of SVD have been covered so far. Today we loo
More informationU.C. Berkeley CS294: Beyond Worst-Case Analysis Luca Trevisan September 5, 2017
U.C. Berkeley CS94: Beyond Worst-Case Analyss Handout 4s Luca Trevsan September 5, 07 Summary of Lecture 4 In whch we ntroduce semdefnte programmng and apply t to Max Cut. Semdefnte Programmng Recall that
More informationLecture 4: Universal Hash Functions/Streaming Cont d
CSE 5: Desgn and Analyss of Algorthms I Sprng 06 Lecture 4: Unversal Hash Functons/Streamng Cont d Lecturer: Shayan Oves Gharan Aprl 6th Scrbe: Jacob Schreber Dsclamer: These notes have not been subjected
More informationLecture 4: September 12
36-755: Advanced Statstcal Theory Fall 016 Lecture 4: September 1 Lecturer: Alessandro Rnaldo Scrbe: Xao Hu Ta Note: LaTeX template courtesy of UC Berkeley EECS dept. Dsclamer: These notes have not been
More information6.842 Randomness and Computation February 18, Lecture 4
6.842 Randomness and Computaton February 18, 2014 Lecture 4 Lecturer: Rontt Rubnfeld Scrbe: Amartya Shankha Bswas Topcs 2-Pont Samplng Interactve Proofs Publc cons vs Prvate cons 1 Two Pont Samplng 1.1
More informationprinceton univ. F 13 cos 521: Advanced Algorithm Design Lecture 3: Large deviations bounds and applications Lecturer: Sanjeev Arora
prnceton unv. F 13 cos 521: Advanced Algorthm Desgn Lecture 3: Large devatons bounds and applcatons Lecturer: Sanjeev Arora Scrbe: Today s topc s devaton bounds: what s the probablty that a random varable
More informationCSCI B609: Foundations of Data Science
CSCI B609: Foundatons of Data Scence Lecture 13/14: Gradent Descent, Boostng and Learnng from Experts Sldes at http://grgory.us/data-scence-class.html Grgory Yaroslavtsev http://grgory.us Constraned Convex
More informationEigenvalues of Random Graphs
Spectral Graph Theory Lecture 2 Egenvalues of Random Graphs Danel A. Spelman November 4, 202 2. Introducton In ths lecture, we consder a random graph on n vertces n whch each edge s chosen to be n the
More informationCSCE 790S Background Results
CSCE 790S Background Results Stephen A. Fenner September 8, 011 Abstract These results are background to the course CSCE 790S/CSCE 790B, Quantum Computaton and Informaton (Sprng 007 and Fall 011). Each
More information18.1 Introduction and Recap
CS787: Advanced Algorthms Scrbe: Pryananda Shenoy and Shjn Kong Lecturer: Shuch Chawla Topc: Streamng Algorthmscontnued) Date: 0/26/2007 We contnue talng about streamng algorthms n ths lecture, ncludng
More informationNotes on Frequency Estimation in Data Streams
Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to
More informationChapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of
Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When
More informationAn introduction to chaining, and applications to sublinear algorithms
An ntroducton to channg, and applcatons to sublnear algorthms Jelan Nelson Harvard August 28, 2015 What s ths talk about? What s ths talk about? Gven a collecton of random varables X 1, X 2,...,, we would
More informationOutline. Multivariate Parametric Methods. Multivariate Data. Basic Multivariate Statistics. Steven J Zeil
Outlne Multvarate Parametrc Methods Steven J Zel Old Domnon Unv. Fall 2010 1 Multvarate Data 2 Multvarate ormal Dstrbuton 3 Multvarate Classfcaton Dscrmnants Tunng Complexty Dscrete Features 4 Multvarate
More informationU.C. Berkeley CS294: Spectral Methods and Expanders Handout 8 Luca Trevisan February 17, 2016
U.C. Berkeley CS94: Spectral Methods and Expanders Handout 8 Luca Trevsan February 7, 06 Lecture 8: Spectral Algorthms Wrap-up In whch we talk about even more generalzatons of Cheeger s nequaltes, and
More informationFirst Year Examination Department of Statistics, University of Florida
Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve
More informationFeb 14: Spatial analysis of data fields
Feb 4: Spatal analyss of data felds Mappng rregularly sampled data onto a regular grd Many analyss technques for geophyscal data requre the data be located at regular ntervals n space and/or tme. hs s
More informationAPPENDIX A Some Linear Algebra
APPENDIX A Some Lnear Algebra The collecton of m, n matrces A.1 Matrces a 1,1,..., a 1,n A = a m,1,..., a m,n wth real elements a,j s denoted by R m,n. If n = 1 then A s called a column vector. Smlarly,
More informationPrimer on High-Order Moment Estimators
Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc
More informationLecture 5 September 17, 2015
CS 229r: Algorthms for Bg Data Fall 205 Prof. Jelan Nelson Lecture 5 September 7, 205 Scrbe: Yakr Reshef Recap and overvew Last tme we dscussed the problem of norm estmaton for p-norms wth p > 2. We had
More informationHANSON-WRIGHT INEQUALITY AND SUB-GAUSSIAN CONCENTRATION
HANSON-WRIGHT INEQUALITY AND SUB-GAUSSIAN CONCENTRATION MARK RUDELSON AND ROMAN VERSHYNIN Abstract. In ths expostory note, we gve a modern proof of Hanson-Wrght nequalty for quadratc forms n sub-gaussan
More informationSupplementary material: Margin based PU Learning. Matrix Concentration Inequalities
Supplementary materal: Margn based PU Learnng We gve the complete proofs of Theorem and n Secton We frst ntroduce the well-known concentraton nequalty, so the covarance estmator can be bounded Then we
More informationStanford University CS359G: Graph Partitioning and Expanders Handout 4 Luca Trevisan January 13, 2011
Stanford Unversty CS359G: Graph Parttonng and Expanders Handout 4 Luca Trevsan January 3, 0 Lecture 4 In whch we prove the dffcult drecton of Cheeger s nequalty. As n the past lectures, consder an undrected
More informationErrors for Linear Systems
Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch
More informationMixture of Gaussians Expectation Maximization (EM) Part 2
Mture of Gaussans Eectaton Mamaton EM Part 2 Most of the sldes are due to Chrstoher Bsho BCS Summer School Eeter 2003. The rest of the sldes are based on lecture notes by A. Ng Lmtatons of K-means Hard
More informationU.C. Berkeley CS294: Beyond Worst-Case Analysis Handout 6 Luca Trevisan September 12, 2017
U.C. Berkeley CS94: Beyond Worst-Case Analyss Handout 6 Luca Trevsan September, 07 Scrbed by Theo McKenze Lecture 6 In whch we study the spectrum of random graphs. Overvew When attemptng to fnd n polynomal
More informationInterval Estimation in the Classical Normal Linear Regression Model. 1. Introduction
ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More information1 Convex Optimization
Convex Optmzaton We wll consder convex optmzaton problems. Namely, mnmzaton problems where the objectve s convex (we assume no constrants for now). Such problems often arse n machne learnng. For example,
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationSketching Sampled Data Streams
Sketchng Sampled Data Streams Florn Rusu and Aln Dobra CISE Department Unversty of Florda March 31, 2009 Motvaton & Goal Motvaton Multcore processors How to use all the processng power? Parallel algorthms
More informationSTAT 309: MATHEMATICAL COMPUTATIONS I FALL 2018 LECTURE 16
STAT 39: MATHEMATICAL COMPUTATIONS I FALL 218 LECTURE 16 1 why teratve methods f we have a lnear system Ax = b where A s very, very large but s ether sparse or structured (eg, banded, Toepltz, banded plus
More informationLecture 3 January 31, 2017
CS 224: Advanced Algorthms Sprng 207 Prof. Jelan Nelson Lecture 3 January 3, 207 Scrbe: Saketh Rama Overvew In the last lecture we covered Y-fast tres and Fuson Trees. In ths lecture we start our dscusson
More informationHidden Markov Model Cheat Sheet
Hdden Markov Model Cheat Sheet (GIT ID: dc2f391536d67ed5847290d5250d4baae103487e) Ths document s a cheat sheet on Hdden Markov Models (HMMs). It resembles lecture notes, excet that t cuts to the chase
More informationDISCRIMINANTS AND RAMIFIED PRIMES. 1. Introduction A prime number p is said to be ramified in a number field K if the prime ideal factorization
DISCRIMINANTS AND RAMIFIED PRIMES KEITH CONRAD 1. Introducton A prme number p s sad to be ramfed n a number feld K f the prme deal factorzaton (1.1) (p) = po K = p e 1 1 peg g has some e greater than 1.
More informationHowever, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values
Fall 007 Soluton to Mdterm Examnaton STAT 7 Dr. Goel. [0 ponts] For the general lnear model = X + ε, wth uncorrelated errors havng mean zero and varance σ, suppose that the desgn matrx X s not necessarly
More informationSolutions Homework 4 March 5, 2018
1 Solutons Homework 4 March 5, 018 Soluton to Exercse 5.1.8: Let a IR be a translaton and c > 0 be a re-scalng. ˆb1 (cx + a) cx n + a (cx 1 + a) c x n x 1 cˆb 1 (x), whch shows ˆb 1 s locaton nvarant and
More informationMATH 241B FUNCTIONAL ANALYSIS - NOTES EXAMPLES OF C ALGEBRAS
MATH 241B FUNCTIONAL ANALYSIS - NOTES EXAMPLES OF C ALGEBRAS These are nformal notes whch cover some of the materal whch s not n the course book. The man purpose s to gve a number of nontrval examples
More informationAlgorithms for factoring
CSA E0 235: Crytograhy Arl 9,2015 Instructor: Arta Patra Algorthms for factorng Submtted by: Jay Oza, Nranjan Sngh Introducton Factorsaton of large ntegers has been a wdely studed toc manly because of
More information10-701/ Machine Learning, Fall 2005 Homework 3
10-701/15-781 Machne Learnng, Fall 2005 Homework 3 Out: 10/20/05 Due: begnnng of the class 11/01/05 Instructons Contact questons-10701@autonlaborg for queston Problem 1 Regresson and Cross-valdaton [40
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More informationChat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980
MT07: Multvarate Statstcal Methods Mke Tso: emal mke.tso@manchester.ac.uk Webpage for notes: http://www.maths.manchester.ac.uk/~mkt/new_teachng.htm. Introducton to multvarate data. Books Chat eld, C. and
More informationMATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)
1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons
More informationStanford University CS254: Computational Complexity Notes 7 Luca Trevisan January 29, Notes for Lecture 7
Stanford Unversty CS54: Computatonal Complexty Notes 7 Luca Trevsan January 9, 014 Notes for Lecture 7 1 Approxmate Countng wt an N oracle We complete te proof of te followng result: Teorem 1 For every
More informationHANSON-WRIGHT INEQUALITY AND SUB-GAUSSIAN CONCENTRATION
HANSON-WRIGHT INEQUALITY AND SUB-GAUSSIAN CONCENTRATION MARK RUDELSON AND ROMAN VERSHYNIN Abstract. In ths expostory note, we gve a modern proof of Hanson-Wrght nequalty for quadratc forms n sub-gaussan
More informationMore metrics on cartesian products
More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of
More informationTAIL BOUNDS FOR SUMS OF GEOMETRIC AND EXPONENTIAL VARIABLES
TAIL BOUNDS FOR SUMS OF GEOMETRIC AND EXPONENTIAL VARIABLES SVANTE JANSON Abstract. We gve explct bounds for the tal probabltes for sums of ndependent geometrc or exponental varables, possbly wth dfferent
More informationPh 219a/CS 219a. Exercises Due: Wednesday 23 October 2013
1 Ph 219a/CS 219a Exercses Due: Wednesday 23 October 2013 1.1 How far apart are two quantum states? Consder two quantum states descrbed by densty operators ρ and ρ n an N-dmensonal Hlbert space, and consder
More information10-801: Advanced Optimization and Randomized Methods Lecture 2: Convex functions (Jan 15, 2014)
0-80: Advanced Optmzaton and Randomzed Methods Lecture : Convex functons (Jan 5, 04) Lecturer: Suvrt Sra Addr: Carnege Mellon Unversty, Sprng 04 Scrbes: Avnava Dubey, Ahmed Hefny Dsclamer: These notes
More informationHidden Markov Models & The Multivariate Gaussian (10/26/04)
CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models
More informationIntroduction to Algorithms
Introducton to Algorthms 6.046J/8.40J Lecture 7 Prof. Potr Indyk Data Structures Role of data structures: Encapsulate data Support certan operatons (e.g., INSERT, DELETE, SEARCH) Our focus: effcency of
More informationLecture 21: Numerical methods for pricing American type derivatives
Lecture 21: Numercal methods for prcng Amercan type dervatves Xaoguang Wang STAT 598W Aprl 10th, 2014 (STAT 598W) Lecture 21 1 / 26 Outlne 1 Fnte Dfference Method Explct Method Penalty Method (STAT 598W)
More informationMATH 281A: Homework #6
MATH 28A: Homework #6 Jongha Ryu Due date: November 8, 206 Problem. (Problem 2..2. Soluton. If X,..., X n Bern(p, then T = X s a complete suffcent statstc. Our target s g(p = p, and the nave guess suggested
More informationMath 217 Fall 2013 Homework 2 Solutions
Math 17 Fall 013 Homework Solutons Due Thursday Sept. 6, 013 5pm Ths homework conssts of 6 problems of 5 ponts each. The total s 30. You need to fully justfy your answer prove that your functon ndeed has
More informationLecture 10: May 6, 2013
TTIC/CMSC 31150 Mathematcal Toolkt Sprng 013 Madhur Tulsan Lecture 10: May 6, 013 Scrbe: Wenje Luo In today s lecture, we manly talked about random walk on graphs and ntroduce the concept of graph expander,
More informationThe Geometry of Logit and Probit
The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.
More informationDifferentiating Gaussian Processes
Dfferentatng Gaussan Processes Andrew McHutchon Aprl 17, 013 1 Frst Order Dervatve of the Posteror Mean The posteror mean of a GP s gven by, f = x, X KX, X 1 y x, X α 1 Only the x, X term depends on the
More informationSingular Value Decomposition: Theory and Applications
Sngular Value Decomposton: Theory and Applcatons Danel Khashab Sprng 2015 Last Update: March 2, 2015 1 Introducton A = UDV where columns of U and V are orthonormal and matrx D s dagonal wth postve real
More informationStanford University Graph Partitioning and Expanders Handout 3 Luca Trevisan May 8, 2013
Stanford Unversty Graph Parttonng and Expanders Handout 3 Luca Trevsan May 8, 03 Lecture 3 In whch we analyze the power method to approxmate egenvalues and egenvectors, and we descrbe some more algorthmc
More informationEffects of Ignoring Correlations When Computing Sample Chi-Square. John W. Fowler February 26, 2012
Effects of Ignorng Correlatons When Computng Sample Ch-Square John W. Fowler February 6, 0 It can happen that ch-square must be computed for a sample whose elements are correlated to an unknown extent.
More informationStatistical pattern recognition
Statstcal pattern recognton Bayes theorem Problem: decdng f a patent has a partcular condton based on a partcular test However, the test s mperfect Someone wth the condton may go undetected (false negatve
More informationxp(x µ) = 0 p(x = 0 µ) + 1 p(x = 1 µ) = µ
CSE 455/555 Sprng 2013 Homework 7: Parametrc Technques Jason J. Corso Computer Scence and Engneerng SUY at Buffalo jcorso@buffalo.edu Solutons by Yngbo Zhou Ths assgnment does not need to be submtted and
More informationRandomness and Computation
Randomness and Computaton or, Randomzed Algorthms Mary Cryan School of Informatcs Unversty of Ednburgh RC 208/9) Lecture 0 slde Balls n Bns m balls, n bns, and balls thrown unformly at random nto bns usually
More informationCIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M
CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute
More informationPerron Vectors of an Irreducible Nonnegative Interval Matrix
Perron Vectors of an Irreducble Nonnegatve Interval Matrx Jr Rohn August 4 2005 Abstract As s well known an rreducble nonnegatve matrx possesses a unquely determned Perron vector. As the man result of
More informationComputation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models
Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,
More informationGMM Method (Single-equation) Pongsa Pornchaiwiseskul Faculty of Economics Chulalongkorn University
GMM Method (Sngle-equaton Pongsa Pornchawsesul Faculty of Economcs Chulalongorn Unversty Stochastc ( Gven that, for some, s random COV(, ε E(( µ ε E( ε µ E( ε E( ε (c Pongsa Pornchawsesul, Faculty of Economcs,
More informationWhich Separator? Spring 1
Whch Separator? 6.034 - Sprng 1 Whch Separator? Mamze the margn to closest ponts 6.034 - Sprng Whch Separator? Mamze the margn to closest ponts 6.034 - Sprng 3 Margn of a pont " # y (w $ + b) proportonal
More informationStrong Markov property: Same assertion holds for stopping times τ.
Brownan moton Let X ={X t : t R + } be a real-valued stochastc process: a famlty of real random varables all defned on the same probablty space. Defne F t = nformaton avalable by observng the process up
More informationTwo-factor model. Statistical Models. Least Squares estimation in LM two-factor model. Rats
tatstcal Models Lecture nalyss of Varance wo-factor model Overall mean Man effect of factor at level Man effect of factor at level Y µ + α + β + γ + ε Eε f (, ( l, Cov( ε, ε ) lmr f (, nteracton effect
More informationBezier curves. Michael S. Floater. August 25, These notes provide an introduction to Bezier curves. i=0
Bezer curves Mchael S. Floater August 25, 211 These notes provde an ntroducton to Bezer curves. 1 Bernsten polynomals Recall that a real polynomal of a real varable x R, wth degree n, s a functon of the
More informationLecture 3. Ax x i a i. i i
18.409 The Behavor of Algorthms n Practce 2/14/2 Lecturer: Dan Spelman Lecture 3 Scrbe: Arvnd Sankar 1 Largest sngular value In order to bound the condton number, we need an upper bound on the largest
More informationHanson-Wright inequality and sub-gaussian concentration
Electron. Commun. Probab. 18 (013, no. 8, 1 9. DOI: 10.114/ECP.v18-865 ISSN: 1083-589X ELECTRONIC COMMUNICATIONS n PROBABILITY Hanson-Wrght nequalty and sub-gaussan concentraton Mark Rudelson Roman Vershynn
More informationChapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems
Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons
More informationError Bars in both X and Y
Error Bars n both X and Y Wrong ways to ft a lne : 1. y(x) a x +b (σ x 0). x(y) c y + d (σ y 0) 3. splt dfference between 1 and. Example: Prmordal He abundance: Extrapolate ft lne to [ O / H ] 0. [ He
More information8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS
SECTION 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS 493 8.4 COMPLEX VECTOR SPACES AND INNER PRODUCTS All the vector spaces you have studed thus far n the text are real vector spaces because the scalars
More informationMLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012
MLE and Bayesan Estmaton Je Tang Department of Computer Scence & Technology Tsnghua Unversty 01 1 Lnear Regresson? As the frst step, we need to decde how we re gong to represent the functon f. One example:
More informationCHALMERS, GÖTEBORGS UNIVERSITET. SOLUTIONS to RE-EXAM for ARTIFICIAL NEURAL NETWORKS. COURSE CODES: FFR 135, FIM 720 GU, PhD
CHALMERS, GÖTEBORGS UNIVERSITET SOLUTIONS to RE-EXAM for ARTIFICIAL NEURAL NETWORKS COURSE CODES: FFR 35, FIM 72 GU, PhD Tme: Place: Teachers: Allowed materal: Not allowed: January 2, 28, at 8 3 2 3 SB
More informationComparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method
Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method
More informationPattern Classification
attern Classfcaton All materals n these sldes were taken from attern Classfcaton nd ed by R. O. Duda,. E. Hart and D. G. Stork, John Wley & Sons, 000 wth the ermsson of the authors and the ublsher Chater
More informationFall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede
Fall 0 Analyss of Expermental easurements B. Esensten/rev. S. Errede We now reformulate the lnear Least Squares ethod n more general terms, sutable for (eventually extendng to the non-lnear case, and also
More informationSUPPLEMENT TO ROP: MATRIX RECOVERY VIA RANK-ONE PROJECTIONS 1
SUPPLEMENT TO ROP: MATRIX RECOVERY VIA RANK-ONE PROJECTIONS BY T. TONY CAI and ANRU ZHANG Deartment of Statstcs The Wharton School Unversty of Pennsylvana We rove n ths sulement the techncal lemmas used
More informationNot-for-Publication Appendix to Optimal Asymptotic Least Aquares Estimation in a Singular Set-up
Not-for-Publcaton Aendx to Otmal Asymtotc Least Aquares Estmaton n a Sngular Set-u Antono Dez de los Ros Bank of Canada dezbankofcanada.ca December 214 A Proof of Proostons A.1 Proof of Prooston 1 Ts roof
More informationBézier curves. Michael S. Floater. September 10, These notes provide an introduction to Bézier curves. i=0
Bézer curves Mchael S. Floater September 1, 215 These notes provde an ntroducton to Bézer curves. 1 Bernsten polynomals Recall that a real polynomal of a real varable x R, wth degree n, s a functon of
More informationQuantifying Uncertainty
Partcle Flters Quantfyng Uncertanty Sa Ravela M. I. T Last Updated: Sprng 2013 1 Quantfyng Uncertanty Partcle Flters Partcle Flters Appled to Sequental flterng problems Can also be appled to smoothng problems
More information6) Derivatives, gradients and Hessian matrices
30C00300 Mathematcal Methods for Economsts (6 cr) 6) Dervatves, gradents and Hessan matrces Smon & Blume chapters: 14, 15 Sldes by: Tmo Kuosmanen 1 Outlne Defnton of dervatve functon Dervatve notatons
More information[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact
Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β
More informationFinding Dense Subgraphs in G(n, 1/2)
Fndng Dense Subgraphs n Gn, 1/ Atsh Das Sarma 1, Amt Deshpande, and Rav Kannan 1 Georga Insttute of Technology,atsh@cc.gatech.edu Mcrosoft Research-Bangalore,amtdesh,annan@mcrosoft.com Abstract. Fndng
More informationTests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions
ECONOMICS 35* -- NOTE ECON 35* -- NOTE Tests of Sngle Lnear Coeffcent Restrctons: t-tests and -tests Basc Rules Tests of a sngle lnear coeffcent restrcton can be performed usng ether a two-taled t-test
More informationBOUNDEDNESS OF THE RIESZ TRANSFORM WITH MATRIX A 2 WEIGHTS
BOUNDEDNESS OF THE IESZ TANSFOM WITH MATIX A WEIGHTS Introducton Let L = L ( n, be the functon space wth norm (ˆ f L = f(x C dx d < For a d d matrx valued functon W : wth W (x postve sem-defnte for all
More informationProbability Theory (revisited)
Probablty Theory (revsted) Summary Probablty v.s. plausblty Random varables Smulaton of Random Experments Challenge The alarm of a shop rang. Soon afterwards, a man was seen runnng n the street, persecuted
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More informationFour lectures on probabilistic methods for data science
IAS/Park Cty Mathematcs Seres Volume 00, Pages 000 000 S 1079-5634(XX)0000-0 Four lectures on probablstc methods for data scence Roman Vershynn Abstract. Methods of hgh-dmensonal probablty play a central
More informationEconometrica Supplementary Material
Econometrca Supplementary Materal SUPPLEMENT TO IDENTIFICATION AND ESTIMATION OF TRIANGULAR SIMULTANEOUS EQUATIONS MODELS WITHOUT ADDITIVITY Econometrca, Vol. 77, No. 5, September 009, 48 5) BY GUIDO W.
More informationParametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010
Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton
More informationLecture Space-Bounded Derandomization
Notes on Complexty Theory Last updated: October, 2008 Jonathan Katz Lecture Space-Bounded Derandomzaton 1 Space-Bounded Derandomzaton We now dscuss derandomzaton of space-bounded algorthms. Here non-trval
More information