Hybridized Heredity In Support Vector Machine
|
|
- Lauren Hunter
- 5 years ago
- Views:
Transcription
1 Hybridized Heredity I Suort Vector Machie May 2015 Hybridized Heredity I Suort Vector Machie Timothy Idowu Yougmi Park Uiversity of Wiscosi-Madiso idowu@stat.wisc.edu yougmi@stat.wisc.edu May 2015 Abstract I the resece of high dimesioal classificatio roblem it becomes ecessary to ivoke some form of sarsity. I this reort we make efforts to imrove heredity ricile i the Suort Vector Machie frame work. Heredity ricile is some form of hierarchical structure imosed o the redictor variables which ca make classifiers more iterretable. The heredity riciles itroduced by Wu, S., Zou, H. ad Yua, M. (2008)] are exteded to a more geeral form. The methods here are origially based o the structured variable selectio (SVS) roosed by Yua, M., Joseh, R. ad Zou, H. (2007)]. Obtaied solutios coform with secified heredity requiremets i the resece of sarsity. I. Itroductio I the machie learig world the suort vector machie (SVM) is a widely used suervised learig method for biary classificatio. Let x deote a set of covariates. The class labels, y, are deoted as {1, 1}. Usig some traiig data set {x i, y i }, i = 1, 2,...,, the suort vector machie solves the roblem by usig the followig ealized hige loss ( ˆβ, ˆβ 0 ) = arg mi β,β 0 1 y i (x T i β β 0 )] λ β 2 2 (1) with the subscrit "" meaig ositive arts oly. The solutio is the Sig ( ˆβ 0 x T ˆβ ). I order to make the SVM more iterretable ad reduce classificatio error (roortio of times the SVM mis-classifies) we ca imlemet some structural adjustmet. Oe ossible adjustmet could be sarsity i cases with high dimesio to avoid iclusio of irrelevat redictors. A revious roositio by Bradley, P. ad Magasaria, O. (1998)] suggested a relacemet of the l 2 ealty i 1 with a l 1 ealty ( ˆβ, ˆβ 0 ) = arg mi β,β 0 1 y i (x T i β β 0 )] λ β 1. (2) Methods The aforemetioed methods do ot however take ito accout ay form of relatioshi betwee the redictors. For examle, let us cosider a quadratic classifier with redictors z 1, z 2,..., z q : β 1 z 1... β q z q β 11 z 2 1 β 12z 1 z 2... β q,q 1 z q z q 1... β qq z 2 q (3) oe might wat to activate the heredity ricile. Two well established heredity riciles are the strog ad weak. 1
2 Hybridized Heredity I Suort Vector Machie May 2015 Strog heredity: For a two factor iteractio, say, some z i z j to be active both its aret effects, z i ad z j, should be active. Weak heredity: For a two factor iteractio, say, some z i z j to be active at least oe of its aret effects, z i or z j, should be active. It is cosistet to require z i active for z 2 i to be active. The methods discussed i this reort simultaeously imose sarsity ad heredity. The costraits are also secified such that the roblems remai liear rogrammig oes. I. Geeralized Garrote ad Heredity Priciles Breima, L. 1995] itroduced the oegative garrrote used for variable selectio i liear regressio. This ca be alied i the SVM framework. If we have extracted the coefficiets ˆβ from the l 2 SVM, it is ossible to scale the each redictor x j with some arameter θ j ad the solve the followig otimizatio roblem mi 1 y i x ij ˆβ j θ j β 0 with the costraits θ j M ad θ j 0 j, with M as the shrikage arameter. The classifier ( the takes the form Sig ˆβ 0 x j ˆβ ) j ˆθ j. With a aroriately chose M, some θ j s will be reduced to 0, hece erformig a variable selectio. Adjustmets ca be made o the garrote method to fit certai requiremet by usig aroriate costraits. (4) Strog Heredity (SHSVM) Let the redictor set be of size, the hierarchical structure of the redictors ca be exressed by sets {D j : j = 1,..., }, where D j cosists of the aret effects of the jth redictor. For examle, let the q 1th redictor, x q1 = z 1 z 2 with aret effect x 1 = z 1 ad x 2 = z 2 so we have D q1 = {1, 2}. To imlemet the strog heredity ricile the garrote ca be adjusted with some costraits mi 1 y i x ij ˆβ j θ j β 0 λ θ j (5) with the costraits θ j θ r, r D j, j ad θ j 0 j. The additioal liear iequality costrait o the scalig arameters also hel with sarsity of the coefficiets. I Weak Heredity (WHSVM) To imlemet the weak heredity ricile we have the costraits are mi 1 y i x ij ˆβ j θ j β 0 λ θ j (6) with the costraits θ j r Dj θ r, j ad θ j 0 j. Based o the costraits at least oe aret eeds to be active for a iteractio to be active, which imlies that the resultig model satisfies the weak heredity ricile. 2
3 Hybridized Heredity I Suort Vector Machie May 2015 Hybridized Heredity (HHSVM) We further set the costraits so as to geeralize the heredity ricile by havig the followig: mi 1 y i x ij ˆβ j θ j β 0 λ θ j (7) with the costraits θ j r D θ j r, j ad θ D j j 0 j where D j is the dimesio of D j. Note the followig: If strog heredity is satisfied the the hybridized heredity must be satisfied. If hybridized heredity is satisfied the the weak heredity must be satisfied. I Numerical Results I. Data We use umerical examles to comare our HHSVM to the WHSVM ad SHSVM. For each simulatio examle, we geerated a dataset of size 10,000 for testig. For traiig, we radomly geerated aother datasets of sizes = 50, 100, ad 200. I each examle, all classifiers were fitted o a traiig samle ad their classificatio errors (uder 0 1 loss) were comuted o a test samle. This rocess was reeated 30 times ad the meas of the errors are reorted. Structure The geerated exlaatory variables z 1,..., z 7 are stadard ormal, where the correlatio betwee z r ad z j is ρ r j, with ρ = 0, 0.5. The class labels are geerated from a logistic regressio model. The set of redictors for fittig the SVMs is {z j, z r z j, z 2 j }, with r, j = 1,..., 7. Let θ j ad θ jj be the scalig arameters for z j ad z 2 j resectively while θ rj corresods to z r z j for r = j. To fit the SHSVM the liear costraits i 5 are of the form θ rj θ r ad θ rj θ j, r = j, where r, j = 1,..., 7 To fit the WHSVM the liear costraits i 6 are of the form θ rj θ r θ j, r = j, where r, j = 1,..., 7 To fit the HHSVM the liear costraits i 7 are of the form I θ rj θ rθ j 2, r = j, where r, j = 1,..., 7 Examle with Strog Heredity satisfied ( ) Pr(y = 1 z1,..., z 7 ) log = 2z Pr(y = 1 z 1,..., z 7 ) 1 4z 3 3z 1 z 2 1. The simulatio results are summarized i Table 1. From Table 1 we see that the SHSVM sigificatly outerforms other methods but the HHSVM outerforms the WHSVM. 3
4 Hybridized Heredity I Suort Vector Machie May 2015 Examle with Weak Heredity satisfied ( ) Pr(y = 1 z1,..., z 7 ) log = 3.5z Pr(y = 1 z 1,..., z 7 ) 1 3z 1 z 2 2.5z 1 z 3 2z 1 z 4 1.5z 1 z 5 z 1 z 6 1. The simulatio results are summarized i Table 2. From Table 2 we see that the WHSVM sigificatly outerforms other methods but the HHSVM outerforms the SHSVM. Table 1: Classificatio Error from Strog Heredity Examle ρ=0 SHSVM HHSVM WHSVM l 2 SVM ρ=0.5 SHSVM HHSVM WHSVM l 2 SVM Table 2: Classificatio Error from Weak Heredity Examle ρ=0 WHSVM HHSVM SHSVM l 2 SVM ρ=0.5 WHSVM HHSVM SHSVM l 2 SVM Discussio I the cases where weak heredity is satisfied, HHSVM outerform SHSVM. I the cases where strog heredity is satisfied, HHSVM outerform WHSVM. It is recommeded to aly HHSVM whe the heredity structure is ot kow or well secified. Refereces Bradley, P. ad Magasaria, O. (1998)] Bradley, P. ad Magasaria, O. (1998). Feature selectio via cocave miimizatio ad suort vector machies. I J. Shavlik (eds), ICML 98. Morga Kaufma. Breima, L. 1995] Breima, L. (1995). Better subset regressio usig the oegative garrote. Techometrics, 37, 4,
5 Hybridized Heredity I Suort Vector Machie May 2015 Yua, M., Joseh, R. ad Zou, H. (2007)] Yua, M., Joseh, R. ad Zou, H. (2007). Structured Variable Selectio ad Estimatio. Techical Reort, School of Idustrial ad Systems Egieerig, Georgia Istitute of Techology. Wu, S., Zou, H. ad Yua, M. (2008)] Wu, S., Zou, H. ad Yua, M. (2008). Structured Variable Selectio i Suort Vector Machies. Electroic Joural of Statistics, Vol. 2(2008)
Support vector machine revisited
6.867 Machie learig, lecture 8 (Jaakkola) 1 Lecture topics: Support vector machie ad kerels Kerel optimizatio, selectio Support vector machie revisited Our task here is to first tur the support vector
More informationOverview. Structured learning for feature selection and prediction. Motivation for feature selection. Outline. Part III:
Overview Structured learig for feature selectio ad predictio Yookyug Lee Departmet of Statistics The Ohio State Uiversity Part I: Itroductio to Kerel methods Part II: Learig with Reproducig Kerel Hilbert
More informationConfidence Intervals
Cofidece Itervals Berli Che Deartmet of Comuter Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Referece: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chater 5 & Teachig Material Itroductio
More informationVolume 2, ISSN (Online), Published at:
SHRINKAGE REGRESSION METHODS APPLIED TO AGRICULTURE Sua Akkol Yuzucu Yıl Uiversity, Faculty of Agriculture, Deartmet of Aimal Sciece, Va-Turkey Abstract I Multile Liear Regressio a commo goal is to determie
More informationChapter 7. Support Vector Machine
Chapter 7 Support Vector Machie able of Cotet Margi ad support vectors SVM formulatio Slack variables ad hige loss SVM for multiple class SVM ith Kerels Relevace Vector Machie Support Vector Machie (SVM)
More informationSYMMETRIC POSITIVE SEMI-DEFINITE SOLUTIONS OF AX = B AND XC = D
Joural of Pure ad Alied Mathematics: Advaces ad Alicatios olume, Number, 009, Pages 99-07 SYMMERIC POSIIE SEMI-DEFINIE SOLUIONS OF AX B AND XC D School of Mathematics ad Physics Jiagsu Uiversity of Sciece
More informationDefinitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.
Defiitios ad Theorems Remember the scalar form of the liear programmig problem, Miimize, Subject to, f(x) = c i x i a 1i x i = b 1 a mi x i = b m x i 0 i = 1,2,, where x are the decisio variables. c, b,
More informationBIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH
BIOSAISICAL MEHODS FOR RANSLAIONAL & CLINICAL RESEARCH Direct Bioassays: REGRESSION APPLICAIONS COMPONENS OF A BIOASSAY he subject is usually a aimal, a huma tissue, or a bacteria culture, he aget is usually
More informationMachine Learning Theory Tübingen University, WS 2016/2017 Lecture 12
Machie Learig Theory Tübige Uiversity, WS 06/07 Lecture Tolstikhi Ilya Abstract I this lecture we derive risk bouds for kerel methods. We will start by showig that Soft Margi kerel SVM correspods to miimizig
More informationBehavior of Lasso Quantile Regression with Small Sample Sizes
Joural of Multidisciliary Egieerig Sciece ad Techology (JMEST) Behavior of Lasso Quatile Regressio with Small Samle Sizes Dr. Elham Abdul-Razik Ismail Assistat Professor of Statistics, Faculty of Commerce,
More information10-701/ Machine Learning Mid-term Exam Solution
0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it
More informationSummary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector
Summary ad Discussio o Simultaeous Aalysis of Lasso ad Datzig Selector STAT732, Sprig 28 Duzhe Wag May 4, 28 Abstract This is a discussio o the work i Bickel, Ritov ad Tsybakov (29). We begi with a short
More informationTo make comparisons for two populations, consider whether the samples are independent or dependent.
Sociology 54 Testig for differeces betwee two samle meas Cocetually, comarig meas from two differet samles is the same as what we ve doe i oe-samle tests, ecet that ow the hyotheses focus o the arameters
More information13.1 Shannon lower bound
ECE598: Iformatio-theoretic methods i high-dimesioal statistics Srig 016 Lecture 13: Shao lower boud, Fao s method Lecturer: Yihog Wu Scribe: Daewo Seo, Mar 8, 016 [Ed Mar 11] I the last class, we leared
More informationLinear Classifiers III
Uiversität Potsdam Istitut für Iformatik Lehrstuhl Maschielles Lere Liear Classifiers III Blaie Nelso, Tobias Scheffer Cotets Classificatio Problem Bayesia Classifier Decisio Liear Classifiers, MAP Models
More informationComposite Quantile Generalized Quasi-Likelihood Ratio Tests for Varying Coefficient Regression Models Jin-ju XU 1 and Zhong-hua LUO 2,*
07 d Iteratioal Coferece o Iformatio Techology ad Maagemet Egieerig (ITME 07) ISBN: 978--60595-45-8 Comosite Quatile Geeralized Quasi-Likelihood Ratio Tests for Varyig Coefficiet Regressio Models Ji-u
More information6.867 Machine learning
6.867 Machie learig Mid-term exam October, ( poits) Your ame ad MIT ID: Problem We are iterested here i a particular -dimesioal liear regressio problem. The dataset correspodig to this problem has examples
More informationTopics Machine learning: lecture 2. Review: the learning problem. Hypotheses and estimation. Estimation criterion cont d. Estimation criterion
.87 Machie learig: lecture Tommi S. Jaakkola MIT CSAIL tommi@csail.mit.edu Topics The learig problem hypothesis class, estimatio algorithm loss ad estimatio criterio samplig, empirical ad epected losses
More informationAlgebra of Least Squares
October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal
More informationLecture 24: Variable selection in linear models
Lecture 24: Variable selectio i liear models Cosider liear model X = Z β + ε, β R p ad Varε = σ 2 I. Like the LSE, the ridge regressio estimator does ot give 0 estimate to a compoet of β eve if that compoet
More informationEstimation of Parameters of Johnson s System of Distributions
Joural of Moder Alied tatistical Methods Volume 0 Issue Article 9 --0 Estimatio of Parameters of Johso s ystem of Distributios Florece George Florida Iteratioal iversity, fgeorge@fiu.edu K. M. Ramachadra
More informationGeometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT
OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationLecture 10: Performance Evaluation of ML Methods
CSE57A Machie Learig Sprig 208 Lecture 0: Performace Evaluatio of ML Methods Istructor: Mario Neuma Readig: fcml: 5.4 (Performace); esl: 7.0 (Cross-Validatio); optioal book: Evaluatio Learig Algorithms
More informationLecture 33: Bootstrap
Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece
More informationA Note on Adaptive Group Lasso
A Note o Adaptive Group Lasso Hasheg Wag ad Chelei Leg Pekig Uiversity & Natioal Uiversity of Sigapore July 7, 2006. Abstract Group lasso is a atural extesio of lasso ad selects variables i a grouped maer.
More informationMachine Learning Theory Tübingen University, WS 2016/2017 Lecture 11
Machie Learig Theory Tübige Uiversity, WS 06/07 Lecture Tolstikhi Ilya Abstract We will itroduce the otio of reproducig kerels ad associated Reproducig Kerel Hilbert Spaces (RKHS). We will cosider couple
More informationConfidence intervals for proportions
Cofidece itervals for roortios Studet Activity 7 8 9 0 2 TI-Nsire Ivestigatio Studet 60 mi Itroductio From revious activity This activity assumes kowledge of the material covered i the activity Distributio
More informationECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015
ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],
More informationThe Hong Kong University of Science & Technology ISOM551 Introductory Statistics for Business Assignment 3 Suggested Solution
The Hog Kog Uiversity of ciece & Techology IOM55 Itroductory tatistics for Busiess Assigmet 3 uggested olutio Note All values of statistics i Q ad Q4 are obtaied by Excel. Qa. Let be the robability that
More informationInformation-based Feature Selection
Iformatio-based Feature Selectio Farza Faria, Abbas Kazeroui, Afshi Babveyh Email: {faria,abbask,afshib}@staford.edu 1 Itroductio Feature selectio is a topic of great iterest i applicatios dealig with
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio
More informationMachine Learning Theory (CS 6783)
Machie Learig Theory (CS 6783) Lecture 2 : Learig Frameworks, Examples Settig up learig problems. X : istace space or iput space Examples: Computer Visio: Raw M N image vectorized X = 0, 255 M N, SIFT
More informationDISCUSSION: LATENT VARIABLE GRAPHICAL MODEL SELECTION VIA CONVEX OPTIMIZATION. By Zhao Ren and Harrison H. Zhou Yale University
Submitted to the Aals of Statistics DISCUSSION: LATENT VARIABLE GRAPHICAL MODEL SELECTION VIA CONVEX OPTIMIZATION By Zhao Re ad Harriso H. Zhou Yale Uiversity 1. Itroductio. We would like to cogratulate
More informationEmpirical likelihood for parametric model under imputation for missing
Emirical likelihood for arametric model uder imutatio for missig data Lichu Wag Ceter for Statistics Limburgs Uiversitair Cetrum Uiversitaire Camus B-3590 Dieebeek Belgium Qihua Wag Istitute of Alied Mathematics
More informationHypothesis Testing, Model Selection, and Prediction in Least Squares and Maximum Likelihood Estimation
Hyothesis Testig, Model Selectio, ad Predictio i Least Squares ad Maximum Likelihood Estimatio Greee Ch.5, 4; Keedy Ch. 4 R scrit mod3sa, mod3sb, mod3sc Testig for Restrictios i a LS Model Liear Restrictios
More informationJohn H. J. Einmahl Tilburg University, NL. Juan Juan Cai Tilburg University, NL
Estimatio of the margial exected shortfall Laures de Haa, Poitiers, 202 Estimatio of the margial exected shortfall Jua Jua Cai Tilburg iversity, NL Laures de Haa Erasmus iversity Rotterdam, NL iversity
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More informationOptimally Sparse SVMs
A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationBoosting. Professor Ameet Talwalkar. Professor Ameet Talwalkar CS260 Machine Learning Algorithms March 1, / 32
Boostig Professor Ameet Talwalkar Professor Ameet Talwalkar CS260 Machie Learig Algorithms March 1, 2017 1 / 32 Outlie 1 Admiistratio 2 Review of last lecture 3 Boostig Professor Ameet Talwalkar CS260
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More information6.867 Machine learning, lecture 7 (Jaakkola) 1
6.867 Machie learig, lecture 7 (Jaakkola) 1 Lecture topics: Kerel form of liear regressio Kerels, examples, costructio, properties Liear regressio ad kerels Cosider a slightly simpler model where we omit
More informationRound-off Errors and Computer Arithmetic - (1.2)
Roud-off Errors ad Comuter Arithmetic - (1.) 1. Roud-off Errors: Roud-off errors is roduced whe a calculator or comuter is used to erform real umber calculatios. That is because the arithmetic erformed
More informationTHE INTEGRAL TEST AND ESTIMATES OF SUMS
THE INTEGRAL TEST AND ESTIMATES OF SUMS. Itroductio Determiig the exact sum of a series is i geeral ot a easy task. I the case of the geometric series ad the telescoig series it was ossible to fid a simle
More informationEmpirical Process Theory and Oracle Inequalities
Stat 928: Statistical Learig Theory Lecture: 10 Empirical Process Theory ad Oracle Iequalities Istructor: Sham Kakade 1 Risk vs Risk See Lecture 0 for a discussio o termiology. 2 The Uio Boud / Boferoi
More information(all terms are scalars).the minimization is clearer in sum notation:
7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1
More informationIntroduction to Machine Learning DIS10
CS 189 Fall 017 Itroductio to Machie Learig DIS10 1 Fu with Lagrage Multipliers (a) Miimize the fuctio such that f (x,y) = x + y x + y = 3. Solutio: The Lagragia is: L(x,y,λ) = x + y + λ(x + y 3) Takig
More informationLinear Support Vector Machines
Liear Support Vector Machies David S. Roseberg The Support Vector Machie For a liear support vector machie (SVM), we use the hypothesis space of affie fuctios F = { f(x) = w T x + b w R d, b R } ad evaluate
More informationInvestigating the Significance of a Correlation Coefficient using Jackknife Estimates
Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------
More informationECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors
ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationLinear Programming and the Simplex Method
Liear Programmig ad the Simplex ethod Abstract This article is a itroductio to Liear Programmig ad usig Simplex method for solvig LP problems i primal form. What is Liear Programmig? Liear Programmig is
More informationTesting Statistical Hypotheses for Compare. Means with Vague Data
Iteratioal Mathematical Forum 5 o. 3 65-6 Testig Statistical Hypotheses for Compare Meas with Vague Data E. Baloui Jamkhaeh ad A. adi Ghara Departmet of Statistics Islamic Azad iversity Ghaemshahr Brach
More informationElement sampling: Part 2
Chapter 4 Elemet samplig: Part 2 4.1 Itroductio We ow cosider uequal probability samplig desigs which is very popular i practice. I the uequal probability samplig, we ca improve the efficiecy of the resultig
More informationECE534, Spring 2018: Final Exam
ECE534, Srig 2018: Fial Exam Problem 1 Let X N (0, 1) ad Y N (0, 1) be ideedet radom variables. variables V = X + Y ad W = X 2Y. Defie the radom (a) Are V, W joitly Gaussia? Justify your aswer. (b) Comute
More informationIP Reference guide for integer programming formulations.
IP Referece guide for iteger programmig formulatios. by James B. Orli for 15.053 ad 15.058 This documet is iteded as a compact (or relatively compact) guide to the formulatio of iteger programs. For more
More informationON SUPERSINGULAR ELLIPTIC CURVES AND HYPERGEOMETRIC FUNCTIONS
ON SUPERSINGULAR ELLIPTIC CURVES AND HYPERGEOMETRIC FUNCTIONS KEENAN MONKS Abstract The Legedre Family of ellitic curves has the remarkable roerty that both its eriods ad its suersigular locus have descritios
More informationStudy on Coal Consumption Curve Fitting of the Thermal Power Based on Genetic Algorithm
Joural of ad Eergy Egieerig, 05, 3, 43-437 Published Olie April 05 i SciRes. http://www.scirp.org/joural/jpee http://dx.doi.org/0.436/jpee.05.34058 Study o Coal Cosumptio Curve Fittig of the Thermal Based
More informationMachine Learning for Data Science (CS 4786)
Machie Learig for Data Sciece CS 4786) Lecture & 3: Pricipal Compoet Aalysis The text i black outlies high level ideas. The text i blue provides simple mathematical details to derive or get to the algorithm
More informationSummary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.
Key Cocepts: 1) Sketchig of scatter diagram The scatter diagram of bivariate (i.e. cotaiig two variables) data ca be easily obtaied usig GC. Studets are advised to refer to lecture otes for the GC operatios
More informationBasics of Inference. Lecture 21: Bayesian Inference. Review - Example - Defective Parts, cont. Review - Example - Defective Parts
Basics of Iferece Lecture 21: Sta230 / Mth230 Coli Rudel Aril 16, 2014 U util this oit i the class you have almost exclusively bee reseted with roblems where we are usig a robability model where the model
More informationA Note on Matrix Rigidity
A Note o Matrix Rigidity Joel Friedma Departmet of Computer Sciece Priceto Uiversity Priceto, NJ 08544 Jue 25, 1990 Revised October 25, 1991 Abstract I this paper we give a explicit costructio of matrices
More information18. Two-sample problems for population means (σ unknown)
8. Two-samle roblems for oulatio meas (σ ukow) The Practice of Statistics i the Life Scieces Third Editio 04 W.H. Freema ad Comay Objectives (PSLS Chater 8) Comarig two meas (σ ukow) Two-samle situatios
More informationPutnam Training Exercise Counting, Probability, Pigeonhole Principle (Answers)
Putam Traiig Exercise Coutig, Probability, Pigeohole Pricile (Aswers) November 24th, 2015 1. Fid the umber of iteger o-egative solutios to the followig Diohatie equatio: x 1 + x 2 + x 3 + x 4 + x 5 = 17.
More informationConfidence Intervals for the Difference Between Two Proportions
PASS Samle Size Software Chater 6 Cofidece Itervals for the Differece Betwee Two Proortios Itroductio This routie calculates the grou samle sizes ecessary to achieve a secified iterval width of the differece
More informationLecture #20. n ( x p i )1/p = max
COMPSCI 632: Approximatio Algorithms November 8, 2017 Lecturer: Debmalya Paigrahi Lecture #20 Scribe: Yua Deg 1 Overview Today, we cotiue to discuss about metric embeddigs techique. Specifically, we apply
More informationVariable selection in principal components analysis of qualitative data using the accelerated ALS algorithm
Variable selectio i pricipal compoets aalysis of qualitative data usig the accelerated ALS algorithm Masahiro Kuroda Yuichi Mori Masaya Iizuka Michio Sakakihara (Okayama Uiversity of Sciece) (Okayama Uiversity
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationStatistical Inference Based on Extremum Estimators
T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0
More informationDETERMINATION OF MECHANICAL PROPERTIES OF A NON- UNIFORM BEAM USING THE MEASUREMENT OF THE EXCITED LONGITUDINAL ELASTIC VIBRATIONS.
ICSV4 Cairs Australia 9- July 7 DTRMINATION OF MCHANICAL PROPRTIS OF A NON- UNIFORM BAM USING TH MASURMNT OF TH XCITD LONGITUDINAL LASTIC VIBRATIONS Pavel Aokhi ad Vladimir Gordo Departmet of the mathematics
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationLASSO Variable Selection Techniques in Data Envelopment Analysis
LASSO Variable Selectio Techiques i Data Evelomet Aalysis Jia-Yig Cai Istitute of Maufacturig Iformatio ad Systems Natioal Cheg Kug Uiversity, Taia City 701, Taiwa Tel: (+886) 6-275-7575 ext.34223, Email:
More information1 Duality revisited. AM 221: Advanced Optimization Spring 2016
AM 22: Advaced Optimizatio Sprig 206 Prof. Yaro Siger Sectio 7 Wedesday, Mar. 9th Duality revisited I this sectio, we will give a slightly differet perspective o duality. optimizatio program: f(x) x R
More informationSolutions to Problem Set 7
8.78 Solutios to Problem Set 7. If the umber is i S, we re doe sice it s relatively rime to everythig. So suose S. Break u the remaiig elemets ito airs {, }, {4, 5},..., {, + }. By the Pigeohole Pricile,
More informationMachine Learning. Logistic Regression -- generative verses discriminative classifier. Le Song /15-781, Spring 2008
Machie Learig 070/578 Srig 008 Logistic Regressio geerative verses discrimiative classifier Le Sog Lecture 5 Setember 4 0 Based o slides from Eric Xig CMU Readig: Cha. 3..34 CB Geerative vs. Discrimiative
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week Lecture: Cocept Check Exercises Starred problems are optioal. Statistical Learig Theory. Suppose A = Y = R ad X is some other set. Furthermore, assume P X Y is a discrete
More informationEconomics 326 Methods of Empirical Research in Economics. Lecture 18: The asymptotic variance of OLS and heteroskedasticity
Ecoomics 326 Methods of Empirical Research i Ecoomics Lecture 8: The asymptotic variace of OLS ad heteroskedasticity Hiro Kasahara Uiversity of British Columbia December 24, 204 Asymptotic ormality I I
More informationThe Method of Least Squares. To understand least squares fitting of data.
The Method of Least Squares KEY WORDS Curve fittig, least square GOAL To uderstad least squares fittig of data To uderstad the least squares solutio of icosistet systems of liear equatios 1 Motivatio Curve
More informationECON 3150/4150, Spring term Lecture 3
Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationDistribution of Sample Proportions
Distributio of Samle Proortios Probability ad statistics Aswers & Teacher Notes TI-Nsire Ivestigatio Studet 90 mi 7 8 9 10 11 12 Itroductio From revious activity: This activity assumes kowledge of the
More informationDimension of a Maximum Volume
Dimesio of a Maximum Volume Robert Kreczer Deartmet of Mathematics ad Comutig Uiversity of Wiscosi-Steves Poit Steves Poit, WI 54481 Phoe: (715) 346-3754 Email: rkrecze@uwsmail.uws.edu 1. INTRODUCTION.
More informationClustering. CM226: Machine Learning for Bioinformatics. Fall Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar.
Clusterig CM226: Machie Learig for Bioiformatics. Fall 216 Sriram Sakararama Ackowledgmets: Fei Sha, Ameet Talwalkar Clusterig 1 / 42 Admiistratio HW 1 due o Moday. Email/post o CCLE if you have questios.
More informationMixtures of Gaussians and the EM Algorithm
Mixtures of Gaussias ad the EM Algorithm CSE 6363 Machie Learig Vassilis Athitsos Computer Sciece ad Egieerig Departmet Uiversity of Texas at Arligto 1 Gaussias A popular way to estimate probability desity
More informationControl Charts for Mean for Non-Normally Correlated Data
Joural of Moder Applied Statistical Methods Volume 16 Issue 1 Article 5 5-1-017 Cotrol Charts for Mea for No-Normally Correlated Data J. R. Sigh Vikram Uiversity, Ujjai, Idia Ab Latif Dar School of Studies
More informationStep 1: Function Set. Otherwise, output C 2. Function set: Including all different w and b
Logistic Regressio Step : Fuctio Set We wat to fid P w,b C x σ z = + exp z If P w,b C x.5, output C Otherwise, output C 2 z P w,b C x = σ z z = w x + b = w i x i + b i z Fuctio set: f w,b x = P w,b C x
More informationLinear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d
Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y
More informationtests 17.1 Simple versus compound
PAS204: Lecture 17. tests UMP ad asymtotic I this lecture, we will idetify UMP tests, wherever they exist, for comarig a simle ull hyothesis with a comoud alterative. We also look at costructig tests based
More informationSelective Prediction
COMS 6998-4 Fall 2017 November 8, 2017 Selective Predictio Preseter: Rog Zhou Scribe: Wexi Che 1 Itroductio I our previous discussio o a variatio o the Valiat Model [3], the described learer has the ability
More information3.1. Introduction Assumptions.
Sectio 3. Proofs 3.1. Itroductio. A roof is a carefully reasoed argumet which establishes that a give statemet is true. Logic is a tool for the aalysis of roofs. Each statemet withi a roof is a assumtio,
More informationLogit regression Logit regression
Logit regressio Logit regressio models the probability of Y= as the cumulative stadard logistic distributio fuctio, evaluated at z = β 0 + β X: Pr(Y = X) = F(β 0 + β X) F is the cumulative logistic distributio
More informationMassachusetts Institute of Technology
Massachusetts Istitute of Techology 6.867 Machie Learig, Fall 6 Problem Set : Solutios. (a) (5 poits) From the lecture otes (Eq 4, Lecture 5), the optimal parameter values for liear regressio give the
More informationSOME TRIBONACCI IDENTITIES
Mathematics Today Vol.7(Dec-011) 1-9 ISSN 0976-38 Abstract: SOME TRIBONACCI IDENTITIES Shah Devbhadra V. Sir P.T.Sarvajaik College of Sciece, Athwalies, Surat 395001. e-mail : drdvshah@yahoo.com The sequece
More informationA RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS
J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a
More informationLecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)
Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +
More information5. Fractional Hot deck Imputation
5. Fractioal Hot deck Imputatio Itroductio Suppose that we are iterested i estimatig θ EY or eve θ 2 P ry < c where y fy x where x is always observed ad y is subject to missigess. Assume MAR i the sese
More informationarxiv: v1 [math.st] 18 Jun 2008
The Aals of Statistics 28, Vol. 36, No. 3, 8 26 DOI:.24/7-AOS57 c Istitute of Mathematical Statistics, 28 arxiv:86.295v [math.st] 8 Ju 28 COMPOSITE QUANTILE REGRESSION AND THE ORACLE MODEL SELECTION THEORY
More information