Machine Learning 1: Linear Regression

Size: px
Start display at page:

Download "Machine Learning 1: Linear Regression"

Transcription

1 Machine Learning 1: Linear Regression J. Zico Kolter September 18,

2 Motivation How much energy will we consume tommorrow? Difficult to estimate from a priori models But, we have lots of data from which to build a model 2

3 Energy 101 Energy: ability to do work (apply force through a distance) Unit of energy: joule (J), (also btu, kilowatt hour) 1 joule = 1 newton 1 meter = 1 kilogram 1 meter2 1 second 2 Power is a rate of energy use Unit of power: watt (W) (commonly kilo/mega/gigawatt) 1 watt = 1 joule 1 second 3

4 Forms of energy Mechanical kinetic energy: E = 1 2 mv2 Gravitational potential energy: E = mgh Thermal energy: E = 3 2 NkT Electrical energy: E = V Q Electromagnetic energy: E = hf Chemical energy Nuclear energy: E = mc 2 4

5 Energy conversion 5

6 Duquesne Light electricity consumption Hourly Demand (GW) Feb 9 Jul 13 Oct Hour of Day Data: PJM 6

7 Duquesne Light electricity consumption Source: Duquesne Light 7

8 Predict peak demand from high temperature What will peak demand be tomorrow? If we know something else about tomorrow (like the high temperature), we can use this to predict peak demand Peak Hourly Demand (GW) High Temperature (F) Data: PJM, Weather Underground (summer months, June-August) 8

9 A simple model A linear model that predicts demand: predicted peak demand = θ 1 (high temperature) + θ 2 Peak Hourly Demand (GW) Observed data Linear regression prediction High Temperature (F) Parameters of model: θ 1, θ 2 R (θ 1 = 0.046, θ 2 = 1.46) 9

10 A simple model We can use a model like this to make predictions What will be the peak demand for Duquense Light tomorrow? I know from weather report that high temperature will be 80 F (ignore, for the moment, that this too is a prediction) Then predicted peak demand is: θ θ 2 = = 2.19 GW 10

11 Formal problem setting Input: x i R n, i = 1,..., m E.g.: x i R 1 = {high temperature for day i} Output: y i R (regression task) E.g.: y i R = {peak demand for day i} Model Parameters: θ R k Predicted Output: ŷ i R E.g.: ŷ i = θ 1 x i + θ 2 11

12 For convenience, we define a function that maps inputs to feature vectors φ : R n R k For example, in our task above, if we define [ ] xi φ(x i ) = (here n = 1, k = 2) 1 then we can write ŷ i = k θ j φ j (x i ) θ T φ(x i ) j=1 12

13 Loss functions Want a model that performs well on the data we have I.e., ŷ i y i, i We measure closeness of ŷ i and y i using loss function l : R R R + Example: squared loss l(ŷ i, y i ) = (ŷ i y i ) 2 13

14 Finding model parameters, and optimization Want to find model parameters such that minimize sum of costs over all input/output pairs J(θ) = m l(ŷ i, y i ) = i=1 m (θ T φ(x i ) y i ) 2 i=1 Write our objective formally as minimize θ J(θ) simple example of an optimization problem; these will dominate our development of algorithms throughout the course 14

15 Let s write J(θ) a little more compactly using matrix notation; define φ(x 1 ) T y 1 φ(x 2 ) T y 2 Φ R m k =. φ(x m ) T, y Rm =. y m then J(θ) = m (θ T φ(x i ) y i ) 2 = Φθ y 2 2 i=1 ( z 2 is l 2 norm of a vector: z 2 n i=1 z2 i = z T z) Called least-squares objective function 15

16 How do we optimize a function? 1-D case (θ R): J(θ) = θ 2 2θ dj dθ = 2θ 2 θ minimum dj dθ = 0 θ 2θ 2 = 0 θ = 1 16

17 Multi-variate case: θ R k, J : R k R Generalized condition: θ J(θ) θ = 0 θ J(θ) denotes gradient of J with respect to θ θ J(θ) R n J θ 1 J θ 2. J θ k Some important rules and common gradient θ (af(θ) + bg(θ)) = a θ f(θ) + b θ g(θ), (a, b R) θ (θ T Aθ) = (A + A T )θ, (A R k k ) θ (b T θ) = b, (b R k ) 17

18 Optimizing least-squares objective J(θ) = Φθ y 2 2 = (Φθ y) T (Φθ y) = θ T Φ T Φθ 2y T Φθ + y T y Using the previous gradient rules θ J(θ) = θ (θ T Φ T Φθ 2y T Φθ + y T y) = θ (θ T Φ T Φθ) 2 θ (y T Φθ) + θ (y T y) = 2Φ T Φθ 2Φ T y Setting gradient equal to zero 2Φ T Φθ 2Φ T y = 0 θ = (Φ T Φ) 1 Φ T y known as the normal equations 18

19 Let s see how this looks in MATLAB code X = load('high_temperature.txt'); y = load('peak_demand.txt'); n = size(x,2); m = size(x,1); Phi = [X ones(m,1)]; theta = inv(phi' * Phi) * Phi' * y; theta = The normal equations are so common that MATLAB has a special operation for them % same as inv(phi' * Phi) * Phi' * y theta = Phi \ y; 19

20 Aside: general optimization problems In this class we ll consider general optimization problems minimize θ J(θ) subject to g i (θ) 0, i = 1,..., N i h i (θ) = 0, i = 1,..., N e A constrained optimization problem; g i terms are the inequality constraints; h i terms are the equality constraints. Many different classifications of optimization problems (linear programming, quadratic programming, semidefinite programming, integer programming), depending on the form of J, the g i s and the h i s. 20

21 Important distinctions in optimization is between convex (where J, g i are convex and h i linear) and non-convex problems f convex f(aθ + (1 a)θ ) af(θ) + (1 a)f(θ ) for 0 a 1 Informally speaking, we can usually find global solutions of convex problems efficiently, while for non-convex problems we must settle for local solutions or time-consuming optimization 21

22 Solving optimization problems Many generic optimization libraries We will be using YALMIP (Yet Another Linear Matrix Inequality Parser): YALMIP code for least squares optimization: theta = sdpvar(n,1); solvesdp([], sum((phi*theta - y).^2)); double(theta) ans =

23 Alternative loss functions Nothing special about least-squares loss function l(ŷ, y) = (ŷ y) 2. Some alternatives: Absolute loss: l(ŷ, y) = ŷ y Deadband loss: l(ŷ, y) = max{0, ŷ y ɛ}, ɛ R + Loss Squared Loss Absolute Loss Deadband Loss Error: y pred y 23

24 How do we find parameters that minimize absolute loss? minimize θ m θ T φ(x i ) y i i=1 Non-differentiable, can t take gradient Solution: frame as a constrained optimization problem Introduce new variables ν R m, (ν i θ T φ(x i ) y i ) m minimize ν i θ,ν subject to i=1 ν i θ T φ(x i ) y i ν i Linear program (LP): linear object and linear constraints 24

25 To solve LPs, typically need to put them in standard form: minimize c T z z subject to Az b z R n, A R Ni n, b R Ni For absolute loss LP [ ] [ θ 0 z =, c = ν 1 ] [ Φ I, A = Φ I ] [ y, b = y ] 25

26 MATLAB code c = [zeros(n,1); ones(m,1)]; A = [Phi -eye(m); -Phi -eye(m)]; b = [y; -y]; z = linprog(c,a,b); theta = z(1:n) theta = The same solution in YALMIP: theta = sdpvar(n,1); solvesdp([], sum(abs((phi*theta - y)))); double(theta) theta =

27 Which loss function should we use? Peak Hourly Demand (GW) Observed data Squared loss Absolute loss Deadband loss, eps = High Temperature (F) 27

Machine Learning 2: Nonlinear Regression

Machine Learning 2: Nonlinear Regression 15-884 Machine Learning : Nonlinear Regression J. Zico Kolter September 17, 01 1 Non-linear regression Peak Hourly Demand (GW).5 0 0 40 60 80 100 High temperature / peak demand observations for all days

More information

15-388/688 - Practical Data Science: Intro to Machine Learning & Linear Regression. J. Zico Kolter Carnegie Mellon University Spring 2018

15-388/688 - Practical Data Science: Intro to Machine Learning & Linear Regression. J. Zico Kolter Carnegie Mellon University Spring 2018 15-388/688 - Practical Data Science: Intro to Machine Learning & Linear Regression J. Zico Kolter Carnegie Mellon University Spring 2018 1 Outline Least squares regression: a simple example Machine learning

More information

Foundations of Physical Science. Unit 2: Work and Energy

Foundations of Physical Science. Unit 2: Work and Energy Foundations of Physical Science Unit 2: Work and Energy Chapter 5: Work, Energy, and Power 5.1 Work 5.2 Energy Conservation 5.3 Energy Transformations Learning Goals Calculate the amount of work done by

More information

Energy and the Environment

Energy and the Environment Energy and the Environment Energy physics definition the capacity to do work and conjunction used to connect grammatically coordinate words, phrases, or clauses the Environment the aggregate of surrounding

More information

1 Review of Winnow Algorithm

1 Review of Winnow Algorithm COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture # 17 Scribe: Xingyuan Fang, Ethan April 9th, 2013 1 Review of Winnow Algorithm We have studied Winnow algorithm in Algorithm 1. Algorithm

More information

Energy. on this world and elsewhere. Instructor: Gordon D. Cates Office: Physics 106a, Phone: (434)

Energy. on this world and elsewhere. Instructor: Gordon D. Cates Office: Physics 106a, Phone: (434) Energy on this world and elsewhere Instructor: Gordon D. Cates Office: Physics 106a, Phone: (434) 924-4792 email: cates@virginia.edu Course web site available at www.phys.virginia.edu, click on classes

More information

Chapter 3-4 Energy Work Power

Chapter 3-4 Energy Work Power Chapter 3-4 Energy 3-1. The Meaning of Work 3-2. Power 3-3. Kinetic Energy 3-4. Potential Energy 3-5. Energy Transformations 3-6. Conservation of Energy 3-7. The Nature of Heat 3-8. Linear Momentum 3-9.

More information

Chapter 3-4 Energy. Horsepower Kinetic Energy Work Potential Energy Power. James Watt

Chapter 3-4 Energy. Horsepower Kinetic Energy Work Potential Energy Power. James Watt Chapter 3-4 Energy Horsepower 3-1. The Meaning of Work 3-2. Power 3-3. Kinetic Energy 3-4. Potential Energy 3-5. Energy Transformations 3-6. Conservation of Energy 3-7. The Nature of Heat 3-8. Linear Momentum

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Proximal-Gradient Mark Schmidt University of British Columbia Winter 2018 Admin Auditting/registration forms: Pick up after class today. Assignment 1: 2 late days to hand in

More information

XI PHYSICS. M. Affan Khan LECTURER PHYSICS, AKHSS, K. https://promotephysics.wordpress.com

XI PHYSICS. M. Affan Khan LECTURER PHYSICS, AKHSS, K. https://promotephysics.wordpress.com XI PHYSICS M. Affan Khan LECTURER PHYSICS, AKHSS, K affan_414@live.com https://promotephysics.wordpress.com [WORK, POWER AND ENERGY] CHAPTER NO. 7 A little concept of vector mathematics is applied here

More information

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation. CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.

More information

Physics of Energy. Premise of this course in order to come up with such a solution, we need to understand how energy works?

Physics of Energy. Premise of this course in order to come up with such a solution, we need to understand how energy works? Physics of Energy As we discussed. Our society needs to find a sustainable energy solution that Fulfills global energy needs in the long term. Doesn t degrade the environment. Premise of this course in

More information

2.3 Work, Energy and Power

2.3 Work, Energy and Power 2.3 Work, Energy and Power Kari Eloranta 2016 Jyväskylän Lyseon lukio International Baccalaureate October 23, 2016 2.3 Work as Energy Transfer Work as Energy Transfer Work W is a quantity that gives the

More information

Lecture 8. Strong Duality Results. September 22, 2008

Lecture 8. Strong Duality Results. September 22, 2008 Strong Duality Results September 22, 2008 Outline Lecture 8 Slater Condition and its Variations Convex Objective with Linear Inequality Constraints Quadratic Objective over Quadratic Constraints Representation

More information

Electrical Energy Modeling In Y2E2 Building Based On Distributed Sensors Information

Electrical Energy Modeling In Y2E2 Building Based On Distributed Sensors Information Electrical Energy Modeling In Y2E2 Building Based On Distributed Sensors Information Mahmoud Saadat Saman Ghili Introduction Close to 40% of the primary energy consumption in the U.S. comes from commercial

More information

Energy, Work, and Power

Energy, Work, and Power Matthew W. Milligan, Work, and Power Conservation Laws an Alternative to Newton s Laws Matthew W. Milligan, Work, and Power I. - kinetic and potential - conservation II. Work - dot product - work-energy

More information

Homework 4. Convex Optimization /36-725

Homework 4. Convex Optimization /36-725 Homework 4 Convex Optimization 10-725/36-725 Due Friday November 4 at 5:30pm submitted to Christoph Dann in Gates 8013 (Remember to a submit separate writeup for each problem, with your name at the top)

More information

Support Vector Machines for Classification and Regression

Support Vector Machines for Classification and Regression CIS 520: Machine Learning Oct 04, 207 Support Vector Machines for Classification and Regression Lecturer: Shivani Agarwal Disclaimer: These notes are designed to be a supplement to the lecture. They may

More information

Energy and Work. What is energy? What is work? What is power? What is efficiency? Unit 02 Energy Slide 1

Energy and Work. What is energy? What is work? What is power? What is efficiency? Unit 02 Energy Slide 1 Energy and Work What is energy? What is work? What is power? What is efficiency? Unit 02 Energy Slide 1 Energy and Work Energy The ability to do work Work How we chance energy from one form to another

More information

Introduction to Optimization

Introduction to Optimization Introduction to Optimization Konstantin Tretyakov (kt@ut.ee) MTAT.03.227 Machine Learning So far Machine learning is important and interesting The general concept: Fitting models to data So far Machine

More information

Support Vector Machines

Support Vector Machines Support Vector Machines Support vector machines (SVMs) are one of the central concepts in all of machine learning. They are simply a combination of two ideas: linear classification via maximum (or optimal

More information

Energy and the Environment. HNRS 228 Spring 2010 Prof. Geller

Energy and the Environment. HNRS 228 Spring 2010 Prof. Geller Energy and the Environment HNRS 228 Spring 2010 Prof. Geller Good to Know Units of length, mass and time Metric Prefixes Relationship of Mass, Volume and Density The Scientific Method Speed, velocity,

More information

Contents. 1 Introduction. 1.1 History of Optimization ALG-ML SEMINAR LISSA: LINEAR TIME SECOND-ORDER STOCHASTIC ALGORITHM FEBRUARY 23, 2016

Contents. 1 Introduction. 1.1 History of Optimization ALG-ML SEMINAR LISSA: LINEAR TIME SECOND-ORDER STOCHASTIC ALGORITHM FEBRUARY 23, 2016 ALG-ML SEMINAR LISSA: LINEAR TIME SECOND-ORDER STOCHASTIC ALGORITHM FEBRUARY 23, 2016 LECTURERS: NAMAN AGARWAL AND BRIAN BULLINS SCRIBE: KIRAN VODRAHALLI Contents 1 Introduction 1 1.1 History of Optimization.....................................

More information

A Quick Tour of Linear Algebra and Optimization for Machine Learning

A Quick Tour of Linear Algebra and Optimization for Machine Learning A Quick Tour of Linear Algebra and Optimization for Machine Learning Masoud Farivar January 8, 2015 1 / 28 Outline of Part I: Review of Basic Linear Algebra Matrices and Vectors Matrix Multiplication Operators

More information

HNRS 227 Chapter 3. Energy presented by Prof. Geller Fall 2008

HNRS 227 Chapter 3. Energy presented by Prof. Geller Fall 2008 HNRS 227 Chapter 3 Energy presented by Prof. Geller Fall 2008 Don t Forget the Following Units of length, mass and time Metric Prefixes The Scientific Method Speed, velocity, acceleration Force Falling

More information

INTRODUCTION TO DATA SCIENCE

INTRODUCTION TO DATA SCIENCE INTRODUCTION TO DATA SCIENCE JOHN P DICKERSON Lecture #13 3/9/2017 CMSC320 Tuesdays & Thursdays 3:30pm 4:45pm ANNOUNCEMENTS Mini-Project #1 is due Saturday night (3/11): Seems like people are able to do

More information

Chapter 4. Energy. Work Power Kinetic Energy Potential Energy Conservation of Energy. W = Fs Work = (force)(distance)

Chapter 4. Energy. Work Power Kinetic Energy Potential Energy Conservation of Energy. W = Fs Work = (force)(distance) Chapter 4 Energy In This Chapter: Work Kinetic Energy Potential Energy Conservation of Energy Work Work is a measure of the amount of change (in a general sense) that a force produces when it acts on a

More information

Work, Power and Energy Worksheet

Work, Power and Energy Worksheet Work, Power and Energy Worksheet Name: 1. Which of the following statements are true about work? Include all that apply. a. Work is the transfer of energy into or out of a system by means of an external

More information

15-884/484 Electric Power Systems 1: DC and AC Circuits

15-884/484 Electric Power Systems 1: DC and AC Circuits 15-884/484 Electric Power Systems 1: DC and AC Circuits J. Zico Kolter October 8, 2013 1 Hydro Estimated U.S. Energy Use in 2010: ~98.0 Quads Lawrence Livermore National Laboratory Solar 0.11 0.01 8.44

More information

Support Vector Machines for Classification and Regression. 1 Linearly Separable Data: Hard Margin SVMs

Support Vector Machines for Classification and Regression. 1 Linearly Separable Data: Hard Margin SVMs E0 270 Machine Learning Lecture 5 (Jan 22, 203) Support Vector Machines for Classification and Regression Lecturer: Shivani Agarwal Disclaimer: These notes are a brief summary of the topics covered in

More information

ICS-E4030 Kernel Methods in Machine Learning

ICS-E4030 Kernel Methods in Machine Learning ICS-E4030 Kernel Methods in Machine Learning Lecture 3: Convex optimization and duality Juho Rousu 28. September, 2016 Juho Rousu 28. September, 2016 1 / 38 Convex optimization Convex optimisation This

More information

Physics Year 11 Term 1 Week 7

Physics Year 11 Term 1 Week 7 Physics Year 11 Term 1 Week 7 Energy According to Einstein, a counterpart to mass An enormously important but abstract concept Energy can be stored (coal, oil, a watch spring) Energy is something moving

More information

Machine Learning for NLP

Machine Learning for NLP Machine Learning for NLP Linear Models Joakim Nivre Uppsala University Department of Linguistics and Philology Slides adapted from Ryan McDonald, Google Research Machine Learning for NLP 1(26) Outline

More information

Forecasting the electricity consumption by aggregating specialized experts

Forecasting the electricity consumption by aggregating specialized experts Forecasting the electricity consumption by aggregating specialized experts Pierre Gaillard (EDF R&D, ENS Paris) with Yannig Goude (EDF R&D) Gilles Stoltz (CNRS, ENS Paris, HEC Paris) June 2013 WIPFOR Goal

More information

System Identification and Optimization Methods Based on Derivatives. Chapters 5 & 6 from Jang

System Identification and Optimization Methods Based on Derivatives. Chapters 5 & 6 from Jang System Identification and Optimization Methods Based on Derivatives Chapters 5 & 6 from Jang Neuro-Fuzzy and Soft Computing Model space Adaptive networks Neural networks Fuzzy inf. systems Approach space

More information

Physics 130: Questions to study for midterm #1 from Chapter 7

Physics 130: Questions to study for midterm #1 from Chapter 7 Physics 130: Questions to study for midterm #1 from Chapter 7 1. Kinetic energy is defined to be one-half the a. mass times the speed. b. mass times the speed squared. c. mass times the acceleration. d.

More information

Linear Classifiers. Michael Collins. January 18, 2012

Linear Classifiers. Michael Collins. January 18, 2012 Linear Classifiers Michael Collins January 18, 2012 Today s Lecture Binary classification problems Linear classifiers The perceptron algorithm Classification Problems: An Example Goal: build a system that

More information

Optimization for Machine Learning

Optimization for Machine Learning Optimization for Machine Learning Elman Mansimov 1 September 24, 2015 1 Modified based on Shenlong Wang s and Jake Snell s tutorials, with additional contents borrowed from Kevin Swersky and Jasper Snoek

More information

Lecture 7: Kernels for Classification and Regression

Lecture 7: Kernels for Classification and Regression Lecture 7: Kernels for Classification and Regression CS 194-10, Fall 2011 Laurent El Ghaoui EECS Department UC Berkeley September 15, 2011 Outline Outline A linear regression problem Linear auto-regressive

More information

Barrier Method. Javier Peña Convex Optimization /36-725

Barrier Method. Javier Peña Convex Optimization /36-725 Barrier Method Javier Peña Convex Optimization 10-725/36-725 1 Last time: Newton s method For root-finding F (x) = 0 x + = x F (x) 1 F (x) For optimization x f(x) x + = x 2 f(x) 1 f(x) Assume f strongly

More information

Convex Optimization & Parsimony of L p-balls representation

Convex Optimization & Parsimony of L p-balls representation Convex Optimization & Parsimony of L p -balls representation LAAS-CNRS and Institute of Mathematics, Toulouse, France IMA, January 2016 Motivation Unit balls associated with nonnegative homogeneous polynomials

More information

Applications of Linear Programming

Applications of Linear Programming Applications of Linear Programming lecturer: András London University of Szeged Institute of Informatics Department of Computational Optimization Lecture 9 Non-linear programming In case of LP, the goal

More information

Stanford Machine Learning - Week V

Stanford Machine Learning - Week V Stanford Machine Learning - Week V Eric N Johnson August 13, 2016 1 Neural Networks: Learning What learning algorithm is used by a neural network to produce parameters for a model? Suppose we have a neural

More information

Examples of Applications of Potential Functions in Problem Solving (Web Appendix to the Paper)

Examples of Applications of Potential Functions in Problem Solving (Web Appendix to the Paper) Examples of Applications of otential Functions in roblem Solving (Web Appendix to the aper) Ali Mehrizi-Sani and Reza Iravani May 5, 2010 1 Introduction otential functions may be exploited to formulate

More information

Physics 23 Notes Chapter 6 Part Two

Physics 23 Notes Chapter 6 Part Two Physics 23 Notes Chapter 6 Part Two Dr. Alward Conservation of Energy Object moves freely upward under the influence of Earth only. Its acceleration is a = -g. v 2 = vo 2 + 2ax = vo 2-2g (h-ho) = vo 2-2gh

More information

Machine Learning and Data Mining. Linear regression. Kalev Kask

Machine Learning and Data Mining. Linear regression. Kalev Kask Machine Learning and Data Mining Linear regression Kalev Kask Supervised learning Notation Features x Targets y Predictions ŷ Parameters q Learning algorithm Program ( Learner ) Change q Improve performance

More information

Lecture 10. Neural networks and optimization. Machine Learning and Data Mining November Nando de Freitas UBC. Nonlinear Supervised Learning

Lecture 10. Neural networks and optimization. Machine Learning and Data Mining November Nando de Freitas UBC. Nonlinear Supervised Learning Lecture 0 Neural networks and optimization Machine Learning and Data Mining November 2009 UBC Gradient Searching for a good solution can be interpreted as looking for a minimum of some error (loss) function

More information

r=1 r=1 argmin Q Jt (20) After computing the descent direction d Jt 2 dt H t d + P (x + d) d i = 0, i / J

r=1 r=1 argmin Q Jt (20) After computing the descent direction d Jt 2 dt H t d + P (x + d) d i = 0, i / J 7 Appendix 7. Proof of Theorem Proof. There are two main difficulties in proving the convergence of our algorithm, and none of them is addressed in previous works. First, the Hessian matrix H is a block-structured

More information

The SI unit for Energy is the joule, usually abbreviated J. One joule is equal to one kilogram meter squared per second squared:

The SI unit for Energy is the joule, usually abbreviated J. One joule is equal to one kilogram meter squared per second squared: Chapter 2 Energy Energy is an extremely loaded term. It is used in everyday parlance to mean a number of different things, many of which bear at most a passing resemblance to the term as used in physical

More information

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017 Machine Learning Support Vector Machines Fabio Vandin November 20, 2017 1 Classification and Margin Consider a classification problem with two classes: instance set X = R d label set Y = { 1, 1}. Training

More information

The N k Problem using AC Power Flows

The N k Problem using AC Power Flows The N k Problem using AC Power Flows Sean Harnett 5-19-2011 Outline Introduction AC power flow model The optimization problem Some results Goal: find a small set of lines whose removal will cause the power

More information

T m / A. Table C2 Submicroscopic Masses [2] Symbol Meaning Best Value Approximate Value

T m / A. Table C2 Submicroscopic Masses [2] Symbol Meaning Best Value Approximate Value APPENDIX C USEFUL INFORMATION 1247 C USEFUL INFORMATION This appendix is broken into several tables. Table C1, Important Constants Table C2, Submicroscopic Masses Table C3, Solar System Data Table C4,

More information

Chapter 7. Work and Kinetic Energy

Chapter 7. Work and Kinetic Energy Chapter 7 Work and Kinetic Energy P. Lam 7_16_2018 Learning Goals for Chapter 7 To understand the concept of kinetic energy (energy of motion) To understand the meaning of work done by a force. To apply

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EECS 227A Nonlinear and Convex Optimization. Solutions 6 Fall 2009

UC Berkeley Department of Electrical Engineering and Computer Science. EECS 227A Nonlinear and Convex Optimization. Solutions 6 Fall 2009 UC Berkele Department of Electrical Engineering and Computer Science EECS 227A Nonlinear and Convex Optimization Solutions 6 Fall 2009 Solution 6.1 (a) p = 1 (b) The Lagrangian is L(x,, λ) = e x + λx 2

More information

Unit 5: Energy (Part 2)

Unit 5: Energy (Part 2) SUPERCHARGED SCIENCE Unit 5: Energy (Part 2) www.sciencelearningspace.com Appropriate for Grades: Lesson 1 (K-12), Lesson 2 (K-12) Duration: 6-15 hours, depending on how many activities you do! We covered

More information

Advanced Introduction to Machine Learning

Advanced Introduction to Machine Learning 10-715 Advanced Introduction to Machine Learning Homework Due Oct 15, 10.30 am Rules Please follow these guidelines. Failure to do so, will result in loss of credit. 1. Homework is due on the due date

More information

Physics 2414 Group Exercise 8. Conservation of Energy

Physics 2414 Group Exercise 8. Conservation of Energy Physics 244 Group Exercise 8 Name : OUID : Name 2: OUID 2: Name 3: OUID 3: Name 4: OUID 4: Section Number: Solutions Solutions Conservation of Energy A mass m moves from point i to point f under the action

More information

Short Course Robust Optimization and Machine Learning. 3. Optimization in Supervised Learning

Short Course Robust Optimization and Machine Learning. 3. Optimization in Supervised Learning Short Course Robust Optimization and 3. Optimization in Supervised EECS and IEOR Departments UC Berkeley Spring seminar TRANSP-OR, Zinal, Jan. 16-19, 2012 Outline Overview of Supervised models and variants

More information

10-725/36-725: Convex Optimization Prerequisite Topics

10-725/36-725: Convex Optimization Prerequisite Topics 10-725/36-725: Convex Optimization Prerequisite Topics February 3, 2015 This is meant to be a brief, informal refresher of some topics that will form building blocks in this course. The content of the

More information

Optimization. Yuh-Jye Lee. March 21, Data Science and Machine Intelligence Lab National Chiao Tung University 1 / 29

Optimization. Yuh-Jye Lee. March 21, Data Science and Machine Intelligence Lab National Chiao Tung University 1 / 29 Optimization Yuh-Jye Lee Data Science and Machine Intelligence Lab National Chiao Tung University March 21, 2017 1 / 29 You Have Learned (Unconstrained) Optimization in Your High School Let f (x) = ax

More information

Overview. Linear Regression with One Variable Gradient Descent Linear Regression with Multiple Variables Gradient Descent with Multiple Variables

Overview. Linear Regression with One Variable Gradient Descent Linear Regression with Multiple Variables Gradient Descent with Multiple Variables Overview Linear Regression with One Variable Gradient Descent Linear Regression with Multiple Variables Gradient Descent with Multiple Variables Example: Advertising Data Data taken from An Introduction

More information

wave speed (metre/second, m/s) = frequency (hertz, Hz) x wavelength (metre, m)

wave speed (metre/second, m/s) = frequency (hertz, Hz) x wavelength (metre, m) Physics formulae: Unit P1 (GCSE Science ( Core Science )): The relationship between wave speed, frequency and wavelength: wave speed (metre/second, m/s) = frequency (hertz, Hz) x wavelength (metre, m)

More information

Optimisation and Operations Research

Optimisation and Operations Research Optimisation and Operations Research Lecture 22: Linear Programming Revisited Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/ Lecture_notes/OORII/ School

More information

Data Science - Convex optimization and application

Data Science - Convex optimization and application 1 Data Science - Convex optimization and application Data Science - Convex optimization and application Summary We begin by some illustrations in challenging topics in modern data science. Then, this session

More information

P1 Quick Revision Questions. P1 for AQA GCSE examination 2018 onwards

P1 Quick Revision Questions. P1 for AQA GCSE examination 2018 onwards P1 Quick Revision Questions Question 1... of 50 What type of energy is stored in a stretched elastic band? Answer 1... of 50 Elastic potential energy. Question 2... of 50 What type of energy is stored

More information

SOLUTION: The domain of a square root function only includes values for which the radicand is nonnegative.

SOLUTION: The domain of a square root function only includes values for which the radicand is nonnegative. 19. Graph each function. State the domain and range. 21. The domain of a square root function only includes values for which the radicand is nonnegative. esolutions Manual - Powered by Cognero Page 1 23.

More information

1 Review of the Perceptron Algorithm

1 Review of the Perceptron Algorithm COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #15 Scribe: (James) Zhen XIANG April, 008 1 Review of the Perceptron Algorithm In the last few lectures, we talked about various kinds

More information

Gradient Descent. Ryan Tibshirani Convex Optimization /36-725

Gradient Descent. Ryan Tibshirani Convex Optimization /36-725 Gradient Descent Ryan Tibshirani Convex Optimization 10-725/36-725 Last time: canonical convex programs Linear program (LP): takes the form min x subject to c T x Gx h Ax = b Quadratic program (QP): like

More information

Chapter 6 Work, Energy, and Power. Copyright 2010 Pearson Education, Inc.

Chapter 6 Work, Energy, and Power. Copyright 2010 Pearson Education, Inc. Chapter 6 Work, Energy, and Power What Is Physics All About? Matter Energy Force Work Done by a Constant Force The definition of work, when the force is parallel to the displacement: W = Fs SI unit: newton-meter

More information

Determine the trend for time series data

Determine the trend for time series data Extra Online Questions Determine the trend for time series data Covers AS 90641 (Statistics and Modelling 3.1) Scholarship Statistics and Modelling Chapter 1 Essent ial exam notes Time series 1. The value

More information

has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity.

has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity. http://preposterousuniverse.com/grnotes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity. As with any major theory in physics, GR has been framed

More information

Linear Models in Machine Learning

Linear Models in Machine Learning CS540 Intro to AI Linear Models in Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu We briefly go over two linear models frequently used in machine learning: linear regression for, well, regression,

More information

Lecture 2 Machine Learning Review

Lecture 2 Machine Learning Review Lecture 2 Machine Learning Review CMSC 35246: Deep Learning Shubhendu Trivedi & Risi Kondor University of Chicago March 29, 2017 Things we will look at today Formal Setup for Supervised Learning Things

More information

Convex Optimization / Homework 1, due September 19

Convex Optimization / Homework 1, due September 19 Convex Optimization 1-725/36-725 Homework 1, due September 19 Instructions: You must complete Problems 1 3 and either Problem 4 or Problem 5 (your choice between the two). When you submit the homework,

More information

5.3: Calculate kinetic energy, gravitational potential energy, and elastic potential energy. Do Now: 1. Hand in your Forms of Energy Wheel

5.3: Calculate kinetic energy, gravitational potential energy, and elastic potential energy. Do Now: 1. Hand in your Forms of Energy Wheel Do Now: 1. Hand in your Forms of Energy Wheel 2. Identify the following forms of energy: a. A hiker at the top of a mountain b. A dog chasing a cat c. A rubber band being stretched Agenda: How can we calculate

More information

Part of the slides are adapted from Ziko Kolter

Part of the slides are adapted from Ziko Kolter Part of the slides are adapted from Ziko Kolter OUTLINE 1 Supervised learning: classification........................................................ 2 2 Non-linear regression/classification, overfitting,

More information

Some Applications of Stochastic. Gradient Processes

Some Applications of Stochastic. Gradient Processes Applied Mathematical Sciences, Vol. 2, 2008, no. 25, 1219-1228 Some Applications of Stochastic Gradient Processes A. Bennar 1, A. Bouamaine 2 and A. Namir 1 1 Département de Mathématiques et Informatique

More information

UNITS AND DEFINITIONS RELATED TO BIOMECHANICAL AND ELECTROMYOGRAPHICAL MEASUREMENTS

UNITS AND DEFINITIONS RELATED TO BIOMECHANICAL AND ELECTROMYOGRAPHICAL MEASUREMENTS APPENDIX B UNITS AND DEFINITIONS RELATED TO BIOMECHANICAL AND ELECTROMYOGRAPHICAL MEASUREMENTS All units used are SI (Système International d Unités). The system is based on seven well-defined base units

More information

The Representor Theorem, Kernels, and Hilbert Spaces

The Representor Theorem, Kernels, and Hilbert Spaces The Representor Theorem, Kernels, and Hilbert Spaces We will now work with infinite dimensional feature vectors and parameter vectors. The space l is defined to be the set of sequences f 1, f, f 3,...

More information

carroll/notes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general

carroll/notes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general http://pancake.uchicago.edu/ carroll/notes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity. As with any major theory in physics, GR has been

More information

Rirdge Regression. Szymon Bobek. Institute of Applied Computer science AGH University of Science and Technology

Rirdge Regression. Szymon Bobek. Institute of Applied Computer science AGH University of Science and Technology Rirdge Regression Szymon Bobek Institute of Applied Computer science AGH University of Science and Technology Based on Carlos Guestrin adn Emily Fox slides from Coursera Specialization on Machine Learnign

More information

100 Physics Facts. 1. The standard international unit (SI unit) for mass (m) is. kg (kilograms) s (seconds)

100 Physics Facts. 1. The standard international unit (SI unit) for mass (m) is. kg (kilograms) s (seconds) 100 Physics Facts 1. The standard international unit (SI unit) for mass (m) is. kg (kilograms) 2. The standard international unit (SI unit) for time (t) is. s (seconds) 3. The standard international unit

More information

is both a Thing and a Process

is both a Thing and a Process ENERGY = Matter + ENERGY is both a Thing and a Process Matter HAS Energy. Energy is usually observable only during transfer. Allows WORK to be done. Matter is bottled-up energy. Energy is the capacity

More information

Problem Set 6: Magnetism

Problem Set 6: Magnetism University of Alabama Department of Physics and Astronomy PH 10- / LeClair Spring 008 Problem Set 6: Magnetism 1. 10 points. A wire with a weight per unit length of 0.10 N/m is suspended directly above

More information

Today. Exam 1. The Electric Force Work, Energy and Power. Comments on exam extra credit. What do these pictures have in common?

Today. Exam 1. The Electric Force Work, Energy and Power. Comments on exam extra credit. What do these pictures have in common? Today Exam 1 Announcements: The average on the first exam was 31/40 Exam extra credit is due by :00 pm Thursday February 18th. (It opens on LONCAPA today) The Electric Force Work, Energy and Power Number

More information

CS229 Supplemental Lecture notes

CS229 Supplemental Lecture notes CS229 Supplemental Lecture notes John Duchi Binary classification In binary classification problems, the target y can take on at only two values. In this set of notes, we show how to model this problem

More information

Linear Classifiers and the Perceptron

Linear Classifiers and the Perceptron Linear Classifiers and the Perceptron William Cohen February 4, 2008 1 Linear classifiers Let s assume that every instance is an n-dimensional vector of real numbers x R n, and there are only two possible

More information

WORK, POWER & ENERGY

WORK, POWER & ENERGY WORK, POWER & ENERGY Work An applied force acting over a displacement. The force being applied must be parallel to the displacement for work to be occurring. Work Force displacement Units: Newton meter

More information

Big Data Analytics. Lucas Rego Drumond

Big Data Analytics. Lucas Rego Drumond Big Data Analytics Lucas Rego Drumond Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Predictive Models Predictive Models 1 / 34 Outline

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Multivariate Gaussians Mark Schmidt University of British Columbia Winter 2019 Last Time: Multivariate Gaussian http://personal.kenyon.edu/hartlaub/mellonproject/bivariate2.html

More information

15-388/688 - Practical Data Science: Nonlinear modeling, cross-validation, regularization, and evaluation

15-388/688 - Practical Data Science: Nonlinear modeling, cross-validation, regularization, and evaluation 15-388/688 - Practical Data Science: Nonlinear modeling, cross-validation, regularization, and evaluation J. Zico Kolter Carnegie Mellon University Fall 2016 1 Outline Example: return to peak demand prediction

More information

Kernel Learning via Random Fourier Representations

Kernel Learning via Random Fourier Representations Kernel Learning via Random Fourier Representations L. Law, M. Mider, X. Miscouridou, S. Ip, A. Wang Module 5: Machine Learning L. Law, M. Mider, X. Miscouridou, S. Ip, A. Wang Kernel Learning via Random

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Convex Optimization Mark Schmidt University of British Columbia Winter 2018 Admin Auditting/registration forms: Submit them at end of class, pick them up end of next class. I

More information

15-388/688 - Practical Data Science: Decision trees and interpretable models. J. Zico Kolter Carnegie Mellon University Spring 2018

15-388/688 - Practical Data Science: Decision trees and interpretable models. J. Zico Kolter Carnegie Mellon University Spring 2018 15-388/688 - Practical Data Science: Decision trees and interpretable models J. Zico Kolter Carnegie Mellon University Spring 2018 1 Outline Decision trees Training (classification) decision trees Interpreting

More information

Sparse Gaussian conditional random fields

Sparse Gaussian conditional random fields Sparse Gaussian conditional random fields Matt Wytock, J. ico Kolter School of Computer Science Carnegie Mellon University Pittsburgh, PA 53 {mwytock, zkolter}@cs.cmu.edu Abstract We propose sparse Gaussian

More information

OCR Physics Specification A - H156/H556

OCR Physics Specification A - H156/H556 OCR Physics Specification A - H156/H556 Module 3: Forces and Motion You should be able to demonstrate and show your understanding of: 3.1 Motion Displacement, instantaneous speed, average speed, velocity

More information

Experiment 1: Linear Regression

Experiment 1: Linear Regression Experiment 1: Linear Regression August 27, 2018 1 Description This first exercise will give you practice with linear regression. These exercises have been extensively tested with Matlab, but they should

More information

Approximation Theoretical Questions for SVMs

Approximation Theoretical Questions for SVMs Ingo Steinwart LA-UR 07-7056 October 20, 2007 Statistical Learning Theory: an Overview Support Vector Machines Informal Description of the Learning Goal X space of input samples Y space of labels, usually

More information

ENERGY UNITS. 1 cal J 1 ev J 1 btu 1055 J 1 kw h J. Your body gets 8,000 J (1900cal=1.9kcal) of energy from eating a

ENERGY UNITS. 1 cal J 1 ev J 1 btu 1055 J 1 kw h J. Your body gets 8,000 J (1900cal=1.9kcal) of energy from eating a ENERGY 1 ENERGY UNITS Energy: The ability to do work (make something happen) Joule (J) Calorie (cal) The calories on food packages are really kcal Electron Volt (ev) British Thermal Unit (btu) Kilo watt

More information

NONLINEAR CLASSIFICATION AND REGRESSION. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition

NONLINEAR CLASSIFICATION AND REGRESSION. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition NONLINEAR CLASSIFICATION AND REGRESSION Nonlinear Classification and Regression: Outline 2 Multi-Layer Perceptrons The Back-Propagation Learning Algorithm Generalized Linear Models Radial Basis Function

More information