Machine Learning 1: Linear Regression
|
|
- Egbert Morrison
- 5 years ago
- Views:
Transcription
1 Machine Learning 1: Linear Regression J. Zico Kolter September 18,
2 Motivation How much energy will we consume tommorrow? Difficult to estimate from a priori models But, we have lots of data from which to build a model 2
3 Energy 101 Energy: ability to do work (apply force through a distance) Unit of energy: joule (J), (also btu, kilowatt hour) 1 joule = 1 newton 1 meter = 1 kilogram 1 meter2 1 second 2 Power is a rate of energy use Unit of power: watt (W) (commonly kilo/mega/gigawatt) 1 watt = 1 joule 1 second 3
4 Forms of energy Mechanical kinetic energy: E = 1 2 mv2 Gravitational potential energy: E = mgh Thermal energy: E = 3 2 NkT Electrical energy: E = V Q Electromagnetic energy: E = hf Chemical energy Nuclear energy: E = mc 2 4
5 Energy conversion 5
6 Duquesne Light electricity consumption Hourly Demand (GW) Feb 9 Jul 13 Oct Hour of Day Data: PJM 6
7 Duquesne Light electricity consumption Source: Duquesne Light 7
8 Predict peak demand from high temperature What will peak demand be tomorrow? If we know something else about tomorrow (like the high temperature), we can use this to predict peak demand Peak Hourly Demand (GW) High Temperature (F) Data: PJM, Weather Underground (summer months, June-August) 8
9 A simple model A linear model that predicts demand: predicted peak demand = θ 1 (high temperature) + θ 2 Peak Hourly Demand (GW) Observed data Linear regression prediction High Temperature (F) Parameters of model: θ 1, θ 2 R (θ 1 = 0.046, θ 2 = 1.46) 9
10 A simple model We can use a model like this to make predictions What will be the peak demand for Duquense Light tomorrow? I know from weather report that high temperature will be 80 F (ignore, for the moment, that this too is a prediction) Then predicted peak demand is: θ θ 2 = = 2.19 GW 10
11 Formal problem setting Input: x i R n, i = 1,..., m E.g.: x i R 1 = {high temperature for day i} Output: y i R (regression task) E.g.: y i R = {peak demand for day i} Model Parameters: θ R k Predicted Output: ŷ i R E.g.: ŷ i = θ 1 x i + θ 2 11
12 For convenience, we define a function that maps inputs to feature vectors φ : R n R k For example, in our task above, if we define [ ] xi φ(x i ) = (here n = 1, k = 2) 1 then we can write ŷ i = k θ j φ j (x i ) θ T φ(x i ) j=1 12
13 Loss functions Want a model that performs well on the data we have I.e., ŷ i y i, i We measure closeness of ŷ i and y i using loss function l : R R R + Example: squared loss l(ŷ i, y i ) = (ŷ i y i ) 2 13
14 Finding model parameters, and optimization Want to find model parameters such that minimize sum of costs over all input/output pairs J(θ) = m l(ŷ i, y i ) = i=1 m (θ T φ(x i ) y i ) 2 i=1 Write our objective formally as minimize θ J(θ) simple example of an optimization problem; these will dominate our development of algorithms throughout the course 14
15 Let s write J(θ) a little more compactly using matrix notation; define φ(x 1 ) T y 1 φ(x 2 ) T y 2 Φ R m k =. φ(x m ) T, y Rm =. y m then J(θ) = m (θ T φ(x i ) y i ) 2 = Φθ y 2 2 i=1 ( z 2 is l 2 norm of a vector: z 2 n i=1 z2 i = z T z) Called least-squares objective function 15
16 How do we optimize a function? 1-D case (θ R): J(θ) = θ 2 2θ dj dθ = 2θ 2 θ minimum dj dθ = 0 θ 2θ 2 = 0 θ = 1 16
17 Multi-variate case: θ R k, J : R k R Generalized condition: θ J(θ) θ = 0 θ J(θ) denotes gradient of J with respect to θ θ J(θ) R n J θ 1 J θ 2. J θ k Some important rules and common gradient θ (af(θ) + bg(θ)) = a θ f(θ) + b θ g(θ), (a, b R) θ (θ T Aθ) = (A + A T )θ, (A R k k ) θ (b T θ) = b, (b R k ) 17
18 Optimizing least-squares objective J(θ) = Φθ y 2 2 = (Φθ y) T (Φθ y) = θ T Φ T Φθ 2y T Φθ + y T y Using the previous gradient rules θ J(θ) = θ (θ T Φ T Φθ 2y T Φθ + y T y) = θ (θ T Φ T Φθ) 2 θ (y T Φθ) + θ (y T y) = 2Φ T Φθ 2Φ T y Setting gradient equal to zero 2Φ T Φθ 2Φ T y = 0 θ = (Φ T Φ) 1 Φ T y known as the normal equations 18
19 Let s see how this looks in MATLAB code X = load('high_temperature.txt'); y = load('peak_demand.txt'); n = size(x,2); m = size(x,1); Phi = [X ones(m,1)]; theta = inv(phi' * Phi) * Phi' * y; theta = The normal equations are so common that MATLAB has a special operation for them % same as inv(phi' * Phi) * Phi' * y theta = Phi \ y; 19
20 Aside: general optimization problems In this class we ll consider general optimization problems minimize θ J(θ) subject to g i (θ) 0, i = 1,..., N i h i (θ) = 0, i = 1,..., N e A constrained optimization problem; g i terms are the inequality constraints; h i terms are the equality constraints. Many different classifications of optimization problems (linear programming, quadratic programming, semidefinite programming, integer programming), depending on the form of J, the g i s and the h i s. 20
21 Important distinctions in optimization is between convex (where J, g i are convex and h i linear) and non-convex problems f convex f(aθ + (1 a)θ ) af(θ) + (1 a)f(θ ) for 0 a 1 Informally speaking, we can usually find global solutions of convex problems efficiently, while for non-convex problems we must settle for local solutions or time-consuming optimization 21
22 Solving optimization problems Many generic optimization libraries We will be using YALMIP (Yet Another Linear Matrix Inequality Parser): YALMIP code for least squares optimization: theta = sdpvar(n,1); solvesdp([], sum((phi*theta - y).^2)); double(theta) ans =
23 Alternative loss functions Nothing special about least-squares loss function l(ŷ, y) = (ŷ y) 2. Some alternatives: Absolute loss: l(ŷ, y) = ŷ y Deadband loss: l(ŷ, y) = max{0, ŷ y ɛ}, ɛ R + Loss Squared Loss Absolute Loss Deadband Loss Error: y pred y 23
24 How do we find parameters that minimize absolute loss? minimize θ m θ T φ(x i ) y i i=1 Non-differentiable, can t take gradient Solution: frame as a constrained optimization problem Introduce new variables ν R m, (ν i θ T φ(x i ) y i ) m minimize ν i θ,ν subject to i=1 ν i θ T φ(x i ) y i ν i Linear program (LP): linear object and linear constraints 24
25 To solve LPs, typically need to put them in standard form: minimize c T z z subject to Az b z R n, A R Ni n, b R Ni For absolute loss LP [ ] [ θ 0 z =, c = ν 1 ] [ Φ I, A = Φ I ] [ y, b = y ] 25
26 MATLAB code c = [zeros(n,1); ones(m,1)]; A = [Phi -eye(m); -Phi -eye(m)]; b = [y; -y]; z = linprog(c,a,b); theta = z(1:n) theta = The same solution in YALMIP: theta = sdpvar(n,1); solvesdp([], sum(abs((phi*theta - y)))); double(theta) theta =
27 Which loss function should we use? Peak Hourly Demand (GW) Observed data Squared loss Absolute loss Deadband loss, eps = High Temperature (F) 27
Machine Learning 2: Nonlinear Regression
15-884 Machine Learning : Nonlinear Regression J. Zico Kolter September 17, 01 1 Non-linear regression Peak Hourly Demand (GW).5 0 0 40 60 80 100 High temperature / peak demand observations for all days
More information15-388/688 - Practical Data Science: Intro to Machine Learning & Linear Regression. J. Zico Kolter Carnegie Mellon University Spring 2018
15-388/688 - Practical Data Science: Intro to Machine Learning & Linear Regression J. Zico Kolter Carnegie Mellon University Spring 2018 1 Outline Least squares regression: a simple example Machine learning
More informationFoundations of Physical Science. Unit 2: Work and Energy
Foundations of Physical Science Unit 2: Work and Energy Chapter 5: Work, Energy, and Power 5.1 Work 5.2 Energy Conservation 5.3 Energy Transformations Learning Goals Calculate the amount of work done by
More informationEnergy and the Environment
Energy and the Environment Energy physics definition the capacity to do work and conjunction used to connect grammatically coordinate words, phrases, or clauses the Environment the aggregate of surrounding
More information1 Review of Winnow Algorithm
COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture # 17 Scribe: Xingyuan Fang, Ethan April 9th, 2013 1 Review of Winnow Algorithm We have studied Winnow algorithm in Algorithm 1. Algorithm
More informationEnergy. on this world and elsewhere. Instructor: Gordon D. Cates Office: Physics 106a, Phone: (434)
Energy on this world and elsewhere Instructor: Gordon D. Cates Office: Physics 106a, Phone: (434) 924-4792 email: cates@virginia.edu Course web site available at www.phys.virginia.edu, click on classes
More informationChapter 3-4 Energy Work Power
Chapter 3-4 Energy 3-1. The Meaning of Work 3-2. Power 3-3. Kinetic Energy 3-4. Potential Energy 3-5. Energy Transformations 3-6. Conservation of Energy 3-7. The Nature of Heat 3-8. Linear Momentum 3-9.
More informationChapter 3-4 Energy. Horsepower Kinetic Energy Work Potential Energy Power. James Watt
Chapter 3-4 Energy Horsepower 3-1. The Meaning of Work 3-2. Power 3-3. Kinetic Energy 3-4. Potential Energy 3-5. Energy Transformations 3-6. Conservation of Energy 3-7. The Nature of Heat 3-8. Linear Momentum
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning Proximal-Gradient Mark Schmidt University of British Columbia Winter 2018 Admin Auditting/registration forms: Pick up after class today. Assignment 1: 2 late days to hand in
More informationXI PHYSICS. M. Affan Khan LECTURER PHYSICS, AKHSS, K. https://promotephysics.wordpress.com
XI PHYSICS M. Affan Khan LECTURER PHYSICS, AKHSS, K affan_414@live.com https://promotephysics.wordpress.com [WORK, POWER AND ENERGY] CHAPTER NO. 7 A little concept of vector mathematics is applied here
More informationMark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.
CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.
More informationPhysics of Energy. Premise of this course in order to come up with such a solution, we need to understand how energy works?
Physics of Energy As we discussed. Our society needs to find a sustainable energy solution that Fulfills global energy needs in the long term. Doesn t degrade the environment. Premise of this course in
More information2.3 Work, Energy and Power
2.3 Work, Energy and Power Kari Eloranta 2016 Jyväskylän Lyseon lukio International Baccalaureate October 23, 2016 2.3 Work as Energy Transfer Work as Energy Transfer Work W is a quantity that gives the
More informationLecture 8. Strong Duality Results. September 22, 2008
Strong Duality Results September 22, 2008 Outline Lecture 8 Slater Condition and its Variations Convex Objective with Linear Inequality Constraints Quadratic Objective over Quadratic Constraints Representation
More informationElectrical Energy Modeling In Y2E2 Building Based On Distributed Sensors Information
Electrical Energy Modeling In Y2E2 Building Based On Distributed Sensors Information Mahmoud Saadat Saman Ghili Introduction Close to 40% of the primary energy consumption in the U.S. comes from commercial
More informationEnergy, Work, and Power
Matthew W. Milligan, Work, and Power Conservation Laws an Alternative to Newton s Laws Matthew W. Milligan, Work, and Power I. - kinetic and potential - conservation II. Work - dot product - work-energy
More informationHomework 4. Convex Optimization /36-725
Homework 4 Convex Optimization 10-725/36-725 Due Friday November 4 at 5:30pm submitted to Christoph Dann in Gates 8013 (Remember to a submit separate writeup for each problem, with your name at the top)
More informationSupport Vector Machines for Classification and Regression
CIS 520: Machine Learning Oct 04, 207 Support Vector Machines for Classification and Regression Lecturer: Shivani Agarwal Disclaimer: These notes are designed to be a supplement to the lecture. They may
More informationEnergy and Work. What is energy? What is work? What is power? What is efficiency? Unit 02 Energy Slide 1
Energy and Work What is energy? What is work? What is power? What is efficiency? Unit 02 Energy Slide 1 Energy and Work Energy The ability to do work Work How we chance energy from one form to another
More informationIntroduction to Optimization
Introduction to Optimization Konstantin Tretyakov (kt@ut.ee) MTAT.03.227 Machine Learning So far Machine learning is important and interesting The general concept: Fitting models to data So far Machine
More informationSupport Vector Machines
Support Vector Machines Support vector machines (SVMs) are one of the central concepts in all of machine learning. They are simply a combination of two ideas: linear classification via maximum (or optimal
More informationEnergy and the Environment. HNRS 228 Spring 2010 Prof. Geller
Energy and the Environment HNRS 228 Spring 2010 Prof. Geller Good to Know Units of length, mass and time Metric Prefixes Relationship of Mass, Volume and Density The Scientific Method Speed, velocity,
More informationContents. 1 Introduction. 1.1 History of Optimization ALG-ML SEMINAR LISSA: LINEAR TIME SECOND-ORDER STOCHASTIC ALGORITHM FEBRUARY 23, 2016
ALG-ML SEMINAR LISSA: LINEAR TIME SECOND-ORDER STOCHASTIC ALGORITHM FEBRUARY 23, 2016 LECTURERS: NAMAN AGARWAL AND BRIAN BULLINS SCRIBE: KIRAN VODRAHALLI Contents 1 Introduction 1 1.1 History of Optimization.....................................
More informationA Quick Tour of Linear Algebra and Optimization for Machine Learning
A Quick Tour of Linear Algebra and Optimization for Machine Learning Masoud Farivar January 8, 2015 1 / 28 Outline of Part I: Review of Basic Linear Algebra Matrices and Vectors Matrix Multiplication Operators
More informationHNRS 227 Chapter 3. Energy presented by Prof. Geller Fall 2008
HNRS 227 Chapter 3 Energy presented by Prof. Geller Fall 2008 Don t Forget the Following Units of length, mass and time Metric Prefixes The Scientific Method Speed, velocity, acceleration Force Falling
More informationINTRODUCTION TO DATA SCIENCE
INTRODUCTION TO DATA SCIENCE JOHN P DICKERSON Lecture #13 3/9/2017 CMSC320 Tuesdays & Thursdays 3:30pm 4:45pm ANNOUNCEMENTS Mini-Project #1 is due Saturday night (3/11): Seems like people are able to do
More informationChapter 4. Energy. Work Power Kinetic Energy Potential Energy Conservation of Energy. W = Fs Work = (force)(distance)
Chapter 4 Energy In This Chapter: Work Kinetic Energy Potential Energy Conservation of Energy Work Work is a measure of the amount of change (in a general sense) that a force produces when it acts on a
More informationWork, Power and Energy Worksheet
Work, Power and Energy Worksheet Name: 1. Which of the following statements are true about work? Include all that apply. a. Work is the transfer of energy into or out of a system by means of an external
More information15-884/484 Electric Power Systems 1: DC and AC Circuits
15-884/484 Electric Power Systems 1: DC and AC Circuits J. Zico Kolter October 8, 2013 1 Hydro Estimated U.S. Energy Use in 2010: ~98.0 Quads Lawrence Livermore National Laboratory Solar 0.11 0.01 8.44
More informationSupport Vector Machines for Classification and Regression. 1 Linearly Separable Data: Hard Margin SVMs
E0 270 Machine Learning Lecture 5 (Jan 22, 203) Support Vector Machines for Classification and Regression Lecturer: Shivani Agarwal Disclaimer: These notes are a brief summary of the topics covered in
More informationICS-E4030 Kernel Methods in Machine Learning
ICS-E4030 Kernel Methods in Machine Learning Lecture 3: Convex optimization and duality Juho Rousu 28. September, 2016 Juho Rousu 28. September, 2016 1 / 38 Convex optimization Convex optimisation This
More informationPhysics Year 11 Term 1 Week 7
Physics Year 11 Term 1 Week 7 Energy According to Einstein, a counterpart to mass An enormously important but abstract concept Energy can be stored (coal, oil, a watch spring) Energy is something moving
More informationMachine Learning for NLP
Machine Learning for NLP Linear Models Joakim Nivre Uppsala University Department of Linguistics and Philology Slides adapted from Ryan McDonald, Google Research Machine Learning for NLP 1(26) Outline
More informationForecasting the electricity consumption by aggregating specialized experts
Forecasting the electricity consumption by aggregating specialized experts Pierre Gaillard (EDF R&D, ENS Paris) with Yannig Goude (EDF R&D) Gilles Stoltz (CNRS, ENS Paris, HEC Paris) June 2013 WIPFOR Goal
More informationSystem Identification and Optimization Methods Based on Derivatives. Chapters 5 & 6 from Jang
System Identification and Optimization Methods Based on Derivatives Chapters 5 & 6 from Jang Neuro-Fuzzy and Soft Computing Model space Adaptive networks Neural networks Fuzzy inf. systems Approach space
More informationPhysics 130: Questions to study for midterm #1 from Chapter 7
Physics 130: Questions to study for midterm #1 from Chapter 7 1. Kinetic energy is defined to be one-half the a. mass times the speed. b. mass times the speed squared. c. mass times the acceleration. d.
More informationLinear Classifiers. Michael Collins. January 18, 2012
Linear Classifiers Michael Collins January 18, 2012 Today s Lecture Binary classification problems Linear classifiers The perceptron algorithm Classification Problems: An Example Goal: build a system that
More informationOptimization for Machine Learning
Optimization for Machine Learning Elman Mansimov 1 September 24, 2015 1 Modified based on Shenlong Wang s and Jake Snell s tutorials, with additional contents borrowed from Kevin Swersky and Jasper Snoek
More informationLecture 7: Kernels for Classification and Regression
Lecture 7: Kernels for Classification and Regression CS 194-10, Fall 2011 Laurent El Ghaoui EECS Department UC Berkeley September 15, 2011 Outline Outline A linear regression problem Linear auto-regressive
More informationBarrier Method. Javier Peña Convex Optimization /36-725
Barrier Method Javier Peña Convex Optimization 10-725/36-725 1 Last time: Newton s method For root-finding F (x) = 0 x + = x F (x) 1 F (x) For optimization x f(x) x + = x 2 f(x) 1 f(x) Assume f strongly
More informationConvex Optimization & Parsimony of L p-balls representation
Convex Optimization & Parsimony of L p -balls representation LAAS-CNRS and Institute of Mathematics, Toulouse, France IMA, January 2016 Motivation Unit balls associated with nonnegative homogeneous polynomials
More informationApplications of Linear Programming
Applications of Linear Programming lecturer: András London University of Szeged Institute of Informatics Department of Computational Optimization Lecture 9 Non-linear programming In case of LP, the goal
More informationStanford Machine Learning - Week V
Stanford Machine Learning - Week V Eric N Johnson August 13, 2016 1 Neural Networks: Learning What learning algorithm is used by a neural network to produce parameters for a model? Suppose we have a neural
More informationExamples of Applications of Potential Functions in Problem Solving (Web Appendix to the Paper)
Examples of Applications of otential Functions in roblem Solving (Web Appendix to the aper) Ali Mehrizi-Sani and Reza Iravani May 5, 2010 1 Introduction otential functions may be exploited to formulate
More informationPhysics 23 Notes Chapter 6 Part Two
Physics 23 Notes Chapter 6 Part Two Dr. Alward Conservation of Energy Object moves freely upward under the influence of Earth only. Its acceleration is a = -g. v 2 = vo 2 + 2ax = vo 2-2g (h-ho) = vo 2-2gh
More informationMachine Learning and Data Mining. Linear regression. Kalev Kask
Machine Learning and Data Mining Linear regression Kalev Kask Supervised learning Notation Features x Targets y Predictions ŷ Parameters q Learning algorithm Program ( Learner ) Change q Improve performance
More informationLecture 10. Neural networks and optimization. Machine Learning and Data Mining November Nando de Freitas UBC. Nonlinear Supervised Learning
Lecture 0 Neural networks and optimization Machine Learning and Data Mining November 2009 UBC Gradient Searching for a good solution can be interpreted as looking for a minimum of some error (loss) function
More informationr=1 r=1 argmin Q Jt (20) After computing the descent direction d Jt 2 dt H t d + P (x + d) d i = 0, i / J
7 Appendix 7. Proof of Theorem Proof. There are two main difficulties in proving the convergence of our algorithm, and none of them is addressed in previous works. First, the Hessian matrix H is a block-structured
More informationThe SI unit for Energy is the joule, usually abbreviated J. One joule is equal to one kilogram meter squared per second squared:
Chapter 2 Energy Energy is an extremely loaded term. It is used in everyday parlance to mean a number of different things, many of which bear at most a passing resemblance to the term as used in physical
More informationMachine Learning. Support Vector Machines. Fabio Vandin November 20, 2017
Machine Learning Support Vector Machines Fabio Vandin November 20, 2017 1 Classification and Margin Consider a classification problem with two classes: instance set X = R d label set Y = { 1, 1}. Training
More informationThe N k Problem using AC Power Flows
The N k Problem using AC Power Flows Sean Harnett 5-19-2011 Outline Introduction AC power flow model The optimization problem Some results Goal: find a small set of lines whose removal will cause the power
More informationT m / A. Table C2 Submicroscopic Masses [2] Symbol Meaning Best Value Approximate Value
APPENDIX C USEFUL INFORMATION 1247 C USEFUL INFORMATION This appendix is broken into several tables. Table C1, Important Constants Table C2, Submicroscopic Masses Table C3, Solar System Data Table C4,
More informationChapter 7. Work and Kinetic Energy
Chapter 7 Work and Kinetic Energy P. Lam 7_16_2018 Learning Goals for Chapter 7 To understand the concept of kinetic energy (energy of motion) To understand the meaning of work done by a force. To apply
More informationUC Berkeley Department of Electrical Engineering and Computer Science. EECS 227A Nonlinear and Convex Optimization. Solutions 6 Fall 2009
UC Berkele Department of Electrical Engineering and Computer Science EECS 227A Nonlinear and Convex Optimization Solutions 6 Fall 2009 Solution 6.1 (a) p = 1 (b) The Lagrangian is L(x,, λ) = e x + λx 2
More informationUnit 5: Energy (Part 2)
SUPERCHARGED SCIENCE Unit 5: Energy (Part 2) www.sciencelearningspace.com Appropriate for Grades: Lesson 1 (K-12), Lesson 2 (K-12) Duration: 6-15 hours, depending on how many activities you do! We covered
More informationAdvanced Introduction to Machine Learning
10-715 Advanced Introduction to Machine Learning Homework Due Oct 15, 10.30 am Rules Please follow these guidelines. Failure to do so, will result in loss of credit. 1. Homework is due on the due date
More informationPhysics 2414 Group Exercise 8. Conservation of Energy
Physics 244 Group Exercise 8 Name : OUID : Name 2: OUID 2: Name 3: OUID 3: Name 4: OUID 4: Section Number: Solutions Solutions Conservation of Energy A mass m moves from point i to point f under the action
More informationShort Course Robust Optimization and Machine Learning. 3. Optimization in Supervised Learning
Short Course Robust Optimization and 3. Optimization in Supervised EECS and IEOR Departments UC Berkeley Spring seminar TRANSP-OR, Zinal, Jan. 16-19, 2012 Outline Overview of Supervised models and variants
More information10-725/36-725: Convex Optimization Prerequisite Topics
10-725/36-725: Convex Optimization Prerequisite Topics February 3, 2015 This is meant to be a brief, informal refresher of some topics that will form building blocks in this course. The content of the
More informationOptimization. Yuh-Jye Lee. March 21, Data Science and Machine Intelligence Lab National Chiao Tung University 1 / 29
Optimization Yuh-Jye Lee Data Science and Machine Intelligence Lab National Chiao Tung University March 21, 2017 1 / 29 You Have Learned (Unconstrained) Optimization in Your High School Let f (x) = ax
More informationOverview. Linear Regression with One Variable Gradient Descent Linear Regression with Multiple Variables Gradient Descent with Multiple Variables
Overview Linear Regression with One Variable Gradient Descent Linear Regression with Multiple Variables Gradient Descent with Multiple Variables Example: Advertising Data Data taken from An Introduction
More informationwave speed (metre/second, m/s) = frequency (hertz, Hz) x wavelength (metre, m)
Physics formulae: Unit P1 (GCSE Science ( Core Science )): The relationship between wave speed, frequency and wavelength: wave speed (metre/second, m/s) = frequency (hertz, Hz) x wavelength (metre, m)
More informationOptimisation and Operations Research
Optimisation and Operations Research Lecture 22: Linear Programming Revisited Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/ Lecture_notes/OORII/ School
More informationData Science - Convex optimization and application
1 Data Science - Convex optimization and application Data Science - Convex optimization and application Summary We begin by some illustrations in challenging topics in modern data science. Then, this session
More informationP1 Quick Revision Questions. P1 for AQA GCSE examination 2018 onwards
P1 Quick Revision Questions Question 1... of 50 What type of energy is stored in a stretched elastic band? Answer 1... of 50 Elastic potential energy. Question 2... of 50 What type of energy is stored
More informationSOLUTION: The domain of a square root function only includes values for which the radicand is nonnegative.
19. Graph each function. State the domain and range. 21. The domain of a square root function only includes values for which the radicand is nonnegative. esolutions Manual - Powered by Cognero Page 1 23.
More information1 Review of the Perceptron Algorithm
COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture #15 Scribe: (James) Zhen XIANG April, 008 1 Review of the Perceptron Algorithm In the last few lectures, we talked about various kinds
More informationGradient Descent. Ryan Tibshirani Convex Optimization /36-725
Gradient Descent Ryan Tibshirani Convex Optimization 10-725/36-725 Last time: canonical convex programs Linear program (LP): takes the form min x subject to c T x Gx h Ax = b Quadratic program (QP): like
More informationChapter 6 Work, Energy, and Power. Copyright 2010 Pearson Education, Inc.
Chapter 6 Work, Energy, and Power What Is Physics All About? Matter Energy Force Work Done by a Constant Force The definition of work, when the force is parallel to the displacement: W = Fs SI unit: newton-meter
More informationDetermine the trend for time series data
Extra Online Questions Determine the trend for time series data Covers AS 90641 (Statistics and Modelling 3.1) Scholarship Statistics and Modelling Chapter 1 Essent ial exam notes Time series 1. The value
More informationhas a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity.
http://preposterousuniverse.com/grnotes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity. As with any major theory in physics, GR has been framed
More informationLinear Models in Machine Learning
CS540 Intro to AI Linear Models in Machine Learning Lecturer: Xiaojin Zhu jerryzhu@cs.wisc.edu We briefly go over two linear models frequently used in machine learning: linear regression for, well, regression,
More informationLecture 2 Machine Learning Review
Lecture 2 Machine Learning Review CMSC 35246: Deep Learning Shubhendu Trivedi & Risi Kondor University of Chicago March 29, 2017 Things we will look at today Formal Setup for Supervised Learning Things
More informationConvex Optimization / Homework 1, due September 19
Convex Optimization 1-725/36-725 Homework 1, due September 19 Instructions: You must complete Problems 1 3 and either Problem 4 or Problem 5 (your choice between the two). When you submit the homework,
More information5.3: Calculate kinetic energy, gravitational potential energy, and elastic potential energy. Do Now: 1. Hand in your Forms of Energy Wheel
Do Now: 1. Hand in your Forms of Energy Wheel 2. Identify the following forms of energy: a. A hiker at the top of a mountain b. A dog chasing a cat c. A rubber band being stretched Agenda: How can we calculate
More informationPart of the slides are adapted from Ziko Kolter
Part of the slides are adapted from Ziko Kolter OUTLINE 1 Supervised learning: classification........................................................ 2 2 Non-linear regression/classification, overfitting,
More informationSome Applications of Stochastic. Gradient Processes
Applied Mathematical Sciences, Vol. 2, 2008, no. 25, 1219-1228 Some Applications of Stochastic Gradient Processes A. Bennar 1, A. Bouamaine 2 and A. Namir 1 1 Département de Mathématiques et Informatique
More informationUNITS AND DEFINITIONS RELATED TO BIOMECHANICAL AND ELECTROMYOGRAPHICAL MEASUREMENTS
APPENDIX B UNITS AND DEFINITIONS RELATED TO BIOMECHANICAL AND ELECTROMYOGRAPHICAL MEASUREMENTS All units used are SI (Système International d Unités). The system is based on seven well-defined base units
More informationThe Representor Theorem, Kernels, and Hilbert Spaces
The Representor Theorem, Kernels, and Hilbert Spaces We will now work with infinite dimensional feature vectors and parameter vectors. The space l is defined to be the set of sequences f 1, f, f 3,...
More informationcarroll/notes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general
http://pancake.uchicago.edu/ carroll/notes/ has a lot of good notes on GR and links to other pages. General Relativity Philosophy of general relativity. As with any major theory in physics, GR has been
More informationRirdge Regression. Szymon Bobek. Institute of Applied Computer science AGH University of Science and Technology
Rirdge Regression Szymon Bobek Institute of Applied Computer science AGH University of Science and Technology Based on Carlos Guestrin adn Emily Fox slides from Coursera Specialization on Machine Learnign
More information100 Physics Facts. 1. The standard international unit (SI unit) for mass (m) is. kg (kilograms) s (seconds)
100 Physics Facts 1. The standard international unit (SI unit) for mass (m) is. kg (kilograms) 2. The standard international unit (SI unit) for time (t) is. s (seconds) 3. The standard international unit
More informationis both a Thing and a Process
ENERGY = Matter + ENERGY is both a Thing and a Process Matter HAS Energy. Energy is usually observable only during transfer. Allows WORK to be done. Matter is bottled-up energy. Energy is the capacity
More informationProblem Set 6: Magnetism
University of Alabama Department of Physics and Astronomy PH 10- / LeClair Spring 008 Problem Set 6: Magnetism 1. 10 points. A wire with a weight per unit length of 0.10 N/m is suspended directly above
More informationToday. Exam 1. The Electric Force Work, Energy and Power. Comments on exam extra credit. What do these pictures have in common?
Today Exam 1 Announcements: The average on the first exam was 31/40 Exam extra credit is due by :00 pm Thursday February 18th. (It opens on LONCAPA today) The Electric Force Work, Energy and Power Number
More informationCS229 Supplemental Lecture notes
CS229 Supplemental Lecture notes John Duchi Binary classification In binary classification problems, the target y can take on at only two values. In this set of notes, we show how to model this problem
More informationLinear Classifiers and the Perceptron
Linear Classifiers and the Perceptron William Cohen February 4, 2008 1 Linear classifiers Let s assume that every instance is an n-dimensional vector of real numbers x R n, and there are only two possible
More informationWORK, POWER & ENERGY
WORK, POWER & ENERGY Work An applied force acting over a displacement. The force being applied must be parallel to the displacement for work to be occurring. Work Force displacement Units: Newton meter
More informationBig Data Analytics. Lucas Rego Drumond
Big Data Analytics Lucas Rego Drumond Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Predictive Models Predictive Models 1 / 34 Outline
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning Multivariate Gaussians Mark Schmidt University of British Columbia Winter 2019 Last Time: Multivariate Gaussian http://personal.kenyon.edu/hartlaub/mellonproject/bivariate2.html
More information15-388/688 - Practical Data Science: Nonlinear modeling, cross-validation, regularization, and evaluation
15-388/688 - Practical Data Science: Nonlinear modeling, cross-validation, regularization, and evaluation J. Zico Kolter Carnegie Mellon University Fall 2016 1 Outline Example: return to peak demand prediction
More informationKernel Learning via Random Fourier Representations
Kernel Learning via Random Fourier Representations L. Law, M. Mider, X. Miscouridou, S. Ip, A. Wang Module 5: Machine Learning L. Law, M. Mider, X. Miscouridou, S. Ip, A. Wang Kernel Learning via Random
More informationCPSC 540: Machine Learning
CPSC 540: Machine Learning Convex Optimization Mark Schmidt University of British Columbia Winter 2018 Admin Auditting/registration forms: Submit them at end of class, pick them up end of next class. I
More information15-388/688 - Practical Data Science: Decision trees and interpretable models. J. Zico Kolter Carnegie Mellon University Spring 2018
15-388/688 - Practical Data Science: Decision trees and interpretable models J. Zico Kolter Carnegie Mellon University Spring 2018 1 Outline Decision trees Training (classification) decision trees Interpreting
More informationSparse Gaussian conditional random fields
Sparse Gaussian conditional random fields Matt Wytock, J. ico Kolter School of Computer Science Carnegie Mellon University Pittsburgh, PA 53 {mwytock, zkolter}@cs.cmu.edu Abstract We propose sparse Gaussian
More informationOCR Physics Specification A - H156/H556
OCR Physics Specification A - H156/H556 Module 3: Forces and Motion You should be able to demonstrate and show your understanding of: 3.1 Motion Displacement, instantaneous speed, average speed, velocity
More informationExperiment 1: Linear Regression
Experiment 1: Linear Regression August 27, 2018 1 Description This first exercise will give you practice with linear regression. These exercises have been extensively tested with Matlab, but they should
More informationApproximation Theoretical Questions for SVMs
Ingo Steinwart LA-UR 07-7056 October 20, 2007 Statistical Learning Theory: an Overview Support Vector Machines Informal Description of the Learning Goal X space of input samples Y space of labels, usually
More informationENERGY UNITS. 1 cal J 1 ev J 1 btu 1055 J 1 kw h J. Your body gets 8,000 J (1900cal=1.9kcal) of energy from eating a
ENERGY 1 ENERGY UNITS Energy: The ability to do work (make something happen) Joule (J) Calorie (cal) The calories on food packages are really kcal Electron Volt (ev) British Thermal Unit (btu) Kilo watt
More informationNONLINEAR CLASSIFICATION AND REGRESSION. J. Elder CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition
NONLINEAR CLASSIFICATION AND REGRESSION Nonlinear Classification and Regression: Outline 2 Multi-Layer Perceptrons The Back-Propagation Learning Algorithm Generalized Linear Models Radial Basis Function
More information