Mixed Integer Programming (MIP) for Daily Fantasy Sports, Statistics and Marketing

Size: px
Start display at page:

Download "Mixed Integer Programming (MIP) for Daily Fantasy Sports, Statistics and Marketing"

Transcription

1 Mixed Integer Programming (MIP) for Daily Fantasy Sports, Statistics and Marketing Juan Pablo Vielma Massachusetts Institute of Technology AM/ES 121, SEAS, Harvard. Boston, MA, November, 2016.

2 MIP & Daily Fantasy Sports

3 Example Entry

4 100% of the money in the top 20% lineups 26% of the money in the top 10 lineups (0.04%)

5 Building a Lineup

6 Integer Programming Formulation We will make a bunch of lineups consisting of 9 players each Use an integer programming approach to find these lineups Decision variables

7 Basic Feasibility 9 different players Salary less than $50,000 Basic constraints

8 Position Feasibility Between 2 and 3 centers Between 3 and 4 wingers Between 2 and 3 defensemen 1 goalie Position constraints

9 Team Feasibility At least 3 different NHL teams Team constraints

10 Maximize Points Forecasted points for player p: Score type Points Goal 3 Assist 2 Shot on Goal 0.5 Blocked Shot 0.5 Short Handed Point Bonus (Goal/Assist) 1 Shootout Goal 0.2 Hat Trick Bonus 1.5 Win (goalie only) 3 Save (goalie only) 0.2 Goal allowed (goalie only) -1 Shutout Bonus (goalie only) 2 Table 1 Points system for NHL contests in DraftKings. Points Objective Function

11 Lineup Projections: $9500 $2700 $4600 $3800 $4600 $6400 $5200 $5100 $8000 W UTIL D D C C W W G 23 points on average

12 Need > 38 points for a chance to win

13 Increase variance to have a chance

14 Structural Correlations - Teams

15 Structural Correlations - Lines Goal = 3 pt, assist = 2 pt

16 Structural Correlations Lines = Stacking At least 1 complete line (3 players per line) At least 2 partial lines (at least 2 players per line) 1 complete line constraint 2 partial lines constraint

17 Structural Correlations Goalie Against Opposing Players

18 Structural Correlations Goalie Against Skaters No skater against goalie No skater against goalie constraint

19 Good, but not great chance

20 Play many diverse Lineups Make sure lineup l has no more than g players in common with lineups 1 to l-1 Diversity constraint

21 Were we able to do it? 200 lineups

22 Policy Change 200 lineups -> 100 lineups

23 Were we able to continue it? December 12, 2015 > $15K 100 lineups

24 How can you do it? Download Code from Github:

25 Performance Time < 30 Minutes Runtime (minutes) CBC GLPK Gurobi CBC GLPK Gurobi Solver

26 MIP and Statistics: Inference for the Chilean Earthquake

27 The 2010 Chilean Earthquake

28 6 th Strongest in Recorded History (8.8)

29 Impact on Educational Achievement? PSU = SAT

30 Earthquake Intensity + Great Demographic Info

31 Randomized experiment Treatment / control have similar characteristics (covariates). Treatment( Control(

32 Covariate Balance Important for Inference Dose Covariate 1 2 Gender Male Female School SES Low Mid-low Medium Mid-high High Mother s education Primary Secondary Technical College Missing

33 Observational Study: e.g. After Earthquake Treatment / control can have different characteristics. Treatment( Control( Solution = Matching?

34 Matching Treated Units: T = {t 1,...,t T } Control Units: C = {c 1,...,c C } Observed Covariates: P = {p 1,...,p P } Covariate Values: x t = x t p p2p, x c = x c p p2p, t 2 T c 2 C

35 Nearest Neighbor Matching minimize m subject to 0 apple m t,c apple 1 X t2t X c2c X t2t X c2c t,cm t,c + X i2i m t,c = m, t 2 T 1 m t,c apple 1, c 2 C! i µ i (m m t,c 2 {0, 1}, t 2 T, c 2 C i (m) apple " i, i 2 I e.g. t,c = x t x c 2 Easy to solve

36 Balance Before After Matching SIMCE school (decile) SIMCE student (decile) GPA ranking (decile) Attendance (decile) Rural school Catholic school High SES school Mid High SES school Mid SES school Mid Low SES school Public School Voucher School

37 Maximum Cardinality Matching max s.t. X t2t p,k X X t2t c2c m t,c X m t,c apple 1, t2t X m t,c apple 1, c2c X c/2c p,k m t,c = X t/2t p,k X c2c p,k m t,c 8c 2 C 8t 2 T 8p 2 P,k 2 K(p) m t,c 2 {0, 1} 8t 2 T, c 2 C. K(p) ={x c p} c2p [ {x t p} t2t C p,k = {c 2 C : x c p = k} T p,k = {t 2 T : x t p = k} Very hard to solve ( and very hard to understand! )

38 Advanced Maximum Cardinality Matching max s.t. X t2t X t2t X x t t2t p,k x t = x t = X c2c y c, X Matching without matching variables K(p) ={x c p} c2p [ {x t p} t2t C p,k = {c 2 C : x c p = k} c2c p,k y c, 8p 2 P, k 2 K(p) x t 2 {0, 1} 8t 2 T y c 2 {0, 1} 8c 2 C. T p,k = {t 2 T : x t p = k} Easy to solve: Small, but inherits matching properties

39 Balance Before After Cardinality Matching SIMCE school (decile) SIMCE student (decile) GPA ranking (decile) Attendance (decile) Rural school Catholic school High SES school Mid High SES school Mid SES school Mid Low SES school Public School Voucher School

40 Can Also do Multiple Doses Dose 1. No quake 2. Medium quake 3. Bad quake Dose Covariate Gender Male Female School SES Low Mid-low Medium Mid-high High Mother s education Primary Secondary Technical College Missing

41 Relative (To no Quake) Attendance Impact 3 Doses Attendance (%) Medium quake Bad quake

42 Relative (To no Quake) PSU Score Impact 3 Doses PSU Score !? 2 3 Medium quake Bad quake

43 MIP and Marketing: Chewbacca or BB-8?

44 Adaptive Preference Questionnaires Feature SX530 RX100 Zoom 50x 3.6x Prize $ $ Weight ounces 7.5 ounces Prefer Feature TG-4 G9 Waterproof Yes No Prize $ $ Weight 7.36 lb 7.5 lb Prefer We recommend: Feature TG-4 Galaxy 2 Waterproof Yes No Prize $ $ Viewfinder Electronic Optical Prefer

45 Choice-based Conjoint Analysis (CBCA) Feature Chewbacca BB-8 Wookiee Yes No Droid No Yes Blaster Yes No 0 1 1A = x 2 0 I would buy toy Product Profile x 1 x 2

46 Preference Model and Geometric Interpretation Utilities for 2 products, d features, logitmodel U 1 = U 2 = x = X d i=1 i x 1 i + 1 x = X d i=1 i x 2 i + 2 part-worths product profile noise (gumbel) Utility maximizing customer Geometric interpretation of preference for product 1 without error 2 x 1 x 2 0 x 1 x 2, U 1 U 2 1

47 Next Question = Minimize (Expected) Volume Good estimator Question? for? Center Postchoice Choice of ellipsoid balance symmetry

48 With Error = Volume of Ellipsoid f x 1,x 2 Prior distribution Answer likelihood Posterior distribution Prior ellipsoid Question/Answer Posterior ellipsoid

49 Rules of Thumb Still Good For Ellipsoid Volume ( µ) 0 1 ( µ) apple r Choice balance: Minimize distance to center µ x 1 x 2 µ Postchoice symmetry: Maximize variance of question x 1 x 2 0 x 1 x 2

50 Simple Formula for Expected Volume Expected Volume = Non-convex function distance: d := µ x 1 x 2 f(d, v) variance: v := x 1 x 2 0 x 1 x 2 of Can evaluate f(d, v) with 1-dim integral!

51 Optimization Model min f(d, v) s.t. µ x 1 x 2 = d x 1 x 2 0 x 1 x 2 = v Formulation trick: linearize x k i xl j A 1 x 1 + A 2 x 2 apple b x 1 6= x 2 x 1,x 2 2 {0, 1} n Experimental Design with MIP 50 / 27

52 Technique 2: Piecewise Linear Functions D-efficiency = Non-convex function distance: d := µ x 1 x 2 variance: v := x 1 x 2 0 x 1 x 2 of f(d, v) Can evaluate f(d, v) with 1-dim integral! Piecewise Linear Interpolation MIP formulation

53 Computational Performance Advanced formulations provide an computational advantage Advantage is significantly more important for free solvers State of the art commercial solvers can be significantly better that free solvers Still, free is free! Time [s] Time [s] Simple CPLEX GLPK Advanced Simple Advanced

54 Summary and Main Messages Always choose Chewbacca! How to YOU use MIP / Optimization / OR / Analytics? Study for the 2 nd midterm! Use JuMP and Julia Opt. How about grad school down the river? Masters of Business Analytics / OR Ph.D. in Operations Research

55 How Hard is MIP?

56 How hard is MIP: Traveling Salesman Problem? Paradoxes, Contradictions, and the Limits of Science Many research results define boundaries of what cannot be known, predicted, or described. Classifying these limitations shows us the structure of science and reason. Noson S. Yanofsky S A computer would have to check all these possible routes to find the shortest one.

57 MIP = Avoid Enumeration Number of tours for 49 cities Fastest supercomputer Assuming one floating point operation per tour: > years times the age of the universe! How long does it take on an iphone? Less than a second! 4 iterations of cutting plane method! = 48!/ flops Dantzig, Fulkerson and Johnson 1954 did it by hand! For more info see tutorial in ConcordeTSP app Cutting planes are the key for effectively solving (even NPhard) MIP problems in practice.

58 50+ Years of MIP = Significant Solver Speedups Algorithmic Improvements (Machine Independent): CPLEX v1.2 (1991) v11 (2007): 29,000x speedup Gurobi v1 (2009) v6.5 (2015): 48.7x speedup Commercial, but free for academic use (Reasonably) effective free / open source solvers: GLPK, COIN-OR (CBC) and SCIP (only for non-commercial) Easy to use, fast and versatile modeling languages Julia based JuMP modelling language

59 Technique 1: Binary Quadratic x 1,x 2 2 {0, 1} n x 1 x 2 0 x 1 x 2 = v X l i,j apple x l i, X l i,j apple x l j, X l i,j x l i + x l j 1, X l i,j 0 apple X l i,j = x l i x l j (l 2 {1, 2}, i,j 2 {1,...,n}) : W i,j = x 1 i x 2 j : W i,j apple x 1 i, W i,j apple x 2 j, W i,j x 1 i + x 2 j 1, W i,j 0 nx i,j=1 X 1 i,j + X 2 i,j W i,j W j,i i,j = v

60 Technique 1: Binary Quadratic x 1,x 2 2 {0, 1} n x 1 6= x 2, kx 1 x 2 k Xi,j l = x l i x l j (l 2 {1, 2}, i,j 2 {1,...,n}) : Xi,j l apple x l i, Xi,j l apple x l j, Xi,j l x l i + x l j 1, Xi,j l 0 W i,j = x 1 i x 2 j : W i,j apple x 1 i, W i,j apple x 2 j, W i,j x 1 i + x 2 j 1, W i,j 0 nx i,j=1 X 1 i,j + X 2 i,j W i,j W j,i 1

61 f(d 3 ) f(d 2 ) f(d 5 ) f(d 1 ) f(d 4 ) Simple Formulation for Univariate Functions z = f(x) d 1 d 2 d 3 d 4 d 5 Size = O (# of segments) x z Non-Ideal: Fractional Extreme Points = X 5 1= X 5 j=1 j=1 dj f(d j ) j j, j 0

62 f(d 3 ) f(d 2 ) f(d 5 ) f(d 1 ) f(d 4 ) Advanced Formulation for Univariate Functions z = f(x) x z d 1 d 2 d 3 d 4 d 5 Size = O (log 2 # of segments) Ideal: Integral Extreme Points = X 5 1= X 5 j=1 j=1 y 2 {0, 1} 2 dj f(d j ) j j, j 0

Advanced Mixed Integer Programming Formulations for Non-Convex Optimization Problems in Statistical Learning

Advanced Mixed Integer Programming Formulations for Non-Convex Optimization Problems in Statistical Learning Advanced Mixed Integer Programming Formulations for Non-Convex Optimization Problems in Statistical Learning Juan Pablo Vielma Massachusetts Institute of Technology 2016 IISA International Conference on

More information

Advanced Mixed Integer Programming (MIP) Formulation Techniques

Advanced Mixed Integer Programming (MIP) Formulation Techniques Advanced Mixed Integer Programming (MIP) Formulation Techniques Juan Pablo Vielma Massachusetts Institute of Technology Center for Nonlinear Studies, Los Alamos National Laboratory. Los Alamos, New Mexico,

More information

Mixed Integer Programming Approaches for Experimental Design

Mixed Integer Programming Approaches for Experimental Design Mixed Integer Programming Approaches for Experimental Design Juan Pablo Vielma Massachusetts Institute of Technology Joint work with Denis Saure DRO brown bag lunch seminars, Columbia Business School New

More information

Ellipsoidal Methods for Adaptive Choice-based Conjoint Analysis (CBCA)

Ellipsoidal Methods for Adaptive Choice-based Conjoint Analysis (CBCA) Ellipsoidal Methods for Adaptive Choice-based Conjoint Analysis (CBCA) Juan Pablo Vielma Massachusetts Institute of Technology Joint work with Denis Saure Operations Management Seminar, Rotman School of

More information

Mixed Integer Programming (MIP) for Causal Inference and Beyond

Mixed Integer Programming (MIP) for Causal Inference and Beyond Mixed Integer Programming (MIP) for Causal Inference and Beyond Juan Pablo Vielma Massachusetts Institute of Technology Columbia Business School New York, NY, October, 2016. Traveling Salesman Problem

More information

The Traveling Salesman Problem: An Overview. David P. Williamson, Cornell University Ebay Research January 21, 2014

The Traveling Salesman Problem: An Overview. David P. Williamson, Cornell University Ebay Research January 21, 2014 The Traveling Salesman Problem: An Overview David P. Williamson, Cornell University Ebay Research January 21, 2014 (Cook 2012) A highly readable introduction Some terminology (imprecise) Problem Traditional

More information

Encodings in Mixed Integer Linear Programming

Encodings in Mixed Integer Linear Programming Encodings in Mixed Integer Linear Programming Juan Pablo Vielma Sloan School of usiness, Massachusetts Institute of Technology Universidad de hile, December, 23 Santiago, hile. Mixed Integer er inary Formulations

More information

18 hours nodes, first feasible 3.7% gap Time: 92 days!! LP relaxation at root node: Branch and bound

18 hours nodes, first feasible 3.7% gap Time: 92 days!! LP relaxation at root node: Branch and bound The MIP Landscape 1 Example 1: LP still can be HARD SGM: Schedule Generation Model Example 157323 1: LP rows, still can 182812 be HARD columns, 6348437 nzs LP relaxation at root node: 18 hours Branch and

More information

21. Set cover and TSP

21. Set cover and TSP CS/ECE/ISyE 524 Introduction to Optimization Spring 2017 18 21. Set cover and TSP ˆ Set covering ˆ Cutting problems and column generation ˆ Traveling salesman problem Laurent Lessard (www.laurentlessard.com)

More information

Recent Advances in Mixed Integer Programming Modeling and Computation

Recent Advances in Mixed Integer Programming Modeling and Computation Recent Advances in Mixed Integer Programming Modeling and Computation Juan Pablo Vielma Massachusetts Institute of Technology MIT Center for Transportation & Logistics. Cambridge, Massachusetts, June,

More information

What is an integer program? Modelling with Integer Variables. Mixed Integer Program. Let us start with a linear program: max cx s.t.

What is an integer program? Modelling with Integer Variables. Mixed Integer Program. Let us start with a linear program: max cx s.t. Modelling with Integer Variables jesla@mandtudk Department of Management Engineering Technical University of Denmark What is an integer program? Let us start with a linear program: st Ax b x 0 where A

More information

19. Fixed costs and variable bounds

19. Fixed costs and variable bounds CS/ECE/ISyE 524 Introduction to Optimization Spring 2017 18 19. Fixed costs and variable bounds ˆ Fixed cost example ˆ Logic and the Big M Method ˆ Example: facility location ˆ Variable lower bounds Laurent

More information

From structures to heuristics to global solvers

From structures to heuristics to global solvers From structures to heuristics to global solvers Timo Berthold Zuse Institute Berlin DFG Research Center MATHEON Mathematics for key technologies OR2013, 04/Sep/13, Rotterdam Outline From structures to

More information

1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below.

1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below. No Gdc 1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below. Weight (g) 9.6 9.7 9.8 9.9 30.0 30.1 30. 30.3 Frequency 3 4 5 7 5 3 1 Find unbiased

More information

Gestion de la production. Book: Linear Programming, Vasek Chvatal, McGill University, W.H. Freeman and Company, New York, USA

Gestion de la production. Book: Linear Programming, Vasek Chvatal, McGill University, W.H. Freeman and Company, New York, USA Gestion de la production Book: Linear Programming, Vasek Chvatal, McGill University, W.H. Freeman and Company, New York, USA 1 Contents 1 Integer Linear Programming 3 1.1 Definitions and notations......................................

More information

Machine Learning Practice Page 2 of 2 10/28/13

Machine Learning Practice Page 2 of 2 10/28/13 Machine Learning 10-701 Practice Page 2 of 2 10/28/13 1. True or False Please give an explanation for your answer, this is worth 1 pt/question. (a) (2 points) No classifier can do better than a naive Bayes

More information

CSE 546 Final Exam, Autumn 2013

CSE 546 Final Exam, Autumn 2013 CSE 546 Final Exam, Autumn 0. Personal info: Name: Student ID: E-mail address:. There should be 5 numbered pages in this exam (including this cover sheet).. You can use any material you brought: any book,

More information

Statistics 135 Fall 2008 Final Exam

Statistics 135 Fall 2008 Final Exam Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations

More information

APPLIED MECHANISM DESIGN FOR SOCIAL GOOD

APPLIED MECHANISM DESIGN FOR SOCIAL GOOD APPLIED MECHANISM DESIGN FOR SOCIAL GOOD JOHN P DICKERSON Lecture #4 09/08/2016 CMSC828M Tuesdays & Thursdays 12:30pm 1:45pm PRESENTATION LIST IS ONLINE! SCRIBE LIST COMING SOON 2 THIS CLASS: (COMBINATORIAL)

More information

Resource Constrained Project Scheduling Linear and Integer Programming (1)

Resource Constrained Project Scheduling Linear and Integer Programming (1) DM204, 2010 SCHEDULING, TIMETABLING AND ROUTING Lecture 3 Resource Constrained Project Linear and Integer Programming (1) Marco Chiarandini Department of Mathematics & Computer Science University of Southern

More information

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.) Prof. Daniel Cremers 2. Regression (cont.) Regression with MLE (Rep.) Assume that y is affected by Gaussian noise : t = f(x, w)+ where Thus, we have p(t x, w, )=N (t; f(x, w), 2 ) 2 Maximum A-Posteriori

More information

Which boxplot represents the same information as the histogram? Test Scores Test Scores

Which boxplot represents the same information as the histogram? Test Scores Test Scores 01 013 SEMESTER EXAMS SEMESTER 1. Mrs. Johnson created this histogram of her 3 rd period students test scores. 8 Frequency of Test Scores 6 4 50 60 70 80 90 100 Test Scores Which boxplot represents the

More information

MVE165/MMG630, Applied Optimization Lecture 6 Integer linear programming: models and applications; complexity. Ann-Brith Strömberg

MVE165/MMG630, Applied Optimization Lecture 6 Integer linear programming: models and applications; complexity. Ann-Brith Strömberg MVE165/MMG630, Integer linear programming: models and applications; complexity Ann-Brith Strömberg 2011 04 01 Modelling with integer variables (Ch. 13.1) Variables Linear programming (LP) uses continuous

More information

Math for Machine Learning Open Doors to Data Science and Artificial Intelligence. Richard Han

Math for Machine Learning Open Doors to Data Science and Artificial Intelligence. Richard Han Math for Machine Learning Open Doors to Data Science and Artificial Intelligence Richard Han Copyright 05 Richard Han All rights reserved. CONTENTS PREFACE... - INTRODUCTION... LINEAR REGRESSION... 4 LINEAR

More information

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio. Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency

More information

Logic, Optimization and Data Analytics

Logic, Optimization and Data Analytics Logic, Optimization and Data Analytics John Hooker Carnegie Mellon University United Technologies Research Center, Cork, Ireland August 2015 Thesis Logic and optimization have an underlying unity. Ideas

More information

CS 580: Algorithm Design and Analysis. Jeremiah Blocki Purdue University Spring 2018

CS 580: Algorithm Design and Analysis. Jeremiah Blocki Purdue University Spring 2018 CS 580: Algorithm Design and Analysis Jeremiah Blocki Purdue University Spring 2018 Recap Network Flow Problems Max-Flow Min Cut Theorem Ford Fulkerson Augmenting Paths Residual Flow Graph Integral Solutions

More information

# of units, X P(X) Show that the probability distribution for X is legitimate.

# of units, X P(X) Show that the probability distribution for X is legitimate. Probability Distributions A. El Dorado Community College considers a student to be full-time if he or she is taking between 12 and 18 units. The number of units X that a randomly selected El Dorado Community

More information

16.410/413 Principles of Autonomy and Decision Making

16.410/413 Principles of Autonomy and Decision Making 6.4/43 Principles of Autonomy and Decision Making Lecture 8: (Mixed-Integer) Linear Programming for Vehicle Routing and Motion Planning Emilio Frazzoli Aeronautics and Astronautics Massachusetts Institute

More information

Manage the Lesson: Divide students into teams of 3 or 4 students.

Manage the Lesson: Divide students into teams of 3 or 4 students. Unit Name: Unit : Quadratic Functions and Modeling Lesson Plan Number & Title: Lesson : The Changing Rate of Change Grade Level: High School Math II Lesson Overview: Students will analyze functions using

More information

Outline. Outline. Outline DMP204 SCHEDULING, TIMETABLING AND ROUTING. 1. Scheduling CPM/PERT Resource Constrained Project Scheduling Model

Outline. Outline. Outline DMP204 SCHEDULING, TIMETABLING AND ROUTING. 1. Scheduling CPM/PERT Resource Constrained Project Scheduling Model Outline DMP204 SCHEDULING, TIMETABLING AND ROUTING Lecture 3 and Mixed Integer Programg Marco Chiarandini 1. Resource Constrained Project Model 2. Mathematical Programg 2 Outline Outline 1. Resource Constrained

More information

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013 Midterm 3 4/5/2013 Instructions: You may use a calculator, and one sheet of notes. You will never be penalized for showing work, but if what is asked for can be computed directly, points awarded will depend

More information

Integer programming (part 2) Lecturer: Javier Peña Convex Optimization /36-725

Integer programming (part 2) Lecturer: Javier Peña Convex Optimization /36-725 Integer programming (part 2) Lecturer: Javier Peña Convex Optimization 10-725/36-725 Last time: integer programming Consider the problem min x subject to f(x) x C x j Z, j J where f : R n R, C R n are

More information

Edexcel past paper questions

Edexcel past paper questions Edexcel past paper questions Statistics 1 Discrete Random Variables Past examination questions Discrete Random variables Page 1 Discrete random variables Discrete Random variables Page 2 Discrete Random

More information

10-701/ Machine Learning - Midterm Exam, Fall 2010

10-701/ Machine Learning - Midterm Exam, Fall 2010 10-701/15-781 Machine Learning - Midterm Exam, Fall 2010 Aarti Singh Carnegie Mellon University 1. Personal info: Name: Andrew account: E-mail address: 2. There should be 15 numbered pages in this exam

More information

ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES

ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES How to quantify qualitative variables to quantitative variables? Why do we need to do this? Econometric model needs quantitative variables to estimate its parameters

More information

Evolutionary Computation: introduction

Evolutionary Computation: introduction Evolutionary Computation: introduction Dirk Thierens Universiteit Utrecht The Netherlands Dirk Thierens (Universiteit Utrecht) EC Introduction 1 / 42 What? Evolutionary Computation Evolutionary Computation

More information

10701/15781 Machine Learning, Spring 2007: Homework 2

10701/15781 Machine Learning, Spring 2007: Homework 2 070/578 Machine Learning, Spring 2007: Homework 2 Due: Wednesday, February 2, beginning of the class Instructions There are 4 questions on this assignment The second question involves coding Do not attach

More information

Midterm. Introduction to Machine Learning. CS 189 Spring Please do not open the exam before you are instructed to do so.

Midterm. Introduction to Machine Learning. CS 189 Spring Please do not open the exam before you are instructed to do so. CS 89 Spring 07 Introduction to Machine Learning Midterm Please do not open the exam before you are instructed to do so. The exam is closed book, closed notes except your one-page cheat sheet. Electronic

More information

Computational complexity theory

Computational complexity theory Computational complexity theory Introduction to computational complexity theory Complexity (computability) theory deals with two aspects: Algorithm s complexity. Problem s complexity. References S. Cook,

More information

Machine Learning, Fall 2009: Midterm

Machine Learning, Fall 2009: Midterm 10-601 Machine Learning, Fall 009: Midterm Monday, November nd hours 1. Personal info: Name: Andrew account: E-mail address:. You are permitted two pages of notes and a calculator. Please turn off all

More information

Disconnecting Networks via Node Deletions

Disconnecting Networks via Node Deletions 1 / 27 Disconnecting Networks via Node Deletions Exact Interdiction Models and Algorithms Siqian Shen 1 J. Cole Smith 2 R. Goli 2 1 IOE, University of Michigan 2 ISE, University of Florida 2012 INFORMS

More information

IE 4521 Midterm #1. Prof. John Gunnar Carlsson. March 2, 2010

IE 4521 Midterm #1. Prof. John Gunnar Carlsson. March 2, 2010 IE 4521 Midterm #1 Prof. John Gunnar Carlsson March 2, 2010 Before you begin: This exam has 9 pages (including the normal distribution table) and a total of 8 problems. Make sure that all pages are present.

More information

0615AI Common Core State Standards

0615AI Common Core State Standards 0615AI Common Core State Standards 1 The cost of airing a commercial on television is modeled by the function C(n) = 110n + 900, where n is the number of times the commercial is aired. Based on this model,

More information

RP7-31 Using Proportions to Solve Percentage Problems I

RP7-31 Using Proportions to Solve Percentage Problems I RP-1 Using Proportions to Solve Percentage Problems I These are equivalent statements: 6 of the circles are shaded. of the circles are shaded. 6 is of. 6 : : part whole 1. Write four equivalent statements

More information

Classification: Linear Discriminant Analysis

Classification: Linear Discriminant Analysis Classification: Linear Discriminant Analysis Discriminant analysis uses sample information about individuals that are known to belong to one of several populations for the purposes of classification. Based

More information

Branch-and-Bound. Leo Liberti. LIX, École Polytechnique, France. INF , Lecture p. 1

Branch-and-Bound. Leo Liberti. LIX, École Polytechnique, France. INF , Lecture p. 1 Branch-and-Bound Leo Liberti LIX, École Polytechnique, France INF431 2011, Lecture p. 1 Reminders INF431 2011, Lecture p. 2 Problems Decision problem: a question admitting a YES/NO answer Example HAMILTONIAN

More information

Travelling Salesman Problem

Travelling Salesman Problem Travelling Salesman Problem Fabio Furini November 10th, 2014 Travelling Salesman Problem 1 Outline 1 Traveling Salesman Problem Separation Travelling Salesman Problem 2 (Asymmetric) Traveling Salesman

More information

Introduction to Statistics

Introduction to Statistics Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and

More information

Data Structures for Efficient Inference and Optimization

Data Structures for Efficient Inference and Optimization Data Structures for Efficient Inference and Optimization in Expressive Continuous Domains Scott Sanner Ehsan Abbasnejad Zahra Zamani Karina Valdivia Delgado Leliane Nunes de Barros Cheng Fang Discrete

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

Which boxplot represents the same information as the histogram? Test Scores Test Scores

Which boxplot represents the same information as the histogram? Test Scores Test Scores Frequency of Test Scores ALGEBRA I 01 013 SEMESTER EXAMS SEMESTER 1. Mrs. Johnson created this histogram of her 3 rd period students test scores. 8 6 4 50 60 70 80 90 100 Test Scores Which boplot represents

More information

0815AI Common Core State Standards

0815AI Common Core State Standards 0815AI Common Core State Standards 1 Given the graph of the line represented by the equation f(x) = 2x + b, if b is increased by 4 units, the graph of the new line would be shifted 4 units 1) right 2)

More information

Lecture Note 1: Introduction to optimization. Xiaoqun Zhang Shanghai Jiao Tong University

Lecture Note 1: Introduction to optimization. Xiaoqun Zhang Shanghai Jiao Tong University Lecture Note 1: Introduction to optimization Xiaoqun Zhang Shanghai Jiao Tong University Last updated: September 23, 2017 1.1 Introduction 1. Optimization is an important tool in daily life, business and

More information

College Algebra. Word Problems

College Algebra. Word Problems College Algebra Word Problems Example 2 (Section P6) The table shows the numbers N (in millions) of subscribers to a cellular telecommunication service in the United States from 2001 through 2010, where

More information

Integer Linear Programming (ILP)

Integer Linear Programming (ILP) Integer Linear Programming (ILP) Zdeněk Hanzálek, Přemysl Šůcha hanzalek@fel.cvut.cz CTU in Prague March 8, 2017 Z. Hanzálek (CTU) Integer Linear Programming (ILP) March 8, 2017 1 / 43 Table of contents

More information

Using Matrices And Hungarian Method To Solve The Traveling Salesman Problem

Using Matrices And Hungarian Method To Solve The Traveling Salesman Problem Salem State University Digital Commons at Salem State University Honors Theses Student Scholarship 2018-01-01 Using Matrices And Hungarian Method To Solve The Traveling Salesman Problem Briana Couto Salem

More information

Symbolic Variable Elimination in Discrete and Continuous Graphical Models. Scott Sanner Ehsan Abbasnejad

Symbolic Variable Elimination in Discrete and Continuous Graphical Models. Scott Sanner Ehsan Abbasnejad Symbolic Variable Elimination in Discrete and Continuous Graphical Models Scott Sanner Ehsan Abbasnejad Inference for Dynamic Tracking No one previously did this inference exactly in closed-form! Exact

More information

The Bayes classifier

The Bayes classifier The Bayes classifier Consider where is a random vector in is a random variable (depending on ) Let be a classifier with probability of error/risk given by The Bayes classifier (denoted ) is the optimal

More information

Computational complexity theory

Computational complexity theory Computational complexity theory Introduction to computational complexity theory Complexity (computability) theory deals with two aspects: Algorithm s complexity. Problem s complexity. References S. Cook,

More information

Modeling with Integer Programming

Modeling with Integer Programming Modeling with Integer Programg Laura Galli December 18, 2014 We can use 0-1 (binary) variables for a variety of purposes, such as: Modeling yes/no decisions Enforcing disjunctions Enforcing logical conditions

More information

Notes on Dantzig-Wolfe decomposition and column generation

Notes on Dantzig-Wolfe decomposition and column generation Notes on Dantzig-Wolfe decomposition and column generation Mette Gamst November 11, 2010 1 Introduction This note introduces an exact solution method for mathematical programming problems. The method is

More information

fsat We next show that if sat P, then fsat has a polynomial-time algorithm. c 2010 Prof. Yuh-Dauh Lyuu, National Taiwan University Page 425

fsat We next show that if sat P, then fsat has a polynomial-time algorithm. c 2010 Prof. Yuh-Dauh Lyuu, National Taiwan University Page 425 fsat fsat is this function problem: Let φ(x 1, x 2,..., x n ) be a boolean expression. If φ is satisfiable, then return a satisfying truth assignment. Otherwise, return no. We next show that if sat P,

More information

Section #2: Linear and Integer Programming

Section #2: Linear and Integer Programming Section #2: Linear and Integer Programming Prof. Dr. Sven Seuken 8.3.2012 (with most slides borrowed from David Parkes) Housekeeping Game Theory homework submitted? HW-00 and HW-01 returned Feedback on

More information

Hidden Markov Models (HMM) and Support Vector Machine (SVM)

Hidden Markov Models (HMM) and Support Vector Machine (SVM) Hidden Markov Models (HMM) and Support Vector Machine (SVM) Professor Joongheon Kim School of Computer Science and Engineering, Chung-Ang University, Seoul, Republic of Korea 1 Hidden Markov Models (HMM)

More information

Midterm. Introduction to Machine Learning. CS 189 Spring You have 1 hour 20 minutes for the exam.

Midterm. Introduction to Machine Learning. CS 189 Spring You have 1 hour 20 minutes for the exam. CS 189 Spring 2013 Introduction to Machine Learning Midterm You have 1 hour 20 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. Please use non-programmable calculators

More information

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation. CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.

More information

Independence 1 2 P(H) = 1 4. On the other hand = P(F ) =

Independence 1 2 P(H) = 1 4. On the other hand = P(F ) = Independence Previously we considered the following experiment: A card is drawn at random from a standard deck of cards. Let H be the event that a heart is drawn, let R be the event that a red card is

More information

Topics in Complexity

Topics in Complexity Topics in Complexity Please evaluate this course on Axess! Your feedback really does make a difference. Applied Complexity Theory Complexity theory has enormous practical relevance across various domains

More information

Reformulation and Sampling to Solve a Stochastic Network Interdiction Problem

Reformulation and Sampling to Solve a Stochastic Network Interdiction Problem Network Interdiction Stochastic Network Interdiction and to Solve a Stochastic Network Interdiction Problem Udom Janjarassuk Jeff Linderoth ISE Department COR@L Lab Lehigh University jtl3@lehigh.edu informs

More information

Announcements - Homework

Announcements - Homework Announcements - Homework Homework 1 is graded, please collect at end of lecture Homework 2 due today Homework 3 out soon (watch email) Ques 1 midterm review HW1 score distribution 40 HW1 total score 35

More information

An example of LP problem: Political Elections

An example of LP problem: Political Elections Linear Programming An example of LP problem: Political Elections Suppose that you are a politician trying to win an election. Your district has three different types of areas: urban, suburban, and rural.

More information

Overview of course. Introduction to Optimization, DIKU Monday 12 November David Pisinger

Overview of course. Introduction to Optimization, DIKU Monday 12 November David Pisinger Introduction to Optimization, DIKU 007-08 Monday November David Pisinger Lecture What is OR, linear models, standard form, slack form, simplex repetition, graphical interpretation, extreme points, basic

More information

Announcements. Midterm Review Session. Extended TA session on Friday 9am to 12 noon.

Announcements. Midterm Review Session. Extended TA session on Friday 9am to 12 noon. Announcements Extended TA session on Friday 9am to 12 noon. Midterm Review Session The midterm prep set indicates the difficulty level of the midterm and the type of questions that can be asked. Please

More information

Introduction to Machine Learning Midterm, Tues April 8

Introduction to Machine Learning Midterm, Tues April 8 Introduction to Machine Learning 10-701 Midterm, Tues April 8 [1 point] Name: Andrew ID: Instructions: You are allowed a (two-sided) sheet of notes. Exam ends at 2:45pm Take a deep breath and don t spend

More information

Harvard CS 121 and CSCI E-121 Lecture 22: The P vs. NP Question and NP-completeness

Harvard CS 121 and CSCI E-121 Lecture 22: The P vs. NP Question and NP-completeness Harvard CS 121 and CSCI E-121 Lecture 22: The P vs. NP Question and NP-completeness Harry Lewis November 19, 2013 Reading: Sipser 7.4, 7.5. For culture : Computers and Intractability: A Guide to the Theory

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

Applied Lagrange Duality for Constrained Optimization

Applied Lagrange Duality for Constrained Optimization Applied Lagrange Duality for Constrained Optimization February 12, 2002 Overview The Practical Importance of Duality ffl Review of Convexity ffl A Separating Hyperplane Theorem ffl Definition of the Dual

More information

MTTTS16 Learning from Multiple Sources

MTTTS16 Learning from Multiple Sources MTTTS16 Learning from Multiple Sources 5 ECTS credits Autumn 2018, University of Tampere Lecturer: Jaakko Peltonen Lecture 6: Multitask learning with kernel methods and nonparametric models On this lecture:

More information

Large-scale optimization and decomposition methods: outline. Column Generation and Cutting Plane methods: a unified view

Large-scale optimization and decomposition methods: outline. Column Generation and Cutting Plane methods: a unified view Large-scale optimization and decomposition methods: outline I Solution approaches for large-scaled problems: I Delayed column generation I Cutting plane methods (delayed constraint generation) 7 I Problems

More information

NP-Completeness. Andreas Klappenecker. [based on slides by Prof. Welch]

NP-Completeness. Andreas Klappenecker. [based on slides by Prof. Welch] NP-Completeness Andreas Klappenecker [based on slides by Prof. Welch] 1 Prelude: Informal Discussion (Incidentally, we will never get very formal in this course) 2 Polynomial Time Algorithms Most of the

More information

6. 5x Division Property. CHAPTER 2 Linear Models, Equations, and Inequalities. Toolbox Exercises. 1. 3x = 6 Division Property

6. 5x Division Property. CHAPTER 2 Linear Models, Equations, and Inequalities. Toolbox Exercises. 1. 3x = 6 Division Property CHAPTER Linear Models, Equations, and Inequalities CHAPTER Linear Models, Equations, and Inequalities Toolbox Exercises. x = 6 Division Property x 6 = x =. x 7= Addition Property x 7= x 7+ 7= + 7 x = 8.

More information

Quantum computing with superconducting qubits Towards useful applications

Quantum computing with superconducting qubits Towards useful applications Quantum computing with superconducting qubits Towards useful applications Stefan Filipp IBM Research Zurich Switzerland Forum Teratec 2018 June 20, 2018 Palaiseau, France Why Quantum Computing? Why now?

More information

Unit 1A: Computational Complexity

Unit 1A: Computational Complexity Unit 1A: Computational Complexity Course contents: Computational complexity NP-completeness Algorithmic Paradigms Readings Chapters 3, 4, and 5 Unit 1A 1 O: Upper Bounding Function Def: f(n)= O(g(n)) if

More information

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product?

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product? A. Attribute data B. Numerical data C. Quantitative data D. Sample data E. Qualitative data F. Statistic G. Parameter Chapter #1 Match the following descriptions with the best term or classification given

More information

Algebra 1 Honors EOC Review #2 Calculator Portion

Algebra 1 Honors EOC Review #2 Calculator Portion Algebra 1 Honors EOC Review # Calculator Portion 1. Ms. Robinson wrote the six numbers listed below. 18 3 7 6 4 7 5 8 4 She asked the students in her math class to identify two irrational numbers from

More information

Correlation Coefficient: the quantity, measures the strength and direction of a linear relationship between 2 variables.

Correlation Coefficient: the quantity, measures the strength and direction of a linear relationship between 2 variables. AFM Unit 9 Regression Day 1 notes A mathematical model is an equation that best describes a particular set of paired data. These mathematical models are referred to as models and are used to one variable

More information

1. Consider the following graphs and choose the correct name of each function.

1. Consider the following graphs and choose the correct name of each function. Name Date Summary of Functions Comparing Linear, Quadratic, and Exponential Functions - Part 1 Independent Practice 1. Consider the following graphs and choose the correct name of each function. Part A:

More information

CS 361 Sample Midterm 2

CS 361 Sample Midterm 2 CS 361 Sample Midterm 2 NAME: NETID: CIRCLE YOUR DISCUSSION: Thu 2-3 Thu 4-5 Fri 10-11 Fri 11-12 Be sure that your exam booklet has 6 pages including this cover page Make sure to write your name exactly

More information

Indicator Constraints in Mixed-Integer Programming

Indicator Constraints in Mixed-Integer Programming Indicator Constraints in Mixed-Integer Programming Andrea Lodi University of Bologna, Italy - andrea.lodi@unibo.it Amaya Nogales-Gómez, Universidad de Sevilla, Spain Pietro Belotti, FICO, UK Matteo Fischetti,

More information

Machine Learning for Signal Processing Bayes Classification and Regression

Machine Learning for Signal Processing Bayes Classification and Regression Machine Learning for Signal Processing Bayes Classification and Regression Instructor: Bhiksha Raj 11755/18797 1 Recap: KNN A very effective and simple way of performing classification Simple model: For

More information

Formation of Student Groups with the Help of Optimization

Formation of Student Groups with the Help of Optimization Department of Mathematics Formation of Student Groups with the Help of Optimization Kaj Holmberg LiTH-MAT-R--2014/14--SE Department of Mathematics Linköping University S-581 83 Linköping, Sweden. Formation

More information

Name: Teacher: DO NOT OPEN THE EXAMINATION PAPER UNTIL YOU ARE TOLD BY THE SUPERVISOR TO BEGIN. Mathematics 3201 PRE-PUBLIC EXAMINATION JUNE 2014

Name: Teacher: DO NOT OPEN THE EXAMINATION PAPER UNTIL YOU ARE TOLD BY THE SUPERVISOR TO BEGIN. Mathematics 3201 PRE-PUBLIC EXAMINATION JUNE 2014 Name: Teacher: DO NOT OPEN THE EXAMINATION PAPER UNTIL YOU ARE TOLD BY THE SUPERVISOR TO BEGIN Mathematics 301 PRE-PUBLIC EXAMINATION JUNE 014 Value: 100 Marks Duration: 3 Hours General Instructions This

More information

Tutorial: PART 2. Optimization for Machine Learning. Elad Hazan Princeton University. + help from Sanjeev Arora & Yoram Singer

Tutorial: PART 2. Optimization for Machine Learning. Elad Hazan Princeton University. + help from Sanjeev Arora & Yoram Singer Tutorial: PART 2 Optimization for Machine Learning Elad Hazan Princeton University + help from Sanjeev Arora & Yoram Singer Agenda 1. Learning as mathematical optimization Stochastic optimization, ERM,

More information

Integer Programming Introduction. C, T int (3) 6C +7T 21 (1) Maximize : 12C +13T Subject to

Integer Programming Introduction. C, T int (3) 6C +7T 21 (1) Maximize : 12C +13T Subject to Integer Programming Introduction Divisibility assumption of LP allows for fractions: Produce 7.8 units of a product, buy 12500.33 liters of oil, hire 12.123 people for full time. Clearly some activities

More information

56:171 Operations Research Final Exam December 12, 1994

56:171 Operations Research Final Exam December 12, 1994 56:171 Operations Research Final Exam December 12, 1994 Write your name on the first page, and initial the other pages. The response "NOTA " = "None of the above" Answer both parts A & B, and five sections

More information

Inderjit Dhillon The University of Texas at Austin

Inderjit Dhillon The University of Texas at Austin Inderjit Dhillon The University of Texas at Austin ( Universidad Carlos III de Madrid; 15 th June, 2012) (Based on joint work with J. Brickell, S. Sra, J. Tropp) Introduction 2 / 29 Notion of distance

More information

Mrs. Poyner/Mr. Page Chapter 3 page 1

Mrs. Poyner/Mr. Page Chapter 3 page 1 Name: Date: Period: Chapter 2: Take Home TEST Bivariate Data Part 1: Multiple Choice. (2.5 points each) Hand write the letter corresponding to the best answer in space provided on page 6. 1. In a statistics

More information

1 Which expression represents 5 less than twice x? 1) 2) 3) 4)

1 Which expression represents 5 less than twice x? 1) 2) 3) 4) 1 Which expression represents 5 less than twice x? 2 Gabriella has 20 quarters, 15 dimes, 7 nickels, and 8 pennies in a jar. After taking 6 quarters out of the jar, what will be the probability of Gabriella

More information