Regression. Jordan Boyd-Graber. University of Colorado Boulder LECTURE 11. Jordan Boyd-Graber Boulder Regression 1 of 19

Size: px

Start display at page:

Download "Regression. Jordan Boyd-Graber. University of Colorado Boulder LECTURE 11. Jordan Boyd-Graber Boulder Regression 1 of 19"

Candice Patrick
5 years ago
Views:

1 Regression Jordan Boyd-Graber University of Colorado Boulder LECTURE 11 Jordan Boyd-Graber Boulder Regression 1 of 19

2 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

3 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

4 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

5 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

6 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

7 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

8 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

9 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

10 Content Questions Jordan Boyd-Graber Boulder Regression 2 of 19

11 Content Questions. Jordan Boyd-Graber Boulder Regression 2 of 19

12 Administrivia Learnability HW due Friday SVM HW next week I m out of town next week Jordan Boyd-Graber Boulder Regression 3 of 19

13 Project Default project or choose your own I ll meet with your group early in the week of Oct 12 Project proposal First deliverable due Nov 6: data / baseline Jordan Boyd-Graber Boulder Regression 4 of 19

14 Basics Plan Basics Regularization Sklearn Jordan Boyd-Graber Boulder Regression 5 of 19

15 Basics Predictions dimension weight b 1 w w σ x 1 = {0.0, 0.0}; y 1 = 2. x 2 = {1.0, 1.0}; y 2 = 3. x 3 = {.5, 2}; y 3 = Jordan Boyd-Graber Boulder Regression 6 of 19

16 Basics Predictions dimension weight b 1 w w σ x 1 = {0.0, 0.0}; y 1 = x 2 = {1.0, 1.0}; y 2 = 3. x 3 = {.5, 2}; y 3 = Jordan Boyd-Graber Boulder Regression 6 of 19

17 Basics Predictions dimension weight b 1 w w σ x 1 = {0.0, 0.0}; y 1 = x 2 = {1.0, 1.0}; y 2 = x 3 = {.5, 2}; y 3 = Jordan Boyd-Graber Boulder Regression 6 of 19

18 Basics Predictions dimension weight b 1 w w σ x 1 = {0.0, 0.0}; y 1 = x 2 = {1.0, 1.0}; y 2 = x 3 = {.5, 2}; y 3 =0.0 Jordan Boyd-Graber Boulder Regression 6 of 19

19 Basics Probabilities dimension weight w 0 1 w w σ 1.0 p(y x) = y N b + exp p(y x) = { (y ŷ)2 2 2π p w j x j, σ 2 j=1 } 1. p(y 1 = 1 x 1 = {0.0, 0.0}) = 2. p(y 2 = 3 x 2 = {1.0, 1.0}) = 3. p(y 3 = 1 x 3 = {.5, 2}) = Jordan Boyd-Graber Boulder Regression 7 of 19

20 Basics Probabilities dimension weight w 0 1 w w σ 1.0 p(y x) = y N b + exp p(y x) = { (y ŷ)2 2 2π p w j x j, σ 2 j=1 } 1. p(y 1 = 1 x 1 = {0.0, 0.0}) = p(y 2 = 3 x 2 = {1.0, 1.0}) = 3. p(y 3 = 1 x 3 = {.5, 2}) = Jordan Boyd-Graber Boulder Regression 7 of 19

21 Basics Probabilities dimension weight w 0 1 w w σ 1.0 p(y x) = y N b + exp p(y x) = { (y ŷ)2 2 2π p w j x j, σ 2 j=1 } 1. p(y 1 = 1 x 1 = {0.0, 0.0}) = p(y 2 = 3 x 2 = {1.0, 1.0}) = p(y 3 = 1 x 3 = {.5, 2}) = Jordan Boyd-Graber Boulder Regression 7 of 19

22 Basics Probabilities dimension weight w 0 1 w w σ 1.0 p(y x) = y N b + exp p(y x) = { (y ŷ)2 2 2π p w j x j, σ 2 j=1 } 1. p(y 1 = 1 x 1 = {0.0, 0.0}) = p(y 2 = 3 x 2 = {1.0, 1.0}) = p(y 3 = 1 x 3 = {.5, 2}) = Jordan Boyd-Graber Boulder Regression 7 of 19

23 Regularization Plan Basics Regularization Sklearn Jordan Boyd-Graber Boulder Regression 8 of 19

24 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 Jordan Boyd-Graber Boulder Regression 9 of 19

25 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 Which is the better OLS solution? Jordan Boyd-Graber Boulder Regression 9 of 19

26 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 Blue! It has lower RSS. Jordan Boyd-Graber Boulder Regression 9 of 19

27 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 What is the RSS of the better solution? Jordan Boyd-Graber Boulder Regression 9 of 19

28 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 1 2 i r i 2 = 1 ( 2 (1 1) 2 + (2.5 2) 2 + (2.5 3) 2) = 1 4 Jordan Boyd-Graber Boulder Regression 9 of 19

29 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 What is the RSS of the red line? Jordan Boyd-Graber Boulder Regression 9 of 19

30 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 1 2 i r i 2 = 1 ( 2 (1 1) 2 + ( ) 2 + ( ) 2) = 3 8 Jordan Boyd-Graber Boulder Regression 9 of 19

31 Regularization Consider these points and data w=.75,b=.25 w=1,b=0 For what λ does the blue line have a better regularized solution with L 2 and L 1? Jordan Boyd-Graber Boulder Regression 9 of 19

32 Regularization When Regularization Wins L 2 L 1 Jordan Boyd-Graber Boulder Regression 10 of 19

33 Regularization When Regularization Wins L 2 RSS(x, y, w) + λ d w 2 d > RSS(x, y, w) + λ d w 2 d L 1 Jordan Boyd-Graber Boulder Regression 10 of 19

34 Regularization When Regularization Wins L 2 RSS(x, y, w) + λ d w 2 d > RSS(x, y, w) + λ d w 2 d λ1 > λ 9 16 L 1 Jordan Boyd-Graber Boulder Regression 10 of 19

35 Regularization When Regularization Wins L λ1 > λ λ > 1 8 L 1 Jordan Boyd-Graber Boulder Regression 10 of 19

36 Regularization When Regularization Wins L λ > 1 8 λ > 2 7 L 1 Jordan Boyd-Graber Boulder Regression 10 of 19

37 Regularization When Regularization Wins L 2 λ > 2 7 L 1 RSS(x, y, w) + λ d w d > RSS(x, y, w) + λ d w d Jordan Boyd-Graber Boulder Regression 10 of 19

38 Regularization When Regularization Wins L 2 λ > 2 7 L 1 RSS(x, y, w) + λ d w d > RSS(x, y, w) + λ d w d λ1 > λ3 4 Jordan Boyd-Graber Boulder Regression 10 of 19

39 Regularization When Regularization Wins L 2 λ > 2 7 L λ1 > λ3 4 Jordan Boyd-Graber Boulder Regression 10 of 19

40 Regularization When Regularization Wins L 2 λ > 2 7 L λ > 1 8 Jordan Boyd-Graber Boulder Regression 10 of 19

41 Regularization When Regularization Wins L 2 λ > 2 7 L λ > 1 8 λ > 1 2 Jordan Boyd-Graber Boulder Regression 10 of 19

42 Regularization When Regularization Wins L 2 λ > 2 7 L 1 λ > 1 2 Bigger λ: preference for lower weights w Jordan Boyd-Graber Boulder Regression 10 of 19

43 Sklearn Plan Basics Regularization Sklearn Jordan Boyd-Graber Boulder Regression 11 of 19

44 Sklearn MPG Dataset Predict mpg from features of a car 1. Number of cylinders 2. Displacement 3. Horsepower 4. Weight 5. Acceleration 6. Year 7. Country (ignore this) Jordan Boyd-Graber Boulder Regression 12 of 19

45 Sklearn Simple Regression If w = 0, what s the intercept? Jordan Boyd-Graber Boulder Regression 13 of 19

46 Sklearn Simple Regression If w = 0, what s the intercept? 23.4 Jordan Boyd-Graber Boulder Regression 13 of 19

47 Sklearn Simple Linear Regression What are the coefficients for OLS? Jordan Boyd-Graber Boulder Regression 14 of 19

48 Sklearn Simple Linear Regression What are the coefficients for OLS? c y l d i s hp wgt a c l y r Jordan Boyd-Graber Boulder Regression 14 of 19

49 Sklearn Simple Linear Regression What are the coefficients for OLS? c y l d i s hp wgt a c l y r Intercept: Jordan Boyd-Graber Boulder Regression 14 of 19

50 Sklearn Simple Linear Regression from s k l e a r n import l i n e a r m o d e l l i n e a r m o d e l. L i n e a r R e g r e s s i o n ( ) f i t = model. f i t ( x, y ) Jordan Boyd-Graber Boulder Regression 15 of 19

51 Sklearn Lasso As you increase the weight of alpha, what feature dominates? What happens to the other features? Jordan Boyd-Graber Boulder Regression 16 of 19

52 Sklearn Weight is Everything Jordan Boyd-Graber Boulder mpg = Weight Regression 17 of 19

53 Sklearn How is ridge different? Jordan Boyd-Graber Boulder Regression 18 of 19

54 Sklearn Regression isn t special Feature engineering Regularization Overfitting Development / Test Data Jordan Boyd-Graber Boulder Regression 19 of 19

Solving Regression. Jordan Boyd-Graber. University of Colorado Boulder LECTURE 12. Slides adapted from Matt Nedrich and Trevor Hastie

Solving Regression. Jordan Boyd-Graber. University of Colorado Boulder LECTURE 12. Slides adapted from Matt Nedrich and Trevor Hastie Solving Regression Jordan Boyd-Graber University of Colorado Boulder LECTURE 12 Slides adapted from Matt Nedrich and Trevor Hastie Jordan Boyd-Graber Boulder Solving Regression 1 of 17 Roadmap We talked