Tempered Stable and Pareto Distributions: Predictions Under Uncertainty

Size: px
Start display at page:

Download "Tempered Stable and Pareto Distributions: Predictions Under Uncertainty"

Transcription

1 Tempered Stable and Pareto Distributions: Predictions Under Uncertainty Robert L Wolpert & Kerrie L Mengersen Duke University & Queensland University of Technology Session 2A: Foundations I Thu 1:50 2:15pm Frontiers of Statistical Decision Making & Bayesian Analysis

2 Motivation What s at stake Modeling Power Laws Tempering Uncertainty Parameters Predictions Conclusions Discussion Future Work

3 Caribbean Island of Montserrat

4 Soufrière Hills Volcano View from MVO at sunset, 18 April 2007

5 Where is Montserrat?

6 What s been happening since 1995? Annual Rate e+04 5e+05 5e+06 5e+07 5e+08 PF Volume (m 3 ) Seems kind of linear, on log-log scale... remember that.

7 Pyroclastic Flow Volumes PF s exceeding 10 4 m 3 : Daily, km runout PF s exceeding 10 5 m 3 : Fortnightly, km runout PF s exceeding 10 6 m 3 : Quarterly, km runout PF s exceeding 10 7 m 3 : Yearly, km runout PF s exceeding 10 8 m 3 : Just one, unknown runout PF s exceeding 10 9 m 3 : None YET, but...

8 Pyroclastic Flow Volumes PF s exceeding 10 4 m 3 : Daily, km runout PF s exceeding 10 5 m 3 : Fortnightly, km runout PF s exceeding 10 6 m 3 : Quarterly, km runout PF s exceeding 10 7 m 3 : Yearly, km runout PF s exceeding 10 8 m 3 : Just one, unknown runout PF s exceeding 10 9 m 3 : None YET, but...

9 Pyroclastic Flow Volumes PF s exceeding 10 4 m 3 : Daily, km runout PF s exceeding 10 5 m 3 : Fortnightly, km runout PF s exceeding 10 6 m 3 : Quarterly, km runout PF s exceeding 10 7 m 3 : Yearly, km runout PF s exceeding 10 8 m 3 : Just one, unknown runout PF s exceeding 10 9 m 3 : None YET, but...

10 Flow Volume Context How much is 10 9 m 3 of flow? V Cube Width Depth in 1km 2 Radius 1m deep m (32.8 ) 1mm 18 m m (328 ) 1m 564 m m (0.6 mi) 1km 17.8 km m (1.3 mi) 10 km 56 km Moral: Flows bigger than 10 9 m 3 spell trouble. Flows bigger than m 3 are unrealistic. Flows bigger than m 3 cause mass extinction.

11 Flow Volume Context How much is 10 9 m 3 of flow? V Cube Width Depth in 1km 2 Radius 1m deep m (32.8 ) 1mm 18 m m (328 ) 1m 564 m m (0.6 mi) 1km 17.8 km m (1.3 mi) 10 km 56 km Montserrat is 16 km long 10 km wide. Moral: Flows bigger than 10 9 m 3 spell trouble. Flows bigger than m 3 are unrealistic. Flows bigger than m 3 cause mass extinction.

12 Flow Volume Context How much is 10 9 m 3 of flow? V Cube Width Depth in 1km 2 Radius 1m deep m (32.8 ) 1mm 18 m m (328 ) 1m 564 m m (0.6 mi) 1km 17.8 km m (1.3 mi) 10 km 56 km Montserrat is 16 km long 10 km wide. Largest volcanoes in recorded history: Tambora, Indonesia 1815 AD m 3 Baekdu Mt., China 979 AD m 3 Moral: Flows bigger than 10 9 m 3 spell trouble. Flows bigger than m 3 are unrealistic. Flows bigger than m 3 cause mass extinction.

13 Flow Volume Context How much is 10 9 m 3 of flow? V Cube Width Depth in 1km 2 Radius 1m deep m (32.8 ) 1mm 18 m m (328 ) 1m 564 m m (0.6 mi) 1km 17.8 km m (1.3 mi) 10 km 56 km Montserrat is 16 km long 10 km wide. Moral: Flows bigger than 10 9 m 3 spell trouble. Flows bigger than m 3 are unrealistic. Flows bigger than m 3 cause mass extinction.

14 Flow Volume Context How much is 10 9 m 3 of flow? V Cube Width Depth in 1km 2 Radius 1m deep m (32.8 ) 1mm 18 m m (328 ) 1m 564 m m (0.6 mi) 1km 17.8 km m (1.3 mi) 10 km 56 km Montserrat is 16 km long 10 km wide. Moral: Flows bigger than 10 9 m 3 spell trouble. Flows bigger than m 3 are unrealistic. Flows bigger than m 3 cause mass extinction.

15 Flow Volume Context How much is 10 9 m 3 of flow? V Cube Width Depth in 1km 2 Radius 1m deep m (32.8 ) 1mm 18 m m (328 ) 1m 564 m m (0.6 mi) 1km 17.8 km m (1.3 mi) 10 km 56 km Montserrat is 16 km long 10 km wide. Moral: Flows bigger than 10 9 m 3 spell trouble. Flows bigger than m 3 are unrealistic. Flows bigger than m 3 cause mass extinction.

16 How big is bad? V= V= V= V=10 9 V= V= Kinda depends which way it goes... but Ψ = or Ψ = 10 9 seems bad.

17 Power Laws Remember the nearly-linear Freq. vs. Volume log-log plot? For some slope α and intercept c, log ( P[V > v ]) αlog(v) + c, v > ǫ After exponentiation, P[V > v] = (v/ǫ) α, v > ǫ. This is the Pareto Pa(α,ǫ) distribution! Which is kind of bad news.

18 Power Laws Remember the nearly-linear Freq. vs. Volume log-log plot? For some slope α and intercept c, log ( P[V > v ]) αlog(v) + c, v > ǫ After exponentiation, P[V > v] = (v/ǫ) α, v > ǫ. This is the Pareto Pa(α,ǫ) distribution! Which is kind of bad news.

19 Power Laws Remember the nearly-linear Freq. vs. Volume log-log plot? For some slope α and intercept c, log ( P[V > v ]) αlog(v) + c, v > ǫ After exponentiation, P[V > v] = (v/ǫ) α, v > ǫ. This is the Pareto Pa(α,ǫ) distribution! Which is kind of bad news.

20 The Pareto Distribution The slope seems to be in the α 0.64 or so for 0 < α < 1, the Pareto has: Heavy tails; Infinite mean E[V ] = ; Infinite variance E[V 2 ] = ; No Central Limit Theorem for sums aggregate flow is approx. α-stable, not Gaussian.

21 Frequency of Large Flows MLE and reference Bayes estimates give α 0.64, and λ ǫ 22 flows/year exceeding ǫ = m 3. In twenty years we should expect about: 16.3 flows exceeding 10 7 m flows exceeding 10 8 m flows exceeding 10 9 m flows exceeding m flows exceeding m flows exceeding m 3 Suggest about 18% chance of V > in two decades, and about 1% chance of V > 10 12! Which are unrealistically high (we think). An idea

22 Frequency of Large Flows MLE and reference Bayes estimates give α 0.64, and λ ǫ 22 flows/year exceeding ǫ = m 3. In twenty years we should expect about: 16.3 flows exceeding 10 7 m flows exceeding 10 8 m flows exceeding 10 9 m flows exceeding m flows exceeding m flows exceeding m 3 Suggest about 18% chance of V > in two decades, and about 1% chance of V > 10 12! Which are unrealistically high (we think). An idea

23 Frequency of Large Flows MLE and reference Bayes estimates give α 0.64, and λ ǫ 22 flows/year exceeding ǫ = m 3. In twenty years we should expect about: 16.3 flows exceeding 10 7 m flows exceeding 10 8 m flows exceeding 10 9 m flows exceeding m flows exceeding m flows exceeding m 3 Suggest about 18% chance of V > in two decades, and about 1% chance of V > 10 12! Which are unrealistically high (we think). An idea

24 Frequency of Large Flows MLE and reference Bayes estimates give α 0.64, and λ ǫ 22 flows/year exceeding ǫ = m 3. In twenty years we should expect about: 16.3 flows exceeding 10 7 m flows exceeding 10 8 m flows exceeding 10 9 m flows exceeding m flows exceeding m flows exceeding m 3 Suggest about 18% chance of V > in two decades, and about 1% chance of V > 10 12! Which are unrealistically high (we think). An idea

25 Tempered Pareto Models There s a slight suggestion of a downturn in the Freq-vs-Vol log-log plot; perhaps a better model is Tempered Pareto, with P[V > v] = (v/ǫ) α e β(v ǫ), v > ǫ : Tempered Pareto TP(α = 0.43, ε = 10 4, κ = ) Distribution log 10 Prob [V > v] Note Downturn log 10 PF volume v (m 3 ) See the drop-off near 10 8 m 3 or 10 9 m 3? That s Good news!

26 Tempered Pareto Models There s a slight suggestion of a downturn in the Freq-vs-Vol log-log plot; perhaps a better model is Tempered Pareto, with P[V > v] = (v/ǫ) α e β(v ǫ), v > ǫ : Tempered Pareto TP(α = 0.43, ε = 10 4, κ = ) Distribution log 10 Prob [V > v] Note Downturn log 10 PF volume v (m 3 ) See the drop-off near 10 8 m 3 or 10 9 m 3? That s Good news!

27 Uncertainty MLE and reference Bayesian posterior means (± SD s) for the three parameters of this model are: Shape α 0.64 ± 0.04 Rate λ ± 1.3 /year Bend β ± /m 3 Note how uncertain β is... only 1.5 SD above zero. How does this affect risk predictions? How does Pareto (β = 0) model differ from Tempered Pareto (β > 0)?

28 Uncertainty MLE and reference Bayesian posterior means (± SD s) for the three parameters of this model are: Shape α 0.64 ± 0.04 Rate λ ± 1.3 /year Bend β ± /m 3 Note how uncertain β is... only 1.5 SD above zero. How does this affect risk predictions? How does Pareto (β = 0) model differ from Tempered Pareto (β > 0)?

29 Uncertainty MLE and reference Bayesian posterior means (± SD s) for the three parameters of this model are: Shape α 0.64 ± 0.04 Rate λ ± 1.3 /year Bend β ± /m 3 Note how uncertain β is... only 1.5 SD above zero. How does this affect risk predictions? How does Pareto (β = 0) model differ from Tempered Pareto (β > 0)?

30 Uncertainty MLE and reference Bayesian posterior means (± SD s) for the three parameters of this model are: Shape α 0.64 ± 0.04 Rate λ ± 1.3 /year Bend β ± /m 3 Note how uncertain β is... only 1.5 SD above zero. How does this affect risk predictions? How does Pareto (β = 0) model differ from Tempered Pareto (β > 0)?

31 Risk for Fifty Years of V > 10 8 m 3 Risk Comparisons, Tempered vs. Untempered Pareto, for Ψ 10 8 m 3 P[ Catastrophe in t years Data ] Pareto Model (Untempered) Pareto Model (Tempered) Time t (years)

32 Risk for Fifty Years of V > m 3 Risk Comparisons, Tempered vs. Untempered Pareto, for Ψ m 3 P[ Catastrophe in t years Data ] Pareto Model (Untempered) Pareto Model (Tempered) Time t (years)

33 Why the wide uncertainty? Annual Rate e+04 5e+05 5e+06 5e+07 5e+08 PF Volume (m 3 )

34 How does that affect estimated risk? P[ Catastrophe in t years Data ] Risk for Ψ m 3 Tempered Pareto model: Eα = 0.63, Eβ 1 = m Time t (years)

35 What if there is no tempering? P[ Catastrophe in t years Data ] Risk for Ψ m 3 Untempered Pareto model: Eα = Time t (years)

36 A little more extreme Truncated Pareto P[V > v] = v α L α ǫ α L α ǫ v L : Log 10 Prob [V > v] Log 10 PF volume V (m 3 ) Now the drop-off approaching 10 9 m 3 is fast!

37 A little more extreme Truncated Pareto Annual Rate e+04 5e+05 5e+06 5e+07 5e+08 PF Volume (m 3 )

38 A little more extreme Truncated Pareto P[ Catastrophe in t years Data ] Risk for Ψ 10 8 m 3 Truncated Zipf model: Eα = 0.63, EL = m 3 Prior L Ga(1, 1e 09) Time t (years)

39 Morals It s hard to tell about tail behaviour (extremes) from data; Clearer picture of tempering would help working to elicit expert opinion about physical limits Other model assumptions might bear closer scrutiny.

40 Morals It s hard to tell about tail behaviour (extremes) from data; Clearer picture of tempering would help working to elicit expert opinion about physical limits Other model assumptions might bear closer scrutiny.

41 Morals It s hard to tell about tail behaviour (extremes) from data; Clearer picture of tempering would help working to elicit expert opinion about physical limits Other model assumptions might bear closer scrutiny.

42 Morals It s hard to tell about tail behaviour (extremes) from data; Clearer picture of tempering would help working to elicit expert opinion about physical limits Other model assumptions might bear closer scrutiny.

43 Model Strengths Inspired by data feature (Flow vs. Frequency linearity on log-log scale); Accommodates wide range of censoring; Simulations from model appear similar to real data; Missing observations, irregular space and time are okay.

44 Model Shortcomings Assumes Stationarity, but patterns change over decades Assumes Independence, but patterns exist over short periods Employs Pareto, whose extreme tails are too flat after all, the m 3 volume of the earth is an absolute limit!

45 Open Questions How much does false Stationarity assumption matter? Can we make a dynamic time-varying version? How much does false Independence assumption matter? Should we construct cluster models? Do we need to? How much does false Pareto assumption matter? Does it help to temper the tails like P[V > v] = (v/ǫ) α e β(v ǫ), v > ǫ?

46 Open Questions How much does false Stationarity assumption matter? Can we make a dynamic time-varying version? How much does false Independence assumption matter? Should we construct cluster models? Do we need to? How much does false Pareto assumption matter? Does it help to temper the tails like P[V > v] = (v/ǫ) α e β(v ǫ), v > ǫ?

47 Open Questions How much does false Stationarity assumption matter? Can we make a dynamic time-varying version? How much does false Independence assumption matter? Should we construct cluster models? Do we need to? How much does false Pareto assumption matter? Does it help to temper the tails like P[V > v] = (v/ǫ) α e β(v ǫ), v > ǫ? or P[V > v] (v/ǫ) α 1 {v<l}, v > ǫ?

48 Ongoing Work & Extensions Studying possible variable rate of eruption process; Studying possible dependence between angle φ and flow volume V ; Refining dataset and accumulating additional data; Improving our censorship models for complex data. Group trip to Montserrat to see data acquisition first-hand!

49 Ongoing Work & Extensions Studying possible variable rate of eruption process; Studying possible dependence between angle φ and flow volume V ; Refining dataset and accumulating additional data; Improving our censorship models for complex data. Group trip to Montserrat to see data acquisition first-hand!

50 A Collaborative Effort

51 Thanks! More details (references, this talk in.pdf, related work) are available from Froggy website or on request from Special thanks to Kerrie Mengersen, collaborator in ongoing work on tempering extremes in a wide area of applications. Thanks to SAMSI and to NSF/FRG and, especially, to Jim Berger!

52 Collaboration is such hard work...

Hazard Assessment for Pyroclastic Flows

Hazard Assessment for Pyroclastic Flows Hazard Assessment for Pyroclastic Flows Robert L Wolpert Duke University Department of Statistical Science with James O Berger (Duke), E. Bruce Pitman, Eliza S. Calder & Abani K. Patra (U Buffalo), M.J.

More information

Inference for non-stationary, non-gaussian, Irregularly-Sampled Processes

Inference for non-stationary, non-gaussian, Irregularly-Sampled Processes Inference for non-stationary, non-gaussian, Irregularly-Sampled Processes Robert Wolpert Duke Department of Statistical Science wolpert@stat.duke.edu SAMSI ASTRO Opening WS Aug 22-26 2016 Aug 24 Hamner

More information

EM Algorithm & High Dimensional Data

EM Algorithm & High Dimensional Data EM Algorithm & High Dimensional Data Nuno Vasconcelos (Ken Kreutz-Delgado) UCSD Gaussian EM Algorithm For the Gaussian mixture model, we have Expectation Step (E-Step): Maximization Step (M-Step): 2 EM

More information

Machine Learning CSE546 Carlos Guestrin University of Washington. September 30, What about continuous variables?

Machine Learning CSE546 Carlos Guestrin University of Washington. September 30, What about continuous variables? Linear Regression Machine Learning CSE546 Carlos Guestrin University of Washington September 30, 2014 1 What about continuous variables? n Billionaire says: If I am measuring a continuous variable, what

More information

Statistical inference

Statistical inference Statistical inference Contents 1. Main definitions 2. Estimation 3. Testing L. Trapani MSc Induction - Statistical inference 1 1 Introduction: definition and preliminary theory In this chapter, we shall

More information

STA414/2104 Statistical Methods for Machine Learning II

STA414/2104 Statistical Methods for Machine Learning II STA414/2104 Statistical Methods for Machine Learning II Murat A. Erdogdu & David Duvenaud Department of Computer Science Department of Statistical Sciences Lecture 3 Slide credits: Russ Salakhutdinov Announcements

More information

Imperfect Data in an Uncertain World

Imperfect Data in an Uncertain World Imperfect Data in an Uncertain World James B. Elsner Department of Geography, Florida State University Tallahassee, Florida Corresponding author address: Dept. of Geography, Florida State University Tallahassee,

More information

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013 UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2013 Exam policy: This exam allows two one-page, two-sided cheat sheets; No other materials. Time: 2 hours. Be sure to write your name and

More information

Fundamentals of Machine Learning for Predictive Data Analytics

Fundamentals of Machine Learning for Predictive Data Analytics Fundamentals of Machine Learning for Predictive Data Analytics Chapter 6: Probability-based Learning Sections 6.1, 6.2, 6.3 John Kelleher and Brian Mac Namee and Aoife D Arcy john.d.kelleher@dit.ie brian.macnamee@ucd.ie

More information

Bayesian hierarchical space time model applied to highresolution hindcast data of significant wave height

Bayesian hierarchical space time model applied to highresolution hindcast data of significant wave height Bayesian hierarchical space time model applied to highresolution hindcast data of Presented at Wave workshop 2013, Banff, Canada Erik Vanem Introduction The Bayesian hierarchical space-time models were

More information

Physics 509: Bootstrap and Robust Parameter Estimation

Physics 509: Bootstrap and Robust Parameter Estimation Physics 509: Bootstrap and Robust Parameter Estimation Scott Oser Lecture #20 Physics 509 1 Nonparametric parameter estimation Question: what error estimate should you assign to the slope and intercept

More information

Fitting a Straight Line to Data

Fitting a Straight Line to Data Fitting a Straight Line to Data Thanks for your patience. Finally we ll take a shot at real data! The data set in question is baryonic Tully-Fisher data from http://astroweb.cwru.edu/sparc/btfr Lelli2016a.mrt,

More information

Point Estimation. Vibhav Gogate The University of Texas at Dallas

Point Estimation. Vibhav Gogate The University of Texas at Dallas Point Estimation Vibhav Gogate The University of Texas at Dallas Some slides courtesy of Carlos Guestrin, Chris Bishop, Dan Weld and Luke Zettlemoyer. Basics: Expectation and Variance Binary Variables

More information

Why Bayesian? Rigorous approach to address statistical estimation problems. The Bayesian philosophy is mature and powerful.

Why Bayesian? Rigorous approach to address statistical estimation problems. The Bayesian philosophy is mature and powerful. Why Bayesian? Rigorous approach to address statistical estimation problems. The Bayesian philosophy is mature and powerful. Even if you aren t Bayesian, you can define an uninformative prior and everything

More information

Estimation of Operational Risk Capital Charge under Parameter Uncertainty

Estimation of Operational Risk Capital Charge under Parameter Uncertainty Estimation of Operational Risk Capital Charge under Parameter Uncertainty Pavel V. Shevchenko Principal Research Scientist, CSIRO Mathematical and Information Sciences, Sydney, Locked Bag 17, North Ryde,

More information

Machine Learning CSE546 Sham Kakade University of Washington. Oct 4, What about continuous variables?

Machine Learning CSE546 Sham Kakade University of Washington. Oct 4, What about continuous variables? Linear Regression Machine Learning CSE546 Sham Kakade University of Washington Oct 4, 2016 1 What about continuous variables? Billionaire says: If I am measuring a continuous variable, what can you do

More information

Machine Learning CSE546 Carlos Guestrin University of Washington. September 30, 2013

Machine Learning CSE546 Carlos Guestrin University of Washington. September 30, 2013 Bayesian Methods Machine Learning CSE546 Carlos Guestrin University of Washington September 30, 2013 1 What about prior n Billionaire says: Wait, I know that the thumbtack is close to 50-50. What can you

More information

Measuring Uncertainty in Spatial Data via Bayesian Melding

Measuring Uncertainty in Spatial Data via Bayesian Melding Measuring Uncertainty in Spatial Data via Bayesian Melding Matt Falk Queensland University of Technology (QUT) m.falk@qut.edu.au Joint work with Robert Denham (NRW) and Kerrie Mengersen (QUT) Data Driven

More information

Approximate Bayesian Computation and Particle Filters

Approximate Bayesian Computation and Particle Filters Approximate Bayesian Computation and Particle Filters Dennis Prangle Reading University 5th February 2014 Introduction Talk is mostly a literature review A few comments on my own ongoing research See Jasra

More information

Statistical Inference

Statistical Inference Statistical Inference Robert L. Wolpert Institute of Statistics and Decision Sciences Duke University, Durham, NC, USA Spring, 2006 1. DeGroot 1973 In (DeGroot 1973), Morrie DeGroot considers testing the

More information

Model Fitting. Jean Yves Le Boudec

Model Fitting. Jean Yves Le Boudec Model Fitting Jean Yves Le Boudec 0 Contents 1. What is model fitting? 2. Linear Regression 3. Linear regression with norm minimization 4. Choosing a distribution 5. Heavy Tail 1 Virus Infection Data We

More information

LECTURE NOTE #3 PROF. ALAN YUILLE

LECTURE NOTE #3 PROF. ALAN YUILLE LECTURE NOTE #3 PROF. ALAN YUILLE 1. Three Topics (1) Precision and Recall Curves. Receiver Operating Characteristic Curves (ROC). What to do if we do not fix the loss function? (2) The Curse of Dimensionality.

More information

Lecture notes on statistical decision theory Econ 2110, fall 2013

Lecture notes on statistical decision theory Econ 2110, fall 2013 Lecture notes on statistical decision theory Econ 2110, fall 2013 Maximilian Kasy March 10, 2014 These lecture notes are roughly based on Robert, C. (2007). The Bayesian choice: from decision-theoretic

More information

A Bayesian Model for Root Computation

A Bayesian Model for Root Computation A Bayesian Model for Root Computation Josef Schicho, RICAM joint work with Hanna K. Pikkarainen, RICAM Approximate Algebraic Computation Observation: choose any computational problem in algebra that involves

More information

Lecture : Probabilistic Machine Learning

Lecture : Probabilistic Machine Learning Lecture : Probabilistic Machine Learning Riashat Islam Reasoning and Learning Lab McGill University September 11, 2018 ML : Many Methods with Many Links Modelling Views of Machine Learning Machine Learning

More information

Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5

Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5 Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5 Overview Extreme value distributions Generalized Pareto distributions Tail shapes Using power

More information

1 Some Statistical Basics.

1 Some Statistical Basics. Q Some Statistical Basics. Statistics treats random errors. (There are also systematic errors e.g., if your watch is 5 minutes fast, you will always get the wrong time, but it won t be random.) The two

More information

Weakly informative priors

Weakly informative priors Department of Statistics and Department of Political Science Columbia University 21 Oct 2011 Collaborators (in order of appearance): Gary King, Frederic Bois, Aleks Jakulin, Vince Dorie, Sophia Rabe-Hesketh,

More information

Bayesian Analysis for Natural Language Processing Lecture 2

Bayesian Analysis for Natural Language Processing Lecture 2 Bayesian Analysis for Natural Language Processing Lecture 2 Shay Cohen February 4, 2013 Administrativia The class has a mailing list: coms-e6998-11@cs.columbia.edu Need two volunteers for leading a discussion

More information

Learning Theory. Machine Learning CSE546 Carlos Guestrin University of Washington. November 25, Carlos Guestrin

Learning Theory. Machine Learning CSE546 Carlos Guestrin University of Washington. November 25, Carlos Guestrin Learning Theory Machine Learning CSE546 Carlos Guestrin University of Washington November 25, 2013 Carlos Guestrin 2005-2013 1 What now n We have explored many ways of learning from data n But How good

More information

Statistical Models for Rainfall with Applications to Index Insura

Statistical Models for Rainfall with Applications to Index Insura Statistical Models for Rainfall with Applications to April 21, 2008 Overview The idea: Insure farmers against the risk of crop failure, like drought, instead of crop failure itself. It reduces moral hazard

More information

Dimensionality Reduction and Principle Components

Dimensionality Reduction and Principle Components Dimensionality Reduction and Principle Components Ken Kreutz-Delgado (Nuno Vasconcelos) UCSD ECE Department Winter 2012 Motivation Recall, in Bayesian decision theory we have: World: States Y in {1,...,

More information

Bayesian Learning (II)

Bayesian Learning (II) Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Bayesian Learning (II) Niels Landwehr Overview Probabilities, expected values, variance Basic concepts of Bayesian learning MAP

More information

Linear Models for Classification

Linear Models for Classification Linear Models for Classification Oliver Schulte - CMPT 726 Bishop PRML Ch. 4 Classification: Hand-written Digit Recognition CHINE INTELLIGENCE, VOL. 24, NO. 24, APRIL 2002 x i = t i = (0, 0, 0, 1, 0, 0,

More information

MTH What is a Difference Quotient?

MTH What is a Difference Quotient? MTH 111 - What is a Difference Quotient? Another way to think about the difference quotient is that it yields a new function that gives the average rate of change of the original function, for two points

More information

Ridge regression. Patrick Breheny. February 8. Penalized regression Ridge regression Bayesian interpretation

Ridge regression. Patrick Breheny. February 8. Penalized regression Ridge regression Bayesian interpretation Patrick Breheny February 8 Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/27 Introduction Basic idea Standardization Large-scale testing is, of course, a big area and we could keep talking

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

(1) Introduction to Bayesian statistics

(1) Introduction to Bayesian statistics Spring, 2018 A motivating example Student 1 will write down a number and then flip a coin If the flip is heads, they will honestly tell student 2 if the number is even or odd If the flip is tails, they

More information

Simple Spatial Growth Models The Origins of Scaling in Size Distributions

Simple Spatial Growth Models The Origins of Scaling in Size Distributions Lectures on Spatial Complexity 17 th 28 th October 2011 Lecture 3: 21 st October 2011 Simple Spatial Growth Models The Origins of Scaling in Size Distributions Michael Batty m.batty@ucl.ac.uk @jmichaelbatty

More information

Floating point Numbers

Floating point Numbers Floating point Numbers William Gould President StataCorp LP July 2014, Boston W. Gould (StataCorp) Floating point Numbers: A visit through the looking glass July 2014 1 / 62 Apologia No Report to Users

More information

PHASES OF STATISTICAL ANALYSIS 1. Initial Data Manipulation Assembling data Checks of data quality - graphical and numeric

PHASES OF STATISTICAL ANALYSIS 1. Initial Data Manipulation Assembling data Checks of data quality - graphical and numeric PHASES OF STATISTICAL ANALYSIS 1. Initial Data Manipulation Assembling data Checks of data quality - graphical and numeric 2. Preliminary Analysis: Clarify Directions for Analysis Identifying Data Structure:

More information

SYDE 372 Introduction to Pattern Recognition. Probability Measures for Classification: Part I

SYDE 372 Introduction to Pattern Recognition. Probability Measures for Classification: Part I SYDE 372 Introduction to Pattern Recognition Probability Measures for Classification: Part I Alexander Wong Department of Systems Design Engineering University of Waterloo Outline 1 2 3 4 Why use probability

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing Robert Vanderbei Fall 2014 Slides last edited on November 24, 2014 http://www.princeton.edu/ rvdb Coin Tossing Example Consider two coins.

More information

Parameter Estimation. Industrial AI Lab.

Parameter Estimation. Industrial AI Lab. Parameter Estimation Industrial AI Lab. Generative Model X Y w y = ω T x + ε ε~n(0, σ 2 ) σ 2 2 Maximum Likelihood Estimation (MLE) Estimate parameters θ ω, σ 2 given a generative model Given observed

More information

Gaussians Linear Regression Bias-Variance Tradeoff

Gaussians Linear Regression Bias-Variance Tradeoff Readings listed in class website Gaussians Linear Regression Bias-Variance Tradeoff Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University January 22 nd, 2007 Maximum Likelihood Estimation

More information

Machine Learning! in just a few minutes. Jan Peters Gerhard Neumann

Machine Learning! in just a few minutes. Jan Peters Gerhard Neumann Machine Learning! in just a few minutes Jan Peters Gerhard Neumann 1 Purpose of this Lecture Foundations of machine learning tools for robotics We focus on regression methods and general principles Often

More information

Sharp statistical tools Statistics for extremes

Sharp statistical tools Statistics for extremes Sharp statistical tools Statistics for extremes Georg Lindgren Lund University October 18, 2012 SARMA Background Motivation We want to predict outside the range of observations Sums, averages and proportions

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline

More information

PAC-learning, VC Dimension and Margin-based Bounds

PAC-learning, VC Dimension and Margin-based Bounds More details: General: http://www.learning-with-kernels.org/ Example of more complex bounds: http://www.research.ibm.com/people/t/tzhang/papers/jmlr02_cover.ps.gz PAC-learning, VC Dimension and Margin-based

More information

Parameter estimation Conditional risk

Parameter estimation Conditional risk Parameter estimation Conditional risk Formalizing the problem Specify random variables we care about e.g., Commute Time e.g., Heights of buildings in a city We might then pick a particular distribution

More information

Winter. Here s what a weak La Nina usually brings to the nation with tempseraures:

Winter. Here s what a weak La Nina usually brings to the nation with tempseraures: 2017-2018 Winter Time again for my annual Winter Weather Outlook. Here's just a small part of the items I considered this year and how I think they will play out with our winter of 2017-2018. El Nino /

More information

ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 13. Markets and Efficient Risk-Bearing: Examples and Extensions

ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 13. Markets and Efficient Risk-Bearing: Examples and Extensions ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 13. Markets and Efficient Risk-Bearing: Examples and Extensions 1. Allocation of Risk in Mean-Variance Framework S states of the world,

More information

Chapter 18. Sampling Distribution Models /51

Chapter 18. Sampling Distribution Models /51 Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample

More information

Appendix 1: UK climate projections

Appendix 1: UK climate projections Appendix 1: UK climate projections The UK Climate Projections 2009 provide the most up-to-date estimates of how the climate may change over the next 100 years. They are an invaluable source of information

More information

The Generalized Likelihood Uncertainty Estimation methodology

The Generalized Likelihood Uncertainty Estimation methodology CHAPTER 4 The Generalized Likelihood Uncertainty Estimation methodology Calibration and uncertainty estimation based upon a statistical framework is aimed at finding an optimal set of models, parameters

More information

Bayesian spatial quantile regression

Bayesian spatial quantile regression Brian J. Reich and Montserrat Fuentes North Carolina State University and David B. Dunson Duke University E-mail:reich@stat.ncsu.edu Tropospheric ozone Tropospheric ozone has been linked with several adverse

More information

Stable Limit Laws for Marginal Probabilities from MCMC Streams: Acceleration of Convergence

Stable Limit Laws for Marginal Probabilities from MCMC Streams: Acceleration of Convergence Stable Limit Laws for Marginal Probabilities from MCMC Streams: Acceleration of Convergence Robert L. Wolpert Institute of Statistics and Decision Sciences Duke University, Durham NC 778-5 - Revised April,

More information

Quantile POD for Hit-Miss Data

Quantile POD for Hit-Miss Data Quantile POD for Hit-Miss Data Yew-Meng Koh a and William Q. Meeker a a Center for Nondestructive Evaluation, Department of Statistics, Iowa State niversity, Ames, Iowa 50010 Abstract. Probability of detection

More information

Graphing and Density

Graphing and Density Graphing and Density In addition to values that you can directly measure like length, mass, or volume, chemistry is filled with values that must be calculated. The most common of these is density. Density

More information

Spatial Inference of Nitrate Concentrations in Groundwater

Spatial Inference of Nitrate Concentrations in Groundwater Spatial Inference of Nitrate Concentrations in Groundwater Dawn Woodard Operations Research & Information Engineering Cornell University joint work with Robert Wolpert, Duke Univ. Dept. of Statistical

More information

Weakly informative priors

Weakly informative priors Department of Statistics and Department of Political Science Columbia University 23 Apr 2014 Collaborators (in order of appearance): Gary King, Frederic Bois, Aleks Jakulin, Vince Dorie, Sophia Rabe-Hesketh,

More information

Introduction to Bayesian Inference

Introduction to Bayesian Inference University of Pennsylvania EABCN Training School May 10, 2016 Bayesian Inference Ingredients of Bayesian Analysis: Likelihood function p(y φ) Prior density p(φ) Marginal data density p(y ) = p(y φ)p(φ)dφ

More information

Today. Statistical Learning. Coin Flip. Coin Flip. Experiment 1: Heads. Experiment 1: Heads. Which coin will I use? Which coin will I use?

Today. Statistical Learning. Coin Flip. Coin Flip. Experiment 1: Heads. Experiment 1: Heads. Which coin will I use? Which coin will I use? Today Statistical Learning Parameter Estimation: Maximum Likelihood (ML) Maximum A Posteriori (MAP) Bayesian Continuous case Learning Parameters for a Bayesian Network Naive Bayes Maximum Likelihood estimates

More information

Contents. Decision Making under Uncertainty 1. Meanings of uncertainty. Classical interpretation

Contents. Decision Making under Uncertainty 1. Meanings of uncertainty. Classical interpretation Contents Decision Making under Uncertainty 1 elearning resources Prof. Ahti Salo Helsinki University of Technology http://www.dm.hut.fi Meanings of uncertainty Interpretations of probability Biases in

More information

Machine Learning

Machine Learning Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 13, 2011 Today: The Big Picture Overfitting Review: probability Readings: Decision trees, overfiting

More information

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz 1 Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE Rick Katz Institute for Study of Society and Environment National Center for Atmospheric Research Boulder, CO USA email: rwk@ucar.edu Home

More information

Modeling Orbital Debris Problems

Modeling Orbital Debris Problems Modeling Orbital Debris Problems NAME Space Debris: Is It Really That Bad? One problem with which NASA and space scientists from other countries must deal is the accumulation of space debris in orbit around

More information

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering Types of learning Modeling data Supervised: we know input and targets Goal is to learn a model that, given input data, accurately predicts target data Unsupervised: we know the input only and want to make

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics October 17, 2017 CS 361: Probability & Statistics Inference Maximum likelihood: drawbacks A couple of things might trip up max likelihood estimation: 1) Finding the maximum of some functions can be quite

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

value of the sum standard units

value of the sum standard units Stat 1001 Winter 1998 Geyer Homework 7 Problem 18.1 20 and 25. Problem 18.2 (a) Average of the box. (1+3+5+7)=4=4. SD of the box. The deviations from the average are,3,,1, 1, 3. The squared deviations

More information

Bayesian inference J. Daunizeau

Bayesian inference J. Daunizeau Bayesian inference J. Daunizeau Brain and Spine Institute, Paris, France Wellcome Trust Centre for Neuroimaging, London, UK Overview of the talk 1 Probabilistic modelling and representation of uncertainty

More information

Our first case consists of those sequences, which are obtained by adding a constant number d to obtain subsequent elements:

Our first case consists of those sequences, which are obtained by adding a constant number d to obtain subsequent elements: Week 13 Sequences and Series Many images below are excerpts from the multimedia textbook. You can find them there and in your textbook in sections 7.2 and 7.3. We have encountered the first sequences and

More information

A Bayesian Approach to Phylogenetics

A Bayesian Approach to Phylogenetics A Bayesian Approach to Phylogenetics Niklas Wahlberg Based largely on slides by Paul Lewis (www.eeb.uconn.edu) An Introduction to Bayesian Phylogenetics Bayesian inference in general Markov chain Monte

More information

Bayesian Statistics. State University of New York at Buffalo. From the SelectedWorks of Joseph Lucke. Joseph F. Lucke

Bayesian Statistics. State University of New York at Buffalo. From the SelectedWorks of Joseph Lucke. Joseph F. Lucke State University of New York at Buffalo From the SelectedWorks of Joseph Lucke 2009 Bayesian Statistics Joseph F. Lucke Available at: https://works.bepress.com/joseph_lucke/6/ Bayesian Statistics Joseph

More information

PMR Learning as Inference

PMR Learning as Inference Outline PMR Learning as Inference Probabilistic Modelling and Reasoning Amos Storkey Modelling 2 The Exponential Family 3 Bayesian Sets School of Informatics, University of Edinburgh Amos Storkey PMR Learning

More information

PROFESSOR: WELCOME BACK TO THE LAST LECTURE OF THE SEMESTER. PLANNING TO DO TODAY WAS FINISH THE BOOK. FINISH SECTION 6.5

PROFESSOR: WELCOME BACK TO THE LAST LECTURE OF THE SEMESTER. PLANNING TO DO TODAY WAS FINISH THE BOOK. FINISH SECTION 6.5 1 MATH 16A LECTURE. DECEMBER 9, 2008. PROFESSOR: WELCOME BACK TO THE LAST LECTURE OF THE SEMESTER. I HOPE YOU ALL WILL MISS IT AS MUCH AS I DO. SO WHAT I WAS PLANNING TO DO TODAY WAS FINISH THE BOOK. FINISH

More information

Sequence convergence, the weak T-axioms, and first countability

Sequence convergence, the weak T-axioms, and first countability Sequence convergence, the weak T-axioms, and first countability 1 Motivation Up to now we have been mentioning the notion of sequence convergence without actually defining it. So in this section we will

More information

Ages of stellar populations from color-magnitude diagrams. Paul Baines. September 30, 2008

Ages of stellar populations from color-magnitude diagrams. Paul Baines. September 30, 2008 Ages of stellar populations from color-magnitude diagrams Paul Baines Department of Statistics Harvard University September 30, 2008 Context & Example Welcome! Today we will look at using hierarchical

More information

Bayesian Approaches Data Mining Selected Technique

Bayesian Approaches Data Mining Selected Technique Bayesian Approaches Data Mining Selected Technique Henry Xiao xiao@cs.queensu.ca School of Computing Queen s University Henry Xiao CISC 873 Data Mining p. 1/17 Probabilistic Bases Review the fundamentals

More information

Measuring Keepers S E S S I O N 1. 5 A

Measuring Keepers S E S S I O N 1. 5 A S E S S I O N 1. 5 A Measuring Keepers Math Focus Points Naming, notating, and telling time to the hour on a digital and an analog clock Understanding the meaning of at least in the context of linear measurement

More information

Math 1 Exponential Functions Unit 2018

Math 1 Exponential Functions Unit 2018 1 Math 1 Exponential Functions Unit 2018 Points: /10 Name: Graphing Exponential Functions/Domain and Range Exponential Functions (Growth and Decay) Tables/Word Problems Linear vs Exponential Functions

More information

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,

More information

Obnoxious lateness humor

Obnoxious lateness humor Obnoxious lateness humor 1 Using Bayesian Model Averaging For Addressing Model Uncertainty in Environmental Risk Assessment Louise Ryan and Melissa Whitney Department of Biostatistics Harvard School of

More information

=.55 = = 5.05

=.55 = = 5.05 MAT1193 4c Definition of derivative With a better understanding of limits we return to idea of the instantaneous velocity or instantaneous rate of change. Remember that in the example of calculating the

More information

Stat 451 Lecture Notes Markov Chain Monte Carlo. Ryan Martin UIC

Stat 451 Lecture Notes Markov Chain Monte Carlo. Ryan Martin UIC Stat 451 Lecture Notes 07 12 Markov Chain Monte Carlo Ryan Martin UIC www.math.uic.edu/~rgmartin 1 Based on Chapters 8 9 in Givens & Hoeting, Chapters 25 27 in Lange 2 Updated: April 4, 2016 1 / 42 Outline

More information

Dimensionality Reduction and Principal Components

Dimensionality Reduction and Principal Components Dimensionality Reduction and Principal Components Nuno Vasconcelos (Ken Kreutz-Delgado) UCSD Motivation Recall, in Bayesian decision theory we have: World: States Y in {1,..., M} and observations of X

More information

Ways to make neural networks generalize better

Ways to make neural networks generalize better Ways to make neural networks generalize better Seminar in Deep Learning University of Tartu 04 / 10 / 2014 Pihel Saatmann Topics Overview of ways to improve generalization Limiting the size of the weights

More information

Artificial Intelligence Uncertainty

Artificial Intelligence Uncertainty Artificial Intelligence Uncertainty Ch. 13 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? A 25, A 60, A 3600 Uncertainty: partial observability (road

More information

Towards a more physically based approach to Extreme Value Analysis in the climate system

Towards a more physically based approach to Extreme Value Analysis in the climate system N O A A E S R L P H Y S IC A L S C IE N C E S D IV IS IO N C IR E S Towards a more physically based approach to Extreme Value Analysis in the climate system Prashant Sardeshmukh Gil Compo Cecile Penland

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

Uncertainty. Chapter 13

Uncertainty. Chapter 13 Uncertainty Chapter 13 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Uncertainty Let s say you want to get to the airport in time for a flight. Let action A

More information

5.1. EXPONENTIAL FUNCTIONS AND THEIR GRAPHS

5.1. EXPONENTIAL FUNCTIONS AND THEIR GRAPHS 5.1. EXPONENTIAL FUNCTIONS AND THEIR GRAPHS 1 What You Should Learn Recognize and evaluate exponential functions with base a. Graph exponential functions and use the One-to-One Property. Recognize, evaluate,

More information

Parameter Estimation

Parameter Estimation Parameter Estimation Chapters 13-15 Stat 477 - Loss Models Chapters 13-15 (Stat 477) Parameter Estimation Brian Hartman - BYU 1 / 23 Methods for parameter estimation Methods for parameter estimation Methods

More information

Generating Lévy Random Fields

Generating Lévy Random Fields Robert L Wolpert http://www.stat.duke.edu/ rlw/ Department of Statistical Science and Nicholas School of the Environment Duke University Durham, NC USA Århus University 2009 May 21 22 Outline Motivation:

More information

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

HIERARCHICAL MODELS IN EXTREME VALUE THEORY HIERARCHICAL MODELS IN EXTREME VALUE THEORY Richard L. Smith Department of Statistics and Operations Research, University of North Carolina, Chapel Hill and Statistical and Applied Mathematical Sciences

More information