Incorporating Time-Invariant Covariates into the Pareto/NBD and BG/NBD Models

Size: px
Start display at page:

Download "Incorporating Time-Invariant Covariates into the Pareto/NBD and BG/NBD Models"

Transcription

1 Incorporating Time-Invariant Covariates into the Pareto/NBD and BG/NBD Models Peter S Fader wwwpetefadercom Bruce G S Hardie wwwbrucehardiecom August Introduction This note documents how to incorporate the effects of time-invariant covariates into the Pareto/NBD and BG/NBD models It is assumed that the reader is familiar with the derivations of these models, as presented in Fader and Hardie 2005 and Fader et al 2005 Let z 1 be the vector of time-invariant covariates that are assumed to explain some of the cross-sectional heterogeneity in the purchasing process, and z 2 be the vector of time-invariant covariates that are assumed to explain some of the cross-sectional heterogeneity in the dropout process While it will typically be the case that z 1 = z 2, this equality is not assumed in these derivations Note that since these covariates are specific to each individual, both z 1 and z 2 should have an individual-specific subscript i; this has been suppressed for notational convenience The basic results are as follows: For the Pareto/NBD model, we simply replace α and β with α = α 0 exp γ 1z 1 β = β 0 exp γ 2z 2 where γ 1 and γ 2 capture the effects of these two vectors of covariates; r and s remain unchanged c 2007 Peter S Fader and Bruce G S Hardie This document can be found at < 1

2 For the BG/NBD model, we simply replace α, a, and b with α = α 0 exp γ 1z 1 a = a 0 exp γ 2z 2 b = b 0 exp γ 3z 2 where γ 1, γ 2 and γ 3 capture the effects of these two vectors of covariates; r remains unchanged 2 Preliminaries A popular, easily interpretable method for incorporating the effects of exogenous covariates in event-time models is the proportional hazards approach In this framework, the covariates have a multiplicative effect on the hazard rate More specifically, let F 0 t θ be the so-called baseline cdf for the distribution of an individual s interpurchase times, and f 0 t θ and h 0 t θ the associated pdf and hazard rate function The most common formulation of the proportional hazards specification states that ht θ, γ, z i =h 0 t θ expγ z where z denotes the vector of time-invariant covariates and γ denotes the effects of these covariates Note that z must not include an intercept term When the baseline is distributed exponential with rate parameter θ, ht θ, γ, z =θ expγ z It follows from the definition of the hazard rate function, ht = that, since F 0=0, F t =1 exp Therefore, and ft 1 F t t F t θ, γ, z =1 e θ expγ zt 0 hudu ft θ, γ, z =θ expγ ze θ expγ zt 2

3 3 The Case of the Pareto/NBD Model The Pareto/NBD model is based on the following assumptions: i Customers go through two stages in their lifetime with a specific firm: they are alive for some period of time, then become permanently inactive ii While alive, the number of transactions made by a customer follows a Poisson process with transaction rate λ This is equivalent to assuming that the time between transactions is distributed exponential with transaction rate λ, ft j t j 1 λ =λe λt j t j 1, t j >t j 1 > 0, where t j is the time of the jth purchase iii Heterogeneity in transaction rates across customers follows a gamma distribution with shape parameter r and scale parameter α: gλ r, α = αr λ r 1 e λα iv A customer s unobserved lifetime of length ω after which he is viewed as being inactive is exponentially distributed with dropout rate μ: fω μ =μe μω Note: Previous discussions of the Pareto/NBD have used τ to denote the time at which the customer becomes inactive Because of notational conflicts, we now use ω to denote this quantity v Heterogeneity in dropout rates across customers follows a gamma distribution with shape parameter s and scale parameter β gμ s, β = βs μ s 1 e μβ Γs vi The transaction rate λ and the dropout rate μ vary independently across customers We now assume that interpurchase times are distributed according to the with-covariates exponential distribution ft j t j 1 λ 0, γ 1, z 1 =λ 0 expγ 1z 1 e λ 0 expγ 1 z 1t j t j 1 and F t j t j 1 λ 0, γ 1, z 1 =1 e λ 0 expγ 1 z 1t j t j 1, 3

4 with the unobserved heterogeneity in λ 0 captured by a gamma distribution with shape parameter r and scale parameter α 0 : gλ 0 r, α 0 = αr 0 λr 1 0 e λ 0α 0 We also assume that lifetimes are distributed according to the withcovariates exponential distribution fω μ 0, γ 2, z 2 =μ 0 expγ 2z 2 e μ 0 expγ 2 z2ω and F ω μ 0, γ 2, z 2 =1 e μ 0 expγ 2 z2ω, with the unobserved heterogeneity in μ 0 captured by a gamma distribution with shape parameter s and scale parameter β 0 : gμ 0 s, β 0 = βs 0 μs 1 0 e μ 0β 0 Γs Following the logic of the derivation presented in Fader and Hardie 2005, it follows that and Lλ 0, γ 1 z 1,t 1,,t x,t, ω > T =λ x 0 expγ 1z 1 x e λ 0 expγ 1 z 1T Lλ 0, γ 1 z 1,t 1,,t x,t, inactive at ω t x,t] = λ x 0 expγ 1z 1 x e λ 0 expγ 1 z1ω Removing the conditioning on ω yields the following expression for the individual-level likelihood function: Lλ 0,μ 0, γ 1, γ 2 z 1, z 2,x,t x,t = Lλ 0, γ 1 z 1,x,T,ω >TP ω >T μ 0, γ 2, z 2 T { + Lλ 0, γ 1 z 1,x,t x,t,inactive at ω t x,t] t x } fω μ 0, γ 2, z 2 dω = λx 0 expγ 1 z 1 x μ 0 expγ 2 z 2 λ 0 expγ 1 z 1+μ 0 expγ 2 z 2 e λ 0 expγ 1 z 1+μ 0 expγ 2 z 2t x + λ x+1 0 expγ 1 z 1 x+1 λ 0 expγ 1 z 1+μ 0 expγ 2 z 2 e λ 0 expγ 1 z 1+μ 0 expγ 2 z 2T 1 We remove the conditioning on unobserved latent variables λ 0 and μ 0 by taking the expectation of 1 over the distributions of λ 0 and μ 0 To facilitate 4

5 this calculation, we perform the change of variables λ = λ 0 expγ 1 z 1 and μ = μ 0 expγ 2 z 2, which implies gλ r, α = αr λ r 1 e λα gμ s, β = βs μ s 1 e μβ Γs, where α = α 0 exp γ 1z 1, and, where β = β 0 exp γ 2z 2 Note the addition of the addition of minus signs in the exponential terms as we go from multipliers of λ 0,μ 0 to multipliers of α 0,β 0 This gives us Lr, α 0,s, β 0, γ 1, γ 2 z 1, z 2,x,t x,t { λ x μ = 0 0 λ + μ e λ+μtx + λx+1 λ + μ e λ+μt } gλ r, αgμ s, β dλdμ We know from Fader and Hardie 2005 that the solution to this is where Lr, α 0,s, β 0, γ 1, γ 2 z 1, z 2,x,t x,t = Γr + xαr β s { } s r + x A 1 + A 2 r + s + x r + s + x A 1 = 2F 1 r + s + x, s +1;r + s + x +1; α β α+t x α + t x r+s+x if α β 2F 1 r + s + x, r + x; r + s + x +1; β α β+t x β + t x r+s+x if α β 2 3 and A 2 = 2F 1 r + s + x, s; r + s + x +1; α β α+t α + T r+s+x if α β 2F 1 r + s + x, r + x +1;r + s + x +1; β α β+t β + T r+s+x if α β 4 In other words, the likelihood function for the time-invariant-covariates version of the Pareto/NBD model is the likelihood function associated with the basic model where α and β are replaced by α = α 0 exp γ 1 z 1 and β = β 0 exp γ 2 z 2; r and s remain unchanged Suppose we have transaction data for a sample of N customers, where customer i made x i purchases in the period 0,T i ] with the last transaction occurring at t xi, and covariate vectors z 1i and z 2i The sample 5

6 log-likelihood function is given by LLr, α 0,s,β 0, γ 1, γ 2 = N ln [ Lr, α 0,s,β 0, γ 1, γ 2 z 1i, z 2i,x i,t xi,t i ] i=1 In numerically evaluating this function, we note that the α / β condition associated with 2 4 must now be checked for each i For any given α 0,β 0, it may be the case that α>βfor person i while α<βfor person j because of the values of z 1i, z 2i and z 1j, z 2j and γ 1, γ 2 For all related expressions eg, E[Xt],EY t x, t x,t, it is easy to show that we simply replace α and β with α = α 0 exp γ 1 z 1 and β = β 0 exp γ 2 z 2 to arrive at their with-time-invariant-covariates equivalents 4 The Case of the BG/NBD Model The BG/NBD model is based on the following assumptions the first three of which are identical to the corresponding Pareto/NBD assumptions: i Customers go through two stages in their lifetime with a specific firm: they are alive for some period of time, then become permanently inactive ii While alive, the number of transactions made by a customer follows a Poisson process with transaction rate λ This is equivalent to assuming that the time between transactions is distributed exponential with transaction rate λ, ft j t j 1 λ =λe λt j t j 1, t j >t j 1 > 0, where t j is the time of the jth purchase iii Heterogeneity in transaction rates across customers follows a gamma distribution with shape parameter r and scale parameter α: gλ r, α = αr λ r 1 e λα iv After any transaction, a customer becomes inactive with probability p Therefore the point at which the customer drops out is distributed across transactions according to a shifted geometric distribution with pmf P inactive immediately after jth transaction = p1 p j 1, j =1, 2, 3, 6

7 v Heterogeneity in dropout probabilities follows a beta distribution with parameters a and b: fp a, b = pa 1 1 p b 1 Ba, b, 0 p 1 vi The transaction rate λ and the dropout probability p vary independently across customers We now assume that interpurchase times are distributed according to the with-covariates exponential distribution ft j t j 1 λ 0, γ 1, z 1 =λ 0 expγ 1z 1 e λ 0 expγ 1 z 1t j t j 1 and that the unobserved heterogeneity in λ 0 is distributed gamma with shape parameter r and scale parameter α 0 : gλ 0 r, α 0 = αr 0 λr 1 0 e λ 0α 0 Following the logic of the derivation presented in Fader et al 2005, Lλ 0,p,γ 1 z 1,x,t x,t=1 p x λ x 0 expγ 1z 1 x e λ 0 expγ 1 z 1T + δ x>0 p1 p x 1 λ x 0 expγ 1z 1 x e λ 0 expγ 1 z 1t x Taking the expectation of this over the distribution of λ 0 gives us Lr, α 0,p,γ 1 z 1,x,t x,t =1 p x Γr + xα0 r expγ 1 z 1 x α 0 + expγ 1 z 1T r+x + δ x>0 p1 p x 1 Γr + xα0 r expγ 1 z 1 x α 0 + expγ 1 z 1t x r+x =1 p x Γr + x[α 0 exp γ 1 z 1] r α 0 exp γ 1 z 1+T r+x + δ x>0 p1 p x 1 Γr + x[α 0 exp γ 1 z 1] r α 0 exp γ 1 z 5 1+t x r+x Note that we have yet to include the effects of z 2, the vector of timeinvariant covariates that are assumed to explain some of the cross-sectional heterogeneity in the dropout process It is not possible to include the effects of covariates into p and then allow for unobserved heterogeneity using the beta distribution Using the logic of the beta-logistic model Heckman and Willis 1977, Rao and Steckel 1995, we incorporate the effects of time-invariant covariates on the dropout process via the parameters of the beta mixing distribution 7

8 by making a and b functions of z 2 That is, we assume that heterogeneity in the dropout probabilities follows a beta distribution with parameters a 0 exp γ 2 z 2 and b 0 exp γ 3 z 2: fp a 0,b 0,γ 2,γ 3 ; z 2 = pa 0 exp γ 2 z p b 0 exp γ 3 z 2 1 Ba 0 exp γ 2 z 2,b 0 exp γ 3 z 2 Taking the expectation of 5 over this distribution of p gives us Lr, α 0,a 0,b 0,γ 1, γ 2, γ 3 z 1, z 2,x,t x,t { Ba0 exp γ 2 = z 2,b 0 exp γ 3 z 2+x Ba 0 exp γ 2 z 2,b 0 exp γ 3 z 2 Γr + x[α 0 exp γ 1 z 1] r } α 0 exp γ 1 z 1+T r+x { Ba0 exp γ 2 + δ z 2+1,b 0 exp γ 3 z 2+x 1 x>0 Ba 0 exp γ 2 z 2,b 0 exp γ 3 z 2 Γr + x[α 0 exp γ 1 z 1] r } α 0 exp γ 1 z 1+t x r+x In other words, the likelihood function for the time-invariant-covariates version of the BG/NBD model is the likelihood function associated with the basic model where α, a, and b are replaced by α = α 0 exp γ 1 z 1, a = a 0 exp γ 2 z 2, and b = b 0 exp γ 3 z 2; r remains unchanged 1 For all related expressions eg, E[Xt],EY t x, t x,t, it is easy to show that we simply replace α, a, and b with α = α 0 exp γ 1 z 1, a = a 0 exp γ 2 z 2, and b = b 0 exp γ 3 z 2 to arrive at their with-time-invariantcovariates equivalents 1 We note that the BG/NBD has 50% more covariate parameters than the Pareto/NBD While this difference could be removed by making only a a function of covariates ie, set γ 3 to 0, doing so would constrain the way in which the covariates could influence the shape eg, variance of the beta distribution Furthermore, it would be inconsistent with the basic beta-logistic formulation 8

9 References Fader, Peter S and Bruce G S Hardie 2005, A Note on Deriving the Pareto/NBD Model and Related Expressions < Fader, Peter S, Bruce G S Hardie, and Ka Lok Lee 2005, "Counting Your Customers" the Easy Way: An Alternative to the Pareto/NBD Model, Marketing Science, 24 Spring, Heckman, James J and Robert J Willis 1977, A Beta-logistic Model for the Analysis of Sequential Labor Force Participation by Married Women, Journal of Political Economy, 85 February, Rao, Vithala R and Joel H Steckel 1995, Selecting, Evaluating, and Updating Prospects in Direct Marketing, Journal of Direct Marketing, 9 Spring,

Implementing the Pareto/NBD Model Given Interval-Censored Data

Implementing the Pareto/NBD Model Given Interval-Censored Data Implementing the Pareto/NBD Model Given Interval-Censored Data Peter S. Fader www.petefader.com Bruce G. S. Hardie www.brucehardie.com November 2005 Revised August 2010 1 Introduction The Pareto/NBD model

More information

New Perspectives on Customer Death Using a Generalization of the Pareto/NBD Model

New Perspectives on Customer Death Using a Generalization of the Pareto/NBD Model University of Pennsylvania ScholarlyCommons Finance Papers Wharton Faculty Research 9-211 New Perspectives on Customer Death Using a Generalization of the Pareto/NBD Model Kinshuk Jerath Peter S Fader

More information

Modeling Discrete-Time Transactions Using the BG/BB Model

Modeling Discrete-Time Transactions Using the BG/BB Model University of Pennsylvania ScholarlyCommons Wharton Research Scholars Wharton School May 2008 Modeling Discrete-Time Transactions Using the BG/BB Model Harvey Yang Zhang University of Pennsylvania Follow

More information

Applied Probability Models in Marketing Research: Introduction

Applied Probability Models in Marketing Research: Introduction Applied Probability Models in Marketing Research: Introduction (Supplementary Materials for the A/R/T Forum Tutorial) Bruce G. S. Hardie London Business School bhardie@london.edu www.brucehardie.com Peter

More information

Probability Models for Customer-Base Analysis

Probability Models for Customer-Base Analysis Probability Models for Customer-Base Analysis Peter S Fader University of Pennsylvania wwwpetefadercom Bruce G S Hardie London Business School wwwbrucehardiecom Workshop on Customer-Base Analysis Johann

More information

Duration Analysis. Joan Llull

Duration Analysis. Joan Llull Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual

More information

Applied Probability Models in Marketing Research: Extensions

Applied Probability Models in Marketing Research: Extensions Applied Probability Models in Marketing Research: Extensions Bruce G S Hardie London Business School bhardie@londonedu Peter S Fader University of Pennsylvania faderp@whartonupennedu 12th Annual Advanced

More information

Modeling Recurrent Events in Panel Data Using Mixed Poisson Models

Modeling Recurrent Events in Panel Data Using Mixed Poisson Models Modeling Recurrent Events in Panel Data Using Mixed Poisson Models V. Savani and A. Zhigljavsky Abstract This paper reviews the applicability of the mixed Poisson process as a model for recurrent events

More information

Things to remember when learning probability distributions:

Things to remember when learning probability distributions: SPECIAL DISTRIBUTIONS Some distributions are special because they are useful They include: Poisson, exponential, Normal (Gaussian), Gamma, geometric, negative binomial, Binomial and hypergeometric distributions

More information

Continuous Distributions

Continuous Distributions A normal distribution and other density functions involving exponential forms play the most important role in probability and statistics. They are related in a certain way, as summarized in a diagram later

More information

Probability Distributions Columns (a) through (d)

Probability Distributions Columns (a) through (d) Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)

More information

A Count Data Frontier Model

A Count Data Frontier Model A Count Data Frontier Model This is an incomplete draft. Cite only as a working paper. Richard A. Hofler (rhofler@bus.ucf.edu) David Scrogin Both of the Department of Economics University of Central Florida

More information

Stat410 Probability and Statistics II (F16)

Stat410 Probability and Statistics II (F16) Stat4 Probability and Statistics II (F6 Exponential, Poisson and Gamma Suppose on average every /λ hours, a Stochastic train arrives at the Random station. Further we assume the waiting time between two

More information

Implementing the S BB -G/B Model in MATLAB

Implementing the S BB -G/B Model in MATLAB Implementing the S BB -G/B Model in MATLAB Peter S. Fader www.petefader.com Bruce G. S. Hardie www.brucehardie.com January 2011 This note documents a somewhat informal MATLAB-based implementation of the

More information

IE 336 Seat # Name (one point) < KEY > Closed book. Two pages of hand-written notes, front and back. No calculator. 60 minutes.

IE 336 Seat # Name (one point) < KEY > Closed book. Two pages of hand-written notes, front and back. No calculator. 60 minutes. Closed book. Two pages of hand-written notes, front and back. No calculator. 6 minutes. Cover page and four pages of exam. Four questions. To receive full credit, show enough work to indicate your logic.

More information

Sampling Distributions

Sampling Distributions In statistics, a random sample is a collection of independent and identically distributed (iid) random variables, and a sampling distribution is the distribution of a function of random sample. For example,

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Northwestern University Department of Electrical Engineering and Computer Science

Northwestern University Department of Electrical Engineering and Computer Science Northwestern University Department of Electrical Engineering and Computer Science EECS 454: Modeling and Analysis of Communication Networks Spring 2008 Probability Review As discussed in Lecture 1, probability

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

Classical and Bayesian inference

Classical and Bayesian inference Classical and Bayesian inference AMS 132 January 18, 2018 Claudia Wehrhahn (UCSC) Classical and Bayesian inference January 18, 2018 1 / 9 Sampling from a Bernoulli Distribution Theorem (Beta-Bernoulli

More information

Probability Models for Customer-Base Analysis

Probability Models for Customer-Base Analysis Probability Models for Customer-Base Analysis Peter S Fader niversity of Pennsylvania Bruce G S Hardie London Business School 27th Annual Advanced Research Techniques Forum June 26 29, 2016 2016 Peter

More information

p y (1 p) 1 y, y = 0, 1 p Y (y p) = 0, otherwise.

p y (1 p) 1 y, y = 0, 1 p Y (y p) = 0, otherwise. 1. Suppose Y 1, Y 2,..., Y n is an iid sample from a Bernoulli(p) population distribution, where 0 < p < 1 is unknown. The population pmf is p y (1 p) 1 y, y = 0, 1 p Y (y p) = (a) Prove that Y is the

More information

3 Continuous Random Variables

3 Continuous Random Variables Jinguo Lian Math437 Notes January 15, 016 3 Continuous Random Variables Remember that discrete random variables can take only a countable number of possible values. On the other hand, a continuous random

More information

The exponential distribution and the Poisson process

The exponential distribution and the Poisson process The exponential distribution and the Poisson process 1-1 Exponential Distribution: Basic Facts PDF f(t) = { λe λt, t 0 0, t < 0 CDF Pr{T t) = 0 t λe λu du = 1 e λt (t 0) Mean E[T] = 1 λ Variance Var[T]

More information

Random Processes. DS GA 1002 Probability and Statistics for Data Science.

Random Processes. DS GA 1002 Probability and Statistics for Data Science. Random Processes DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Aim Modeling quantities that evolve in time (or space)

More information

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching Mayya Komisarchik Harvard University April 13, 2016 1 Heartfelt thanks to all of the Gov 2001 TFs of yesteryear; this section draws heavily

More information

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators.

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators. IE 230 Seat # Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators. Score Exam #3a, Spring 2002 Schmeiser Closed book and notes. 60 minutes. 1. True or false. (for each,

More information

Dynamic Models Part 1

Dynamic Models Part 1 Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after

More information

THE QUEEN S UNIVERSITY OF BELFAST

THE QUEEN S UNIVERSITY OF BELFAST THE QUEEN S UNIVERSITY OF BELFAST 0SOR20 Level 2 Examination Statistics and Operational Research 20 Probability and Distribution Theory Wednesday 4 August 2002 2.30 pm 5.30 pm Examiners { Professor R M

More information

1 Introduction. P (n = 1 red ball drawn) =

1 Introduction. P (n = 1 red ball drawn) = Introduction Exercises and outline solutions. Y has a pack of 4 cards (Ace and Queen of clubs, Ace and Queen of Hearts) from which he deals a random of selection 2 to player X. What is the probability

More information

Unobserved Heterogeneity

Unobserved Heterogeneity Unobserved Heterogeneity Germán Rodríguez grodri@princeton.edu Spring, 21. Revised Spring 25 This unit considers survival models with a random effect representing unobserved heterogeneity of frailty, a

More information

1 Probability and Random Variables

1 Probability and Random Variables 1 Probability and Random Variables The models that you have seen thus far are deterministic models. For any time t, there is a unique solution X(t). On the other hand, stochastic models will result in

More information

Chapter 3 Common Families of Distributions

Chapter 3 Common Families of Distributions Lecture 9 on BST 631: Statistical Theory I Kui Zhang, 9/3/8 and 9/5/8 Review for the previous lecture Definition: Several commonly used discrete distributions, including discrete uniform, hypergeometric,

More information

Sampling Distributions

Sampling Distributions Sampling Distributions In statistics, a random sample is a collection of independent and identically distributed (iid) random variables, and a sampling distribution is the distribution of a function of

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Midterm Examination. STA 205: Probability and Measure Theory. Wednesday, 2009 Mar 18, 2:50-4:05 pm

Midterm Examination. STA 205: Probability and Measure Theory. Wednesday, 2009 Mar 18, 2:50-4:05 pm Midterm Examination STA 205: Probability and Measure Theory Wednesday, 2009 Mar 18, 2:50-4:05 pm This is a closed-book examination. You may use a single sheet of prepared notes, if you wish, but you may

More information

BMIR Lecture Series on Probability and Statistics Fall, 2015 Uniform Distribution

BMIR Lecture Series on Probability and Statistics Fall, 2015 Uniform Distribution Lecture #5 BMIR Lecture Series on Probability and Statistics Fall, 2015 Department of Biomedical Engineering and Environmental Sciences National Tsing Hua University s 5.1 Definition ( ) A continuous random

More information

FIRST YEAR EXAM Monday May 10, 2010; 9:00 12:00am

FIRST YEAR EXAM Monday May 10, 2010; 9:00 12:00am FIRST YEAR EXAM Monday May 10, 2010; 9:00 12:00am NOTES: PLEASE READ CAREFULLY BEFORE BEGINNING EXAM! 1. Do not write solutions on the exam; please write your solutions on the paper provided. 2. Put the

More information

Continuous Probability Spaces

Continuous Probability Spaces Continuous Probability Spaces Ω is not countable. Outcomes can be any real number or part of an interval of R, e.g. heights, weights and lifetimes. Can not assign probabilities to each outcome and add

More information

Introduction to GSEM in Stata

Introduction to GSEM in Stata Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /

More information

Joint Modeling of Longitudinal Item Response Data and Survival

Joint Modeling of Longitudinal Item Response Data and Survival Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,

More information

HW7 Solutions. f(x) = 0 otherwise. 0 otherwise. The density function looks like this: = 20 if x [10, 90) if x [90, 100]

HW7 Solutions. f(x) = 0 otherwise. 0 otherwise. The density function looks like this: = 20 if x [10, 90) if x [90, 100] HW7 Solutions. 5 pts.) James Bond James Bond, my favorite hero, has again jumped off a plane. The plane is traveling from from base A to base B, distance km apart. Now suppose the plane takes off from

More information

EE/CpE 345. Modeling and Simulation. Fall Class 5 September 30, 2002

EE/CpE 345. Modeling and Simulation. Fall Class 5 September 30, 2002 EE/CpE 345 Modeling and Simulation Class 5 September 30, 2002 Statistical Models in Simulation Real World phenomena of interest Sample phenomena select distribution Probabilistic, not deterministic Model

More information

Chapter 3, 4 Random Variables ENCS Probability and Stochastic Processes. Concordia University

Chapter 3, 4 Random Variables ENCS Probability and Stochastic Processes. Concordia University Chapter 3, 4 Random Variables ENCS6161 - Probability and Stochastic Processes Concordia University ENCS6161 p.1/47 The Notion of a Random Variable A random variable X is a function that assigns a real

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM

More information

ECE353: Probability and Random Processes. Lecture 18 - Stochastic Processes

ECE353: Probability and Random Processes. Lecture 18 - Stochastic Processes ECE353: Probability and Random Processes Lecture 18 - Stochastic Processes Xiao Fu School of Electrical Engineering and Computer Science Oregon State University E-mail: xiao.fu@oregonstate.edu From RV

More information

Part 3: Parametric Models

Part 3: Parametric Models Part 3: Parametric Models Matthew Sperrin and Juhyun Park August 19, 2008 1 Introduction There are three main objectives to this section: 1. To introduce the concepts of probability and random variables.

More information

Slides 8: Statistical Models in Simulation

Slides 8: Statistical Models in Simulation Slides 8: Statistical Models in Simulation Purpose and Overview The world the model-builder sees is probabilistic rather than deterministic: Some statistical model might well describe the variations. An

More information

IE 230 Probability & Statistics in Engineering I. Closed book and notes. 60 minutes.

IE 230 Probability & Statistics in Engineering I. Closed book and notes. 60 minutes. Closed book and notes. 60 minutes. A summary table of some univariate continuous distributions is provided. Four Pages. In this version of the Key, I try to be more complete than necessary to receive full

More information

8. Parametric models in survival analysis General accelerated failure time models for parametric regression

8. Parametric models in survival analysis General accelerated failure time models for parametric regression 8. Parametric models in survival analysis 8.1. General accelerated failure time models for parametric regression The accelerated failure time model Let T be the time to event and x be a vector of covariates.

More information

Mathematical statistics

Mathematical statistics October 1 st, 2018 Lecture 11: Sufficient statistic Where are we? Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models Fatih Cavdur fatihcavdur@uludag.edu.tr March 20, 2012 Introduction Introduction The world of the model-builder

More information

Stat 475 Life Contingencies I. Chapter 2: Survival models

Stat 475 Life Contingencies I. Chapter 2: Survival models Stat 475 Life Contingencies I Chapter 2: Survival models The future lifetime random variable Notation We are interested in analyzing and describing the future lifetime of an individual. We use (x) to denote

More information

Design of Optimal Bayesian Reliability Test Plans for a Series System

Design of Optimal Bayesian Reliability Test Plans for a Series System Volume 109 No 9 2016, 125 133 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://wwwijpameu ijpameu Design of Optimal Bayesian Reliability Test Plans for a Series System P

More information

15 Discrete Distributions

15 Discrete Distributions Lecture Note 6 Special Distributions (Discrete and Continuous) MIT 4.30 Spring 006 Herman Bennett 5 Discrete Distributions We have already seen the binomial distribution and the uniform distribution. 5.

More information

Final. Fall 2016 (Dec 16, 2016) Please copy and write the following statement:

Final. Fall 2016 (Dec 16, 2016) Please copy and write the following statement: ECE 30: Probabilistic Methods in Electrical and Computer Engineering Fall 06 Instructor: Prof. Stanley H. Chan Final Fall 06 (Dec 6, 06) Name: PUID: Please copy and write the following statement: I certify

More information

MATH : EXAM 2 INFO/LOGISTICS/ADVICE

MATH : EXAM 2 INFO/LOGISTICS/ADVICE MATH 3342-004: EXAM 2 INFO/LOGISTICS/ADVICE INFO: WHEN: Friday (03/11) at 10:00am DURATION: 50 mins PROBLEM COUNT: Appropriate for a 50-min exam BONUS COUNT: At least one TOPICS CANDIDATE FOR THE EXAM:

More information

ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA (krzysio@krzysio.net) Errata Effective July 5, 3, only the latest edition of this manual will have its errata

More information

On Weighted Exponential Distribution and its Length Biased Version

On Weighted Exponential Distribution and its Length Biased Version On Weighted Exponential Distribution and its Length Biased Version Suchismita Das 1 and Debasis Kundu 2 Abstract In this paper we consider the weighted exponential distribution proposed by Gupta and Kundu

More information

Transform Techniques - CF

Transform Techniques - CF Transform Techniques - CF [eview] Moment Generating Function For a real t, the MGF of the random variable is t t M () t E[ e ] e Characteristic Function (CF) k t k For a real ω, the characteristic function

More information

Algorithms for Uncertainty Quantification

Algorithms for Uncertainty Quantification Algorithms for Uncertainty Quantification Tobias Neckel, Ionuț-Gabriel Farcaș Lehrstuhl Informatik V Summer Semester 2017 Lecture 2: Repetition of probability theory and statistics Example: coin flip Example

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

Stat 512 Homework key 2

Stat 512 Homework key 2 Stat 51 Homework key October 4, 015 REGULAR PROBLEMS 1 Suppose continuous random variable X belongs to the family of all distributions having a linear probability density function (pdf) over the interval

More information

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama Instructions This exam has 7 pages in total, numbered 1 to 7. Make sure your exam has all the pages. This exam will be 2 hours

More information

Modeling Binary Outcomes: Logit and Probit Models

Modeling Binary Outcomes: Logit and Probit Models Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

2 Continuous Random Variables and their Distributions

2 Continuous Random Variables and their Distributions Name: Discussion-5 1 Introduction - Continuous random variables have a range in the form of Interval on the real number line. Union of non-overlapping intervals on real line. - We also know that for any

More information

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

STAT2201. Analysis of Engineering & Scientific Data. Unit 3 STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of Queensland School of Mathematics and Physics What we learned in Unit 2 (1) We defined a sample space of a random

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 20, 2009, 8:00 am - 2:00 noon Instructions:. You have four hours to answer questions in this examination. 2. You must show

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Probability and Distributions

Probability and Distributions Probability and Distributions What is a statistical model? A statistical model is a set of assumptions by which the hypothetical population distribution of data is inferred. It is typically postulated

More information

Exponential Distribution and Poisson Process

Exponential Distribution and Poisson Process Exponential Distribution and Poisson Process Stochastic Processes - Lecture Notes Fatih Cavdur to accompany Introduction to Probability Models by Sheldon M. Ross Fall 215 Outline Introduction Exponential

More information

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Random variables. DS GA 1002 Probability and Statistics for Data Science. Random variables DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Motivation Random variables model numerical quantities

More information

Continuous Random Variables

Continuous Random Variables Continuous Random Variables Recall: For discrete random variables, only a finite or countably infinite number of possible values with positive probability. Often, there is interest in random variables

More information

Review for the previous lecture

Review for the previous lecture Lecture 1 and 13 on BST 631: Statistical Theory I Kui Zhang, 09/8/006 Review for the previous lecture Definition: Several discrete distributions, including discrete uniform, hypergeometric, Bernoulli,

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Topic 4: Continuous random variables

Topic 4: Continuous random variables Topic 4: Continuous random variables Course 003, 2018 Page 0 Continuous random variables Definition (Continuous random variable): An r.v. X has a continuous distribution if there exists a non-negative

More information

Practice Exam #1 CAS Exam 3L

Practice Exam #1 CAS Exam 3L Practice Exam #1 CAS Exam 3L These practice exams should be used during the last month prior to your exam. This practice exam contains 25 questions, of equal value, corresponding to an exam of about 2.5

More information

Probability Midterm Exam 2:15-3:30 pm Thursday, 21 October 1999

Probability Midterm Exam 2:15-3:30 pm Thursday, 21 October 1999 Name: 2:15-3:30 pm Thursday, 21 October 1999 You may use a calculator and your own notes but may not consult your books or neighbors. Please show your work for partial credit, and circle your answers.

More information

CTDL-Positive Stable Frailty Model

CTDL-Positive Stable Frailty Model CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland

More information

ECON 721: Lecture Notes on Duration Analysis. Petra E. Todd

ECON 721: Lecture Notes on Duration Analysis. Petra E. Todd ECON 721: Lecture Notes on Duration Analysis Petra E. Todd Fall, 213 2 Contents 1 Two state Model, possible non-stationary 1 1.1 Hazard function.......................... 1 1.2 Examples.............................

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution.

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution. POWER-LAW ADJUSTED SURVIVAL MODELS William J. Reed Department of Mathematics & Statistics University of Victoria PO Box 3060 STN CSC Victoria, B.C. Canada V8W 3R4 reed@math.uvic.ca Key Words: survival

More information

Review 1: STAT Mark Carpenter, Ph.D. Professor of Statistics Department of Mathematics and Statistics. August 25, 2015

Review 1: STAT Mark Carpenter, Ph.D. Professor of Statistics Department of Mathematics and Statistics. August 25, 2015 Review : STAT 36 Mark Carpenter, Ph.D. Professor of Statistics Department of Mathematics and Statistics August 25, 25 Support of a Random Variable The support of a random variable, which is usually denoted

More information

Ching-Han Hsu, BMES, National Tsing Hua University c 2015 by Ching-Han Hsu, Ph.D., BMIR Lab. = a + b 2. b a. x a b a = 12

Ching-Han Hsu, BMES, National Tsing Hua University c 2015 by Ching-Han Hsu, Ph.D., BMIR Lab. = a + b 2. b a. x a b a = 12 Lecture 5 Continuous Random Variables BMIR Lecture Series in Probability and Statistics Ching-Han Hsu, BMES, National Tsing Hua University c 215 by Ching-Han Hsu, Ph.D., BMIR Lab 5.1 1 Uniform Distribution

More information

Department of Mathematics

Department of Mathematics Department of Mathematics Ma 3/13 KC Border Introduction to Probability and Statistics Winter 217 Lecture 13: The Poisson Process Relevant textbook passages: Sections 2.4,3.8, 4.2 Larsen Marx [8]: Sections

More information

Graduate Econometrics I: What is econometrics?

Graduate Econometrics I: What is econometrics? Graduate Econometrics I: What is econometrics? Yves Dominicy Université libre de Bruxelles Solvay Brussels School of Economics and Management ECARES Yves Dominicy Graduate Econometrics I: What is econometrics?

More information

Augmented Statistical Models for Speech Recognition

Augmented Statistical Models for Speech Recognition Augmented Statistical Models for Speech Recognition Mark Gales & Martin Layton 31 August 2005 Trajectory Models For Speech Processing Workshop Overview Dependency Modelling in Speech Recognition: latent

More information

Final Exam # 3. Sta 230: Probability. December 16, 2012

Final Exam # 3. Sta 230: Probability. December 16, 2012 Final Exam # 3 Sta 230: Probability December 16, 2012 This is a closed-book exam so do not refer to your notes, the text, or any other books (please put them on the floor). You may use the extra sheets

More information

PROBABILITY DISTRIBUTIONS

PROBABILITY DISTRIBUTIONS Review of PROBABILITY DISTRIBUTIONS Hideaki Shimazaki, Ph.D. http://goo.gl/visng Poisson process 1 Probability distribution Probability that a (continuous) random variable X is in (x,x+dx). ( ) P x < X

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER. Two hours MATH38181 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER EXTREME VALUES AND FINANCIAL RISK Examiner: Answer any FOUR

More information

Random Variables. Definition: A random variable (r.v.) X on the probability space (Ω, F, P) is a mapping

Random Variables. Definition: A random variable (r.v.) X on the probability space (Ω, F, P) is a mapping Random Variables Example: We roll a fair die 6 times. Suppose we are interested in the number of 5 s in the 6 rolls. Let X = number of 5 s. Then X could be 0, 1, 2, 3, 4, 5, 6. X = 0 corresponds to the

More information

Introduction: exponential family, conjugacy, and sufficiency (9/2/13)

Introduction: exponential family, conjugacy, and sufficiency (9/2/13) STA56: Probabilistic machine learning Introduction: exponential family, conjugacy, and sufficiency 9/2/3 Lecturer: Barbara Engelhardt Scribes: Melissa Dalis, Abhinandan Nath, Abhishek Dubey, Xin Zhou Review

More information

Delta Method. Example : Method of Moments for Exponential Distribution. f(x; λ) = λe λx I(x > 0)

Delta Method. Example : Method of Moments for Exponential Distribution. f(x; λ) = λe λx I(x > 0) Delta Method Often estimators are functions of other random variables, for example in the method of moments. These functions of random variables can sometimes inherit a normal approximation from the underlying

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models

More information

1 Appendix A: Matrix Algebra

1 Appendix A: Matrix Algebra Appendix A: Matrix Algebra. Definitions Matrix A =[ ]=[A] Symmetric matrix: = for all and Diagonal matrix: 6=0if = but =0if 6= Scalar matrix: the diagonal matrix of = Identity matrix: the scalar matrix

More information

STAT J535: Chapter 5: Classes of Bayesian Priors

STAT J535: Chapter 5: Classes of Bayesian Priors STAT J535: Chapter 5: Classes of Bayesian Priors David B. Hitchcock E-Mail: hitchcock@stat.sc.edu Spring 2012 The Bayesian Prior A prior distribution must be specified in a Bayesian analysis. The choice

More information