Applied Bayesian Nonparametric Mixture Modeling Session 1 Introduction/Dirichlet process priors

Size: px
Start display at page:

Download "Applied Bayesian Nonparametric Mixture Modeling Session 1 Introduction/Dirichlet process priors"

Transcription

1 Applied Bayesian Nnparametric Mixture Mdeling Sessin 1 Intrductin/Dirichlet prcess prirs Athanasis Kttas (thans@ams.ucsc.edu) Abel Rdriguez (abel@ams.ucsc.edu) University f Califrnia, Santa Cruz 2014 ISBA Wrld Meeting Cancún, Mexic Sunday July 13, 2014

2 Outline 1 Intrductin and mtivatin 2 Definitin f the Dirichlet prcess 3 Cnstructive definitin f the Dirichlet prcess 4 Pólya urn representatin f the Dirichlet prcess 5 Psterir inference 6 Applicatins

3 Mtivating example: ROC curves fr bimarker evaluatin Individuals are drawn frm ne f tw ppulatins (call them Healthy and Diseased). We measure a univariate bimarker Y in rder t diagnse a disease (e.g., prstate-specific antigen PSA t detect prstate cancer). Individuals with an elevated bimarker (i.e., Y > t fr sme threshld t) are declared Diseased. Sensitivity (r true psitive rate) Specificity (r true negative rate) We assume that a gld-standard is available: disease status can be assumed knwn fr the training set used t estimate the reliability f the diagnstic test.

4 Mtivating example: ROC curves fr bimarker evaluatin Let F H and F D be the c.d.f.s f the bimarker fr the healthy and diseased grups. FPR(t) = 1 F H (t) = F H (t). FNR(t) = F D (t) = 1 F D (t). Selecting t implies a tradeff between FPR and FNR! Healthy Diseased FNR FPR Bimarker

5 Mtivating example: ROC curves fr bimarker evaluatin That tradeff is captured by the ROC curve, implicitly defined as {(FNR(t), TPR(t)) : t (, )} Alternatively ROC(u) = 1 F D (F 1 (1 u)) fr u [0, 1]. H Hence, estimating the ROC curve requires estimatin f tw distributin functins. TPR FPR

6 Mtivating example: ROC curves fr bimarker evaluatin Bld PSA levels fr 454 healthy and 229 diseased males, with ages ranging frm 47 t 81 years (Etzini et al., 1999). Healthy Diseased Density Density Lg ttal PSA Lg ttal PSA Tw simple (frequentist) analyses fr the data invlve Parametric: Assume nrmality f lgscres (skewness?). Nnparametric: Estimate c.d.f.s thrugh the empirical c.d.f.s.

7 Mtivating example: ROC curves fr bimarker evaluatin ROC curves and btstrap-based pintwise cnfidence intervals fr bth simple analyses. Sensitivity Sensitivity Sensitivity Specificity Specificity Specificity As a Bayesian, btaining an analg t the left-hand picture is easy. But hw d we get a picture that is analgus t the ne n the center?

8 Parametric vs. nnparametric Bayes: A simple example Let y i Y, y i F iid F, F F, F = {N(y µ, τ 2 ), µ R, τ R + }. In this parametric specificatin a prir n F bils dwn t a prir n (µ, τ 2 ). Hwever, F is tiny cmpared t F = {All distributins n Y}. Nnparametric Bayes invlves prirs n much larger subsets f F (infinitedimensinal spaces). One handy way t d this is t use stchastic prcesses.

9 Bayesian nnparametrics Bayesian nnparametrics prirs n spaces f functins, {g( ) : g G} (infinite dimensinal spaces). An xymrn? Even thugh we fcus n distributin functins, the methds are much mre widely useful: hazard r cumulative hazard functin, link functin, calibratin functin... Mre generally, enriching usual parametric mdels, typically leading t semiparametric mdels. Wandering nnparametrically near a standard class. What makes a nnparametric mdel gd? (e.g., Fergusn, 1973) The mdel shuld be tractable, i.e., it shuld be easily cmputed, either analytically r thrugh simulatins. The mdel shuld be rich, in the sense f having large supprt. The hyperparameters in the mdel shuld be easily interpretable.

10 Sme references General review papers n Bayesian nnparametrics: Walker, Damien, Laud and Smith (1999); Müller and Quintana (2004); Hansn, Branscum and Jhnsn (2005). Review papers n specific applicatin areas f Bayesian nnparametric and semiparametric methds: Hjrt (1996); Sinha and Dey (1997); Gelfand (1999). Bks: Dey, Müller and Sinha (1998) (edited vlume with a cllectin f papers, mainly, n applicatins f Bayesian nnparametrics); Ghsh and Ramamrthi (2003) (emphasis n theretical develpment f Bayesian nnparametric prirs); Hjrt, Hlmes, Müller and Walker (2010) (edited vlume with an verview f thery, methds, and sme applicatins f Bayesian nnparametrics); Müller and Rdriguez (2013) (lecture ntes frm a week-lng curse at UCSC). Sftware: functins fr nnparametric Bayesian inference are spread ver varius R packages. Only cmprehensive package we are aware f is the DPpackage.

11 The Dirichlet prcess as a mdel fr randm distributins A Bayesian nnparametric apprach t mdeling, say, distributin functins, requires prirs fr spaces f distributin functins. Frmally, it requires stchastic prcesses with sample paths that are distributin functins defined n an apprpriate sample space X (e.g., X = R, r R +, r R d ), equipped with a σ-field B f subsets f X (e.g., the Brel σ-field fr X R d ) The Dirichlet prcess (DP), anticipated in the wrk f Freedman (1963) and Fabius (1964), and frmally develped by Fergusn (1973, 1974), is the first prir defined fr spaces f distributin functins. The DP is, frmally, a (randm) prbability measure n the space f prbability measures (distributins) n (X, B) Hence, the DP generates randm distributins n (X, B), and thus, fr X R d, equivalently, randm c.d.f.s n X

12 Mtivating the cnstructin f the Dirichlet prcess Suppse yu are dealing with a sample space with nly tw utcmes, say, X = {0, 1} and yu are interested in estimating x, the prbability f bserving 1. A natural prir fr x is a beta distributin, p(x) = Γ(a + b) Γ(a)Γ(b) x a 1 (1 x) b 1, 0 x 1. Mre generally, if X is finite with q elements, the prbability distributin ver X is given by q numbers x 1,..., x q such that q i=1 x i = 1. A natural prir fr (x 1,..., x q ), which generalizes the Beta distributin, is the Dirichlet distributin (see the next tw slides). With the Dirichlet prcess we further generalize t infinite but cuntable spaces.

13 Prperties f the Dirichlet distributin Start with independent randm variables with a j > 0. Define Z j Gam(a j, 1), j = 1,..., k, Y j = Z j k l=1 Z, j = 1,..., k. l Then (Y 1,..., Y k ) Dirichlet(a 1,..., a k ). This distributin is singular w.r.t. Lebesgue measure n R k, since k j=1 Y j = 1.

14 Prperties f the Dirichlet distributin (Y 1,..., Y k 1 ) has density ( k ) Γ j=1 a ( j k j=1 Γ (a 1 k 1 j) j=1 y j ) ak 1 k 1 j=1 Nte that fr k = 2, Dirichlet(a 1, a 2 ) Beta(a 1, a 2 ). The mments f the Dirichlet distributin are: E(Y j ) = y a j 1 j. a j k l=1 a, E(Y 2 a j (a j + 1) j ) = k l l=1 a l(1 + k l=1 a l), a i a j E(Y i Y j ) = k l=1 a l(1 + k l=1 a l). We can think abut the Dirichlet as having tw parameters: g = {a j / k l=1 a l}, the mean vectr. α = k l=1 a l, a cncentratin parameter cntrlling its variance.

15 Definitin f the Dirichlet prcess The DP is characterized by tw parameters: A psitive scalar parameter α. A specified prbability measure n (X, B), Q 0 (r, equivalently, a distributin functin n X, G 0). DEFINITION (Fergusn, 1973): The DP generates randm prbability measures (randm distributins) Q n (X, B) such that fr any finite measurable partitin B 1,...,B k f X, (Q(B 1 ),..., Q(B k )) Dirichlet(αQ 0 (B 1 ),..., αq 0 (B k )). Here, Q(B i ) (a randm variable) and Q 0(B i ) (a cnstant) dente the prbability f set B i under Q and Q 0, respectively. Als, the B i, i = 1,..., k, define a measurable partitin if B i B, they are pairwise disjint, and their unin is X.

16 Interpreting the parameters f the Dirichlet prcess Fr any measurable subset B f X, we have frm the definitin that Q(B) beta(αq 0 (B), αq 0 (B c )), and thus E {Q(B)} = Q 0 (B), Var {Q(B)} = Q 0(B){1 Q 0 (B)} α + 1 Q 0 plays the rle f the center f the DP (als referred t as baseline prbability measure, r baseline distributin). α can be viewed as a precisin parameter: fr large α there is small variability in DP realizatins; the larger α is, the clser we expect a realizatin Q frm the prcess t be t Q 0. See Fergusn (1973) fr the rle f Q 0 n mre technical prperties f the DP (e.g., Fergusn shws that the supprt f the DP cntains all prbability measures n (X, B) that are abslutely cntinuus w.r.t. Q 0 ).

17 Interpreting the parameters f the Dirichlet prcess Analgus definitin fr the randm distributin functin G n X generated frm a DP with parameters α and G 0, a specified distributin functin n X Fr example, with X = R, B = (, x], x R, and Q(B) = G(x), and thus G(x) beta(αg 0 (x), α{1 G 0 (x)}), E {G(x)} = G 0 (x), Var {G(x)} = G 0(x){1 G 0 (x)}. α + 1 Ntatin: depending n the cntext, G will dente either the randm prbability measure r the randm distributin functin. G DP(α, G 0 ) will indicate that a DP prir is placed n G.

18 Simulating c.d.f. realizatins frm a Dirichlet prcess Cnsider any grid f pints x 1 < x 2 <... < x k in X R. Then, the randm vectr (G(x 1), G(x 2) G(x 1),..., G(x k ) G(x k 1 ), 1 G(x k )) fllws a Dirichlet distributin with parameter vectr (αg 0(x 1), α(g 0(x 2) G 0(x 1)),..., α(g 0(x k ) G 0(x k 1 )), α(1 G 0(x k ))) Hence, if (u 1, u 2,..., u k ) is a draw frm this Dirichlet distributin, then (u 1,..., i j=1 u j,..., k j=1 u j) is a draw frm the distributin f (G(x 1 ),..., G(x i ),..., G(x k )).

19 Simulating c.d.f. realizatins frm a Dirichlet prcess library(mcmcpack) alpha = 1 x = seq(0.005, 0.995, by=0.005) k = length(x) dg0 = punif(x[-1],0,1)-punif(x[-k],0,1) dg = rdirichlet(1, alpha*dg0) G = cumsum(dg) Distributin x

20 Simulating c.d.f. realizatins frm a Dirichlet prcess α = 0.1 α = 10 α = 100 Distributin Distributin Distributin x x x Figure 1.1: Realizatins frm a Dirichlet prcess with different values fr the precisin parameter. The slid black line crrespnds t the baseline measure (a unifrm n the interval [0, 1] in this case), while the dashed clred lines represent multiple realizatins.

21 Cnstructive definitin f the DP Due t Sethuraman and Tiwari (1982) and Sethuraman (1994). Let {z r : r = 1, 2,...} and {ϑ l : l = 1, 2,...} be independent sequences f i.i.d. randm variables z r beta(1, α), r = 1, 2,... ϑ l G 0, l = 1, 2,... Define ω 1 = z 1 and ω l = z l l 1 r=1 (1 z r ), fr l = 2, 3,... Then, a realizatin G frm DP(α, G 0 ) is (almst surely) f the frm G( ) = ω l δ ϑl ( ) l=1 where δ z ( ) dentes a pint mass at z. Nte this definitin implies l=1 ω l = 1 (almst surely).

22 Mre n the cnstructive definitin f the DP The DP generates distributins that have an (almst sure) representatin as cuntable mixtures f pint masses: The lcatins ϑ l are i.i.d. draws frm the base distributin. The assciated weights ω l are defined using the stick-breaking (smetimes called residual allcatin) cnstructin. This is nt as restrictive as it might sund: Any distributin n R d can be apprximated arbitrarily well using a cuntable mixture f pint masses. The realizatins we shwed befre already hinted at this fact.

23 The stick-breaking cnstructin Start with a stick f length 1 (representing the ttal prbability t be distributed amng the different atms). Draw a randm z 1 beta(1, α), which defines the prtin f the riginal stick assigned t atm 1, s that ω 1 = z 1 then, the remaining part f the stick has length 1 z 1. Draw a randm z 2 beta(1, α) (independently f z 1 ), which defines the prtin f the remaining stick assigned t atm 2, therefre, ω 2 = z 2 (1 z 1 ) nw, the remaining part f the stick has length (1 z 2 )(1 z 1 ). Cntinue ad infinitum... We dente the jint distributin n the vectr f weights by (ω 1, ω 2,...) SB(α).

24 The stick-breaking cnstructin

25 Mre n the cnstructive definitin f the DP The DP cnstructive definitin yields anther methd t simulate frm DP prirs in fact, it prvides (up t a truncatin apprximatin) the entire distributin G, nt just c.d.f. sample paths. Fr example, a pssible apprximatin is G J = J p j δ ϑj, with p j = ω j fr j = 1,..., J 1, and p J = 1 J 1 j=1 ω j = J 1 r=1 (1 z r ). T specify J, nte that J J E = 1 E(1 z r ) = 1 j=1 ω j r=1 J r=1 j=1 ( ) J α α α + 1 = 1 α + 1 Hence, J culd be chsen such that (α/(α + 1)) J = ε, fr small ε.

26 Mre n the cnstructive definitin f the DP w P(X<x) x x Figure 1.2: Illustratin fr a DP with G 0 = N(0, 1) and α = 20. In the left panel, the spiked lines are lcated at 1000 N(0, 1) draws with heights given by the (truncated) stick-breaking weights. These spikes are then summed t generate ne c.d.f. sample path. The right panel shws 8 such sample paths indicated by the lighter jagged lines. The heavy smth line indicates the N(0, 1) c.d.f.

27 Generalizing the DP Many randm prbability measures can be defined by means f a stick-breaking cnstructin the z r are drawn independently frm a distributin n [0, 1]. Fr example, the Beta tw-parameter prcess (Ishwaran and Zarepur, 2000) is defined by chsing z r beta(a, b). If z r beta(1 a, b + ra), fr r = 1, 2,... and sme a [0, 1) and b ( a, ) we btain the tw-parameter Pissn-Dirichlet prcess (e.g., Pitman and Yr, 1997). The general case, z r beta(a r, b r ) (Ishwaran and James, 2001). The prbit stick-breaking prcess, where z r = Φ(x r ) with x r N(µ, σ 2 ) and Φ denting the standard nrmal c.d.f. (Rdríguez and Dunsn, 2011).

28 Further extensins based n the DP cnstructive definitin The cnstructive definitin f the DP has mtivated several f its extensins, including: ɛ-dp (Muliere and Tardella, 1998), generalized DPs (Hjrt, 2000); general stick-breaking prirs (Ishwaran and James, 2001). Dependent DP prirs (MacEachern, 1999, 2000; De Iri et al., 2004; Griffin and Steel, 2006). Hierarchical DPs (Tmlinsn and Escbar, 1999; Teh et al., 2006). Spatial DPs mdels (Gelfand, Kttas and MacEachern, 2005; Kttas, Duan and Gelfand, 2008; Duan, Guindani and Gelfand, 2007). Nested DPs (Rdriguez, Dunsn and Gelfand, 2008).

29 Pólya urn characterizatin f the DP If, fr i = 1,..., n, x i G are i.i.d. frm G, and G DP(α, G 0 ), the jint distributin fr the x i induced by marginalizing G ver its DP prir is given by n α p(x 1,..., x n ) = G 0 (x 1 ) α + i 1 G 1 i 1 0(x i ) + δ xj (x i ) α + i 1 i=2 (Blackwell and MacQueen, 1973). That is, the sequence f the x i fllws a generalized Pólya urn scheme such that: x 1 G 0, and fr any i = 2,..., n, x i x 1,..., x i 1 fllws the mixed distributin that places pint mass (α + i 1) 1 at x j, j = 1,..., i 1, and cntinuus mass α(α + i 1) 1 n G 0. j=1

30 The Chinese restaurant prcess The Pólya urn characterizatin f the DP can be visualized using the Chinese restaurant analgy: A custmer arriving at the restaurant jins a table that already has sme custmers, with prbability prprtinal t the number f peple in the table, r takes the first seat at a new table with prbability prprtinal t α. All custmers sitting in the same table share a dish.

31 Prir t psterir updating with DP prirs Let G dente the randm distributin functin fr the fllwing results. Fergusn (1973) has shwn that if the bservatins y i G are i.i.d. frm G, i = 1,..., n, and G DP(α, G 0 ), then the psterir distributin f G is a DP( α, G 0 ), with α = α + n, and G 0 (t) = α α + n G 0(t) + 1 α + n n 1 [yi, )(t) i=1 Hence, the DP is a cnjugate prir. All the results and prperties develped fr DPs can be used directly fr the psterir distributin f G.

32 Prir t psterir updating with DP prirs Fr example, the psterir mean estimate fr G(t), E {G(t) y 1,..., y n } = α α + n G 0(t) + n α + n G n(t) where G n (t) = n 1 n i=1 1 [y i, )(t) is the empirical distributin functin f the data (the standard classical nnparametric estimatr). Fr small α relative t n, little weight is placed n the prir guess G 0. Fr large α relative t n, little weight is placed n the data. Hence, α can be viewed as a measure f faith in the prir guess G 0 measured in units f number f bservatins (thus, α = 1 indicates strength f belief in G 0 wrth ne bservatin). Hwever, taking α very small in rder t be nninfrmative is very dangerus.

33 C.d.f. estimatin using DP prirs α = 5, n = 10 α = 5, n = 50 F(x) True Baseline E(F) ECDF Psterir E(F y) F(x) True Baseline E(F) ECDF Psterir E(F y) x x Figure 1.3: Estimating the distributin functin under a DP prir, using simulated data. Bth the true distributin generating the data and the baseline distributin are Gaussian. The left panel crrespnds t a sample f n = 10 bservatins while the right panel crrespnds t a sample f n = 50 bservatins.

34 A Bayesian nnparametric apprach t ROC estimatin Let y i,h F H F H fr i = 1,..., n H, and y j,d F D F D fr j = 1,..., n D. Assign independent Dirichlet prcess prirs t each F H and F D. F H DP(α H, G 0,H ), F D DP(α D, G 0,D ). (Neither α H and α D nr G 0,H and G 0,D need t be the same). Questins abut the PSA example: Hw wuld yu chse the baseline measure? Hw wuld yu assess the impact f these chices in yur analysis?

35 A Bayesian nnparametric apprach t ROC estimatin Hw wuld yu chse the baseline measure? In this case, it wuld make sense t take G 0,D = G 0,H = N(0, 1) (bth frm ur knwledge f cmmn PSA levels and frm ur descriptive analysis f the data). It is less clear hw t chse α D and α H. Maybe α D = α H = 5? Recall the equivalent sample size interpretatin! Hw wuld yu assess the impact f these chices in yur analysis? We can simulate frm this prir using the truncatin apprximatin. Since G 0,D = G 0,H, the mean ROC curve is a diagnal line (ignring reflectins). Larger values f α H and α D imply a prir mre cncentrated arund this mean.

36 A Bayesian nnparametric apprach t ROC estimatin Psterir distributins ( F H {y i,h } DP F D {y j,d } DP ( α H + n H, α D + n D, α H α H + n H G 0,H + α D α D + n D G 0,D + 1 α H + n H 1 α D + n D n H 1 [yi,h, ) i=1 n D 1 [yj,d, ) j=1 ) ) Inferences abut the ROC curve can be btained by sampling psterir realizatins f F H and F D using a truncatin. Nte that E(ROC(u) data) 1 F D ( F 1 (1 u))! H Because sample sizes are fairly large, results are similar t thse frm the (frequentist) nnparametric mdel we presented in the intrductin.

37 Dse-respnse mdeling with Dirichlet prcess prirs Study ptency f a stimulus by administering it at k dse levels t a number f subjects at each level. x i : dse levels (with x 1 < x 2 <... < x k ). n i : number f subjects at dse level i. y i : number f psitive respnses at dse level i. F (x) = Pr(psitive respnse at dse level x) (i.e., the ptency f level x f the stimulus). F is referred t as the ptency curve, r dse-respnse curve, r tlerance distributin. Standard assumptin in biassay settings: the prbability f a psitive respnse increases with the dse level, i.e., F is a nn-decreasing functin, i.e., F can be mdeled as a c.d.f. n X R.

38 Dse-respnse mdeling with Dirichlet prcess prirs Questins f interest: Inference fr F (x) fr specified dse levels x. Inference fr unbserved dse level x 0 such that F (x 0) = γ fr specified γ (0, 1). Optimal selectin f {x i, n i } t best accmplish gals 1 and 2 abve (design prblem). Parametric mdeling: F is assumed t be a member f a parametric family f c.d.f.s (e.g., lgit, r prbit mdels). Bayesian nnparametric mdeling: uses a nnparametric prir fr the infinite dimensinal parameter F, i.e., a prir fr the space f c.d.f.s n X wrk based n a DP prir fr F : Antniak (1974), Bhattacharya (1981), Disch (1981), Ku (1983), Gelfand and Ku (1991), Mukhpadhyay (2000).

39 Dse-respnse mdeling with Dirichlet prcess prirs Assuming (cnditinally) independent utcmes at different dse levels, the likelihd is given by k i=1 where p i = F (x i ) fr i = 1,..., k. p y i i (1 p i ) n i y i, If the prir fr F is a DP with precisin parameter α > 0 and base c.d.f. F 0 (the prir guess fr the ptency curve), then a priri (p 1, p 2 p 1,..., p k p k 1, 1 p k ) fllws a Dirichlet distributin with parameters (αf 0 (x 1 ), α(f 0 (x 2 ) F 0 (x 1 )),..., α(f 0 (x k ) F 0 (x k 1 )), α(1 F 0 (x k ))). The psterir fr F is a mixture f Dirichlet prcesses (Antniak, 1974).

40 Mixtures f Dirichlet prcesses Extensin f the DP t a hierarchical versin: set G α, ψ DP(α, G 0 ( ψ)), where (parametric) prirs are added t the precisin parameter α and/r the parameters f the base distributin, ψ. Nt surprisingly, mixtures f Dirichlet prcesses are als cnjugate prirs. Mixtures f Dirichlet prcesses are different frm Dirichlet prcess mixture mdels (which we will study in Sessin 2). Hwever, there are imprtant cnnectins between the tw.

41 Bayesian nnparametric mdeling fr cytgenetic dsimetry Cytgenetic dsimetry (in vitr setting): samples f cell cultures expsed t a range f dses f a given agent in each sample, at each dse level, a measure f cell disability is recrded. Dse-respnse mdeling framewrk, where dse is the frm f expsure t radiatin, and respnse is the measure f genetic aberratin (in viv setting, human expsures), r cell disability (in vitr setting, cell cultures f human lymphcytes) Fcus n (rdered) plytmus categrical respnses: x i : dse levels (with x 1 < x 2 <... < x k ). n i : number f cells at dse level i. y i = (y i1,..., y ir ): respnse vectr (r 2 classificatins) at dse level i. Hence, nw y i p i Mult(n i, p i ), where p i = (p i1,..., p ir ).

42 Bayesian nnparametric mdeling fr cytgenetic dsimetry Bayesian nnparametric mdeling fr plytmus respnse (Kttas, Branc and Gelfand, 2002). Cnsider simple case with r = 3 mdel fr p i1 and p i2 is needed. Mdel p i1 = F 1 (x i ) and p i1 + p i2 = F 2 (x i ), and thus F 1 ( ) F 2 ( ). Bayesian nnparametric mdel requires prir n the space {(F 1, F 2 ) : F 1 ( ) F 2 ( )} f stchastically rdered pairs f c.d.f.s (F 1, F 2 ). Cnstructive apprach: F 1 ( ) = G 1 ( )G 2 ( ), and F 2 ( ) = G 1 ( ), with independent DP(α l, G 0l ) prirs fr G l, l = 1, 2. Induced prir fr q l = (q l,1,..., q l,k ), l = 1, 2, where q l,i = G l (x i ).

43 Bayesian nnparametric mdeling fr cytgenetic dsimetry Cmbining with the likelihd, the psterir fr (q 1, q 2 ) is p(q 1, q 2 data) k { } (1 q 1i ) y i3 q y i1 2i (1 q 2i ) y i2 where i=1 q y i1+y i2 1i q γ (q 12 q 11 ) γ (q 1k q 1,k 1 ) γ k 1 (1 q 1k ) γ k+1 1 q δ (q 22 q 21 ) δ (q 2k q 2,k 1 ) δ k 1 (1 q 2k ) δ k+1 1 γ i = α 1 (G 01 (x i ) G 01 (x i 1 )), δ i = α 2 (G 02 (x i ) G 02 (x i 1 )). Psterirs fr G l (x i ) fr l = 1, 2 prvide psterirs fr F 1 (x i ) and F 2 (x i ), fr all x i.

44 Bayesian nnparametric mdeling fr cytgenetic dsimetry Inference based n an MCMC algrithm. Fr any unbserved dse level x 0, the psterir (predictive) distributin fr q l,0 = G l (x 0) fr l = 1, 2, is given by p(q l,0 data) = p(q l,0 q l )p(q l data)dq l where p(q l,0 q l ) is a rescaled Beta distributin. The inversin prblem (inference fr an unknwn x 0 fr specified respnse values y 0 = (y 01, y 02, y 03)) can be handled by extending the MCMC algrithm t the augmented psterir that includes the additinal parameter vectr (x 0, q 10, q 20). Fr the data illustratins, we cmpare with a parametric lgit mdel, lg p ij p i3 = β 1j + β 2j x i, i = 1,..., k, j = 1, 2 (mdel fitting, predictin, and inversin are straightfrward under this mdel).

45 Cytgenetic dsimetry: simulated data examples Figure 1.4: Tw data sets generated frm the parametric mdel. Psterir inference fr F 1 (upper panels) and F 2 (lwer panels) under the parametric (dashed lines) and nnparametric (slid lines) mdel. dentes the bserved data. The left and right panels crrespnd t the data set with the smaller and large sample sizes, respectively.

46 Cytgenetic dsimetry: simulated data examples Figure 1.5: Tw data sets generated using nn-standard (bimdal) shapes fr F 1 and F 2. Psterir inference fr F 1 (upper panels) and F 2 (lwer panels) under the parametric (dashed lines) and nnparametric (slid lines) mdel. dentes the bserved data. The left and right panels crrespnd t the data set with the smaller and large sample sizes, respectively.

47 Semiparametric regressin fr categrical respnses Applicatin f DP-based mdeling t semiparametric regressin with categrical respnses. Categrical respnses y i, i = 1,..., N (e.g., cunts r prprtins). Cvariate vectr x i fr the i-th respnse, cmprising either categrical predictrs r quantitative predictrs with a finite set f pssible values. K N predictr prfiles (cells), where each cell k (k = 1,..., K) is a cmbinatin f bserved predictr values k(i) dentes the cell crrespnding t the i-th respnse. Assume that all respnses in a cell are exchangeable with distributin F k, k = 1,..., K.

48 Semiparametric regressin fr categrical respnses Prduct f mixtures f Dirichlet prcesses prir (Cifarelli and Regazzini, 1978) fr the cell-specific randm distributins F k, k = 1,..., K: cnditinally n hyperparameters α k and θ k, the F k are assigned independent DP(α k, F 0k ( ; θ k )) prirs, where, in general, θ k = (θ 1k,..., θ Dk ) the F k are related by mdeling the α k (k = 1,..., K) and/r the θ dk (k = 1,..., K; d = 1,..., D) as linear cmbinatins f the predictrs (thrugh specified link functins h d, d = 0, 1,..., D) h 0(α k ) = x T k γ, k = 1,..., K h d (θ dk ) = x T k β d, k = 1,..., K; d = 1,..., D (parametric) prirs fr the vectrs f regressin cefficients γ and β d DP-based prir mdel that induces dependence in the finite cllectin f distributins {F 1,..., F K }, thugh a weaker type f dependence than dependent DP prirs (MacEachern, 2000). [Dependent nnparametric prir mdels will be studied in Sessin 4.]

49 Semiparametric regressin fr categrical respnses Semiparametric structure centered arund a parametric backbne defined by the F 0k ( ; θ k ) useful interpretatin and cnnectins with parametric regressin mdels. Example: regressin mdel fr cunts (Carta and Parmigiani, 2002) y i {F 1,..., F K } N F k(i) (y i ) ind. i=1 F k α k, θ k DP(α k, Pissn( ; θ k )), k = 1,..., K lg(α k ) = x T k γ lg(θ k) = x T k β, k = 1,..., K with prirs fr β and γ Related wrk fr: change-pint prblems (Mira and Petrne, 1996); dse-respnse mdeling fr txiclgy data (Dminici and Parmigiani, 2001); variable selectin in survival analysis (Giudici, Mezzetti and Muliere, 2003).

Bayesian nonparametric modeling approaches for quantile regression

Bayesian nonparametric modeling approaches for quantile regression Bayesian nnparametric mdeling appraches fr quantile regressin Athanasis Kttas Department f Applied Mathematics and Statistics University f Califrnia, Santa Cruz Department f Statistics Athens University

More information

Bootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >

Bootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) > Btstrap Methd > # Purpse: understand hw btstrap methd wrks > bs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(bs) > mean(bs) [1] 21.64625 > # estimate f lambda > lambda = 1/mean(bs);

More information

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised

More information

MATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank

MATCHING TECHNIQUES. Technical Track Session VI. Emanuela Galasso. The World Bank MATCHING TECHNIQUES Technical Track Sessin VI Emanuela Galass The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Emanuela Galass fr the purpse f this wrkshp When can we use

More information

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff

Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeoff Lecture 2: Supervised vs. unsupervised learning, bias-variance tradeff Reading: Chapter 2 STATS 202: Data mining and analysis September 27, 2017 1 / 20 Supervised vs. unsupervised learning In unsupervised

More information

Distributions, spatial statistics and a Bayesian perspective

Distributions, spatial statistics and a Bayesian perspective Distributins, spatial statistics and a Bayesian perspective Dug Nychka Natinal Center fr Atmspheric Research Distributins and densities Cnditinal distributins and Bayes Thm Bivariate nrmal Spatial statistics

More information

MATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank

MATCHING TECHNIQUES Technical Track Session VI Céline Ferré The World Bank MATCHING TECHNIQUES Technical Track Sessin VI Céline Ferré The Wrld Bank When can we use matching? What if the assignment t the treatment is nt dne randmly r based n an eligibility index, but n the basis

More information

What is Statistical Learning?

What is Statistical Learning? What is Statistical Learning? Sales 5 10 15 20 25 Sales 5 10 15 20 25 Sales 5 10 15 20 25 0 50 100 200 300 TV 0 10 20 30 40 50 Radi 0 20 40 60 80 100 Newspaper Shwn are Sales vs TV, Radi and Newspaper,

More information

Maximum A Posteriori (MAP) CS 109 Lecture 22 May 16th, 2016

Maximum A Posteriori (MAP) CS 109 Lecture 22 May 16th, 2016 Maximum A Psteriri (MAP) CS 109 Lecture 22 May 16th, 2016 Previusly in CS109 Game f Estimatrs Maximum Likelihd Nn spiler: this didn t happen Side Plt argmax argmax f lg Mther f ptimizatins? Reviving an

More information

Simple Linear Regression (single variable)

Simple Linear Regression (single variable) Simple Linear Regressin (single variable) Intrductin t Machine Learning Marek Petrik January 31, 2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins

More information

SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis

SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical mdel fr micrarray data analysis David Rssell Department f Bistatistics M.D. Andersn Cancer Center, Hustn, TX 77030, USA rsselldavid@gmail.cm

More information

Tree Structured Classifier

Tree Structured Classifier Tree Structured Classifier Reference: Classificatin and Regressin Trees by L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stne, Chapman & Hall, 98. A Medical Eample (CART): Predict high risk patients

More information

AP Statistics Notes Unit Two: The Normal Distributions

AP Statistics Notes Unit Two: The Normal Distributions AP Statistics Ntes Unit Tw: The Nrmal Distributins Syllabus Objectives: 1.5 The student will summarize distributins f data measuring the psitin using quartiles, percentiles, and standardized scres (z-scres).

More information

COMP 551 Applied Machine Learning Lecture 5: Generative models for linear classification

COMP 551 Applied Machine Learning Lecture 5: Generative models for linear classification COMP 551 Applied Machine Learning Lecture 5: Generative mdels fr linear classificatin Instructr: Herke van Hf (herke.vanhf@mail.mcgill.ca) Slides mstly by: Jelle Pineau Class web page: www.cs.mcgill.ca/~hvanh2/cmp551

More information

Computational modeling techniques

Computational modeling techniques Cmputatinal mdeling techniques Lecture 4: Mdel checing fr ODE mdels In Petre Department f IT, Åb Aademi http://www.users.ab.fi/ipetre/cmpmd/ Cntent Stichimetric matrix Calculating the mass cnservatin relatins

More information

Resampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017

Resampling Methods. Cross-validation, Bootstrapping. Marek Petrik 2/21/2017 Resampling Methds Crss-validatin, Btstrapping Marek Petrik 2/21/2017 Sme f the figures in this presentatin are taken frm An Intrductin t Statistical Learning, with applicatins in R (Springer, 2013) with

More information

Bayesian Nonparametric Modeling in Quantile Regression

Bayesian Nonparametric Modeling in Quantile Regression Bayesian Nnparametric Mdeling in Quantile Regressin Athanasis Kttas and Milvan Krnjajić Abstract We prpse Bayesian nnparametric methdlgy fr quantile regressin mdeling. In particular, e develp Dirichlet

More information

Resampling Methods. Chapter 5. Chapter 5 1 / 52

Resampling Methods. Chapter 5. Chapter 5 1 / 52 Resampling Methds Chapter 5 Chapter 5 1 / 52 1 51 Validatin set apprach 2 52 Crss validatin 3 53 Btstrap Chapter 5 2 / 52 Abut Resampling An imprtant statistical tl Pretending the data as ppulatin and

More information

CS 109 Lecture 23 May 18th, 2016

CS 109 Lecture 23 May 18th, 2016 CS 109 Lecture 23 May 18th, 2016 New Datasets Heart Ancestry Netflix Our Path Parameter Estimatin Machine Learning: Frmally Many different frms f Machine Learning We fcus n the prblem f predictin Want

More information

Eric Klein and Ning Sa

Eric Klein and Ning Sa Week 12. Statistical Appraches t Netwrks: p1 and p* Wasserman and Faust Chapter 15: Statistical Analysis f Single Relatinal Netwrks There are fur tasks in psitinal analysis: 1) Define Equivalence 2) Measure

More information

CAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank

CAUSAL INFERENCE. Technical Track Session I. Phillippe Leite. The World Bank CAUSAL INFERENCE Technical Track Sessin I Phillippe Leite The Wrld Bank These slides were develped by Christel Vermeersch and mdified by Phillippe Leite fr the purpse f this wrkshp Plicy questins are causal

More information

, which yields. where z1. and z2

, which yields. where z1. and z2 The Gaussian r Nrmal PDF, Page 1 The Gaussian r Nrmal Prbability Density Functin Authr: Jhn M Cimbala, Penn State University Latest revisin: 11 September 13 The Gaussian r Nrmal Prbability Density Functin

More information

4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression

4th Indian Institute of Astrophysics - PennState Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur. Correlation and Regression 4th Indian Institute f Astrphysics - PennState Astrstatistics Schl July, 2013 Vainu Bappu Observatry, Kavalur Crrelatin and Regressin Rahul Ry Indian Statistical Institute, Delhi. Crrelatin Cnsider a tw

More information

PSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa

PSU GISPOPSCI June 2011 Ordinary Least Squares & Spatial Linear Regression in GeoDa There are tw parts t this lab. The first is intended t demnstrate hw t request and interpret the spatial diagnstics f a standard OLS regressin mdel using GeDa. The diagnstics prvide infrmatin abut the

More information

Differentiation Applications 1: Related Rates

Differentiation Applications 1: Related Rates Differentiatin Applicatins 1: Related Rates 151 Differentiatin Applicatins 1: Related Rates Mdel 1: Sliding Ladder 10 ladder y 10 ladder 10 ladder A 10 ft ladder is leaning against a wall when the bttm

More information

Module 3: Gaussian Process Parameter Estimation, Prediction Uncertainty, and Diagnostics

Module 3: Gaussian Process Parameter Estimation, Prediction Uncertainty, and Diagnostics Mdule 3: Gaussian Prcess Parameter Estimatin, Predictin Uncertainty, and Diagnstics Jerme Sacks and William J Welch Natinal Institute f Statistical Sciences and University f British Clumbia Adapted frm

More information

T Algorithmic methods for data mining. Slide set 6: dimensionality reduction

T Algorithmic methods for data mining. Slide set 6: dimensionality reduction T-61.5060 Algrithmic methds fr data mining Slide set 6: dimensinality reductin reading assignment LRU bk: 11.1 11.3 PCA tutrial in mycurses (ptinal) ptinal: An Elementary Prf f a Therem f Jhnsn and Lindenstrauss,

More information

3.4 Shrinkage Methods Prostate Cancer Data Example (Continued) Ridge Regression

3.4 Shrinkage Methods Prostate Cancer Data Example (Continued) Ridge Regression 3.3.4 Prstate Cancer Data Example (Cntinued) 3.4 Shrinkage Methds 61 Table 3.3 shws the cefficients frm a number f different selectin and shrinkage methds. They are best-subset selectin using an all-subsets

More information

CHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came.

CHAPTER 24: INFERENCE IN REGRESSION. Chapter 24: Make inferences about the population from which the sample data came. MATH 1342 Ch. 24 April 25 and 27, 2013 Page 1 f 5 CHAPTER 24: INFERENCE IN REGRESSION Chapters 4 and 5: Relatinships between tw quantitative variables. Be able t Make a graph (scatterplt) Summarize the

More information

A New Evaluation Measure. J. Joiner and L. Werner. The problems of evaluation and the needed criteria of evaluation

A New Evaluation Measure. J. Joiner and L. Werner. The problems of evaluation and the needed criteria of evaluation III-l III. A New Evaluatin Measure J. Jiner and L. Werner Abstract The prblems f evaluatin and the needed criteria f evaluatin measures in the SMART system f infrmatin retrieval are reviewed and discussed.

More information

The Law of Total Probability, Bayes Rule, and Random Variables (Oh My!)

The Law of Total Probability, Bayes Rule, and Random Variables (Oh My!) The Law f Ttal Prbability, Bayes Rule, and Randm Variables (Oh My!) Administrivia Hmewrk 2 is psted and is due tw Friday s frm nw If yu didn t start early last time, please d s this time. Gd Milestnes:

More information

Chapter 3: Cluster Analysis

Chapter 3: Cluster Analysis Chapter 3: Cluster Analysis } 3.1 Basic Cncepts f Clustering 3.1.1 Cluster Analysis 3.1. Clustering Categries } 3. Partitining Methds 3..1 The principle 3.. K-Means Methd 3..3 K-Medids Methd 3..4 CLARA

More information

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9. Sectin 7 Mdel Assessment This sectin is based n Stck and Watsn s Chapter 9. Internal vs. external validity Internal validity refers t whether the analysis is valid fr the ppulatin and sample being studied.

More information

COMP 551 Applied Machine Learning Lecture 11: Support Vector Machines

COMP 551 Applied Machine Learning Lecture 11: Support Vector Machines COMP 551 Applied Machine Learning Lecture 11: Supprt Vectr Machines Instructr: (jpineau@cs.mcgill.ca) Class web page: www.cs.mcgill.ca/~jpineau/cmp551 Unless therwise nted, all material psted fr this curse

More information

LHS Mathematics Department Honors Pre-Calculus Final Exam 2002 Answers

LHS Mathematics Department Honors Pre-Calculus Final Exam 2002 Answers LHS Mathematics Department Hnrs Pre-alculus Final Eam nswers Part Shrt Prblems The table at the right gives the ppulatin f Massachusetts ver the past several decades Using an epnential mdel, predict the

More information

Part 3 Introduction to statistical classification techniques

Part 3 Introduction to statistical classification techniques Part 3 Intrductin t statistical classificatin techniques Machine Learning, Part 3, March 07 Fabi Rli Preamble ØIn Part we have seen that if we knw: Psterir prbabilities P(ω i / ) Or the equivalent terms

More information

Pattern Recognition 2014 Support Vector Machines

Pattern Recognition 2014 Support Vector Machines Pattern Recgnitin 2014 Supprt Vectr Machines Ad Feelders Universiteit Utrecht Ad Feelders ( Universiteit Utrecht ) Pattern Recgnitin 1 / 55 Overview 1 Separable Case 2 Kernel Functins 3 Allwing Errrs (Sft

More information

CN700 Additive Models and Trees Chapter 9: Hastie et al. (2001)

CN700 Additive Models and Trees Chapter 9: Hastie et al. (2001) CN700 Additive Mdels and Trees Chapter 9: Hastie et al. (2001) Madhusudana Shashanka Department f Cgnitive and Neural Systems Bstn University CN700 - Additive Mdels and Trees March 02, 2004 p.1/34 Overview

More information

Comparing Several Means: ANOVA. Group Means and Grand Mean

Comparing Several Means: ANOVA. Group Means and Grand Mean STAT 511 ANOVA and Regressin 1 Cmparing Several Means: ANOVA Slide 1 Blue Lake snap beans were grwn in 12 pen-tp chambers which are subject t 4 treatments 3 each with O 3 and SO 2 present/absent. The ttal

More information

A Matrix Representation of Panel Data

A Matrix Representation of Panel Data web Extensin 6 Appendix 6.A A Matrix Representatin f Panel Data Panel data mdels cme in tw brad varieties, distinct intercept DGPs and errr cmpnent DGPs. his appendix presents matrix algebra representatins

More information

Hypothesis Tests for One Population Mean

Hypothesis Tests for One Population Mean Hypthesis Tests fr One Ppulatin Mean Chapter 9 Ala Abdelbaki Objective Objective: T estimate the value f ne ppulatin mean Inferential statistics using statistics in rder t estimate parameters We will be

More information

initially lcated away frm the data set never win the cmpetitin, resulting in a nnptimal nal cdebk, [2] [3] [4] and [5]. Khnen's Self Organizing Featur

initially lcated away frm the data set never win the cmpetitin, resulting in a nnptimal nal cdebk, [2] [3] [4] and [5]. Khnen's Self Organizing Featur Cdewrd Distributin fr Frequency Sensitive Cmpetitive Learning with One Dimensinal Input Data Aristides S. Galanpuls and Stanley C. Ahalt Department f Electrical Engineering The Ohi State University Abstract

More information

Introduction to Regression

Introduction to Regression Intrductin t Regressin Administrivia Hmewrk 6 psted later tnight. Due Friday after Break. 2 Statistical Mdeling Thus far we ve talked abut Descriptive Statistics: This is the way my sample is Inferential

More information

Chapter 8: The Binomial and Geometric Distributions

Chapter 8: The Binomial and Geometric Distributions Sectin 8.1: The Binmial Distributins Chapter 8: The Binmial and Gemetric Distributins A randm variable X is called a BINOMIAL RANDOM VARIABLE if it meets ALL the fllwing cnditins: 1) 2) 3) 4) The MOST

More information

Computational modeling techniques

Computational modeling techniques Cmputatinal mdeling techniques Lecture 2: Mdeling change. In Petre Department f IT, Åb Akademi http://users.ab.fi/ipetre/cmpmd/ Cntent f the lecture Basic paradigm f mdeling change Examples Linear dynamical

More information

CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS

CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS CHAPTER 4 DIAGNOSTICS FOR INFLUENTIAL OBSERVATIONS 1 Influential bservatins are bservatins whse presence in the data can have a distrting effect n the parameter estimates and pssibly the entire analysis,

More information

The blessing of dimensionality for kernel methods

The blessing of dimensionality for kernel methods fr kernel methds Building classifiers in high dimensinal space Pierre Dupnt Pierre.Dupnt@ucluvain.be Classifiers define decisin surfaces in sme feature space where the data is either initially represented

More information

AP Statistics Practice Test Unit Three Exploring Relationships Between Variables. Name Period Date

AP Statistics Practice Test Unit Three Exploring Relationships Between Variables. Name Period Date AP Statistics Practice Test Unit Three Explring Relatinships Between Variables Name Perid Date True r False: 1. Crrelatin and regressin require explanatry and respnse variables. 1. 2. Every least squares

More information

INSTRUMENTAL VARIABLES

INSTRUMENTAL VARIABLES INSTRUMENTAL VARIABLES Technical Track Sessin IV Sergi Urzua University f Maryland Instrumental Variables and IE Tw main uses f IV in impact evaluatin: 1. Crrect fr difference between assignment f treatment

More information

MODULE FOUR. This module addresses functions. SC Academic Elementary Algebra Standards:

MODULE FOUR. This module addresses functions. SC Academic Elementary Algebra Standards: MODULE FOUR This mdule addresses functins SC Academic Standards: EA-3.1 Classify a relatinship as being either a functin r nt a functin when given data as a table, set f rdered pairs, r graph. EA-3.2 Use

More information

making triangle (ie same reference angle) ). This is a standard form that will allow us all to have the X= y=

making triangle (ie same reference angle) ). This is a standard form that will allow us all to have the X= y= Intrductin t Vectrs I 21 Intrductin t Vectrs I 22 I. Determine the hrizntal and vertical cmpnents f the resultant vectr by cunting n the grid. X= y= J. Draw a mangle with hrizntal and vertical cmpnents

More information

Sequential Allocation with Minimal Switching

Sequential Allocation with Minimal Switching In Cmputing Science and Statistics 28 (1996), pp. 567 572 Sequential Allcatin with Minimal Switching Quentin F. Stut 1 Janis Hardwick 1 EECS Dept., University f Michigan Statistics Dept., Purdue University

More information

Least Squares Optimal Filtering with Multirate Observations

Least Squares Optimal Filtering with Multirate Observations Prc. 36th Asilmar Cnf. n Signals, Systems, and Cmputers, Pacific Grve, CA, Nvember 2002 Least Squares Optimal Filtering with Multirate Observatins Charles W. herrien and Anthny H. Hawes Department f Electrical

More information

Department of Economics, University of California, Davis Ecn 200C Micro Theory Professor Giacomo Bonanno. Insurance Markets

Department of Economics, University of California, Davis Ecn 200C Micro Theory Professor Giacomo Bonanno. Insurance Markets Department f Ecnmics, University f alifrnia, Davis Ecn 200 Micr Thery Prfessr Giacm Bnann Insurance Markets nsider an individual wh has an initial wealth f. ith sme prbability p he faces a lss f x (0

More information

7 TH GRADE MATH STANDARDS

7 TH GRADE MATH STANDARDS ALGEBRA STANDARDS Gal 1: Students will use the language f algebra t explre, describe, represent, and analyze number expressins and relatins 7 TH GRADE MATH STANDARDS 7.M.1.1: (Cmprehensin) Select, use,

More information

Source Coding and Compression

Source Coding and Compression Surce Cding and Cmpressin Heik Schwarz Cntact: Dr.-Ing. Heik Schwarz heik.schwarz@hhi.fraunhfer.de Heik Schwarz Surce Cding and Cmpressin September 22, 2013 1 / 60 PartI: Surce Cding Fundamentals Heik

More information

The general linear model and Statistical Parametric Mapping I: Introduction to the GLM

The general linear model and Statistical Parametric Mapping I: Introduction to the GLM The general linear mdel and Statistical Parametric Mapping I: Intrductin t the GLM Alexa Mrcm and Stefan Kiebel, Rik Hensn, Andrew Hlmes & J-B J Pline Overview Intrductin Essential cncepts Mdelling Design

More information

ENSC Discrete Time Systems. Project Outline. Semester

ENSC Discrete Time Systems. Project Outline. Semester ENSC 49 - iscrete Time Systems Prject Outline Semester 006-1. Objectives The gal f the prject is t design a channel fading simulatr. Upn successful cmpletin f the prject, yu will reinfrce yur understanding

More information

SAMPLING DYNAMICAL SYSTEMS

SAMPLING DYNAMICAL SYSTEMS SAMPLING DYNAMICAL SYSTEMS Melvin J. Hinich Applied Research Labratries The University f Texas at Austin Austin, TX 78713-8029, USA (512) 835-3278 (Vice) 835-3259 (Fax) hinich@mail.la.utexas.edu ABSTRACT

More information

BOUNDED UNCERTAINTY AND CLIMATE CHANGE ECONOMICS. Christopher Costello, Andrew Solow, Michael Neubert, and Stephen Polasky

BOUNDED UNCERTAINTY AND CLIMATE CHANGE ECONOMICS. Christopher Costello, Andrew Solow, Michael Neubert, and Stephen Polasky BOUNDED UNCERTAINTY AND CLIMATE CHANGE ECONOMICS Christpher Cstell, Andrew Slw, Michael Neubert, and Stephen Plasky Intrductin The central questin in the ecnmic analysis f climate change plicy cncerns

More information

Probability, Random Variables, and Processes. Probability

Probability, Random Variables, and Processes. Probability Prbability, Randm Variables, and Prcesses Prbability Prbability Prbability thery: branch f mathematics fr descriptin and mdelling f randm events Mdern prbability thery - the aximatic definitin f prbability

More information

Kinetic Model Completeness

Kinetic Model Completeness 5.68J/10.652J Spring 2003 Lecture Ntes Tuesday April 15, 2003 Kinetic Mdel Cmpleteness We say a chemical kinetic mdel is cmplete fr a particular reactin cnditin when it cntains all the species and reactins

More information

Determining the Accuracy of Modal Parameter Estimation Methods

Determining the Accuracy of Modal Parameter Estimation Methods Determining the Accuracy f Mdal Parameter Estimatin Methds by Michael Lee Ph.D., P.E. & Mar Richardsn Ph.D. Structural Measurement Systems Milpitas, CA Abstract The mst cmmn type f mdal testing system

More information

Emphases in Common Core Standards for Mathematical Content Kindergarten High School

Emphases in Common Core Standards for Mathematical Content Kindergarten High School Emphases in Cmmn Cre Standards fr Mathematical Cntent Kindergarten High Schl Cntent Emphases by Cluster March 12, 2012 Describes cntent emphases in the standards at the cluster level fr each grade. These

More information

Modelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA

Modelling of Clock Behaviour. Don Percival. Applied Physics Laboratory University of Washington Seattle, Washington, USA Mdelling f Clck Behaviur Dn Percival Applied Physics Labratry University f Washingtn Seattle, Washingtn, USA verheads and paper fr talk available at http://faculty.washingtn.edu/dbp/talks.html 1 Overview

More information

COMP 551 Applied Machine Learning Lecture 4: Linear classification

COMP 551 Applied Machine Learning Lecture 4: Linear classification COMP 551 Applied Machine Learning Lecture 4: Linear classificatin Instructr: Jelle Pineau (jpineau@cs.mcgill.ca) Class web page: www.cs.mcgill.ca/~jpineau/cmp551 Unless therwise nted, all material psted

More information

Math Foundations 20 Work Plan

Math Foundations 20 Work Plan Math Fundatins 20 Wrk Plan Units / Tpics 20.8 Demnstrate understanding f systems f linear inequalities in tw variables. Time Frame December 1-3 weeks 6-10 Majr Learning Indicatrs Identify situatins relevant

More information

SticiGui Chapter 4: Measures of Location and Spread Philip Stark (2013)

SticiGui Chapter 4: Measures of Location and Spread Philip Stark (2013) SticiGui Chapter 4: Measures f Lcatin and Spread Philip Stark (2013) Summarizing data can help us understand them, especially when the number f data is large. This chapter presents several ways t summarize

More information

February 28, 2013 COMMENTS ON DIFFUSION, DIFFUSIVITY AND DERIVATION OF HYPERBOLIC EQUATIONS DESCRIBING THE DIFFUSION PHENOMENA

February 28, 2013 COMMENTS ON DIFFUSION, DIFFUSIVITY AND DERIVATION OF HYPERBOLIC EQUATIONS DESCRIBING THE DIFFUSION PHENOMENA February 28, 2013 COMMENTS ON DIFFUSION, DIFFUSIVITY AND DERIVATION OF HYPERBOLIC EQUATIONS DESCRIBING THE DIFFUSION PHENOMENA Mental Experiment regarding 1D randm walk Cnsider a cntainer f gas in thermal

More information

x 1 Outline IAML: Logistic Regression Decision Boundaries Example Data

x 1 Outline IAML: Logistic Regression Decision Boundaries Example Data Outline IAML: Lgistic Regressin Charles Suttn and Victr Lavrenk Schl f Infrmatics Semester Lgistic functin Lgistic regressin Learning lgistic regressin Optimizatin The pwer f nn-linear basis functins Least-squares

More information

AP Statistics Notes Unit Five: Randomness and Probability

AP Statistics Notes Unit Five: Randomness and Probability AP Statistics Ntes Unit Five: Randmness and Prbability Syllabus Objectives: 3.1 The student will interpret prbability, including the lng-term relative frequency distributin. 3.2 The student will discuss

More information

Smoothing, penalized least squares and splines

Smoothing, penalized least squares and splines Smthing, penalized least squares and splines Duglas Nychka, www.image.ucar.edu/~nychka Lcally weighted averages Penalized least squares smthers Prperties f smthers Splines and Reprducing Kernels The interplatin

More information

5 th grade Common Core Standards

5 th grade Common Core Standards 5 th grade Cmmn Cre Standards In Grade 5, instructinal time shuld fcus n three critical areas: (1) develping fluency with additin and subtractin f fractins, and develping understanding f the multiplicatin

More information

NUMBERS, MATHEMATICS AND EQUATIONS

NUMBERS, MATHEMATICS AND EQUATIONS AUSTRALIAN CURRICULUM PHYSICS GETTING STARTED WITH PHYSICS NUMBERS, MATHEMATICS AND EQUATIONS An integral part t the understanding f ur physical wrld is the use f mathematical mdels which can be used t

More information

Rigid Body Dynamics (continued)

Rigid Body Dynamics (continued) Last time: Rigid dy Dynamics (cntinued) Discussin f pint mass, rigid bdy as useful abstractins f reality Many-particle apprach t rigid bdy mdeling: Newtn s Secnd Law, Euler s Law Cntinuus bdy apprach t

More information

You need to be able to define the following terms and answer basic questions about them:

You need to be able to define the following terms and answer basic questions about them: CS440/ECE448 Sectin Q Fall 2017 Midterm Review Yu need t be able t define the fllwing terms and answer basic questins abut them: Intr t AI, agents and envirnments Pssible definitins f AI, prs and cns f

More information

We can see from the graph above that the intersection is, i.e., [ ).

We can see from the graph above that the intersection is, i.e., [ ). MTH 111 Cllege Algebra Lecture Ntes July 2, 2014 Functin Arithmetic: With nt t much difficulty, we ntice that inputs f functins are numbers, and utputs f functins are numbers. S whatever we can d with

More information

Name: Block: Date: Science 10: The Great Geyser Experiment A controlled experiment

Name: Block: Date: Science 10: The Great Geyser Experiment A controlled experiment Science 10: The Great Geyser Experiment A cntrlled experiment Yu will prduce a GEYSER by drpping Ments int a bttle f diet pp Sme questins t think abut are: What are yu ging t test? What are yu ging t measure?

More information

Module 4: General Formulation of Electric Circuit Theory

Module 4: General Formulation of Electric Circuit Theory Mdule 4: General Frmulatin f Electric Circuit Thery 4. General Frmulatin f Electric Circuit Thery All electrmagnetic phenmena are described at a fundamental level by Maxwell's equatins and the assciated

More information

Lead/Lag Compensator Frequency Domain Properties and Design Methods

Lead/Lag Compensator Frequency Domain Properties and Design Methods Lectures 6 and 7 Lead/Lag Cmpensatr Frequency Dmain Prperties and Design Methds Definitin Cnsider the cmpensatr (ie cntrller Fr, it is called a lag cmpensatr s K Fr s, it is called a lead cmpensatr Ntatin

More information

How do scientists measure trees? What is DBH?

How do scientists measure trees? What is DBH? Hw d scientists measure trees? What is DBH? Purpse Students develp an understanding f tree size and hw scientists measure trees. Students bserve and measure tree ckies and explre the relatinship between

More information

Subject description processes

Subject description processes Subject representatin 6.1.2. Subject descriptin prcesses Overview Fur majr prcesses r areas f practice fr representing subjects are classificatin, subject catalging, indexing, and abstracting. The prcesses

More information

Inference in the Multiple-Regression

Inference in the Multiple-Regression Sectin 5 Mdel Inference in the Multiple-Regressin Kinds f hypthesis tests in a multiple regressin There are several distinct kinds f hypthesis tests we can run in a multiple regressin. Suppse that amng

More information

Methods for Determination of Mean Speckle Size in Simulated Speckle Pattern

Methods for Determination of Mean Speckle Size in Simulated Speckle Pattern 0.478/msr-04-004 MEASUREMENT SCENCE REVEW, Vlume 4, N. 3, 04 Methds fr Determinatin f Mean Speckle Size in Simulated Speckle Pattern. Hamarvá, P. Šmíd, P. Hrváth, M. Hrabvský nstitute f Physics f the Academy

More information

Statistics, Numerical Models and Ensembles

Statistics, Numerical Models and Ensembles Statistics, Numerical Mdels and Ensembles Duglas Nychka, Reinhard Furrer,, Dan Cley Claudia Tebaldi, Linda Mearns, Jerry Meehl and Richard Smith (UNC). Spatial predictin and data assimilatin Precipitatin

More information

COMP 551 Applied Machine Learning Lecture 9: Support Vector Machines (cont d)

COMP 551 Applied Machine Learning Lecture 9: Support Vector Machines (cont d) COMP 551 Applied Machine Learning Lecture 9: Supprt Vectr Machines (cnt d) Instructr: Herke van Hf (herke.vanhf@mail.mcgill.ca) Slides mstly by: Class web page: www.cs.mcgill.ca/~hvanh2/cmp551 Unless therwise

More information

Lab 1 The Scientific Method

Lab 1 The Scientific Method INTRODUCTION The fllwing labratry exercise is designed t give yu, the student, an pprtunity t explre unknwn systems, r universes, and hypthesize pssible rules which may gvern the behavir within them. Scientific

More information

ECEN 4872/5827 Lecture Notes

ECEN 4872/5827 Lecture Notes ECEN 4872/5827 Lecture Ntes Lecture #5 Objectives fr lecture #5: 1. Analysis f precisin current reference 2. Appraches fr evaluating tlerances 3. Temperature Cefficients evaluatin technique 4. Fundamentals

More information

Revision: August 19, E Main Suite D Pullman, WA (509) Voice and Fax

Revision: August 19, E Main Suite D Pullman, WA (509) Voice and Fax .7.4: Direct frequency dmain circuit analysis Revisin: August 9, 00 5 E Main Suite D Pullman, WA 9963 (509) 334 6306 ice and Fax Overview n chapter.7., we determined the steadystate respnse f electrical

More information

Cells though to send feedback signals from the medulla back to the lamina o L: Lamina Monopolar cells

Cells though to send feedback signals from the medulla back to the lamina o L: Lamina Monopolar cells Classificatin Rules (and Exceptins) Name: Cell type fllwed by either a clumn ID (determined by the visual lcatin f the cell) r a numeric identifier t separate ut different examples f a given cell type

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explrer Supprting User-Defined Functins n Uncertain Data Citatin fr published versin: Tran, TTL, Dia, Y, Suttn, CA & Liu, A 23, 'Supprting User-Defined Functins n Uncertain Data' Prceedings

More information

CS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007

CS 477/677 Analysis of Algorithms Fall 2007 Dr. George Bebis Course Project Due Date: 11/29/2007 CS 477/677 Analysis f Algrithms Fall 2007 Dr. Gerge Bebis Curse Prject Due Date: 11/29/2007 Part1: Cmparisn f Srting Algrithms (70% f the prject grade) The bjective f the first part f the assignment is

More information

k-nearest Neighbor How to choose k Average of k points more reliable when: Large k: noise in attributes +o o noise in class labels

k-nearest Neighbor How to choose k Average of k points more reliable when: Large k: noise in attributes +o o noise in class labels Mtivating Example Memry-Based Learning Instance-Based Learning K-earest eighbr Inductive Assumptin Similar inputs map t similar utputs If nt true => learning is impssible If true => learning reduces t

More information

Activity Guide Loops and Random Numbers

Activity Guide Loops and Random Numbers Unit 3 Lessn 7 Name(s) Perid Date Activity Guide Lps and Randm Numbers CS Cntent Lps are a relatively straightfrward idea in prgramming - yu want a certain chunk f cde t run repeatedly - but it takes a

More information

Simulation Based Optimal Design

Simulation Based Optimal Design BAYESIAN STATISTICS 6, pp. 000 000 J. M. Bernard, J. O. Berger, A. P. Dawid and A. F. M. Smith (Eds.) Oxfrd University Press, 1998 Simulatin Based Optimal Design PETER MULLER Duke University SUMMARY We

More information

Biplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 13 Offprint

Biplots in Practice MICHAEL GREENACRE. Professor of Statistics at the Pompeu Fabra University. Chapter 13 Offprint Biplts in Practice MICHAEL GREENACRE Prfessr f Statistics at the Pmpeu Fabra University Chapter 13 Offprint CASE STUDY BIOMEDICINE Cmparing Cancer Types Accrding t Gene Epressin Arrays First published:

More information

Guide to Using the Rubric to Score the Klf4 PREBUILD Model for Science Olympiad National Competitions

Guide to Using the Rubric to Score the Klf4 PREBUILD Model for Science Olympiad National Competitions Guide t Using the Rubric t Scre the Klf4 PREBUILD Mdel fr Science Olympiad 2010-2011 Natinal Cmpetitins These instructins are t help the event supervisr and scring judges use the rubric develped by the

More information

Data Mining: Concepts and Techniques. Classification and Prediction. Chapter February 8, 2007 CSE-4412: Data Mining 1

Data Mining: Concepts and Techniques. Classification and Prediction. Chapter February 8, 2007 CSE-4412: Data Mining 1 Data Mining: Cncepts and Techniques Classificatin and Predictin Chapter 6.4-6 February 8, 2007 CSE-4412: Data Mining 1 Chapter 6 Classificatin and Predictin 1. What is classificatin? What is predictin?

More information

NOTE ON A CASE-STUDY IN BOX-JENKINS SEASONAL FORECASTING OF TIME SERIES BY STEFFEN L. LAURITZEN TECHNICAL REPORT NO. 16 APRIL 1974

NOTE ON A CASE-STUDY IN BOX-JENKINS SEASONAL FORECASTING OF TIME SERIES BY STEFFEN L. LAURITZEN TECHNICAL REPORT NO. 16 APRIL 1974 NTE N A CASE-STUDY IN B-JENKINS SEASNAL FRECASTING F TIME SERIES BY STEFFEN L. LAURITZEN TECHNICAL REPRT N. 16 APRIL 1974 PREPARED UNDER CNTRACT N00014-67-A-0112-0030 (NR-042-034) FR THE FFICE F NAVAL

More information

Hierarchical priors and mixture models, with applications in regression and density estimation

Hierarchical priors and mixture models, with applications in regression and density estimation Hierarchical prirs and mixture mdels, with applicatins in regressin and density estimatin By Mike West, Peter Müller & Michael D Escbar In: Aspects f Uncertainty: A Tribute t D.V. Lindley (eds: P Freeman

More information