Hypothesis Tests, Significance

Size: px
Start display at page:

Download "Hypothesis Tests, Significance"

Transcription

1 Hypothesis Tests, Significance Significance as hypothesis test Pitfalls Avoiding pitfalls Goodness-of-fit Context is frequency statistics. 1 Frank Porter, BaBar analysis school Statistics, February 2008

2 Significance as Hypothesis Test When asking for the significance of an observation (of, perhaps a new effect), you ask for a test of the hypotheses: Null hypothesis H 0 : There is no new effect; against Alternative hypothesis H 1 : There is a new effect. Reject the null hypothesis (that is, claim a new effect) if the observation falls in a region that is unlikely if the null hypothesis is correct. Significance (as typically used in HEP, e.g., a significance of 5σ ) is the probability that we erroneously reject the null hypothesis. Also called confidence level, or P -value, or the probability of a Type I error. 2 Frank Porter, BaBar analysis school Statistics, February 2008

3 Significance A 68% confidence interval does not always tell you much about significance. The tails may be non-normal. A separate analysis is generally required, which models the tails appropriately. No recommendation on when to label result as significant. Label implies interpretation. No uniform prescription seems to make sense, involves judgement. Eg, bizarre new particle vs. expected branching fraction. Not our most essential experimental role; up to consumer ultimately to decide what they want to believe. Nevertheless, people insist on making qualitative statements ( observation of, evidence for, discovery of, not significant, consistent with ) Code: observation of > 4σ, evidence for > 3σ 3 Frank Porter, BaBar analysis school Statistics, February 2008

4 Significance (more semantics...) From Physics Today, (coloring mine, references deleted) nb: Just an observation on human nature, no criticism should be inferred. In March, back-to-back papers in Physical Review Letters reported the measurement of CP symmetry violation in the decay of neutral B mesons by groups in Japan and California. Now the word measurement has been replaced by observation in the titles of two new back-to-back reports by these same groups in the 27 August Physical Review Letters. That is to say, with a lot more data and improved event reconstruction, the BaBar collaboration at SLAC and the Belle collaboration at KEK in Japanhaveatlastproducedthefirst compelling evidence of CP violation in any system other than the neutral K mesons. Some people think a measurement should not be called a measurement unless the result is significantly different from zero. A senior Assistant Editor at a prominent journal suggested that bounds on mightbemore appropriate than measurement in reference to a CP asymmetry angle which was observed as consistent with zero. Finding sin 2β =0.00 ± 0.01 would be pretty exciting. Butitisn ta measurement? 4 Frank Porter, BaBar analysis school Statistics, February 2008

5 Computation of significance: Example Flat normal background (avg 100/bin) + gaussian signal (100 events, fixed) withmean 0, σ = Events/ Generated signal = 100 events Cut&Count signal = 194 +_ 39 events Significance = 5.0 Expected significance = x Distribution is normal here, so P = (two-tail). Note: H 0 is flat distribution with no signal. H 1 is N sig 0. 5 Frank Porter, BaBar analysis school Statistics, February 2008

6 Note: The significance is not obtained by dividing the signal estimate (194) by the uncertainty in the signal (39), 194/39 = 5.0. Thatwould be akin to asking how likely a signal of the estimated size would be to fluctuate to zero. It is a good approximation in this example, however, because B/S is large. The significance was estimated here with a simple cut-and-count method: Level of background estimated from region x > 3. Counts in signal region x < 3 added up. The excess in the signal region over the estimated background is divided by the square root of the estimated background in the signal region. In this example, we assumed we knew the mean and width of the signal we were looking for, as well as the background shape. The uncertainty in the background estimate is negligible in this example. A more sophisticated fit may yield a more powerful test by incorporating the known shape of the signal. 6 Frank Porter, BaBar analysis school Statistics, February 2008

7 What about Systematic Uncertainties? B(e + e Nobel prize) = 10 ± 1 ± 5. They may be important! Maybethe±5is a systematic uncertainty in the estimate of the background expectation. A 10σ statistical significance is really only a 2σ effect. They may be irrelevant Maybethe±5is a systematic uncertainty on the efficiency, entering as a multiplicative factor. It makes no difference to the significance whether the result is 10 ± 1 or 5 ± 0.5. They may be fuzzy E.g., how is the background expectation estimated? What is the sampling distribution? E.g., Are theoretical uncertainties present? 7 Frank Porter, BaBar analysis school Statistics, February 2008

8 Systematic Uncertainties Blind checks and educated checks : Blind check: testing for mistakes; no correction is expected. If pass test, no contribution to systematic error. Eg, divide data into chronological subsets and compare results. Educated check: measuring biases, corrections. May affect quoted result. Always contributes to systematic error. Eg, dependence of efficiency on model. Quote systematic uncertainty separately from statistical Systematic uncertainty may contain statistical components, eg, MC statistics in evaluation of efficiency. 8 Frank Porter, BaBar analysis school Statistics, February 2008

9 Systematic Uncertainties Example: D mixing and DCSD revisited Want simple procedure. Willing to accept approximation. Scale statistical-only contour uniformly along ray from best-fit value. Factoris 1+ m 2 i,wherem i is an estimate of the effect of systematic uncertainty i measured in units of the statistical uncertainty. This estimate is obtained by determining the effect of the systematic uncertainty on x 2, ŷ. y Physical (x, y ) 2 Central (x, y ) No CPV 95% CL CPV allowed 95% CL CPV allowed, stat only 95% CL CP conserved 95% CL CP conserved, stat only Method conservative ( lazy) in sense that scaling for a given systematic in one direction is applied uniformly in all directions. On the other hand, a linear approximation is being made. x 2 / Frank Porter, BaBar analysis school Statistics, February 2008

10 Aside: Significance as nσ HEP parlance is to say an effect has, e.g., 5σ significance. At face value, this means the observation is 5 standard deviations away from the mean: σ (x x) 2. But we often don t really mean this. Note that a 5σ effect of this sort may not be improbable: P(x) 0.8 P ( x x =5σ) =20%! x 10 Frank Porter, BaBar analysis school Statistics, February 2008

11 Aside: Significance as nσ (continued) Instead, we often mean that the probability (P -value) for the effect is given by the probability of a fluctuation in a normal distribution 5σ from the mean, i.e., (two-tailed probability). P = P ( x > 5), for x N (0, 1) = But sometimes we really do mean 5σ, usually presuming that the sampling distribution is approximately normal. [This may not be an accurate presumption when far out in the tails!] Also now popular to call 2Δ ln L the n in nσ. [ From: L 0 (θ = 0;x) = 1 exp 1 2πσ 2 giving 2Δ ln L = Δχ 2 = x/σ = n. ( x/σ ) 2 ], L max ( θ = x; x) = 1 2πσ, Desirable to be more concise by quoting probabilities, or P -values as is common in the statistics world. At least say what you mean! 11 Frank Porter, BaBar analysis school Statistics, February 2008

12 Estimating significance: Pitfalls What are the dangers? In a nutshell: Unknown or unknowable sampling distributions Ways to not know the distribution: The Improbable Tails Systematic Unknowns The Stopping Problem The exploratory Bump Hunt 12 Frank Porter, BaBar analysis school Statistics, February 2008

13 The Improbable Tails Our earlier example was known to be normal sampling. Events/ Generated signal = 100 events Cut&Count signal = 194 +_ 39 events Significance = 5.0 Expected significance = x Often, this is true approximately (central limit theorem). But for significance, often interested in distribution far into the tails. The normal approximation may be very bad here! If there is any doubt, need to compute the actual distribution. Typically this is done with a toy Monte Carlo to simulate the distribution of the significance statistic. To get to the tails, this may require a fair amount of computing time. Still need to be wary of pushing calculation beyond its validity as a model of the actual distribution. BaBar (and others) now routinely performs these calculations. 13 Frank Porter, BaBar analysis school Statistics, February 2008

14 Nuisance parameters Systematic Unknowns Unknown, but relevant parameters. Estimated somehow, but with some uncertainty. Even if sampling distribution is known, cannot in general derive exact P -values in lower dimensional parameter space. Central Limit Theorem is our friend. Can try other values besides best estimate of nuisance parameter. See our discussion of confidence intervals. Theoretical systematic uncertainties. Guesses, no sampling distribution. Use worst case values when evaluating significance. requires understanding what is meant by the theory errors. Or, give the dependence, e.g., as a range. 14 Frank Porter, BaBar analysis school Statistics, February 2008

15 The Stopping Problem There is a strong tendency to work on an analysis until we are convinced that we got it right, then we stop. Simple example: Keep sampling until we are satisfied. Motivate our example: Ample historical evidence that experimental measurements are sometimes biased by some preconception of what the answer should be. For example, a preconception could be based on the result of another experiment, or on some theoretical prejudice. A model for such a biased experiment is that the experimenter works hard until s/he gets the expected result, and then quits. Let s Consider a simple example of a distribution which could result from such a scenario. 15 Frank Porter, BaBar analysis school Statistics, February 2008

16 Stopping Problem: Normal likelihood function example Consider an experiment in which a measurement of a parameter θ corresponds to sampling from a Gaussian distribution of standard deviation one: N (x; θ, 1)dx = 1 2π e (x θ)2 /2 dx. Frequency θ 2 θ θ+2 x Suppose the experimenter has a prejudice that θ is greater than one. Subconsciously, s/he makes measurements until the sample mean, m = 1 n ni=1 x i, is greater than one, or until s/he becomes convinced (or tired) after a maximum of N measurements. The experimenter then uses the sample mean, m, toestimateθ. 16 Frank Porter, BaBar analysis school Statistics, February 2008

17 Stopping Problem: Normal likelihood function example For illustration, assume that N =2. In terms of the random variables m and n, thepdfis: f(m, n; θ) = 1 2π e 1 2 (m θ)2, n =1,m>1 0, n =1,m<1 1 1 π e (m θ)2 e (x m)2 dx n =2 Number of experiments Histogram of sampling distribution for m, withpdf given by above equation, for θ = Sample mean The likelihood function, as a function of θ, has the shape of a normal distribution, given any experimental result. The peak is at θ = m, som is the maximum likelihood estimator for θ. 17 Frank Porter, BaBar analysis school Statistics, February 2008

18 Stopping Problem: Normal likelihood function example In spite of the normal form of the likelihood function, the sample mean is not sampled from a normal distribution. The 4σ tail is more probable (for some θ) than the experimenter thinks Probability( θ < m - 4s ) Straight line is probability for a 4 σ fluctuation of a normal distrinbution θ Frank Porter, BaBar analysis school Statistics, February 2008

19 Stopping Problem: Normal likelihood function example The likelihood function, as a function of θ, has the shape of a normal distribution, given any experimental result. The peak is at θ = m, som is the maximum likelihood estimator for θ. In spite of the normal form of the likelihood function, the sample mean is not sampled from a normal distribution. The interval defined by where the likelihood function falls by e 1/2 does not correspond to a 68% CI: 0.75 Probability (θ - <θ<θ + ) θ 19 Frank Porter, BaBar analysis school Statistics, February 2008

20 Stopping Problem: Normal likelihood function example The experimenter in this scenario thinks s/he is taking n samples from a normal distribution, and makes probability statements (e.g., about significance) according to a normal distribution. S/he gets an erroneous result because of the mistake in the distribution. If the experimenter realizes that sampling was actually from a nonnormal distribution, s/he can do a more careful analysis to obtain more valid results. Related effect: First observations tend to be biased high. Example? BES early f D = 371 vs f D 220 MeV more recently. Nothing wrong with this, but best estimates should average in earlier null data. Importance of reporting null results and quoting combinable results, e.g., two-sided confidence intervals. 20 Frank Porter, BaBar analysis school Statistics, February 2008

21 The Exploratory Bump Hunt Context is the typical bump-hunting approach in HEP That is, analysis is developed concurrent with looking at the data We see a bump, how significant is it? Events / 20 MeV/c BaBar e + e γ ISR π + π J/ψ P (H 0 ) m(π + π - J/ψ) (GeV/c ) If the analysis is our typical bump-hunt approach, it is impossible to compute the significance! We don t know the sampling distribution (under null hypothesis). Hence, we cannot compute probabilities (under null hypothesis, in particular). 21 Frank Porter, BaBar analysis school Statistics, February 2008

22 Recall our earlier example: Bump Hunting Example I Flat normal background (avg 100/bin) + gaussian signal (100 events) with mean 0, σ = Events/ Generated signal = 100 events Cut&Count signal = 194 +_ 39 events Significance = 5.0 Expected significance = x Distribution is normal here, so P = (two-tail). Note: H 0 is flat distribution with no signal. H 1 is N sig 0. As long as we knew at the outset that we were looking for an effect with mean 0 and σ =1, we make correct inferences. 22 Frank Porter, BaBar analysis school Statistics, February 2008

23 Looking for a mass bump. Bump Hunting Example II 160 Evidence for a threshold enhancement! 140 Events/10 MeV Background level = 100 events/bin Signal = 40 +_ 12 events Statistical significance = 4.0σ One-tailed probability (P-value) = _ 5 X _ m(pp) - 2m(p) (MeV) 23 Frank Porter, BaBar analysis school Statistics, February 2008

24 So, what does it mean? Even though we cannot estimate probabilities, we still quote numbers intended to relay an idea of significance, usually quoted as some number of sigma. What we (usually) mean by this is: If I had done the analysis in a controlled manner, and had been interested in the observed value for the mean and width, then the null hypothesis would require a fluctuation of this number of standard deviations of a Gaussian distribution to produce a bump as large as I see 24 Frank Porter, BaBar analysis school Statistics, February 2008

25 Is it useful? With this understanding of the meaning, it is perhaps not completely useless, since we can interpret it in the context of our experience. However, our experience is that we are sometimes fooled! pentaquarks example Life is short, we have also made great discoveries with this approach. Just remember that quoted significances are highly misleading. Note: The threshold enhancement in Example II is a statistical fluctuation! The sampling distribution was the same N (100, 10) for all bins. 160 Evidence for a threshold enhancement! 140 Events/10 MeV Background level = 100 events/bin Signal = 40 +_ 12 events Statistical significance = 4.0σ One-tailed probability (P-value) = _ 5 X _ m(pp) - 2m(p) (MeV) 25 Frank Porter, BaBar analysis school Statistics, February 2008

26 Avoiding pitfalls Model those tails! Conservatism Based on our experience with the failings of our methodology, we don t claim new discoveries lightly. Do it better: Be Blind 26 Frank Porter, BaBar analysis school Statistics, February 2008

27 Itcanbedone better Better means with more meaningful significance estimates, and avoidance of bias. Do a blind analysis Blind means you design the experiment (analysis) before you look at the results. Goal is to know the sampling distribution. BaBar routinely blinds it s analyses, when there is a well-defined quantity (e.g., CP asymmetry or branching fraction) being measured. 27 Frank Porter, BaBar analysis school Statistics, February 2008

28 Blinding Methodologies There are several basic approaches to blinding an analysis, which may be chosen according to the problem. Many variations on these themes! Don t look inside the box You can look, but keep the answer hidden Obscure the real data (e.g., adding simulated signal to data) Design on a dataset that will be thrown away Divide and conquer (e.g., BNL muon g 2 result, ω p (magnetic field) and ω a (muon precession) were analyzed independently, before combining to obtain g 2; arxiv:hep v3) [Reference: Klein & Roodman, Ann.Rev.Nucl.Part.Sci. 55 (2005) 141.] 28 Frank Porter, BaBar analysis school Statistics, February 2008

29 Blind Analysis Many BaBar results are obtained in blind analyses. avoid introduction of bias. Purpose is to Not all: Exploratory analyses not blind, eg, discovery of the DsJ (2317)+. Example of a blind analysis: B ± K ± e + e After event selection, look for signal in distribution of two kinematic variables ΔE and m ES. Monte Carlo, control sample (usually a type of data resembling signal) studies of entire sideband and fit region. Data only looked at in large sideband region prior to unblinding. Δ E (GeV) B + K + e + e - Signal MC signal region large sideband fit region mes (GeV/c 2 ) Data after unblinding 29 Frank Porter, BaBar analysis school Statistics, February 2008

30 Blind Analysis (continued) Issue: Updating a blind analysis with additional data. Simply add data, no change in analysis. May be impractical, undesirable, eg, re-reconstruction of entire dataset; improvements in tools such as particle identification. Sometimes would like to work harder to optimize analysis, or optimize on different criterion (eg, for most precision instead of most sensitivity). Notion of reblinding and re-optimization. Considered safe to use variables which have not been inspected too carefully in blind region. Not called a blind analysis. Pure approach: Do a blind analysis only on the new dateset. OK to use whatever was learned from the old dataset. Can combine the results from the old and the new data. 30 Frank Porter, BaBar analysis school Statistics, February 2008

31 Blind Analysis Hiding the answer In this approach, no data is hidden, just the answer (typically, a number). For example, a hidden, possibly random, offset may be applied to the real answer to prevent it from being seen before the analysis is designed. Δt(Hidden) = ( ) 1 1 (TAG)Δt + Offset, Top: Not hidden; Bottom: Hidden BaBar, from Klein& Roodman, op. cit. where TAG = ±1 according to the B flavor tag. This algorithm permits viewing the decay time distributions without revealing the asymmetry. 31 Frank Porter, BaBar analysis school Statistics, February 2008

32 Blinding a bump hunt No different in principle than our other blind analyses. The more you know (about the signal you are interested in), the better you can do. Yes, it is harder. It takes discipline, and it might even cost sensitivity. But it is at least worth thinking about. 32 Frank Porter, BaBar analysis school Statistics, February 2008

33 What cuts do I make? Again, the more we know, the better off we are! If we have good simulation of background processes, or background data, that s great! We ll use it. The less we know, the bigger the risk that we won t be optimal. That s life. Being non-optimal doesn t mean wrong. One approach: Sacrifice a portion of data for cut optimization. Ignore any signals. Or tune on them... If signal is real, it should show up in remaining blinded data. If it is a fluctuation, it will disappear. Events/10 MeV Evidence for a threshold enhancement! Background level = 100 events/bin Signal = 40 +_ 12 events Statistical significance = 4.0σ One-tailed probability (P-value) = _ 5 X _ m(pp) - 2m(p) (MeV) Systematic effects will show up both places. But that s another matter. 33 Frank Porter, BaBar analysis school Statistics, February 2008

34 Significance/GOF: Counting Degrees of Freedom The following situation arises with some frequency (with variations): I do two fits to the same dataset (say a histogram with N bins): Fit A has n A parameters, with χ 2 A [or perhaps 2lnL A]. Fit B has a subset n B of the parameters in fit A, withχ 2 B,wherethe n A n B other parameters (call them θ) are fixed at zero. What is the distribution of χ 2 B χ2 A? See SWiG thread: Luc Demortier in: [this is recommended reading] 34 Frank Porter, BaBar analysis school Statistics, February 2008

35 Counting Degrees of Freedom (continued) In the asymptotic limit (that is, as long as the normal sampling distribution is a valid approximation), Δχ 2 χ 2 B χ2 A is the same as a likelihood ratio ( 2lnλ) statistic for the test: H 0 : θ =0 against H 1 : some θ = 0 Δχ 2 is distributed according to a χ 2 (NDOF = n A n B ) distribution as long as: Parameter estimates in λ are consistent (converge to the correct values) under H 0. Parameter values in the null hypothesis are interior points of the maintained hypothesis (union of H 0 and H 1 ). There are no additional nuisance parameters under the alternative hypothesis. 35 Frank Porter, BaBar analysis school Statistics, February 2008

36 Counting Degrees of Freedom (continued) A commonly-encountered situation violates these requirements: We compare fits with and without a signal component to estimate significance of the signal. If the signal fit has, e.g., a parameter for mass, this constitutes an additional nuisance parameter under the alternative hypothesis. If the fit for signal constrains the yield to be non-negative, this violates the interior point requirement. Frequency H0: Fit with no signal H1: Float signal yield Curve: χ 2 (1 dof) Frequency H0: Fit with no signal H1: Float signal yield, constrained _> 0 Curve: χ 2 (1 dof) Frequency H0: Fit with no signal H1: Float signal yield and location Blue: χ 2 (1 dof) Red: χ 2 (2 dof) Δχ (H0-H1) Δχ (H0-H1) Δχ (H0-H1) 36 Frank Porter, BaBar analysis school Statistics, February 2008

37 Counting Degrees of Freedom - Sample fits H0 fit H1 fit H0 fit H1 fit Counts/bin H1 fit Any signal Fix mean H1 fit Counts/bin H1 fit Any signal Fix mean H1 fit Signal>=0 Fix mean x Any signal Float mean x Signal>=0 Fix mean x Any signal Float mean x Fits to data with no signal. Fits to data with a signal. 37 Frank Porter, BaBar analysis school Statistics, February 2008

38 Significance Conclusions Evaluation of significance is a hypothesis test. It is essentially the same problem as evaluating confidence intervals. Except for the more obvious role played by improbable tails. Pitfalls amount to (not) knowing the sampling distribution. Techniques exist to avoid pitfalls: Simulating the sampling distribution Vary the nuisance/theory parameters Blind your experiment Doing it properly requires patience and discipline; the benefit is a more meaningful, convincing result to yourself and to others. 38 Frank Porter, BaBar analysis school Statistics, February 2008

39 Goodness-of-Fit No perfect general goodness-of-fit test: Given a dataset generated under null hypothesis, can usually find a test which rejects the null hypothesis (ie, choosing the test after you see the data is dangerous). Given a dataset generated under alternative hypothesis, can usually find a test for which the null passes (ie, should think about what you want to test for). Nominal recommendation: If you have a specific question, test for that. χ 2 test when valid. Consider more general likelihood ratio test, Kolmogorov-Smirnov, etc., otherwise. Monte Carlo evaluation of distribution of test statistic. 39 Frank Porter, BaBar analysis school Statistics, February 2008

40 Goodness-of-Fit (continued) But recognize when test may not answer desired question, eg, in sin 2β analysis, a likelihood ratio (or a χ 2 ) test on the time distribution may have little sensitivity to testing goodness-of-fit of the asymmetry. Entries / 0.6 ps B 0 tags B 0 tags Background 50 Raw Asymmetry Δt (ps) 40 Frank Porter, BaBar analysis school Statistics, February 2008

41 Consistency of two correlated results BaBar has encountered several times the question of whether a new analysis is consistent with an old analysis. Often, new analysis is a combination of additional data plus changed (improved...) analysis of original data. The stickiest issue is handling the correlation in testing for consistency in the overlapping data. People sometimes have difficulty understanding that statistical differences can arise even comparing results based on the same events. Given a sampling θ 1, θ 2 from a bivariate normal distribution N(θ, σ 1,σ 2,ρ), with θ 1 = θ 2 = θ, the difference Δθ θ 2 θ 1 is N(0,σ)-distributed with σ 2 = σ σ2 2 2ρσ 1σ 2. If the correlation is unknown, all we can say is that the variance of the difference is in the range (σ 1 σ 2 ) 2...(σ 1 + σ 2 ) 2. If we at least believe ρ 0 then the maximum variance of the difference is σ1 2 + σ Frank Porter, BaBar analysis school Statistics, February 2008

42 Consistency Simple example of two analyses on same events Suppose we measure a neutrino mass, m, in a sample of n = 10 independent events. The measurements are x i,i =1,...,10. Assume the sampling distribution for x i is N(m, σ i ). We may form unbiased estimator, m 1,form: m 1 = 1 n ni=1 x i ± 1 n 2 ni=1 σ 2 i. The result (from a MC) is m 1 =0.058 ± Then we notice that we have some further information which might be useful: we know the experimental resolutions, σ i for each measurement. We form another unbiased estimator, m 2,form: m 2 = ni=1 x i σ 2 i ni=1 1 σ 2 i ± 1 ni=1 1 σ 2 i. The result (from the same MC) is m 1 =0.000 ± Frank Porter, BaBar analysis school Statistics, February 2008

43 Example continued The results are certainly correlated, so question of consistency arises (we know the error on the difference is between and 0.055). In this example, the difference between the results is ± 0.036, wherethe error includes the correlation (ρ =0.41). 43 Frank Porter, BaBar analysis school Statistics, February 2008

44 Consistency Evaluating the Correlation Art Snyder developed an approximate formula for evaluating the correlation in a comparison of maximum likelihood analyses (eg, in one-dimensional case). Suppose we perform two maximum likelihood analysis, with event likelihoods L 1, L 2,on the same set of events [nb, may use different information in each analysis]. The results are estimators θ 1, θ 2 for parameter θ. The correlation coefficient ρ may be estimated according to: ρ ( N N i=1 R i d ln L 1i i=1 dθ d ln L 2i θ= θ1 dθ θ= θ2 d 2 ln L 1i dθ 2 θ=θ0 )( N i=1 d 2 ln L 2i dθ 2 θ=θ0 ), where (θ 0 is an expansion reference point) R i = [ 1 ( θ 1 θ 0 ) d2 ln L 1i dθ 2 If θ 0 θ 1 θ 2,then / ][ d ln L1i θ=θ0 dθ θ= θ0 1 ( θ 2 θ 0 ) d2 ln L 2i dθ 2 / ] d ln L2i θ=θ0 dθ θ= θ0. ρ σ θ1 σ θ2 N i=1 d ln L 1i dθ d ln L 2i θ= θ0 dθ θ= θ0, where σ θ 2 k 1/ ( ) 2 N dlki i=1 dθ θ=θ Frank Porter, BaBar analysis school Statistics, February 2008

45 Consistency Example: sin 2β B B pairs PRL, vol 87, 27 August 2001: sin 2β =0.59 ± 0.14(stat) ± 0.05(syst) B B pairs SLAC-PUB-9153, March 2002: sin 2β =0.75 ± 0.09(stat) ± 0.04(syst) Second result includes the earlier data, re-reconstructed. Analysis involves multivariate maximum likelihood fits; reprocessing changes, eg, relative likelihood for an event to be signal or background. Not simply counting events. Question: are the two results statistically consistent? If these were independent data sets, a difference of 0.16 ± 0.17 would not be a worry. The issue is the correlation. A specialized analysis deriving from the previous formula is performed on the events in common between the two analyses. A correlation of ρ =0.87 is deduced, yielding a difference of 2.2σ. 45 Frank Porter, BaBar analysis school Statistics, February 2008

Frank Porter February 26, 2013

Frank Porter February 26, 2013 116 Frank Porter February 26, 2013 Chapter 6 Hypothesis Tests Often, we want to address questions such as whether the possible observation of a new effect is really significant, or merely a chance fluctuation.

More information

Statistical Issues in Particle Physics A View from BaBar

Statistical Issues in Particle Physics A View from BaBar PHYSTAT3, SLAC, Stanford, California, September 8-, 3 Statistical Issues in Particle Physics A View from BaBar F. C. Porter (For the BaBar collaboration) Lauritsen Laboratory of Physics, Caltech, Pasadena,

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Statistics. Lecture 4 August 9, 2000 Frank Porter Caltech. 1. The Fundamentals; Point Estimation. 2. Maximum Likelihood, Least Squares and All That

Statistics. Lecture 4 August 9, 2000 Frank Porter Caltech. 1. The Fundamentals; Point Estimation. 2. Maximum Likelihood, Least Squares and All That Statistics Lecture 4 August 9, 2000 Frank Porter Caltech The plan for these lectures: 1. The Fundamentals; Point Estimation 2. Maximum Likelihood, Least Squares and All That 3. What is a Confidence Interval?

More information

Goodness-of-Fit Pitfalls and Power Example: Testing Consistency of Two Histograms

Goodness-of-Fit Pitfalls and Power Example: Testing Consistency of Two Histograms Goodness-of-Fit Pitfalls and Power Example: Testing Consistency of Two Histograms Sometimes we have two histograms and are faced with the question: Are they consistent? That is, are our two histograms

More information

Evidence for D 0 - D 0 mixing. Marko Starič. J. Stefan Institute, Ljubljana, Slovenia. March XLII Rencontres de Moriond, La Thuile, Italy

Evidence for D 0 - D 0 mixing. Marko Starič. J. Stefan Institute, Ljubljana, Slovenia. March XLII Rencontres de Moriond, La Thuile, Italy Evidence for D - D mixing - D mixing (page 1) Introduction Mixing observed in K, Bd and B s (26), not yet in D system D mixing in the SM governed by box diagrams Effective GIM suppression mixing in D system

More information

Measurement of time dependent CP violation in B φks. Mahalaxmi Krishnamurthy University of Tennessee. BaBar Collaboration Beauty 2003

Measurement of time dependent CP violation in B φks. Mahalaxmi Krishnamurthy University of Tennessee. BaBar Collaboration Beauty 2003 Measurement of time dependent CP violation in B φks Mahalaxmi Krishnamurthy University of Tennessee BaBar Collaboration Beauty 2003 CMU, Pittsburgh 1 b-> sss Decays CP violation from interference of decay

More information

Physics 509: Bootstrap and Robust Parameter Estimation

Physics 509: Bootstrap and Robust Parameter Estimation Physics 509: Bootstrap and Robust Parameter Estimation Scott Oser Lecture #20 Physics 509 1 Nonparametric parameter estimation Question: what error estimate should you assign to the slope and intercept

More information

Search for exotic charmonium states

Search for exotic charmonium states Search for exotic charmonium states on behalf of the BABAR and BELLE Collaborations August 25th, 2014 (*), Forschungszentrum Jülich (Germany) (Germany) 1 (*) Previously addressed at JGU University of Mainz

More information

Hypothesis testing. Chapter Formulating a hypothesis. 7.2 Testing if the hypothesis agrees with data

Hypothesis testing. Chapter Formulating a hypothesis. 7.2 Testing if the hypothesis agrees with data Chapter 7 Hypothesis testing 7.1 Formulating a hypothesis Up until now we have discussed how to define a measurement in terms of a central value, uncertainties, and units, as well as how to extend these

More information

Search for heavy neutrinos in kaon decays

Search for heavy neutrinos in kaon decays Search for heavy neutrinos in kaon decays L. Littenberg (work mainly done by A.T.Shaikhiev INR RAS) HQL-2016 Outline Motivation Previous heavy neutrino searches Experiment BNL-E949 Selection criteria Efficiency

More information

Relative branching ratio measurements of charmless B ± decays to three hadrons

Relative branching ratio measurements of charmless B ± decays to three hadrons LHCb-CONF-011-059 November 10, 011 Relative branching ratio measurements of charmless B ± decays to three hadrons The LHCb Collaboration 1 LHCb-CONF-011-059 10/11/011 Abstract With an integrated luminosity

More information

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That Statistics Lecture 2 August 7, 2000 Frank Porter Caltech The plan for these lectures: The Fundamentals; Point Estimation Maximum Likelihood, Least Squares and All That What is a Confidence Interval? Interval

More information

CP Violation in the B(s) meson system at LHCb Julian Wishahi on behalf of the LHCb collaboration

CP Violation in the B(s) meson system at LHCb Julian Wishahi on behalf of the LHCb collaboration CP Violation in the B(s) meson system at Julian Wishahi on behalf of the collaboration 5th Rencontres de Moriond, Electroweak Session, 2th of March 215 CPV in Interference of Mixing/Decay interference

More information

Studies of CP Violation at BABAR

Studies of CP Violation at BABAR SLAC-PUB-878 BABAR-PROC-/32 hep-ex/1336 February, 21 Studies of CP Violation at BABAR Anders Ryd 1 California Institute of Technology, 356-48, Pasadena CA 91125 (for the BABAR Collaboration) Abstract BABAR

More information

E. Santovetti lesson 4 Maximum likelihood Interval estimation

E. Santovetti lesson 4 Maximum likelihood Interval estimation E. Santovetti lesson 4 Maximum likelihood Interval estimation 1 Extended Maximum Likelihood Sometimes the number of total events measurements of the experiment n is not fixed, but, for example, is a Poisson

More information

Physics 6720 Introduction to Statistics April 4, 2017

Physics 6720 Introduction to Statistics April 4, 2017 Physics 6720 Introduction to Statistics April 4, 2017 1 Statistics of Counting Often an experiment yields a result that can be classified according to a set of discrete events, giving rise to an integer

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

Fitting a Straight Line to Data

Fitting a Straight Line to Data Fitting a Straight Line to Data Thanks for your patience. Finally we ll take a shot at real data! The data set in question is baryonic Tully-Fisher data from http://astroweb.cwru.edu/sparc/btfr Lelli2016a.mrt,

More information

Discovery and Significance. M. Witherell 5/10/12

Discovery and Significance. M. Witherell 5/10/12 Discovery and Significance M. Witherell 5/10/12 Discovering particles Much of what we know about physics at the most fundamental scale comes from discovering particles. We discovered these particles by

More information

ωω: J. Albert Caltech March 24, 2006

ωω: J. Albert Caltech March 24, 2006 ωω: Search for B 0 ωφ and B 0 ωω Are CP and polarization in loops consistent Are CP and polarization in loops consistent with the Standard Model? with the Standard Model? March 24, 2006 J. Albert Caltech

More information

arxiv: v1 [hep-ex] 14 Sep 2015

arxiv: v1 [hep-ex] 14 Sep 2015 WSU HEP XXYY September 5, 5 arxiv:59.3v [hep-ex] Sep 5 Search for CP violation in D K + K, π + π and D π π Tara Nanut Department for Experimental Particle Physics Jozef Stefan Institute, Jamova 39, Ljubljana,

More information

Discovery searches for light new physics with BaBar

Discovery searches for light new physics with BaBar SLAC-PUB-1548 Discovery searches for light new physics with BaBar Neus Lopez-March BABAR Collaboration E-mail: neus.lopezmarch@epfl.ch The BABAR experiment collected large samples of events during the

More information

Measurements of Leptonic B Decays from BaBar

Measurements of Leptonic B Decays from BaBar Measurements of Leptonic B Decays from BaBar Gregory Dubois-Felsmann Caltech for the BaBar collaboration ICHEP 2004 Heavy Quark Mesons and Baryons session Overview Motivations for studying leptonic B decays

More information

E (GeV) E (GeV) E (GeV) Entries/2.5 MeV/c

E (GeV) E (GeV) E (GeV) Entries/2.5 MeV/c SLAC-PUB-8731 BABAR-PROC-/4 hep-ex/1237 December, 2 The BABAR Measurement of sin2 and its Future Prospects James Weatherall University of Manchester Department of Physics and Astronomy, University of Manchester,

More information

Recall the Basics of Hypothesis Testing

Recall the Basics of Hypothesis Testing Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE

More information

Journeys of an Accidental Statistician

Journeys of an Accidental Statistician Journeys of an Accidental Statistician A partially anecdotal account of A Unified Approach to the Classical Statistical Analysis of Small Signals, GJF and Robert D. Cousins, Phys. Rev. D 57, 3873 (1998)

More information

Lecture 2. G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1

Lecture 2. G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1 Lecture 2 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

B. Hoeneisen. Universidad San Francisco de Quito Representing the DØ Collaboration Flavor Physics and CP violation (2013)

B. Hoeneisen. Universidad San Francisco de Quito Representing the DØ Collaboration Flavor Physics and CP violation (2013) Status report on the like-sign dimuon charge asymmetry in p p collisions and Measurement of the direct CP asymmetry in B J/ψK and B J/ψπ decays B. Hoeneisen Universidad San Francisco de Quito Representing

More information

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Higgs and Z τ + τ in CMS

Higgs and Z τ + τ in CMS Higgs and Z τ + τ in CMS Christian Veelken for the CMS Collaboration Moriond EWK Conference, March 14 th 2011 Z τ + τ - Production @ 7 TeV τ + Z τ - CMS Measurement of Z/γ* l + l -, l = e/µ: σ BR(Z/γ*

More information

Systematic uncertainties in statistical data analysis for particle physics. DESY Seminar Hamburg, 31 March, 2009

Systematic uncertainties in statistical data analysis for particle physics. DESY Seminar Hamburg, 31 March, 2009 Systematic uncertainties in statistical data analysis for particle physics DESY Seminar Hamburg, 31 March, 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Search for b Ø bz. CDF note Adam Scott, David Stuart UCSB. 1 Exotics Meeting. Blessing

Search for b Ø bz. CDF note Adam Scott, David Stuart UCSB. 1 Exotics Meeting. Blessing Search for b Ø bz CDF note 8465 Adam Scott, David Stuart UCSB Exotics Meeting Blessing 1 Exotics Meeting Analysis in a Nutshell Looking for new particles decaying to Z+jets Select Z s in the dielectron

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

new measurements of sin(2) & cos(2) at BaBar

new measurements of sin(2) & cos(2) at BaBar new measurements of sin(2) & cos(2) at BaBar, UC Irvine For the BaBar collaboration ICHEP24 August 16th, Beijing bruinsma@slac.stanford.edu Decay rates of B mesons 2 Time-dependent rates for B (f + ) or

More information

Constructing Ensembles of Pseudo-Experiments

Constructing Ensembles of Pseudo-Experiments Constructing Ensembles of Pseudo-Experiments Luc Demortier The Rockefeller University, New York, NY 10021, USA The frequentist interpretation of measurement results requires the specification of an ensemble

More information

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1 Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression

More information

Measurement of ϕ 2 /α using

Measurement of ϕ 2 /α using Measurement of ϕ 2 /α using Decays at elle and aar lexander Somov CKM 6, Nagoya 26 Introduction (CP violation in decays Measurements in φ 2 constraint from an Isospin analysis CP Violation in decays Tree

More information

Please bring the task to your first physics lesson and hand it to the teacher.

Please bring the task to your first physics lesson and hand it to the teacher. Pre-enrolment task for 2014 entry Physics Why do I need to complete a pre-enrolment task? This bridging pack serves a number of purposes. It gives you practice in some of the important skills you will

More information

Charm Baryon Studies at BABAR

Charm Baryon Studies at BABAR W.Mader@Physik.TU-Dresden.de Institut für Kern- und Teilchenphysik Technische Universität Dresden Institutsseminar IKTP 15. Juni 006 Outline 1 Introduction The BABAR Detector and PEP-II 3 Reconstruction

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Results on Tau Physics from BaBar

Results on Tau Physics from BaBar Results on Tau Physics from BaBar Eric Torrence University of Oregon Search for Lepton Flavor Violation Study of τ 3h 2h + ν τ (5 prong) Tau Lifetime The BaBar Experiment Integrated Luminosity (fb -1 )

More information

CP violation in B 0 π + π decays in the BABAR experiment. Muriel Pivk, CERN. 22 March 2004, Lausanne

CP violation in B 0 π + π decays in the BABAR experiment. Muriel Pivk, CERN. 22 March 2004, Lausanne CP violation in B π + π decays in the BABAR experiment, 22 March 24, The BABAR experiment at SLAC 2 Phenomenology 3 The h + h analysis 4 A new tool: s Plots 5 Results 6 Interpretation of the results 7

More information

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16

Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16 600.463 Introduction to Algorithms / Algorithms I Lecturer: Michael Dinitz Topic: Intro to Learning Theory Date: 12/8/16 25.1 Introduction Today we re going to talk about machine learning, but from an

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur Lecture No. # 38 Goodness - of fit tests Hello and welcome to this

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 11 January 7, 2013 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline How to communicate the statistical uncertainty

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

Bayesian Inference for Normal Mean

Bayesian Inference for Normal Mean Al Nosedal. University of Toronto. November 18, 2015 Likelihood of Single Observation The conditional observation distribution of y µ is Normal with mean µ and variance σ 2, which is known. Its density

More information

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests http://benasque.org/2018tae/cgi-bin/talks/allprint.pl TAE 2018 Benasque, Spain 3-15 Sept 2018 Glen Cowan Physics

More information

Semileptonic B decays at BABAR

Semileptonic B decays at BABAR at (University of Oregon) representing collaboration March 4, 4 1.5 excluded area has

More information

Why I never believed the Tevatron Higgs sensitivity claims for Run 2ab Michael Dittmar

Why I never believed the Tevatron Higgs sensitivity claims for Run 2ab Michael Dittmar Why I never believed the Tevatron Higgs sensitivity claims for Run 2ab Michael Dittmar 18.03.09 Why I never believed the Tevatron Higgs sensitivity claims for Run 2ab Michael Dittmar 18.03.09 How to judge

More information

How to find a Higgs boson. Jonathan Hays QMUL 12 th October 2012

How to find a Higgs boson. Jonathan Hays QMUL 12 th October 2012 How to find a Higgs boson Jonathan Hays QMUL 12 th October 2012 Outline Introducing the scalar boson Experimental overview Where and how to search Higgs properties Prospects and summary 12/10/2012 2 The

More information

Advanced Statistical Methods. Lecture 6

Advanced Statistical Methods. Lecture 6 Advanced Statistical Methods Lecture 6 Convergence distribution of M.-H. MCMC We denote the PDF estimated by the MCMC as. It has the property Convergence distribution After some time, the distribution

More information

Lecture 21: October 19

Lecture 21: October 19 36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 21: October 19 21.1 Likelihood Ratio Test (LRT) To test composite versus composite hypotheses the general method is to use

More information

Matthew Charles, University of Iowa

Matthew Charles, University of Iowa Charm Decays in Matthew Charles, University of Iowa Representing the Collaboration The 3nd International Conference on High Energy Physics Beijing August 16, 4 Introduction Search for D l + l [11-964]

More information

Why Try Bayesian Methods? (Lecture 5)

Why Try Bayesian Methods? (Lecture 5) Why Try Bayesian Methods? (Lecture 5) Tom Loredo Dept. of Astronomy, Cornell University http://www.astro.cornell.edu/staff/loredo/bayes/ p.1/28 Today s Lecture Problems you avoid Ambiguity in what is random

More information

Computing Likelihood Functions for High-Energy Physics Experiments when Distributions are Defined by Simulators with Nuisance Parameters

Computing Likelihood Functions for High-Energy Physics Experiments when Distributions are Defined by Simulators with Nuisance Parameters Computing Likelihood Functions for High-Energy Physics Experiments when Distributions are Defined by Simulators with Nuisance Parameters Radford M. Neal Dept. of Statistics, University of Toronto Abstract

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

Achilles: Now I know how powerful computers are going to become!

Achilles: Now I know how powerful computers are going to become! A Sigmoid Dialogue By Anders Sandberg Achilles: Now I know how powerful computers are going to become! Tortoise: How? Achilles: I did curve fitting to Moore s law. I know you are going to object that technological

More information

Rare Hadronic B Decays

Rare Hadronic B Decays XLI st Rencontres de Moriond QCD and High-Energy Hadronic Interactions La Thuile, Italy, March 18-5, 6 Rare Hadronic B Decays Jürgen Kroseberg Santa Cruz Institute for Particle Physics University of California,

More information

arxiv: v1 [hep-ex] 25 Aug 2015

arxiv: v1 [hep-ex] 25 Aug 2015 Anti-neutrino oscillations with TK arxiv:508.05v [hep-ex] 5 Aug 05 Université de Genève (H E-mail: melody.ravonel@cern.ch TK is a long-baseline neutrino oscillation experiment, in which a muon neutrino

More information

Goodness-of-Fit Considerations and Comparisons Example: Testing Consistency of Two Histograms

Goodness-of-Fit Considerations and Comparisons Example: Testing Consistency of Two Histograms Goodness-of-Fit Considerations and Comparisons Example: Testing Consistency of Two Histograms Sometimes we have two histograms and are faced with the question: Are they consistent? That is, are our two

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

Introduction to the Terascale: DESY 2015

Introduction to the Terascale: DESY 2015 Introduction to the Terascale: DESY 2015 Analysis walk-through exercises EXERCISES Ivo van Vulpen Exercise 0: Root and Fitting basics We start with a simple exercise for those who are new to Root. If you

More information

p-values and Discovery

p-values and Discovery p-values and Discovery Louis Lyons Oxford l.lyons@physics.ox.ac.uk SLUO Lecture 4, February 2007 1 2 TOPICS Discoveries H0 or H0 v H1 p-values: For Gaussian, Poisson and multi-variate data Goodness of

More information

Part I. Experimental Error

Part I. Experimental Error Part I. Experimental Error 1 Types of Experimental Error. There are always blunders, mistakes, and screwups; such as: using the wrong material or concentration, transposing digits in recording scale readings,

More information

Recent results and prospects of exotic hadrons at B-factory

Recent results and prospects of exotic hadrons at B-factory Recent results and prospects of exotic hadrons at B-factory XiaoLong Wang Virginia Tech & Belle Collaboration XVI International Conference on Hadron Spectroscopy Jefferson Lab, Newport News, Virginia 9/13

More information

Inferring from data. Theory of estimators

Inferring from data. Theory of estimators Inferring from data Theory of estimators 1 Estimators Estimator is any function of the data e(x) used to provide an estimate ( a measurement ) of an unknown parameter. Because estimators are functions

More information

Rare Exclusive Decays. Results from BaBar on the. DPF-2002, College of William and Mary. University of California, Jeffrey Berryhill.

Rare Exclusive Decays. Results from BaBar on the. DPF-2002, College of William and Mary. University of California, Jeffrey Berryhill. Results from BaBar on the Rare Exclusive Decays B K l + l and B K* l + l Jeffrey Berryhill University of California, Santa Barbara DPF2002, College of William and Mary May 24, 2002 B K (*) l + l in the

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Physics 403. Segev BenZvi. Credible Intervals, Confidence Intervals, and Limits. Department of Physics and Astronomy University of Rochester

Physics 403. Segev BenZvi. Credible Intervals, Confidence Intervals, and Limits. Department of Physics and Astronomy University of Rochester Physics 403 Credible Intervals, Confidence Intervals, and Limits Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Summarizing Parameters with a Range Bayesian

More information

Solution: chapter 2, problem 5, part a:

Solution: chapter 2, problem 5, part a: Learning Chap. 4/8/ 5:38 page Solution: chapter, problem 5, part a: Let y be the observed value of a sampling from a normal distribution with mean µ and standard deviation. We ll reserve µ for the estimator

More information

Recent results on charmless B and B s decays from Belle

Recent results on charmless B and B s decays from Belle ICHEP 2016 Recent results on charmless B and B s decays from Belle Kamal Jyoti Nath IIT Guwahati, India On behalf of Belle Collaboration August 4, 2016 Kamal J Nath ICHEP 2016 August 4, 2016 1 / 14 Outline

More information

arxiv: v3 [hep-ex] 11 Feb 2013

arxiv: v3 [hep-ex] 11 Feb 2013 q/p Measurement from B 0 D lν Partial Reconstruction arxiv:1301.0417v3 [hep-ex] 11 Feb 2013 Martino Margoni on behalf of the BaBar Collaboration Università di Padova and INFN sezione di Padova Padova,

More information

Physics 509: Error Propagation, and the Meaning of Error Bars. Scott Oser Lecture #10

Physics 509: Error Propagation, and the Meaning of Error Bars. Scott Oser Lecture #10 Physics 509: Error Propagation, and the Meaning of Error Bars Scott Oser Lecture #10 1 What is an error bar? Someone hands you a plot like this. What do the error bars indicate? Answer: you can never be

More information

Nicolas Berger SLAC, Menlo Park, California, U.S.A.

Nicolas Berger SLAC, Menlo Park, California, U.S.A. SLAC-PUB-11414 August 25 Frascati Physics Series Vol. VVVVVV (xxxx), pp. - DAΦNE 24: Physics at meson factories Frascati, June. 7-11, 24 Selected Contribution in Plenary Session INCLUSIVE HADRONIC RESULTS

More information

Dalitz Plot Analyses of B D + π π, B + π + π π + and D + s π+ π π + at BABAR

Dalitz Plot Analyses of B D + π π, B + π + π π + and D + s π+ π π + at BABAR Proceedings of the DPF-9 Conference, Detroit, MI, July 7-3, 9 SLAC-PUB-98 Dalitz Plot Analyses of B D + π π, B + π + π π + and D + s π+ π π + at BABAR Liaoyuan Dong (On behalf of the BABAR Collaboration

More information

Statistical Methods for Particle Physics (I)

Statistical Methods for Particle Physics (I) Statistical Methods for Particle Physics (I) https://agenda.infn.it/conferencedisplay.py?confid=14407 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Statistics for the LHC Lecture 1: Introduction

Statistics for the LHC Lecture 1: Introduction Statistics for the LHC Lecture 1: Introduction Academic Training Lectures CERN, 14 17 June, 2010 indico.cern.ch/conferencedisplay.py?confid=77830 Glen Cowan Physics Department Royal Holloway, University

More information

Department of Physics and Astronomy, University of California, Riverside, CA, USA

Department of Physics and Astronomy, University of California, Riverside, CA, USA Recent results on charmless B meson decays from Babar Department of Physics and Astronomy, University of California, Riverside, CA, 92521 USA E-mail: bill.gary@ucr.edu Recent results from the Babar experiment

More information

STA Module 10 Comparing Two Proportions

STA Module 10 Comparing Two Proportions STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Density Estimation. We are concerned more here with the non-parametric case (see Roger Barlow s lectures for parametric statistics)

Density Estimation. We are concerned more here with the non-parametric case (see Roger Barlow s lectures for parametric statistics) Density Estimation Density Estimation: Deals with the problem of estimating probability density functions (PDFs) based on some data sampled from the PDF. May use assumed forms of the distribution, parameterized

More information

Is there evidence for a peak in this data?

Is there evidence for a peak in this data? Is there evidence for a peak in this data? 1 Is there evidence for a peak in this data? Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron S. Stepanyan et al, CLAS Collab,

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

Discovery significance with statistical uncertainty in the background estimate

Discovery significance with statistical uncertainty in the background estimate Glen Cowan, Eilam Gross ATLAS Statistics Forum 8 May, 2008 Discovery significance with statistical uncertainty in the background estimate Introduction In a search for a new type of event, data samples

More information

Statistics and Data Analysis

Statistics and Data Analysis Statistics and Data Analysis The Crash Course Physics 226, Fall 2013 "There are three kinds of lies: lies, damned lies, and statistics. Mark Twain, allegedly after Benjamin Disraeli Statistics and Data

More information

Physics 403. Segev BenZvi. Choosing Priors and the Principle of Maximum Entropy. Department of Physics and Astronomy University of Rochester

Physics 403. Segev BenZvi. Choosing Priors and the Principle of Maximum Entropy. Department of Physics and Astronomy University of Rochester Physics 403 Choosing Priors and the Principle of Maximum Entropy Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Review of Last Class Odds Ratio Occam Factors

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.2 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

V cb. Determination of. and related results from BABAR. Masahiro Morii, Harvard University on behalf of the BABAR Collaboration

V cb. Determination of. and related results from BABAR. Masahiro Morii, Harvard University on behalf of the BABAR Collaboration Determination of and related results from BABAR V cb Masahiro Morii, Harvard University on behalf of the BABAR Collaboration V cb from inclusive B semileptonic decays Lepton energy moments Hadron mass

More information

Hadronic Charm Decays: Experimental Review

Hadronic Charm Decays: Experimental Review Hadronic Charm Decays: Experimental Review Featuring results from Peter Onyisi ICHEP, 1 Aug 2008 Peter Onyisi Hadronic Charm Decays: Experimental Review ICHEP, 1 Aug 2008 1 What Can We Learn? Hadronic

More information

Modern Methods of Data Analysis - SS 2009

Modern Methods of Data Analysis - SS 2009 Modern Methods of Data Analysis Lecture X (2.6.09) Contents: Frequentist vs. Bayesian approach Confidence Level Re: Bayes' Theorem (1) Conditional ( bedingte ) probability Examples: Rare Disease (1) Probability

More information

Time-dependent CP violation

Time-dependent CP violation Time-dependent CP violation experimental results and prospects Paul Seyfert on behalf of the collaboration INFN Milano Bicocca 13th October 216 Paul Seyfert (INFN MIB) time dependent CPV implications workshop

More information

Measuring CP violation in. B s φφ with LHCb. Jim Libby (University of Oxford) for the LHCb collaboration. 14/12/2006 CKM 2006 Nagoya 1

Measuring CP violation in. B s φφ with LHCb. Jim Libby (University of Oxford) for the LHCb collaboration. 14/12/2006 CKM 2006 Nagoya 1 Measuring CP violation in B s φφ with LHCb Jim Libby (University of Oxford) for the LHCb collaboration 4//6 CKM 6 Nagoya Outline Motivation for the study B s φφ Brief introduction to LHCb Expected signal

More information

Chapter Three. Hypothesis Testing

Chapter Three. Hypothesis Testing 3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being

More information