Negative binomial distribution and multiplicities in p p( p) collisions

Similar documents
Statistical Data Analysis Stat 3: p-values, parameter estimation

arxiv: v1 [hep-ph] 22 Apr 2018

Two Early Exotic searches with dijet events at ATLAS

Search for Quark Substructure in 7 TeV pp Collisions with the ATLAS Detector

Statistical Methods for Particle Physics Lecture 3: Systematics, nuisance parameters

Summary of Columbia REU Research. ATLAS: Summer 2014

Physics 736. Experimental Methods in Nuclear-, Particle-, and Astrophysics. - Statistics and Error Analysis -

Measurements of net-particle fluctuations in Pb-Pb collisions at ALICE

arxiv: v1 [hep-ph] 28 Mar 2013

Distinguishing quark and gluon jets at the LHC

Statistical Data Analysis Stat 5: More on nuisance parameters, Bayesian methods

Discovery Potential for the Standard Model Higgs at ATLAS

Modern Methods of Data Analysis - WS 07/08

Primer on statistics:

Parameter Estimation and Fitting to Data

Discovery significance with statistical uncertainty in the background estimate

Three-component multiplicity distribution, oscillation of combinants and properties of clans in pp collisions at the LHC

FYST17 Lecture 8 Statistics and hypothesis testing. Thanks to T. Petersen, S. Maschiocci, G. Cowan, L. Lyons

Introductory Statistics Course Part II

Lecture 2. G. Cowan Lectures on Statistical Data Analysis Lecture 2 page 1

Federico Antinori (INFN Padova, Italy) on behalf of the ALICE Collaboration

Muon reconstruction performance in ATLAS at Run-2

Contact the LHC

Quarkonium results in pp collisions from ALICE

Systematic uncertainties in statistical data analysis for particle physics. DESY Seminar Hamburg, 31 March, 2009

Kristjan Gulbrandsen NBI ALICE/Discovery group

Statistics for the LHC Lecture 2: Discovery

Statistical Methods for Particle Physics Lecture 3: systematic uncertainties / further topics

arxiv: v1 [hep-ex] 21 Aug 2011

Asymptotic formulae for likelihood-based tests of new physics

Statistical Methods for Astronomy

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests

Statistical Methods for Discovery and Limits in HEP Experiments Day 3: Exclusion Limits

Statistical Methods in Particle Physics Lecture 2: Limits and Discovery

The Compact Muon Solenoid Experiment. Conference Report. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland

Some Statistical Tools for Particle Physics

Multijet in Neutral Current Deep Inelastic Scattering

Use Tag & Probe method to measure electron efficiency

Inclusive top pair production at Tevatron and LHC in electron/muon final states

32. STATISTICS. 32. Statistics 1

arxiv: v1 [hep-ex] 9 Jan 2019

Statistics for the LHC Lecture 1: Introduction

Luminosity measurement and K-short production with first LHCb data. Sophie Redford University of Oxford for the LHCb collaboration

Hypothesis testing: theory and methods

arxiv: v2 [hep-ex] 8 Aug 2013

Statistical Methods in Particle Physics

1 The pion bump in the gamma reay flux

Statistical Methods for Particle Physics Lecture 2: statistical tests, multivariate methods

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

Top Quark Production at the LHC. Masato Aoki (Nagoya University, Japan) For the ATLAS, CMS Collaborations

Statistical Methods for Particle Physics Lecture 4: discovery, exclusion limits

October 4, :33 ws-rv9x6 Book Title main page 1. Chapter 1. Measurement of Minimum Bias Observables with ATLAS

Neutral particles energy spectra for 900 GeV and 7 TeV p-p collisions, measured by the LHCf experiment

Analysis and discussion of the recent W mass measurements

The photon PDF from high-mass Drell Yan data at the LHC

Physics 403. Segev BenZvi. Classical Hypothesis Testing: The Likelihood Ratio Test. Department of Physics and Astronomy University of Rochester

STAT 461/561- Assignments, Year 2015

32. STATISTICS. 32. Statistics 1

Lecture 10: Generalized likelihood ratio test

arxiv: v1 [nucl-ex] 29 Sep 2014

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis

Results from combined CMS-TOTEM data

The Compact Muon Solenoid Experiment. Conference Report. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland

Measurement of W-boson Mass in ATLAS

Discovery and Significance. M. Witherell 5/10/12

Global polarization of Λ and Λ hyperons in Pb Pb collisions at s NN = 2.76 TeV

Massimo Venaruzzo for the ALICE Collaboration INFN and University of Trieste

Fluctuations and Search for the QCD Critical Point

PoS(IHEP-LHC-2011)008

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

Review. December 4 th, Review

Recent Heavy Flavors results from Tevatron. Aleksei Popov (Institute for High Energy Physics, Protvino) on behalf of the CDF and DØ Collaborations

B. Hoeneisen. Universidad San Francisco de Quito Representing the DØ Collaboration Flavor Physics and CP violation (2013)

Theory of Statistical Tests

Statistics and Data Analysis

Massimo Venaruzzo for the ALICE Collaboration INFN and University of Trieste

FERMI NATIONAL ACCELERATOR LABORATORY

Detection of Z Gauge Bosons in the Di-muon Decay Mode in CMS

PoS(CHARM2016)074. Searches for CPV in D + decays at LHCb

Measurement of Properties of Electroweak Bosons with the DØ Detector

arxiv: v1 [hep-ex] 18 May 2015

Performance Evaluation and Comparison

Momentum Correlations in Nuclear Collisions

On the double-ridge effect at the LHC

arxiv: v1 [nucl-ex] 14 Oct 2013

arxiv: v1 [hep-ex] 30 Nov 2009

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: )

QCD at the Tevatron: The Production of Jets & Photons plus Jets

Final Examination Statistics 200C. T. Ferguson June 11, 2009

The Compact Muon Solenoid Experiment. Conference Report. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland. Rare B decays at CMS

FIRST MEASUREMENTS OF PROTON-PROTON ELASTIC SCATTERING AND TOTAL CROSS-SECTION AT THE LHC BY TOTEM

Error analysis for efficiency

Recent QCD results from ATLAS

Hypothesis Testing - Frequentist

arxiv: v1 [hep-ex] 27 Nov 2017

Charged jets in p Pb collisions measured with the ALICE detector

Statistics Challenges in High Energy Physics Search Experiments

PoS(ICHEP2012)300. Electroweak boson production at LHCb

Analysis of Top Quarks Using a Kinematic Likelihood Method

arxiv: v3 [physics.data-an] 24 Jun 2013

Transcription:

Negative binomial distribution and multiplicities in p p( p) collisions Institute of Theoretical Physics University of Wroc law Zakopane June 12, 2011

Summary s are performed for the hypothesis that charged-particle multiplicity distributions measured in the limited pseudo-rapidity windows of p p( p) collisions at s = 0.9 and 2.36 TeV are negative binomial. indicate that the hypothesis should be rejected in all cases of ALICE-LHC measurements, whereas should be accepted in the corresponding cases of UA5 data. Possible explanations of that and of the disagreement with the least-squares method are given. based on: DP, arxiv:1101.0787 [hep-ph]

Motivation The fitted quantity is a probability distribution function (p.d.f.), so the most natural way is to use the maximum likelihood (ML) method, where the likelihood function is constructed directly from the tested p.d.f.. Because of Wilks s Theorem one can define a statistic, the distribution of which converges to a χ 2 distribution as the number of measurements goes to infinity. Thus for the large sample the goodness-of-fit can be expressed as a p-value computed with the corresponding χ 2 distribution. The most commonly used method, the least-squares method (LS) (called also χ 2 minimization), has the disadvantage of providing only the qualitative measure of the significance of the fit. Only if observables are represented by Gaussian random variables with known variances, the conclusion about the goodness-of-fit equivalent to that mentioned in the first point can be derived.

Negative binomial distribution P (n; p, k) = k(k + 1)(k + 2)...(k + n 1) (1 p) n p k n! 0 p 1, k is a positive real number n = 0, 1, 2,... - the number of charged particles in an event n = k(1 p) p, V (n) = k(1 p) p 2.

The maximum likelihood method For N events in a sample there are N measurements of N ch, say X = (X 1, X 2,..., X N ). L(X p, k) = N j=1 P (X j ; p, k) The values ˆp and ˆk for which L(X p, k) has its maximum are the maximum likelihood (ML) estimators of parameters p and k. The log-likelihood function ln L(X p, k) = N j=1 ln P (X j ; p, k)

Maximization of the log-likelihood function N p ln L(X p, k) = j=1 N k ln L(X p, k) = j=1 p ln P (X j; p, k) = 0 k ln P (X j; p, k) = 0 For NBD the upper equation gives n = N ch = 1 p = N ch + 1 k

- Wilks s theorem X - a random variable with p.d.f f(x, θ), which depends on parameters θ = (θ 1, θ 2,..., θ d ) Θ, Θ is an open set in R d. X = (X 1,..., X N ) - a sample of N independent observations of X H 0 - a k-dimensional subset of Θ, k < d. The maximum likelihood ratio: λ = max θ H 0 L(X θ) max θ Θ L(X θ) If the hypothesis H 0 is true, i.e. it is true that θ H 0, then the distribution of the statistic 2 ln λ converges to a χ 2 distribution with d k degrees of freedom as N.

2-in-1 χ 2 function Let define the function: χ 2 (X θ) θ H0 = 2 ln L(X θ) max θ Θ L(X θ ) The minimum of χ 2 with respect to θ H 0 is at ˆθ - the ML estimators. The test statistic χ 2 min = χ2 (X ˆθ) has a χ 2 distribution in the large sample limit.

p-value of the test statistic The probability of obtaining the value of the test statistic equal to or greater then the value just obtained for the present data set (i.e. χ 2 min ), when repeating the whole experiment many times: p = P (χ 2 χ 2 min; n dof ) = χ 2 min f(z; n dof )dz, f(z; n dof ) - the χ 2 p.d.f. n dof = d k - the number of degrees of freedom

2-in-1 χ 2 function for binned data Let divide the sample X = (X 1, X 2,..., X N ) into m bins defined by the number of measured charged particles {0, 1, 2, 3,..., m 1} and with n i entries in the ith bin, N = m i=1 n i. χ 2 = 2 ln λ = 2 m i=1 n i ln n i ν i ν i = N P (i 1; p, k) Details in: G. Cowan, Statistical data analysis, (Oxford University Press, Oxford, 1998)

2-in-1 χ 2 function for binned data, cont. χ 2 (p, k) = 2 m i=1 n i ln n i ν i = 2 N m i=1 Pi ex P (i 1; p, k) ln Pi ex P ex i = n i /N - the experimental probability (frequency) This χ 2 function depends explicitly on the number of events in the sample! But does not depend on actual experimental errors!

The χ 2 function of the least-squares method The sum of squares of normalized residuals: χ 2 LS(p, k) = m i=1 (P ex i P (i 1; p, k)) 2 err 2 i err i - the uncertainty of the ith measurement NOT MINIMIZED HERE!!! but χ 2 LS = χ 2 LS(ˆp, ˆk) ˆp, ˆk - ML estimators of parameters p and k

Sources of data UA5 Collaboration: R. E. Ansorge et al., Z. Phys. C 43, 357 (1989) ALICE Collaboration: K. Aamodt et al., Eur. Phys. J. C 68, 89 (2010) of the LS analysis of ALICE data available in: T.Mizoguchi, M. Biyajima, Eur. Phys. J. C 70, 1061 (2010)

Tests of NBD for UA5 and ALICE data at s = 0.9 TeV χ 2 /n dof p-value χ 2 LS /n dof with errors: Case N event [%] quad. sum stat. ni /N ev UA5 8550.0 0.21 99.998 0.07 na na 0.20 η < 0.5 ALICE 149663.2 14.5 0 0.73 0.38 2.46 15.1 η < 0.5 ALICE 128476.5 36.9 0 1.72 0.95 11.0 38.0 η < 1.0 ALICE 60142.8 24.3 0 2.21 1.28 15.2 25.8 η < 1.3 UA5 8550.0 1.1 28.9 0.36 na na 1.14 η < 1.5

Tests of NBD for ALICE data at s = 2.36 TeV χ 2 /n dof p-value χ 2 LS /n dof with errors: Case N event [%] quad. sum stat. ni /N ev ALICE 38970.79 7.0 0 0.76 0.43 3.8 7.5 η < 0.5 ALICE 37883.99 18.5 0 2.29 1.36 18.8 20.3 η < 1.0 ALICE 22189.40 18.2 0 4.25 2.60 39.6 20.0 η < 1.3

ALICE data at s = 0.9 TeV with UA5 N event χ 2 /n dof p-value χ 2 LS /n dof with errors: Case N event [%] quad. sum stat. ni /N ev UA5 8550.0 0.21 99.998 0.07 na na 0.20 η < 0.5 ALICE 8550.0 0.83 70.4 0.73 0.38 2.46 0.86 η < 0.5 ALICE 8550.0 2.45 5 10 5 1.72 0.95 11.0 2.53 η < 1.0 ALICE 8550.0 3.46 7 10 13 2.21 1.28 15.2 3.66 η < 1.3 UA5 8550.0 1.1 28.9 0.36 na na 1.14 η < 1.5

1 of the likelihood ratio tests suggest that the hypothesis about the NBD of charged-particle multiplicities measured by the ALICE Collaboration in limited pseudo-rapidity windows of proton-proton collisions at s = 0.9 and 2.36 TeV should be rejected. 2 The significant systematic errors of ALICE data are the reasons for acceptable values of the LS test statistic for the narrowest pseudo-rapidity window cases. 3 The size of the sample is very important for the validation of a hypothesis about the p.d.f. of an observable. If the hypothesis is true, the distribution is exact for the whole population. Thus for the very large samples (as in all ALICE cases) the measured distribution should be very close to that postulated. How close is controlled by the size of the sample (discrepancies n i /N event ), not by the size of errors, rather.