arxiv: v1 [physics.data-an] 26 Oct 2012

Similar documents
4. Score normalization technical details We now discuss the technical details of the score normalization method.

Notes on Instrumental Variables Methods

STA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2

Estimation of the large covariance matrix with two-step monotone missing data

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014

CHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)

On split sample and randomized confidence intervals for binomial proportions

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation

arxiv: v3 [physics.data-an] 23 May 2011

Hidden Predictors: A Factor Analysis Primer

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

Solved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points.

Information collection on a graph

An Improved Calibration Method for a Chopped Pyrgeometer

Radial Basis Function Networks: Algorithms

Evaluating Circuit Reliability Under Probabilistic Gate-Level Fault Models

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules

Information collection on a graph

The Poisson Regression Model

RANDOM WALKS AND PERCOLATION: AN ANALYSIS OF CURRENT RESEARCH ON MODELING NATURAL PROCESSES

Churilova Maria Saint-Petersburg State Polytechnical University Department of Applied Mathematics

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

MATH 2710: NOTES FOR ANALYSIS

State Estimation with ARMarkov Models

Feedback-error control

Robustness of classifiers to uniform l p and Gaussian noise Supplementary material

Monte Carlo Studies. Monte Carlo Studies. Sampling Distribution

Convex Optimization methods for Computing Channel Capacity

Introduction to Probability and Statistics

Performance of lag length selection criteria in three different situations

Characterizing the Behavior of a Probabilistic CMOS Switch Through Analytical Models and Its Verification Through Simulations

Statics and dynamics: some elementary concepts

dn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

8.7 Associated and Non-associated Flow Rules

Chapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 5 Jul 1998

CMSC 425: Lecture 4 Geometry and Geometric Programming

Pairwise active appearance model and its application to echocardiography tracking

Finite Mixture EFA in Mplus

Period-two cycles in a feedforward layered neural network model with symmetric sequence processing

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition

Dortmund, Germany; 3 Institute Laue Langevin, Grenoble, France; 4 Palacky University, Olomouc, Czech Republic;

Wolfgang POESSNECKER and Ulrich GROSS*

Wave Drift Force in a Two-Layer Fluid of Finite Depth

arxiv:cond-mat/ v2 25 Sep 2002

Machine Learning: Homework 4

John Weatherwax. Analysis of Parallel Depth First Search Algorithms

A generalization of Amdahl's law and relative conditions of parallelism

Spectral Analysis by Stationary Time Series Modeling

Effective conductivity in a lattice model for binary disordered media with complex distributions of grain sizes

The non-stochastic multi-armed bandit problem

Fault Tolerant Quantum Computing Robert Rogers, Thomas Sylwester, Abe Pauls

START Selected Topics in Assurance

arxiv: v1 [hep-ex] 8 Jun 2017

Topic 7: Using identity types

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

Elementary Analysis in Q p

Estimating function analysis for a class of Tweedie regression models

8 STOCHASTIC PROCESSES

Asymptotically Optimal Simulation Allocation under Dependent Sampling

FUGACITY. It is simply a measure of molar Gibbs energy of a real gas.

Chapter 7 Rational and Irrational Numbers

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Estimation of Separable Representations in Psychophysical Experiments

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

Applied Fitting Theory VI. Formulas for Kinematic Fitting

97.398*, Physical Electronics, Lecture 8. Diode Operation

Numerical Linear Algebra

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION

The Noise Power Ratio - Theory and ADC Testing

Distributed Rule-Based Inference in the Presence of Redundant Information

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity

Performance of a First-Level Muon Trigger with High Momentum Resolution Based on the ATLAS MDT Chambers for HL-LHC

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE

Unsupervised Hyperspectral Image Analysis Using Independent Component Analysis (ICA)

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION

Scaling Multiple Point Statistics for Non-Stationary Geostatistical Modeling

Meshless Methods for Scientific Computing Final Project

arxiv: v1 [quant-ph] 22 Apr 2017

Linear diophantine equations for discrete tomography

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

A PEAK FACTOR FOR PREDICTING NON-GAUSSIAN PEAK RESULTANT RESPONSE OF WIND-EXCITED TALL BUILDINGS

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation

Hotelling s Two- Sample T 2

Uncertainty Modeling with Interval Type-2 Fuzzy Logic Systems in Mobile Robotics

The Binomial Approach for Probability of Detection

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK

Chapter 2 Introductory Concepts of Wave Propagation Analysis in Structures

Topology Optimization of Three Dimensional Structures under Self-weight and Inertial Forces

Developing A Deterioration Probabilistic Model for Rail Wear

ON POLYNOMIAL SELECTION FOR THE GENERAL NUMBER FIELD SIEVE

which is a convenient way to specify the piston s position. In the simplest case, when φ

ε(ω,k) =1 ω = ω'+kv (5) ω'= e2 n 2 < 0, where f is the particle distribution function and v p f v p = 0 then f v = 0. For a real f (v) v ω (kv T

Transcription:

Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch (Dated: October 29, 212) The method of extended maximum likelihood is a well known concet of arameter estimation. One can imlement external knowledge on the unknown arameters by multilying the likelihood by constraint terms. In this note, we emhasize that this is also true for yield arameters in an extended maximum likelihood fit, which is widely used in the article hysics community. We recommend a way to generate seudo-exeriments in resence of constraint terms on yield arameters, and oint to itfalls inside the RooFit framework. arxiv:121.7141v1 [hysics.data-an] 26 Oct 212 Keywords: extended maximum likelihood, constraints, toy Monte Carlo, RooFit I. INTRODUCTION The concet of extended maximum likelihood (EML) is widely used for arameter estimation in article hysics. It is described in [1], and we shall summarize its main features here. In EML, the total number of events is regarded as a free arameter. Its best value is determined by maximizing the likelihood function. The number of observed events follows a robability density function (PDF), tyically a Poisson PDF. In some situations, the observed number of events is not the most efficient estimator for the exected number of events. These situations occur when there is at least one free arameter (or a combination of arameters) that simultaneously changes both shae and normalization of the PDF. Then, an EML fit is suerior to a regular maximum likelihood (ML) fit. These genuine EML situations are labeled tye A, following the notation of [1]. The textbook examle of a tye A situation is that of an unknown signal over a known background of N b events. Suose both signal and background are described by unit Gaussian PDFs G(x; µ, σ = 1), then one ossible (non-normalized) total PDF is g(x) = N s G(x; µ 1 = ) + N b G(x; µ 2 =.5), (1) with N s being the only free arameter. Besides the genuine EML situation, there are also tye B EML situations (or bogus, following again [1]), where both EML and ML give equivalent results. This is the case when in Eq. 1 also N b is a free arameter. Then we can rewrite g(x) = N(fG(µ 1 = ) + (1 f)g(µ 2 =.5)), (2) with the total number of events N = N s + N b and the signal fraction f = N s /N. Now f controls only the shae of the PDF, while N controls only the normalization. It might still be beneficial to formulate a roblem using EML terms as in Eq. 1, even if it truly is a tye B roblem. This is because the ML notation from Eq. 2 quickly leads to less intuitive fraction arameters if more than one background comonent is resent, while the yields of Eq. 1 are interreted easily. The extended likelihood is formed by multilying the classical likelihood by a Poisson term, L(N, λ) = e N N N obs N obs! P( x i ; λ), (3) N obs i=1 where N and N obs are the number of exected and observed events, resectively, P is the total PDF, x is the vector of observables, λ is the vector of arameters to be estimated. The constant factorial term (N obs!) is usually omitted as it does not change the shae of ln L at its minimum. In the following we discuss how to include external constraints into the (extended) likelihood and review the effects of such terms. Then we describe a way to generate seudo ( toy ) exeriments, and demonstrate, that it will lead to unbiased results, if the correct ull statistic is chosen. We will show that this is still the case when constraints on yield arameters are resent. At last, we will oint out several itfalls that are resent in the toy exeriment tools of a current version of the RooFit framework. II. CONSTRAINTS If there is knowledge available on the true value of a fit arameter, we can incororate this knowledge into the fit rocedure. For examle, a revious exeriment might have already measured the arameter at hand, and we have access to their ublished result, say λ e ± σ λe. It is well known how to incororate such constraints into maximum likelihood fits. The full likelihood function is multilied by the constraint PDF C(λ) (where λ be a comonent of λ) L c = C(λ) L. (4) This holds also in the EML case, and also for constraints on yield arameters even though the likelihood is not Poissonian anymore in the total yield, but contains the roduct of a Poisson term in the total yield and a non- Poissonian constraint term in a comonent yield. Often

2 a Gaussian distribution is assumed for C(λ), 1 C(λ) = ex ( (λ e λ) 2 ) 2πσλe 2σ λe. (5) If more than one arameter is constrained, there can in rincile be an external correlation between them. This external correlation is different from the internal one. It can easily be accounted for by, for examle, relacing the single Gaussian of Eq. 5 by a multivariate one, ( ex ( λ e λ)ve 1 ( λ e ) λ) T C(λ) = (2π) l/2, (6) V e where V e is the known l l external covariance matrix. We now recall two effects of including a constraint term for arameter λ: they include external knowledge, and are a means of error roagation. Comared to a situation with a floating arameter and no constraint, including the constraint term will reduce the reorted error on this arameter. Suose that when λ is left floating without constraint, the result be λ u ± σ λu, and with the constraint term included it be λ c ±σ λc. If both the unconstrained likelihood and the constraint term are uncorrelated and Gaussian in λ, the likelihood fit is equivalent to the weighted average of λ u and λ e. Thus the error will be given by 1 σ 2 λ c = 1 σ 2 λ u + 1 σ 2 λ e (7) so that σ λc < σ λu. Constraint terms are also a means of error roagation. If the likelihood deends not only on the fit arameters, but also on arameters that are fixed, one may want to roagate the errors of the fixed arameters into the fit result. This can be done by including constraint terms in the fixed arameters, and letting the reviously fixed arameters float, too. If there are non-zero correlations between the reviously fixed and the floating arameters, the errors on the latter will increase, reflecting the roagated uncertainty on the reviously fixed arameters. The reorted errors on the reviously fixed arameters will in general be smaller than given by the constraint. This is because the dataset can also hold information on them. In addition to the above effects, constraint terms can also be incororated to hel the fit converge. When doing this, the errors are modified, for examle as indicated by Eq. 7. This might soil the interretation of the reorted fit errors as being statistical, if σ λe is not statistical and also of same order as σ λu. The effects of constraints described above are not limited to shae arameters. They also aly to normalization arameters such as the fraction arameters of Eq. 2 and the yield arameters of Eq. 1. But constraining fraction arameters is not equivalent to constraining yield arameters. If, for examle, we know the rate of a background rocess as a fraction of the rate of a control rocess, we should constrain this fraction. If, on the other hand, we know the absolute rate, we should constrain the yield. As ointed out above, the full likelihood is not required to be Poissonian in its yield arameters. Thus a Gaussian constraint on a Poissonian yield arameter is the correct imlementation, even if the sum of a Gaussian and a Poissonian random variable does not follow a Poissonian PDF. The constraint term on a yield arameter can even have a width smaller than N. This haens, for examle, when the constraint is derived from a large control yield Y ± Y by scaling down by a factor ε that has no uncertainty: y e = εy ± ε Y. In such situations, the constraint term will ush the fit into the genuine tye A EML regime. III. PSEUDO EXPERIMENTS Generating and fitting back a large number of seudo exeriments is a owerful tool to understand and validate a fit rocedure. Pseudo exeriments are generated by drawing a seudo dataset from the full PDF, for examle through a hit-and-miss algorithm. In an EML situation it is imortant that in the seudo datasets the comonent event yields all fluctuate like a Poissonian. As a consequence, also the total yield fluctuates like a Poissonian, and each seudo dataset contains a different number of events. Note that each yield must fluctuate indeendently, so that their ratios are not constant across the toy exeriments. It is not enough that the total yield fluctuates like a Poissonian. If constraints are resent, they have to be considered when generating and fitting a toy dataset. In articular, there is a right and a wrong way of doing it, as outlined in Ref. [2]. The right way is to interret the constraint as stemming from an external measurement: We not only have to reeat our own measurement (by drawing events from the full PDF), but also have to reeat the external measurement by drawing from the constraint PDF. So each toy exeriment will be erformed with a different constraint term, but using the same shae for the total PDF. The wrong way is to fluctuate the total shae and not the constraint term, so that each exeriment uses the same constraint term, but draws events from different total PDFs. This will lead to biased results. If there are constraints resent for yield arameters, their correct treatment in toy generation is still the above right way. This is even though the likelihood function does not only contain a Poisson term (the EML term), but also a generally non-poissonian term (the constraint). Thus one might conclude, that the total yield should not be generated from a Poissonian, while this in fact is the case. Let us be more secific. Fixing the notation, we will denote for a arameter A, its true value as A t, its value as estimated by the fit as A f, its value as determined by an external measurement as A e, and a generated value as A. Suose the total PDF is that of Eq. 1, and we

3 add a Gaussian constraint on the background yield, corresonding to an external measurement of N b,e ± σ Nb,e. Obviously the fit will be biased if we constrain a arameter to anything else but its true value, so we ll assume N b,e = N b,t (if N b,e was obtained from a genuine external measurement, this bias will likely go in the direction of the true value). To generate a toy exeriment, we have to 1. draw a value N b from a Poissonian P (N; N b,t), and a value N s from a Poissonian P (N; N s,t ), 2. generate N b background events from G(x; µ 2) and N s signal events from G(x; µ 1 ), 3. draw a toy constraint value N b,e from the constraint PDF G(N b ; N b,e, σ Nb,e ). Then the likelihood to be maximized for this articular exeriment is L(N s, N b ) = e N N N N N G(N b ; N! b,e, σ Nb,e ) g(x i), (8) i=1 where N = N s + N b and N = N s + N b is the exected total yield to be estimated by the fit. It is interesting to note, that the Poisson EML term is technically also a constraint. It constrains the fitted total number of events to the observed number of events. It also varies with the toy exeriments, because the generated observed number of events varies. IV. PULL DEFINITIONS The ull statistic is defined as = λ f λ t σ λf, (9) where λ f ± σ λf is the fit result of one articular seudo exeriment, and λ t is the true value. One exects the ull to follow a unit Gaussian, so from its observed distribution one can draw conclusions about whether or not the fit reorts unbiased central values and errors of correct coverage. If the ull distribution has mean µ ± σ µ and width w ±σ w that are not equal to and 1, resectively, one can decide to correct the fit result for these biases: λ c f = λ f µ σ λf, (1) σ c λ f = w σ λf. (11) The ull formed with the corrected quantities then has mean and width 1. If a constraint to λ e ± σ λe is resent, the usual ull of Eq. 9 still follows a unit Gaussian, rovided the toy exeriments are generated in the right way as described in Section III. Reference [2] defines a second ull statistic as 2 = λ f λ e σ. (12) 2 σ 2 λe λf The square root is always defined as a consequence of Eq. 7. Ref. [2] oints out that this definition may exhibit a slower convergence towards the unit Gaussian distribution, i.e. for large number of events in the toy samles (not large number of toy exeriments). However, the authors do not discuss constraints in the context of EML fits, and we found 2 to not follow a unit Gaussian even with sufficiently large samles. A third ossibility is 3 = λ f λ e σ λf, (13) where the generated constraint value λ e is used rather than the fixed λ e. This definition is used in certain situations by the RooFit framework [3], which we will discuss later. When generating toy exeriments in the right way, we found that also 3 does not follow a unit Gaussian. Using ull definitions with different convergence rates comes with an additional comlication: If a bias correction is necessary in a situation with too few events for the limit to be valid, the correction will deend on the ull definition. In conclusion, we recommend to use 1 in combination with the right way of generating toy exeriments. This combination gives unit ulls even if constraints on yield arameters are resent. V. EXAMPLES Let us consider the following examle. We will add to the scenario of Eq. 1 a third, low-yield Gaussian, to make the situation symmetric. The observable might reresent an invariant mass of a reconstructed comosite article, and the low-yield Gaussians might corresond to backgrounds, in which a daughter article was misreconstructed: g(x) =N s G(x; µ 1 = m = 14) + N b1 G(x; µ b1 = m + 2) + N b2 G(x; µ b2 = m 2). (14) Each Gaussian has unit width. We will assume the true yields N s,t = 5 for the signal, and each N b1,t = N b2,t = 1 for the backgrounds. We consider Gaussian constraint terms for both backgrounds N b1 and N b2, G(N bi ; µ = N bi,t, σ = N bi,t ). An examle of such a seudo exeriment is shown in Fig. 1. In Figure 2 we show the ull distributions of each ull definition in Section IV, using 5 toy exeriments. While the standard definition Eq. 9 is consistent with a unit Gaussian (µ =.31±.14, w =.993±.11),

4 Events / (.12 ) 35 Total PDF Signal 3 Bkg 1 Bkg 2 25 9 8 7 6 Pull Definition (13) Pull Definition (12) Pull Definition (9) Unit Gaussian 2 5 15 4 1 3 2 5 1 134 136 138 14 142 144 146 Mass -6-4 -2 2 4 6 Pull of N b2,f FIG. 1: Examle toy exeriment drawn from Eq. 14, with N s,t = 5 and N b1,t = N b2,t = 1. FIG. 3: Distributions of the same ull definitions as described in the cation of Figure 2, but when erforming toy exeriments in the wrong way. the other definitions (12, 13) are not. When enlarging the samle sizes to N t = 7, the distributions remain unchanged. In Figure 3 we show again the three ull distributions for generating and fitting the wrong way. Now definitions Eq. 12 and Eq. 13 follow a unit Gaussian, while definition Eq. 9 does not. However, this deends on the width of the constraint. These examles suort our conclusion of Section IV. 22 2 18 16 14 12 1 8 6 4 = 2 σ Nbi,e = 7 σ Nbi,e = σ Nbi,e nbkg nbkg nbkg 1 8 6 4 Pull Definition (13) Pull Definition (12) Pull Definition (9) Unit Gaussian 2-25 -2-15 -1-5 5 1 15 2 25 N f - N' FIG. 4: Distribution of the difference of fitted and generated total number of events N f N for different values of the constraint width on the N bi : σ Nbi,e = 2 (red), 7 (green), 1 (blue). 2 VI. ROOFIT -6-4 -2 2 4 6 Pull of N b2,f FIG. 2: Pull distributions according to Eq. 9 (blue), Eq. 12 (red), and Eq. 13 (green). Overlaid is a unit Gaussian (dashed). To illustrate how a tight constraint term can ush the fit into the tye A EML regime, we now subsequently tighten the constraints on the N bi we observe in Figure 4 that the difference between fitted and generated total number of events N f N can grow larger, as the constraints get stronger. The widest distribution is reached at a value of about N bi,t. Then, deviations of u to 1 events are ossible, corresonding to 1.4% of the events in the considered scenario. The RooFit framework [3] is widely used in exerimental article hysics to imlement sohisticated maximum likelihood fits. It also features a mechanism to automate ull studies, RooMCStudy. We would like to oint out several itfalls resent in RooFit version 3.5.4 (bundled with Root version 5.34.). There are two ways to configure RooMCStudy for the use with constraint terms. The first, using the Constrain() argument, is suosed to be used when the constraint term is art of the original PDF definition. The second, using the ExternalConstraints() argument, should be used when the constraint terms are sulied searately. Both ways do not give identical results. In the following, we refer to ulls obtained through RooMCstudy::lotPull(). Using Constrain(): RooMCstudy generates the

5 wrong way sketched in Section III. This is articularly imortant if constraints on yield arameters are resent. Then, RooFit first fluctuates the exected yield using the constraint term, and then again fluctuates the result using the EML Poisson term. As a consequence, the total generated yield does not follow a Poissonian anymore. For the ull comutation Eq. 13 is used. If the width of a yield constraint is much larger than N, one exects results similar to those obtained in the unconstrained situation. But in our examle scenario, we observe a moderate bias of µ =.3. Also, the distributions of both the central value and the error are much wider comared to the unconstrained situation. This is shown in Figure 5. It can also haen, that the effects cancel by chance: In a second test scenario, corresonding to Eq. 1, with N s,t = 1, N b,t = 5, and G(N b ; µ = N b,t, σ = 15), we observed a unit Gaussian ull, while the error and central value distributions of N b were still too wide. Events / (.5 ) 1 8 6 4 2 µ = -.229 ±.15 w = 1.89 ±.11 µ = -.661 ±.2 w = 1.425 ±.14-5 -4-3 -2-1 1 2 3 4 5 Pull of N b2,f FIG. 6: Pull distribution (red) obtained from RooMCstudy when using the Constrain() aroach and a wide constraint G(N bi ; µ = N bi,t = 1, σ = N bi,t ). The other distribution (green) is for the same scenario, but obtained by exlicitly stating N t = 7 in the generateandfit() function call. Events / ( 1.3 ) 9 8 7 6 5 4 3 2 1 Events / (.5 ) 1 8 6 4 µ = -.15 ±.14 w =.112 ±.1 µ = -.39 ±.63 w =.4487 ±.45 µ = -.1 ±.13 w =.919 ±.9 1 2 3 4 5 Error of N b2,f 2 FIG. 5: Error distribution (red) obtained from RooMCstudy when using the Constrain() aroach and a wide constraint G(N bi ; µ = N bi,t, σ = N bi,t ). Also shown is the error distribution in the unconstrained case (green). Another itfall when using Constrain() is that if the generateandfit() function is called in an EML scenario, and if one exlicitly secifies the total number of events to be generated in the function call, then the ulls deend on the width of the constraint: For wide constraints, the ull distribution will be too wide. This is illustrated in Figure 6. Using ExternalConstraints(): RooMCstudy generates the right way, i.e. the comonent yields fluctuate like a Poissonian. But during fitting, always the same, fixed constraint is used, and the ull is comuted using Eq. 9. As a consequence, the resulting ull distribution is too narrow. Thus, if the constraint is wide enough comared to N, the unconstrained situation is recovered. This is illustrated in Figure 7. Considering these difficulties it is clear that, in order to be able to conclude on a otential fit bias, the user needs a detailed understanding of RooMCstudy. -5-4 -3-2 -1 1 2 3 4 5 Pull of N b2,f FIG. 7: Pull distribution obtained from RooMCstudy when using the ExternalConstraints() aroach and a wide constraint G(N bi ; µ = N bi,t = 1, σ = 5) (green), a medium constraint (σ = 1, red), and a narrow constraint (σ = 1, blue). VII. CONCLUSION We have discussed the basic features of extended maximum likelihood fits, and how to use constraint terms to incororate external knowledge into these fits. If constraint terms are resent, the generation of seudo datasets requires care. We recommend to use the right way, in which the constraint is fluctuated in the generation ste and the PDF is not, and to use the usual ull definition. Then we find the ull to follow a unit Gaussian even if constraints on yield arameters are resent. The authors wish to thank Niels Tuning for useful discussion.

6 [1] R. J. Barlow, Nucl.Instrum.Meth. A297, 496 (199). [2] CDF Statistics Commitee, L. Demortier and L. Lyons, CDF Public Note 5776 (22). [3] W. Verkerke and D. Kirkby, htt://roofit.sourceforge.net.