A general assistant tool for the checking results from Monte Carlo simulations

Similar documents
UNCERTAINTY ANALYSIS IN

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Unit 6 - Simple linear regression

Stat 231 Exam 2 Fall 2013

Hypothesis Testing One Sample Tests

Inference in Regression Analysis

Chapter 27 Summary Inferences for Regression

EM375 STATISTICS AND MEASUREMENT UNCERTAINTY CORRELATION OF EXPERIMENTAL DATA

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data

Chapter Eight: Assessment of Relationships 1/42

Module 8: Linear Regression. The Applied Research Center

appstats27.notebook April 06, 2017

Week 11 Sample Means, CLT, Correlation

Inference in Normal Regression Model. Dr. Frank Wood

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Unit 6 - Introduction to linear regression

Statistical Inference for Means

Observed Global Warming and Climate Change

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Finding Relationships Among Variables

Inferences for Regression

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Regression and the 2-Sample t

Chapter 16: Correlation

Harvard University. Rigorous Research in Engineering Education

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Chapter 12: Estimation

THE ROYAL STATISTICAL SOCIETY 2015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3

Bivariate statistics: correlation

Multiple Linear Regression

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Visual interpretation with normal approximation

Section 3: Simple Linear Regression

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

Two-sample inference: Continuous data

Spatio-temporal correlations in fuel pin simulation : prediction of true uncertainties on local neutron flux (preliminary results)

Statistical methods and data analysis

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

CENTRAL LIMIT THEOREM (CLT)

MORE ON SIMPLE REGRESSION: OVERVIEW

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.


Monte Carlo modeling of an electronic brachytherapy source using MCNP5 and EGSnrc

1 Hypothesis testing for a single mean

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Chapter 10: Analysis of variance (ANOVA)

CMS: Priming the Network for LHC Startup and Beyond

Econometrics -- Final Exam (Sample)

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Warm-up Using the given data Create a scatterplot Find the regression line

16.400/453J Human Factors Engineering. Design of Experiments II

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

1. How will an increase in the sample size affect the width of the confidence interval?

Formulas and Tables by Mario F. Triola

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

hp calculators HP 20b Probability Distributions The HP 20b probability distributions Practice solving problems involving probability distributions

General Certificate of Education Advanced Level Examination June 2014

Statistics: revision

Open book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition).

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Hypothesis testing. Anna Wegloop Niels Landwehr/Tobias Scheffer

Statistical Intervals (One sample) (Chs )

Practical Statistics

Y i = η + ɛ i, i = 1,...,n.

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Linear Correlation and Regression Analysis

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics)

7 Estimation. 7.1 Population and Sample (P.91-92)

Interpreting Regression Results

Correlation and Linear Regression

Primer on statistics:

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

The regression model with one stochastic regressor.

Chapter 23: Inferences About Means

Chapter 5 Confidence Intervals

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

Testing Linear Restrictions: cont.

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

One-sample categorical data: approximate inference

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Chapter 16. Simple Linear Regression and dcorrelation

Seasonal drought predictability in Portugal using statistical-dynamical techniques

Robust Backtesting Tests for Value-at-Risk Models

Filter Methods. Part I : Basic Principles and Methods

However, reliability analysis is not limited to calculation of the probability of failure.

STA Module 11 Inferences for Two Population Means

STA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Linear Regression with Multiple Regressors

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Probability and Statistics

Transcription:

A general assistant tool for the checking results from Monte Carlo simulations Koi, Tatsumi SLAC/SCCS Contents Motivation Precision and Accuracy Central Limit Theorem Testing Method Current Status of Development Summary

Motivation After a Monte Carlo simulation, we get an answer. However how to estimate quality of the answer. What we must remember is Large number of history does not valid result of simulation. Small elative Error does not valid result of simulation Motivation (Cont.) To provide statistical information to assist establishing valid confidence intervals for Monte Carlo results for users, something like MCNPs did.

Subject of this study Precision of the Monte Carlo simulation Accuracy of the result is NOT a subject of this study At first we have to define Precision and Accuracy of simulations Precision and Accuracy Precision: Uncertainty caused by statistical fluctuation Accuracy: Difference between expected value and true physical quantity. Accuracy Precision True Value Mote Carlo esults

Subject of this study (Cont.) Precision of the Monte Carlo simulation is subject of this study. To state accuracy of simulations, we should consider details of simulation, i.e., uncertainties of physical data, modeling of physical processes, variance reduction techniques and so on. To make a generalized tool, we have to limit subjects only for precision. Accuracy is a subject for most of presentations in this workshop. Principal of this study is Central Limit Theorem

Central Limit Theorem Every data which are influenced by many small and unrelated random effects has normally distribution. The estimated mean will appear to be sampled from normal distribution with a KNOWN standard deviation when N approaches infinity. ( σ N ) Central Limit Theorem (Cont.) Therefore, We have to check that N have approached infinity in the sense of the CLT, or not. This corresponds to the checking the complete sampling of interested phase space has occurred, or not.

This is not a simple static test but check of results from nature of Monte Carlo simulations Mean Checking Values Variance and Standard Deviation elative error Variance of Variance S = x x x S = = 1 N N ( xi x) i= 1 N i= 1 N 1, where S = x S VOV = x i S N ( S ) x 4 S x

Checking Values (Cont.) Figure of Merit Scoring Efficiency intrinsic and efficiency Shift q = SHIFT SLOPE Fit to the Largest history scores 1 FOM = T number of NON ZEO histories N 3 ( x x) ( S N ) = i What we check? Behavior of MEAN Values of Time profile of Values of VOV Time profile of VOV Time profile of FOM Behavior of FOM Value of SLOPE Value of SHIFT Boolean Answer Effect of the largest history score occurs on the next history. MEAN ( intrinsic and efficiency) VOV FOM SHIFT Numeric Answer

Of cause, Boolean check is carried out mathematically (statistically) value behavior time profile For behaviors and time profiles check Derive Pearson s r from data (results and theoretical values) r=1(-1), perfect positive (negative) correlation r=0, uncorrelated null hypothesis is set to uncorrelated Then, t = r N 1 r follows student t distribution of degree of freedom ν = N Checking significance of r with null hypothesis. ejection level of null hypothesis is 68.8% (1σ)

Example Checking value: Observable Energy of Sampling Calorimeter. Material Pb (Lead)-Scinitillator Thickens Pb: 8.0 mm/layer, Sci:.0 mm/layer Layers 10 layers 1 m x 1 m interaction surface Beam Electon 4 GeV ange Cuts 1 mm mm e- Pb 8mm Sci. Example 100 histories MEAN mean 80 79.5 79 78.5 78 77.5 77 0 0 40 60 80 100 10 mean 10 9.9 9.8 9.7 9.6 9.5 9.4 9.3 9. 0 0 40 60 80 100 10 0.0 0.018 0.016 0.014 0.01 0.01 0.008 0.006 0.004 0.00 0 0 0 40 60 80 100 10 VOV 0.04 0.035 0.03 0.05 0.0 0.015 0.01 0.005 0 0 0 40 60 80 100 10 vov Does not pass most of Boolean tests

Example 1,000 histories MEAN mean 80 79.5 79 78.5 78 77.5 77 0 00 400 600 800 1000 100 mean 11 10.8 10.6 10.4 10. 10 9.8 9.6 9.4 9. 9 0 00 400 600 800 1000 100 0.007 0.006 0.005 0.004 0.003 0.00 0.001 0 0 00 400 600 800 1000 100 vov VOV 0.005 0.0045 0.004 0.0035 0.003 0.005 0.00 0.0015 0.001 0.0005 0 0 00 400 600 800 1000 100 vov Does not pass some of Boolean tests 80 79.5 79 78.5 78 77.5 Example 10,000 histories MEAN mean 77 0 000 4000 6000 8000 10000 1000 mean 11 10.8 10.6 10.4 10. 10 9.8 9.6 9.4 9. 9 0 000 4000 6000 8000 10000 1000 0.00 0.0018 0.0016 0.0014 0.001 0.001 0.0008 0.0006 0.0004 0.000 0 0 000 4000 6000 8000 10000 1000 vov VOV 0.0005 0.00045 0.0004 0.00035 0.0003 0.0005 0.000 0.00015 0.0001 0.00005 0 0 000 4000 6000 8000 10000 1000 vov Does not pass one of Boolean tests (SLOPE check)

How to apply Energy Spectrum estimation etc. V/E P1 P P3 P4 Checking each confidence level of P1, P, P3, P4,,,, Of course, scoring efficiency becomes low. E Unfortunately, this tool does not work well with some deterministic variance reduction techniques. This is come from limitation of CLT (means some variance are required for distribution), so that we can not over come.

And some simulations becomes deterministic without awaking of users. Please check your simulation carefully. Current Status of Development Most part of developments has been done. Following items are remained under development. Output of testing result Class or function for minimization of multi dimensional functions

We want to include this tool in Geant4 but what category is suite for this tool? un,, Hits and its collections, Tally?? Summary We have successfully developed a general assistant tool for the checking the results from Monte Carlo simulations like MCNPs. Through this tool, users easily know the confidence intervals for Monte Carlo results.