Modern Methods of Data Analysis - SS Introduction

Size: px
Start display at page:

Download "Modern Methods of Data Analysis - SS Introduction"

Transcription

1 Modern Methods of Data Analysis Introduction SS 2010 Universität Heidelberg

2 What Do You Learn in this Lecture? How to extract PHYSICS knowledge from measurements in (particle) physics Acquire competence in understanding the statistical tools needed for data analysis understanding the role of uncertainties and probabilities in relating experimental data and theory Beeing able to perform analysis of real data applying these techniques

3 What you NOT learn in this Lecture? How experimental detectors work How the standard model is build How to calculate Feynman diagrams How to design large software projects Mathematical regirous proof of all tools

4 The Particle Physics case... Why would one need to know statistical methods for data analysis in physics? Why should I need to learn such methods? Let's just consider the case of the Standard Model in particle physics...

5 Components of the Standard Model

6 THE STANDARD MODEL!!! Experiments confirm Standard Model to incredible accuracy! Everything is great We found THE THEORY!... is this really all??? Example: Z cross-section: uncertainties smaller than symbol size, data confirm hypothesis: 3 light neutrinos

7 Methodology of Particle Physics Here statistical methods are crucial!

8 How do we reach those goals? Introduction to statistical concepts in the lectures with integrated exercises Hands-on work in the computer exercises optional at end of semester: reproduce physics analysis on real (LHC) data -> presentation Slides of lecture available Monday afternoon before the lecture

9 Computer Exercises: Place: CIP-Pool, Albert-Ueberle-Str. 3-5 Potential dates: Wednesday: h Thursday: h Which programming skills do you have? C++, ROOT, other languages Do you plan to attend computer course? Do you have a Rechenzentrums-account?

10 Credit Points Lecture: 2 CP - requirements: short oral exam at end of the term (can be decided shortly before end of the term) Computer Exercises: 2 CP - requirement: (very) frequent presence

11 Outline of the Lecture Basics Concepts & Definitions Characteristics of distribution Monte Carlo generators Important distributions Error propagation Estimators Maximum Likelihood Least Square method Confidence Level and Limits Hypothesis Tests introduction/bring everyone to same level core part of the lecture Add. Material multivariant systems, analysis bias, numerical methods, function minimization

12 The Cheating Baker Once upon a time, in a holiday resort the landlord L. ran a profitable B&B, and every morning bought 30 rolls for breakfast. By law the mass of a single roll was required to be 75 g. One fine day the owner of the bakery changed, and L. suspected that the new baker B. might be cheating. So he decided to check the mass of what he bought, using a kitchen scales with a resolution of 1g. After one month he had collected a fair amount of data:

13 Data Reduction the raw list of number is not very useful need some kind of data reduction assume that all measurements are equivalent the sequence of individual data does not matter all relevant information is contained in the number of counts per reading - much improved presentation of the collected information - the above numbers cover the entire data set - most of the measurements are lower than 75g...

14 Visualization an even better presentation of the available information bar-chart of the tables given before example for the concept of a histogram define bins for the possible values of a variable plot the number of entries in each bin immediate grasp of center and width of the distribution The rolls produced by baker B. definitely show a lack of doe. So L. was right in his suspicion, that B tried to make some extra profit by cheating...

15 and the Conclusion As a consequence of his findings, L. complained to B. and even threatened to inform the police about his doings. B. apologized and claimed that the low mass of the rolls was an accident which will be corrected in the future. L., however, continues to monitor the quality delivered by the baker. One month later, B. inquired again about the quality of his products, asking whether now everything was all right. L., for his part, acknowledged that the weight of the rolls now matched his expectations, but he also voiced the opinion that B. was still cheating... B. simply selected the heaviest rolls for L.!

16 Literature Statistische und numerische Methoden der Datenanalyse (V. Blobel/E.Lohrmann) Statistical Data Analysis (Glen Cowan) Statistics: A guide to the use of statistical methods in Physical Sciences (R. J. Barlow) Recommendation for a rainy Sunday afternoon:

17 Many good lectures on statistical methods around Lecture SS07 Christoph Grab & Christian Regenfus Uni Zuerich Michael Feindt, Guenther Quast, IEKP Karlsruhe, many different lectures and many good exercise examples! Guenter Duckeck, Madjid Boutemeur Lecture Ian C. Brock, WS 2000/01, Universitaet Bonn Michael Schmelling, Heidelberger Graduiertentag 2006

18 Modern Methods of Data Analysis Lecture I Content: - Basic Concepts of Probability - Definition of random variables, Probability density, mean, variance...

19 Predictable For easy classical physical processes the result is precisely determined determinism Determinisums Examples are : pendular, orbit of planets, billard...

20 Contingency Random events are as a matter of principle not predictable (even with precise knowledge of the starting conditions) Examples: Lotto (depend on many quantities, deterministic chaos) radioactive decay (quantum mechanics) electronic noise measurement uncertainties

21 Quantum Mechanics Each time something different happens... events of the CDF experiment

22 Measurements Experiments measure frequency distributions:

23 Probability vs. Statistics Probability: from theory to data start with a well-defined problem and calculate all possible outcomes of a specific experiment Statistics: from data to theory Try to solve the inverse problem: starting from a data set, want to deduce the rules/laws this is analyzing experimental data deals with parameter estimation determine parameters and uncertainties in unbiased and efficient way hypothesis testing (agreement,confidence ) - most challenging: to match question and answers

24 Definition/Interpretation of Probability mathematical probability (axiomatic) all definitions of probability need to follow these axioms interpretation as rel. Frequency(Frequentists) subjective probability (Bayesian) Beware of different names used in literature! Lucky enough, in all useful cases they use the same combinatorial rules of probabilities! Interpretation of results differ sometimes... Not go in detail about difference at this stage of the lecture.

25 Kolmogorov Axioms (1931) Elementary events are considered, which exclude each other: elementary event set of all elements probability of event positive definite additive normalized Formal approach, however from pragmatic point of view rather useless...

26 Combination of Probabilities : A or B : A and B : not A : P(A) if is reduced to B (conditional bedingte probability) Can be deduced from Kolmogorov axioms: A and B are called independent if:

27 Frequentist Probability empirical definition via frequency of occurrence (R. von Mises) perform experiment N times in identical trials, assume event E occurs k times, then: P(E) = lim k/n intuitive definition for particle physics (many repetitions) very useful, but has some problems limes in strict mathematical sense does not exist, i.e. can not be proven that it converges... how large is N? When does it converge? What means repeatable under identical conditions? Is similar OK as well? This IS difficult, if not impossible... Not applicable for single events.

28 Subjective (Bayesian) Probability Reverend Thomas Bayes ( ) Probability is the degree of believe, that an experiment has a specific result. Subjective probability compatible with Kolmogorov axioms Essay Towards Solving a Problem in the Doctrine of Chances (1763), published postum in Philosophical Transactions of the Royal Society of London

29 Examples for Bayesian Probability Frequentists interpretation often not applicable. Than Bayesian interpretation only possible one. Probability is the degree of belief that a hypothesis is true: - The particle in this event is a positron. - Nature is supersymmetric. - It will rain tomorrow. - Germany will win the soccer championship this year. Often criticized as subjective and non scientific. However based on simple probability computation (axioms). Not contradicting the Frequentists approach, but include it.

30 Comments on Probability There are strong rivaling schools of these approaches Be beware of names: Bayesians call frequentists the classical ones some call the frequency def. of probability objective Most scientist would claim to belong to the frequency school But we often talk in Bayesian probability terms; and we think of probabilites as objective numbers In particular, any attempt of interpreating results of an experiment falls into the trap of repeatability. Bayes' Theorem will highlight this more...

31 Bayes' Theorem (1) Conditional ( bedingte ) probability Due to follows:

32 Examples: Rare Disease (1) Probability of disease A: P(A) = P(not A) = Test for disease: P(+ A) = 0.98 P(- A) = 0.02 P(+ not A) = 0.03 P(- not A) = 0.97 Do you need to be worried if you get + as test result? What is the posteriori probability P(A +)?

33 Example: Rare Disease (2) The posterior probability is only 3.2%, due to the tiny prior probability and the non negligible misidentification rate.

34 Exercise: Particle ID Example particle ID detector: proton-antiproton collision: 90% pions, 10% kaons kaon ID 95% efficient, pion mis-id 6% Question: If the particle ID indicates a kaon, what is the probability that it is a real kaon/a real pion?

35 Bayes' Theorem (2) Important through the interpretation A=theory,B=data Likelihood Posterior Prior Evidence

36 Confirmation and Rejection P(no Higgs below 200 GeV SM is true) < Likelihood If I believe the SM is true, what is the probability to see no Higgs below 200 GeV: no Higgs below 200 GeV SM is ruled out P(SM is true Higgs below 200 GeV exists) depends on P(Higgs below 200 GeV in any model) and on the a priori probability that P(SM is true) Posteriori Probability If I see a Higgs below 200 GeV, how large is the probability that the SM is true?

37 Some Definitions experiment e.g. throwing 2 times a dice realization: outcome of one specific experiment event space/sample space: all possible outcomes random variable Zufallsvariable function of outcome of experiment (e.g. sum of dices) event space and realization defined accordingly for random variables random variable can be either discrete or continues

38 Example experiment: throwing two dices event space of experiment {11,12,13,14,15,16,21,22,23,...,64,65,66} random variable x = sum of dices event space of x: {2,3,4,5,6,7,8,9,10,11,12} realization of experiment e.g. 25 accordingly realization of x=7 cumulated distribution u(x): probability to observe x or smaller value

39 Probability density function (pdf) Repeat an experiment with outcome characterized by single continuous variable x Definition: the probability to measure a value x in the interval [x,x+dx] is give by probability density function f(x) (pdf) Wahrscheinlichkeitsdichte pdf f(x) >=0 and normalized pdf f(x) is NOT a probability, it has dimension 1/x!

40 Question on PDFs List some examples of a probability density function f(x) continuous ones discrete ones

41 Cumulative distribution function (cdf) Cumulative distribution function F(x), also know as probability distribution function. Wahrscheinlichkeitsverteilung = Verteilungsfunktion F(x') is interpreted as probability to find value x <= x' F(x) is continuously non-decreasing function F(- ) = 0 and F(+ )=1 is directly related to the probability density function f(x) by: f(x) is given (for well-behaved distributions) by:

42

43 Example: cdf f(x) = 1-x for x in [0,2] 0 elsewhere Compute the function F(x) What is the probability to find x > 1.5? What is the probability to find x in [0.5,1]

44 Arithmetic Mean Definition: Examples: Average number of children per family in Germany is 2.3 Average lifetime expectation for men is 74 for women 78 Average amount of semester for physics studies in Heidelberg is

45 Weighted Mean Definition: Example: 5 measurements { uncertainties { } with different } arithmetic mean is special case of weighted means (same weight for each measurement)

46 ... even more averages Mode/Modus: most probable value (highest bin in distribution) definition not unique, unimodal, bimodal distributions Median: smallest value which is than 50% of the events (median more robust against outliers than artithm.mean) For unimodal, symmetric functions centered around : median = mode = arithmetic mean else, for unimodal functions, empiric rule mean-mode = 3 x (mean-median)

47 Examples: Give median, mode, arithmetic mean of:

48 Energy loss in Material Assume tracking stations with 12 layers The energy loss per traversed material (de/dx) of the particle traversing a layer follows a Landau distribution mode de/dx

49 Energy loss in Material The mode of the de/dx distribution as a function of particle momentum is used to seperate different particle species 1 10 momentum [GeV/c] How to get estimate of mode from 12 measurements?

50 Truncated Mean Discard 10/20% measurements with larges value (symmetrize the function) Take arithmetic mean of remaining ones All about estimating the true mode/median/mean of distributions from given data set. More about estimators later.

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Introduction October 15, 2012 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Niklaus Berger, University of Heidelberg nberger@physi.uni-heidelberg.de Winter

More information

Statistical Methods in Particle Physics. Lecture 2

Statistical Methods in Particle Physics. Lecture 2 Statistical Methods in Particle Physics Lecture 2 October 17, 2011 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2011 / 12 Outline Probability Definition and interpretation Kolmogorov's

More information

Introduction to Statistical Methods for High Energy Physics

Introduction to Statistical Methods for High Energy Physics Introduction to Statistical Methods for High Energy Physics 2011 CERN Summer Student Lectures Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Statistical Methods in Particle Physics Lecture 1: Bayesian methods Statistical Methods in Particle Physics Lecture 1: Bayesian methods SUSSP65 St Andrews 16 29 August 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Statistische Methoden der Datenanalyse. Kapitel 1: Fundamentale Konzepte. Professor Markus Schumacher Freiburg / Sommersemester 2009

Statistische Methoden der Datenanalyse. Kapitel 1: Fundamentale Konzepte. Professor Markus Schumacher Freiburg / Sommersemester 2009 Prof. M. Schumacher Stat Meth. der Datenanalyse Kapi,1: Fundamentale Konzepten Uni. Freiburg / SoSe09 1 Statistische Methoden der Datenanalyse Kapitel 1: Fundamentale Konzepte Professor Markus Schumacher

More information

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests

Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests Statistical Methods for Particle Physics Lecture 1: parameter estimation, statistical tests http://benasque.org/2018tae/cgi-bin/talks/allprint.pl TAE 2018 Benasque, Spain 3-15 Sept 2018 Glen Cowan Physics

More information

Lectures on Statistical Data Analysis

Lectures on Statistical Data Analysis Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk

More information

Statistical Methods for Particle Physics (I)

Statistical Methods for Particle Physics (I) Statistical Methods for Particle Physics (I) https://agenda.infn.it/conferencedisplay.py?confid=14407 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

YETI IPPP Durham

YETI IPPP Durham YETI 07 @ IPPP Durham Young Experimentalists and Theorists Institute Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan Course web page: www.pp.rhul.ac.uk/~cowan/stat_yeti.html

More information

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Lecture 5 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

LECTURE NOTES FYS 4550/FYS EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2013 PART I A. STRANDLIE GJØVIK UNIVERSITY COLLEGE AND UNIVERSITY OF OSLO

LECTURE NOTES FYS 4550/FYS EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2013 PART I A. STRANDLIE GJØVIK UNIVERSITY COLLEGE AND UNIVERSITY OF OSLO LECTURE NOTES FYS 4550/FYS9550 - EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2013 PART I PROBABILITY AND STATISTICS A. STRANDLIE GJØVIK UNIVERSITY COLLEGE AND UNIVERSITY OF OSLO Before embarking on the concept

More information

Modern Methods of Data Analysis - SS 2009

Modern Methods of Data Analysis - SS 2009 Modern Methods of Data Analysis Lecture X (2.6.09) Contents: Frequentist vs. Bayesian approach Confidence Level Re: Bayes' Theorem (1) Conditional ( bedingte ) probability Examples: Rare Disease (1) Probability

More information

Systematic uncertainties in statistical data analysis for particle physics. DESY Seminar Hamburg, 31 March, 2009

Systematic uncertainties in statistical data analysis for particle physics. DESY Seminar Hamburg, 31 March, 2009 Systematic uncertainties in statistical data analysis for particle physics DESY Seminar Hamburg, 31 March, 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan

More information

Statistics for the LHC Lecture 1: Introduction

Statistics for the LHC Lecture 1: Introduction Statistics for the LHC Lecture 1: Introduction Academic Training Lectures CERN, 14 17 June, 2010 indico.cern.ch/conferencedisplay.py?confid=77830 Glen Cowan Physics Department Royal Holloway, University

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

Single Maths B: Introduction to Probability

Single Maths B: Introduction to Probability Single Maths B: Introduction to Probability Overview Lecturer Email Office Homework Webpage Dr Jonathan Cumming j.a.cumming@durham.ac.uk CM233 None! http://maths.dur.ac.uk/stats/people/jac/singleb/ 1 Introduction

More information

Statistical Data Analysis 2017/18

Statistical Data Analysis 2017/18 Statistical Data Analysis 2017/18 London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk

More information

RWTH Aachen Graduiertenkolleg

RWTH Aachen Graduiertenkolleg RWTH Aachen Graduiertenkolleg 9-13 February, 2009 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan Course web page: www.pp.rhul.ac.uk/~cowan/stat_aachen.html

More information

The language of random variables

The language of random variables The language of random variables 1 Methods in Experimental Particle Physics 04/10/18 Support Material G. Cowan, Statistical Data Analysis, Clarendon Press, Oxford, 1998. PDG L. Lista Statistical methods

More information

Statistics for the LHC Lecture 2: Discovery

Statistics for the LHC Lecture 2: Discovery Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN, 14 17 June, 2010 indico.cern.ch/conferencedisplay.py?confid=77830 Glen Cowan Physics Department Royal Holloway, University of

More information

Probability - Lecture 4

Probability - Lecture 4 1 Introduction Probability - Lecture 4 Many methods of computation physics and the comparison of data to a mathematical representation, apply stochastic methods. These ideas were first introduced in the

More information

Statistics for Data Analysis. Niklaus Berger. PSI Practical Course Physics Institute, University of Heidelberg

Statistics for Data Analysis. Niklaus Berger. PSI Practical Course Physics Institute, University of Heidelberg Statistics for Data Analysis PSI Practical Course 2014 Niklaus Berger Physics Institute, University of Heidelberg Overview You are going to perform a data analysis: Compare measured distributions to theoretical

More information

Numerical Methods for Data Analysis

Numerical Methods for Data Analysis Michael O. Distler distler@uni-mainz.de Bosen (Saar), August 29 - September 3, 2010 Fundamentals Probability distributions Expectation values, error propagation Parameter estimation Regression analysis

More information

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com Gujarati D. Basic Econometrics, Appendix

More information

2. A Basic Statistical Toolbox

2. A Basic Statistical Toolbox . A Basic Statistical Toolbo Statistics is a mathematical science pertaining to the collection, analysis, interpretation, and presentation of data. Wikipedia definition Mathematical statistics: concerned

More information

STA Module 4 Probability Concepts. Rev.F08 1

STA Module 4 Probability Concepts. Rev.F08 1 STA 2023 Module 4 Probability Concepts Rev.F08 1 Learning Objectives Upon completing this module, you should be able to: 1. Compute probabilities for experiments having equally likely outcomes. 2. Interpret

More information

Terminology. Experiment = Prior = Posterior =

Terminology. Experiment = Prior = Posterior = Review: probability RVs, events, sample space! Measures, distributions disjoint union property (law of total probability book calls this sum rule ) Sample v. population Law of large numbers Marginals,

More information

Modern Methods of Data Analysis - WS 07/08

Modern Methods of Data Analysis - WS 07/08 Modern Methods of Data Analysis Lecture VII (26.11.07) Contents: Maximum Likelihood (II) Exercise: Quality of Estimators Assume hight of students is Gaussian distributed. You measure the size of N students.

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Probability, Entropy, and Inference / More About Inference

Probability, Entropy, and Inference / More About Inference Probability, Entropy, and Inference / More About Inference Mário S. Alvim (msalvim@dcc.ufmg.br) Information Theory DCC-UFMG (2018/02) Mário S. Alvim (msalvim@dcc.ufmg.br) Probability, Entropy, and Inference

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 3 October 29, 2012 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline Reminder: Probability density function Cumulative

More information

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary) Chapter 14 From Randomness to Probability How to measure a likelihood of an event? How likely is it to answer correctly one out of two true-false questions on a quiz? Is it more, less, or equally likely

More information

Physics 509: Bootstrap and Robust Parameter Estimation

Physics 509: Bootstrap and Robust Parameter Estimation Physics 509: Bootstrap and Robust Parameter Estimation Scott Oser Lecture #20 Physics 509 1 Nonparametric parameter estimation Question: what error estimate should you assign to the slope and intercept

More information

Probability Density Functions

Probability Density Functions Statistical Methods in Particle Physics / WS 13 Lecture II Probability Density Functions Niklaus Berger Physics Institute, University of Heidelberg Recap of Lecture I: Kolmogorov Axioms Ingredients: Set

More information

STT When trying to evaluate the likelihood of random events we are using following wording.

STT When trying to evaluate the likelihood of random events we are using following wording. Introduction to Chapter 2. Probability. When trying to evaluate the likelihood of random events we are using following wording. Provide your own corresponding examples. Subjective probability. An individual

More information

Grundlagen der Künstlichen Intelligenz

Grundlagen der Künstlichen Intelligenz Grundlagen der Künstlichen Intelligenz Uncertainty & Probabilities & Bandits Daniel Hennes 16.11.2017 (WS 2017/18) University Stuttgart - IPVS - Machine Learning & Robotics 1 Today Uncertainty Probability

More information

Math Review Sheet, Fall 2008

Math Review Sheet, Fall 2008 1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the

More information

Topic 3: Introduction to Probability

Topic 3: Introduction to Probability Topic 3: Introduction to Probability 1 Contents 1. Introduction 2. Simple Definitions 3. Types of Probability 4. Theorems of Probability 5. Probabilities under conditions of statistically independent events

More information

Course Evaluation, FYTN04 Theoretical Particle. Particle Physics, Fall 11, Department of Astronomy and Theoretical Physics

Course Evaluation, FYTN04 Theoretical Particle. Particle Physics, Fall 11, Department of Astronomy and Theoretical Physics 1 of 16 03/07/2012 03:41 PM Course Evaluation, FYTN04 Theoretical Particle Physics, Fall 11, Department of Astronomy and Theoretical Physics Course Evaluation, FYTN04 Theoretical Particle Physics, Fall

More information

2.3 Estimating PDFs and PDF Parameters

2.3 Estimating PDFs and PDF Parameters .3 Estimating PDFs and PDF Parameters estimating means - discrete and continuous estimating variance using a known mean estimating variance with an estimated mean estimating a discrete pdf estimating a

More information

Data Analysis and Monte Carlo Methods

Data Analysis and Monte Carlo Methods Lecturer: Allen Caldwell, Max Planck Institute for Physics & TUM Recitation Instructor: Oleksander (Alex) Volynets, MPP & TUM General Information: - Lectures will be held in English, Mondays 16-18:00 -

More information

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples Machine Learning Bayes Basics Bayes, probabilities, Bayes theorem & examples Marc Toussaint U Stuttgart So far: Basic regression & classification methods: Features + Loss + Regularization & CV All kinds

More information

Confidence Intervals. First ICFA Instrumentation School/Workshop. Harrison B. Prosper Florida State University

Confidence Intervals. First ICFA Instrumentation School/Workshop. Harrison B. Prosper Florida State University Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia,, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University Outline Lecture 1 Introduction Confidence Intervals

More information

Statistics: Learning models from data

Statistics: Learning models from data DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial

More information

Frequentist Confidence Limits and Intervals. Roger Barlow SLUO Lectures on Statistics August 2006

Frequentist Confidence Limits and Intervals. Roger Barlow SLUO Lectures on Statistics August 2006 Frequentist Confidence Limits and Intervals Roger Barlow SLUO Lectures on Statistics August 2006 Confidence Intervals A common part of the physicist's toolkit Especially relevant for results which are

More information

Random Number Generation. CS1538: Introduction to simulations

Random Number Generation. CS1538: Introduction to simulations Random Number Generation CS1538: Introduction to simulations Random Numbers Stochastic simulations require random data True random data cannot come from an algorithm We must obtain it from some process

More information

Machine Learning

Machine Learning Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 13, 2011 Today: The Big Picture Overfitting Review: probability Readings: Decision trees, overfiting

More information

6 Event-based Independence and Conditional Probability. Sneak peek: Figure 3: Conditional Probability Example: Sneak Peek

6 Event-based Independence and Conditional Probability. Sneak peek: Figure 3: Conditional Probability Example: Sneak Peek 6 Event-based Independence and Conditional Probability Example Example Roll 6.1. a fair Roll dicea dice... Sneak peek: Figure 3: Conditional Probability Example: Sneak Peek Example 6.2 (Slides). Diagnostic

More information

Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides

Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides slide 1 Inference with Bayes rule: Example In a bag there are two envelopes one has a red ball (worth $100) and a black ball one

More information

MATH MW Elementary Probability Course Notes Part I: Models and Counting

MATH MW Elementary Probability Course Notes Part I: Models and Counting MATH 2030 3.00MW Elementary Probability Course Notes Part I: Models and Counting Tom Salisbury salt@yorku.ca York University Winter 2010 Introduction [Jan 5] Probability: the mathematics used for Statistics

More information

Probability (Devore Chapter Two)

Probability (Devore Chapter Two) Probability (Devore Chapter Two) 1016-345-01: Probability and Statistics for Engineers Fall 2012 Contents 0 Administrata 2 0.1 Outline....................................... 3 1 Axiomatic Probability 3

More information

Statistics, Data Analysis, and Simulation SS 2017

Statistics, Data Analysis, and Simulation SS 2017 Statistics, Data Analysis, and Simulation SS 2017 08.128.730 Statistik, Datenanalyse und Simulation Dr. Michael O. Distler Mainz, 20. April 2017 The Mainz Microtron 1.6 GeV cw electron

More information

Physics 6720 Introduction to Statistics April 4, 2017

Physics 6720 Introduction to Statistics April 4, 2017 Physics 6720 Introduction to Statistics April 4, 2017 1 Statistics of Counting Often an experiment yields a result that can be classified according to a set of discrete events, giving rise to an integer

More information

Probability and statistics; Rehearsal for pattern recognition

Probability and statistics; Rehearsal for pattern recognition Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Czech Institute of Informatics, Robotics and Cybernetics 166 36 Prague 6, Jugoslávských

More information

Information Science 2

Information Science 2 Information Science 2 Probability Theory: An Overview Week 12 College of Information Science and Engineering Ritsumeikan University Agenda Terms and concepts from Week 11 Basic concepts of probability

More information

Probability Theory and Random Variables

Probability Theory and Random Variables Probability Theory and Random Variables One of the most noticeable aspects of many computer science related phenomena is the lack of certainty. When a job is submitted to a batch oriented computer system,

More information

7.1 What is it and why should we care?

7.1 What is it and why should we care? Chapter 7 Probability In this section, we go over some simple concepts from probability theory. We integrate these with ideas from formal language theory in the next chapter. 7.1 What is it and why should

More information

Statistics and Data Analysis

Statistics and Data Analysis Statistics and Data Analysis The Crash Course Physics 226, Fall 2013 "There are three kinds of lies: lies, damned lies, and statistics. Mark Twain, allegedly after Benjamin Disraeli Statistics and Data

More information

FYST17 Lecture 8 Statistics and hypothesis testing. Thanks to T. Petersen, S. Maschiocci, G. Cowan, L. Lyons

FYST17 Lecture 8 Statistics and hypothesis testing. Thanks to T. Petersen, S. Maschiocci, G. Cowan, L. Lyons FYST17 Lecture 8 Statistics and hypothesis testing Thanks to T. Petersen, S. Maschiocci, G. Cowan, L. Lyons 1 Plan for today: Introduction to concepts The Gaussian distribution Likelihood functions Hypothesis

More information

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019 Lecture 10: Probability distributions DANIEL WELLER TUESDAY, FEBRUARY 19, 2019 Agenda What is probability? (again) Describing probabilities (distributions) Understanding probabilities (expectation) Partial

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten

More information

With Question/Answer Animations. Chapter 7

With Question/Answer Animations. Chapter 7 With Question/Answer Animations Chapter 7 Chapter Summary Introduction to Discrete Probability Probability Theory Bayes Theorem Section 7.1 Section Summary Finite Probability Probabilities of Complements

More information

Discovery and Significance. M. Witherell 5/10/12

Discovery and Significance. M. Witherell 5/10/12 Discovery and Significance M. Witherell 5/10/12 Discovering particles Much of what we know about physics at the most fundamental scale comes from discovering particles. We discovered these particles by

More information

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An

More information

Probabilistic Reasoning

Probabilistic Reasoning Course 16 :198 :520 : Introduction To Artificial Intelligence Lecture 7 Probabilistic Reasoning Abdeslam Boularias Monday, September 28, 2015 1 / 17 Outline We show how to reason and act under uncertainty.

More information

Modern Methods of Data Analysis - WS 07/08

Modern Methods of Data Analysis - WS 07/08 Modern Methods of Data Analysis Lecture V (12.11.07) Contents: Central Limit Theorem Uncertainties: concepts, propagation and properties Central Limit Theorem Consider the sum X of n independent variables,

More information

Lecture 1: Basics of Probability

Lecture 1: Basics of Probability Lecture 1: Basics of Probability (Luise-Vitetta, Chapter 8) Why probability in data science? Data acquisition is noisy Sampling/quantization external factors: If you record your voice saying machine learning

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Lecture 11: Information theory THURSDAY, FEBRUARY 21, 2019

Lecture 11: Information theory THURSDAY, FEBRUARY 21, 2019 Lecture 11: Information theory DANIEL WELLER THURSDAY, FEBRUARY 21, 2019 Agenda Information and probability Entropy and coding Mutual information and capacity Both images contain the same fraction of black

More information

Cogs 14B: Introduction to Statistical Analysis

Cogs 14B: Introduction to Statistical Analysis Cogs 14B: Introduction to Statistical Analysis Statistical Tools: Description vs. Prediction/Inference Description Averages Variability Correlation Prediction (Inference) Regression Confidence intervals/

More information

6.867 Machine Learning

6.867 Machine Learning 6.867 Machine Learning Problem set 1 Solutions Thursday, September 19 What and how to turn in? Turn in short written answers to the questions explicitly stated, and when requested to explain or prove.

More information

An introduction to Bayesian reasoning in particle physics

An introduction to Bayesian reasoning in particle physics An introduction to Bayesian reasoning in particle physics Graduiertenkolleg seminar, May 15th 2013 Overview Overview A concrete example Scientific reasoning Probability and the Bayesian interpretation

More information

Bayes Theorem (Recap) Adrian Bevan.

Bayes Theorem (Recap) Adrian Bevan. Bayes Theorem (Recap) Adrian Bevan email: a.j.bevan@qmul.ac.uk Overview Bayes theorem is given by The probability you want to compute: The probability of the hypothesis B given the data A. This is sometimes

More information

Bayesian Models in Machine Learning

Bayesian Models in Machine Learning Bayesian Models in Machine Learning Lukáš Burget Escuela de Ciencias Informáticas 2017 Buenos Aires, July 24-29 2017 Frequentist vs. Bayesian Frequentist point of view: Probability is the frequency of

More information

Probability Dr. Manjula Gunarathna 1

Probability Dr. Manjula Gunarathna 1 Probability Dr. Manjula Gunarathna Probability Dr. Manjula Gunarathna 1 Introduction Probability theory was originated from gambling theory Probability Dr. Manjula Gunarathna 2 History of Probability Galileo

More information

Bayesian Inference. STA 121: Regression Analysis Artin Armagan

Bayesian Inference. STA 121: Regression Analysis Artin Armagan Bayesian Inference STA 121: Regression Analysis Artin Armagan Bayes Rule...s! Reverend Thomas Bayes Posterior Prior p(θ y) = p(y θ)p(θ)/p(y) Likelihood - Sampling Distribution Normalizing Constant: p(y

More information

Fundamentals of Probability CE 311S

Fundamentals of Probability CE 311S Fundamentals of Probability CE 311S OUTLINE Review Elementary set theory Probability fundamentals: outcomes, sample spaces, events Outline ELEMENTARY SET THEORY Basic probability concepts can be cast in

More information

Error analysis for efficiency

Error analysis for efficiency Glen Cowan RHUL Physics 28 July, 2008 Error analysis for efficiency To estimate a selection efficiency using Monte Carlo one typically takes the number of events selected m divided by the number generated

More information

Experiment 2 Random Error and Basic Statistics

Experiment 2 Random Error and Basic Statistics PHY9 Experiment 2: Random Error and Basic Statistics 8/5/2006 Page Experiment 2 Random Error and Basic Statistics Homework 2: Turn in at start of experiment. Readings: Taylor chapter 4: introduction, sections

More information

Origins of Probability Theory

Origins of Probability Theory 1 16.584: INTRODUCTION Theory and Tools of Probability required to analyze and design systems subject to uncertain outcomes/unpredictability/randomness. Such systems more generally referred to as Experiments.

More information

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009 MIT OpenCourseWare http://ocw.mit.edu Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009 For information about citing these

More information

Randomized Algorithms

Randomized Algorithms Randomized Algorithms Prof. Tapio Elomaa tapio.elomaa@tut.fi Course Basics A new 4 credit unit course Part of Theoretical Computer Science courses at the Department of Mathematics There will be 4 hours

More information

Some Statistical Tools for Particle Physics

Some Statistical Tools for Particle Physics Some Statistical Tools for Particle Physics Particle Physics Colloquium MPI für Physik u. Astrophysik Munich, 10 May, 2016 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk

More information

Statistical Methods for Particle Physics Lecture 3: Systematics, nuisance parameters

Statistical Methods for Particle Physics Lecture 3: Systematics, nuisance parameters Statistical Methods for Particle Physics Lecture 3: Systematics, nuisance parameters http://benasque.org/2018tae/cgi-bin/talks/allprint.pl TAE 2018 Centro de ciencias Pedro Pascual Benasque, Spain 3-15

More information

AST 418/518 Instrumentation and Statistics

AST 418/518 Instrumentation and Statistics AST 418/518 Instrumentation and Statistics Class Website: http://ircamera.as.arizona.edu/astr_518 Class Texts: Practical Statistics for Astronomers, J.V. Wall, and C.R. Jenkins Measuring the Universe,

More information

Topics in Statistical Data Analysis for HEP Lecture 1: Bayesian Methods CERN-JINR European School of High Energy Physics Bautzen, June 2009

Topics in Statistical Data Analysis for HEP Lecture 1: Bayesian Methods CERN-JINR European School of High Energy Physics Bautzen, June 2009 Topics in Statistical Data Analysis for HEP Lecture 1: Bayesian Methods CERN-JINR European School of High Energy Physics Bautzen, 14 27 June 2009 Glen Cowan Physics Department Royal Holloway, University

More information

Statistische Methoden der Datenanalyse. Kapitel 3: Die Monte-Carlo-Methode

Statistische Methoden der Datenanalyse. Kapitel 3: Die Monte-Carlo-Methode 1 Statistische Methoden der Datenanalyse Kapitel 3: Die Monte-Carlo-Methode Professor Markus Schumacher Freiburg / Sommersemester 2009 Basiert auf Vorlesungen und Folien von Glen Cowan und Abbildungen

More information

E. Santovetti lesson 4 Maximum likelihood Interval estimation

E. Santovetti lesson 4 Maximum likelihood Interval estimation E. Santovetti lesson 4 Maximum likelihood Interval estimation 1 Extended Maximum Likelihood Sometimes the number of total events measurements of the experiment n is not fixed, but, for example, is a Poisson

More information

Probability theory basics

Probability theory basics Probability theory basics Michael Franke Basics of probability theory: axiomatic definition, interpretation, joint distributions, marginalization, conditional probability & Bayes rule. Random variables:

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

BACKGROUND NOTES FYS 4550/FYS EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2016 PROBABILITY A. STRANDLIE NTNU AT GJØVIK AND UNIVERSITY OF OSLO

BACKGROUND NOTES FYS 4550/FYS EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2016 PROBABILITY A. STRANDLIE NTNU AT GJØVIK AND UNIVERSITY OF OSLO ACKGROUND NOTES FYS 4550/FYS9550 - EXERIMENTAL HIGH ENERGY HYSICS AUTUMN 2016 ROAILITY A. STRANDLIE NTNU AT GJØVIK AND UNIVERSITY OF OSLO efore embarking on the concept of probability, we will first define

More information

Statistics Challenges in High Energy Physics Search Experiments

Statistics Challenges in High Energy Physics Search Experiments Statistics Challenges in High Energy Physics Search Experiments The Weizmann Institute of Science, Rehovot, Israel E-mail: eilam.gross@weizmann.ac.il Ofer Vitells The Weizmann Institute of Science, Rehovot,

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 11 January 7, 2013 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline How to communicate the statistical uncertainty

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009 Statistics for Particle Physics Kyle Cranmer New York University 1 Hypothesis Testing 55 Hypothesis testing One of the most common uses of statistics in particle physics is Hypothesis Testing! assume one

More information

Basic Probability. Introduction

Basic Probability. Introduction Basic Probability Introduction The world is an uncertain place. Making predictions about something as seemingly mundane as tomorrow s weather, for example, is actually quite a difficult task. Even with

More information

FYST17 Lecture 6 LHC Physics II

FYST17 Lecture 6 LHC Physics II FYST17 Lecture 6 LHC Physics II 1 Today & Monday The LHC accelerator Challenges The experiments (mainly CMS and ATLAS) Important variables Preparations Soft physics EWK physics Some recent results Focus

More information

Chapter ML:IV. IV. Statistical Learning. Probability Basics Bayes Classification Maximum a-posteriori Hypotheses

Chapter ML:IV. IV. Statistical Learning. Probability Basics Bayes Classification Maximum a-posteriori Hypotheses Chapter ML:IV IV. Statistical Learning Probability Basics Bayes Classification Maximum a-posteriori Hypotheses ML:IV-1 Statistical Learning STEIN 2005-2017 Area Overview Mathematics Statistics...... Stochastics

More information

Course Evaluation, Department of Theoretical Physics - FYS230 Theoretical Particle Physics, Fall 2006

Course Evaluation, Department of Theoretical Physics - FYS230 Theoretical Particle Physics, Fall 2006 Course Evaluation, Department of Theoretical Physics - FYS230 Theoretical Particle Physics, Fall 2006 Course Evaluation, Department of Theoretical Physics - FYS230 Theoretical Particle Physics, Fall 2006

More information

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge).

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge). Probability One of the most useful and intriguing aspects of quantum mechanics is the Heisenberg Uncertainty Principle. Before I get to it however, we need some initial comments on probability. Let s first

More information