Some Statistics. V. Lindberg. May 16, 2007

Size: px
Start display at page:

Download "Some Statistics. V. Lindberg. May 16, 2007"

Transcription

1 Some Statistics V. Lindberg May 16, Go here for full details An excellent reference written by physicists with sample programs available is Data Reduction and Error Analysis for the Physical Sciences, Philip R. Bevington and D. Keith Robinson, McGraw Hill 2003, ISBN Basic Statistics You all have, I hope, understand the following terms: (a) Random versus systematic error (b) Gaussian or Normal distribution (c) Measures of central tendency: mean, median, mode (d) Measure of distribution spread: standard deviation We believe that there is an underlying answer to our measurements: that is there is some average and standard deviation in the quantity being measured. This is called the population mean, µ and the population standard deviation, σ. Our objective is to determine the population values based on a finite number of measurements. Suppose we make N measurements of a quantity x this is called sampling the distribution. The sample mean is defined as x = 1 N x i (1) N the sample variance is s 2 x = 1 N 1 i=1 N (x i x) 2 (2) i=1 1

2 and the sample standard deviation is the square root of the variance. As we make more and more measurements we are more and more certain that our sample measurements accurately represent the population, that is that x = µ and s x = σ. In some situations we have discrete values that a variable can take. For example, in the hydrogen spectra there are only certain energy photons that exist. It is convenient to bin the measurements giving the measured value and the number of times it is seen. So in a particular measurement of 100 photons from hydrogen we might measure 30 photons with energy 10.2 ev, 22 photons with energy 1.9 ev, etc. In this case we can describe the probability, P (x j ), of observing a particular energy x. Putting this on a formal basis: we make N measurements, and there are n values that the variable can have. A particular result x j is observed n j times. The probability of observing that particular value is P (x j ) = n j (3) N and the average can be written using this probability Variance is given by x = 1 N n NP (x j ) x j = j=1 s 2 x = n P (x j ) x j (4) j=1 n P (x j ) (x j x) 2 (5) j=1 Example The following measurements are made. Find the mean and standard deviation. Value Applying Equations (1) and (2) we get a mean of 4.27 and a standard deviation of Excel functions AVRERAGE and STDEV are useful. A more compact representation of the data is the following 2

3 Value Frequency Probability Using Equation (4) and we get mean = ((3)(0.2727) + (4)(0.3636) + (5)(0.2727) + 7(0.0909)) = 4.27 (6) The variance is done similarly. Equations (1) and (2) are appropriate for discrete distributions. Many times we have variables that are continuous and we define a probability density, p(x) such that the probability, dp (x) of making a measurement in an infinitesimal range from x to x + dx is dp (x) = p(x) dx (7) Using this we can write Equations(4) and (5) as x = p(x) x dx (8) and s 2 x = p(x) (x x) 2 dx (9) 3 Distributions In one standard treatment of statistics one makes an a priori assumption that the distributions are Gaussian. In many cases this is a fine assumption. In many other cases it is completely wrong. 3.1 Gaussian Distribution (Continuous) The Gaussian, or Normal, or Bell-shaped curve is a symmetric, continuous distribution in the measured variable x. The variable x can have any value: < x <. Population mean and standard deviations are µ and σ and the probability of making a measurement in the range x to x + dx is dp G (x : µ, σ) = p G (x; µ, σ) dx (10) 3

4 where [ p G (x; µ, σ) = 1 σ 2π exp 1 2 ( ) ] x µ 2 Our notation (after Bevington) uses the subscript G to indicate Gaussian, x as the variable, and µ, σ as the parameters describing the distribution. Consider first the math SAT scores for college students as a whole. These may well follow a normal distribution. By contrast, the math SAT scores for physics and math majors should not be distributed normally you are supposed to be better at math! At one time memory chips were manufactured and measured for speed of the chip. Those that could reliably operate at higher speed were sold at a higher price with a different part number than those that were slower. Imagine a normal distribution for the initial population of chips, draw a line somewhere separating the distribution into two parts and you can see that the distribution is not normal! Part of the job of a physicist is to determine what distribution applies to an experimental situation, and apply the appropriate statistics. This is not done nearly as carefully as it ought to be done! Fortunately the distribution is frequently Gaussian, and the researcher gets away with some unjustified assumptions. Excel has a function NORMDIST that returns the probablility density, =NORMDIST(x, mean, stdev, FALSE) or the probability gotten by the integral from to x, =NOR- MDIST(x, mean, stdev, TRUE). Example A Gaussian distribution of energy measurements, E, has a mean of 13.6 ev with a standard deviation of 1.2 ev. (a) What is the probability density for E = 11.0 ev? =NORMDIST(11.0, 13.6,1.2,FALSE) = (b) What is the probability of measuring a value between 10.9 and 11.1 ev? σ (11) Here the range in energies is small so try just multiplying the probability density by the width. P = ( )(0.2) = The exact answer is , so this is close. (c) What is the probability of measuring a value between 10.9 and 13.0 ev? Now we evaluate the integral, P =NORMDIST(13.0, 13.6,1.2,TRUE) - NOR- MDIST(10.9, 13.6,1.2,TRUE) = Three other distributions are seen regularly in physics: Binomial Distributions, Poisson Distributions, and Lorentzian Distributions. 4

5 3.2 Binomial Distribution Consider a coin toss that can have only two outcomes, heads or tails. If n coins are tossed, what is the probability that there are exactly x heads? We allow for the possibility of a biased coin by saying that the probability of a head from a single toss of the coin is p, and this may not be the fair-coin value of P B (x : n, p) = The mean of the binomial distribution, not surprisingly, is n! x!(n x)! px (1 p) n x (12) µ = n p (13) and the standard deviation is σ = n p (1 p) (14) What is the difference between the Binomial and the Gaussian? The binomial has finite limits: x can only run from 0 to n. For large values of n the Binomial can be approximated by the Gaussian however. example Preliminary measurements show that of 1000 measurements, 472 result in scattering in a forward direction and 528 result in scattering in the reverse direction. What is the standard deviation to be quoted? Here n = 1000 and we estimate that p = 472/1000 = From Equation (14) we get σ = Hence we can say that the number that forward scattered is (472 ± 16). 3.3 Poisson Distribution The Poisson Distribution is regularly seen for the case of counting statistics for photons or radioactivity, where we ask, What is the probability of measuring x counts in a time interval when the mean number of counts in that interval is µ. This probability is with the mean being µ and the standard deviation being P P (x : µ) = µx exp( µ) (15) x! σ = µ (16) example You count the number of photons arriving for 100 s and find 123 counts. 5

6 (i) What are the average and standard deviation of the the count rate? The mean is 123/100 = 1.23 counts per second. The standard deviation in the number of counts is 123 = 11 so the standard deviation in the count rate based on this sample is 11/100 = Hence we could quote the count rate to be 1.23 ± 0.11 counts/s. (ii) If you count for 2 seconds, what is the chance of getting 0 counts? 1 count?... 8 counts? Use Equation (15) with the mean count in 2 seconds being 2(1.23) = 2.46 we find n probability The Poisson distribution is different from the Binomial or Normal in one very important way it is not symmetrical around the mean. Unlike the Binomial which is bounded on both sides, 0 x n, the Poisson is only bounded on the lower side, 0 x. If the mean of a Poisson distribution is large, it can be approximated by a normal. 3.4 Lorentzian The Lorentzian or Cauchy Distribution is used to describe behavior of resonant systems and has a probability density distribution p L (x : µ, Γ) = 1 π Γ/2 (x µ) 2 + (Γ/2) 2 (17) The standard deviation is not defined for the Lorentzian. Instead the full-width-at-halfmaximum, Γ characterizes the distribution. Figure (1) compares the different distributions for a sample with mean of 12. 6

7 7

8 4 Error in the Mean Consider the following set of measurements. (a) Three measurements of the lifetime of an excited state resulting in a mean value of ns and standard deviation of ns (b) Ten measurements resulting in mean ns and standard deviation ns. (c) One Hundred measurements resulting in a mean of ns with standard deviation of ns All measurements agree that the mean is approximately 12 ns and the standard deviation is approximately 0.2 ns. Yet somehow we know that we trust the results from 100 measurements more than the results for just 3 measurements. What we want is an easy way to estimate the uncertainty or error in the value of the mean based on N measurements this is called the standard error in the mean and is defined as σ µ S E = σ (18) N Example From the three cases mentioned above we would get standard errors of 0.211/ 3 = 0.12 ns, 0.242/ 10 = ns, and 0.223/ 100 = ns, and we see that increasing the number of measurements reduces the standard error. Hence the lifetimes would be reported as ± 0.12 ns, ± ns, and ± ns. When you include either a standard deviation or a standard error, be sure to identify which it is in the text of the paper. 5 Chi-Square Test of Fit Many times we wish to compare experimental data to a predicted function. The chi-square (χ 2 ) statistic does this based on a histogram of data. We consider the case of N measurements of a quantity x. The quantity may be discrete, such as the number of heads obtained from tossing 40 coins, or it may be continuous. If it is continuous it must be binned. A bin is a range of values for x, and given some data our first task is to choose the size and boundaries of the bins.i know of no definitive technique for doing this: here are two methods mentioned Bin width = 2σ/ 3 N 8

9 Number of bins = ln N Suppose that there are n bins, and we know the predicted number of events for each bin. The χ 2 statistic is χ 2 = Observed Expected σ 2 Observed Expected Expected where we use the Poisson estimate of the standard deviation. Example 20 coins are tossed N = 40 times and the number of heads is recorded. On average the number of heads is 7.75, meaning the probability of getting a head is p = 7.75/20 = Using the Binomial Distribution, Equation (12), we can predict the distribution of heads. Number of Heads Times Observed Times Predicted Contribution to χ The sum of the last column is χ 2 = 7.50 Is this a good χ 2 or not? A full statistics treatment tells us that we can expect a value equal to the number of degrees of freedom that is defined as the number of bins (12 in our example) less the number of constraints in our fit, and that is 2 in our example the number of measurements and the mean number of heads. Since our χ 2 < (12 2) we trust that we have a good fit. (19) 6 Linear Least Squares Simplest Case Consider an independent variable x with no uncertainty and a dependent variable y that we expect to be related by the relation y = m x + b (20) 9

10 We take N measurements of (x i, y i ) pairs and want to find the best values for the slope m and intercept b. Each value y i has a corresponding standard deviation σ i. Define the deviation, y as and we can then form the χ 2 from the deviation, y = y i y(x i ) = y i mx i b (21) χ 2 = N (y i mx i b) 2 σ 2 i=1 i (22) For simplicity assume that all measurements have the same standard deviation. We choose to define the best fit as the choice of slope and intercept that minimizes this χ 2. There are two variables, so we minimize by setting partial derivatives to zero, χ 2 / m = 0, χ 2 / b = 0, and solving the resulting equations for m and b. χ 2 / m = 2 (y i mx i b)x i = 0 (23) or xi y i = m x 2 i + b x i (24) The other partial derivative results in yi = m x i + b N (25) where we use the fact that N i=1 1 = N. Solving the simultaneous equations results in our best fit parameters b = x 2 i yi x i xi y i N x 2 i ( x i ) 2 (26) and m = N x i y i x i yi N x 2 i ( x i ) 2 (27) This technique can easily be extended to a polynomial fit, just with more simultaneous equations, typically solved by using determinants. Likewise we can sometimes make a non-polynomial function look like a polynomial. For example, y = y 0 e kx (28) 10

11 can be written as ln y = ln y 0 kx (29) One subtlety is that while the standard deviation in y may be the same for all points, the standard deviation in ln y will not be the same for all points. Other functions cannot be linearized, and so the techniques described will not work. A simple example of such a non-linear function is y = a 1 + a 2 e a 4x + a 3 e a 5x (30) Finding ways to fit such functions is very difficult, and well beyond these notes. 11

Chapter 8: An Introduction to Probability and Statistics

Chapter 8: An Introduction to Probability and Statistics Course S3, 200 07 Chapter 8: An Introduction to Probability and Statistics This material is covered in the book: Erwin Kreyszig, Advanced Engineering Mathematics (9th edition) Chapter 24 (not including

More information

Chapter 5. Means and Variances

Chapter 5. Means and Variances 1 Chapter 5 Means and Variances Our discussion of probability has taken us from a simple classical view of counting successes relative to total outcomes and has brought us to the idea of a probability

More information

Introduction to Statistics and Error Analysis

Introduction to Statistics and Error Analysis Introduction to Statistics and Error Analysis Physics116C, 4/3/06 D. Pellett References: Data Reduction and Error Analysis for the Physical Sciences by Bevington and Robinson Particle Data Group notes

More information

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions ASTR 511/O Connell Lec 6 1 STATISTICS OF OBSERVATIONS & SAMPLING THEORY References: Bevington Data Reduction & Error Analysis for the Physical Sciences LLM: Appendix B Warning: the introductory literature

More information

Statistical Methods for Astronomy

Statistical Methods for Astronomy Statistical Methods for Astronomy Probability (Lecture 1) Statistics (Lecture 2) Why do we need statistics? Useful Statistics Definitions Error Analysis Probability distributions Error Propagation Binomial

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

Treatment of Error in Experimental Measurements

Treatment of Error in Experimental Measurements in Experimental Measurements All measurements contain error. An experiment is truly incomplete without an evaluation of the amount of error in the results. In this course, you will learn to use some common

More information

Physics 6720 Introduction to Statistics April 4, 2017

Physics 6720 Introduction to Statistics April 4, 2017 Physics 6720 Introduction to Statistics April 4, 2017 1 Statistics of Counting Often an experiment yields a result that can be classified according to a set of discrete events, giving rise to an integer

More information

Chapter 6 Continuous Probability Distributions

Chapter 6 Continuous Probability Distributions Math 3 Chapter 6 Continuous Probability Distributions The observations generated by different statistical experiments have the same general type of behavior. The followings are the probability distributions

More information

Measurements and Data Analysis

Measurements and Data Analysis Measurements and Data Analysis 1 Introduction The central point in experimental physical science is the measurement of physical quantities. Experience has shown that all measurements, no matter how carefully

More information

COUNTING ERRORS AND STATISTICS RCT STUDY GUIDE Identify the five general types of radiation measurement errors.

COUNTING ERRORS AND STATISTICS RCT STUDY GUIDE Identify the five general types of radiation measurement errors. LEARNING OBJECTIVES: 2.03.01 Identify the five general types of radiation measurement errors. 2.03.02 Describe the effect of each source of error on radiation measurements. 2.03.03 State the two purposes

More information

Discrete and continuous

Discrete and continuous Discrete and continuous A curve, or a function, or a range of values of a variable, is discrete if it has gaps in it - it jumps from one value to another. In practice in S2 discrete variables are variables

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 3 October 29, 2012 Silvia Masciocchi, GSI Darmstadt s.masciocchi@gsi.de Winter Semester 2012 / 13 Outline Reminder: Probability density function Cumulative

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

Statistics, Probability Distributions & Error Propagation. James R. Graham

Statistics, Probability Distributions & Error Propagation. James R. Graham Statistics, Probability Distributions & Error Propagation James R. Graham Sample & Parent Populations Make measurements x x In general do not expect x = x But as you take more and more measurements a pattern

More information

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Let s face it. Statistics has a really bad reputation. Why? 1. It is boring. 2. It doesn t make a lot of sense. Actually, the

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Chapter (4) Discrete Probability Distributions Examples

Chapter (4) Discrete Probability Distributions Examples Chapter (4) Discrete Probability Distributions Examples Example () Two balanced dice are rolled. Let X be the sum of the two dice. Obtain the probability distribution of X. Solution When the two balanced

More information

MTH302 Quiz # 4. Solved By When a coin is tossed once, the probability of getting head is. Select correct option:

MTH302 Quiz # 4. Solved By When a coin is tossed once, the probability of getting head is. Select correct option: MTH302 Quiz # 4 Solved By konenuchiha@gmail.com When a coin is tossed once, the probability of getting head is. 0.55 0.52 0.50 (1/2) 0.51 Suppose the slope of regression line is 20 and the intercept is

More information

Statistics and data analyses

Statistics and data analyses Statistics and data analyses Designing experiments Measuring time Instrumental quality Precision Standard deviation depends on Number of measurements Detection quality Systematics and methology σ tot =

More information

Probability Distributions - Lecture 5

Probability Distributions - Lecture 5 Probability Distributions - Lecture 5 1 Introduction There are a number of mathematical models of probability density functions that represent the behavior of physical systems. In this lecture we explore

More information

X = X X n, + X 2

X = X X n, + X 2 CS 70 Discrete Mathematics for CS Fall 2003 Wagner Lecture 22 Variance Question: At each time step, I flip a fair coin. If it comes up Heads, I walk one step to the right; if it comes up Tails, I walk

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY PHYSICS DEPARTMENT

MASSACHUSETTS INSTITUTE OF TECHNOLOGY PHYSICS DEPARTMENT G. Clark 7oct96 1 MASSACHUSETTS INSTITUTE OF TECHNOLOGY PHYSICS DEPARTMENT 8.13/8.14 Junior Laboratory STATISTICS AND ERROR ESTIMATION The purpose of this note is to explain the application of statistics

More information

Probability Distribution

Probability Distribution Economic Risk and Decision Analysis for Oil and Gas Industry CE81.98 School of Engineering and Technology Asian Institute of Technology January Semester Presented by Dr. Thitisak Boonpramote Department

More information

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

STAT2201. Analysis of Engineering & Scientific Data. Unit 3 STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of Queensland School of Mathematics and Physics What we learned in Unit 2 (1) We defined a sample space of a random

More information

Introduction to Statistics and Error Analysis II

Introduction to Statistics and Error Analysis II Introduction to Statistics and Error Analysis II Physics116C, 4/14/06 D. Pellett References: Data Reduction and Error Analysis for the Physical Sciences by Bevington and Robinson Particle Data Group notes

More information

Introduction to Error Analysis

Introduction to Error Analysis Introduction to Error Analysis Part 1: the Basics Andrei Gritsan based on lectures by Petar Maksimović February 1, 2010 Overview Definitions Reporting results and rounding Accuracy vs precision systematic

More information

Chapter 2: Statistical Methods. 4. Total Measurement System and Errors. 2. Characterizing statistical distribution. 3. Interpretation of Results

Chapter 2: Statistical Methods. 4. Total Measurement System and Errors. 2. Characterizing statistical distribution. 3. Interpretation of Results 36 Chapter : Statistical Methods 1. Introduction. Characterizing statistical distribution 3. Interpretation of Results 4. Total Measurement System and Errors 5. Regression Analysis 37 1.Introduction The

More information

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0. () () a. X is a binomial distribution with n = 000, p = /6 b. The expected value, variance, and standard deviation of X is: E(X) = np = 000 = 000 6 var(x) = np( p) = 000 5 6 666 stdev(x) = np( p) = 000

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Fourier and Stats / Astro Stats and Measurement : Stats Notes

Fourier and Stats / Astro Stats and Measurement : Stats Notes Fourier and Stats / Astro Stats and Measurement : Stats Notes Andy Lawrence, University of Edinburgh Autumn 2013 1 Probabilities, distributions, and errors Laplace once said Probability theory is nothing

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics. Probability Distributions Prof. Dr. Klaus Reygers (lectures) Dr. Sebastian Neubert (tutorials) Heidelberg University WS 07/8 Gaussian g(x; µ, )= p exp (x µ) https://en.wikipedia.org/wiki/normal_distribution

More information

Chapter 2 Random Variables

Chapter 2 Random Variables Stochastic Processes Chapter 2 Random Variables Prof. Jernan Juang Dept. of Engineering Science National Cheng Kung University Prof. Chun-Hung Liu Dept. of Electrical and Computer Eng. National Chiao Tung

More information

MA 1125 Lecture 33 - The Sign Test. Monday, December 4, Objectives: Introduce an example of a non-parametric test.

MA 1125 Lecture 33 - The Sign Test. Monday, December 4, Objectives: Introduce an example of a non-parametric test. MA 1125 Lecture 33 - The Sign Test Monday, December 4, 2017 Objectives: Introduce an example of a non-parametric test. For the last topic of the semester we ll look at an example of a non-parametric test.

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Probability theory basics

Probability theory basics Probability theory basics Michael Franke Basics of probability theory: axiomatic definition, interpretation, joint distributions, marginalization, conditional probability & Bayes rule. Random variables:

More information

Probability Density Functions

Probability Density Functions Statistical Methods in Particle Physics / WS 13 Lecture II Probability Density Functions Niklaus Berger Physics Institute, University of Heidelberg Recap of Lecture I: Kolmogorov Axioms Ingredients: Set

More information

System Identification

System Identification System Identification Arun K. Tangirala Department of Chemical Engineering IIT Madras July 27, 2013 Module 3 Lecture 1 Arun K. Tangirala System Identification July 27, 2013 1 Objectives of this Module

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

What students need to know for... Functions, Statistics & Trigonometry (FST)

What students need to know for... Functions, Statistics & Trigonometry (FST) What students need to know for... Functions, Statistics & Trigonometry (FST) 2018-2019 NAME: This is a MANDATORY assignment that will be GRADED. It is due the first day of the course. Your teacher will

More information

Part 3: Parametric Models

Part 3: Parametric Models Part 3: Parametric Models Matthew Sperrin and Juhyun Park August 19, 2008 1 Introduction There are three main objectives to this section: 1. To introduce the concepts of probability and random variables.

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability Random variable The outcome of each procedure is determined by chance. Probability Distributions Normal Probability Distribution N Chapter 6 Discrete Random variables takes on a countable number of values

More information

Chapter 4: An Introduction to Probability and Statistics

Chapter 4: An Introduction to Probability and Statistics Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability

More information

Topic 3 Random variables, expectation, and variance, II

Topic 3 Random variables, expectation, and variance, II CSE 103: Probability and statistics Fall 2010 Topic 3 Random variables, expectation, and variance, II 3.1 Linearity of expectation If you double each value of X, then you also double its average; that

More information

PHYS 114 Exam 1 Answer Key NAME:

PHYS 114 Exam 1 Answer Key NAME: PHYS 4 Exam Answer Key AME: Please answer all of the questions below. Each part of each question is worth points, except question 5, which is worth 0 points.. Explain what the following MatLAB commands

More information

Algebra Calculator Skills Inventory Solutions

Algebra Calculator Skills Inventory Solutions Algebra Calculator Skills Inventory Solutions 1. The equation P = 1.25x 15 represents the profit in dollars when x widgets are sold. Find the profit if 450 widgets are sold. A. $427.50 B. $697.50 C. $562.50

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Suppose n tickets are drawn at random with replacement from a box of numbered tickets. The central limit theorem says that when the probability histogram for the sum of the draws

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics.

Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics. Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics. Study Session 1 1. Random Variable A random variable is a variable that assumes numerical

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER 2 2017/2018 DR. ANTHONY BROWN 5.1. Introduction to Probability. 5. Probability You are probably familiar with the elementary

More information

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements.

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements. Statistics notes Introductory comments These notes provide a summary or cheat sheet covering some basic statistical recipes and methods. These will be discussed in more detail in the lectures! What is

More information

FE 490 Engineering Probability and Statistics. Donald E.K. Martin Department of Statistics

FE 490 Engineering Probability and Statistics. Donald E.K. Martin Department of Statistics FE 490 Engineering Probability and Statistics Donald E.K. Martin Department of Statistics 1 Dispersion, Mean, Mode 1. The population standard deviation of the data points 2,1,6 is: (A) 1.00 (B) 1.52 (C)

More information

Practical Statistics

Practical Statistics Practical Statistics Lecture 1 (Nov. 9): - Correlation - Hypothesis Testing Lecture 2 (Nov. 16): - Error Estimation - Bayesian Analysis - Rejecting Outliers Lecture 3 (Nov. 18) - Monte Carlo Modeling -

More information

8: Statistical Distributions

8: Statistical Distributions : Statistical Distributions The Uniform Distribution 1 The Normal Distribution The Student Distribution Sample Calculations The Central Limit Theory Calculations with Samples Histograms & the Normal Distribution

More information

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge).

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge). Probability One of the most useful and intriguing aspects of quantum mechanics is the Heisenberg Uncertainty Principle. Before I get to it however, we need some initial comments on probability. Let s first

More information

STT 315 Problem Set #3

STT 315 Problem Set #3 1. A student is asked to calculate the probability that x = 3.5 when x is chosen from a normal distribution with the following parameters: mean=3, sd=5. To calculate the answer, he uses this command: >

More information

Math/Stat 352 Lecture 10. Section 4.11 The Central Limit Theorem

Math/Stat 352 Lecture 10. Section 4.11 The Central Limit Theorem Math/Stat 352 Lecture 10 Section 4.11 The Central Limit Theorem 1 Summing random variables Summing random variables Summing random variables Generally summation changes the shape of the distribution: range

More information

Math489/889 Stochastic Processes and Advanced Mathematical Finance Solutions for Homework 7

Math489/889 Stochastic Processes and Advanced Mathematical Finance Solutions for Homework 7 Math489/889 Stochastic Processes and Advanced Mathematical Finance Solutions for Homework 7 Steve Dunbar Due Mon, November 2, 2009. Time to review all of the information we have about coin-tossing fortunes

More information

Poisson distribution and χ 2 (Chap 11-12)

Poisson distribution and χ 2 (Chap 11-12) Poisson distribution and χ 2 (Chap 11-12) Announcements: Last lecture today! Labs will continue. Homework assignment will be posted tomorrow or Thursday (I will send email) and is due Thursday, February

More information

OPIM 303, Managerial Statistics H Guy Williams, 2006

OPIM 303, Managerial Statistics H Guy Williams, 2006 OPIM 303 Lecture 6 Page 1 The height of the uniform distribution is given by 1 b a Being a Continuous distribution the probability of an exact event is zero: 2 0 There is an infinite number of points in

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 20

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 20 CS 70 Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 20 Today we shall discuss a measure of how close a random variable tends to be to its expectation. But first we need to see how to compute

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Lab 0 Appendix C L0-1 APPENDIX C ACCURACY OF MEASUREMENTS AND TREATMENT OF EXPERIMENTAL UNCERTAINTY

Lab 0 Appendix C L0-1 APPENDIX C ACCURACY OF MEASUREMENTS AND TREATMENT OF EXPERIMENTAL UNCERTAINTY Lab 0 Appendix C L0-1 APPENDIX C ACCURACY OF MEASUREMENTS AND TREATMENT OF EXPERIMENTAL UNCERTAINTY A measurement whose accuracy is unknown has no use whatever. It is therefore necessary to know how to

More information

11/16/2017. Chapter. Copyright 2009 by The McGraw-Hill Companies, Inc. 7-2

11/16/2017. Chapter. Copyright 2009 by The McGraw-Hill Companies, Inc. 7-2 7 Chapter Continuous Probability Distributions Describing a Continuous Distribution Uniform Continuous Distribution Normal Distribution Normal Approximation to the Binomial Normal Approximation to the

More information

Physics 403 Probability Distributions II: More Properties of PDFs and PMFs

Physics 403 Probability Distributions II: More Properties of PDFs and PMFs Physics 403 Probability Distributions II: More Properties of PDFs and PMFs Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Last Time: Common Probability Distributions

More information

Probability Distributions

Probability Distributions Probability Distributions Probability This is not a math class, or an applied math class, or a statistics class; but it is a computer science course! Still, probability, which is a math-y concept underlies

More information

Parameter Estimation and Fitting to Data

Parameter Estimation and Fitting to Data Parameter Estimation and Fitting to Data Parameter estimation Maximum likelihood Least squares Goodness-of-fit Examples Elton S. Smith, Jefferson Lab 1 Parameter estimation Properties of estimators 3 An

More information

PHYS 275 Experiment 2 Of Dice and Distributions

PHYS 275 Experiment 2 Of Dice and Distributions PHYS 275 Experiment 2 Of Dice and Distributions Experiment Summary Today we will study the distribution of dice rolling results Two types of measurement, not to be confused: frequency with which we obtain

More information

1 Some Statistical Basics.

1 Some Statistical Basics. Q Some Statistical Basics. Statistics treats random errors. (There are also systematic errors e.g., if your watch is 5 minutes fast, you will always get the wrong time, but it won t be random.) The two

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics February 19, 2018 CS 361: Probability & Statistics Random variables Markov s inequality This theorem says that for any random variable X and any value a, we have A random variable is unlikely to have an

More information

Uncertainty in Physical Measurements: Module 5 Data with Two Variables

Uncertainty in Physical Measurements: Module 5 Data with Two Variables : Module 5 Data with Two Variables Often data have two variables, such as the magnitude of the force F exerted on an object and the object s acceleration a. In this Module we will examine some ways to

More information

MAT Mathematics in Today's World

MAT Mathematics in Today's World MAT 1000 Mathematics in Today's World Last Time We discussed the four rules that govern probabilities: 1. Probabilities are numbers between 0 and 1 2. The probability an event does not occur is 1 minus

More information

Statistical Methods for Astronomy

Statistical Methods for Astronomy Statistical Methods for Astronomy If your experiment needs statistics, you ought to have done a better experiment. -Ernest Rutherford Lecture 1 Lecture 2 Why do we need statistics? Definitions Statistical

More information

Multiple Random Variables

Multiple Random Variables Multiple Random Variables Joint Probability Density Let X and Y be two random variables. Their joint distribution function is F ( XY x, y) P X x Y y. F XY ( ) 1, < x

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan

Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan 2.4 Random Variables Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan By definition, a random variable X is a function with domain the sample space and range a subset of the

More information

(A) Incorrect! A parameter is a number that describes the population. (C) Incorrect! In a Random Sample, not just a sample.

(A) Incorrect! A parameter is a number that describes the population. (C) Incorrect! In a Random Sample, not just a sample. AP Statistics - Problem Drill 15: Sampling Distributions No. 1 of 10 Instructions: (1) Read the problem statement and answer choices carefully (2) Work the problems on paper 1. Which one of the following

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

Chapter 8. Some Approximations to Probability Distributions: Limit Theorems

Chapter 8. Some Approximations to Probability Distributions: Limit Theorems Chapter 8. Some Approximations to Probability Distributions: Limit Theorems Sections 8.2 -- 8.3: Convergence in Probability and in Distribution Jiaping Wang Department of Mathematical Science 04/22/2013,

More information

MATH 3670 First Midterm February 17, No books or notes. No cellphone or wireless devices. Write clearly and show your work for every answer.

MATH 3670 First Midterm February 17, No books or notes. No cellphone or wireless devices. Write clearly and show your work for every answer. No books or notes. No cellphone or wireless devices. Write clearly and show your work for every answer. Name: Question: 1 2 3 4 Total Points: 30 20 20 40 110 Score: 1. The following numbers x i, i = 1,...,

More information

ΔP(x) Δx. f "Discrete Variable x" (x) dp(x) dx. (x) f "Continuous Variable x" Module 3 Statistics. I. Basic Statistics. A. Statistics and Physics

ΔP(x) Δx. f Discrete Variable x (x) dp(x) dx. (x) f Continuous Variable x Module 3 Statistics. I. Basic Statistics. A. Statistics and Physics Module 3 Statistics I. Basic Statistics A. Statistics and Physics 1. Why Statistics Up to this point, your courses in physics and engineering have considered systems from a macroscopic point of view. For

More information

Algebra I. Mathematics Curriculum Framework. Revised 2004 Amended 2006

Algebra I. Mathematics Curriculum Framework. Revised 2004 Amended 2006 Algebra I Mathematics Curriculum Framework Revised 2004 Amended 2006 Course Title: Algebra I Course/Unit Credit: 1 Course Number: Teacher Licensure: Secondary Mathematics Grades: 9-12 Algebra I These are

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 5 Spring 2006

UC Berkeley Department of Electrical Engineering and Computer Science. EE 126: Probablity and Random Processes. Solutions 5 Spring 2006 Review problems UC Berkeley Department of Electrical Engineering and Computer Science EE 6: Probablity and Random Processes Solutions 5 Spring 006 Problem 5. On any given day your golf score is any integer

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Binomial random variable

Binomial random variable Binomial random variable Toss a coin with prob p of Heads n times X: # Heads in n tosses X is a Binomial random variable with parameter n,p. X is Bin(n, p) An X that counts the number of successes in many

More information

MFM Practitioner Module: Quantitative Risk Management. John Dodson. September 23, 2015

MFM Practitioner Module: Quantitative Risk Management. John Dodson. September 23, 2015 MFM Practitioner Module: Quantitative Risk Management September 23, 2015 Mixtures Mixtures Mixtures Definitions For our purposes, A random variable is a quantity whose value is not known to us right now

More information

Statistics for Data Analysis. Niklaus Berger. PSI Practical Course Physics Institute, University of Heidelberg

Statistics for Data Analysis. Niklaus Berger. PSI Practical Course Physics Institute, University of Heidelberg Statistics for Data Analysis PSI Practical Course 2014 Niklaus Berger Physics Institute, University of Heidelberg Overview You are going to perform a data analysis: Compare measured distributions to theoretical

More information

A Measurement of Randomness in Coin Tossing

A Measurement of Randomness in Coin Tossing PHYS 0040 Brown University Spring 2011 Department of Physics Abstract A Measurement of Randomness in Coin Tossing A single penny was flipped by hand to obtain 100 readings and the results analyzed to check

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Inferential Statistics

Inferential Statistics Inferential Statistics Part 1 Sampling Distributions, Point Estimates & Confidence Intervals Inferential statistics are used to draw inferences (make conclusions/judgements) about a population from a sample.

More information

Definition: A random variable X is a real valued function that maps a sample space S into the space of real numbers R. X : S R

Definition: A random variable X is a real valued function that maps a sample space S into the space of real numbers R. X : S R Random Variables Definition: A random variable X is a real valued function that maps a sample space S into the space of real numbers R. X : S R As such, a random variable summarizes the outcome of an experiment

More information

1 Probability Distributions

1 Probability Distributions 1 Probability Distributions In the chapter about descriptive statistics sample data were discussed, and tools introduced for describing the samples with numbers as well as with graphs. In this chapter

More information