Probability and statistics: basic terms

Similar documents
Lecture 7: Properties of Random Samples

Closed book and notes. No calculators. 60 minutes, but essentially unlimited time.

IE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.

NOTES ON DISTRIBUTIONS

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

4. Basic probability theory

Modeling and Performance Analysis with Discrete-Event Simulation

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }


Lecture 18: Sampling distributions

Chapter 6 Sampling Distributions

Distribution of Random Samples & Limit theorems

4. Partial Sums and the Central Limit Theorem

Binomial Distribution

Random Variables, Sampling and Estimation

Quick Review of Probability

Simulation. Two Rule For Inverting A Distribution Function

EE 4TM4: Digital Communications II Probability Theory

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Basis for simulation techniques

Quick Review of Probability

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Probability and Statistics

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Parameter, Statistic and Random Samples

AMS570 Lecture Notes #2

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

CH5. Discrete Probability Distributions

Chapter 6 Principles of Data Reduction

Discrete probability distributions

This section is optional.

Stat 319 Theory of Statistics (2) Exercises

7.1 Convergence of sequences of random variables

Parameter, Statistic and Random Samples

Chapter 2 The Monte Carlo Method

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

Lecture 5. Random variable and distribution of probability

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Limit Theorems. Convergence in Probability. Let X be the number of heads observed in n tosses. Then, E[X] = np and Var[X] = np(1-p).

Module 1 Fundamentals in statistics

Lecture 4. Random variable and distribution of probability

7.1 Convergence of sequences of random variables

Last time: Moments of the Poisson distribution from its generating function. Example: Using telescope to measure intensity of an object

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

Probability and Distributions. A Brief Introduction

Econ 325: Introduction to Empirical Economics

Lecture 1 Probability and Statistics

PROBABILITY DISTRIBUTION RELATIONSHIPS. Y.H. Abdelkader, Z.A. Al-Marzouq 1. INTRODUCTION

Handout #5. Discrete Random Variables and Probability Distributions

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT

Statisticians use the word population to refer the total number of (potential) observations under consideration

Expectation and Variance of a random variable

Topic 10: Introduction to Estimation

(6) Fundamental Sampling Distribution and Data Discription

Stat 421-SP2012 Interval Estimation Section

Lecture 1 Probability and Statistics

Lecture 2: Poisson Sta*s*cs Probability Density Func*ons Expecta*on and Variance Es*mators

Empirical Distributions

Introduction to Probability and Statistics Twelfth Edition

Summarizing Data. Major Properties of Numerical Data

Approximations and more PMFs and PDFs

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Lecture 2: Monte Carlo Simulation

An Introduction to Randomized Algorithms

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

The Random Walk For Dummies

Some Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables

IIT JAM Mathematical Statistics (MS) 2006 SECTION A

1.010 Uncertainty in Engineering Fall 2008

6. Sufficient, Complete, and Ancillary Statistics

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

B Supplemental Notes 2 Hypergeometric, Binomial, Poisson and Multinomial Random Variables and Borel Sets

STAT 515 fa 2016 Lec Sampling distribution of the mean, part 2 (central limit theorem)

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Chapter 2 Descriptive Statistics

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

5. Likelihood Ratio Tests

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment HW5 Solution

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Median and IQR The median is the value which divides the ordered data values in half.

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

Statistics 511 Additional Materials

32 estimating the cumulative distribution function

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

AMS 216 Stochastic Differential Equations Lecture 02 Copyright by Hongyun Wang, UCSC ( ( )) 2 = E X 2 ( ( )) 2

Final Review for MATH 3510

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Lecture 19: Convergence

Sampling Distributions, Z-Tests, Power

Bayesian Methods: Introduction to Multi-parameter Models

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Transcription:

Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample space S. Defiitio: A radom variable X o a sample space S is a fuctio X : S R that assigs a real umber X( s) to each sample poit s S. We defie the evet space A x to be the subset of S to which the radom variable X assigs the value x. A x { s S X( s) x} () Example: Take 3 Beroulli trials. Sample space has 8 possible outcomes: 000, 00, 00, 00, 0, 0, 0,. But if we oly iterested i the umber of successes, the 00, 00, 00 sample poits map to the value (for umber of s withi the three trials). Similarly, 0, 0, 0 map to 2. The evet space has oly 4 outcomes 0,, 2, 3 istead of the origial sample space which has 8 poits. Typically, we are oly iterested i the evet space rather tha i the whole sample space S if our iterest lies oly i the experimetal values of the rv X. A x Discrete radom variables: A radom variable is discrete if it ca take o values from a discrete set of umbers. Probability mass fuctio (pmf): P( A x ) P( s) P[ X x] p X X( s) x (2)

Probability distributio fuctio or Cumulative distributio fuctio (cdf): F X ( t) P( X t) p X x t (3) Ofte we will see the statemet, X is a discrete rv with pmf P X with o hit or metio of the sample space. Kow the pmf, cdf, mea (expected value) ad variace for the followig types of discrete radom variables:. Beroulli - oe parameter p (probability of success) A discrete radom variable X that is either 0 or has the followig pmf: p X ( 0) p q Mea is p ad variace is pq. 2. Biomial - cout umber of successes i idepedet trials with p as probability of success of each trial - two parameters ad p. Its pmf is as follows: Mea is p ad variace is p( p) or pq. 3. Geometric - cout umber of trials upto first success (while biomial couts umber of successes) - oe parameter p p X ( k) p X ( ) p k k p ( p) 0 k k 0 otherwise p X ( i) p( p) i for i, 2, F X ( t) ( p) t for ( t 0) ad (4) (5) 4. Negative biomial - cout umber of trials util r th success - two parameters p ad r. 5. Poisso distributio: A discrete radom variable X has a Poisso distributio with parameter λ ( λ > 0 ). The pmf o is e λ λ x p X ------------ for x 0,, 2, x! Mea ad variace are both λ (6) 2

6. Hypergeometric - samplig without replacemet while biomial is samplig with replacemet; N compoets of which d are defective. I a sample of compoets what is the probability that k are defective. 7. Uiform - oe parameter - rage ; discrete versio: Mea is ( + ) 2 ad variace is ( 2 ) 2. 8. Costat - oe parameter c p X ( i) 9. Zipf - log-tailed discrete-value distributio -- for i. Cotiuous radom variables: Probability desity fuctio (pdf): df X ----------------- dx Probability distributio fuctio or Cumulative distributio fuctio (cdf): (7) F X ( t) P( X t) ( r) dr Kow the pdf, cdf, mea (expected value) ad variace for the followig types of radom variables: t (8). Expoetial - parameter λ F e λx if( 0 x < ) 0 otherwise (9) f if( x > 0) λe λx 0 otherwise (0) E[ X] -- λ () Var[ X] ---- λ 2 (2) 3

2. Logormal [Devore] A oegative radom variable X is said to have a logormal distributio if the radom variable Y l( X) has a ormal distributio. The resultig pdf of a logormal radom variable whe l ( X) is ormally distributed with mea µ ad variace σ 2 is Mea is. Media is e µ. Variace is ( e σ2 ) ( e ). 3. Gamma fuctio [Devore] For α > 0, the gamma fuctio Γ( α) is defied by Importat properties are: a. For ay α >, Γ( α) ( α )Γ( α ) b. For ay positive iteger,, Γ( ) ( )! c. e µ + σ2 2 Γ( 2) π Gamma distributio [Devore] A cotiuous radom variable X is said to have a gamma distributio if the pdf o is where α ad β are both greater tha 0. The stadard gamma distributio has β. If α ad β λ, the gamma distributio reduces to a expoetial distributio. Mea is αβ ad variace is αβ 2. The parameter β is called the scale parameter because values other tha either stretch or compress the pdf i the x directio (Devore), ad α is called the shape parameter. 4. Chi-Squared distributio [Devore] -----------------e 2πσx [ l µ ] 2 ( 2σ 2 ) x 0 0 x < 0 2µ + σ2 Γ( α) x α e x dx Let ν be a positive iteger. The a radom variable X is said to have a chi-squared distributio with parameter ν if the pdf o is the gamma desity with α ν 2 ad β 2. The parameter ν is called the umber of degrees of freedom (df) o. The symbol χ 2 is used to represet chisquared. Mea is ν ad variace is 2ν. I is a stadard ormal radom variable, ad Y X 2, the Y has a chi-squared distributio with ν degree of freedom [Devore] 0 ------------------x α β α e x β x 0 Γ( α) 0 otherwise 4

5. Cauchy distributio It is the same as the t-distributio with degree of freedom. Mea ad variace are udefied. Media ad mode are both 0. Desity fuctio: ----------------------,. π( + x 2 < x < ) 6. Uiform distributio A cotiuous radom variable X has a uiform distributio o the iterval [A,B] if the pdf o is Mea is ( B + A) 2 ad variace is ( B A) 2 2. 7. Studet-t distributios The t-distributio has oly oe parameter, the degrees of freedom. Type help(pt) i R to see the PDF of the t-distributio. The mea is 0 whe >, ad the variace is ( 2) for > 2. As 40, the t distributio becomes approximately equal to the ormal distributio. P( t α 2, < T < t α 2, ) α P( T < t α, ) α, the value t α, is called a t critical value. 8. Erlag distributio 9. Hypoexpoetial distributio 0. Hyperexpoetial distributio. Normal (Gaussia) distributio: two parameters: mea ad stadard deviatio. 2. Pareto distributio B ------------ ( A A < x ) < B 0 otherwise αθ ------------ α, x θ x α + (3) where α is the shape parameter ad θ is the scale parameter. Mea value is αθ ( α ). 3. Weibull distributio 5

Relatioships betwee radom variables. Mutually exclusive evets P( A B) P( A) + P( B) ; P( A B) 0. (4) I geeral P( A B) P( A) + P( B) P( A B) (5) 2. Idepedet evets P( A B) P( A)P( B) (6) 3. Law of total probability: Let evets be mutually exclusive ad exhaustive. B i 4. Bayes Theorem (Devore) P( A) P( A B i )P( B i ) i (7) Let A, A 2,, A k be a collectio of k mutually exclusive ad exhaustive evets with prior probability P( A i ), where i, 2,, k. The for ay other evet B for which P( B) > 0, the posteriori probability of give that B has occurred is A j P( A j B) 5. Correlatio of two radom variables P( A j B) P( B A ------------------------ j )P( A j ) ------------------------------------------- P( B) k j, 2,, k P( B A i )P( A i ) i (8) r XY E[ XY] (9) E[ XY] xyp X, Y ( x, y) x S X y S Y for a discrete r.v. (20) I ad Y are idepedet radom variables the because P X, Y ( x, y) P X P Y ( y), E[ XY] E[ X]E[ Y] (see [2, page 07]). Secod momet of a sigle radom variable is E[ X 2 ]. 6. Covariace of two radom variables cov( X, Y) E[ X µ X ][ Y µ Y ] E[ XY] µ X µ Y, where µ X E[ X] ad µ Y E[ Y]. Relate this to variace of a sigle radom variable Var[ X] E[ ( X µ X ) 2 ] E[ X 2 2 ] µ X (2) 6

7. Correlatio coefficiet: ρ XY cov( X, Y) -------------------------------------- var( X)var( Y) (22) Memoryless property: Two distributios, the expoetial cotiuous r.v. distributio ad geometric discrete r.v. distributio ejoy this property. Let X be the lifetime of a compoet. Suppose we have observed that the system has bee operatioal for time t. We would like to kow the probability that it will be operatioal for hours. Say Y X t. We will show that y more P( Y y X > t) P( Y y), i other words, how log it lasts beyod some time t is idepedet of how log the compoet has bee operatioal so far, i.e. t. Expoetial distributio: P( Y y X > t) Geometric distributio: P( Y y X > t) P( X t y X > t) P( X y + t X > t) P( X y + t ad X > t) P( t < X y + t) ---------------------------------------------------- ------------------------------------ P( X > t) P( X > t) ( y + t) t t λe λx dx e λt ( ------------------------------- e λy ) -------------------------------- ( e λy ) P( Y y) λe λx dx e λt (23) (24) Let X be a rv with a geometric distributio ad Y X t. We will show that P( Y y X > t) P( Y y). P( Y y X > t) ( p) y + t ( ( p) t --------------------------------------------------------------------------------- ) ( p) t ( ( p) y ) ( p) t ------------------------------------------------------------ ( p) t P( Y y) Note: Y X t has the same distributio as X because t is a costat. 7

Relatio betwee expoetial distributio (for cotiuous rv) ad geometric distributio (for discrete rv): [2] I is a expoetial radom variable with parameter a, the K X is a geometric radom variable with parameter p e a. Uivariate data statistics terms:. what are sample mea, sample media, sample variace? Quitiles, quartiles, percetiles? R book, page 43: The sample mea of the umeric data set, x, x 2,, x is x x + x 2 + + x --------------------------------------- R book, page 44: The sample media, m, of x, x 2,, x is the middle value of the sorted values. Let the sorted data be deoted by x ( ) x ( 2) x ( ). The m x ( k + ) 2k + (odd) -- ( x 2 ( k) + x ( k + ) ) 2k (eve) R book, page 49: The sample variace ad sample stadard deviatio. For a umeric data set x, x 2,, x, the sample variace is s 2 ----------- x, ad ( i x) 2 i sample stadard deviatio, s is the square root of sample variace. R book, page 50: The p th quatile is at positio + p( ) i the sorted data. If this is ot a iteger, a weighted average is used. For example, as the media is the 0.5 quatile, if is eve, the positio is a fractio, ad hece a average is used as show above i the defiitio of media. The p th quatile splits the dataset so that 00 p % is smaller ad 00 ( p) % is larger. R book, page 50: Percetiles do the same thig as quatiles, except thata scale of 0 to 00 is used istead of 0 to. Devore book: Order the sample observatios from smallest to largest. The the i th smallest observatio i the list is take to be the ---------------------------- 00( i 0.5) th sample percetile. R book, page 50. Quartiles refer to the 0, 25, 50, 75 ad 00 percetiles. Quitiles refer to the 0, 20, 40, 60, 80, 00 percetiles. 8

Radom variables vs. sample values: Devore book: Radom variables are customarily deoted by uppercase letters, such as X ad Y, ear the ed of the alphabet. Lowercase letters are used to represet some particular value of the correspodig radom variable, e.g., X x. Refereces [] K. S. Trivedi, Probability, Statistics with Reliability, Queueig ad Computer Sciece Applicatios, Secod Editio, Wiley, 2002, ISBN 0-47-3334-7. [2] R. Yates ad D. Goodma, Probability ad Stochastic Processes, Wiley, ISBN 0-47-7837-3. [3] Jay L. Devore, Probability ad Statistics for Egieerig ad the Scieces. [4] J Verzai, Usig R for Itroductory Statistics, Chapma & Hall. 9