Discrete Mathematics and Probability Theory Fall 2016 Walrand Probability: An Overview
|
|
- Malcolm Stokes
- 6 years ago
- Views:
Transcription
1 CS 70 Discrete Mathematics ad Probability Theory Fall 2016 Walrad Probability: A Overview Probability is a fasciatig theory. It provides a precise, clea, ad useful model of ucertaity. The successes of Probability Theory i Computer Sciece are remarkable: data sciece, machie learig, artificial itelligece, voice ad image recogitio, ad commuicatio theory are based o that theory. The objective of these otes is to itroduce the key ideas of Probability Theory o simple examples. Hopefully, this overview will help you see the forest as you explore its differet trees i the course. 1 Pick a Marble Setup Imagie a bag with 100 marbles that are idetical, except for their color. Amog those, 10 are blue, 20 are red, 30 are gree, ad 40 are white. You shake the bag ad pick a marble without lookig. Probability You will certaily agree that the odds that you picked a gree marble are 30 out of 100. Similarly, the odds that you picked a blue marble are 10 out of 100. We say that the probability that the marble is gree is 30/100 = 0.3. We write Pr[gree] = 0.3. Iterpretatio What does this mea precisely? Well, this is ot really that obvious. Two iterpretatios are useful. The first iterpretatio is a subjective willigess to bet o the outcome. Imagie the followig game of chace. You bet some amout ad you get $ if the marble is gree. How much are you willig to bet? I would be willig to bet $ The secod iterpretatio is frequetist. It says that if you were to repeat this experimet (shakig the bag with 100 marbles ad pick a marble without lookig), you would pick a gree marble about 30% of the time. Note that this is a iterpretatio at this poit, ot a theorem. Additivity Cosider the evet the marble is blue or gree. The odds of that evet are 40/100. We write Pr[blue or gree] = 0.4. Note that Pr[blue or gree] = Pr[blue] + Pr[gree]. This is ot surprisig sice the umber of marbles that are blue or gree is the sum of the umber of blue marbles plus the umber of gree marbles. We say that probability is additive. Coditioal Probability Assume that the marble you picked is blue or gree. What are the odds that it is blue or red? Well, sice you picked oe of the 40 marbles that are either blue or gree, that marble is blue or red oly if it is oe of the 10 blue marbles. Sice 10 out of the 40 blue or gree marbles are blue, we see that the odds that you picked a blue or red marble, give that you picked a blue or gree marble, is 10/40. We say that the coditioal probability of blue or red give blue or gree is 10/40. We write Pr[blue or red blue or gree] = 10/40. CS 70, Fall 2016, Probability: A Overview 1
2 Note that Pr[blue or red blue or gree] = Pr[(blue or red) ad (blue or gree)] Pr[blue or gree] = Pr[blue] Pr[blue or gree]. Bayes Rule Assume that we pait a black dot o half of the blue ad half of the red marbles, ad also o 20% of the gree ad 20% of the white marbles. You pick a marble at radom ad are told that the marble has a black dot. What are the odds that the marble is red? To aswer this questio, we ote that there are = 29 marbles with a black dot, out of which 10 are red. Thus, the aswer is 10/29. This calculatio is a example of Bayes Rule. The idea is that oe specified Pr[black dot blue] = 0.5 ad similarly for the other colors. Oe also kows Pr[blue] = 0.1, ad similarly for the other colors. The calculatio determies Pr[red black dot], which i a sese is the reverse of the specificatio. A similar calculatio determies the likelihood of a disease (e.g., flu) give a symptom (e.g., fever). Here, the symptom is the back dot ad the disease is the color of the marble. Radom Variable Say that you get $8.00 if you pick a blue marble, $5.00 if it is red, $2.00 if it is gree, ad $2.00 if it is white. The amout you get is the a fuctio of the color of the marble you picked. This fuctio is fixed. Let us call the fuctio X( ). Thus, X(blue) = 10 ad X(white) = 2, ad so o. We call X a radom variable. Thus, we say that a radom variable is a real-valued fuctio of the outcome of a radom experimet. Here, the radom experimet is choosig a marble. The outcome is the color of the marble. We have specified all the possible outcomes: blue, red, gree, white. Also, we kow the probability of each outcome. For istace, Pr[blue] = 0.1. The set of outcomes ad their probability specifies the radom experimet. The fuctio X assigs a real umber to each outcome. Note that the values assiged to differet outcomes do ot have to be differet. Here, X(gree) = X(white) = 2. Distributio Assume that we are iterested oly i how much you get, ot i the details of the experimet that produces that gai. I that case, we ca describe X by sayig that X = 8 with probability 0.1 (which is the probability you pick a blue marble), X = 5 with probability 0.2, ad X = 2 with probability 0.7 (the probability that you pick a gree or white marble). Thus, the possible values of X are 8,5,2 ad their probability is 0.1,0.2,0.7, respectively. These values ad their probability are called the distributio of the radom variable X. Expectatio Imagie that you repeat the experimet (shake, pick, collect X) a very large umber N of times. The frequetist iterpretatio suggests that the fractio of the times that you collect 8 is 0.1, that you collect 5 is 0.2 ad that you collect 2 is 0.7. Thus, you collect 8 about 0.1N times, 5 about 0.2N times, ad 2 about 0.7N times. Hece, the total amout you collect over the N experimets is about 8 0.1N N N = ( )N. Accordigly, the average amout you collect per experimet is We call this value the expectatio of X ad we write it as E[X]. We also call E[X] the mea value or the expected value of X. Thus, E[X] = = 3.2. That is, E[X] is the sum of the values of X multiplied by their probability. Fuctio CS 70, Fall 2016, Probability: A Overview 2
3 Would you rather play the game (pick a marble ad get X) or get $3.20 without playig the game? The aswer depeds o a key factor that the ecoomists call the utility that you have for moey. To make the situatio a bit more dramatic, say that you ca either get $1.00 or play a game ad get $ with probability 0.01 or $0.00 otherwise. What do you prefer? May people ted to choose to play the game. I fact, may people play the Califoria Lottery where the odds of wiig $100M are much less that Let h(x) be the utility that you have for $x. Say that (this is a silly example, but it will illustrate a poit) h(8) = 10 ad h(5) = h(3.2) = h(2) = 0. For istace, for $8.00, you ca by a ticket to go see the latest Pokemo movie you crave ad that you caot do aythig of comparable value with less tha $8.00. The we fid that, after playig the marble game, h(x) = 10 with probability 0.1 ad h(x) = 0 with probability 0.9. Hece, E[h(X)] = = 1. O the other had, if you do t play the game ad get 3.2, the h(3.2) = 0. Thus, you would rather play the game. Similarly, people play the lottery beause wiig would chage their life, presumably for the better, whereas loosig $1.00 does ot affect their life. Note that we calculated E[h(X)] by fidig the distributio (recall that this meas the set of possible values ad their probability) of h(x). We could have calculated E[h(X)] directly from the distributio of X: E[h(X)] = h(8)pr[x = 8] + h(5)pr[x = 5] + h(2)pr[x = 2] = This is simple observatio, but it is coveiet. I a similar way, we could have computed E[h(X)] by lookig at the outcomes of the marble pickig game: E[h(X)] = h(x(blue))pr[blue] + h(x(red))pr[red] + h(x(gree))pr[gree] + h(x(white))pr[white] = h(8)0.1 + h(5)0.2 + h(2)0.3 + h(2)0.5. Ideed, these three differet ways of calculatig E[h(X)] correspod to differet ways of summig the possible ways of gettig the values of h(x): summig over the values of h(x), or the values of X, or the outcomes. Variace We saw that oe ca describe a radom variable X by its distributio. A summary of that distributio is the mea value E[X]. However, our discussio of the utility shows that this descriptio is a bit crude ad may ot suffice to decide whether to play a game of chace. For istace, the expected gai of playig the lottery is egative. You would ot paly a game where you are certai to loose. The mea value does ot say aythig about the ucertaity of X, i.e., its variability. Here, by variability we mea that if we play the game may times, we observe a variety of values of X. The variace is a oe-umber summary of variability. The variace of X is defied by var[x] = E[(X E[X]) 2 ]. The ituitio is that if X is almost always close to E[X], the the variace is small; otherwise, it is large. I our marble example, E[X] = 3.2. Sice X = 8,5, or 2 with probability 0.1,0.2,0.7, respectively, we see that var[x] = E[(X E[X]) 2 ] = (8 3.2) 2 Pr[X = 8] + (5 3.2) 2 Pr[X = 5] + (2 3.2) 2 Pr[X = 2] = (8 3.2) (5 3.2) (2 3.2) = = The square root of the variace is called the stadard deviatio ad we deote it by σ X. Here, σ X = CS 70, Fall 2016, Probability: A Overview 3
4 Figure 1: Liear Regressio of Y over X (brow) Figure 2: Quadratic Regressio of Y over X (purple) Liear Regressio Cosider oce agai our bag of marbles. Defie aother radom variable Y by Y (blue) = 1,Y (red) = 1,Y (gree) = 3 ad Y (white) = 4. Thus, each outcome (i.e., color) is assiged two umbers: X ad Y. I aother cotext, each perso is associated with a height ad a weight. Say that you wat to guess the weight of a perso from his/her height. How do you do it? Here, we wat to guess Y from the value of X. Here, a picture helps. Figure 1 shows the values of X ad Y associated with the four possible outcomes. For istace, the blue outcome is associated with X(blue) = 8 ad Y (blue) = 1. The figure also shows the probability of the differet outcomes. We wat a simple formula to provide a guess of Y based o X. I fact, we wat a formula of the form Ŷ = a + bx. Here, Ŷ is our guess for Y based o the value of X. Also, a ad b are some costats. This formula correspods to the lie show i the figure. We choose a ad b so that the guess Ŷ teds to be close to Y. This meas that the lie should be close to the actual poits (X,Y ) i the figure. Thus, Ŷ Y should be small. We make this precise by requirig that E[(Ŷ Y ) 2 ] be as small as possible. That is, we choose a ad b to miimize E[(Ŷ Y ) 2 ] = E[(a + bx Y ) 2 ]. We explai i the lectures that the best choice of a ad b is such that where cov(x,y ) = E[XY ] E[X]E[Y ]. Quadratic Regressio Ŷ = E[Y ] + cov(x,y ) (X E[X]) var[x] I the previous sectio, we estimated Y by usig a liear fuctio a + bx of X, as show i Figure 1. Figure 2 suggests that a quadratic estimate c + dx + ex 2 is better tha a liear estimate, i.e., that it is closer to the pairs (X,Y ). I the lectures, we explai how to fid the best values of c,d,e. CS 70, Fall 2016, Probability: A Overview 4
5 Figure 3: Coditioal Expectatio of Y give X (gree) Coditioal Expectatio What if we could choose ay fuctio of X istead of beig limited to liear or quadratic fuctios? Figure 3 shows the best possible fuctio g(x) of X to estimate Y. We explai i the lectures how to calculate that fuctio called the coditioal expectatio of Y give X. 2 Flip Cois So far, we looked at oe or two radom variables. I this sectio, we explore may radom variables. Setup You have a coi. Whe you flip it, there are two possible outcomes: heads (H) ad tails (T ). Let p = Pr[H], so that Pr[T ] = 1 p. For istace, the coi could be biased with p = 0.6, so that heads is more likely tha tails. Idepedece Say that you flip the coi twice. There are four possible outcomes for this experimet: HH,HT,T H, ad T T. Here, HT meas that the first flip produces H ad the secod T, ad similarly for the other outcomes. If we recall the defiitio of coditioal probability, we have Pr[first fip yields H secod flip yields H] Pr[(first flip yields H) ad (secod flip yields H)] = Pr[secod flip yields H] = Pr[HH]. p I the last step, we used the fact that the probability that the secod flip yields H is p. Now, it is reasoable to assume that the likelihood that the first flip yields H does ot deped o the fact that the secod flip yields H ad that this likelihood is the p. Hece, we are led to the coclusio that p = Pr[HH]/p, so that Pr[HH] = p 2. This assumptio is called the idepedece of the coi flips. A similar reasoig yield to the coclusio that Pr[HT ] = p(1 p),pr[t H] = (1 p)p,pr[t T ] = (1 p) 2. Let X = 1 whe the first flip is H ad X = 0 whe it is T. Also, let Y = 1 whe the secod flip is H ad Y = 0 whe it is T. The we see that Pr[X = 1] = Pr[Y = 1] = p ad Pr[X = 1,Y = 1] = Pr[X = 1]Pr[Y = 1]. Also, Pr[X = 1,Y = 0] = Pr[X = 1]Pr[Y = 0]. More geerally, Pr[X = a,y = b] = Pr[X = a]pr[y = b] for all a,b. Two radom variables with that property are said to be idepedet. CS 70, Fall 2016, Probability: A Overview 5
6 Variace of Sum Let X ad Y be idepedet radom variables. We show i the lectures that var[x +Y ] = var[x] + var[y ]. More geerally, if X 1,...,X are radom variables such that ay two of them are idepedet, the var[x X ] = var[x 1 ] + + var[x ]. Moreover, we will see that var[ax] = a 2 var[x] for ay radom variable X ad ay costat a. Cosequetly, we see that var[ X X ] = var[x 1] + + var[x ] 2. I particular, if var[x m ] = σ 2 for m = 1,...,, we have var[ X X ] = var[x 1] + + var[x ] 2 = σ 2 2 = σ 2. Chebyshev s Iequality Flip a coi times ad let X m = 1 if flip m yields H ad X m = 0 otherwise. The var[x m ] = E[(X m E[X m ]) 2 ] = E[(X m p) 2 ] = (1 p) 2 Pr[X m = 1] + (0 p) 2 Pr[X m = 0] = (1 p) 2 p + p 2 (1 p) = p(1 p). Accordigly, i view of the previous sectio, var[ X X ] = p(1 p). Thus, whe is large, the variace of A := (X X )/ is very small. This suggests that the radom variable A teds to be very close to its mea value, which happes to be p. Thus, we expect the fractio of heads A i coi flips to be close to p. To make this idea precise, Chebyshev developed a iequality which says that We prove this iequality i the lectures. Pr[ X E[X] 2 > ε] var[x] ε 2. Thus, the likelihood that a radom variable X differs from its mea E[X] by at least ε is small if var[x] is small. If we apply this iequality to A, we fid that Pr[ A p ε] p(1 p) ε 2. Note that p(1 p) 1/4 for ay value of p. Cosequetly, we see that Pr[ A p ε] 1 4ε 2. Cofidece Iterval Say that you do ot kow the value of p = Pr[H]. To estimate it, you flip the coi times ad ote the fractio A of heads. The last iequality holds. Let us choose ε so that the right-had side of the iequality CS 70, Fall 2016, Probability: A Overview 6
7 is 0.05 = 1/20. That is, we choose ε so that 4ε 2 = 20, i.e., ε 2 = 5/ or ε = 5/ 2.25/. Hece, the previous iequality with that value of ε implies that Pr[ A p 2.25 ] 0.05, so that Pr[ A p 2.25 ] = 95%. Now, sice A p δ if ad oly if p [A δ,a + δ], we coclude that Pr[p [A 2.25,A ]] 95%. For istace, say that = 10 4 ad A = We the coclude that so that Pr[p [ , ]] 95%, 100 Pr[p [0.2875, ]] 95%. We say that [0.2875,0.3325] is a 95%-cofidece iterval for p. As you ca see, the width of the cofidece iterval decreases like 1/. This example is the basis for the estimates i public opiio surveys. Time util first H We flip the coi util we get the first H. How may times do we eed to flip the coi, o average? Let β be that average umber of flips. That umber of flips is 1 if the first flip is H, which occurs with probability p. If the first coi is T, which occurs with probability 1 p, the the process starts afresh ad we eed to flip the coi β more times, o average. Thus, β = p 1 + (1 p) (1 + β). Solvig, we fid β = 1/p. Time util two cosecutive Hs We flip the coi util we get two cosecutive Hs. How may times do we eed to flip the coi, o average? Let β be that average umber of flips. Let also β(h) be the average umber of additioal flips util two cosecutive Hs, give that the last flip is H. The we claim that β = p(1 + β(h)) + (1 p)(1 + β) β(h) = p 1 + (1 p)(1 + β). The first idetity ca be see by otig that if the first flip is H, the after that first flip oe eeds β(h) additioal flips, o average, sice the last flip was H. However, if the secod flip is T, the after the first flip oe eeds β additioal flips, o average. The secod idetity ca be justified similarly. Solvig, oe fids β = 1/p + 1/p 2. CS 70, Fall 2016, Probability: A Overview 7
n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe
More informationProblems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:
Math 224 Fall 2017 Homework 4 Drew Armstrog Problems from 9th editio of Probability ad Statistical Iferece by Hogg, Tais ad Zimmerma: Sectio 2.3, Exercises 16(a,d),18. Sectio 2.4, Exercises 13, 14. Sectio
More informationSTATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:
Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationThe standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
More informationDiscrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 19
CS 70 Discrete Mathematics ad Probability Theory Sprig 2016 Rao ad Walrad Note 19 Some Importat Distributios Recall our basic probabilistic experimet of tossig a biased coi times. This is a very simple
More informationFinal Review for MATH 3510
Fial Review for MATH 50 Calculatio 5 Give a fairly simple probability mass fuctio or probability desity fuctio of a radom variable, you should be able to compute the expected value ad variace of the variable
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationRandomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)
Radomized Algorithms I, Sprig 08, Departmet of Computer Sciece, Uiversity of Helsiki Homework : Solutios Discussed Jauary 5, 08). Exercise.: Cosider the followig balls-ad-bi game. We start with oe black
More informationACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics
ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationDiscrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22
CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first
More informationHomework 5 Solutions
Homework 5 Solutios p329 # 12 No. To estimate the chace you eed the expected value ad stadard error. To do get the expected value you eed the average of the box ad to get the stadard error you eed the
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More informationf X (12) = Pr(X = 12) = Pr({(6, 6)}) = 1/36
Probability Distributios A Example With Dice If X is a radom variable o sample space S, the the probablity that X takes o the value c is Similarly, Pr(X = c) = Pr({s S X(s) = c} Pr(X c) = Pr({s S X(s)
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More informationProblem Set 2 Solutions
CS271 Radomess & Computatio, Sprig 2018 Problem Set 2 Solutios Poit totals are i the margi; the maximum total umber of poits was 52. 1. Probabilistic method for domiatig sets 6pts Pick a radom subset S
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationLecture 01: the Central Limit Theorem. 1 Central Limit Theorem for i.i.d. random variables
CSCI-B609: A Theorist s Toolkit, Fall 06 Aug 3 Lecture 0: the Cetral Limit Theorem Lecturer: Yua Zhou Scribe: Yua Xie & Yua Zhou Cetral Limit Theorem for iid radom variables Let us say that we wat to aalyze
More informationPH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
More informationLecture 2: April 3, 2013
TTIC/CMSC 350 Mathematical Toolkit Sprig 203 Madhur Tulsiai Lecture 2: April 3, 203 Scribe: Shubhedu Trivedi Coi tosses cotiued We retur to the coi tossig example from the last lecture agai: Example. Give,
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationOutput Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationDiscrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand Final Solutions
CS 70 Discrete Mathematics ad Probability Theory Fall 2016 Seshia ad Walrad Fial Solutios CS 70, Fall 2016, Fial Solutios 1 1 TRUE or FALSE?: 2x8=16 poits Clearly put your aswers i the aswer box o the
More informationMATH 472 / SPRING 2013 ASSIGNMENT 2: DUE FEBRUARY 4 FINALIZED
MATH 47 / SPRING 013 ASSIGNMENT : DUE FEBRUARY 4 FINALIZED Please iclude a cover sheet that provides a complete setece aswer to each the followig three questios: (a) I your opiio, what were the mai ideas
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationPRACTICE PROBLEMS FOR THE FINAL
PRACTICE PROBLEMS FOR THE FINAL Math 36Q Fall 25 Professor Hoh Below is a list of practice questios for the Fial Exam. I would suggest also goig over the practice problems ad exams for Exam ad Exam 2 to
More informationAn Introduction to Randomized Algorithms
A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis
More informationLecture 12: November 13, 2018
Mathematical Toolkit Autum 2018 Lecturer: Madhur Tulsiai Lecture 12: November 13, 2018 1 Radomized polyomial idetity testig We will use our kowledge of coditioal probability to prove the followig lemma,
More informationLecture Chapter 6: Convergence of Random Sequences
ECE5: Aalysis of Radom Sigals Fall 6 Lecture Chapter 6: Covergece of Radom Sequeces Dr Salim El Rouayheb Scribe: Abhay Ashutosh Doel, Qibo Zhag, Peiwe Tia, Pegzhe Wag, Lu Liu Radom sequece Defiitio A ifiite
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationDiscrete probability distributions
Discrete probability distributios I the chapter o probability we used the classical method to calculate the probability of various values of a radom variable. I some cases, however, we may be able to develop
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015
ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],
More informationUnderstanding Samples
1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We
More informationMachine Learning Theory (CS 6783)
Machie Learig Theory (CS 6783) Lecture 2 : Learig Frameworks, Examples Settig up learig problems. X : istace space or iput space Examples: Computer Visio: Raw M N image vectorized X = 0, 255 M N, SIFT
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationMachine Learning Brett Bernstein
Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationInfinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationDiscrete Mathematics and Probability Theory Spring 2013 Anant Sahai Lecture 18
EECS 70 Discrete Mathematics ad Probability Theory Sprig 2013 Aat Sahai Lecture 18 Iferece Oe of the major uses of probability is to provide a systematic framework to perform iferece uder ucertaity. A
More informationEcon 325: Introduction to Empirical Economics
Eco 35: Itroductio to Empirical Ecoomics Lecture 3 Discrete Radom Variables ad Probability Distributios Copyright 010 Pearso Educatio, Ic. Publishig as Pretice Hall Ch. 4-1 4.1 Itroductio to Probability
More informationMassachusetts Institute of Technology
Solutios to Quiz : Sprig 006 Problem : Each of the followig statemets is either True or False. There will be o partial credit give for the True False questios, thus ay explaatios will ot be graded. Please
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationSTAT Homework 1 - Solutions
STAT-36700 Homework 1 - Solutios Fall 018 September 11, 018 This cotais solutios for Homework 1. Please ote that we have icluded several additioal commets ad approaches to the problems to give you better
More informationkp(x = k) = λe λ λ k 1 (k 1)! = λe λ r k e λλk k! = e λ g(r) = e λ e rλ = e λ(r 1) g (1) = E[X] = λ g(r) = kr k 1 e λλk k! = E[X]
Problem 1: (8 poits) Let X be a Poisso radom variable of parameter λ. 1. ( poits) Compute E[X]. E[X] = = kp(x = k) = k=1 λe λ λ k 1 (k 1)! = λe λ ke λλk λ k k! k =0 2. ( poits) Compute g(r) = E [ r X],
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationQuick Review of Probability
Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter 2 & Teachig
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More information( ) = p and P( i = b) = q.
MATH 540 Radom Walks Part 1 A radom walk X is special stochastic process that measures the height (or value) of a particle that radomly moves upward or dowward certai fixed amouts o each uit icremet of
More informationDiscrete Mathematics and Probability Theory Fall 2009 Satish Rao,David Tse Lecture 16. Multiple Random Variables and Applications to Inference
CS 70 Discrete Mathematics ad Probability Theory Fall 2009 Satish Rao,David Tse Lecture 16 Multiple Radom Variables ad Applicatios to Iferece I may probability problems, we have to deal with multiple r.v.
More informationBasics of Probability Theory (for Theory of Computation courses)
Basics of Probability Theory (for Theory of Computatio courses) Oded Goldreich Departmet of Computer Sciece Weizma Istitute of Sciece Rehovot, Israel. oded.goldreich@weizma.ac.il November 24, 2008 Preface.
More informationDiscrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions
CS 70 Discrete Mathematics for CS Sprig 2005 Clacy/Wager Notes 21 Some Importat Distributios Questio: A biased coi with Heads probability p is tossed repeatedly util the first Head appears. What is the
More informationLinear Regression Demystified
Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More informationTopic 8: Expected Values
Topic 8: Jue 6, 20 The simplest summary of quatitative data is the sample mea. Give a radom variable, the correspodig cocept is called the distributioal mea, the epectatio or the epected value. We begi
More informationf X (12) = Pr(X = 12) = Pr({(6, 6)}) = 1/36
Probability Distributios A Example With Dice If X is a radom variable o sample space S, the the probability that X takes o the value c is Similarly, Pr(X = c) = Pr({s S X(s) = c}) Pr(X c) = Pr({s S X(s)
More informationUNIT 2 DIFFERENT APPROACHES TO PROBABILITY THEORY
UNIT 2 DIFFERENT APPROACHES TO PROBABILITY THEORY Structure 2.1 Itroductio Objectives 2.2 Relative Frequecy Approach ad Statistical Probability 2. Problems Based o Relative Frequecy 2.4 Subjective Approach
More informationDiscrete Mathematics and Probability Theory Summer 2014 James Cook Note 15
CS 70 Discrete Mathematics ad Probability Theory Summer 2014 James Cook Note 15 Some Importat Distributios I this ote we will itroduce three importat probability distributios that are widely used to model
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationLinear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d
Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y
More informationRecall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed
More information1 Review of Probability & Statistics
1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5
More informationQuick Review of Probability
Quick Review of Probability Berli Che Departmet of Computer Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Refereces: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chapter & Teachig Material.
More informationNO! This is not evidence in favor of ESP. We are rejecting the (null) hypothesis that the results are
Hypothesis Testig Suppose you are ivestigatig extra sesory perceptio (ESP) You give someoe a test where they guess the color of card 100 times They are correct 90 times For guessig at radom you would expect
More informationJanuary 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS
Jauary 25, 207 INTRODUCTION TO MATHEMATICAL STATISTICS Abstract. A basic itroductio to statistics assumig kowledge of probability theory.. Probability I a typical udergraduate problem i probability, we
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationThe Random Walk For Dummies
The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli
More informationMath 525: Lecture 5. January 18, 2018
Math 525: Lecture 5 Jauary 18, 2018 1 Series (review) Defiitio 1.1. A sequece (a ) R coverges to a poit L R (writte a L or lim a = L) if for each ǫ > 0, we ca fid N such that a L < ǫ for all N. If the
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationLecture 12: September 27
36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.
More informationLECTURE 8: ASYMPTOTICS I
LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationA sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as
More informationChapter 5. Inequalities. 5.1 The Markov and Chebyshev inequalities
Chapter 5 Iequalities 5.1 The Markov ad Chebyshev iequalities As you have probably see o today s frot page: every perso i the upper teth percetile ears at least 1 times more tha the average salary. I other
More informationMath 10A final exam, December 16, 2016
Please put away all books, calculators, cell phoes ad other devices. You may cosult a sigle two-sided sheet of otes. Please write carefully ad clearly, USING WORDS (ot just symbols). Remember that the
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationLecture 11 and 12: Basic estimation theory
Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationAxis Aligned Ellipsoid
Machie Learig for Data Sciece CS 4786) Lecture 6,7 & 8: Ellipsoidal Clusterig, Gaussia Mixture Models ad Geeral Mixture Models The text i black outlies high level ideas. The text i blue provides simple
More informationAMS570 Lecture Notes #2
AMS570 Lecture Notes # Review of Probability (cotiued) Probability distributios. () Biomial distributio Biomial Experimet: ) It cosists of trials ) Each trial results i of possible outcomes, S or F 3)
More informationCSE 527, Additional notes on MLE & EM
CSE 57 Lecture Notes: MLE & EM CSE 57, Additioal otes o MLE & EM Based o earlier otes by C. Grat & M. Narasimha Itroductio Last lecture we bega a examiatio of model based clusterig. This lecture will be
More informationIntroduction to Probability. Ariel Yadin
Itroductio to robability Ariel Yadi Lecture 2 *** Ja. 7 ***. Covergece of Radom Variables As i the case of sequeces of umbers, we would like to talk about covergece of radom variables. There are may ways
More informationECE 901 Lecture 12: Complexity Regularization and the Squared Loss
ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality
More informationVariance of Discrete Random Variables Class 5, Jeremy Orloff and Jonathan Bloom
Variace of Discrete Radom Variables Class 5, 18.05 Jeremy Orloff ad Joatha Bloom 1 Learig Goals 1. Be able to compute the variace ad stadard deviatio of a radom variable.. Uderstad that stadard deviatio
More information