Intro to probability concepts

Similar documents
Probability concepts. Math 10A. October 33, 2017

Bell-shaped curves, variance

Methods of Mathematics

z-test, t-test Kenneth A. Ribet Math 10A November 28, 2017

Methods of Mathematics

What is a random variable

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables

To find the median, find the 40 th quartile and the 70 th quartile (which are easily found at y=1 and y=2, respectively). Then we interpolate:

The remains of the course

Differential Equations

success and failure independent from one trial to the next?

Math 105 Course Outline

Probability. Kenneth A. Ribet. Math 10A After Election Day, 2016

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

Math 416 Lecture 3. The average or mean or expected value of x 1, x 2, x 3,..., x n is

Example A. Define X = number of heads in ten tosses of a coin. What are the values that X may assume?

Calculus with Algebra and Trigonometry II Lecture 21 Probability applications

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Probability Theory and Random Variables

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Statistics and Econometrics I

DS-GA 1002 Lecture notes 2 Fall Random variables

Hypothesis testing and the Gamma function

Derivatives and series FTW!

This is the end. Kenneth A. Ribet. Math 10A December 1, 2016

Name: Firas Rassoul-Agha

Methods of Mathematics

Chapter 1: Revie of Calculus and Probability

MAT Mathematics in Today's World

Properties of Continuous Probability Distributions The graph of a continuous probability distribution is a curve. Probability is represented by area

EXAM. Exam #1. Math 3342 Summer II, July 21, 2000 ANSWERS

Continuous Random Variables

Statistical Methods for the Social Sciences, Autumn 2012

Statistical Preliminaries. Stony Brook University CSE545, Fall 2016

Chapter 1 Probability Theory

More on infinite series Antiderivatives and area

Lecture 10: The Normal Distribution. So far all the random variables have been discrete.

Chapter 2: Random Variables

Name: Exam 2 Solutions. March 13, 2017

Probability Density Functions

Lecture 3. Discrete Random Variables

Continuous Distributions

Lecture 2. October 21, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

MATH 3510: PROBABILITY AND STATS July 1, 2011 FINAL EXAM

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

Joint Probability Distributions and Random Samples (Devore Chapter Five)

Random Variables. Saravanan Vijayakumaran Department of Electrical Engineering Indian Institute of Technology Bombay

The Central Limit Theorem

Single Maths B: Introduction to Probability

CS 361: Probability & Statistics

Discrete and continuous

Probability Theory and Simulation Methods

MATH Solutions to Probability Exercises

2.1 Lecture 5: Probability spaces, Interpretation of probabilities, Random variables

STA 111: Probability & Statistical Inference

Math 151. Rumbos Fall Solutions to Review Problems for Exam 2. Pr(X = 1) = ) = Pr(X = 2) = Pr(X = 3) = p X. (k) =

Statistical Concepts. Distributions of Data

Recitation 2: Probability

27 Binary Arithmetic: An Application to Programming

Probability and Estimation. Alan Moses

Lecture 7. Sums of random variables

review session gov 2000 gov 2000 () review session 1 / 38

What does independence look like?

Probability measures A probability measure, P, is a real valued function from the collection of possible events so that the following

Lecture 1: Probability Fundamentals

This exam is closed book and closed notes. (You will have access to a copy of the Table of Common Distributions given in the back of the text.

Brief Review of Probability

Lecture 2: Repetition of probability theory and statistics

DISCRETE RANDOM VARIABLES: PMF s & CDF s [DEVORE 3.2]

Part 3: Parametric Models

Discrete Probability distribution Discrete Probability distribution

Lecture 1. ABC of Probability

Introductory Probability

MATH Notebook 5 Fall 2018/2019

Probability Density Functions and the Normal Distribution. Quantitative Understanding in Biology, 1.2

Probability Theory for Machine Learning. Chris Cremer September 2015

Introduction to Stochastic Processes

Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016

Random Variables Example:

M378K In-Class Assignment #1

Linear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017

Lecture 10: Normal RV. Lisa Yan July 18, 2018

Ch. 2: Lec. 1. Basics: Outline. Importance. Usages. Key problems. Three ways of looking... Colbert on Equations. References. Ch. 2: Lec. 1.

Statistical Methods in Particle Physics

Random Variables. Definition: A random variable (r.v.) X on the probability space (Ω, F, P) is a mapping

Probability. Lecture Notes. Adolfo J. Rumbos

Introduction and Overview STAT 421, SP Course Instructor

STAT 430/510 Probability Lecture 7: Random Variable and Expectation

Discrete Mathematics and Probability Theory Fall 2013 Vazirani Note 12. Random Variables: Distribution and Expectation

Sociology 6Z03 Topic 10: Probability (Part I)

STAT 111 Recitation 1

Lecture 9. Expectations of discrete random variables

Review for Exam Spring 2018

Lecture 1/25 Chapter 2

Arkansas Tech University MATH 3513: Applied Statistics I Dr. Marcel B. Finan

Chapter 2. Random Variable. Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance.

Discrete Mathematics and Probability Theory Fall 2012 Vazirani Note 14. Random Variables: Distribution and Expectation

Why should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4.

Random variables, Expectation, Mean and Variance. Slides are adapted from STAT414 course at PennState

Transcription:

October 31, 2017

Serge Lang lecture This year s Serge Lang Undergraduate Lecture will be given by Keith Devlin of our main athletic rival. The title is When the precision of mathematics meets the messiness of the world of people. The lecture will be given at 4:10PM on Thursday, November 2 (this week!) in 60 Evans. Please visit the Facebook event page for the lecture as well.

Breakfast Next breakfast: Tomorrow at 9AM. There are still some places available. Please send me email to sign up.

Dinner Foothill DC dinner, Friday (Nov. 3) at 6:30PM.

The fundamental concepts Probability space Random variable Probability density function (PDF) Cumulative distribution function (CDF)

Flipping a coin ten times We take a generic ( fair ) coin and flip it ten times in a row. We record the result as a binary string, writing 0 for tails and 1 for heads. For example, we might write 01 01 11 00 10; spaces have been inserted for readability.

This slide has complex text Let Ω be the set of all possible outcomes the possible strings that we could write down. Then Ω has 2 2 2 = 2 10 = 1024 elements. Since the coin is fair, each outcome is as likely as every other outcome. The probability of getting any particular outcome is 1/1024. Because Ω is equipped with a probability function, it s a probability space. We say that the probability on Ω is uniform because all outcomes are assigned the same probability.

Random variable A random variable is a function on a probability space. As an example, consider the function X : Ω { 0, 1, 2,..., 9, 10 } that takes each string to the number of 1s in the string. For example, X(01 01 11 00 10) = 5. What is the probability that X = 5? The answer is (# of strings with five 1s)/1024 = 252/1024 0.246. The number 252 is a binomial coefficient; you ll see lots of binomial coefficients in Math 10B.

Probability density function A natural activity is to graph the function i P(X = i): 0.25 0.2 0.15 0.1 0.05 2 4 6 8 10 This function is the PDF attached to X.

What is the probability that X is between 3 and 7? We add the relevant values of the function (i.e., the PDF) on the previous slide. Call the function f. Then the desired probability is f (3) + f (4) + f (5) + f (6) + f (7) = 57/64 0.89. The probability that X lies in a given range is obtained by summing up appropriate values of f.

Cumulative distribution function Consider the function i P(X i): 1 0.8 0.6 0.4 0.2 2 4 6 8 10 This is the CDF attached to X. It might look logistic to you.

I lied, slightly A CDF is supposed to be defined for all real numbers. The CDF in this case looks like this: 1 0.8 0.6 0.4 0.2 2 4 6 8 10 It has the value 0 to the left of 0 and the value 1 to the right of 10.

PDF and CDF 0.25 1 0.2 0.8 0.15 0.6 0.1 0.4 0.05 0.2 2 4 6 8 10 2 4 6 8 10 Nothing new here I m just showing you the CDF and the PDF side by side.

PDF and CDF Let f = the PDF, F = the CDF. Then the PDF totals to 1: f (i) = 1; i the CDF is an increasing function: F(i) F (i + 1); you get from the PDF to the CDF by integrating (summing): F(i) = j i f (j); you get from the CDF to the PDF by differentiating (taking a difference): f (i) = F(i) F(i 1). Notice that the CDF and the PDF determine each other. They are different ways to package the same information.

We ve had a taste of the 10B view of the world.

But we re 10A!

More tosses Here s what happens if we toss the coin 100 times instead of 10 times: 2.5e296 1 2e296 0.8 1.5e296 0.6 1e296 0.4 5e295 0.2 200 400 600 800 1000 20 40 60 80 100 The picture begins to look continuous. More importantly, the probability of any particular number of tosses is approaching 0. The probability of getting exactly 50 heads in 100 tosses is about 0.08. Getting 500 heads in 1000 tosses has probability about 0.025.

Don t let students off the hook. They must face challenging reading head-on. For us 10A folk, the PDF will be a calculus-friendly function f (x) on (, + ) with these properties: it s non-negative ( 0); the total area under y = f (x) is 1: f (x) dx = 1. Examples: f (x) = 1 e x 2, f (x) = 1 π π 1 1 + x 2.

The point of a PDF is that integrals of f give probabilities: P(a X b) = b a f (x) dx. The probability that X has any particular value is negligible (and 0 in the limit), but the probability that X lies in a range can be computed (and is non-zero when we care about it).

A real world example Let Ω be the set of students who have ever taken at least one course at a UC, over the entire history of the University. Effectively, Ω is infinite. Let X : Ω [0, 5] be the function that assigns to each student that student s GPA, expressed as a 10-digit decimal. (I go beyond 4 because of A+ grades.) The probability that X lies in [2, 3] is substantial, but the probability that X has value 2.1234567890 is negligible. The associated PDF f (x) is 0 outside [0, 5]. What might it look like on that interval? This is an interesting thought exercise.

CDFs If X is a random variable, the CDF of X is the function F(t) = P(X t) = Then: F(t) 0 for t ; F(t) 1 for t + ; t F(t) is an increasing function of t; F (t) = f (t) (Fundamental theorem!). f (x) dx.

Example Let 0 if x < 0, f (x) = 2 e x 2 π if x 0. Then f (x) is a PDF that is concentrated on the set of non-negative real numbers. It corresponds to a Gaussian random variable X that takes no negative values. The corresponding CDF is 0 if x < 0, F(x) = x 2 e t2 dt if x 0. π 0

What these look like 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0.5 1 1.5 2 2.5 3 0.5 1 1.5 2 2.5 3 On the left side, there s the right-half of a Gaussian curve. On the right side, there s the corresponding CDF F(x). The plots stop at 3. To the left of the y-axis, both functions are identically 0.

A familiar CDF 1 0.8 0.6 y = e t /(1 + e t ) 0.4 0.2-4 -2 2 4 This is the graph of y = F(x) = We ve seen it before. ex, the logistic function. 1 + ex

The corresponding PDF is F (x) = F(x)(1 F(x)). 0.25 0.2 0.15 0.1 0.05-10 -5 5 10 It s clear visually that the graph is symmetric about the vertical axis. Can you check that algebraically?

Histograms as PDFs I projected this image one year ago in. It s from the fake @nytimes article At Chipotle, How Many Calories Do People Really Eat? If you have a meal at Chipotle, what is the probability that you will eat 400 450 calories? 2000 2050 calories? You see that P(X > 2000) = 0.02.

The NY Times provides a CDF To make a true CDF, you need to scale the vertical axis so that the last bar has top at y = 1 rather than at y = 22, 500, 000.

Pareto Fix p > 1 and note that 1 1 x p dx = 1 p 1 ] 1 x p 1 = 1 1 p 1. The function 0 if x < 1, f (x) = p 1 x p if x 1 is then a PDF. The CDF is for t 1. t F(t) = (p 1) 1 1 x p dx = 1 x p 1 ] t 1 = 1 1 t p 1

What these look like 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0 2 4 6 8 10 0 2 4 6 8 10 Here p = 2, so the PDF is the graph of y = 1 and the CDF is x 2 the graph of y = 1 1. Both functions are 0 to the left of x = 1. x

Pareto If X has f as its PDF, then X satisfies the Pareto distribution. Wikipedia: The Pareto distribution, named after the Italian civil engineer, economist, and sociologist Vilfredo Pareto, is a power law probability distribution that is used in description of social, scientific, geophysical, actuarial, and many other types of observable phenomena.