Statistics for Engineers Lecture 1 Introduction to Probability Theory

Similar documents
STAT 509 STATISTICS FOR ENGINEERS

STAT509: Probability

Statistical Theory 1

Lecture 3: Probability

STAT Chapter 3: Probability

Conditional probability

2.4. Conditional Probability

Mean, Median and Mode. Lecture 3 - Axioms of Probability. Where do they come from? Graphically. We start with a set of 21 numbers, Sta102 / BME102

Announcements. Topics: To Do:

Probabilistic models

Basic Statistics and Probability Chapter 3: Probability

Probability Review I

Lecture 1. Chapter 1. (Part I) Material Covered in This Lecture: Chapter 1, Chapter 2 ( ). 1. What is Statistics?

Probability and Probability Distributions. Dr. Mohammed Alahmed

STAT:5100 (22S:193) Statistical Inference I

ELEG 3143 Probability & Stochastic Process Ch. 1 Experiments, Models, and Probabilities

Lecture 3 - Axioms of Probability

Probability: Why do we care? Lecture 2: Probability and Distributions. Classical Definition. What is Probability?

Econ 325: Introduction to Empirical Economics

4. Conditional Probability

Probabilistic models

Chapter 2 Class Notes

AP Statistics Ch 6 Probability: The Study of Randomness

STT When trying to evaluate the likelihood of random events we are using following wording.

Lecture 2. Conditional Probability

Probability and Conditional Probability

Single Maths B: Introduction to Probability


Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability

BASICS OF PROBABILITY CHAPTER-1 CS6015-LINEAR ALGEBRA AND RANDOM PROCESSES

ACM 116: Lecture 1. Agenda. Philosophy of the Course. Definition of probabilities. Equally likely outcomes. Elements of combinatorics

STAT 516: Basic Probability and its Applications

Statistical Inference

2.3 Conditional Probability

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1

2. AXIOMATIC PROBABILITY

CINQA Workshop Probability Math 105 Silvia Heubach Department of Mathematics, CSULA Thursday, September 6, 2012

Lecture 2: Probability and Distributions

Chapter 6. Probability

Lecture 4. Selected material from: Ch. 6 Probability

Math 3338: Probability (Fall 2006)

Basic Concepts of Probability. Section 3.1 Basic Concepts of Probability. Probability Experiments. Chapter 3 Probability

Dept. of Linguistics, Indiana University Fall 2015

Making Hard Decision. Probability Basics. ENCE 627 Decision Analysis for Engineering

tossing a coin selecting a card from a deck measuring the commuting time on a particular morning

Chapter 1 (Basic Probability)

Section 4.2 Basic Concepts of Probability

1 INFO 2950, 2 4 Feb 10

Multiple Choice Practice Set 1

Notes for Math 324, Part 12

Homework (due Wed, Oct 27) Chapter 7: #17, 27, 28 Announcements: Midterm exams keys on web. (For a few hours the answer to MC#1 was incorrect on

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

4. Probability of an event A for equally likely outcomes:

Part (A): Review of Probability [Statistics I revision]

1 Probability Theory. 1.1 Introduction

Week 2: Probability: Counting, Sets, and Bayes

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

Statistics for Business and Economics

Probability COMP 245 STATISTICS. Dr N A Heard. 1 Sample Spaces and Events Sample Spaces Events Combinations of Events...

With Question/Answer Animations. Chapter 7

Business Statistics MBA Pokhara University

F71SM STATISTICAL METHODS

STAT 201 Chapter 5. Probability

CS626 Data Analysis and Simulation

Conditional Probability & Independence. Conditional Probabilities

Lecture 01: Introduction

18.600: Lecture 3 What is probability?

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Chapter 4: Experiment, outcomes, and sample space

3 Conditional Probability

STAT 509: Statistics for Engineers Dr. Dewei Wang. Copyright 2014 John Wiley & Sons, Inc. All rights reserved.

Mathematics Standards for High School Financial Algebra A and Financial Algebra B

Outline. Probability. Math 143. Department of Mathematics and Statistics Calvin College. Spring 2010

Review. More Review. Things to know about Probability: Let Ω be the sample space for a probability measure P.

Sixth Edition. Chapter 2 Probability. Copyright 2014 John Wiley & Sons, Inc. All rights reserved. Probability

Probability and Statistics Notes

Lecture notes for probability. Math 124

Introduction to Probability

Topic 2 Probability. Basic probability Conditional probability and independence Bayes rule Basic reliability

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Information Science 2

Probability and Sample space

Introduction to Probability. Experiments. Sample Space. Event. Basic Requirements for Assigning Probabilities. Experiments

Introduction and basic definitions

Lecture 2: Probability, conditional probability, and independence

Recitation 2: Probability

Introduction to Probability Theory

HW MATH425/525 Lecture Notes 1

2011 Pearson Education, Inc

High School Modeling Standards

CIVL 7012/8012. Basic Laws and Axioms of Probability

Probability Notes (A) , Fall 2010

Probability 1 (MATH 11300) lecture slides

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

ELEG 3143 Probability & Stochastic Process Ch. 1 Probability

Origins of Probability Theory

3.2 Probability Rules

Basic notions of probability theory

Lecture 3 Probability Basics

Chapter 4. Probability

1. Let A and B be two events such that P(A)=0.6 and P(B)=0.6. Which of the following MUST be true?

Transcription:

Statistics for Engineers Lecture 1 Introduction to Probability Theory Chong Ma Department of Statistics University of South Carolina chongm@email.sc.edu January 9, 2017 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 1 / 32

Outline 1 Introduction Deterministic Model Statistical Model 2 Probability Sample Spaces and Events Unions and Intersections Axioms of Probability Conditional Probability Probability Rules Random Variables Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 2 / 32

What is statistics? The art and science of learning from data,i.e., the study of a collection, analysis, interpretation and organization of data. The ultimate goal is to translate data into knowledge and understanding the world around us. Partly empirical and partly mathematical involving probability theory, measure theory and other related mathematics. Nowadays statistical is more computational. Popular statistical softwares: R, SAS, Python, Julia, Perl... Broad application: machine learning(google DeepMind), Biomedical, genetics, econometrics, statistical physics, chemistry,... Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 3 / 32

Examples 1 In a reliability(time to event) study, engineers are interested in describing the time until failure for a jet engine fan blade. 2 In a genetics study involving patients with Prostate cancer, researchers wish to identify genes that are differently expressed compared to non-prostate cancer patients. 3 In an agricultural experiment, researchers want to know which of four fertilizers varying in their nitrogen contents produces the highest corn yield. 4 In a clinical trial, physicians want to determine which of two drugs is more effective for treating HIV in the early stages of the disease. 5 In a public health study involving at-risk teenagers, epidemiologists want to know whether smoking is more common in a particular demographic class. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 4 / 32

continue... 6 A food scientist is interested in determining how different feeding schedules(for pigs) could affect the spread of salmonella during the slaughtering process. 7 A pharmacist is concerned that administering caffeine to premature babies will increase the incidence of necrotizing enterocolitis. 8 A research dietitian wants to determine if academic achievement is related to body mass index(bmi) among African American students in the fourth grade. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 5 / 32

Deterministic Model Deterministic Model is one that makes no attempt to explain variability. For example, In chemistry, the ideal gas law states that PV = nrt Where p=pressure of a gas, V=volume, n=the amount of substance of gas(number of moles), R=Boltzmann s constant, and T=temperature. In circuit analysis, Ohm s law states that V = IR Where V=voltage, I=current and R=resistant. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 6 / 32

Deterministic Model Remarks: In both of these models, the relationship among the variables is completely determined without ambiguity. In real life, this is rarely true for the obvious reason: there is natural variation that arises in the measurement process. For example, a common electrical engineering experiment involves setting up a simple circuit with a known resistance R. For a given current I, different students will then measure the voltage V. - With a sample of 20 students, conducting the experiment in succession, they might very well get 20 different measured voltages. - A deterministic model is too simplistic; it does not acknowledge the inherent variability that arises in the measurement process. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 7 / 32

Statistical Model Statistical Model is not deterministic, which incorporates variability and is used to predict future outcomes. Suppose that I am interested in predicting Y = STAT 509 final course percentage by using x 1 = SAT MATH score and x 2 = MATH 141 grade. The statistical model can be formated as Y = β 0 + β 1 x 1 + β 2 x 2 + ɛ Where ɛ is a term that accounts for not only measurement errors(e.g., incorrect information, data entry errors, grading errors, etc.) but also all of the other variables not accounted for(e.g., majoy, study habits, natural ability, etc.) and the error induced by assuming a linear relationship between Y and x 1 and x 2 when the relationship might actually not be. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 8 / 32

Statistical Model STAT 509 0 20 40 60 80 100 500 550 600 650 700 750 2.6 2.8 3.0 3.2 3.4 MATH 141 SAT MATH Figure 1: 3-D scatterplot of historical data Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 9 / 32

Statistical Model Remarks: Is this sample of students representative of some larger relevant population? After all, we would like our model to be useful on a larger scale. How should we estimate β 0, β 1 and β 2 in the model? - If we can do this, then we can produce predictions of Y on a student-by-student basis. - This maybe of interest to academic advisers who are trying to model the success of their incoming students. - We can also characterize numerical uncertainty with our predictions. Probability is the mathematics of uncertainty and forms the basis for all of statistics. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 10 / 32

Outline 1 Introduction Deterministic Model Statistical Model 2 Probability Sample Spaces and Events Unions and Intersections Axioms of Probability Conditional Probability Probability Rules Random Variables Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 11 / 32

Sample Spaces and Events Probability is a measure of one s belief in the occurrence of a future event. Here are some events to which we may wish to assign a probability. tomorrow s temperature belows 50 degrees manufacturing a defective part concluding one fertilizer is superior to another when it isn t the NASDAQ losing 5 percent of its value you being diagnosed with prostate/cervical cancer in next 30 years. Sample Space is the set of all possible outcomes for a given random experiment, denoted by S. The total number in S is denoted by n S. Event is a subset of the sample space, denoted by capital letters such as A, B, C,.... The total number in S is denoted by n S. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 12 / 32

Sample Spaces and Events Suppose a sample space S contains finite outcomes n S <, each of which is equally likely. This is called an equiprobability model. If an event A contains n A outcomes, then Examples P(A) = n A n S (a) The Michigan state lottery calls for a three-digit integer to be selected: S = {001, 002,..., 999} Let event A = winning number is a multiple of 5. Thus, we have n S = 1000, n A = 200 and P(A) = n A n S = 200 1000 = 0.2 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 13 / 32

Sample Spaces and Events (b) A USC undergraduate student is tested for chlamydia(0=negative and 1=positive). Thus, S = {0, 1} Here n S = 2 possible outcomes. However, is it reasonable to assume that each outcome in S is equally likely? The prevalence of chlamydia among college age students is much less than 50%. It would be illogical to assign probabilities using equiprobability model. (c) Four equally qualified applicants(a,b,c,d) are competing for two positions. If the positions are identical(so that selection order doesn t matter). Let A be the event that applicant d is selected for one of the two positions. Thus, S = {ab, ac, ad, bc, bd, cd}, A = {ad, bd, cd} And P(A) = 3/6 = 0.5 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 14 / 32

Sample Spaces and Events Interpretation: What does P(A) measure? There are two main interpretations: P(A) measures the likelihood that A will occure on any given experiment. If the experiment is performed many times, then P(A) can be interpreted as the percentage of times that A will occur over the long run. This is called the relative frequency interpretation. Example Suppose a new public company s initial stock price is set at $10. Assume that its average stock price on the first day is more than $21.75 with probability 0.463. The probability 0.463 can also be interpreted as the long run percentage of stock price more than $21.75. I used R to simulate the stock price on the first transaction day. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 15 / 32

Sample Spaces and Events stock price 10 15 20 25 0 100 200 300 400 Minutes Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 16 / 32

Unions and Intersections Null Event denoted by, is an event that contains no outcomes. Accordingly, p( ) = 0. Union of two events A and B contains all outcomes ω in either event or in both, denoted by A B = {ω : ω A or ω B}. Intersection of two events A and B contains all outcomes ω in both events, denoted by A B = {ω : ω A and ω B}. Disjoint of two events A and B contains no common outcomes, also noted as mutually exclusive. It implies P(A B) = P( ) = 0 Example Hemophilia is a sex-linked hereditary blood defect of males characterized by delayed clotting of the blood. When a woman is a carrier of classical hemophilia, there is a 50 percent chance that a male child will inherit this disease. if a carrier gives birth to two males(not twins), what is the probability that either will have the disease? Both will have the disease? Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 17 / 32

Unions and Intersections Consider that the process of having two male children as an experiment with sample space S = {++, +, +, } Where + means the male offspring has the disease; means that the male does not have the disease. Assume that each outcome in S is equally likely. Note that Thus, A = {first child has disease} = {++, + } B = {second child has disease} = { +, } A B = {either child has disease} = {++, +, +} A B = {both child have disease} = {++} Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 18 / 32

Unions and Intersections The probability that either male child will have the disease is P(A B) = n A B n S = 3 4 = 0.75 The probability that both male children will have the disease is P(A B) = n A B n S = 1 4 = 0.25 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 19 / 32

Axioms of Probability Kolmogorov s Axioms: For any sample space S, a probability P must satisfy (1) 0 P(A) 1, for any event A (2) P(S) = 1 (3) If A 1, A 2,..., A n are pairwise mutually exclusive events, then n P( A i ) = i=1 n P(A i ) i=1 Remarks The term pairwise mutually exclusive means A i A j =, for all i j. The event n i=1 A i = A 1 A 2 A n means at least one A i occurs. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 20 / 32

Conditional Probability Let A and B be events in a sample space S with P(B) > 0. The conditional probability of A, given that B has occurred, is P(A B) = P(A B) P(B) Example In a company, 36% of employees have a degree from a SEC university, 22% of those employees with a degree from the SEC are engineers, and 30% of employees are engineers. An employee is selected at random. (a) Compute the probability that the employee is an engineer and is from the SEC. (b) Compute the conditional probability that the employee is from the SEC, given that he/she is an engineer. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 21 / 32

Conditional Probability Note that A = {employee has a degree from SEC}, P(A) = 0.36 B = {employee is an engineer}, P(B) = 0.3 And we also know from the problem P(B A) = 0.22 The probability that the employee is an engineer and is from SEC P(A B) = P(B A)P(A) = 0.22 0.36 = 0.0792 The conditional probability that the employee is from the SEC, given that he/she is an engineer P(A B) = P(A B) P(B) = 0.0792 0.3 = 0.264 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 22 / 32

Conditional Probability Remark:In the example, the conditional probability P(A B) and the unconditional probability P(A) are not equal. In some situation, knowledge that B has occurred has changed the likelihood that A occurs. In other situations, it might be that the occurrence(or non-occurrence) of a companion event has no effect on the probability of the event of interest. This leads to the definition of independence. When the occurrence or non-occurrence of B has no effect on whether or not A occurs, and vice-versa, we say the events A and B are independent. Mathematically, we define A and B are independent if and only if P(A B) = P(A)P(B) Equivalently, if A and B are independent, P(A B) P(A B) = = P(A)P(B) = P(A) P(B) P(B) P(B A) P(B A) = = P(B)P(A) = P(B) P(A) P(A) Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 23 / 32

Conditional Probability Example In an engineering system, two components are placed in a series; that is, the system is functional as long as both components are. Each component is functional with probability 0.95. Define the events A 1 = {component 1 is functional} A 2 = {component 2 is functional} So that P(A 1 ) = P(A 2 ) = 0.95. Because we need both components to be functional, the probability that the system is functional is given by P(A 1 A 2 ). If the components operate independently, then A 1 and A 2 are independent events and the system reliability is P(A 1 A 2 ) = P(A 1 )P(A 2 ) = 0.95 0.95 = 0.9025 If the components do not operate independently; failure of one component wears on the other, we can not compute P(A 1 A 2 ) without additional knowledge. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 24 / 32

Conditional Probability Extension: The notion of independence extends to any finite collection of events A 1, A 2,..., A n. Mutually independence means that the probability of the intersection of any sub-collection of A 1, A 2,..., A n equals the product of the probabilities of the events in the sub-collection. For example, if A 1, A 2, A 3, and A 4 are mutually independent, then P(A 1 A 2 ) = P(A 1 )P(A 2 ) P(A 1 A 2 A 3 ) = P(A 1 )P(A 2 )P(A 3 ) P(A 1 A 2 A 3 A 4 ) = P(A 1 )P(A 2 )P(A 3 )P(A 4 ) Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 25 / 32

Probability Rules Let S is a sample space and A is an event. The complement of A, denoted by A, is the collection of all outcomes in S not in A. That is, A = {ω S : ω / A} 1 Complement rule: P(A) = 1 P(A) 2 Additive law: P(A B) = P(A) + P(B) P(A B) 3 Multiplicative law: P(A B) = P(A B)P(B) = P(B A)P(A) 4 Law of Total Probability: 5 Bayes rule: P(A) = P(A B)P(B) + P(A B)P(B) P(A B) = P(A B) P(B) = P(A B) P(B A)P(A) + P(B A)P(A) Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 26 / 32

Probability Rules Example The probability that train 1 is on time is 0.95. The probability that train 2 is on time is 0.93. The probability that both are on time is 0.90. Define the events A 1 = {train 1 is on time}, A 2 = {train 2 is on time} We are given that P(A 1 ) = 0.95, P(A 2 ) = 0.93, P(A 1 A 2 ) = 0.90 (a) What is the probability that train 1 is not on time? P(A 1 ) = 1 P(A 1 ) = 1 0.95 = 0.05 (b) What is the probability that at least one train is on time? P(A 1 A 2 ) = P(A 1 )+P(A 2 ) P(A 1 A 2 ) = 0.95+0.93 0.90 = 0.98 (c) What is the probability that train 1 is on time given that train 2 is on time? P(A 1 A 2 ) = P(A 1 A 2 ) = 0.90 P(A 2 ) 0.93 0.968 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 27 / 32

Probability Rules (d) What is the probability that train 2 is on time train 1 is not on time? P(A 2 A 1 ) = P(A 1 A 2 ) P(A 1 ) = P(A 2) P(A 1 A 2 ) 1 P(A 1 ) = 0.93 0.90 1 0.95 = 0.6 (d) Are A 1 and A 2 independent? They are not independent because P(A 1 A 2 ) P(A 1 ). In other words, knowledge that A 2 has occurred changes the likelihood that A 1 occurs. Example An insurance company classifies people as accident-prone and non-accident-prone. For a fixed year, the probability that an accident-prone person has an accident is 0.4, and the probability that a non-accident-prone person has an accident is 0.2. The population is estimated to be 30% accident-prone. Define the events A = {policy holder has an accident}, B = {policy holder is accident-prone} Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 28 / 32

Probability Rules Note that P(B) = 0.3, P(A B = 0.4), P(A B) = 0.2 (a) What is the probability that a new policy-holder will have an accident? P(A) = P(A B)P(B) + P(A B)P(B) = 0.4 0.3 + 0.2 0.7 = 0.26 (b) Suppose that the policy-holder does have an accident. What is the probability that he/she was accident-prone? P(A B)P(B) P(B A) = P(A B)P(B) + P(A B)P(B) 0.4 0.3 = 0.4 0.3 + 0.2 0.7 0.46 Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 29 / 32

Random Variables A random variables(denoted as r.v.) Y is a variable whose value is determined by chance. The distribution of a r.v.consists of two parts: an elicitation of the set of all possible values of Y (called support). a function that describes how to assign probabilities to events induced by Y. A r.v. Y can be categorized by two types: discrete If Y can assume only a finite (or countable) number of values. continuous If can envision Y as assuming in an interval off numbers. Remarks: By convention, we denote random variables by upper case letters towards the end of the alphabet, e.g., W, X, Y, Z, etc. A possible value of Y is denoted generally by its according lower case letter. In words, P(Y = y) is read, the probability that the random variable Y equals the value y. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 30 / 32

Random Variables Example Classify the following random variables as discrete or continuous and specify the support of each random variable. V = number of broken eggs in a randomly selected carton(dozen) W = ph of an aqueous solution X = length of time between accident at a factory Y = whether or not you pass this class Z = number of cans of beer that an adult has The r.v. V is discrete. It can assume values in {υ : υ 0, 1, 2,..., 12} The r.v. W is continuous. It can appropriately assume values in {ω : 0 ω 14} Of course, its support can assumed as {ω : < ω < } Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 31 / 32

Random Variables The r.v. X is continuous. It can assume values in {x : 0 < x < } The key feature here is that a time cannot be negative. The r.v. Y is discrete. It can assume values in {y : y = 0, 1} Where 0 can arbitrarily label for passing and 0 for failing. R.V. that assume exactly two values are called binary. The r.v. Z is discrete. It can assume values in {z : z = 0, 1, 2,...} Here a large amount of beer is assumed. Chong Ma (Statistics, USC) STAT 509 Spring 2017 January 9, 2017 32 / 32