Probability Densities in Data Mining

Size: px
Start display at page:

Download "Probability Densities in Data Mining"

Transcription

1 Probabilit Densities in Data Mining Note to other teachers and users of these slides. Andrew would be delighted if ou found this source material useful in giving our own lectures. Feel free to use these slides verbatim, or to modif them to fit our own needs. PowerPoint originals are available. If ou make use of a significant ortion of these slides in our own lecture, lease include this message, or the following link to the source reositor of Andrew s tutorials: htt:// Comments and corrections gratefull received. Andrew W. Moore Associate Professor School of Comuter Science Carnegie Mellon Universit awm@cs.cmu.edu Coright 001, Andrew W. Moore Aug 7, 001 Probabilit Densities in Data Mining Wh we should care Notation and Fundamentals of continuous PDFs Multivariate continuous PDFs Combining continuous and discrete random variables Coright 001, Andrew W. Moore Probabilit Densities: Slide

2 Wh we should care Real Numbers occur in at least 50% of database records Can t alwas quantize them So need to understand how to describe where the come from A great wa of saing what s a reasonable range of values A great wa of saing how multile attributes should reasonabl co-occur Coright 001, Andrew W. Moore Probabilit Densities: Slide 3 Wh we should care Can immediatel get us Baes Classifiers that are sensible with real-valued data You ll need to intimatel understand PDFs in order to do kernel methods, clustering with Miture Models, analsis of variance, time series and man other things Will introduce us to linear and non-linear regression Coright 001, Andrew W. Moore Probabilit Densities: Slide 4

3 A PDF of American Ages in 000 Coright 001, Andrew W. Moore Probabilit Densities: Slide 5 A PDF of American Ages in 000 Let X be a continuous random variable. If is a Probabilit Densit Function for X then a < X b P d b a 50 < Age 50 P 30 age dage 0.36 age 30 Coright 001, Andrew W. Moore Probabilit Densities: Slide 6

4 Proerties of PDFs a < X b P d b a That means lim h 0 h P < X h h + P X Coright 001, Andrew W. Moore Probabilit Densities: Slide 7 Proerties of PDFs a < X b P d P b a X Therefore Therefore d 1 : 0 Coright 001, Andrew W. Moore Probabilit Densities: Slide 8

5 Talking to our stomach What s the gut-feel meaning of? If then and when a value X is samled from the distribution, ou are times as likel to find that X is ver close to 5.31 than that X is ver close to 5.9. Coright 001, Andrew W. Moore Probabilit Densities: Slide 9 Talking to our stomach What s the gut-feel meaning of? If then 5.31 a 0.06 and 5.9 b 0.03 when a value X is samled from the distribution, ou are times as likel to find that X is ver close to 5.31 a than that X is ver close to 5.9. b Coright 001, Andrew W. Moore Probabilit Densities: Slide 10

6 Talking to our stomach What s the gut-feel meaning of? If then 5.31 a 0.03 z and 5.9 b 0.06 z when a value X is samled from the distribution, ou are times as likel to find that X is ver close to 5.31 a than that X is ver close to 5.9. b Coright 001, Andrew W. Moore Probabilit Densities: Slide 11 Talking to our stomach What s the gut-feel meaning of? If then 5.31 a 0.03 αz and 5.9 b 0.06 z when a value X is samled from the distribution, ou are α times as likel to find that X is ver close to 5.31 a than that X is ver close to 5.9. b Coright 001, Andrew W. Moore Probabilit Densities: Slide 1

7 Talking to our stomach What s the gut-feel meaning of? If then a b α when a value X is samled from the distribution, ou are α times as likel to find that X is ver close to 5.31 a than that X is ver close to 5.9. b Coright 001, Andrew W. Moore Probabilit Densities: Slide 13 Talking to our stomach What s the gut-feel meaning of? If a b then P a h < lim h 0 P b h < α X X < a + h < b + h α Coright 001, Andrew W. Moore Probabilit Densities: Slide 14

8 Yet another wa to view a PDF A recie for samling a random age. 1. Generate a random dot from the rectangle surrounding the PDF curve. Call the dot age,d. If d < age sto and return age 3. Else tr again: go to Ste 1. Coright 001, Andrew W. Moore Probabilit Densities: Slide 15 Test our understanding True or False: : 1 : P X 0 Coright 001, Andrew W. Moore Probabilit Densities: Slide 16

9 Eectations E[X] the eected value of random variable X the average value we d see if we took a ver large number of random samles of X d Coright 001, Andrew W. Moore Probabilit Densities: Slide 17 Eectations E[age] E[X] the eected value of random variable X the average value we d see if we took a ver large number of random samles of X Coright 001, Andrew W. Moore Probabilit Densities: Slide 18 d the first moment of the shae formed b the aes and the blue curve the best value to choose if ou must guess an unknown erson s age and ou ll be fined the square of our error

10 Eectation of a function E[age ] E[age] µe[fx] the eected value of f where is drawn from X s distribution. the average value we d see if we took a ver large number of random samles of fx µ f d Note that in general: E[ f ] f E[ X ] Coright 001, Andrew W. Moore Probabilit Densities: Slide 19 Variance σ Var[X] the eected squared difference between and E[X] Var [age] σ µ d amount ou d eect to lose if ou must guess an unknown erson s age and ou ll be fined the square of our error, and assuming ou la otimall Coright 001, Andrew W. Moore Probabilit Densities: Slide 0

11 Standard Deviation σ Var[X] the eected squared difference between and E[X] Var [age] σ σ µ d amount ou d eect to lose if ou must guess an unknown erson s age and ou ll be fined the square of our error, and assuming ou la otimall σ Standard Deviation tical deviation of X from its mean σ Var[ X ] Coright 001, Andrew W. Moore Probabilit Densities: Slide 1 In dimensions, robabilit densit of random variables X,Y at location, Coright 001, Andrew W. Moore Probabilit Densities: Slide

12 In dimensions Let X,Y be a air of continuous random variables, and let R be some region of X,Y sace P X, Y R, R, dd Coright 001, Andrew W. Moore Probabilit Densities: Slide 3 In dimensions Let X,Y be a air of continuous random variables, and let R be some region of X,Y sace P X, Y R, R, dd P 0<mg<30 and 500<weight<3000 area under the -d surface within the red rectangle Coright 001, Andrew W. Moore Probabilit Densities: Slide 4

13 In dimensions Let X,Y be a air of continuous random variables, and let R be some region of X,Y sace P X, Y R, R, dd P [mg-5/10] + [weight-3300/1500] < 1 area under the -d surface within the red oval Coright 001, Andrew W. Moore Probabilit Densities: Slide 5 In dimensions Let X,Y be a air of continuous random variables, and let R be some region of X,Y sace P X, Y R, R, dd Take the secial case of region R everwhere. Remember that with robabilit 1, X,Y will be drawn from somewhere. So.., dd 1 Coright 001, Andrew W. Moore Probabilit Densities: Slide 6

14 In dimensions Let X,Y be a air of continuous random variables, and let R be some region of X,Y sace P X, Y R, R, dd, lim h 0 h P < X h + h h < Y h + Coright 001, Andrew W. Moore Probabilit Densities: Slide 7 In m dimensions Let X 1,X, X m be an n-tule of continuous random variables, and let R be some region of R m P X, X,..., X, 1 1 m R...,..., m R, 1,..., m d m,,... d, d 1 Coright 001, Andrew W. Moore Probabilit Densities: Slide 8

15 Indeendence X Y iff, :, If X and Y are indeendent then knowing the value of X does not hel redict the value of Y mg,weight NOT indeendent Coright 001, Andrew W. Moore Probabilit Densities: Slide 9 Indeendence X Y iff, :, If X and Y are indeendent then knowing the value of X does not hel redict the value of Y the contours sa that acceleration and weight are indeendent Coright 001, Andrew W. Moore Probabilit Densities: Slide 30

16 Multivariate Eectation µ X E [ X] d E[mg,weight] 4.5,600 The centroid of the cloud Coright 001, Andrew W. Moore Probabilit Densities: Slide 31 Multivariate Eectation E [ f X] f d Coright 001, Andrew W. Moore Probabilit Densities: Slide 3

17 Test our understanding Question : When if ever does E [ X + Y] E[ X ] + E[ Y]? All the time? Onl when X and Y are indeendent? It can fail even if X and Y are indeendent? Coright 001, Andrew W. Moore Probabilit Densities: Slide 33 Bivariate Eectation E [ f, ] f,, dd if f, then E[ f X, Y ], dd if f, then E[ f X, Y], dd if f, + then E[ f X, Y ] +, dd E [ X + Y] E[ X ] + E[ Y] Coright 001, Andrew W. Moore Probabilit Densities: Slide 34

18 σ Bivariate Covariance Cov[ X, Y ] E[ X µ Y µ ] σ σ σ σ Cov[ X, X ] Var[ X ] E[ X µ Cov[ Y, Y] Var[ Y ] E[ Y µ ] ] Coright 001, Andrew W. Moore Probabilit Densities: Slide 35 σ Bivariate Covariance Cov[ X, Y ] E[ X µ Y µ ] σ σ σ Cov[ X] σ Cov[ X, X ] Var[ X ] E[ X µ Cov[ Y, Y] Var[ Y ] E[ Y µ X Write X, then Y ] σ σ T E[ X µ X µ ] Σ σ σ Coright 001, Andrew W. Moore Probabilit Densities: Slide 36 ]

19 Covariance Intuition σ weight 700 E[mg,weight] 4.5,600 σ weight 700 σ mg 8 σ mg 8 Coright 001, Andrew W. Moore Probabilit Densities: Slide 37 Covariance Intuition Princial Eigenvector of Σ σ weight 700 E[mg,weight] 4.5,600 σ weight 700 σ mg 8 σ mg 8 Coright 001, Andrew W. Moore Probabilit Densities: Slide 38

20 Cov[ X] Covariance Fun Facts σ σ T E[ X µ X µ ] Σ σ σ True or False: If σ 0 then X and Y are indeendent True or False: If X and Y are indeendent then σ 0 True or False: If σ σ σ then X and Y are deterministicall related True or False: If X and Y are deterministicall related then σ σ σ How could ou rove or disrove these? Coright 001, Andrew W. Moore Probabilit Densities: Slide 39 General Covariance Let X X 1,X, X k be a vector of k continuous random variables T Cov [ X] E [ X µ X µ ] Σ Σ ij Cov[ X, X ] σ i j i j S is a k k smmetric non-negative definite matri If all distributions are linearl indeendent it is ositive definite If the distributions are linearl deendent it has determinant zero Coright 001, Andrew W. Moore Probabilit Densities: Slide 40

21 Question : When if Test our understanding ever doesvar [ X + Y] Var[ X ] + Var[ Y]? All the time? Onl when X and Y are indeendent? It can fail even if X and Y are indeendent? Coright 001, Andrew W. Moore Probabilit Densities: Slide 41 Marginal Distributions, d Coright 001, Andrew W. Moore Probabilit Densities: Slide 4

22 mg weight 4600 Conditional Distributions mg weight 300 mg weight 000.d.f. of X when Y Coright 001, Andrew W. Moore Probabilit Densities: Slide 43 mg weight 4600 Conditional Distributions, Wh?.d.f. of X when Y Coright 001, Andrew W. Moore Probabilit Densities: Slide 44

23 Coright 001, Andrew W. Moore Probabilit Densities: Slide 45 Indeendence Revisited It s eas to rove that these statements are equivalent, :, iff Y X :, :,, :, Coright 001, Andrew W. Moore Probabilit Densities: Slide 46 More useful stuff Baes Rule These can all be roved from definitions on revious slides 1 d,, z z z

24 Coright 001, Andrew W. Moore Probabilit Densities: Slide 47 Miing discrete and continuous variables h v A h X h P v A + <, lim 0 h 1, 1 n A v d v A Baes Rule Baes Rule A P A P A A P A A P Coright 001, Andrew W. Moore Probabilit Densities: Slide 48 Miing discrete and continuous variables PEduYears,Wealth

25 Miing discrete and continuous variables PEduYears,Wealth PWealth EduYears Coright 001, Andrew W. Moore Probabilit Densities: Slide 49 Miing discrete and continuous variables PEduYears,Wealth PEduYears Wealth PWealth EduYears Renormalized Aes Coright 001, Andrew W. Moore Probabilit Densities: Slide 50

26 What ou should know You should be able to la with discrete, continuous and mied joint distributions You should be ha with the difference between and PA You should be intimate with eectations of continuous and discrete random variables You should smile when ou meet a covariance matri Indeendence and its consequences should be second nature Coright 001, Andrew W. Moore Probabilit Densities: Slide 51 Discussion Are PDFs the onl sensible wa to handle analsis of real-valued variables? Wh is covariance an imortant concet? Suose X and Y are indeendent real-valued random variables distributed between 0 and 1: What is [minx,y]? What is E[minX,Y]? Prove that E[X] is the value u that minimizes E[X-u ] What is the value u that minimizes E[ X-u ]? Coright 001, Andrew W. Moore Probabilit Densities: Slide 5

Cross-validation for detecting and preventing overfitting

Cross-validation for detecting and preventing overfitting Cross-validation for detecting and preventing overfitting A Regression Problem = f() + noise Can we learn f from this data? Note to other teachers and users of these slides. Andrew would be delighted if

More information

Random Vectors. 1 Joint distribution of a random vector. 1 Joint distribution of a random vector

Random Vectors. 1 Joint distribution of a random vector. 1 Joint distribution of a random vector Random Vectors Joint distribution of a random vector Joint distributionof of a random vector Marginal and conditional distributions Previousl, we studied probabilit distributions of a random variable.

More information

UNIVERSITY OF DUBLIN TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics

UNIVERSITY OF DUBLIN TRINITY COLLEGE. Faculty of Engineering, Mathematics and Science. School of Computer Science & Statistics UNIVERSI OF DUBLIN RINI COLLEGE Facult of Engineering, Mathematics and Science School of Comuter Science & Statistics BA (Mod) Maths, SM rinit erm 04 SF and JS S35 Probabilit and heoretical Statistics

More information

Probabilistic and Bayesian Analytics

Probabilistic and Bayesian Analytics Probabilistic and Bayesian Analytics Note to other teachers and users of these slides. Andrew would be delighted if you found this source material useful in giving your own lectures. Feel free to use these

More information

Copyright, 2008, R.E. Kass, E.N. Brown, and U. Eden REPRODUCTION OR CIRCULATION REQUIRES PERMISSION OF THE AUTHORS

Copyright, 2008, R.E. Kass, E.N. Brown, and U. Eden REPRODUCTION OR CIRCULATION REQUIRES PERMISSION OF THE AUTHORS Copright, 8, RE Kass, EN Brown, and U Eden REPRODUCTION OR CIRCULATION REQUIRES PERMISSION OF THE AUTHORS Chapter 6 Random Vectors and Multivariate Distributions 6 Random Vectors In Section?? we etended

More information

Review of Probability

Review of Probability Review of robabilit robabilit Theor: Man techniques in speech processing require the manipulation of probabilities and statistics. The two principal application areas we will encounter are: Statistical

More information

Conditioning and Independence

Conditioning and Independence Discrete Random Variables: Joint PMFs Conditioning and Indeendence Berlin Chen Deartment of Comuter Science & Information ngineering National Taiwan Normal Universit Reference: - D. P. Bertsekas J. N.

More information

VC-dimension for characterizing classifiers

VC-dimension for characterizing classifiers VC-dimension for characterizing classifiers Note to other teachers and users of these slides. Andrew would be delighted if you found this source material useful in giving your own lectures. Feel free to

More information

Covariance and Correlation Class 7, Jeremy Orloff and Jonathan Bloom

Covariance and Correlation Class 7, Jeremy Orloff and Jonathan Bloom 1 Learning Goals Covariance and Correlation Class 7, 18.05 Jerem Orloff and Jonathan Bloom 1. Understand the meaning of covariance and correlation. 2. Be able to compute the covariance and correlation

More information

VC-dimension for characterizing classifiers

VC-dimension for characterizing classifiers VC-dimension for characterizing classifiers Note to other teachers and users of these slides. Andrew would be delighted if you found this source material useful in giving your own lectures. Feel free to

More information

Introduction to Probability for Graphical Models

Introduction to Probability for Graphical Models Introduction to Probability for Grahical Models CSC 4 Kaustav Kundu Thursday January 4, 06 *Most slides based on Kevin Swersky s slides, Inmar Givoni s slides, Danny Tarlow s slides, Jaser Snoek s slides,

More information

V-0. Review of Probability

V-0. Review of Probability V-0. Review of Probabilit Random Variable! Definition Numerical characterization of outcome of a random event!eamles Number on a rolled die or dice Temerature at secified time of da 3 Stock Market at close

More information

11. Regression and Least Squares

11. Regression and Least Squares 11. Regression and Least Squares Prof. Tesler Math 186 Winter 2016 Prof. Tesler Ch. 11: Linear Regression Math 186 / Winter 2016 1 / 23 Regression Given n points ( 1, 1 ), ( 2, 2 ),..., we want to determine

More information

INF Introduction to classifiction Anne Solberg Based on Chapter 2 ( ) in Duda and Hart: Pattern Classification

INF Introduction to classifiction Anne Solberg Based on Chapter 2 ( ) in Duda and Hart: Pattern Classification INF 4300 151014 Introduction to classifiction Anne Solberg anne@ifiuiono Based on Chapter 1-6 in Duda and Hart: Pattern Classification 151014 INF 4300 1 Introduction to classification One of the most challenging

More information

University of California, Los Angeles Department of Statistics. Joint probability distributions

University of California, Los Angeles Department of Statistics. Joint probability distributions Universit of California, Los Angeles Department of Statistics Statistics 100A Instructor: Nicolas Christou Joint probabilit distributions So far we have considered onl distributions with one random variable.

More information

Information Gain. Andrew W. Moore Professor School of Computer Science Carnegie Mellon University.

Information Gain. Andrew W. Moore Professor School of Computer Science Carnegie Mellon University. te to other teachers and users of these slides. Andrew would be delighted if you found this source material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them

More information

CHAPTER 5. Jointly Probability Mass Function for Two Discrete Distributed Random Variables:

CHAPTER 5. Jointly Probability Mass Function for Two Discrete Distributed Random Variables: CHAPTER 5 Jointl Distributed Random Variable There are some situations that experiment contains more than one variable and researcher interested in to stud joint behavior of several variables at the same

More information

f X, Y (x, y)dx (x), where f(x,y) is the joint pdf of X and Y. (x) dx

f X, Y (x, y)dx (x), where f(x,y) is the joint pdf of X and Y. (x) dx INDEPENDENCE, COVARIANCE AND CORRELATION Independence: Intuitive idea of "Y is independent of X": The distribution of Y doesn't depend on the value of X. In terms of the conditional pdf's: "f(y x doesn't

More information

Biostatistics in Research Practice - Regression I

Biostatistics in Research Practice - Regression I Biostatistics in Research Practice - Regression I Simon Crouch 30th Januar 2007 In scientific studies, we often wish to model the relationships between observed variables over a sample of different subjects.

More information

Correlation analysis 2: Measures of correlation

Correlation analysis 2: Measures of correlation Correlation analsis 2: Measures of correlation Ran Tibshirani Data Mining: 36-462/36-662 Februar 19 2013 1 Review: correlation Pearson s correlation is a measure of linear association In the population:

More information

Inference about the Slope and Intercept

Inference about the Slope and Intercept Inference about the Slope and Intercept Recall, we have established that the least square estimates and 0 are linear combinations of the Y i s. Further, we have showed that the are unbiased and have the

More information

Experimental Uncertainty Review. Abstract. References. Measurement Uncertainties and Uncertainty Propagation

Experimental Uncertainty Review. Abstract. References. Measurement Uncertainties and Uncertainty Propagation Experimental Uncertaint Review Abstract This is intended as a brief summar of the basic elements of uncertaint analsis, and a hand reference for laborator use. It provides some elementar "rules-of-thumb"

More information

36-720: Latent Class Models

36-720: Latent Class Models 36-720: Latent Class Models Brian Junker October 17, 2007 Latent Class Models and Simpson s Paradox Latent Class Model for a Table of Counts An Intuitive E-M Algorithm for Fitting Latent Class Models Deriving

More information

2. This problem is very easy if you remember the exponential cdf; i.e., 0, y 0 F Y (y) = = ln(0.5) = ln = ln 2 = φ 0.5 = β ln 2.

2. This problem is very easy if you remember the exponential cdf; i.e., 0, y 0 F Y (y) = = ln(0.5) = ln = ln 2 = φ 0.5 = β ln 2. FALL 8 FINAL EXAM SOLUTIONS. Use the basic counting rule. There are n = number of was to choose 5 white balls from 7 = n = number of was to choose ellow ball from 5 = ( ) 7 5 ( ) 5. there are n n = ( )(

More information

INF Anne Solberg One of the most challenging topics in image analysis is recognizing a specific object in an image.

INF Anne Solberg One of the most challenging topics in image analysis is recognizing a specific object in an image. INF 4300 700 Introduction to classifiction Anne Solberg anne@ifiuiono Based on Chapter -6 6inDuda and Hart: attern Classification 303 INF 4300 Introduction to classification One of the most challenging

More information

Algorithms for Playing and Solving games*

Algorithms for Playing and Solving games* Algorithms for Playing and Solving games* Andrew W. Moore Professor School of Computer Science Carnegie Mellon University www.cs.cmu.edu/~awm awm@cs.cmu.edu 412-268-7599 * Two Player Zero-sum Discrete

More information

General Random Variables

General Random Variables Chater General Random Variables. Law of a Random Variable Thus far we have considered onl random variables whose domain and range are discrete. We now consider a general random variable X! defined on the

More information

Chapter 7: Special Distributions

Chapter 7: Special Distributions This chater first resents some imortant distributions, and then develos the largesamle distribution theory which is crucial in estimation and statistical inference Discrete distributions The Bernoulli

More information

INF Introduction to classifiction Anne Solberg

INF Introduction to classifiction Anne Solberg INF 4300 8.09.17 Introduction to classifiction Anne Solberg anne@ifi.uio.no Introduction to classification Based on handout from Pattern Recognition b Theodoridis, available after the lecture INF 4300

More information

Today. CS-184: Computer Graphics. Introduction. Some Examples. 2D Transformations

Today. CS-184: Computer Graphics. Introduction. Some Examples. 2D Transformations Toda CS-184: Comuter Grahics Lecture #3: 2D Transformations Prof. James O Brien Universit of California, Berkele V2005F-03-1.0 2D Transformations Primitive Oerations Scale, Rotate, Shear, Fli, Translate

More information

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA THE ROYAL STATISTICAL SOCIETY 4 EXAINATIONS SOLUTIONS GRADUATE DIPLOA PAPER I STATISTICAL THEORY & ETHODS The Societ provides these solutions to assist candidates preparing for the examinations in future

More information

3.0 PROBABILITY, RANDOM VARIABLES AND RANDOM PROCESSES

3.0 PROBABILITY, RANDOM VARIABLES AND RANDOM PROCESSES 3.0 PROBABILITY, RANDOM VARIABLES AND RANDOM PROCESSES 3.1 Introduction In this chapter we will review the concepts of probabilit, rom variables rom processes. We begin b reviewing some of the definitions

More information

5. Zeros. We deduce that the graph crosses the x-axis at the points x = 0, 1, 2 and 4, and nowhere else. And that s exactly what we see in the graph.

5. Zeros. We deduce that the graph crosses the x-axis at the points x = 0, 1, 2 and 4, and nowhere else. And that s exactly what we see in the graph. . Zeros Eample 1. At the right we have drawn the graph of the polnomial = ( 1) ( 2) ( 4). Argue that the form of the algebraic formula allows ou to see right awa where the graph is above the -ais, where

More information

MATH 829: Introduction to Data Mining and Analysis Consistency of Linear Regression

MATH 829: Introduction to Data Mining and Analysis Consistency of Linear Regression 1/9 MATH 829: Introduction to Data Mining and Analysis Consistency of Linear Regression Dominique Guillot Deartments of Mathematical Sciences University of Delaware February 15, 2016 Distribution of regression

More information

Expected value of r.v. s

Expected value of r.v. s 10 Epected value of r.v. s CDF or PDF are complete (probabilistic) descriptions of the behavior of a random variable. Sometimes we are interested in less information; in a partial characterization. 8 i

More information

Chapter 18 Quadratic Function 2

Chapter 18 Quadratic Function 2 Chapter 18 Quadratic Function Completed Square Form 1 Consider this special set of numbers - the square numbers or the set of perfect squares. 4 = = 9 = 3 = 16 = 4 = 5 = 5 = Numbers like 5, 11, 15 are

More information

Multiple Random Variables

Multiple Random Variables Multiple Random Variables This Version: July 30, 2015 Multiple Random Variables 2 Now we consider models with more than one r.v. These are called multivariate models For instance: height and weight An

More information

Monte Carlo Studies. Monte Carlo Studies. Sampling Distribution

Monte Carlo Studies. Monte Carlo Studies. Sampling Distribution Monte Carlo Studies Do not let yourself be intimidated by the material in this lecture This lecture involves more theory but is meant to imrove your understanding of: Samling distributions and tests of

More information

The data can be downloaded as an Excel file under Econ2130 at

The data can be downloaded as an Excel file under Econ2130 at 1 HG Revised Sept. 018 Supplement to lecture 9 (Tuesda 18 Sept) On the bivariate normal model Eample: daughter s height (Y) vs. mother s height (). Data collected on Econ 130 lectures 010-01. The data

More information

Classical AI and ML research ignored this phenomena Another example

Classical AI and ML research ignored this phenomena Another example Wat is tis? Classical AI and ML researc ignored tis enomena Anoter eamle you want to catc a fligt at 0:00am from Pitt to SF, can I make it if I leave at 8am and take a Marta at Gatec? artial observability

More information

Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology

Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology Review (probability, linear algebra) CE-717 : Machine Learning Sharif University of Technology M. Soleymani Fall 2012 Some slides have been adopted from Prof. H.R. Rabiee s and also Prof. R. Gutierrez-Osuna

More information

Bayesian Networks: Independencies and Inference

Bayesian Networks: Independencies and Inference Bayesian Networks: Independencies and Inference Scott Davies and Andrew Moore Note to other teachers and users of these slides. Andrew and Scott would be delighted if you found this source material useful

More information

MAHALAKSHMI ENGINEERING COLLEGE TIRUCHIRAPALLI

MAHALAKSHMI ENGINEERING COLLEGE TIRUCHIRAPALLI MAHALAKSHMI ENGINEERING COLLEGE TIRUCHIRAPALLI QUESTION BANK - ANSWERS SEMESTER: IV MA - PROBABILITY AND QUEUEING THEORY UNIT II: TWO DIMENSIONAL RANDOM VARIABLES PART-A Question : AUC M / J If the joint

More information

Review: mostly probability and some statistics

Review: mostly probability and some statistics Review: mostly probability and some statistics C2 1 Content robability (should know already) Axioms and properties Conditional probability and independence Law of Total probability and Bayes theorem Random

More information

Section 2: Wave Functions and Probability Solutions

Section 2: Wave Functions and Probability Solutions Phsics 43a: Quantum Mechanics I Section : Wave Functions and Probabilit Solutions Spring 5, Harvard Here is a summar of the most important points from the second week with a few of m own tidbits), relevant

More information

10.2 Polar Equations and Graphs

10.2 Polar Equations and Graphs SECTIN 0. Polar Equations and Grahs 77 Elaining Concets: Discussion and Writing 85. In converting from olar coordinates to rectangular coordinates, what formulas will ou use? 86. Elain how ou roceed to

More information

3. Several Random Variables

3. Several Random Variables . Several Random Variables. Two Random Variables. Conditional Probabilit--Revisited. Statistical Independence.4 Correlation between Random Variables. Densit unction o the Sum o Two Random Variables. Probabilit

More information

Linear regression Class 25, Jeremy Orloff and Jonathan Bloom

Linear regression Class 25, Jeremy Orloff and Jonathan Bloom 1 Learning Goals Linear regression Class 25, 18.05 Jerem Orloff and Jonathan Bloom 1. Be able to use the method of least squares to fit a line to bivariate data. 2. Be able to give a formula for the total

More information

ME 597: AUTONOMOUS MOBILE ROBOTICS SECTION 2 PROBABILITY. Prof. Steven Waslander

ME 597: AUTONOMOUS MOBILE ROBOTICS SECTION 2 PROBABILITY. Prof. Steven Waslander ME 597: AUTONOMOUS MOBILE ROBOTICS SECTION 2 Prof. Steven Waslander p(a): Probability that A is true 0 pa ( ) 1 p( True) 1, p( False) 0 p( A B) p( A) p( B) p( A B) A A B B 2 Discrete Random Variable X

More information

Moment Generating Function. STAT/MTHE 353: 5 Moment Generating Functions and Multivariate Normal Distribution

Moment Generating Function. STAT/MTHE 353: 5 Moment Generating Functions and Multivariate Normal Distribution Moment Generating Function STAT/MTHE 353: 5 Moment Generating Functions and Multivariate Normal Distribution T. Linder Queen s University Winter 07 Definition Let X (X,...,X n ) T be a random vector and

More information

0.24 adults 2. (c) Prove that, regardless of the possible values of and, the covariance between X and Y is equal to zero. Show all work.

0.24 adults 2. (c) Prove that, regardless of the possible values of and, the covariance between X and Y is equal to zero. Show all work. 1 A socioeconomic stud analzes two discrete random variables in a certain population of households = number of adult residents and = number of child residents It is found that their joint probabilit mass

More information

Random vectors X 1 X 2. Recall that a random vector X = is made up of, say, k. X k. random variables.

Random vectors X 1 X 2. Recall that a random vector X = is made up of, say, k. X k. random variables. Random vectors Recall that a random vector X = X X 2 is made up of, say, k random variables X k A random vector has a joint distribution, eg a density f(x), that gives probabilities P(X A) = f(x)dx Just

More information

Summary of Random Variable Concepts March 17, 2000

Summary of Random Variable Concepts March 17, 2000 Summar of Random Variable Concepts March 17, 2000 This is a list of important concepts we have covered, rather than a review that devives or eplains them. Tpes of random variables discrete A random variable

More information

2: Distributions of Several Variables, Error Propagation

2: Distributions of Several Variables, Error Propagation : Distributions of Several Variables, Error Propagation Distribution of several variables. variables The joint probabilit distribution function of two variables and can be genericall written f(, with the

More information

MTH234 Chapter 15 - Multiple Integrals Michigan State University

MTH234 Chapter 15 - Multiple Integrals Michigan State University MTH24 Chater 15 - Multile Integrals Michigan State University 6 Surface Area Just as arc length is an alication of a single integral, surface area is an alication of double integrals. In 15.6 we comute

More information

Probability Theory Refresher

Probability Theory Refresher Machine Learning WS24 Module IN264 Sheet 2 Page Machine Learning Worksheet 2 Probabilit Theor Refresher Basic Probabilit Problem : A secret government agenc has developed a scanner which determines whether

More information

LESSON 35: EIGENVALUES AND EIGENVECTORS APRIL 21, (1) We might also write v as v. Both notations refer to a vector.

LESSON 35: EIGENVALUES AND EIGENVECTORS APRIL 21, (1) We might also write v as v. Both notations refer to a vector. LESSON 5: EIGENVALUES AND EIGENVECTORS APRIL 2, 27 In this contet, a vector is a column matri E Note 2 v 2, v 4 5 6 () We might also write v as v Both notations refer to a vector (2) A vector can be man

More information

Chapter 5 continued. Chapter 5 sections

Chapter 5 continued. Chapter 5 sections Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

Introduction to Computational Finance and Financial Econometrics Probability Theory Review: Part 2

Introduction to Computational Finance and Financial Econometrics Probability Theory Review: Part 2 Introduction to Computational Finance and Financial Econometrics Probability Theory Review: Part 2 Eric Zivot July 7, 2014 Bivariate Probability Distribution Example - Two discrete rv s and Bivariate pdf

More information

Dependence and scatter-plots. MVE-495: Lecture 4 Correlation and Regression

Dependence and scatter-plots. MVE-495: Lecture 4 Correlation and Regression Dependence and scatter-plots MVE-495: Lecture 4 Correlation and Regression It is common for two or more quantitative variables to be measured on the same individuals. Then it is useful to consider what

More information

Numerical Linear Algebra

Numerical Linear Algebra Numerical Linear Algebra Numerous alications in statistics, articularly in the fitting of linear models. Notation and conventions: Elements of a matrix A are denoted by a ij, where i indexes the rows and

More information

Review (Probability & Linear Algebra)

Review (Probability & Linear Algebra) Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint

More information

Section 3.1. ; X = (0, 1]. (i) f : R R R, f (x, y) = x y

Section 3.1. ; X = (0, 1]. (i) f : R R R, f (x, y) = x y Paul J. Bruillard MATH 0.970 Problem Set 6 An Introduction to Abstract Mathematics R. Bond and W. Keane Section 3.1: 3b,c,e,i, 4bd, 6, 9, 15, 16, 18c,e, 19a, 0, 1b Section 3.: 1f,i, e, 6, 1e,f,h, 13e,

More information

7. Two Random Variables

7. Two Random Variables 7. Two Random Variables In man eeriments the observations are eressible not as a single quantit but as a amil o quantities. or eamle to record the height and weight o each erson in a communit or the number

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

Polar Coordinates; Vectors

Polar Coordinates; Vectors Polar Coordinates; Vectors Earth Scientists Use Fractals to Measure and Predict Natural Disasters Predicting the size, location, and timing of natural hazards is virtuall imossible, but now earth scientists

More information

Review of Elementary Probability Lecture I Hamid R. Rabiee

Review of Elementary Probability Lecture I Hamid R. Rabiee Stochastic Processes Review o Elementar Probabilit Lecture I Hamid R. Rabiee Outline Histor/Philosoph Random Variables Densit/Distribution Functions Joint/Conditional Distributions Correlation Important

More information

Bivariate distributions

Bivariate distributions Bivariate distributions 3 th October 017 lecture based on Hogg Tanis Zimmerman: Probability and Statistical Inference (9th ed.) Bivariate Distributions of the Discrete Type The Correlation Coefficient

More information

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3 Probability Paul Schrimpf January 23, 2018 Contents 1 Definitions 2 2 Properties 3 3 Random variables 4 3.1 Discrete........................................... 4 3.2 Continuous.........................................

More information

Physics Gravitational force. 2. Strong or color force. 3. Electroweak force

Physics Gravitational force. 2. Strong or color force. 3. Electroweak force Phsics 360 Notes on Griffths - pluses and minuses No tetbook is perfect, and Griffithsisnoeception. Themajorplusisthat it is prett readable. For minuses, see below. Much of what G sas about the del operator

More information

INTRODUCTION TO DIOPHANTINE EQUATIONS

INTRODUCTION TO DIOPHANTINE EQUATIONS INTRODUCTION TO DIOPHANTINE EQUATIONS In the earl 20th centur, Thue made an important breakthrough in the stud of diophantine equations. His proof is one of the first eamples of the polnomial method. His

More information

Multivariate Random Variable

Multivariate Random Variable Multivariate Random Variable Author: Author: Andrés Hincapié and Linyi Cao This Version: August 7, 2016 Multivariate Random Variable 3 Now we consider models with more than one r.v. These are called multivariate

More information

Bayesian Networks Structure Learning (cont.)

Bayesian Networks Structure Learning (cont.) Koller & Friedman Chapters (handed out): Chapter 11 (short) Chapter 1: 1.1, 1., 1.3 (covered in the beginning of semester) 1.4 (Learning parameters for BNs) Chapter 13: 13.1, 13.3.1, 13.4.1, 13.4.3 (basic

More information

MA Game Theory 2005, LSE

MA Game Theory 2005, LSE MA. Game Theor, LSE Problem Set The problems in our third and final homework will serve two purposes. First, the will give ou practice in studing games with incomplete information. Second (unless indicated

More information

6. Linear transformations. Consider the function. f : R 2 R 2 which sends (x, y) (x, y)

6. Linear transformations. Consider the function. f : R 2 R 2 which sends (x, y) (x, y) Consider the function 6 Linear transformations f : R 2 R 2 which sends (x, ) (, x) This is an example of a linear transformation Before we get into the definition of a linear transformation, let s investigate

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analysis of Variance and Design of Exeriment-I MODULE II LECTURE -4 GENERAL LINEAR HPOTHESIS AND ANALSIS OF VARIANCE Dr. Shalabh Deartment of Mathematics and Statistics Indian Institute of Technology Kanur

More information

What is the limit of a function? Intuitively, we want the limit to say that as x gets closer to some value a,

What is the limit of a function? Intuitively, we want the limit to say that as x gets closer to some value a, Limits The notion of a limit is fundamental to the stud of calculus. It is one of the primar concepts that distinguishes calculus from mathematical subjects that ou saw prior to calculus, such as algebra

More information

Mathematical Statistics. Gregg Waterman Oregon Institute of Technology

Mathematical Statistics. Gregg Waterman Oregon Institute of Technology Mathematical Statistics Gregg Waterman Oregon Institute of Technolog c Gregg Waterman This work is licensed under the Creative Commons Attribution. International license. The essence of the license is

More information

Probability Theory Review Reading Assignments

Probability Theory Review Reading Assignments Probability Theory Review Reading Assignments R. Duda, P. Hart, and D. Stork, Pattern Classification, John-Wiley, 2nd edition, 2001 (appendix A.4, hard-copy). "Everything I need to know about Probability"

More information

ES.182A Problem Section 11, Fall 2018 Solutions

ES.182A Problem Section 11, Fall 2018 Solutions Problem 25.1. (a) z = 2 2 + 2 (b) z = 2 2 ES.182A Problem Section 11, Fall 2018 Solutions Sketch the following quadratic surfaces. answer: The figure for part (a) (on the left) shows the z trace with =

More information

Lecture 14: Multivariate mgf s and chf s

Lecture 14: Multivariate mgf s and chf s Lecture 14: Multivariate mgf s and chf s Multivariate mgf and chf For an n-dimensional random vector X, its mgf is defined as M X (t) = E(e t X ), t R n and its chf is defined as φ X (t) = E(e ıt X ),

More information

Next is material on matrix rank. Please see the handout

Next is material on matrix rank. Please see the handout B90.330 / C.005 NOTES for Wednesday 0.APR.7 Suppose that the model is β + ε, but ε does not have the desired variance matrix. Say that ε is normal, but Var(ε) σ W. The form of W is W w 0 0 0 0 0 0 w 0

More information

Chapter 4 Multiple Random Variables

Chapter 4 Multiple Random Variables Review for the previous lecture Definition: n-dimensional random vector, joint pmf (pdf), marginal pmf (pdf) Theorem: How to calculate marginal pmf (pdf) given joint pmf (pdf) Example: How to calculate

More information

1.1 The Equations of Motion

1.1 The Equations of Motion 1.1 The Equations of Motion In Book I, balance of forces and moments acting on an component was enforced in order to ensure that the component was in equilibrium. Here, allowance is made for stresses which

More information

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)

More information

Review of probability

Review of probability Review of probability Computer Sciences 760 Spring 2014 http://pages.cs.wisc.edu/~dpage/cs760/ Goals for the lecture you should understand the following concepts definition of probability random variables

More information

Review of Probability Theory II

Review of Probability Theory II Review of Probability Theory II January 9-3, 008 Exectation If the samle sace Ω = {ω, ω,...} is countable and g is a real-valued function, then we define the exected value or the exectation of a function

More information

Business Statistics 41000: Homework # 2 Solutions

Business Statistics 41000: Homework # 2 Solutions Business Statistics 4000: Homework # 2 Solutions Drew Creal February 9, 204 Question #. Discrete Random Variables and Their Distributions (a) The probabilities have to sum to, which means that 0. + 0.3

More information

Advanced Econometrics II (Part 1)

Advanced Econometrics II (Part 1) Advanced Econometrics II (Part 1) Dr. Mehdi Hosseinkouchack Goethe University Frankfurt Summer 2016 osseinkouchack (Goethe University Frankfurt) Review Slides Summer 2016 1 / 22 Distribution For simlicity,

More information

Lecture 1.2 Units, Dimensions, Estimations 1. Units To measure a quantity in physics means to compare it with a standard. Since there are many

Lecture 1.2 Units, Dimensions, Estimations 1. Units To measure a quantity in physics means to compare it with a standard. Since there are many Lecture. Units, Dimensions, Estimations. Units To measure a quantity in hysics means to comare it with a standard. Since there are many different quantities in nature, it should be many standards for those

More information

1.2 Relations. 20 Relations and Functions

1.2 Relations. 20 Relations and Functions 0 Relations and Functions. Relations From one point of view, all of Precalculus can be thought of as studing sets of points in the plane. With the Cartesian Plane now fresh in our memor we can discuss

More information

Joint Distribution of Two or More Random Variables

Joint Distribution of Two or More Random Variables Joint Distribution of Two or More Random Variables Sometimes more than one measurement in the form of random variable is taken on each member of the sample space. In cases like this there will be a few

More information

San Francisco State University ECON 851 Summer Problem Set 1

San Francisco State University ECON 851 Summer Problem Set 1 San Francisco State University Michael Bar ECON 85 Summer 05 Problem Set. Suose that the wea reference relation % on L is transitive. Prove that the strict reference relation is transitive. Let A; B; C

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 2016 MODULE 1 : Probability distributions Time allowed: Three hours Candidates should answer FIVE questions. All questions carry equal marks.

More information

Vector Basics. Lecture 1 Vector Basics

Vector Basics. Lecture 1 Vector Basics Lecture 1 Vector Basics Vector Basics We will be using vectors a lot in this course. Remember that vectors have both magnitude and direction e.g. a, You should know how to find the components of a vector

More information

Uncertainty and Parameter Space Analysis in Visualization -

Uncertainty and Parameter Space Analysis in Visualization - Uncertaint and Parameter Space Analsis in Visualiation - Session 4: Structural Uncertaint Analing the effect of uncertaint on the appearance of structures in scalar fields Rüdiger Westermann and Tobias

More information

Let X denote a random variable, and z = h(x) a function of x. Consider the

Let X denote a random variable, and z = h(x) a function of x. Consider the EE385 Class Notes 11/13/01 John Stensb Chapter 5 Moments and Conditional Statistics Let denote a random variable, and z = h(x) a function of x. Consider the transformation Z = h(). We saw that we could

More information

CONTINUOUS SPATIAL DATA ANALYSIS

CONTINUOUS SPATIAL DATA ANALYSIS CONTINUOUS SPATIAL DATA ANALSIS 1. Overview of Spatial Stochastic Processes The ke difference between continuous spatial data and point patterns is that there is now assumed to be a meaningful value, s

More information

4. Distributions of Functions of Random Variables

4. Distributions of Functions of Random Variables 4. Distributions of Functions of Random Variables Setup: Consider as given the joint distribution of X 1,..., X n (i.e. consider as given f X1,...,X n and F X1,...,X n ) Consider k functions g 1 : R n

More information