Part I: Background on the Binomial Distribution

Similar documents
Econometric Methods. Review of Estimation

Chapter 14 Logistic Regression Models

Parameter, Statistic and Random Samples

Bayes (Naïve or not) Classifiers: Generative Approach

Generative classification models

CS 2750 Machine Learning Lecture 5. Density estimation. Density estimation

Continuous Distributions

Maximum Likelihood Estimation

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Special Instructions / Useful Data

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Multivariate Transformation of Variables and Maximum Likelihood Estimation

STK4011 and STK9011 Autumn 2016

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Law of Large Numbers

Objectives of Multiple Regression

Chapter 3 Sampling For Proportions and Percentages

Statistical modelling and latent variables (2)

Simulation Output Analysis

Overview. Basic concepts of Bayesian learning. Most probable model given data Coin tosses Linear regression Logistic regression

STA302/1001-Fall 2008 Midterm Test October 21, 2008

STK3100 and STK4100 Autumn 2017

Chapter 5 Properties of a Random Sample

Lecture 3. Sampling, sampling distributions, and parameter estimation

Chapter 8: Statistical Analysis of Simulated Data

9.1 Introduction to the probit and logit models

X ε ) = 0, or equivalently, lim

ENGI 4421 Propagation of Error Page 8-01

Chapter 2 Supplemental Text Material

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

STK3100 and STK4100 Autumn 2018

Simple Linear Regression

Point Estimation: definition of estimators

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

ENGI 3423 Simple Linear Regression Page 12-01

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

Probability and Statistics. What is probability? What is statistics?

Logistic regression (continued)

i 2 σ ) i = 1,2,...,n , and = 3.01 = 4.01

å 1 13 Practice Final Examination Solutions - = CS109 Dec 5, 2018

Estimation of the Loss and Risk Functions of Parameter of Maxwell Distribution

Random Variables and Probability Distributions

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Parametric Density Estimation: Bayesian Estimation. Naïve Bayes Classifier

Classification : Logistic regression. Generative classification model.

Unsupervised Learning and Other Neural Networks

TESTS BASED ON MAXIMUM LIKELIHOOD

Summary of the lecture in Biostatistics

CS 3710 Advanced Topics in AI Lecture 17. Density estimation. CS 3710 Probabilistic graphical models. Administration

CS 2750 Machine Learning. Lecture 8. Linear regression. CS 2750 Machine Learning. Linear regression. is a linear combination of input components x

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Line Fitting and Regression

Lecture 02: Bounding tail distributions of a random variable

Lecture Notes Types of economic variables

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 2 STATISTICAL INFERENCE

Median as a Weighted Arithmetic Mean of All Sample Observations

CHAPTER 3 POSTERIOR DISTRIBUTIONS

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

THE ROYAL STATISTICAL SOCIETY 2010 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2 STATISTICAL INFERENCE

: At least two means differ SST

Introduction to local (nonparametric) density estimation. methods

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

Linear Regression Linear Regression with Shrinkage. Some slides are due to Tommi Jaakkola, MIT AI Lab

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))

Study of Correlation using Bayes Approach under bivariate Distributions

Third handout: On the Gini Index

Linear Regression with One Regressor

Lecture Notes 2. The ability to manipulate matrices is critical in economics.

Module 7. Lecture 7: Statistical parameter estimation

Bayes Estimator for Exponential Distribution with Extension of Jeffery Prior Information

Bayesian Classification. CS690L Data Mining: Classification(2) Bayesian Theorem: Basics. Bayesian Theorem. Training dataset. Naïve Bayes Classifier

Chapter 5 Transformation and Weighting to Correct Model Inadequacies

Class 13,14 June 17, 19, 2015

7. Joint Distributions

LINEAR REGRESSION ANALYSIS

Qualifying Exam Statistical Theory Problem Solutions August 2005

Construction and Evaluation of Actuarial Models. Rajapaksha Premarathna

Recall MLR 5 Homskedasticity error u has the same variance given any values of the explanatory variables Var(u x1,...,xk) = 2 or E(UU ) = 2 I

Section 2 Notes. Elizabeth Stone and Charles Wang. January 15, Expectation and Conditional Expectation of a Random Variable.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests. Soccer Goals in European Premier Leagues

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

SMALL AREA ESTIMATION USING NATURAL EXPONENTIAL FAMILIES WITH QUADRATIC VARIANCE FUNCTION (NEF-QVF) FOR BINARY DATA 1

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

CHAPTER VI Statistical Analysis of Experimental Data

ρ < 1 be five real numbers. The

Random Variables. ECE 313 Probability with Engineering Applications Lecture 8 Professor Ravi K. Iyer University of Illinois

GOALS The Samples Why Sample the Population? What is a Probability Sample? Four Most Commonly Used Probability Sampling Methods

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2

Chapter 13 Student Lecture Notes 13-1

Chapter 4 Multiple Random Variables

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

Bayesian belief networks

9 U-STATISTICS. Eh =(m!) 1 Eh(X (1),..., X (m ) ) i.i.d

Transcription:

Part I: Bacgroud o the Bomal Dstrbuto A radom varable s sad to have a Beroull dstrbuto f t taes o the value wth probablt "p" ad the value wth probablt " - p". The umber of "successes" "" depedet Beroull trals s sad to have a bomal dstrbuto. The probablt mass fucto for the bomal dstrbuto s: probablt(observg successes, p) = choose(, ) * p^ * (-p)^(-) Let's cosder a cocrete example. Suppose we wat to ow the probablt of observg "" heads "" tosses of a co, where the probablt of ladg o heads s "p" = /2. The co tosses are depedet, because the outcome for the prevous co toss does ot affect the outcome for the ext co toss. The co tosses are detcall dstrbuted, because the probablt of ladg o heads s the same for all co tosses. There are eght possble outcomes: tals, tals, tals: probablt of ths outcome = ( - p) * ( - p) * ( - p) = /8 tals, tals, heads: probablt of ths outcome = ( - p) * ( - p) * p = /8 tals, heads, tals: probablt of ths outcome = ( - p) * p * ( - p) = /8 tals, heads, heads: probablt of ths outcome = ( - p) * p * p = /8 heads, tals, tals: probablt of ths outcome = p * ( - p) * ( - p) = /8 heads, tals, heads: probablt of ths outcome = p * ( - p) * p = /8 heads, heads, tals: probablt of ths outcome = p * p * ( - p) = /8 heads, heads, heads: probablt of ths outcome = p * p * p = /8

Part II: Logstc Regresso Usg Smulated Data As ou ma recall, the bomal dstrbuto s our source of ucertat for logstc regresso. Ths s aalogous to usg the Gaussa (ormal) dstrbuto as the source of ucertat for lear regresso.

Part III: Estmatg a Bomal Proporto Suppose we have a set of Beroull outcomes, ad we wat to estmate the bomal proporto from data (.e. we wat to estmate "p"). We ca geerate a "maxmum lelhood estmate" for the bomal proporto b settg the dervatve of the log lelhood to zero ad solvg for "p". probablt, so tag the log of both sdes gves us log, log probablt log log log ad tag the dervatve wth respect to theta gves us log probablt, log log log log log log ad settg the dervatve equal to zero ad solvg for theta gves us

Ths estmate s sometmes called a "frequetst" estmate of the bomal proporto. The problem wth ths estmate occurs whe the actual bomal proporto s ether ver small or ver large, or whe s small. Suppose the actual bomal proporto s p =. ad we ol have observatos. The expected umber of "successes" ( * p = *. =.) s less tha oe. Ths meas our frequetst estmate s lel to be, eve though the actual bomal proporto s.. Ths ca cause trouble a doma where observed probabltes ca be ver small or ver large; e.g. whe estmatg clc-through rates for ole advertsg or purchase probabltes for tems the "tal" of a ecommerce merchat's vetor. We would le to avod degeerate estmates that express the certat of ether zero or oe. Baesa estmato ca be used to adjust the frequetst estmate. Recall that posteror = pror * lelhood / evdece. pror lelhood p, /,,, dbeta probablt evdece p,,, posteror p,,, / d / / d /,, dbeta

The expected value (mea) of the posteror dstrbuto s smpl ( + alpha ) / ( + alpha + (-) + alpha ). We are essetall adjustg the frequetst estmate b addg alpha ad alpha pseudo couts to the ad - observed couts. Usg the posteror mea for our estmate of the bomal proporto s the same as usg a weghted mea of ( / ) ad (alpha / (alpha + alpha )), so we ll wat to eep the pseudo couts relatvel small to allow the observed couts to overtae the pseudo couts relatvel qucl ˆ We re usg a Baesa pror to avod the certat of usg ether zero or oe for our estmate of the bomal proporto. I practce, fols ofte use ether alpha = alpha = (a Laplace pror) or alpha = alpha =.5 (a Jeffres pror). We ca obta a 95% cofdece terval for the bomal proporto wth the followg R commad qbeta(c(.25,.975), + alpha., - + alpha.) Note: the R bom.test() commad uses the followg slghtl more coservatve (larger) terval c(qbeta(.25,, - + ), qbeta(.975, +, - ))