STA 114: Statistics. Notes 2. Statistical Models and the Likelihood Function

Similar documents
Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

20. Applications of the Genetic-Drift Model

Vehicle Arrival Models : Headway

KINEMATICS IN ONE DIMENSION

5. Stochastic processes (1)

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

Math 2142 Exam 1 Review Problems. x 2 + f (0) 3! for the 3rd Taylor polynomial at x = 0. To calculate the various quantities:

5.1 - Logarithms and Their Properties

Lecture Notes 2. The Hilbert Space Approach to Time Series

Comparing Means: t-tests for One Sample & Two Related Samples

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Matlab and Python programming: how to get started

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates

Fishing limits and the Logistic Equation. 1

Chapter 2. Models, Censoring, and Likelihood for Failure-Time Data

Explaining Total Factor Productivity. Ulrich Kohli University of Geneva December 2015

Time series Decomposition method

Lab #2: Kinematics in 1-Dimension

Physics 20 Lesson 5 Graphical Analysis Acceleration

The Arcsine Distribution

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

Stochastic models and their distributions

4.1 - Logarithms and Their Properties

Random Processes 1/24

State-Space Models. Initialization, Estimation and Smoothing of the Kalman Filter

72 Calculus and Structures

Chapter 15. Time Series: Descriptive Analyses, Models, and Forecasting

Kinematics Vocabulary. Kinematics and One Dimensional Motion. Position. Coordinate System in One Dimension. Kinema means movement 8.

A Dynamic Model of Economic Fluctuations

FITTING EQUATIONS TO DATA

Phys1112: DC and RC circuits

Exponential Smoothing

SMT 2014 Calculus Test Solutions February 15, 2014 = 3 5 = 15.

2. Nonlinear Conservation Law Equations

Physics 127b: Statistical Mechanics. Fokker-Planck Equation. Time Evolution

Problem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims

Problem 1 / 25 Problem 2 / 20 Problem 3 / 10 Problem 4 / 15 Problem 5 / 30 TOTAL / 100

Predator - Prey Model Trajectories and the nonlinear conservation law

Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé

Final Spring 2007

Sub Module 2.6. Measurement of transient temperature

Nature Neuroscience: doi: /nn Supplementary Figure 1. Spike-count autocorrelations in time.

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

Semi-Competing Risks on A Trivariate Weibull Survival Model

Two Coupled Oscillators / Normal Modes

Echocardiography Project and Finite Fourier Series

Decimal moved after first digit = 4.6 x Decimal moves five places left SCIENTIFIC > POSITIONAL. a) g) 5.31 x b) 0.

Week 1 Lecture 2 Problems 2, 5. What if something oscillates with no obvious spring? What is ω? (problem set problem)

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

Final Exam. Tuesday, December hours

The Contradiction within Equations of Motion with Constant Acceleration

1 Review of Zero-Sum Games

The average rate of change between two points on a function is d t

Calculus Tricks #1. So if you understand derivatives, you ll understand the course material much better. a few preliminaries exponents

Some Basic Information about M-S-D Systems

(π 3)k. f(t) = 1 π 3 sin(t)

Innova Junior College H2 Mathematics JC2 Preliminary Examinations Paper 2 Solutions 0 (*)

Mathcad Lecture #8 In-class Worksheet Curve Fitting and Interpolation

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

Analyze patterns and relationships. 3. Generate two numerical patterns using AC

Ground Rules. PC1221 Fundamentals of Physics I. Kinematics. Position. Lectures 3 and 4 Motion in One Dimension. A/Prof Tay Seng Chuan

WEEK-3 Recitation PHYS 131. of the projectile s velocity remains constant throughout the motion, since the acceleration a x

A Bayesian Approach to Spectral Analysis

Introduction to Probability and Statistics Slides 4 Chapter 4

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes

Physics 180A Fall 2008 Test points. Provide the best answer to the following questions and problems. Watch your sig figs.

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Traveling Waves. Chapter Introduction

Today: Graphing. Note: I hope this joke will be funnier (or at least make you roll your eyes and say ugh ) after class. v (miles per hour ) Time

Types of Exponential Smoothing Methods. Simple Exponential Smoothing. Simple Exponential Smoothing

Section 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Physics Spring Experiment 9: Faraday s Law of Induction

Math 106: Review for Final Exam, Part II. (x x 0 ) 2 = !

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits

Inequality measures for intersecting Lorenz curves: an alternative weak ordering

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds

Summer Term Albert-Ludwigs-Universität Freiburg Empirische Forschung und Okonometrie. Time Series Analysis

Unit Root Time Series. Univariate random walk

Math 10B: Mock Mid II. April 13, 2016

Sections 2.2 & 2.3 Limit of a Function and Limit Laws

Solutions to Odd Number Exercises in Chapter 6

A First Course on Kinetics and Reaction Engineering. Class 19 on Unit 18

( ) a system of differential equations with continuous parametrization ( T = R + These look like, respectively:

Phys 221 Fall Chapter 2. Motion in One Dimension. 2014, 2005 A. Dzyubenko Brooks/Cole

Notes for Lecture 17-18

LabQuest 24. Capacitors

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

The Simple Linear Regression Model: Reporting the Results and Choosing the Functional Form

Physics for Scientists & Engineers 2

Article from. Predictive Analytics and Futurism. July 2016 Issue 13

Financial Econometrics Jeffrey R. Russell Midterm Winter 2009 SOLUTIONS

Answers to QUIZ

) were both constant and we brought them from under the integral.

Dynamic Econometric Models: Y t = + 0 X t + 1 X t X t k X t-k + e t. A. Autoregressive Model:

Bernoulli numbers. Francesco Chiatti, Matteo Pintonello. December 5, 2016

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t

- Graphing: Position Velocity. Acceleration

Transcription:

STA 114: Saisics Noes 2. Saisical Models and he Likelihood Funcion Describing Daa & Saisical Models A physicis has a heory ha makes a precise predicion of wha s o be observed in daa. If he daa doesn mach he predicion, hen he heory is falsified. A saisician only has an imprecise descripion. This could be eiher because he heory is imprecise, or because random errors are inroduced in collecing he daa, or a combinaion of he wo. Therefore a saisician s daa, from he perspecive of her heory + daa collecion mehod, is an uncerain quaniy X. Any uncerain quaniy can be bes described by a se of values S he quaniy may assume, wih a pdf/pmf f() on S. The pdf/pmf is o be inerpreed as follows: f( 1 )/f( 2 ) = r means ha X = 1 is r-imes as plausible as X = 2. If he daa can be described by a single pmf/pdf hen here is no need of saisical analysis. Saisics is needed when a muliude of compeing heories lead o a muliude of pmfs/pdfs. When all hese pmfs/pdfs are colleced ogeher, we have a saisical model for our analysis. If θ denoes he quaniy by which he consiuen pmfs/pdfs of he model differ from each oher, hen we can wrie each pmf/pdf as f( θ). The quaniy θ is a parameer of his model. The se Θ of all possible values of θ is called he parameer space of he model. Eample (Opinion Poll). Take for eample a sudy where one wans o know wha percenage of sudens in a cerain universiy are in favor of a recen governmen policy. For a large universiy, soliciing every suden s opinion is impossible. The researcher may wan o draw a random lis of n = 5 sudens and quiz hem on heir opinion regarding he policy. A random lis gives he bes chance of guarding agains sysemaic biases in obaining a represenaive sample of sudens. The daa here is he number X of sudens in he sample who are in favor. If he researcher hinks ha a fracion p of he sudens, among a oal of N universiy sudens are in favor of he policy, hen X can be described as hyper-geomeric pmf f( p) given by { ( m )( N m n ) for =, 1, 2,, min(n, m) f( p) = ( N n) oherwise where m = Np is he oal number of sudens in he universiy who are in favor of he policy. The fracion p represens he researcher s heory abou he populariy of he policy among college sudens. If she considers all possibiliies p 1, hen here saisical model for X is {f( p) : p [, 1]} wih f( p) given as above. 1

HypGeo Binom f( p)..2.4 f( p)..2.4 4 4 Figure 1: X = number of sudens favoring he policy in a sample of 5 sudens. Descripion of X under hypergeomeric (lef) and binomial disribuions (righ) for hree possible values of p =.25,.5,.75. When N is very large compared o n, we can also represen X by he binomial pmf { ( n ) p (1 p) n for =, 1, 2,, n f( p) = oherwise Now he researcher s model is {f( p) : p [, 1]} wih f( p) given by he binomial pmf above. Figure 1 below shows wha he researcher epecs o see as daa X under he hypergeomeric or he binomial disribuion for hree possible values of p, namely, p = 1/4 (solid line), p = 1/2 (broken line) and p = 3/4 (doed line). Eample (Trend of TC couns). A climae researcher wans o sudy wheher hurricane aciviy is inensifying wih ime. One way o do i is o sudy he annual couns of ropical cyclones (TC) in an ocean basin, say he norh Alanic basin, for he pas 1 years. The daa is hen of he form X = (X 1, X 2,, X 1 ), wih X giving he TC coun in year. To describe his daa, we can firs focus on describing one X. Since X is a coun, we can describe i by a Poisson pmf: { e µ µ f ( µ ) =! for =, 1, 2, oherwise where µ represens he epeced coun for year. Now o describe, X = (X 1, X 2,, X 1 ) we can rea he componen X s as independen and wrie f( {µ }) = f 1 ( 1 µ 1 ) f 2 ( 2 µ 2 ) f 1 ( 1 µ 1 ) which gives he join pmf of X a = ( 1, 2,, 1 ). Alhough he above gives a descripion of X, i is no clear how o sudy he climae researcher s quesion wihin his framework. To achieve his, we now need o say somehing 2

4 6 8 1 5 1 3 4 5 1 3 4 5 4 3 1 4 6 8 1 4 6 8 1 Figure 2: X = annual TC couns for 1 consecuive years. Descripion of X under Poisson disribuions wih mean µ in year. Three possible linear specificaions µ = µ + β( 1) are considered. abou how he differen µ compare o each oher, and in paricular, how hey evolve over ime. One possible descripion is he following: µ = α + β( 1), = 1, 2,, 1 which says ha he epeced annual couns are increasing linearly in ime, wih slope β. The research quesion of wheher TC aciviy is increasing can now be represened by various values of (α, β). In paricular, a posiive sign of β means ha TC couns have an upward rend, wih larger β indicaing faser growh. On he oher hand, a zero or a negaive value of β indicaes no or downward rend. Therefore a saisical model for X is given by {f ( µ, β) : α (a, b), β (c, d)} for well chosen limis a, b, c, d, where f ( α, β) = f1 (1 α) f2 (2 α + β) f1 (1 α + 99β). Figure 2 shows he descripion of X under hree choices of (α, β): (,.2), (1,.25) and (, ). Noe ha unlike he previous eample, he he choice of model for his eample was a lo less obvious. Indeed, one could use many disribuions, insead of a Poisson pmf, o describe each X. Furhermore, he evoluion of µ over ime, could also be described in many differen ways. Wha we have buil here is a descripion of he daa, wheher here is a beer descripion can always be debaed. The Likelihood Funcion Suppose a saisical model {f ( θ) : θ Θ} has been consruced for daa X, wih each θ represening a differen heory. When we observed daa X =, we can compare wo parameer values (i.e., wo heories) θ = θ1 and θ = θ2 by looking a he raio f ( θ1 )/f ( θ2 ). If his raio equals 2, hen he daa X = is wice as likely o be observed under θ = θ1 han i is under θ = θ2. Such comparisons can be done based on he likelihood funcion L (θ) := f ( θ), θ Θ. 3

Likelihood..2 Log-likelihood -1-6 -..4.8 p..4.8 p Figure 3: Likelihood and log-likelihood funcions in he opinion poll eample. The observed daa is X =. Noe ha L (θ) is a funcion over he variable θ aking values in he se Θ. For all echnical purposes, one can work wih L (θ) in he log-scale. Tha is, define he log-likelihood funcion l (θ) = log L (θ) = log f( θ). Log-scale comparisons beween heories are hen done by differences l (θ 1 ) l (θ 2 ). Eample (Opinion Poll, Cond). For he opinion poll eample wih he saisical model {Binomial(n, p) : p [.1]}, he likelihood funcion in he parameer p is given by ( ) n L (p) = p (1 p) n, p [, 1] and he log-likelihood funcion is l (p) = log L (p) = log ( ) n + log p + (n ) log(1 p), p [, 1] Noe ha he firs erm on he righ hand side does no involve he funcion argumen p. So we can wrie l (p) = cons + log p + (n ) log(1 p), no caring abou he eac value of his addiive consan. Indeed, he consan disappears when we look a differences l (p 1 ) l (p 2 ). For daa X = he heories p =.25, p =.5 and p =.75 receive likelihood scores 6.45 1 14, 1.54 1 6 and 1.25 1 61. Figure 3 shows he likelihood funcion L (p) and he log-likelihood funcion l (p) over he grid p {.,.1,, 1.}. These funcions indicae ha heories wih p close o.4 fare well in eplaining he daa X =. The heory p =.4 eplains he daa nearly 1 8 imes beer han he heory p =.8. 4

1 3 4 5 6 1 -.5.5.15.25 5 1 15 Figure 4: TC couns beween 198 and 7 and he corresponding log-likelihood funcion shown as a conour. Eample (TC couns, Cond.). For he saisical model we discussed before, he loglikelihood funcion is given by: 1 1 l (α, β) = cons (α + β( 1)) + log(α + β( 1)). =1 Figure 4 shows he observed annual TC couns beween 198 and 7 (on he lef superimposed on f( 7,.5)). A conour plo of he he log-likelihood funcion over (α, β) is shown on he righ. Posiive slope values (β > ) fare beer in eplaining he daa han negaive slopes. A Word of Cauion The likelihood funcion gives a numerical comparison of he posulaed heories once daa X = is observed. Bu be clear on wha L (θ 1 )/L (θ 2 ) = 2 means. I does NOT mean ha given he observed daa, heory θ 1 is wice more likely han heory θ 1. We don ye have a plaform for discussing likeliness or relaive plausibiliy of heories. Formalizing his concep is he focus of saisical inference. =1 5