Spatio-Temporal Modelling of Credit Default Data
|
|
- Emery Thornton
- 5 years ago
- Views:
Transcription
1 1/20 Spatio-Temporal Modelling of Credit Default Data Sathyanarayan Anand Advisor: Prof. Robert Stine The Wharton School, University of Pennsylvania April 29, 2011
2 2/20 Outline 1 Background 2 Conditional Autoregressive (CAR) Model 3 Results on Credit Default Data 4 Simulation Based Analysis 5 Work In Progress
3 3/20 Problem Statement & Data Description To predict credit default rates aggregated over some geographical region, and understand the underlying spatial and temporal structures
4 3/20 Problem Statement & Data Description To predict credit default rates aggregated over some geographical region, and understand the underlying spatial and temporal structures Quarterly Data from the Fed County, state & national levels 71 quarters from 1992 to 2009 County Level Default rates of many different kinds & time limits. Ex: mortgage, revolving, etc. Economic indicators, such as poverty, income, unemployment, etc. Time series 3000 x 71 per default rate; plus covariates State & National Levels All the variables at county level; plus many more rates & economic indicators State level 50 x 71, National level 1 x 71 per default rate; plus covariates
5 4/20 Dependencies in Default Rates REPB60M Medians (Smoothed) 1.0 REPB60M Mississippi Georgia North Carolina National California Minnesota Correlation CreditType Installment 60-Day Mortgage 60-Day Revolving 60-Day time Time
6 5/20 Issues to Think About Revolving Default Rate (2006.1) REPB60M Spatial Structure? Markovian? How many layers to use? Geographic distance? Or in covariate space? Outliers? Overfitting and predictive power Bob s motto: regression work s best
7 6/20 A Naive Model We focus on 60-day Revolving Debt Default Rate for Pennsylvania from here on Let p st be the default rate for location s at time t
8 6/20 A Naive Model We focus on 60-day Revolving Debt Default Rate for Pennsylvania from here on Let p st be the default rate for location s at time t A simple OLS model can be written as logit(p st) = ψ + φ s + α t + J β jt x jst + ɛ st j=1 iid ( ) ɛ st N 0, σ 2 e Fixed effects, φ s for space and α t for time For each covariate, one slope coefficient β jt across space at a given time t No spatial structure captured Why logit? keeps predicted default rates to between 0 and 1 Is it actually logit?
9 7/20 Is It Logit? Inverse response plot from the logit model looks linear (Simon Sheather) Fitted Observed
10 8/20 CAR Specification The CAR model is based on the logit model previously specified. logit(p st) N ( µ st, σs 2 ) ; s = 1, 2,..., S J µ st = ψ + φ s + α t + β jt x jst j=1 T s.t. α t = 0 and S φ s = 0 t=1 s=1
11 8/20 CAR Specification The CAR model is based on the logit model previously specified. logit(p st) N ( µ st, σ 2 s ) ; s = 1, 2,..., S µ st = ψ + φ s + α t + s.t. T α t = 0 t=1 Spatial and temporal prior distributions: J β jt x jst j=1 and S φ s = 0 s=1 φ i φ i N ρ c w ij σc 2 φ j, w j i+ w i+ α i α i N ρ τ v ij στ 2 α j, v j i+ v i+
12 9/20 Interpreting the CAR Model φ i φ i N ( ρ c j w ij w i+ φ j, σ 2 c w i+ ) w ij = 1 if nodes i and j are neighbors, and 0 otherwise Node mean is a scaled average of its neighbors values Node variance is inversly proportional to the number of neighbors
13 9/20 Interpreting the CAR Model φ i φ i N ( ρ c j w ij w i+ φ j, σ 2 c w i+ ) w ij = 1 if nodes i and j are neighbors, and 0 otherwise Node mean is a scaled average of its neighbors values Node variance is inversly proportional to the number of neighbors Let W be the adjacency matrix of the graph. Then, φ N S ( 0, σ 2 c (E w ρ c W ) 1) where E w = diagonal matrix of row sums of W Note: the covariance matrix is invertible if ρ c (λ 1 n, λ 1 1 ), where λn (negative) and λ 1 (positive) are the minimum and maximum eigenvalues of W.
14 10/20 Estimating the CAR Model We derive the CAR prior specification under the zero-sum constraint to get w ij φ j σc φ i ψ, φ i N ρ 2 c + ψ(ρ c 1), w i+ j i These terms are now contrasts from the global intercept ψ, with φ 1 = 0. w i+
15 10/20 Estimating the CAR Model We derive the CAR prior specification under the zero-sum constraint to get w ij φ j σc φ i ψ, φ i N ρ 2 c + ψ(ρ c 1), w i+ j i These terms are now contrasts from the global intercept ψ, with φ 1 = 0. Estimation notes: Estimate models separately for each time point t and predict for t + 1 Each spatial point s gets its own variance term, σ 2 s Each covariate j gets its own variance, σ 2 jb CAR scaling parameter ρ c is fixed to its highest permissible value i.e. λ 1 1 This is the commonly used method in other papers Adaptive rejection metropolis sampling proposed but computationally intensive (Olivera 2008) Solution: good old friend, MLE (in a few slides, time permitting) w i+ All posteriors have closed form expressions Gibbs sampling Number of parameters to be estimated O(S + J T ), for S spatial locations, J covariates, and T time points
16 11/20 Prediction of Default Rates: RMSEs CAR Model Blue: Fitted RMSE, Pink: Predicted RMSE Logit Model Red: Fitted RMSE, Green: Predicted RMSE Model RMSE 0.3 Logit Fitted Logit Predicted CAR Fitted CAR Predicted TimePoint
17 12/20 Simulation Experiments With real data, we can compare different models but no baseline truths Understand how the CAR model behaves under lower signal to noise ratios Full scale simulated datasets of Pennsylvania 67 counties with the same neighborhood structure Covariates (income, unemployment, etc.) kept the same Generate slopes for covariates, one for each time point Generate the spatial and temporal CAR-based intercepts Add in noise Noise levels 2-5 times higher than the actual dataset Both models estimated in the same fashion
18 Quarter 13/20 Estimation for a Random Time Point Estimated & True Covariate Slopes Beta
19 Quarter Quarter Estimation for a Random Time Point Estimated & True Covariate Slopes Beta Estimated & True α t Alpha 13/20
20 Quarter Quarter Estimation for a Random Time Point Estimated & True Covariate Slopes Estimated vs. True φ s Beta True Phi Estimated & True α t Alpha Estimated Phi 13/20
21 Quarter Quarter Actual Estimation for a Random Time Point Estimated & True Covariate Slopes Estimated vs. True φ s Beta True Phi Estimated & True α t Alpha Estimated Phi Predicted vs. Actual Response Predicted 13/20
22 14/20 Prediction of Default Rates: RMSEs RMSE Knowing True Model: Pink CAR Model Dark Green: Fitted RMSE, Blue: Predicted RMSE Logit Model Red: Fitted RMSE, Olive: Predicted RMSE Model Logit Fitted RMSE 2.5 Logit Predicted CAR Fitted CAR Predicted True TimePoint Predictive performace is much worse than the RMSE of fitted values
23 Type Predicted True /20 Predicting the Temporal CAR Parameter The temporal CAR term is specified as ( α t α t N ρ τ v tu α u, v u t+ When predicting for time T + 1, the only neighbor is time T, and hence ˆα T +1 = ρ c α T Compare predicted ˆα T +1 from model built for time T, and fitted α from model built using the entire dataset σ 2 τ v t+ ) Alpha Absolute Difference Period Period
24 16/20 Understanding the Temporal CAR Parameter 3.5 Blue Bars: Alpha Difference RMSE Model a CAR Fitted a CAR Predicted a True TimePoint Difference in α explains about 64% of the difference between fitted and predicted RMSEs Need a predictive model for the temporal intercepts on top of the CAR structure
25 17/20 Work In Progress: MLE for CAR Covariance Let X i N S (0, Σ) ; i = 1, 2,..., T The log-likelihood is given by log(l(σ Y )) = C T 2 log(det(σ)) 1 2 T i=1 X i Σ 1 X i Under the CAR structure, Σ = σ 2 (E w ρw ) 1, and log-likehood reduces to log(l(σ 2, ρ Y )) = C+ T 2 log(det(ew ρw )) ST 2 log(σ2 ) 1 tr((ew ρw )V ) 2σ2 where V = T equations. i=1 X i X i. Solving for the MLE, we get the following set of tr ( ) (ˆσ 2 T (E w ˆρW ) 1 V ) W = 0 ˆσ 2 = 1 tr ((Ew ˆρW ) V ) ST
26 18/20 Work In Progress: Utilizing the Spatial CAR Structure Spatial structure is useful if the future can be partially observed Knowing future values for k counties, update predictions of remaining counties Plots: X-axis = percentage of known values, Y-axis = updated RMSE TimePoint 1.6 RMSE 2.0 TP21 TP35 TP43 TP61 RMSE TimePoint TP33 TP59 TP Peek Peek Seems to work for most time points, but no improvement for some Could be the basis for a test of changing spatial structure
27 19/20 Work In Progress MLE for CAR covariance structure Likelihood ratio test for CAR structure with power calculations under two alternative spatial structures, robustness to the multivariate t distribution, and scalability with graph size Predictive layers on the CAR model Compare to Bob and Dean s α-investing procedure Spatial clustering of counties using Hierarchical DP Summary statistic for amount of spatial correlation : extending Geary s C to spatio-temporal data A second application: Understanding patient access to care in the face of reduction or closure of hospital service lines Extension of previous work, The Effect of California s Safe-Staffing Law (AB394) on Hospital Behavior, to be submitted to the Journal of Health Economics, May With Emi Terasawa and Scott Harrington, Healthcare Management Department
28 20/20 Thank You
Spatial Temporal Models for Retail Credit
Spatial Temporal Models for Retail Credit Bob Stine The School, University of Pennsylvania stat.wharton.upenn.edu/~stine Introduction Exploratory analysis Trends and maps Outline Measuring spatial association
More informationMulti-level Models: Idea
Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random
More informationFrailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.
Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk
More informationECE521 week 3: 23/26 January 2017
ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear
More informationSpatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions
Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64
More informationBayesian Areal Wombling for Geographic Boundary Analysis
Bayesian Areal Wombling for Geographic Boundary Analysis Haolan Lu, Haijun Ma, and Bradley P. Carlin haolanl@biostat.umn.edu, haijunma@biostat.umn.edu, and brad@biostat.umn.edu Division of Biostatistics
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationCommunity Health Needs Assessment through Spatial Regression Modeling
Community Health Needs Assessment through Spatial Regression Modeling Glen D. Johnson, PhD CUNY School of Public Health glen.johnson@lehman.cuny.edu Objectives: Assess community needs with respect to particular
More informationLabor-Supply Shifts and Economic Fluctuations. Technical Appendix
Labor-Supply Shifts and Economic Fluctuations Technical Appendix Yongsung Chang Department of Economics University of Pennsylvania Frank Schorfheide Department of Economics University of Pennsylvania January
More informationAreal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case
Areal data models Spatial smoothers Brook s Lemma and Gibbs distribution CAR models Gaussian case Non-Gaussian case SAR models Gaussian case Non-Gaussian case CAR vs. SAR STAR models Inference for areal
More informationBayesian Hierarchical Models
Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology
More informationAsymptotic standard errors of MLE
Asymptotic standard errors of MLE Suppose, in the previous example of Carbon and Nitrogen in soil data, that we get the parameter estimates For maximum likelihood estimation, we can use Hessian matrix
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North
More informationPart 8: GLMs and Hierarchical LMs and GLMs
Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course
More informationLearning Multiple Tasks with a Sparse Matrix-Normal Penalty
Learning Multiple Tasks with a Sparse Matrix-Normal Penalty Yi Zhang and Jeff Schneider NIPS 2010 Presented by Esther Salazar Duke University March 25, 2011 E. Salazar (Reading group) March 25, 2011 1
More informationNearest Neighbor Gaussian Processes for Large Spatial Data
Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns
More informationHierarchical Modelling for Univariate Spatial Data
Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department
More informationA Random-effects construction of EMV models - a solution to the Identification problem? Peter E Clarke, Deva Statistical Consulting
A Random-effects construction of EMV models - a solution to the Identification problem? Peter E Clarke, Deva Statistical Consulting Credit Scoring and Credit Control Conference XV Edinburgh, August 2017
More information4 Bias-Variance for Ridge Regression (24 points)
Implement Ridge Regression with λ = 0.00001. Plot the Squared Euclidean test error for the following values of k (the dimensions you reduce to): k = {0, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500,
More informationLinear Regression (9/11/13)
STA561: Probabilistic machine learning Linear Regression (9/11/13) Lecturer: Barbara Engelhardt Scribes: Zachary Abzug, Mike Gloudemans, Zhuosheng Gu, Zhao Song 1 Why use linear regression? Figure 1: Scatter
More informationHierarchical Modeling for Univariate Spatial Data
Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,
More informationPhysician Performance Assessment / Spatial Inference of Pollutant Concentrations
Physician Performance Assessment / Spatial Inference of Pollutant Concentrations Dawn Woodard Operations Research & Information Engineering Cornell University Johns Hopkins Dept. of Biostatistics, April
More informationHierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets
Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,
More informationA Fully Nonparametric Modeling Approach to. BNP Binary Regression
A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012 Outline 1 2 3 Simulation
More informationGibbs Sampling in Linear Models #2
Gibbs Sampling in Linear Models #2 Econ 690 Purdue University Outline 1 Linear Regression Model with a Changepoint Example with Temperature Data 2 The Seemingly Unrelated Regressions Model 3 Gibbs sampling
More informationHierarchical Modelling for Univariate Spatial Data
Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan
More informationMachine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function.
Bayesian learning: Machine learning comes from Bayesian decision theory in statistics. There we want to minimize the expected value of the loss function. Let y be the true label and y be the predicted
More informationMetropolis-Hastings Algorithm
Strength of the Gibbs sampler Metropolis-Hastings Algorithm Easy algorithm to think about. Exploits the factorization properties of the joint probability distribution. No difficult choices to be made to
More informationBayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing
More informationBayesian Linear Models
Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department
More informationWeb Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis
Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis Haijun Ma, Bradley P. Carlin and Sudipto Banerjee December 8, 2008 Web Appendix A: Selecting
More informationSimple and Multiple Linear Regression
Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where
More informationMaking rating curves - the Bayesian approach
Making rating curves - the Bayesian approach Rating curves what is wanted? A best estimate of the relationship between stage and discharge at a given place in a river. The relationship should be on the
More informationStructural Interpretation of Vector Autoregressions with Incomplete Identification: Revisiting the Role of Oil Supply and Demand Shocks
Structural Interpretation of Vector Autoregressions with Incomplete Identification: Revisiting the Role of Oil Supply and Demand Shocks Christiane Baumeister, University of Notre Dame James D. Hamilton,
More informationMedical GIS: New Uses of Mapping Technology in Public Health. Peter Hayward, PhD Department of Geography SUNY College at Oneonta
Medical GIS: New Uses of Mapping Technology in Public Health Peter Hayward, PhD Department of Geography SUNY College at Oneonta Invited research seminar presentation at Bassett Healthcare. Cooperstown,
More informationTHE UNIVERSITY OF CHICAGO Graduate School of Business Business 41912, Spring Quarter 2008, Mr. Ruey S. Tsay. Solutions to Final Exam
THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41912, Spring Quarter 2008, Mr. Ruey S. Tsay Solutions to Final Exam 1. (13 pts) Consider the monthly log returns, in percentages, of five
More informationLinear Models A linear model is defined by the expression
Linear Models A linear model is defined by the expression x = F β + ɛ. where x = (x 1, x 2,..., x n ) is vector of size n usually known as the response vector. β = (β 1, β 2,..., β p ) is the transpose
More informationDS-GA 1002 Lecture notes 12 Fall Linear regression
DS-GA Lecture notes 1 Fall 16 1 Linear models Linear regression In statistics, regression consists of learning a function relating a certain quantity of interest y, the response or dependent variable,
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationSpatial Lasso with Application to GIS Model Selection. F. Jay Breidt Colorado State University
Spatial Lasso with Application to GIS Model Selection F. Jay Breidt Colorado State University with Hsin-Cheng Huang, Nan-Jung Hsu, and Dave Theobald September 25 The work reported here was developed under
More informationModule 11: Linear Regression. Rebecca C. Steorts
Module 11: Linear Regression Rebecca C. Steorts Announcements Today is the last class Homework 7 has been extended to Thursday, April 20, 11 PM. There will be no lab tomorrow. There will be office hours
More informationSpatial Inference of Nitrate Concentrations in Groundwater
Spatial Inference of Nitrate Concentrations in Groundwater Dawn Woodard Operations Research & Information Engineering Cornell University joint work with Robert Wolpert, Duke Univ. Dept. of Statistical
More informationChapter 11: Robust & Quantile regression
Chapter 11: Robust & Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I 1 / 13 11.3: Robust regression Leverages h ii and deleted residuals t i
More informationHierarchical Modeling for Multivariate Spatial Data
Hierarchical Modeling for Multivariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department
More informationWeek 3: Linear Regression
Week 3: Linear Regression Instructor: Sergey Levine Recap In the previous lecture we saw how linear regression can solve the following problem: given a dataset D = {(x, y ),..., (x N, y N )}, learn to
More informationMultivariate Statistics
Multivariate Statistics Chapter 4: Factor analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2017/2018 Master in Mathematical Engineering Pedro
More informationProbabilistic Graphical Models
Probabilistic Graphical Models Brown University CSCI 2950-P, Spring 2013 Prof. Erik Sudderth Lecture 12: Gaussian Belief Propagation, State Space Models and Kalman Filters Guest Kalman Filter Lecture by
More informationA Bayesian Perspective on Residential Demand Response Using Smart Meter Data
A Bayesian Perspective on Residential Demand Response Using Smart Meter Data Datong-Paul Zhou, Maximilian Balandat, and Claire Tomlin University of California, Berkeley [datong.zhou, balandat, tomlin]@eecs.berkeley.edu
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within
More informationBayesian spatial hierarchical modeling for temperature extremes
Bayesian spatial hierarchical modeling for temperature extremes Indriati Bisono Dr. Andrew Robinson Dr. Aloke Phatak Mathematics and Statistics Department The University of Melbourne Maths, Informatics
More informationLinear regression. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Linear regression DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall15 Carlos Fernandez-Granda Linear models Least-squares estimation Overfitting Example:
More informationThe linear regression model: functional form and structural breaks
The linear regression model: functional form and structural breaks Ragnar Nymoen Department of Economics, UiO 16 January 2009 Overview Dynamic models A little bit more about dynamics Extending inference
More informationFactor Models for Asset Returns. Prof. Daniel P. Palomar
Factor Models for Asset Returns Prof. Daniel P. Palomar The Hong Kong University of Science and Technology (HKUST) MAFS6010R- Portfolio Optimization with R MSc in Financial Mathematics Fall 2018-19, HKUST,
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationMCMC algorithms for fitting Bayesian models
MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models
More informationApplied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013
Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis
More informationFactor Analysis and Kalman Filtering (11/2/04)
CS281A/Stat241A: Statistical Learning Theory Factor Analysis and Kalman Filtering (11/2/04) Lecturer: Michael I. Jordan Scribes: Byung-Gon Chun and Sunghoon Kim 1 Factor Analysis Factor analysis is used
More informationSUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION
SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION 1 Outline Basic terminology Features Training and validation Model selection Error and loss measures Statistical comparison Evaluation measures 2 Terminology
More information6.867 Machine Learning
6.867 Machine Learning Problem Set 2 Due date: Wednesday October 6 Please address all questions and comments about this problem set to 6867-staff@csail.mit.edu. You will need to use MATLAB for some of
More informationComputer Vision Group Prof. Daniel Cremers. 10a. Markov Chain Monte Carlo
Group Prof. Daniel Cremers 10a. Markov Chain Monte Carlo Markov Chain Monte Carlo In high-dimensional spaces, rejection sampling and importance sampling are very inefficient An alternative is Markov Chain
More informationA Brief and Friendly Introduction to Mixed-Effects Models in Linguistics
A Brief and Friendly Introduction to Mixed-Effects Models in Linguistics Cluster-specific parameters ( random effects ) Σb Parameters governing inter-cluster variability b1 b2 bm x11 x1n1 x21 x2n2 xm1
More informationVector Autoregressive Model. Vector Autoregressions II. Estimation of Vector Autoregressions II. Estimation of Vector Autoregressions I.
Vector Autoregressive Model Vector Autoregressions II Empirical Macroeconomics - Lect 2 Dr. Ana Beatriz Galvao Queen Mary University of London January 2012 A VAR(p) model of the m 1 vector of time series
More informationBasics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations
Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent
More informationPart 6: Multivariate Normal and Linear Models
Part 6: Multivariate Normal and Linear Models 1 Multiple measurements Up until now all of our statistical models have been univariate models models for a single measurement on each member of a sample of
More informationST 740: Linear Models and Multivariate Normal Inference
ST 740: Linear Models and Multivariate Normal Inference Alyson Wilson Department of Statistics North Carolina State University November 4, 2013 A. Wilson (NCSU STAT) Linear Models November 4, 2013 1 /
More informationSpatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States
Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Laura A. Hatfield and Bradley P. Carlin Division of Biostatistics School of Public Health University
More informationy Xw 2 2 y Xw λ w 2 2
CS 189 Introduction to Machine Learning Spring 2018 Note 4 1 MLE and MAP for Regression (Part I) So far, we ve explored two approaches of the regression framework, Ordinary Least Squares and Ridge Regression:
More informationspbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models
spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments
More informationOnline Appendices, Not for Publication
Online Appendices, Not for Publication Appendix A. Network definitions In this section, we provide basic definitions and interpretations for the different network characteristics that we consider. At the
More informationAreal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets
Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Chapter 3: Basics of Areal Data Models p. 1/18 Areal Unit Data Regular or Irregular Grids
More informationSpacetime models in R-INLA. Elias T. Krainski
Spacetime models in R-INLA Elias T. Krainski 2 Outline Separable space-time models Infant mortality in Paraná PM-10 concentration in Piemonte, Italy 3 Multivariate dynamic regression model y t : n observations
More informationEmpirical Market Microstructure Analysis (EMMA)
Empirical Market Microstructure Analysis (EMMA) Lecture 3: Statistical Building Blocks and Econometric Basics Prof. Dr. Michael Stein michael.stein@vwl.uni-freiburg.de Albert-Ludwigs-University of Freiburg
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationItem Reliability Analysis
Item Reliability Analysis Revised: 10/11/2017 Summary... 1 Data Input... 4 Analysis Options... 5 Tables and Graphs... 5 Analysis Summary... 6 Matrix Plot... 8 Alpha Plot... 10 Correlation Matrix... 11
More informationLecture 7 Autoregressive Processes in Space
Lecture 7 Autoregressive Processes in Space Dennis Sun Stanford University Stats 253 July 8, 2015 1 Last Time 2 Autoregressive Processes in Space 3 Estimating Parameters 4 Testing for Spatial Autocorrelation
More informationMultivariate Analysis
Multivariate Analysis Chapter 5: Cluster analysis Pedro Galeano Departamento de Estadística Universidad Carlos III de Madrid pedro.galeano@uc3m.es Course 2015/2016 Master in Business Administration and
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 3: More on linear regression (v3) Ramesh Johari ramesh.johari@stanford.edu 1 / 59 Recap: Linear regression 2 / 59 The linear regression model Given: n outcomes Y i, i = 1,...,
More informationStatistics: A review. Why statistics?
Statistics: A review Why statistics? What statistical concepts should we know? Why statistics? To summarize, to explore, to look for relations, to predict What kinds of data exist? Nominal, Ordinal, Interval
More informationOn Gaussian Process Models for High-Dimensional Geostatistical Datasets
On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May
More informationIndependent Component (IC) Models: New Extensions of the Multinormal Model
Independent Component (IC) Models: New Extensions of the Multinormal Model Davy Paindaveine (joint with Klaus Nordhausen, Hannu Oja, and Sara Taskinen) School of Public Health, ULB, April 2008 My research
More informationGOV 2001/ 1002/ E-200 Section 7 Zero-Inflated models and Intro to Multilevel Modeling 1
GOV 2001/ 1002/ E-200 Section 7 Zero-Inflated models and Intro to Multilevel Modeling 1 Anton Strezhnev Harvard University March 23, 2016 1 These section notes are heavily indebted to past Gov 2001 TFs
More informationRegression, Ridge Regression, Lasso
Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.
More informationRegression. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh.
Regression Machine Learning and Pattern Recognition Chris Williams School of Informatics, University of Edinburgh September 24 (All of the slides in this course have been adapted from previous versions
More informationStatistics & Data Sciences: First Year Prelim Exam May 2018
Statistics & Data Sciences: First Year Prelim Exam May 2018 Instructions: 1. Do not turn this page until instructed to do so. 2. Start each new question on a new sheet of paper. 3. This is a closed book
More informationW-BASED VS LATENT VARIABLES SPATIAL AUTOREGRESSIVE MODELS: EVIDENCE FROM MONTE CARLO SIMULATIONS
1 W-BASED VS LATENT VARIABLES SPATIAL AUTOREGRESSIVE MODELS: EVIDENCE FROM MONTE CARLO SIMULATIONS An Liu University of Groningen Henk Folmer University of Groningen Wageningen University Han Oud Radboud
More informationStatistics 352: Spatial statistics. Jonathan Taylor. Department of Statistics. Models for discrete data. Stanford University.
352: 352: Models for discrete data April 28, 2009 1 / 33 Models for discrete data 352: Outline Dependent discrete data. Image data (binary). Ising models. Simulation: Gibbs sampling. Denoising. 2 / 33
More informationFrom Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...
From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. Contents About This Book... xiii About The Author... xxiii Chapter 1 Getting Started: Data Analysis with JMP...
More informationBayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling
Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Jon Wakefield Departments of Statistics and Biostatistics University of Washington 1 / 37 Lecture Content Motivation
More informationLattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)
Title: Spatial Statistics for Point Processes and Lattice Data (Part III) Lattice Data Tonglin Zhang Outline Description Research Problems Global Clustering and Local Clusters Permutation Test Spatial
More informationCSC 411: Lecture 09: Naive Bayes
CSC 411: Lecture 09: Naive Bayes Class based on Raquel Urtasun & Rich Zemel s lectures Sanja Fidler University of Toronto Feb 8, 2015 Urtasun, Zemel, Fidler (UofT) CSC 411: 09-Naive Bayes Feb 8, 2015 1
More informationNon-Stationary Time Series and Unit Root Testing
Econometrics II Non-Stationary Time Series and Unit Root Testing Morten Nyboe Tabor Course Outline: Non-Stationary Time Series and Unit Root Testing 1 Stationarity and Deviation from Stationarity Trend-Stationarity
More informationMassachusetts Institute of Technology
Massachusetts Institute of Technology 6.867 Machine Learning, Fall 2006 Problem Set 5 Due Date: Thursday, Nov 30, 12:00 noon You may submit your solutions in class or in the box. 1. Wilhelm and Klaus are
More informationGaussian Multiscale Spatio-temporal Models for Areal Data
Gaussian Multiscale Spatio-temporal Models for Areal Data (University of Missouri) Scott H. Holan (University of Missouri) Adelmo I. Bertolde (UFES) Outline Motivation Multiscale factorization The multiscale
More informationPrinciples of forecasting
2.5 Forecasting Principles of forecasting Forecast based on conditional expectations Suppose we are interested in forecasting the value of y t+1 based on a set of variables X t (m 1 vector). Let y t+1
More informationARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2
ARIC Manuscript Proposal # 1186 PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2 1.a. Full Title: Comparing Methods of Incorporating Spatial Correlation in
More informationTechnical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models
Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University
More information