Beyond the Average Man: The Art of Unlikelihood

Similar documents
Quantile Regression: Past and Prospects

Quantile Regression Methods for Reference Growth Charts

ECAS Summer Course. Quantile Regression for Longitudinal Data. Roger Koenker University of Illinois at Urbana-Champaign

Nonparametric Quantile Regression

ECAS Summer Course. Fundamentals of Quantile Regression Roger Koenker University of Illinois at Urbana-Champaign

Penalty Methods for Bivariate Smoothing and Chicago Land Values

Quantile Regression for Longitudinal Data

Quantile Regression for Panel/Longitudinal Data

Bayesian spatial quantile regression

Single Index Quantile Regression for Heteroscedastic Data

Quantile Regression for Dynamic Panel Data

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

Total Variation Regularization for Bivariate Density Estimation

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

A Modern Look at Classical Multivariate Techniques

Stat 5101 Lecture Notes

Under Appropriate Regularity Conditions: And Without Loss of Generality

SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES. Liping Zhu, Mian Huang, & Runze Li. The Pennsylvania State University

Alternatives to Basis Expansions. Kernels in Density Estimation. Kernels and Bandwidth. Idea Behind Kernel Methods

Issues on quantile autoregression

Robust Predictions in Games with Incomplete Information

Quantile Regression Computation: From Outside, Inside and the Proximal

Single Index Quantile Regression for Heteroscedastic Data

A Resampling Method on Pivotal Estimating Functions

UNIVERSITY OF NOTTINGHAM. Discussion Papers in Economics CONSISTENT FIRM CHOICE AND THE THEORY OF SUPPLY

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Generalized Elastic Net Regression

Flexible Regression Modeling using Bayesian Nonparametric Mixtures

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Next, we discuss econometric methods that can be used to estimate panel data models.

Recovering Copulae from Conditional Quantiles

Quantile Regression: Inference

New Developments in Econometrics Lecture 16: Quantile Estimation

Machine Learning for OR & FE

The Instability of Correlations: Measurement and the Implications for Market Risk

Habilitationsvortrag: Machine learning, shrinkage estimation, and economic theory

Design of Text Mining Experiments. Matt Taddy, University of Chicago Booth School of Business faculty.chicagobooth.edu/matt.

Supplementary Material for Wang and Serfling paper

Bayesian Inference for DSGE Models. Lawrence J. Christiano

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Quantile Regression: Inference

Implementation and Application of the Curds and Whey Algorithm to Regression Problems

ECE521 lecture 4: 19 January Optimization, MLE, regularization

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation

Unobserved Heterogeneity in Longitudinal Data An Empirical Bayes Perspective

IDENTIFICATION OF MARGINAL EFFECTS IN NONSEPARABLE MODELS WITHOUT MONOTONICITY

Bayesian estimation of bandwidths for a nonparametric regression model with a flexible error density

Independent and conditionally independent counterfactual distributions

Flexible Estimation of Treatment Effect Parameters

Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility

Empirical Bayes Quantile-Prediction aka E-B Prediction under Check-loss;

Research Note: A more powerful test statistic for reasoning about interference between units

regression Lie Wang Abstract In this paper, the high-dimensional sparse linear regression model is considered,

Study of Causal Relationships in Macroeconomics

Field Course Descriptions

Near Ideal Behavior of a Modified Elastic Net Algorithm in Compressed Sensing

Modeling Real Estate Data using Quantile Regression

Adaptive estimation of the copula correlation matrix for semiparametric elliptical copulas

Analysing geoadditive regression data: a mixed model approach

Chapter 1. Linear Regression with One Predictor Variable

What s New in Econometrics. Lecture 1

Measuring Social Influence Without Bias

A Note on Demand Estimation with Supply Information. in Non-Linear Models

Dynamic analysis of binary longitudinal data

Approximate Bayesian Computation

Vector Auto-Regressive Models

27. SIMPLE LINEAR REGRESSION II

Stochastic Spectral Approaches to Bayesian Inference

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

Applied Econometrics Lecture 1

multilevel modeling: concepts, applications and interpretations

Outline. Overview of Issues. Spatial Regression. Luc Anselin

VAR Models and Applications

ECE521 week 3: 23/26 January 2017

STA 4273H: Statistical Machine Learning

Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors

Augmented and unconstrained: revisiting the Regional Knowledge Production Function

Sparse Nonparametric Density Estimation in High Dimensions Using the Rodeo

The Identification of ARIMA Models

MKTG 555: Marketing Models

Generated Covariates in Nonparametric Estimation: A Short Review.

Quantile Regression for Extraordinarily Large Data

Hakone Seminar Recent Developments in Statistics

Indirect Rule Learning: Support Vector Machines. Donglin Zeng, Department of Biostatistics, University of North Carolina

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran

Hierarchical Modeling for Univariate Spatial Data

Recent Developments in Compressed Sensing

Some Preliminary Market Research: A Googoloscopy. Parametric Links for Binary Response. Some Preliminary Market Research: A Googoloscopy

Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes


Regularization in Cox Frailty Models

SF2930: REGRESION ANALYSIS LECTURE 1 SIMPLE LINEAR REGRESSION.

What s New in Econometrics? Lecture 14 Quantile Methods

Tobit and Selection Models

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

Transcription:

Beyond the Average Man: The Art of Unlikelihood Roger Koenker University of Illinois, Urbana-Champaign FEMES/Taipei: July, 2007 Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 1 / 28

An Outline A Brief Biography of the Average Man Science Begins with Measurement (Kelvin) Convergence versus Diversity Heterogeneity and the Regression Fallacy Skepticism and Unlikelihoods Robustness: Local versus Global Models L 1 Regularization as Occam s (New) Razor Progress in (Quantile) Regression: Four Directions Additive Non-parametric Models Longitudinal (Panel) Data Nonlinear Filtering Multivariate Conditional Quantiles Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 2 / 28

The Father of the Average Man Adolphe Quetelet (1796-1874) Quetelet s (1835) Sur l Homme (On Man) invented the Average Man. Through systematic measurement and relentless averaging Quetelet sought to extract man s essential qualities: social, economic, aesthetic, and moral. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 3 / 28

The Mother of the Average Man Florence Nightingale (1820-1910) Heroine of the Crimean War, Patron Saint of Nurses, admirer of Quetelet, and champion of the scientific, i.e. statistical, study of society. To Nightingale every piece of legislation was an experiment in the laboratory of society deserving study and demanding evaluation. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 4 / 28

Portrait of the Average Man Quetelet s (1871) Anthropométrie Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 5 / 28

The Schoolmaster of the Average Man Francis Galton (1822-1911) Victorian adventurer, polymath and inveterate data collector and analyst. Galton s (1885) discovery of regression and correlation paved the way for the Average Man as a scientific construct. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 6 / 28

Galton s Regression to Mediocrity Galton s Discovery of Regression (1885) Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 7 / 28

Galton s Most Likely Children Children are more mediocre than their parents: Children of midparents 3 inches taller than average are most likely to be 2 inches taller than average. Children of midparents 3 inches shorter than average are most likely to be 2 inches shorter than average. But parents are also more mediocre than their children: Midparents of children 3 inches taller than average are most likely to be 2 inches taller than average. Midparents of children 3 inches shorter than average are most likely to be 2 inches shorter than average. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 8 / 28

Galton s Bivariate Gaussian Sandbox Given the bivariate Gaussian form of the joint distribution of heights: Conditional mean heights are linear in midparents height. Conditional density of height is symmetric, so Means, medians and modes (most likely heights) coalesce. Conditional scale and shape are invariant. It all seems a little too good to be true and even for heights it is, at best, only a decent first approximation. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 9 / 28

Convergence versus Diversity Galton s beautiful figure should come with a health warning: Prolonged exposure to regression may cause confusion about convergence. Quetelet already exhibits symptoms of this malady: The more knowledge is diffused, so much the more do the deviations from the average disappear; and the more, consequently, do we tend to approach that which is beautiful, that which is good. [ On Man, p.108] Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 10 / 28

Secrist and the Regression Fallacy Stigler s (1996) paper The History of Statistics in 1933 provides a classic economic case study: In 1933 Horace Secrist published The Triumph of Mediocrity that advanced the alarming thesis that American business was inexorably tending toward mediocrity: Groups of highly profitable firms, over time, on average lost profitability, less profitable firms gained. Harold Hotelling wrote a devastating review in JASA explaining that this was simply Galton s regression effect: firms were not nor were human heights converging to their respective means. Milton Friedman, a student of Hotelling s in 1933, felt compelled to write an essay for the JEL in 1992 called Will Old Fallacies Ever Die? to combat similar foolishness in contemporary economics. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 11 / 28

The Demise of the Average Man Robert Doisneau s (1952) [Representative] Consumer The Average Man, rehabilited in recent years as The Representative Agent has not aged well. Here he is down and out living in Paris. We have converged not the the ideal world of Quetelet s Average Man, but to the realization that we need to take heterogeneity more seriously. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 12 / 28

The Representative Agent An Inconvenient Fiction Heterogeneity is not the inessential cloud obscuring the ideal average man, it is precisely this diversity that we need to better understand. Diversity of tastes, beliefs, endowments is crucial without it much of economic life comes to a standstill. Aggregation does not preserve economic rationality, the average man is not homo economicus: Sonnenschein, Debreu... Treatment effects are often heterogeneous, and this is crucial to policy evaluation: Heckman,... Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 13 / 28

Normal Skepticism and Unlikelihood Before his conversion to the Gaussian faith, Galton was a confirmed nonparamet-nik. His obsessive data analysis relied almost exclusively on: The median as a measure of location, Half the interquartile range as probable deviation. The modern version of this is Tukey s boxplot and 5-Quantile summary: Q(τ) : τ {0,.25,.50,.75, 1} which serve to roughly summarize the form of the observed distribution. Sufficiency of the mean and standard deviation require a dangerous leap of Gaussian faith, rarely justified when samples are moderately large. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 14 / 28

Data Analysis versus Analysis of Data Two principles compete for econometric attention: Panzar Principle: Are there conditions, however far-fetched, under which a given statistical analysis would have been optimal? Tukey Principle: Better an approximate answer to the right question, than an exact answer to the wrong question. There should always be a healthy tension between good structural estimation and good descriptive data analysis. (Koopmans (1947)) This tension is, in effect, the interplay between Tukey s exploratory and confirmatory data analysis. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 15 / 28

Likehoods versus Unlikelihoods Classical Bayesian and Fisherian statistical theory share a common faith: How do we know the likelihood? What makes a likelihood likely? Classical robustness critique scorns the Gaussian likelihood. Such doubts lead to partially specified, semiparametric models. We minimize unlikelihood instead of maximizing likelihood. ρ τ (u) τ 1 τ Quantile Unlikelihood Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 16 / 28

Quantile Unlikelihoods Quantiles are an easily interpretable way to characterize distributions. Defined via optimization, a wide range of generalizations are possible: ˆα(τ) = argmin α ρτ (y i α) τth sample quantile ˆβ(τ) = argmin β ρτ (y i x i β) τth regression quantile ˆγ(τ) = argmin γ ρτ (y i g(x i, γ)) τth nonlinear regression quantile Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 17 / 28

Nonparametric Quantile Regression One can easily do univariate locally polynomial QR by kernel weighting: min ρτ (y i γ 0 (x i x)γ 1 (x i x) p γ p )K h (x x i ), γ But this becomes difficult with higher dimensional x. An alternative is min ρτ (y i g(x i )) + λ 2 g dx. g Where the penalty term can be interpreted as total variation of g. Additive models of this type can also be easily estimated: min ρ τ (y i g j (x ij )) + λ j 2 g j dx. g i j j In R such models can be estimated with rqss in my quantreg package. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 18 / 28

Chicago Land Values via TV Regularization Chicago Land Values: Based on 1194 vacant land sales and 7505 virtual sales introduced to increase the flexibility of the triangulation. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 19 / 28

Quantile Regression for Panel Data Time-series and panel data offer many interesting challenges, again regularization is a critical device. Consider the panel model, which can be estimated by solving, min m n j=1 i=1 t=1 Q Yit (τ x it ) = α i + x it β(τ) T ρ τ (y it α i x it β(τ j)) + λ Dα 1. The regularization penalty shrinks the fixed effects toward a common value. Such problems are now feasible for large m, n, T thanks to recent developments in Barrier methods for linear programming, Frisch (1956),... Sparse Linear Algebra Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 20 / 28

Shrinkage: l 1 versus l 2 Penalties α 5 0 5 10 α 5 0 5 10 0 1 2 3 4 5 λ 0 1 2 3 4 5 λ The l 1 penalty tends to shrink most fixed effects to zero very quickly, while the l 2 penalty imposes much more gradual shrinkage. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 21 / 28

Regularization Penalties and Model Selection Classical l 2 regularization methods originating in the work of Tihkonov and Stein are too gentle; the polyhedral nature of l 1 penalties are much better suited to model selection: Lasso/Lars of Tibshirani, Efron,... Basis Pursuit of Donoho,.... Dantzig Selector of Candes and Tao. l 1 regularization is an emerging growth area in statistics; promising applications to imaging, signal processing and encryption. Strongly dependent on recent developments in convex optimization. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 22 / 28

Whispering Down the Line Problem: Transmit x R n over a noisey channel. Encoding: Send y = Ax for A R m n, m n, and receive either: (1) ỹ = Ax + u (2) ỹ = Ax + u + v where u Gaussian, and v is (sparsely) arbitrarily bad. Decoding: Set Q = I A(A A) 1 A and do either: (1) ˆx = (A A) 1 A ỹ (2) ˆx = (A A) 1 A (ỹ ṽ) where: ṽ = argmin{ v 1 such that Q(ỹ v) K} Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 23 / 28

Dantzig Decoding: An Example Let n = 256, m = 512, and entries in x, u and A be iid standard Gaussian, and let v be the mixture: v i = 0.9δ 0 + 0.1δ 2yi. 2 1 0 1 2 2 1 0 1 2 3 Gauss x x^ RMSE = 0.646 2 1 0 1 2 2 1 0 1 2 Dantzig x x^ RMSE = 0.053 2 1 0 1 2 2 1 0 1 2 Oracle x x^ RMSE = 0.042 Dantzig decoding (2) achieves almost the same accuracy as if v were known. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 24 / 28

Multivariate Quantile Regression and Endogoneity The search for a satisfactory notion of multivariate quantiles has become a quest for the statistical holy grail in recent years. Even the multivariate median has several competing definitions Quantile proposals by Chaudhuri, Koltchinskii, Liu and Singh,... Wei (2007) offers a promising approach based on recursive conditioning. Antecedants in the debate over Wold s causal chain models, and Chesher s recent work on endogeneity in QR Models. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 25 / 28

Generalized Growth Curves (Wei (2007)) Problem: Estimate a family of conditional quantile contours for height and weight of young children, conditioning on age, parental heights and weights, and possibly other covariates. Model: Let x denote a vector of covariates and H and W denote height and weight: Q H (τ x) = h(x; γ H (τ)) Q W (τ H, x) = g(h, x; γ W (τ)) where, for convenience, g and h are linear in parameters, after expanding the covariates in B-spline basis functions. Simulation: Fix x, and draw pairs (τ H, τ W ) from (U 1, U 2 ) U[0, 1] 2 and evaluate (Q H (U H x), Q W (U W Q H (U H x), x)) and now estimate density contours, Rosenblatt (1952). Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 26 / 28

Height-Weight Contours for Two Two-Year Olds Age = 2 Age = 2 E Weight (Pound) 20 25 30 35 Average parental height = 63 inch Weight (Pound) 20 25 30 35 Average parental height = 72 inch 30 32 34 36 38 Height (Inch) 30 32 34 36 38 Height (Inch) Dashed lines are reference quantile contours for τ {0.5, 0.8, 0.9} for two year olds. Red contours conditional on parents height. Ying Wei (2007). Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 27 / 28

Conclusions The Average Man is alive, but unwell, drinking in Paris. Fortunately, we aren t all converging to his unhappy state. Heterogeneity is essential to most economic phenomena, but we need better econometric methods to explore it. Empirical methods based on global likelihoods are often implausible and non-robust. More flexible local (unlikelihood) methods are preferable for exploratory data analysis. Some progress has been made, but there are many challenging problems that lie ahead. Roger Koenker (U. of Illinois) Beyond the Average Man FEMES/Taipei: July, 2007 28 / 28