Duration-Based Volatility Estimation

Similar documents
Vast Volatility Matrix Estimation for High Frequency Data

Volatility, Information Feedback and Market Microstructure Noise: A Tale of Two Regimes

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract

Understanding Regressions with Observations Collected at High Frequency over Long Span

Subsampling Cumulative Covariance Estimator

Finite Sample Analysis of Weighted Realized Covariance with Noisy Asynchronous Observations

Introduction to Algorithmic Trading Strategies Lecture 10

Short-time expansions for close-to-the-money options under a Lévy jump model with stochastic volatility

A Modified Fractionally Co-integrated VAR for Predicting Returns

Modeling Ultra-High-Frequency Multivariate Financial Data by Monte Carlo Simulation Methods

Fractals. Mandelbrot defines a fractal set as one in which the fractal dimension is strictly greater than the topological dimension.

Lecture 7 Introduction to Statistical Decision Theory

Multiscale and multilevel technique for consistent segmentation of nonstationary time series

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

THE UNIVERSITY OF CHICAGO ESTIMATING THE INTEGRATED PARAMETER OF THE LOCALLY PARAMETRIC MODEL IN HIGH-FREQUENCY DATA A DISSERTATION SUBMITTED TO

sparse and low-rank tensor recovery Cubic-Sketching

Portfolio Allocation using High Frequency Data. Jianqing Fan

Location Multiplicative Error Model. Asymptotic Inference and Empirical Analysis

Dynamic Models for Volatility and Heavy Tails by Andrew Harvey

Bootstrap tests of multiple inequality restrictions on variance ratios

Reliability of inference (1 of 2 lectures)

Small-time asymptotics of stopped Lévy bridges and simulation schemes with controlled bias

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

On Lead-Lag Estimation

Session 5B: A worked example EGARCH model

Forecasting in the presence of recent structural breaks

Supplementary Appendix to Dynamic Asset Price Jumps: the Performance of High Frequency Tests and Measures, and the Robustness of Inference

Introduction to Rare Event Simulation

Practical unbiased Monte Carlo for Uncertainty Quantification

Statistical inference on Lévy processes

Robust Backtesting Tests for Value-at-Risk Models

Realized Covariance Tick-by-Tick in Presence of Rounded Time Stamps and General Microstructure Effects

Volatility. Gerald P. Dwyer. February Clemson University

Stochastic Processes

Econ 423 Lecture Notes: Additional Topics in Time Series 1

First-Passage Statistics of Extreme Values

1 Estimation of Persistent Dynamic Panel Data. Motivation

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Econometrics of Panel Data

MARTINGALES: VARIANCE. EY = 0 Var (Y ) = σ 2

Time Series Models for Measuring Market Risk

Asymptotic inference for a nonstationary double ar(1) model

On Realized Volatility, Covariance and Hedging Coefficient of the Nikkei-225 Futures with Micro-Market Noise

If we want to analyze experimental or simulated data we might encounter the following tasks:

Monte-Carlo MMD-MA, Université Paris-Dauphine. Xiaolu Tan

arxiv: v1 [q-fin.st] 18 Sep 2017

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

13. Parameter Estimation. ECE 830, Spring 2014

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Online Appendix to Correcting Estimation Bias in Dynamic Term Structure Models

Generalized Autoregressive Score Models

Non-nested model selection. in unstable environments

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Malliavin Calculus in Finance

LM threshold unit root tests

HETEROSKEDASTICITY, TEMPORAL AND SPATIAL CORRELATION MATTER

Research Division Federal Reserve Bank of St. Louis Working Paper Series

Heteroskedasticity in Time Series

Econometric Forecasting

Time series models in the Frequency domain. The power spectrum, Spectral analysis

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

An estimate of the long-run covariance matrix, Ω, is necessary to calculate asymptotic

1 Linear Difference Equations

Long Memory through Marginalization

DEPARTMENT OF ECONOMICS

Detection of structural breaks in multivariate time series

A stochastic particle system for the Burgers equation.

Autoencoders and Score Matching. Based Models. Kevin Swersky Marc Aurelio Ranzato David Buchman Benjamin M. Marlin Nando de Freitas

A framework for adaptive Monte-Carlo procedures

New stylized facts in financial markets: The Omori law and price impact of a single transaction in financial markets

Symmetric btw positive & negative prior returns. where c is referred to as risk premium, which is expected to be positive.

Data Integration for Big Data Analysis for finite population inference

Nonstationary Time Series:

Introduction General Framework Toy models Discrete Markov model Data Analysis Conclusion. The Micro-Price. Sasha Stoikov. Cornell University

SHSU ECONOMICS WORKING PAPER

GMM, HAC estimators, & Standard Errors for Business Cycle Statistics

A Test of Cointegration Rank Based Title Component Analysis.


Minjing Tao and Yazhen Wang. University of Wisconsin-Madison. Qiwei Yao. London School of Economics. Jian Zou

Discussion of Tests of Equal Predictive Ability with Real-Time Data by T. E. Clark and M.W. McCracken

Dynamic Risk Measures and Nonlinear Expectations with Markov Chain noise

Econometrics I, Estimation

Drawing Inferences from Statistics Based on Multiyear Asset Returns

interval forecasting

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

10. Linear Models and Maximum Likelihood Estimation

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

A nonparametric test for seasonal unit roots

Upper bound for optimal value of risk averse multistage problems

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Incentives Work: Getting Teachers to Come to School. Esther Duflo, Rema Hanna, and Stephen Ryan. Web Appendix

Generalized Autoregressive Score Smoothers

Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model

Uniform Inference for Conditional Factor Models with Instrumental and Idiosyncratic Betas

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc.

Brief Review on Estimation Theory

Gaussian, Markov and stationary processes

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Variance Reduction Techniques for Monte Carlo Simulations with Stochastic Volatility Models

Transcription:

A Dual Approach to RV Torben G. Andersen, Northwestern University Dobrislav Dobrev, Federal Reserve Board of Governors Ernst Schaumburg, Northwestern Univeristy CHICAGO-ARGONNE INSTITUTE ON COMPUTATIONAL ECONOMICS University of Chicago, August 8th, 2008

Motivation The prevailing approach to high frequency volatility estimation is to measure the price change (return) per time unit We put forward the dual approach of measuring the time duration (passage time) per unit price change Our analysis fills an existing gap in the IV estimation literature by: 1 Developing a broad class of duration-based IV estimators 2 Identifying situations in which the duration-based approach is advantageous 3 Shedding new light on the microstructure properties of real data

Literature Review Magnitude-based approach to volatility estimation: 1 Discrete time (parametric): rich ARCH literature 2 Continuous time (non-parametric): rich RV literature Duration-based approach to volatility estimation: 1 Discrete time (parametric): rich ACD literature 2 Continuous time (non-parametric): lack of DV literature Cho and Frees (JF 88) consider non-parametric duration-based estimation under constant volatility but their procedure is not suitable for IV estimation

Our Contribution Develop duration-based analogues to RV, range-based RV (RRV), and other power/multipower variations Derive an asymptotic theory for our duration-based estimators showing consistency for IV and promising asymptotic efficiency Demonstrate excellent finite sample efficiency and robustness to both (finite activity) jumps and microstructure noise Document superior performance in comparison to subsampled BV both on simulated and real stock data

Main Idea Inherent duality between the increment h and corresponding passage time dt for a Brownian motion: E [ h 2 dt ] σ 2 dt E [dt h] h2 σ 2 Passage time moment subject to Jensen effect! Resolution: Use the reciprocal passage time [ ] 1 E dt h σ2 h 2 ˆσ 2 h2 h = const dt Passage time moment subject to Censoring effect! Resolution: Exploit the time reversibility of the Brownian motion Passage time moment subject to Discretization effect! Resolution: Apply discretization error theory for Brownian maxima

Integrated Variance Estimators Based On Passage Times Consider a fixed time grid 0 t 0 < t 1 <... < t N 1 consisting of N intervals with mesh size i = t i+1 t i τ + h (ti 1) τ h (ti) τ + h (ti) τ h (ti+1) t i 1 t i t i+1 From a sequence of unbiased local variance estimates ˆσ 2 h(t i ) we can construct an IV estimate Define DV as a duration-based counterpart to RV based on a sequence of reciprocal passage times instead of squared returns: DV N,h = N 1 i=0 ˆσ 2 h(t i ) i

Brownian Passage Times - A Multitude of Definitions Forward passage times for threshold h: inf {W t+θ W t = h} (first hitting time) θ>0 τ + h (t) = inf { W t+θ W t = h} (first exit time) θ>0 inf { sup W s inf W s = h} (first range time) θ>0 [t,t+θ] [t,t+θ] Backward passage times for threshold h: inf {W t θ W t = h} (first hitting time) θ>0 τ h (t) = inf { W t θ W t = h} (first exit time) θ>0 inf { sup W s inf W s = h} (first range time) θ>0 [t θ,t] [t θ,t]

Brownian Passage Times - A Multitude of Definitions 0.08 logp h -h Running Range 0.06 0.04 First Range Time First Hitting Time 0.02 0-0.02-0.04-0.06 First Exit Time

A Multitude of DV Estimators DV N,h = N 1 i=0 ˆσ 2 h(t i ) i Local estimators of σ 2 given by the scaled reciprocal passage times: ˆσ 2 h(t i ) = µ 1 1 h 2 τ h First exit time DV: based on reciprocal first exit times First range time DV: based on reciprocal first range times More generally, local estimators of σ p are given by the p/2 power of the reciprocal passage times: ˆσ p h (t i) = µ 1 p/2 τ p/2 h Can define natural DV analogues to power and multipower variations h p

Duality Theorem (Duality for magnitude and passage time functionals) Define the following standard Brownian functionals H t = sup B θ, θ [0;t] M t = sup B θ, R t = sup B θ θ [0;t] θ [0;t] inf θ [0;t] B θ and let (for h > 0) τ HT h = inf{t H t = h}, τ ET h = inf{t M t = h}, τ RT h = inf{t R t = h} be the first range time, first exit time, and first hitting time respectively. Then we have the following identities in distribution: D 1 D 1 H 1 = ( ) τ HT 1/2, M 1 = 1 ( τ ET 1 ) 1/2, R 1 D = 1 ( ) τ RT 1/2 1

Integrated Variance Estimators Based On Passage Times We establish the following main asymptotic result: ) ( 1 ) N ( DV N,h IV Mixed Normal 0, ν σ 4 u du 0, where 0.7681 (first exit time) ν 0.4073 (first range time) 2.0000 (first hitting time) is the variance factor of the individual passage time estimators at each grid point. Underlying assumptions: - No leverage - Lipschitz continuity of the volatility process - Mesh size = O ( N 1), threshold h = o(n 1/2 )

Subsampling DV Estimators In practice, the observation record is discrete and we only observe the value of the process at N grid points but not in between For a feasible version of DV consider coarser sub-grid {t i1,..., t ik } where K = o(n), K : ) ( 1 ) K ( DV K,h IV Mixed Normal 0, ν σ 4 u du 0 Convergence rate is now the slower K 1/2 Efficiency loss can be mitigated by averaging the estimator over all possible sub-grids of mesh size = K 1 The outcome is feasible DV akin to subsampled RV!

Main Challenges to IV Estimation Microstructure noise Jumps

Robustness of DV to Microstructure Noise An intuitive advantage of the passage time approach is that the threshold h can be chosen large enough to achieve noise-robustness To formalize this intuition we adopt an AR(1) noise structure: p i = p i + u i u i = ρu i 1 + ε i, where ε i N(0, (1 ρ 2 )ω 2 ), so that E[u i ] = 0, V[u i ] = ω 2 For ρ = 0 we obtain a Gaussian i.i.d. specification representative for transaction prices For ρ >> 0 we obtain a persistent autoregressive specification representative for quotes

Robustness of DV to Microstructure Noise Cont d Given a passage time transition on noisy data, it is possible to infer the expected magnitude of the latent transition AR(1) noise leads to an upward bias of our reciprocal passage time estimators We plot the upward bias factor as a function of the noise persistence ρ for two different noise-to-signal ratios: 1 λ = 0.25 (moderate noise) 2 λ = 1.00 (high noise)

Robustness of DV to Microstructure Noise Cont d Bias Fac ctor 1.20 1.18 1.16 1.14 112 1.12 1.10 1.08 1.06 1.04 1.02 1.00 0.98 Noise-to-Signal Ratio 0.25 First Exit Time Bias At Three Target Threshold Levels 3 5 7-0.999-0.99-0.95-0.9-0.75-0.5-0.4-0.25-0.15-0.1-0.05 0 0.05 0.1 0.15 0.25 0.4 0.5 0.75 0.9 0.95 0.99 0.999 Noise Persistence Negligible bias for moderate noise (λ = 0.25) at all persistence levels ρ

Robustness of DV to Microstructure Noise Cont d Bias Fac ctor 1.20 1.18 1.16 1.14 112 1.12 1.10 1.08 1.06 1.04 1.02 1.00 0.98 Noise-to-Signal Ratio 1.00 First Exit Time Bias At Three Target Threshold Levels 3 5 7-0.999-0.99-0.95-0.9-0.75-0.5-0.4-0.25-0.15-0.1-0.05 0 0.05 0.1 0.15 0.25 0.4 0.5 0.75 0.9 0.95 0.99 0.999 Noise Persistence Negligible bias for high noise (λ = 1.0) if sufficiently persistent (ρ > 0.9)

Robustness of DV to Jumps Another advantage of the passage time estimators is their inherent robustness to finite jumps Jumps above the threshold level h are effectively truncated! The lower the chosen threshold, the higher the degree of jump-robustness but... the lower the degree of noise-robustness To avoid lowering the threshold too much, we propose an asymptotically equivalent previous tick passage time estimator It utilizes the lower threshold level h corresponding to the level at one tick prior to the crossing of the target threshold h

Previous Tick Passage Times 0.07 Previous Tick Threshold Original Threshold logp 0.06 0.05 0.04 0.03 Previous Tick First Exit Time 0.02 0.01 First Exit Time 0-0.01-0.02 Better jump-robustness in finite samples than the standard passage times!

Empirical Analysis of The Robust DV Estimators Real Stock Data Simulated Stock Data

DV Analysis of The Dow Jones 30 We analyze the performance of DV on the Dow Jones 30 stocks for 601 trading days in the period January 1, 2005 to May 31, 2007 We work with mid-quotes known to have relatively low and persistent noise, so DV should be unbiased for moderate threshold levels DV is jump-robust, so we choose 2min subsampled BV as benchmark We produce DV signature plots for the mean, standard deviation, and correlation of DV with 2min subsampled BV Focus on first exit time DV and first range time DV for thresholds from 1 to 10 log-spreads

BV Signature Plot 0.00025 Ticker Symbol: (All) Subsampled BV Subsampled BV1 Subsampled RV 0.00020 Mean 0.00015 0.00010 0.00005 0.00000 3 15 30 60 120 Sample Frequency (seconds) 180 300 600 BV is downward biased at higher frequencies, so 2min is close to optimal!

DV Signature Plots: Mean 0.00025 Ticker symbol: (All) Range Time DV Exit Time DV 2min Subsampled RV 2min Subsampled BV1 2min Subsampled BV 0.00020 Mean 0.00015 0.00010 0.00005 0.00000 1 2 3 4 5 6 7 8 9 10 Threshold factor (x average log spread) DV for thresholds above 4 log-spreads has the same mean as BV!

DV Signature Plots: StdDev 0.00025 Ticker symbol: (All) Range Time DV Exit Time DV 2min Subsampled RV 2min Subsampled BV1 2min Subsampled BV 0.00020 StdDev 0.00015 0.00010 0.00005 0.00000 1 2 3 4 5 6 7 8 9 10 Threshold factor (x average log spread) DV has markedly lower standard deviation than BV!

DV Signature Plots: Correlation with BV tion sampled BV Correlat with 2min Subs 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Exit Time DV Range Time DV 1 2 3 4 5 6 7 8 9 10 Threshold Factor (x Average Log Spread) DV for thresholds above 4 log-spreads is highly correlated with BV!

Monte Carlo Study of DV Very similar results across different SV models Scenario of main interest: jump days with 25% mean jump contribution to IV and U-shape volatility pattern The adopted AR(1) noise structure gives rise to five distinct cases: 1 No noise 2 Non-persistent noise (ρ = 0) at moderate level (λ = 0.25) 3 Persistent noise (ρ = 0.99) at moderate level (λ = 0.25) 4 Non-persistent noise (ρ = 0) at high level (λ = 1.00) 5 Persistent noise (ρ = 0.99) at high level (λ = 1.00) Compare the performance of DV vis-a-vis 2min subsampled BV by: (i) A relative bias measure: mean of ÎV /IV (ii) A relative MSE measure: mean of 195(ÎV IV ) 2 /IQ

Monte Carlo Study of DV 1/5: No Noise s Relative Bias 1.5 1.4 1.3 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 Model SV2A-UJ with U-Shaped Pattern and 25% Mean Jump Contribution to IV Avg Sample Freq 3sec without Microstructure Noise Robust Exit Time DV Robust Range Time DV 2min Subsampled BV 1 2 3 4 5 6 7 8 9 10 E Relative MSE 4 3.5 3 2.5 2 1.5 1 0.5 0 1 2 3 4 5 6 7 8 9 10 Threshold Factor (x Average Log Spread)

Monte Carlo Study 2/5: Moderate Nonpersistent Noise s Relative Bias 1.5 1.4 1.3 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 Model SV2A-UJ with U-Shaped Pattern and 25% Mean Jump Contribution to IV Avg Sample Freq 3sec with Non-Persistent Noise at Moderate Level Robust Exit Time DV Robust Range Time DV 2min Subsampled BV 1 2 3 4 5 6 7 8 9 10 E Relative MSE 4 3.5 3 2.5 2 1.5 1 0.5 0 1 2 3 4 5 6 7 8 9 10 Threshold Factor (x Average Log Spread)

Monte Carlo Study of DV 3/5: Moderate Persistent Noise s Relative Bias 1.5 1.4 1.3 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 Model SV2A-UJ with U-Shaped Pattern and 25% Mean Jump Contribution to IV Avg Sample Freq 3sec with Persistent Noise at Moderate Level Robust Exit Time DV Robust Range Time DV 2min Subsampled BV 1 2 3 4 5 6 7 8 9 10 E Relative MSE 4 3.5 3 2.5 2 1.5 1 0.5 0 1 2 3 4 5 6 7 8 9 10 Threshold Factor (x Average Log Spread)

Monte Carlo Study of DV 4/5: High Nonpersistent Noise s Relative Bias 1.5 1.4 1.3 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 Model SV2A-UJ with U-Shaped Pattern and 25% Mean Jump Contribution to IV Avg Sample Freq 3sec with Non-Persistent Noise at High Level Robust Exit Time DV Robust Range Time DV 2min Subsampled BV 1 2 3 4 5 6 7 8 9 10 E Relative MSE 4 3.5 3 2.5 2 1.5 1 0.5 0 1 2 3 4 5 6 7 8 9 10 Threshold Factor (x Average Log Spread)

Monte Carlo Study of DV 5/5: High Persistent Noise s Relative Bias 1.5 1.4 1.3 1.2 1.1 1 0.9 0.8 0.7 0.6 0.5 Model SV2A-UJ with U-Shaped Pattern and 25% Mean Jump Contribution to IV Avg Sample Freq 3sec with Persistent Noise at High Level Robust Exit Time DV Robust Range Time DV 2min Subsampled BV 1 2 3 4 5 6 7 8 9 10 E Relative MSE 4 3.5 3 2.5 2 1.5 1 0.5 0 1 2 3 4 5 6 7 8 9 10 Threshold Factor (x Average Log Spread)

Summary and Conclusions Novel dual approach to realized return variation measurement based on reciprocal passage times Duration-based counterparts to RV, range-based RV (RRV), and other power/multipower variations Asymptotic theory showing consistency for IV and promising asymptotic efficiency Natural robustness to both jumps and microstructure noise Promising finite sample efficiency in comparison to subsampled BV both on simulated and real stock data Exciting to do list!