Factor Investing using Penalized Principal Components
|
|
- Edith Flowers
- 5 years ago
- Views:
Transcription
1 Factor Investing using Penalized Principal Components Markus Pelger Martin Lettau 2 Stanford University 2 UC Berkeley February 8th 28 AI in Fintech Forum 28
2 Motivation Motivation: Asset Pricing with Risk Factors The Challenge of Asset Pricing Most important question in finance: Why are prices different for different assets? Fundamental insight: Arbitrage Pricing Theory: Prices of financial assets should be explained by systematic risk factors. Problem: Chaos in asset pricing factors: Over 3 potential asset pricing factors published! Fundamental question: Which factors are really important in explaining expected returns? Which are subsumed by others? Goals of this paper: Bring order into factor chaos Summarize the pricing information of a large number of assets with a small number of factors
3 Motivation Why is it important? Importance of factors for investing Optimal portfolio construction Only factors are compensated for systematic risk Optimal portfolio with highest Sharpe-ratio must be based on factor portfolios (Sharpe-ratio=expected excess return/standard deviation) Smart beta investments = exposure to risk factors 2 Arbitrage opportunities Find underpriced assets and earn alpha 3 Risk management Factors explain risk-return trade-off Factors allow to manage systematic risk exposure 2
4 Motivation Contribution of this paper Contribution This Paper: Estimation approach for finding risk factors and optimal factor investing Key elements of estimator: Statistical factors instead of pre-specified (and potentially miss-specified) factors 2 Uses information from large panel data sets: Many assets with many time observations 3 Searches for factors explaining asset prices (explain differences in expected returns) not only co-movement in the data 4 Allows time-variation in factor structure 3
5 Motivation Contribution of this paper Results Asymptotic distribution theory for weak and strong factors No blackbox approach Estimator discovers weak factors with high Sharpe-ratios high Sharpe-ratio factors important for asset pricing and investment Estimator strongly dominates conventional approach (Principal Component Analysis (PCA)) PCA does not find all high Sharpe-ratio factors Empirical results: Investment: 3 times higher Sharpe-ratio then benchmark factors (PCA) Asset Pricing: Smaller pricing errors in- and out-of sample than benchmark (PCA, 5 Fama-French factors, etc.) 4
6 Model The Model Approximate Factor Model Observe excess returns of N assets over T time periods: X t,i = F t K }{{} factors Λ i K }{{} loadings + e t,i }{{} idiosyncratic i =,..., N t =,..., T Matrix notation X }{{} T N = }{{} F }{{} Λ T K K N + e }{{} T N N assets (large) T time-series observation (large) K systematic factors (fixed) F, Λ and e are unknown 5
7 Model The Model Approximate Factor Model Systematic and non-systematic risk (F and e uncorrelated): Var(X ) = ΛVar(F )Λ + }{{} Var(e) }{{} systematic non systematic Systematic factors should explain a large portion of the variance Idiosyncratic risk can be weakly correlated Arbitrage-Pricing Theory (APT): The expected excess return is explained by the risk-premium of the factors: E[X i ] = E[F ]Λ i Systematic factors should explain the cross-section of expected returns 6
8 Model The Model: Estimation of Latent Factors Conventional approach: PCA (Principal component analysis) Apply PCA to the sample covariance matrix: T X X X X with X = sample mean of asset excess returns Eigenvectors of largest eigenvalues estimate loadings ˆΛ. Much better approach: Risk-Premium PCA (RP-PCA) Apply PCA to a covariance matrix with overweighted mean T X X + γ X X γ = risk-premium weight Eigenvectors of largest eigenvalues estimate loadings ˆΛ. ˆF estimator for factors: ˆF = N X ˆΛ = X (ˆΛ ˆΛ) ˆΛ. 7
9 Model The Model: Objective Function Conventional PCA: Objective Function Minimize the unexplained variance: min Λ,F NT N T (X ti F t Λ i ) 2 i= t= RP-PCA (Risk-Premium PCA): Objective Function Minimize jointly the unexplained variance and pricing error min Λ,F N T (X ti F t Λ i ) 2 +γ N ( X i F Λ ) 2 i NT N i= t= i= }{{}}{{} unexplained variation pricing error with X i = T T t= X t,i and F = T T t= F t and risk-premium weight γ 8
10 Model The Model: Interpretation Interpretation of Risk-Premium-PCA (RP-PCA): Combines variation and pricing error criterion functions: Select factors with small cross-sectional pricing errors (alpha s). Protects against spurious factor with vanishing loadings as it requires the time-series errors to be small as well. 2 Penalized PCA: Search for factors explaining the time-series but penalizes low Sharpe-ratios. 3 Information interpretation: PCA of a covariance matrix uses only the second moment but ignores first moment Using more information leads to more efficient estimates. RP-PCA combines first and second moments efficiently. 9
11 Model The Model: Interpretation Interpretation of Risk-Premium-PCA (RP-PCA): continued 4 Signal-strengthening: Intuitively the matrix T X X + γ X X converges to Λ ( Σ F + ( + γ)µ F µ ) F Λ + Var(e) with Σ F = Var(F ) and µ F = E[F ]. The signal of weak factors with a small variance can be pushed up by their mean with the right γ.
12 Illustration Illustration (Size and accrual) Illustration: Anomaly-sorted portfolios (Size and accrual) Factors PCA: Estimation based on PCA of correlation matrix, K = 3 2 RP-PCA: K = 3 and γ = 3 FF-long/short: market, size and accrual (based on extreme quantiles, same construction as Fama-French factors) Data Double-sorted portfolios according to size and accrual (from Kenneth French s website) Monthly return data from 7/963 to 5/27 (T = 647) for N = 25 portfolios Out-of-sample: Rolling window of 2 years (T=24) Optimal factor investing: maximum Sharpe-ratio portfolio R opt = F Σ F µ F
13 Illustration Illustration (Size and accrual) In-sample Out-of-sample SR RMS α Idio. Var. SR RMS α Idio. Var. RP-PCA PCA FF-long/short Table: Maximal Sharpe-ratios, root-mean-squared pricing errors and unexplained idiosyncratic variation. K = 3 factors and γ =. SR: Maximum Sharpe-ratio of linear combination of factors Cross-sectional pricing errors α: Pricing error α i = E[X i ] E[F ]Λ i N RMS α: Root-mean-squared pricing errors N i= α i 2 N T Idiosyncratic Variation: NT i= t= (X t,i Ft Λ i ) 2 RP-PCA significantly better than PCA and quantile-sorted factors. 2
14 Illustration Loadings for statistical factors (Size and Accrual).5. PCA factor.5 2. PCA factor.5 3. PCA factor Loadings Loadings Loadings Portfolio.5. RP-PCA factor Portfolio.5 2. RP-PCA factor Portfolio.5 3. RP-PCA factor Loadings Loadings Loadings Portfolio Portfolio Portfolio RP-PCA detects accrual factor while 3rd PCA factor is noise. 3
15 Illustration Maximal Sharpe ratio (Size and accrual) SR (In-sample) SR (Out-of-sample).35 =- = =.3 = =5 = factor 2 factors 3 factors factor 2 factors 3 factors Figure: Maximal Sharpe-ratio by adding factors incrementally. st and 2nd PCA and RP-PCA factors the same. RP-PCA detects 3rd factor (accrual) for γ >. 4
16 Weak vs. Strong Factors The Model Strong vs. weak factor models Strong factor model ( N Λ Λ bounded) Interpretation: strong factors affect most assets (proportional to N), e.g. market factor Strong factors lead to exploding eigenvalues RP-PCA always more efficient than PCA optimal γ relatively small Weak factor model (Λ Λ bounded) Interpretation: weak factors affect a smaller fraction of assets Weak factors lead to large but bounded eigenvalues RP-PCA detects weak factors which cannot be detected by PCA optimal γ relatively large In Data: Typically -2 strong factors and several weak factors with high Sharpe-ratios 5
17 Weak Factor Model Weak Factor Model Weak Factor Model Weak factors have small variance or affect small fraction of assets Λ Λ bounded (after normalizing factor variances) Statistical model: Spiked covariance from random matrix theory Eigenvalues of sample covariance matrix separate into two areas: The bulk, majority of eigenvalues The extremes, a few large outliers Bulk spectrum converges to generalized Marchenko-Pastur distrib. Large eigenvalues converge either to A biased value (characterized by the Stieltjes transform) To the bulk of the spectrum if the true eigenvalue is too small Phase transition phenomena: estimated eigenvectors orthogonal to true eigenvectors if eigenvalues too small 6
18 Weak Factor Model Weak Factor Model Intuition: Weak Factor Model Signal matrix for PCA of covariance matrix: ΛΣ F Λ K largest eigenvalues θ PCA,..., θk PCA measure strength of signal Signal matrix for RP-PCA: Λ ( Σ F + ( + γ)µ F µ ) F Λ K largest eigenvalues θ RP-PCA,..., θk RP-PCA measure strength of signal If µ F and γ > then RP-PCA signal always larger than PCA signal: θ RP-PCA i > θ PCA i 7
19 Weak Factor Model Weak Factor Model Theorem : Risk-Premium PCA under weak factor model The correlation of the estimated with the true factors is ρ Ĉorr(F, ˆF ) p }{{} Ũ ρ }{{} Ṽ rotation rotation ρ K with ρ 2 i { p +θ i B(θ i if θ )) i > θ crit otherwise Critical value θ crit and function B(.) depend only on the noise distribution and are known in closed-form Based on closed-form expression choose optimal RP-weight γ For θ i > θ crit ρ 2 i is strictly increasing in θ i. RP-PCA strictly dominates PCA 8
20 Strong Factor Model Strong Factor Model Strong Factor Model Strong factors affect most assets: e.g. market factor N Λ Λ bounded (after normalizing factor variances) Factors and loadings can be estimated consistently and are asymptotically normal distributed Assumptions more general than in weak factor model Asymptotic Efficiency Choose RP-weight γ to obtain smallest asymptotic variance of estimators RP-PCA (i.e. γ > ) always more efficient than PCA In most cases optimal γ = (smaller than in weak factor model) RP-PCA and PCA are both consistent 9
21 Time-varying loadings Time-varying loadings Model with time-varying loadings Observe panel of excess returns and L covariates Z i,t,l : X t,i = F t K g K (Z i,t,,..., Z i,t,l ) + e t,i Loadings are function of L covariates Z i,t,l with l =,..., L e.g. characteristics like size, book-to-market ratio, past returns,... Factors and loading function are latent Idea: Approximate non-linear function g by basis functions Project return data on appropriate basis functions Equivalent to apply RP-PCA to portfolio data Special case: Characteristics sorted portfolios corresponds to kernel basis functions General case: Obtain arbitrary interactions and break curse of dimensionality by conditional tree sorting projection 2
22 Empirical Results Single-sorted portfolios Portfolio Data Monthly return data from 7/963 to 2/26 (T = 638) for N = 37 portfolios Kozak, Nagel and Santosh (27) data: 37 decile portfolios sorted according to 37 anomalies Factors: RP-PCA: K = 6 and γ =. 2 PCA: K = 6 3 Fama-French 5: The five factor model of Fama-French (market, size, value, investment and operating profitability, all from Kenneth French s website). 4 Proxy factors: RP-PCA and PCA factors approximated with 5% of largest position. 2
23 Empirical Results Single-sorted portfolios In-sample Out-of-sample SR RMS α Idio. Var. SR RMS α Idio. Var. RP-PCA PCA Fama-French Table: Deciles of 37 single-sorted portfolios from 7/963 to 2/26 (N = 37 and T = 638): Maximal Sharpe-ratios, root-mean-squared pricing errors and unexplained idiosyncratic variation. K = 6 statistical factors. RP-PCA strongly dominates PCA and Fama-French 5 factors Results hold out-of-sample. 22
24 Empirical Results Single-sorted portfolios: Maximal Sharpe-ratio SR (In-sample) SR (Out-of-sample) factors 4 factors 5 factors 6 factors 7 factors RP-PCA RP-PCA Proxy PCA PCA Proxy RP-PCA RP-PCA Proxy PCA PCA Proxy Figure: Maximal Sharpe-ratios. Spike in Sharpe-ratio for 6 factors 23
25 Empirical Results Single-sorted portfolios: Pricing error.25 RMS (In-sample).25 RMS (Out-of-sample) factors 4 factors 5 factors 6 factors 7 factors..5 RP-PCA RP-PCA Proxy PCA PCA Proxy RP-PCA RP-PCA Proxy PCA PCA Proxy Figure: Root-mean-squared pricing errors. RP-PCA has smaller out-of-sample pricing errors 24
26 Empirical Results Single-sorted portfolios: Idiosyncratic Variation 5 Idiosyncratic Variation (In-sample) 5 Idiosyncratic Variation (Out-of-sample) factors 4 factors 5 factors 6 factors 7 factors 2 RP-PCA RP-PCA Proxy PCA PCA Proxy RP-PCA RP-PCA Proxy PCA PCA Proxy Figure: Unexplained idiosyncratic variation. Unexplained variation similar for RP-PCA and PCA 25
27 Empirical Results Single-sorted portfolios: Maximal Sharpe-ratio SR (In-sample) SR (Out-of-sample).7 =- = =.6 = =5 = factor 2 factors 3 factors 4 factors 5 factors 6 factors 7 factors.2. factor 2 factors 3 factors 4 factors 5 factors 6 factors 7 factors Figure: Maximal Sharpe-ratios for different RP-weights γ and number of factors K Strong increase in Sharpe-ratios for γ. 26
28 Empirical Results Optimal Portfolio with RP-PCA Composition of Stochast Discount Factor (RP-PCA) Industry Mom. Rev. Ind. Rel. Rev. (L.V.) Value-Mom-Prof. Industry Rel. Reversals High Decile Low Decile Sales Growth Value-Profitability Seasonality Value (M) Gross Profitability Momentum (2m) Idiosyncratic Volatility Long Run Reversals Asset Turnover Momentum-Reversals Sales/Price Return on Assets (A) Investment/Assets Value (A) Return on Book Equity (A) Gross Margins Composite Issuance Earnings/Price Size Momentum (6m) Accrual Cash Flows/Price Short-Term Reversals Value-Momentum Investment Growth Industry Momentum Asset Growth Dividend/Price Share Volume Net Operating Assets Price Investment/Capital Leverage Figure: Portfolio composition of highest Sharpe ratio portfolio (Stochastic Discount Factor) based on 6 RP-PCA factors. 27
29 Empirical Results Optimal Portfolio with RP-PCA (largest positions) Composition of Stochast Discount Factor (RP-PCA).3 High Decile Low Decile Industry Mom. Rev. Ind. Rel. Rev. (L.V.) Value-Mom-Prof. Industry Rel. Reversals Value-Profitability Seasonality Value (M) Gross Profitability Momentum (2m) Idiosyncratic Volatility Long Run Reversals Asset Turnover Momentum-Reversals Sales/Price Return on Assets (A) Figure: Largest portfolios in highest Sharpe ratio portfolio (Stochastic Discount Factor) based on 6 RP-PCA factors. 28
30 Empirical Results Optimal Portfolio with PCA Composition of Stochast Discount Factor (PCA) High Decile Low Decile Value-Profitability Asset Turnover Sales/Price Leverage Value-Mom-Prof. Gross Profitability Industry Mom. Rev. Earnings/Price Size Idiosyncratic Volatility Long Run Reversals Ind. Rel. Rev. (L.V.) Return on Book Equity (A) Dividend/Price Cash Flows/Price Value-Momentum Value (A) Return on Assets (A) Industry Momentum Momentum (6m) Momentum (2m) Composite Issuance Momentum-Reversals Investment/Assets Share Volume Short-Term Reversals Asset Growth Price Seasonality Gross Margins Investment/Capital Investment Growth Value (M) Industry Rel. Reversals Sales Growth Accrual Net Operating Assets Figure: Portfolio composition of highest Sharpe ratio portfolio (Stochastic Discount Factor) based on 6 PCA factors. 29
31 Empirical Results Optimal Portfolio with PCA (largest positions) Composition of Stochast Discount Factor (PCA).3 High Decile Low Decile Value-Mom-Prof. Value-Profitability Asset Turnover Sales/Price Leverage Gross Profitability Industry Mom. Rev. Earnings/Price Size Long Run Reversals Idiosyncratic Volatility Ind. Rel. Rev. (L.V.) Return on Book Equity (A) Dividend/Price Cash Flows/Price Figure: Largest portfolios in highest Sharpe ratio portfolio (Stochastic Discount Factor) based on 6 PCA factors. 3
32 Empirical Results Interpreting factors: Generalized correlations with proxies RP-PCA PCA. Gen. Corr Gen. Corr Gen. Corr Gen. Corr Gen. Corr Gen. Corr Table: Generalized correlations of statistical factors with proxy factors (portfolios of 5% of assets). Problem in interpreting factors: Factors only identified up to invertible linear transformations. Generalized correlations close to measure of how many factors two sets have in common. Proxy factors approximate statistical factors. 3
33 Empirical Results Interpreting factors: 6th proxy factor 6. Proxy RP-PCA Weights 6. Proxy PCA Weights Momentum (6m).28 Leverage.33 Momentum (6m) 2.25 Asset Turnover.25 Value (M).25 Value-Profitability.25 Value-Momentum.23 Profitability.22 Industry Momentum.2 Asset Turnover 9.22 Industry Reversals 9.9 Sales/Price.2 Industry Momentum 2.9 Sales/Price 9.8 Momentum (6m) 3.8 Size.7 Idiosyncratic Volatility Value-Momentum-Profitability -.9 Industry Mom. Reversals -.8 Profitability Value-Momentum Value-Profitability -.2 Momentum (6m) -.2 Profitability Value-Momentum Value-Profitability Value-Momentum -.23 Profitability -.23 Short-Term Reversals -.24 Idiosyncratic Volatility -.24 Industry-Momentum -.24 Profitability Industry Rel. Reversals -.28 Asset Turnover Idiosyncratic Volatility -.38 Asset Turnover
34 Empirical Results Time-stability of loadings Figure: Time-varying rotated loadings for the first six factors. Loadings are estimated on a rolling window with 24 months. 33
35 Empirical Results Time-stability of loadings of individual stocks SR (In-sample) SR (Out-of-sample).6.4 factor 2 factors 3 factors 4 factors 5 factors 6 factors RP-PCA PCA RP-PCA PCA Figure: Stock RMS price (In-sample) data (N = 27 and T = 5): RMS Maximal (Out-of-sample) Sharpe-ratios for different number of factors. RP-weight γ = Stock price data from /972 to 2/26.3 (N = 27 and T = 5).2. Out-of-sample performance collapses Constant loading model inappropriate.2. 34
36 Empirical Results Time-stability of loadings of individual stocks Figure: Stock price data: Generalized correlations between loadings estimated on the whole time horizon and a rolling window 35
37 Conclusion Conclusion Methodology Estimator for estimating priced latent factors from large data sets Combines variation and pricing criterion function Asymptotic theory under weak and strong factor model assumption Detects weak factors with high Sharpe-ratio More efficient than conventional PCA Empirical Results Strongly dominates PCA of the covariance matrix. Potential to provide benchmark factors for horse races. RP-PCA factor investing outperforms benchmark models 36
38 Generalized Correlation Generalized Correlation Time-stability: Generalized correlations RP-PCA (total vs. time-varying) st GC 2nd GC 3rd GC 4th GC 5th GC 6th GC Year PCA (total vs. time-varying) st GC 2nd GC 3rd GC 4th GC 5th GC 6th GC Year Figure: Generalized correlations between loadings estimated on the whole time horizon T = 638 and a rolling window with 24 months for K = 6 factors. A
39 Time-stability of loadings of individual stocks Generalized Correlation Generalized Correlation RP-PCA (total vs. time-varying) st GC 2nd GC.5 3rd GC 4th GC 5th GC 6th GC Time PCA (total vs. time-varying) Time Figure: Stock price data (N = 27 and T = 5): Generalized correlations between loadings estimated on the whole time horizon and a rolling window with 24 months. A 2
40 Effect of Risk-Premium Weight γ.4 SR (In-sample).4 SR (Out-of-sample) factor 2 factors SR.2 3 factors SR RMS (In-sample) RMS (Out-of-sample) Idiosyncratic Variation (In-sample) Idiosyncratic Variation (Out-of-sample) 4 4 Variation 2 Variation Figure: Maximal Sharpe-ratios, root-mean-squared pricing errors and unexplained idiosyncratic variation. RP-PCA detects 3rd factor (accrual) for γ >. A 3
41 Literature (partial list) Large-dimensional factor models with strong factors Bai (23): Distribution theory Fan et al. (26): Projected PCA for time-varying loadings Kelly et al. (27): Instrumented PCA for time-varying loadings Pelger (26), Aït-Sahalia and Xiu (25): High-frequency Large-dimensional factor models with weak factors (based on random matrix theory) Onatski (22): Phase transition phenomena Benauch-Georges and Nadakuditi (2): Perturbation of large random matrices Asset-pricing factors Harvey and Liu (25): Lucky factors Clarke (25): Level, slope and curvature for stocks Kozak, Nagel and Santosh (25): PCA based factors Bryzgalova (26): Spurious factors A 4
42 Extreme deciles of single-sorted portfolios Portfolio Data Monthly return data from 7/963 to 2/26 (T = 638) for N = 64 portfolios Kozak, Nagel and Santosh (27) data: 37 decile portfolios sorted according to 37 anomalies Here we take only the lowest and highest decile portfolio for each anomaly (N = 64). Factors: RP-PCA: K = 6 and γ =. 2 PCA: K = 6 3 Fama-French 5: The five factor model of Fama-French (market, size, value, investment and operating profitability, all from Kenneth French s website). 4 Proxy factors: RP-PCA and PCA factors approximated with 8 largest positions. A 5
43 Extreme Deciles Anomaly Mean SD Sharpe-ratio Anomaly Mean SD Sharpe-ratio Accruals - accrual Momentum (2m) - mom Asset Turnover - aturnover Momentum-Reversals - momrev Cash Flows/Price - cfp Net Operating Assets - noa Composite Issuance - ciss Price - price Dividend/Price - divp Gross Profitability - prof Earnings/Price - ep Return on Assets (A) - roaa Gross Margins - gmargins Return on Book Equity (A) - roea Asset Growth - growth Seasonality - season Investment Growth - igrowth Sales Growth - sgrowth Industry Momentum - indmom Share Volume - shvol. 6.. Industry Mom. Reversals - indmomrev Size - size Industry Rel. Reversals - indrrev Sales/Price sp Industry Rel. Rev. (L.V.) - indrrevlv Short-Term Reversals - strev Investment/Assets - inv Value-Momentum - valmom Investment/Capital - invcap Value-Momentum-Prof. - valmomprof Idiosyncratic Volatility - ivol Value-Profitability - valprof Leverage - lev Value (A) - value Long Run Reversals - lrrev Value (M) - valuem Momentum (6m) - mom Table: Long-Short Portfolios of extreme deciles of 37 single-sorted portfolios from 7/963 to 2/26: Mean, standard deviation and Sharpe-ratio. A 6
44 Extreme Deciles In-sample Out-of-sample SR RMS α Idio. Var. SR RMS α Idio. Var. RP-PCA PCA RP-PCA Proxy PCA Proxy Fama-French Table: First and last decile of 37 single-sorted portfolios from 7/963 to 2/26 (N = 64 and T = 638): Maximal Sharpe-ratios, root-mean-squared pricing errors and unexplained idiosyncratic variation. K = 6 statistical factors. RP-PCA strongly dominates PCA and Fama-French 5 factors Results hold out-of-sample. A 7
45 Extreme Deciles: Maximal Sharpe-ratio SR (In-sample) SR (Out-of-sample) factors 4 factors 5 factors 6 factors 7 factors RP-PCA RP-PCA Proxy PCA PCA Proxy RP-PCA RP-PCA Proxy PCA PCA Proxy Figure: Maximal Sharpe-ratios. Spike in Sharpe-ratio for 6 factors A 8
46 Extreme Deciles: Pricing error.35 RMS (In-sample).35 RMS (Out-of-sample) factors 4 factors 5 factors 6 factors 7 factors.5..5 RP-PCA RP-PCA Proxy PCA PCA Proxy RP-PCA RP-PCA Proxy PCA PCA Proxy Figure: Root-mean-squared pricing errors. RP-PCA has smaller out-of-sample pricing errors A 9
47 Extreme Deciles: Idiosyncratic Variation 7 Idiosyncratic Variation (In-sample) 7 Idiosyncratic Variation (Out-of-sample) factors 4 factors 5 factors 6 factors 7 factors 3 2 RP-PCA RP-PCA Proxy PCA PCA Proxy RP-PCA RP-PCA Proxy PCA PCA Proxy Figure: Unexplained idiosyncratic variation. Unexplained variation similar for RP-PCA and PCA A
48 Extreme Deciles: Maximal Sharpe-ratio SR (In-sample) =- = = = =5 = SR (Out-of-sample) factor 2 factors 3 factors 4 factors 5 factors 6 factors 7 factors.2. factor 2 factors 3 factors 4 factors 5 factors 6 factors 7 factors Figure: First and last decile of 37 single-sorted portfolios (N = 64 and T = 638): Maximal Sharpe-ratios for different RP-weights γ and number of factors K. A
49 Interpreting factors: Generalized correlations with proxies RP-PCA PCA. Gen. Corr Gen. Corr Gen. Corr Gen. Corr Gen. Corr Gen. Corr Table: Generalized correlations of statistical factors with proxy factors (portfolios of 8 assets). Problem in interpreting factors: Factors only identified up to invertible linear transformations. Generalized correlations close to measure of how many factors two sets have in common. Proxy factors approximate statistical factors. A 2
50 Interpreting factors: Cumulative absolute proxy weights RP-PCA Proxy PCA Proxy Momentum (2m).7 Idiosyncratic Volatility.28 Idiosyncratic Volatility.65 Momentum (2m).4 Industry Rel. Rev. (L.V.).2 Momentum (6m).4 Momentum (6m).2 Price.95 Price.2 Asset Turnover.83 Industry Mom. Reversals. Industry Rel. Rev. (L.V.).79 Value (M).3 Value (M).75 Industry Rel. Reversals.84 Industry Momentum.73 Share Volume.55 Industry Mom. Reversals.73 Net Operating Assets.47 Dividend/Price.67 Value-Momentum-Prof..46 Gross Profitability.64 Size.42 Sales/Price.58 Return on Book Equity (A).32 Value-Profitability.58 Industry Momentum.29 Net Operating Assets.37 Value-Momentum.27 Value (A).36 Long Run Reversals.24 Value-Momentum-Prof..33 Earnings/Price.22 Cash Flows/Price.3 A 3
51 Interpreting factors: Composition of proxies 2. Proxy (RP-PCA) 3. Proxy (RP-PCA) 4. Proxy (RP-PCA) 5. Proxy (RP-PCA) 6. Proxy (RP-PCA) indrrevlv.54 valmomprof.7 mom2.28 mom price.38 indmomrev.52 indmomrev -.2 mom.26 mom -.28 mom.36 ivol.24 ivol -.2 valuem.25 valmomprof -.29 valuem.34 Accrual -.2 mom lrrev -.24 roea -.32 indrrev.32 shvol -.22 indrrevlv -.26 mom -.3 shvol -.33 indrrev -.26 ep -.22 indmomrev -.4 valuem -.44 price -.37 valmom -.27 indrrev -.25 indrrevlv -.4 price -.45 size -.42 indmom -.29 mom ivol -.67 mom noa -.47 ivol Proxy (PCA) 3. Proxy (PCA) 4. Proxy (PCA) 5. Proxy (PCA) 6. Proxy (PCA) ivol.59 valuem.46 mom.36 divp.3 valprof.33 indrrevlv.43 price.38 indmom.35 roea -.25 Aturnover.32 indmomrev.37 divp.37 mom2.34 shvol -.27 sp.27 indrrevlv.36 value.36 valmomprof.33 size -.28 prof.24 indmomrev.36 sp.32 valmom -.3 mom -.29 valprof -.25 ivol.27 lrrev.3 indmom -.35 noa -.37 prof -.4 mom2.3 cfp.3 mom mom ivol -.42 indmom.3 valuem -.29 mom -.39 price -.57 Aturnover -.5 Table: Portfolio-composition of proxy factors for first and last decile of 37 single-sorted portfolios: First proxy factors is an equally-weighted portfolio. A 4
52 Extreme Deciles: Time-Stability Figure: Time-varying rotated loadings for the first six factors. Loadings are estimated on a rolling window with 24 months. A 5
53 Generalized Correlation Generalized Correlation Generalized Correlation Generalized Correlation Extreme Deciles: Time-Stability RP-PCA (total vs. time-varying, K=4) RP-PCA (total vs. time-varying, K=6) st GC 2nd GC st GC 3rd GC.2 2nd GC 3rd GC.2 4th GC 5th GC 4th GC 6th GC Time Time PCA (total vs. time-varying, K=4) PCA (total vs. time-varying, K=6) st GC 2nd GC st GC 3rd GC.2 2nd GC 3rd GC.2 4th GC 5th GC 4th GC 6th GC Time Time Figure: Generalized correlations between loadings estimated on the whole time horizon T = 638 and a rolling window. A 6
54 Double-sorted portfolios Double-sorted portfolios Data Factors Monthly return data from 7/963 to 5/27 (T = 647) 3 double sorted portfolios (consisting of 25 portfolios) from Kenneth French s website PCA: K = 3 2 RP-PCA: K = 3 and γ = 3 FF-Long/Short factors: market + two specific anomaly long-short factors A 7
55 Sharpe-ratios and pricing errors (in-sample) Sharpe-Ratio α RPCA PCA FF-L/S RPCA PCA FF-L/S Size and BM BM and Investment BM and Profits Size and Accrual Size and Beta Size and Investment Size and Profits Size and Momentum Size and ST-Reversal Size and Idio. Vol Size and Total Vol Profits and Invest Size and LT-Reversal A 8
56 Sharpe-ratios and pricing errors (out-of-sample) Sharpe-Ratio α RPCA PCA FF-L/S RPCA PCA FF-L/S Size and BM BM and Investment BM and Profits Size and Accrual Size and Beta Size and Investment Size and Profits Size and Momentum Size and ST-Reversal Size and Idio. Vol Size and Total Vol Profits and Invest Size and LT-Reversal A 9
57 The Model: Objective function Time-series objective function: Minimize the unexplained variance: min Λ,F NT N i= t= T (X ti F t Λ i ) 2 = min Λ NT trace ( (XM Λ ) (XM Λ ) ) s.t. F = X (Λ Λ) Λ Projection matrix M Λ = I N Λ(Λ Λ) Λ Error (non-systematic risk): e = X F Λ = XM Λ Λ proportional to eigenvectors of the first K largest eigenvalues of NT X X minimizes time-series objective function Motivation for PCA A 2
58 The Model: Objective function Cross-sectional objective function: Minimize cross-sectional expected pricing error: N = N N (Ê[Xi ] Ê[F ]Λ i i= ) 2 N ( T X i T F Λ i i= ) 2 = N trace ( ( T XM Λ ) ( T XM Λ ) ) s.t. F = X (Λ Λ) Λ is vector T of s and thus F T estimates factor mean Why not estimate factors with cross-sectional objective function? Factors not identified Spurious factor detection (Bryzgalova (26)) A 2
59 The Model: Objective function Combined objective function: Risk-Premium-PCA min Λ,F ( (( )) ( NT trace (XM Λ ) (XM Λ + γ ) ( ) ) N trace T XM Λ T XM Λ ( ( = min Λ NT trace M Λ X I + γ ) T ) XM Λ s.t. F = X (Λ Λ) Λ The objective function is minimized by the eigenvectors of the largest eigenvalues of NT X ( I T + γ T ) X. ˆΛ estimator for loadings: proportional to eigenvectors of the first K eigenvalues of NT X ( I T + γ T ) X ˆF estimator for factors: N X ˆΛ = X (ˆΛ ˆΛ) ˆΛ. Estimator for the common component C = F Λ is Ĉ = ˆF ˆΛ A 22
60 The Model: Objective function Weighted Combined objective function: Straightforward extension to weighted objective function: min Λ,F NT trace(q (X F Λ ) (X F Λ )Q) + γ ) ( N trace (X F Λ )QQ (X F Λ ) = min trace (M ( Λ Q X I + γ ) Λ T ) XQM Λ s.t. F = X (Λ Λ) Λ Cross-sectional weighting matrix Q Factors and loadings can be estimated by applying PCA to Q X ( I + γ T ) XQ. Today: Only Q equal to inverse of a diagonal matrix of standard deviations. For γ = corresponds to PCA of a correlation matrix. Optimal choice of Q: GLS type argument A 23
61 Simulation Simulation parameters N = 25 and T = 35. Factors: K = 4. Factor represent the market with N(.2, 9): Sharpe-ratio of.4 2. Factor represents an industry factors following N(., ): Sharpe-ratio of.. 3. Factor follows N(.4, ): Sharpe-ratio of Factor has a small variance but high Sharpe-ratio. It follows N(.4,.6): Sharpe-ratio of. Loadings normalized such that N Λ Λ = I K. Λ i, = and Λ i,k N(, ) for k = 2, 3, 4. Errors: Cross-sectional and time-series correlation and heteroskedasticity in the residuals. /3 of variation due to systematic risk. A 24
62 Simulation parameters Errors Residuals are modeled as e = σ e D T A T ɛa N D N : ɛ is a T N matrix and follows a multivariate standard normal distribution Time-series correlation in errors: A T creates an AR() model with parameter ρ =. Cross-sectional correlation in errors: A N is a Toeplitz-matrix with (β, β, β) on the right three off-diagonals with β =.7 Cross-sectional heteroskedasticity: D N is a diagonal matrix with independent elements following N(,.2) Time-series heteroskedasticity: D T is a diagonal matrix with independent elements following N(,.2) Signal-to-noise ratio: σ 2 e = 9 Parameters produce eigenvalues that are consistent with the data. A 25
63 Simulation 6. Factor 2 2. Factor Factor 3 4. Factor True factor RP-PCA = RP-PCA = RP-PCA = RP-PCA = PCA Figure: Sample paths of the cumulative returns of the first four factors and the estimated factor processes. N = 25 and T = 35. A 26
64 Simulation True Factor γ = γ = γ = γ = 5 γ = In-sample SR α Idio. Var Ex. Var Out-of-sample SR α Idio. Var Ex. Var Table: (i) Maximal Sharpe-ratio (SR), (ii) root-mean-squared pricing error (α), unexplained variation and proportion of variation explained by K = 4 factors. N = 25 and T = 35. A 27
65 Simulation γ = γ = γ = γ = 5 γ = In-sample. Factor Factor Factor Factor Out-of-sample. Factor Factor Factor Factor Table: Correlation between estimated and true factors. The risk-premium weight γ = corresponds to PCA. N = 25 and T = 35. A 28
66 Simulation Simulation parameters Consider only K = factor Sharpe-ratio =, i.e. µ F = σ F Different signal strength σf 2 Λ i N(, ), i.e. normalized variance σf 2 N in weak factor model. Same residuals as before A 29
67 Simulation: Correlation, N=5, T=5 Correlation with true factor (In-sample) Correlation with true factor (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Correlation of estimated with true factor. For signal σ 2 <.8 a value of γ is optimal. A 3
68 Simulation: Sharpe-ratio, N=5, T=5.2 SR (In-sample).2 SR (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Maximal Sharpe-ratios. For signal σ 2 <.8 a value of γ is optimal. A 3
69 Simulation: Pricing Error, N=5, T=5 RMS (In-sample) RMS (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Root-mean-squared pricing errors. For signal σ 2 <.8 a value of γ is optimal. A 32
70 Simulation: Idiosyncratic Variation, N=5, T=5 25 Idiosyncratic Variation (In-sample) 25 Idiosyncratic Variation (Out-of-sample) =- = = =5 = True factor 5 2 =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Unexplained idiosyncratic variation. Unexplained variation similar for RP-PCA and PCA. A 33
71 Simulation: Correlation, N=25, T=25 Correlation with true factor (In-sample) Correlation with true factor (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Correlation of estimated with true factor. For signal σ 2 <.8 a value of γ is optimal. A 34
72 Simulation: Correlation, N=5, T=25 Correlation with true factor (In-sample) Correlation with true factor (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Correlation of estimated with true factor. For signal σ 2 <.8 a value of γ is optimal. A 35
73 Simulation: Correlation, N=, T=25 Correlation with true factor (In-sample) Correlation with true factor (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Correlation of estimated with true factor. For signal σ 2 <.8 a value of γ is optimal. A 36
74 Simulation: Correlation, N=25, T=5 Correlation with true factor (In-sample) Correlation with true factor (Out-of-sample) =- = = =5 = True factor =.3 2 =.5 2 =.8 2 =. 2 =.3 2 =.5 2 =.8 2 =. Figure: Correlation of estimated with true factor. For signal σ 2 <.8 a value of γ is optimal. A 37
75 Simulation: Weak factor model prediction Statistical Model.5 PCA RP-PCA ( =) RP-PCA ( =) RP-PCA ( =5) Monte-Carlo Simulation.5 PCA RP-PCA ( =) RP-PCA ( =) RP-PCA ( =5) Correlations between estimated and true factor based on the weak factor model prediction and Monte-Carlo simulations for different variances of the factor. The residuals have cross-sectional and time-series correlation and A 38 heteroskedasticity. The Sharpe-ratio of the factor is. N = 25, T = 35.
76 Simulation: Weak factor model prediction Statistical Model.5 PCA RP-PCA ( =) RP-PCA ( =) RP-PCA ( =5) Monte-Carlo Simulation.5 PCA RP-PCA ( =) RP-PCA ( =) RP-PCA ( =5) Correlations between estimated and true factor based on the weak factor model prediction and Monte-Carlo simulations for different variances of the factor. The residuals have only cross-sectional correlation. The Sharpe-ratio of the A 39 factor is. N = 25, T = 35.
77 Weak Factor Model: Dependent residuals dependent residuals i.i.d residuals Figure: Values of ρ 2 i ( if θ +θ i B(ˆθ i )) i > σcrit 2 and otherwise) for different signals θ i. The average noise level is normalized in both cases to σe 2 = and c =. For the correlated residuals we assume that Σ /2 is a Toeplitz matrix with β, β, β on the right three off-diagonals with β =.7. N = 25, T = 35. A 4
78 Weak Factor Model Weak Factor Model Weak factors either have a small variance or affect a smaller fraction of assets: Λ Λ bounded (after normalizing factor variances) Statistical model: Spiked covariance models from random matrix theory Eigenvalues of sample covariance matrix separate into two areas: The bulk, majority of eigenvalues The extremes, a few large outliers Bulk spectrum converges to generalized Marchenko-Pastur distribution (under certain conditions) A 4
79 Weak Factor Model Weak Factor Model Large eigenvalues converge either to A biased value characterized by the Stieltjes transform of the bulk spectrum To the bulk of the spectrum if the true eigenvalue is below some critical threshold Phase transition phenomena: estimated eigenvectors orthogonal to true eigenvectors if eigenvalues too small Onatski (22): Weak factor model with phase transition phenomena Problem: All models in the literature assume that random processes have mean zero RP-PCA implicitly uses non-zero means of random variables New tools necessary! A 42
80 Weak Factor Model Assumption : Weak Factor Model Rate: Assume that N T c with < c <. 2 Factors: F are uncorrelated among each other and are independent of e and Λ and have bounded first two moments. ˆµ F := T p F t µf ˆΣF := σ 2 F T T FtF t p Σ F = t= σf 2 K 3 Loadings: The column vectors of the loadings Λ are orthogonally invariant and independent of ɛ and F (e.g. Λ i,k N(, N ) and Λ Λ = I K 4 Residuals: e = ɛσ with ɛ t,i N(, ). The empirical eigenvalue distribution function of Σ converges to a non-random spectral distribution function with compact support and supremum of support b. Largest eigenvalues of Σ converge to b. A 43
81 Weak Factor Model Intuition: Weak Factor Model Signal matrix for PCA of covariance matrix: ΛΣ F Λ K largest eigenvalues θ PCA,..., θk PCA measure strength of signal Signal matrix for RP-PCA: Λ ( Σ F + ( + γ)µ F µ ) F Λ K largest eigenvalues θ RP-PCA,..., θk RP-PCA measure strength of signal If µ F and γ > then RP-PCA signal always larger than PCA signal: θ RP-PCA i > θ PCA i A 44
82 Weak Factor Model Theorem : Risk-Premium PCA under weak factor model Under Assumption the correlation of the estimated with the true factors is ρ Ĉorr(F, ˆF ) p ρ 2 }{{} Ũ }{{} Ṽ rotation rotation ρ K with ρ 2 i { p +θ i B(θ i if θ )) i > θ crit otherwise Critical value θ crit and function B(.) depend only on the noise distribution and are known in closed-form Based on closed-form expression choose optimal RP-weight γ For θ i > θ crit ρ 2 i is strictly increasing in θ i. RP-PCA strictly dominates PCA A 45
83 Strong Factor Model Strong Factor Model Strong factors affect most assets: e.g. market factor N Λ Λ bounded (after normalizing factor variances) Statistical model: Bai and Ng (22) and Bai (23) framework Factors and loadings can be estimated consistently and are asymptotically normal distributed RP-PCA provides a more efficient estimator of the loadings Assumptions essentially identical to Bai (23) A 46
84 Strong Factor Model Asymptotic Distribution (up to rotation) PCA under assumptions of Bai (23): (up to rotation) Asymptotically ˆΛ behaves like OLS regression of F on X. Asymptotically ˆF behaves like OLS regression of Λ on X. RP-PCA under slightly stronger assumptions as in Bai (23): Asymptotically ( ˆΛ behaves ) like OLS regression of FW on XW with W 2 = I T + γ T and is a T vector of s. Asymptotically ˆF behaves like OLS regression of Λ on X. Asymptotic Efficiency Choose RP-weight γ to obtain smallest asymptotic variance of estimators RP-PCA (i.e. γ > ) always more efficient than PCA Optimal γ typically smaller than optimal value from weak factor model RP-PCA and PCA are both consistent A 47
85 Simplified Strong Factor Model Assumption 2: Simplified Strong Factor Model Rate: Same as in Assumption 2 Factors: Same as in Assumption 3 Loadings: Λ Λ/N p I K and all loadings are bounded. 4 Residuals: e = ɛσ with ɛ t,i N(, ). All elements and all row sums of Σ are bounded. A 48
86 Simplified Strong Factor Model Proposition: Simplified Strong Factor Model Assumption 2 holds. Then: The factors and loadings can be estimated consistently. 2 The asymptotic distribution of the factors is not affected by γ. 3 The asymptotic distribution of the loadings is given by ) D T (ˆΛi Λ i N(, Ωi ) Ω i =σe 2 ( i ΣF + ( + γ)µ F µ ) ( F ΣF + ( + γ) 2 µ F µ ) F ( ΣF + ( + γ)µ F µ ) F, E[e 2 t,i ] = σe 2 i 4 γ = is optimal choice for smallest asymptotic variance. γ =, i.e. the covariance matrix, is not efficient. A 49
87 Time-varying loadings Model with time-varying loadings Observe panel of excess returns and L covariates Z i,t,l : X t,i = F t K g K (Z i,t,,..., Z i,t,l ) + e t,i Loadings are function of L covariates Z i,t,l with l =,..., L e.g. characteristics like size, book-to-market ratio, past returns,... Factors and loading function are latent Idea: Similar to Projected PCA (Fan, Liao and Wang (26)) and Instrumented PCA (Kelly, Pruitt, Su (27)), but we include the pricing error penalty allow for general interactions between covariates A 5
88 Time-varying loadings Projected RP-PCA (work in progress) Approximate nonlinear function g k (.) by basis functions φ m (.): g k (Z i,t ) = M m= b m,k φ m (Z i,t ) g(z t ) = }{{}}{{} B Φ(Z t ) }{{} K N K M M N Apply RP-PCA to projected data X t = X t Φ(Z t ) X t = F t B Φ(Z t )Φ(Z t ) + e t Φ(Z t ) = F t B + ẽt Special case: φ m = {Zt I m} X characteristics sorted portfolios Obtain arbitrary interactions and break curse of dimensionality by conditional tree sorting projection Intuition: Projection creates M portfolios sorted on any functional form and interaction of covariates Z t. A 5
89 Weak Factor Model Assumption : Weak Factor Model Residual matrix can be represented as e = ɛσ with ɛ t,i N(, ). The empirical eigenvalue distribution function of Σ converges to a non-random spectral distribution function with compact support. The supremum of the support is b. 2 The factors F are uncorrelated among each other and are independent of e and Λ and have bounded first two moments. ˆµ F := T p F t µf ˆΣ F := σ 2 F T T FtF t p Σ F = t= σf 2 K 3 The column vectors of the loadings Λ are orthogonally invariant and independent of ɛ and F (e.g. Λ i,k N(, N ) and Λ Λ = I K 4 Assume that N T c with < c <. A 52
90 Weak Factor Model Definition: Weak Factor Model Average idiosyncratic noise σ 2 e := trace(σ)/n Denote by λ λ 2... λ N the ordered eigenvalues of T e e. The Cauchy transform (also called Stieltjes transform) of the eigenvalues is the almost sure limit: G(z) := a.s. lim T N B-function N i= c B(z) :=a.s. lim T N z λ i N i= c =a.s. lim T N trace ( = a.s. lim T N trace (zi N ) T e e) λ i (z λ i ) 2 ( ( (zi N T e e) ) 2 ( ) ) T e e A 53
91 Weak Factor Model Estimator Risk-premium ( PCA (RP-PCA): ) Apply PCA estimation to S γ := X I T T + γ X T PCA : Apply PCA ( to estimated ) covariance matrix S := X I T T X, i.e. γ =. PCA special case of RP-PCA Signal Matrix for Covariance PCA T σf 2 + cσ 2 e M Var = Σ F + cσe 2 I K = σf 2 K + cσe 2 Intuition: Largest K true eigenvalues of S. A 54
92 Weak Factor Model Lemma: Covariance PCA Assumption holds. Define the critical value σcrit 2 = lim z b. The first K G(z) largest eigenvalues ˆλ i of S satisfy for i =,..., K ( ) p G ˆλ if σ i σ F 2 +cσ 2 F 2 i e i + cσe 2 > σcrit 2 b otherwise The correlation between the estimated and true factors converges to ϱ Ĉorr(F, ˆF ) p ϱ K with ϱ 2 i p { +(σ 2 F i +cσ 2 e )B(ˆλ i )) if σ 2 F i + cσ 2 e > σ 2 crit otherwise A 55
93 Weak Factor Model Corollary: Covariance PCA for i.i.d. errors Assumption holds, c and e t,i i.i.d. N(, σ 2 e ). The largest K eigenvalues of S have the following limiting values: ˆλ i p { σ 2 Fi + σ2 e σ 2 F i (c + + σ 2 e ) σ 2 e ( + c) 2 if σ 2 F i + cσ 2 e > σ 2 crit σ 2 F > cσ 2 e otherwise The correlation between the estimated and true factors converges to ϱ Ĉorr(F, ˆF ) p ϱ K with ϱ 2 i p cσ4 e σ 4 F i + cσ2 e σ 2 F i + σ4 e σ 4 F i (c 2 c) if σ 2 F i + cσ 2 e > σ 2 crit otherwise A 56
94 Weak Factor Model Signal Matrix for RP-PCA Signal Matrix for RP-PCA ( ) Σ M RP = F + cσe 2 Σ /2 F µ F ( + γ) µ F Σ /2 F ( + γ) ( + γ)(µ F µ F + cσ2) 2 Define γ = γ + and note that ( + γ) 2 = + γ. Projection on K demeaned factors and on mean operator. Denote by θ... θ K+ the eigenvalues of the signal matrix M RP and by Ũ the corresponding orthonormal eigenvectors : θ Ũ M RP Ũ = θ K+ Intuition: θ,..., θ K+ largest K + true eigenvalues of S γ. A 57
95 Weak Factor Model Theorem : Risk-Premium PCA under weak factor model Assumption holds. The first K largest eigenvalues ˆθ i i =,..., K of S γ satisfy { ( ) p G ˆθ i θ i if θ i > σcrit 2 = lim z b G(z) b otherwise The correlation of the estimated with the true factors converges to ρ Ĉorr(F, ˆF ) p ( I K ) ρ 2 Ũ. }{{}..... D /2 /2 K ˆΣ ˆF rotation ρ K }{{} rotation with ρ 2 i p { +θ i B( ˆθ i )) if θ i > σcrit 2 otherwise A 58
96 Weak Factor Model Theorem : continued ˆΣ ˆF =D/2 K ρ ( ρ K ρ 2 ) ρ 2 K D K =diag ((ˆθ )) ˆθK ( ) Ũ IK Ũ D /2 K ρ ρ K A 59
97 Weak Factor Model Lemma: Detection of weak factors If γ > and µ F, then the first K eigenvalues of M RP are strictly larger than the first K eigenvalues of M Var, i.e. For θ i > σ 2 crit it holds that ˆθ i θ i > θ i > σ 2 F i + cσ 2 e Thus, if γ > and µ F, then ρ i > ϱ i. ρ i θ i > i =,..., K For µ F RP-PCA always better than PCA. A 6
98 Weak Factor Model Example: One-factor model Assume that there is only one factor, i.e. K =. The signal matrix M RP simplifies to ( ) σ 2 M RP = F + cσe 2 σ F µ( + γ) µσ F ( + γ) (µ 2 + cσe 2 )( + γ) and has the eigenvalues: θ,2 = 2 σ2 F + cσe 2 + (µ 2 + cσe 2 )( + γ) ± (σf cσ2 e + (µ 2 + cσe 2 )( + γ)) 2 4( + γ)cσe 2 (σf 2 + µ2 + cσe 2 ) The eigenvector of first eigenvalue θ has the components Ũ, = Ũ,2 = µσ F ( + γ) (θ (σf 2 + cσ2 e )) 2 + µ 2 σf 2 ( + γ) θ σ 2 F + cσ 2 e (θ (σ 2 F + cσ2 e )) 2 + µ 2 σ 2 F ( + γ) A 6
99 Weak Factor Model Corollary: One-factor model The correlation between the estimated and true factor has the following limit: Ĉorr(F, ˆF ) p ρ ρ 2 + ( ρ2 ) (θ (σ 2 F +cσ2 e ))2 + µ 2 σ 2 F (+γ) A 62
100 Strong Factor Model Strong Factor Model Strong factors affect most assets: e.g. market factor N Λ Λ bounded (after normalizing factor variances) Statistical model: Bai and Ng (22) and Bai (23) framework Factors and loadings can be estimated consistently and are asymptotically normal distributed RP-PCA provides a more efficient estimator of the loadings Assumptions essentially identical to Bai (23) A 63
101 Strong Factor Model Asymptotic Distribution (up to rotation) PCA under assumptions of Bai (23): Asymptotically ˆΛ behaves like OLS regression of F on X. Asymptotically ˆF behaves like OLS regression of Λ on X. RP-PCA under slightly stronger assumptions as in Bai (23): Asymptotically ( ˆΛ behaves ) like OLS regression of FW on XW with W 2 = I T + γ T. Asymptotically ˆF behaves like OLS regression of Λ on X. Asymptotic Expansion Asymptotic expansions (under slightly stronger assumptions as in Bai (23)): ) T (H ˆΛi Λ i = ( F W 2 F ) ( T ) T T F W 2 e i + O p + o N p() ) 2 N (H ˆFt F t = ( N Λ Λ ) ( N ) N Λ et + O p + o T p() with known rotation matrix H. A 64
102 Strong Factor Model Assumption 2: Strong Factor Model Assume the same assumptions as in Bai (23) (Assumption A-G) hold and in addition ( T T t= Fte ) ( ) t,i D Ω, Ω,2 T T t= e N(, Ω) Ω = t,i Ω 2, Ω 2,2 A 65
103 Strong Factor Model Theorem 2: Strong Factor Model Assumption 2 holds and γ [, ). Then: For any choice of γ the factors, loadings and common components can be estimated consistently pointwise. then ( ) T H ˆΛi D Λ i N(, Φ) If T N ( ) ) Φ = Σ F + (γ + )µ F µ F (Ω, + γµ F Ω 2, + γω,2µ F + γ 2 µ F Ω 2,2µ F ( ) Σ F + (γ + )µ F µ F For γ = this simplifies to the conventional case Σ F Ω,Σ F. If N T then the asymptotic distribution of the factors is not affected by the choice of γ. The asymptotic distribution of the common component depends on γ if and only if N T does not go to zero. For T N T (Ĉt,i C t,i ) D N (, F t ΦF t ) A 66
Factors that Fit the Time Series and Cross-Section of Stock Returns
Factors that Fit the Time Series and Cross-Section of Stock Returns Martin Lettau Markus Pelger UC Berkeley Stanford University November 3th 8 NBER Asset Pricing Meeting Motivation Motivation Fundamental
More informationEstimating Latent Asset-Pricing Factors
Estimating Latent Asset-Pricing Factors Martin Lettau and Markus Pelger April 9, 8 Abstract We develop an estimator for latent factors in a large-dimensional panel of financial data that can explain expected
More informationEstimating Latent Asset-Pricing Factors
Estimating Latent Asset-Pricing Factors Martin Lettau Haas School of Business, University of California at Berkeley, Berkeley, CA 947, lettau@berkeley.edu,nber,cepr Markus Pelger Department of Management
More informationEstimating Latent Asset-Pricing Factors
Estimating Latent Asset-Pricing Factors Martin Lettau Haas School of Business, University of California at Berkeley, Berkeley, CA 947, lettau@berkeley.edu,nber,cepr Markus Pelger Department of Management
More informationEstimating Latent Asset-Pricing Factors
Estimating Latent Asset-Pricing actors Martin Lettau Haas School of Business, University of California at Berkeley, Berkeley, CA 947, lettau@berkeley.edu,nber,cepr Markus Pelger Department of Management
More informationDeep Learning in Asset Pricing
Deep Learning in Asset Pricing Luyang Chen 1 Markus Pelger 1 Jason Zhu 1 1 Stanford University November 17th 2018 Western Mathematical Finance Conference 2018 Motivation Hype: Machine Learning in Investment
More informationEcon671 Factor Models: Principal Components
Econ671 Factor Models: Principal Components Jun YU April 8, 2016 Jun YU () Econ671 Factor Models: Principal Components April 8, 2016 1 / 59 Factor Models: Principal Components Learning Objectives 1. Show
More informationInference on Risk Premia in the Presence of Omitted Factors
Inference on Risk Premia in the Presence of Omitted Factors Stefano Giglio Dacheng Xiu Booth School of Business, University of Chicago Center for Financial and Risk Analytics Stanford University May 19,
More informationFactor Models for Asset Returns. Prof. Daniel P. Palomar
Factor Models for Asset Returns Prof. Daniel P. Palomar The Hong Kong University of Science and Technology (HKUST) MAFS6010R- Portfolio Optimization with R MSc in Financial Mathematics Fall 2018-19, HKUST,
More informationR = µ + Bf Arbitrage Pricing Model, APM
4.2 Arbitrage Pricing Model, APM Empirical evidence indicates that the CAPM beta does not completely explain the cross section of expected asset returns. This suggests that additional factors may be required.
More informationApplications of Random Matrix Theory to Economics, Finance and Political Science
Outline Applications of Random Matrix Theory to Economics, Finance and Political Science Matthew C. 1 1 Department of Economics, MIT Institute for Quantitative Social Science, Harvard University SEA 06
More informationEstimation of large dimensional sparse covariance matrices
Estimation of large dimensional sparse covariance matrices Department of Statistics UC, Berkeley May 5, 2009 Sample covariance matrix and its eigenvalues Data: n p matrix X n (independent identically distributed)
More informationNetwork Connectivity and Systematic Risk
Network Connectivity and Systematic Risk Monica Billio 1 Massimiliano Caporin 2 Roberto Panzica 3 Loriana Pelizzon 1,3 1 University Ca Foscari Venezia (Italy) 2 University of Padova (Italy) 3 Goethe University
More informationRoss (1976) introduced the Arbitrage Pricing Theory (APT) as an alternative to the CAPM.
4.2 Arbitrage Pricing Model, APM Empirical evidence indicates that the CAPM beta does not completely explain the cross section of expected asset returns. This suggests that additional factors may be required.
More informationLarge-dimensional factor modeling based on high-frequency observations
Large-dimensional factor modeling based on high-frequency observations Markus Pelger May 21, 2018 Abstract This paper develops a statistical theory to estimate an unknown factor structure based on financial
More informationRegression: Ordinary Least Squares
Regression: Ordinary Least Squares Mark Hendricks Autumn 2017 FINM Intro: Regression Outline Regression OLS Mathematics Linear Projection Hendricks, Autumn 2017 FINM Intro: Regression: Lecture 2/32 Regression
More informationMFE Financial Econometrics 2018 Final Exam Model Solutions
MFE Financial Econometrics 2018 Final Exam Model Solutions Tuesday 12 th March, 2019 1. If (X, ε) N (0, I 2 ) what is the distribution of Y = µ + β X + ε? Y N ( µ, β 2 + 1 ) 2. What is the Cramer-Rao lower
More informationNotes on empirical methods
Notes on empirical methods Statistics of time series and cross sectional regressions 1. Time Series Regression (Fama-French). (a) Method: Run and interpret (b) Estimates: 1. ˆα, ˆβ : OLS TS regression.
More informationEmpirical properties of large covariance matrices in finance
Empirical properties of large covariance matrices in finance Ex: RiskMetrics Group, Geneva Since 2010: Swissquote, Gland December 2009 Covariance and large random matrices Many problems in finance require
More informationTheory and Applications of High Dimensional Covariance Matrix Estimation
1 / 44 Theory and Applications of High Dimensional Covariance Matrix Estimation Yuan Liao Princeton University Joint work with Jianqing Fan and Martina Mincheva December 14, 2011 2 / 44 Outline 1 Applications
More informationASSET PRICING MODELS
ASSE PRICING MODELS [1] CAPM (1) Some notation: R it = (gross) return on asset i at time t. R mt = (gross) return on the market portfolio at time t. R ft = return on risk-free asset at time t. X it = R
More informationCross-Sectional Regression after Factor Analysis: Two Applications
al Regression after Factor Analysis: Two Applications Joint work with Jingshu, Trevor, Art; Yang Song (GSB) May 7, 2016 Overview 1 2 3 4 1 / 27 Outline 1 2 3 4 2 / 27 Data matrix Y R n p Panel data. Transposable
More informationLinear Factor Models and the Estimation of Expected Returns
Linear Factor Models and the Estimation of Expected Returns Cisil Sarisoy a,, Peter de Goeij b, Bas J.M. Werker c a Department of Finance, CentER, Tilburg University b Department of Finance, Tilburg University
More informationThe Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility
The Slow Convergence of OLS Estimators of α, β and Portfolio Weights under Long Memory Stochastic Volatility New York University Stern School of Business June 21, 2018 Introduction Bivariate long memory
More informationMultivariate GARCH models.
Multivariate GARCH models. Financial market volatility moves together over time across assets and markets. Recognizing this commonality through a multivariate modeling framework leads to obvious gains
More informationPortfolio Allocation using High Frequency Data. Jianqing Fan
Portfolio Allocation using High Frequency Data Princeton University With Yingying Li and Ke Yu http://www.princeton.edu/ jqfan September 10, 2010 About this talk How to select sparsely optimal portfolio?
More informationLecture 12 Robust Estimation
Lecture 12 Robust Estimation Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical Economics University of Karlsruhe Financial Econometrics, Summer Semester 2007 Copyright These lecture-notes
More informationEquity risk factors and the Intertemporal CAPM
Equity risk factors and the Intertemporal CAPM Ilan Cooper 1 Paulo Maio 2 1 Norwegian Business School (BI) 2 Hanken School of Economics BEROC Conference, Minsk Outline 1 Motivation 2 Cross-sectional tests
More information3.1. The probabilistic view of the principal component analysis.
301 Chapter 3 Principal Components and Statistical Factor Models This chapter of introduces the principal component analysis (PCA), briefly reviews statistical factor models PCA is among the most popular
More informationThis paper develops a test of the asymptotic arbitrage pricing theory (APT) via the maximum squared Sharpe
MANAGEMENT SCIENCE Vol. 55, No. 7, July 2009, pp. 1255 1266 issn 0025-1909 eissn 1526-5501 09 5507 1255 informs doi 10.1287/mnsc.1090.1004 2009 INFORMS Testing the APT with the Maximum Sharpe Ratio of
More informationIdentifying Financial Risk Factors
Identifying Financial Risk Factors with a Low-Rank Sparse Decomposition Lisa Goldberg Alex Shkolnik Berkeley Columbia Meeting in Engineering and Statistics 24 March 2016 Outline 1 A Brief History of Factor
More informationState-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53
State-space Model Eduardo Rossi University of Pavia November 2014 Rossi State-space Model Fin. Econometrics - 2014 1 / 53 Outline 1 Motivation 2 Introduction 3 The Kalman filter 4 Forecast errors 5 State
More informationTime Series Models for Measuring Market Risk
Time Series Models for Measuring Market Risk José Miguel Hernández Lobato Universidad Autónoma de Madrid, Computer Science Department June 28, 2007 1/ 32 Outline 1 Introduction 2 Competitive and collaborative
More informationEcon 583 Final Exam Fall 2008
Econ 583 Final Exam Fall 2008 Eric Zivot December 11, 2008 Exam is due at 9:00 am in my office on Friday, December 12. 1 Maximum Likelihood Estimation and Asymptotic Theory Let X 1,...,X n be iid random
More information17 Factor Models and Principal Components
17 Factor Models and Principal Components 17.1 Dimension Reduction High-dimensional data can be challenging to analyze. They are difficult to visualize, need extensive computer resources, and often require
More informationGMM - Generalized method of moments
GMM - Generalized method of moments GMM Intuition: Matching moments You want to estimate properties of a data set {x t } T t=1. You assume that x t has a constant mean and variance. x t (µ 0, σ 2 ) Consider
More informationSpecification Test Based on the Hansen-Jagannathan Distance with Good Small Sample Properties
Specification Test Based on the Hansen-Jagannathan Distance with Good Small Sample Properties Yu Ren Department of Economics Queen s University Katsumi Shimotsu Department of Economics Queen s University
More informationFinancial Econometrics Short Course Lecture 3 Multifactor Pricing Model
Financial Econometrics Short Course Lecture 3 Multifactor Pricing Model Oliver Linton obl20@cam.ac.uk Renmin University Financial Econometrics Short Course Lecture 3 MultifactorRenmin Pricing University
More informationTesting the APT with the Maximum Sharpe. Ratio of Extracted Factors
Testing the APT with the Maximum Sharpe Ratio of Extracted Factors Chu Zhang This version: April, 2008 Abstract. This paper develops a test of the asymptotic arbitrage pricing theory (APT) via the maximum
More informationGARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50
GARCH Models Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 50 Outline 1 Stylized Facts ARCH model: definition 3 GARCH model 4 EGARCH 5 Asymmetric Models 6
More informationSolutions of the Financial Risk Management Examination
Solutions of the Financial Risk Management Examination Thierry Roncalli January 9 th 03 Remark The first five questions are corrected in TR-GDR and in the document of exercise solutions, which is available
More informationVolatility. Gerald P. Dwyer. February Clemson University
Volatility Gerald P. Dwyer Clemson University February 2016 Outline 1 Volatility Characteristics of Time Series Heteroskedasticity Simpler Estimation Strategies Exponentially Weighted Moving Average Use
More information5.1 Consistency of least squares estimates. We begin with a few consistency results that stand on their own and do not depend on normality.
88 Chapter 5 Distribution Theory In this chapter, we summarize the distributions related to the normal distribution that occur in linear models. Before turning to this general problem that assumes normal
More informationInference on Risk Premia in the Presence of Omitted Factors
Inference on Risk Premia in the Presence of Omitted Factors Stefano Giglio Yale School of Management NBER and CEPR Dacheng Xiu Booth School of Business University of Chicago First version: November 216
More informationGeneralized Hypothesis Testing and Maximizing the Success Probability in Financial Markets
Generalized Hypothesis Testing and Maximizing the Success Probability in Financial Markets Tim Leung 1, Qingshuo Song 2, and Jie Yang 3 1 Columbia University, New York, USA; leung@ieor.columbia.edu 2 City
More information9.1 Orthogonal factor model.
36 Chapter 9 Factor Analysis Factor analysis may be viewed as a refinement of the principal component analysis The objective is, like the PC analysis, to describe the relevant variables in study in terms
More informationLinear Factor Models and the Estimation of Expected Returns
Linear Factor Models and the Estimation of Expected Returns Cisil Sarisoy, Peter de Goeij and Bas Werker DP 03/2016-020 Linear Factor Models and the Estimation of Expected Returns Cisil Sarisoy a,, Peter
More informationErrata for Campbell, Financial Decisions and Markets, 01/02/2019.
Errata for Campbell, Financial Decisions and Markets, 01/02/2019. Page xi, section title for Section 11.4.3 should be Endogenous Margin Requirements. Page 20, equation 1.49), expectations operator E should
More information14 Week 5 Empirical methods notes
14 Week 5 Empirical methods notes 1. Motivation and overview 2. Time series regressions 3. Cross sectional regressions 4. Fama MacBeth 5. Testing factor models. 14.1 Motivation and Overview 1. Expected
More informationMaximum Likelihood Estimation for Factor Analysis. Yuan Liao
Maximum Likelihood Estimation for Factor Analysis Yuan Liao University of Maryland Joint worth Jushan Bai June 15, 2013 High Dim. Factor Model y it = λ i f t + u it i N,t f t : common factor λ i : loading
More informationIntroduction to Algorithmic Trading Strategies Lecture 10
Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References
More informationForecasting Large Realized Covariance Matrices: The Benefits of Factor Models and Shrinkage
Forecasting Large Realized Covariance Matrices: The Benefits of Factor Models and Shrinkage Diego S. de Brito Department of Economics PUC-Rio Marcelo C. Medeiros Department of Economics PUC-Rio First version:
More informationFinancial Econometrics Lecture 6: Testing the CAPM model
Financial Econometrics Lecture 6: Testing the CAPM model Richard G. Pierse 1 Introduction The capital asset pricing model has some strong implications which are testable. The restrictions that can be tested
More informationA Factor Analytical Method to Interactive Effects Dynamic Panel Models with or without Unit Root
A Factor Analytical Method to Interactive Effects Dynamic Panel Models with or without Unit Root Joakim Westerlund Deakin University Australia March 19, 2014 Westerlund (Deakin) Factor Analytical Method
More informationOptimal Portfolio Selection using Regularization
Optimal Portfolio Selection using Regularization Marine Carrasco and Nérée Noumon Université de Montréal Preliminary and incomplete Do not circulate February, 2010 Abstract The mean-variance principle
More informationUniform Inference for Conditional Factor Models with Instrumental and Idiosyncratic Betas
Uniform Inference for Conditional Factor Models with Instrumental and Idiosyncratic Betas Yuan Xiye Yang Rutgers University Dec 27 Greater NY Econometrics Overview main results Introduction Consider a
More informationPanel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63
1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:
More informationModel Mis-specification
Model Mis-specification Carlo Favero Favero () Model Mis-specification 1 / 28 Model Mis-specification Each specification can be interpreted of the result of a reduction process, what happens if the reduction
More informationIn modern portfolio theory, which started with the seminal work of Markowitz (1952),
1 Introduction In modern portfolio theory, which started with the seminal work of Markowitz (1952), many academic researchers have examined the relationships between the return and risk, or volatility,
More informationNon white sample covariance matrices.
Non white sample covariance matrices. S. Péché, Université Grenoble 1, joint work with O. Ledoit, Uni. Zurich 17-21/05/2010, Université Marne la Vallée Workshop Probability and Geometry in High Dimensions
More informationRECENT DEVELOPMENTS IN LARGE DIMENSIONAL FACTOR ANALYSIS 58
RECENT DEVELOPMENTS IN LARGE DIMENSIONAL FACTOR ANALYSIS Serena Ng June 2007 SCE Meeting, Montreal June 2007 SCE Meeting, Montreal 1 / Factor Model: for i = 1,... N, t = 1,... T, x it = λ if t + e it F
More informationORTHOGO ALIZED EQUITY RISK PREMIA SYSTEMATIC RISK DECOMPOSITIO. Rudolf F. Klein a,* and K. Victor Chow b,* Abstract
ORTHOGO ALIZED EQUITY RIS PREMIA A D SYSTEMATIC RIS DECOMPOSITIO Rudolf F. lein a,* and. Victor Chow b,* Abstract To solve the dependency problem between factors, in the context of linear multi-factor
More informationIntroduction to Algorithmic Trading Strategies Lecture 3
Introduction to Algorithmic Trading Strategies Lecture 3 Pairs Trading by Cointegration Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Distance method Cointegration Stationarity
More informationBacktesting Marginal Expected Shortfall and Related Systemic Risk Measures
and Related Systemic Risk Measures Denisa Banulescu, Christophe Hurlin, Jérémy Leymarie, Olivier Scaillet, ACPR Chair "Regulation and Systemic Risk" - March 24, 2016 Systemic risk The recent nancial crisis
More informationMiloš Kopa. Decision problems with stochastic dominance constraints
Decision problems with stochastic dominance constraints Motivation Portfolio selection model Mean risk models max λ Λ m(λ r) νr(λ r) or min λ Λ r(λ r) s.t. m(λ r) µ r is a random vector of assets returns
More informationThe Conditional Pricing of Systematic and Idiosyncratic Risk in the UK Equity Market. Niall O Sullivan b Francesco Rossi c. This version: July 2014
The Conditional Pricing of Systematic and Idiosyncratic Risk in the UK Equity Market John Cotter a Niall O Sullivan b Francesco Rossi c This version: July 2014 Abstract We test whether firm idiosyncratic
More informationUniform Inference for Conditional Factor Models with. Instrumental and Idiosyncratic Betas
Uniform Inference for Conditional Factor Models with Instrumental and Idiosyncratic Betas Yuan Liao Xiye Yang December 6, 7 Abstract It has been well known in financial economics that factor betas depend
More informationRegularization of Portfolio Allocation
Regularization of Portfolio Allocation Benjamin Bruder Quantitative Research Lyxor Asset Management, Paris benjamin.bruder@lyxor.com Jean-Charles Richard Quantitative Research Lyxor Asset Management, Paris
More informationModel Comparison with Sharpe Ratios
Model Comparison with Sharpe Ratios Francisco Barillas, Raymond Kan, Cesare Robotti, and Jay Shanken Barillas is from Emory University. Kan is from the University of Toronto. Robotti is from the University
More informationPrincipal Component Analysis (PCA) Our starting point consists of T observations from N variables, which will be arranged in an T N matrix R,
Principal Component Analysis (PCA) PCA is a widely used statistical tool for dimension reduction. The objective of PCA is to find common factors, the so called principal components, in form of linear combinations
More informationIntro VEC and BEKK Example Factor Models Cond Var and Cor Application Ref 4. MGARCH
ntro VEC and BEKK Example Factor Models Cond Var and Cor Application Ref 4. MGARCH JEM 140: Quantitative Multivariate Finance ES, Charles University, Prague Summer 2018 JEM 140 () 4. MGARCH Summer 2018
More informationarxiv: v2 [math.st] 2 Jul 2017
A Relaxed Approach to Estimating Large Portfolios Mehmet Caner Esra Ulasan Laurent Callot A.Özlem Önder July 4, 2017 arxiv:1611.07347v2 [math.st] 2 Jul 2017 Abstract This paper considers three aspects
More informationIf we want to analyze experimental or simulated data we might encounter the following tasks:
Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction
More informationQuaderni di Dipartimento. Estimation Methods in Panel Data Models with Observed and Unobserved Components: a Monte Carlo Study
Quaderni di Dipartimento Estimation Methods in Panel Data Models with Observed and Unobserved Components: a Monte Carlo Study Carolina Castagnetti (Università di Pavia) Eduardo Rossi (Università di Pavia)
More informationData Abundance and Asset Price Informativeness. On-Line Appendix
Data Abundance and Asset Price Informativeness On-Line Appendix Jérôme Dugast Thierry Foucault August 30, 07 This note is the on-line appendix for Data Abundance and Asset Price Informativeness. It contains
More informationEmpirical Methods in Finance Section 2 - The CAPM Model
Empirical Methods in Finance Section 2 - The CAPM Model René Garcia Professor of finance EDHEC Nice, March 2010 Estimation and Testing The CAPM model is a one-period model E [Z i ] = β im E [Z m] with
More informationChapter 4: Factor Analysis
Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.
More informationSome Aspects of Universal Portfolio
1 Some Aspects of Universal Portfolio Tomoyuki Ichiba (UC Santa Barbara) joint work with Marcel Brod (ETH Zurich) Conference on Stochastic Asymptotics & Applications Sixth Western Conference on Mathematical
More informationMultivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]
1 Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8] Insights: Price movements in one market can spread easily and instantly to another market [economic globalization and internet
More informationCS281 Section 4: Factor Analysis and PCA
CS81 Section 4: Factor Analysis and PCA Scott Linderman At this point we have seen a variety of machine learning models, with a particular emphasis on models for supervised learning. In particular, we
More informationChapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis
Chapter 12: An introduction to Time Series Analysis Introduction In this chapter, we will discuss forecasting with single-series (univariate) Box-Jenkins models. The common name of the models is Auto-Regressive
More informationVector autoregressions, VAR
1 / 45 Vector autoregressions, VAR Chapter 2 Financial Econometrics Michael Hauser WS17/18 2 / 45 Content Cross-correlations VAR model in standard/reduced form Properties of VAR(1), VAR(p) Structural VAR,
More informationEstimating Covariance Using Factorial Hidden Markov Models
Estimating Covariance Using Factorial Hidden Markov Models João Sedoc 1,2 with: Jordan Rodu 3, Lyle Ungar 1, Dean Foster 1 and Jean Gallier 1 1 University of Pennsylvania Philadelphia, PA joao@cis.upenn.edu
More informationExam 2. Jeremy Morris. March 23, 2006
Exam Jeremy Morris March 3, 006 4. Consider a bivariate normal population with µ 0, µ, σ, σ and ρ.5. a Write out the bivariate normal density. The multivariate normal density is defined by the following
More informationFinancial Econometrics Return Predictability
Financial Econometrics Return Predictability Eric Zivot March 30, 2011 Lecture Outline Market Efficiency The Forms of the Random Walk Hypothesis Testing the Random Walk Hypothesis Reading FMUND, chapter
More informationAsymptotic distribution of the sample average value-at-risk
Asymptotic distribution of the sample average value-at-risk Stoyan V. Stoyanov Svetlozar T. Rachev September 3, 7 Abstract In this paper, we prove a result for the asymptotic distribution of the sample
More informationFixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility
American Economic Review: Papers & Proceedings 2016, 106(5): 400 404 http://dx.doi.org/10.1257/aer.p20161082 Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility By Gary Chamberlain*
More informationEstimating the Number of Common Factors in Serially Dependent Approximate Factor Models
Estimating the Number of Common Factors in Serially Dependent Approximate Factor Models Ryan Greenaway-McGrevy y Bureau of Economic Analysis Chirok Han Korea University February 7, 202 Donggyu Sul University
More informationEcon 423 Lecture Notes: Additional Topics in Time Series 1
Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes
More informationBacktesting Marginal Expected Shortfall and Related Systemic Risk Measures
Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures Denisa Banulescu 1 Christophe Hurlin 1 Jérémy Leymarie 1 Olivier Scaillet 2 1 University of Orleans 2 University of Geneva & Swiss
More informationMultivariate Volatility, Dependence and Copulas
Chapter 9 Multivariate Volatility, Dependence and Copulas Multivariate modeling is in many ways similar to modeling the volatility of a single asset. The primary challenges which arise in the multivariate
More information= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.
QB research September 4, 06 Page -Minute Bin Volume Forecast Model Overview In response to strong client demand, Quantitative Brokers (QB) has developed a new algorithm called Closer that specifically
More informationStatistics 910, #5 1. Regression Methods
Statistics 910, #5 1 Overview Regression Methods 1. Idea: effects of dependence 2. Examples of estimation (in R) 3. Review of regression 4. Comparisons and relative efficiencies Idea Decomposition Well-known
More informationForecast comparison of principal component regression and principal covariate regression
Forecast comparison of principal component regression and principal covariate regression Christiaan Heij, Patrick J.F. Groenen, Dick J. van Dijk Econometric Institute, Erasmus University Rotterdam Econometric
More informationEconometric Forecasting
Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna October 1, 2014 Outline Introduction Model-free extrapolation Univariate time-series models Trend
More informationShape of the return probability density function and extreme value statistics
Shape of the return probability density function and extreme value statistics 13/09/03 Int. Workshop on Risk and Regulation, Budapest Overview I aim to elucidate a relation between one field of research
More informationFactor Analysis (10/2/13)
STA561: Probabilistic machine learning Factor Analysis (10/2/13) Lecturer: Barbara Engelhardt Scribes: Li Zhu, Fan Li, Ni Guan Factor Analysis Factor analysis is related to the mixture models we have studied.
More informationNumerical Solutions to the General Marcenko Pastur Equation
Numerical Solutions to the General Marcenko Pastur Equation Miriam Huntley May 2, 23 Motivation: Data Analysis A frequent goal in real world data analysis is estimation of the covariance matrix of some
More information