Online Appendix for Innovation and Top Income Inequality

Similar documents
What Lies Beneath: A Sub- National Look at Okun s Law for the United States.

Innovation and Top Income Inequality

Statistical Mechanics of Money, Income, and Wealth

Lecture 26 Section 8.4. Mon, Oct 13, 2008

Nursing Facilities' Life Safety Standard Survey Results Quarterly Reference Tables

Your Galactic Address

Estimating Dynamic Games of Electoral Competition to Evaluate Term Limits in U.S. Gubernatorial Elections: Online Appendix

Sample Statistics 5021 First Midterm Examination with solutions

Class business PS is due Wed. Lecture 20 (QPM 2016) Multivariate Regression November 14, / 44

Use your text to define the following term. Use the terms to label the figure below. Define the following term.

Advanced Economic Growth: Lecture 3, Review of Endogenous Growth: Schumpeterian Models

Appendix 5 Summary of State Trademark Registration Provisions (as of July 2016)

Cluster Analysis. Part of the Michigan Prosperity Initiative

Chapter 7. Endogenous Growth II: R&D and Technological Change

Dynamics of Firms and Trade in General Equilibrium. Robert Dekle, Hyeok Jeong and Nobuhiro Kiyotaki USC, Seoul National University and Princeton

Schumpeterian Growth Models

Analyzing Severe Weather Data

Online Appendix The Growth of Low Skill Service Jobs and the Polarization of the U.S. Labor Market. By David H. Autor and David Dorn

2006 Supplemental Tax Information for JennisonDryden and Strategic Partners Funds

Evolution Strategies for Optimizing Rectangular Cartograms

Forecasting the 2012 Presidential Election from History and the Polls

Simple New Keynesian Model without Capital

Parametric Test. Multiple Linear Regression Spatial Application I: State Homicide Rates Equations taken from Zar, 1984.

Annual Performance Report: State Assessment Data

The Harris-Todaro model

14.461: Technological Change, Lecture 4 Technology and the Labor Market

problem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Toulouse School of Economics, M2 Macroeconomics 1 Professor Franck Portier. Exam Solution

14.461: Technological Change, Lecture 1

(a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Neoclassical Business Cycle Model

The Ramsey Model. (Lecture Note, Advanced Macroeconomics, Thomas Steger, SS 2013)

Summary of Natural Hazard Statistics for 2008 in the United States

EXST 7015 Fall 2014 Lab 08: Polynomial Regression

1 The Basic RBC Model

14.461: Technological Change, Lecture 3 Competition, Policy and Technological Progress

14.461: Technological Change, Lecture 4 Competition and Innovation

Regional Innovation Policy in Taiwan and South Korea: Impact of Science Parks on Firm-Productivity. Ian Sheldon

Political Cycles and Stock Returns. Pietro Veronesi

Does agglomeration explain regional income inequalities?

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class.

Multiway Analysis of Bridge Structural Types in the National Bridge Inventory (NBI) A Tensor Decomposition Approach

Endogenous information acquisition

Smart Magnets for Smart Product Design: Advanced Topics

Getting to page 31 in Galí (2008)

The TransPacific agreement A good thing for VietNam?

Economic Growth: Lecture 8, Overlapping Generations

Equilibrium in a Production Economy

C Further Concepts in Statistics

Cross-Country Differences in Productivity: The Role of Allocation and Selection

Advanced Macroeconomics

SUPPLEMENTAL NUTRITION ASSISTANCE PROGRAM QUALITY CONTROL ANNUAL REPORT FISCAL YEAR 2008

4- Current Method of Explaining Business Cycles: DSGE Models. Basic Economic Models

Economics 2450A: Public Economics Section 8: Optimal Minimum Wage and Introduction to Capital Taxation

Housing and the Business Cycle

Comparative Advantage and Heterogeneous Firms

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Business Cycles: The Classical Approach

Empirical Application of Panel Data Regression

Team Production and the Allocation of Creativity across Global and Local Sectors

Online Appendix for Investment Hangover and the Great Recession

Economic Growth: Lectures 9 and 10, Endogenous Technological Change

Mathematical Appendix. Ramsey Pricing

Labor Union and the Wealth-Income Ratio

Advanced Economic Growth: Lecture 8, Technology Di usion, Trade and Interdependencies: Di usion of Technology

SAMPLE AUDIT FORMAT. Pre Audit Notification Letter Draft. Dear Registrant:

Expanding Variety Models

Online Appendixes for \A Theory of Military Dictatorships"

Solow Growth Model. Michael Bar. February 28, Introduction Some facts about modern growth Questions... 4

Economic Growth: Lecture 9, Neoclassical Endogenous Growth

Lecture 2: Firms, Jobs and Policy

A Modern Equilibrium Model. Jesús Fernández-Villaverde University of Pennsylvania

Simple New Keynesian Model without Capital

Module 19: Simple Linear Regression

Under-Employment and the Trickle-Down of Unemployment - Online Appendix Not for Publication

Income Distribution Dynamics with Endogenous Fertility. By Michael Kremer and Daniel Chen

Modelling Czech and Slovak labour markets: A DSGE model with labour frictions

Data Abundance and Asset Price Informativeness. On-Line Appendix

ECON 581: Growth with Overlapping Generations. Instructor: Dmytro Hryshko

REGRESSION ANALYSIS BY EXAMPLE

Political Economy of Institutions and Development. Lecture 4. Economic Institutions under Elite Domination

Structural Change, Demographic Transition and Fertility Di erence

Economic Growth: Lectures 10 and 11, Endogenous Technological Change

Does Pleasing Export-Oriented Foreign Investors Help Your. Balance of Payments? A General Equilibrium Analysis. (Available on Request Appendix)

Macroeconomics Theory II

New Notes on the Solow Growth Model

A Tale of Two Tails: Wage Inequality and City Size

Test of Convergence in Agricultural Factor Productivity: A Semiparametric Approach

Corporate Profits Tax and Innovative Investments

Small Open Economy RBC Model Uribe, Chapter 4

The Aggregate Implications of Innovative Investment. September 2017

The economy is populated by a unit mass of infinitely lived households with preferences given by. β t u(c Mt, c Ht ) t=0

u(c t, x t+1 ) = c α t + x α t+1

PhD/MA Econometrics Examination January 2012 PART A

Imperfect Credit Markets, Household Wealth Distribution, and Development. (Prepared for Annual Review of Economics)

Assumption 5. The technology is represented by a production function, F : R 3 + R +, F (K t, N t, A t )

Optimal Insurance of Search Risk

More on Roy Model of Self-Selection

14.770: Introduction to Political Economy Lectures 13 and 14: Economic Policy under Nondemocratic Institutions

2008/73. On the Golden Rule of Capital Accumulation Under Endogenous Longevity. David DE LA CROIX Grégory PONTHIERE

Transcription:

Online Appendix for Innovation and Top Income Inequality Not for publication unless requested Philippe Aghion Ufuk Akcigit Antonin Bergeaud Richard Blundell David Hémous February 23, 208 A Theoretical appendix A. Proofs of Proposition 2 The only claim we have not formally proved in the text is that 2 θ K z z) x E > 0 which immediately implies that the positive impact of an increase in R&D productivity on growth, entrepreneurial share and social mobility is attenuated when barriers to entry are high). Differentiating first with respect to θ E, we get: z) x E θ E z) x E = ), θ E L z)2 which is increasing in z since x E and z) both decrease in z and the denominator θ E + [ L z)2 respect to θ I gives: ] increases in z recall that z) x E θ I = L ) > 0). Similarly, differentiating with z) 2 θ E L z)2 ) x I θ I, which is increasing in z since x I θ I < 0, and z and the denominator both decrease in z. This establishes the proposition. A.2 Entrepreneurial share of income net of innovation costs So far we have computed gross shares of income ignoring innovation expenditures. If we discount these expenditures, the ratio between net entrepreneurial income and labor income

can be written as: rel net share = = x 2 E Entrepreneur share t θ E 2 θ I π L + π H π L x I + 2 ) x 2 I / 2 ) wt L Ỹ t πh 2 + w ) t π L z) x E 2Ỹt ) / ) wt L Ỹ t A) where we used 5), 7), the equilibrium values 9) and 0) knowing that there are M/+L) product lines. This expression shows that a higher rate of incumbent innovation will raise the net entrepreneur share of income, whereas a higher rate of entrant innovation will only raise the net entrepreneurial share of income if 2 π HỸt + 2 w t π L Ỹ t > 0 which occurs in particular if π H > 2π L ). This in turn relates to the creative destruction nature of entrant s innovation: a successful entrant gains π H Ỹ t w t by innovating but she destroys the rents π L Ỹ t of the incumbent. Formally, we can show: Proposition An increase in incumbent R&D productivity lower θ I ) leads to an increase in the relative shares of net entrepreneurial income over labor income. An increase in entrant R&D productivity lower θ E ) also leads to an increase in the relative shares of net entrepreneurial income over labor income whenever π 2 H + w t π 2 Ỹ L > 0. t On the other hand, we find that when L is large and π H is close enough to π L, then an increase in the productivity of entrant R&D will shift income towards workers instead of entrepreneurs, and therefore will contribute to a reduction in inequality. This result is in the vein of Jones and Kim 207). Proof. Using 6), we rewrite: We then obtain ) w t /Ỹt x I w t Ỹ t = L π L π H π L ) x I + z) x E)) = L π H π L ) and ) w t /Ỹt x E = z L π H π L ). Using A), we get: rel net share x I rel net share x E Note that = = 2 π H π L ) w t + π H π L πl + 2 π H π L ) x I Ỹ t L + 2 π ) H π L z) x E z)π H π L ) L ) 2 π H + w t 2 π Ỹ L z) wt + t Ỹ t πl + 2 π H π L ) x I + 2 π ) ) H π L z) x E )) Ỹ t A = π L + ) 2 π H π L ) x I + 2 π H π L z) x E = π L ) 2 z) x E + 2 π H π L ) x I + z) x E) w t ) 2 L t ) ) 2 Ỹt w t L t 2

is positive since z) x rel net share rel net share E <. Therefore > 0 and > 0 if π x I x E 2 H + w t π 2 Ỹ L > 0. t We know that an increase in θ E has no impact on x I but decreases x E, therefore we get that it reduces the relative net shares whenever π 2 H + w t π 2 Ỹ L > 0. An increase in θ I affects t both x I but also x E, as we have: We can then write rel net share x E θ I = π L H π L ) x I θ E, L z)2 π H π L ) θ I = = θ I rel net share x I x I θ I + rel net share x E x E θ E ) π H π L ) w θ E t L z)2 π L wt Ỹ t 2Ỹt θ E L z)2 π H π L ) + A θ E π H π L ) L θ E L z)2 π H π L ) Ỹt w t ) 2 L t x I θ I Note that x E <, requires π H wt Ỹ t ) z) < θ E. Moreover as L >, we must have θ E L z)2 π L w ) t > Ỹ t L z)2 π H π L ). Hence the relative net share is always decreasing in θ I. Finally assume that L is large such that w t /Ỹt is small relative to π H, then we have therefore w t πh π L π L π H π L ) + z) π )) H Ỹ t L θ I θ E rel net share x E ) 2 π wt H π L + π H π L ) πl + 2 π )) ) 2 H π L ) x I Ỹ t L + 2 π ) Ỹ t z H π L z) x E w t L 2 π ) )) H π L π L π H π L ) πh π L θ I + z) π ) H 2 θe + π H π L ) π L + π H π L ) 2 2 θ I + 2 π ) ) Ỹt z) H π L z) π H w t L θe ) ) ) 2 π H π L π L ) + π H π L ) π L + π L π H π L ) 2 2 Ỹt z) 2 w t L θ I Then, for L large enough, ) π H2 π L πl )+π H π L ) π L + π Lπ H π L ) 2 2θ I > 0 is a necessary and sufficient condition under which a decrease in θ E increases the relative net share. 3

A.3 Proofs for subsection 2.3.2 From 9), we have: x I = η 2 L θ I x E = z) < 0, whereas: [ 2x I θ ) E z) 2 π H x I ) x I η 2 L ) z) 2 ] )) θ E z) 2 )) 2, whose sign is ambiguous. Indeed, entrant innovation depends on the wage rate, and a higher directly reduces wages but also decreases incumbent innovation which increases wages. Yet, if θ E = θ I, the overall effect of a higher on aggregate innovation is negative: x I + x E = ηl 2 θ + z) x I ) )) ηl 2 θ z) 2 )) x I θ z) 2 + = z) < 0. η 2 L η 2 L )) θ z) 2 θ + z) + z) x I z θ Overall, we therefore have: θ z) 2 ) π H x I ) x I z) 2 θ z) 2 )) 2 ))+ η H θ z) 2 )) )) π H x I) x I entrepreneur share t = z) x ηl 2 E x I) + ) z) 2 ) z) x E + x η I), L where the second term is dominated by the first term for θ large enough. A.4 Shared rents A.4. Profit sharing between inventor and developer Here, we assume that once an innovation has been researched, it still needs to be implemented and that this development phase depends on a CEO s effort. Since we are separating the firm owner from the firm manager, we now consider that a firm s owner does not have the outside option of working as a production worker if her firm does not produce. For simplicity 4

we assume that M = + L, so that the economy is populated by a mass L of workers and a mass of firm owners who own both the incumbent firm but also the potential entrant firm). For simplicity, the CEO is assumed to be a worker who gets the opportunity to be CEO for a potential entrant or the incumbent in addition to his work as a production workers. Hence for the owner of an incumbent firm, expected income net of research spending and CEO wages) is given by: Π inc x I, e I, R I,H, R I,L ) = e I x I π H R I,H ) Y t + e I x I z) e Ex E) π L Y t e I ) x I R I,L Y t θ I x 2 I 2 Y t, where e I denotes the likelihood that the CEO succeeds in ensuring that the company implements the new technology and similarly e E is the equilibrium likelihood that the CEO of an entrant company manages to set-up a new firm. R I,H Y t is the income that the CEO obtains in case of a success, and R I,L Y t, his income if he fails. To obtain a success rate e I, a CEO has to incur a utility effort cost ψ e2 I 2 Y t. The CEOs outside option is 0 we assume that he can always reject a negative payment). A CEO of an incumbent firm will then solve the following program: Max e I { e I R I,H Y t + e I ) R I,L ψ e2 I 2 Y t We then obtain that the constraint R I,L 0 will bind. As a result the CEO will choose a success probability: e I = R I,H/ψ. This implies that the firm s owner will decide on a payment R I,H = π H π L ) /2. Therefore, in case of a success, the CEO obtains half of the gains from innovation. Similarly for an entrant firm owner, we find that her expected income is given by: }. Π ent x E, e E, R E,H, R E.L ) = z) e E x E π H R E,H ) Y t z) x E e E ) R E,L Y t θ E x 2 E 2 Y t. e E is now the likelihood that the CEO succeeds in setting up a new firm here we assumed that the CEO effort is undertaken after the innovation has been potentially blocked, this is without loss of generality). As above the constraint that R E,L = 0 binds must be satisfied. We then obtain that e E = R E,H /ψ as before, which now leads to R E,H = π H /2. Here as well the CEO gets half of the gains from innovation in case of success. The gains from an innovation for the owner of an entrant firm is π H Y t, while it was π H Y t w t when she had the outside option of becoming a worker. 5

We obtain that as a share of gross output, CEOs income is given by CEO share = x Ie IR I,H + z) x Ee ERE.H = π H π L ) 4 z)2 πh 4 + θ I 6ψ 2 θ E 6ψ. 2 Therefore it decreases with both entrant and incumbent innovation costs. As long as the labor force is large enough, top income earners will be the owners and the CEO. As a share of gross output, their joint income net of innovation costs) will be given by: T op share = π H µ + π L µ ) θ Ex 2 E 2 where the share of high-mark up sectors satisfies: θ Ix 2 I 2, A2) µ = x Ie I + z) x Ee E. It is then straightforward to show that this top share decreases with the incumbent innovation costs θ I, whereas the labor share increases with both entrant and incumbent innovation costs. Furthermore, a decrease in entrant innovation cost θ E shifts income towards top earners relative to workers i.e. it increases T op share/wage share) if and only if 3π H π 4π L +π L π H +π H π L ) 4 L 8θ I > 0, which is satisfied if profits of innovative firms are large enough ψ 2 relative to the non-innovative ones. Indeed, entrant innovation can potentially reduce the owner share for the same reasons as above. This establishes: Proposition 2 A reduction in incumbents innovation costs favors top income earners. A reduction in entrant s innovation costs favors top income earners if and only if 3π H 4π L + π L π H + π L π H π L ) 4 8θ I ψ 2 > 0. Proof. Solving for the innovation decision we obtain that incumbents invest: x I = θ I π H π L ) 2 4ψ = 4ψθ I ) 2. Entrants invest x E = z πh 2 = z ) 2. 4ψθ E 4ψθ E We can then express the share of high mark-up sector as: Since the wage share is given by µ = π H π L ) 3 z)2 + π θ I 8ψ 2 8ψ 2 θ H. 3 E w t L Y t = π L π H π L ) µ = π L π H π L ) 6 π H π L ) 3 + θ I 8ψ 2 ) z)2 π 8ψ 2 H 3, θ E

both innovation costs increase the labor share of gross output. The top earners share using A2) and the values for the innovation rates) can then be expressed as: T op share = w tl Y t π H π L ) 4 32θ I ψ 2 + z)2 π 4 H 32θ E ψ 2 = π L + 3 π H π L ) 4 32θ I ψ 2 + z)2 π 3 H 3π H 4π L ) 32ψ 2 θ E. Hence we get that T op share is decreasing in θ I. Further, we get that ) T op share = ) ) z)2 π 3 2 H Yt π H π L ) 3π 4 θ E w t L/Y t ) 32ψ 2 θe 2 H 4π L + π L π H + π L w t L 8θ I ψ 2 Hence an increase in θ E shifts income towards workers to the detriment of the top earners if π 3π H 4π L + π L π H + π H π L ) 4 L > 0 which is satisfied if π H /π L is large enough). A.4.2 8θ I ψ 2 Profit sharing between firm owner and inventor To distinguish between the firm owner and the innovator we now consider that the set of potential firm owners is given. For simplicity we assume that M = + L. There is a mass of capitalists who inherit incumbent firms and can each set up an entrant firm, while innovators are drawn from the population, and there is a mass L of potential workers. Workers are identical when in production but differ in the quantity of human capital they can produce in innovation each worker can produce h units of human capital and h is distributed uniformly over [ 0, h ] ). To innovate with probability x an incumbent firm needs to hire θe 2 /2 units of human capital. Similarly an entrant firm needs to hire θe 2 /2 units of human capital. 2 Denoting by v the price of unit of innovative human capital normalized by Y t, we obtain that there will be a threshold ĥ, such that individuals whose h is below ĥ will be production workers and those above will be innovators. That threshold obeys w Y = vĥ. Solving for the profit maximization problem, we find the optimal innovation rates as: x I = π H π L θv and x E = π H z θv, ), A3) A4) for the incumbent and the entrant respectively. These rates are similar to those in the baseline model, except that they depend on the wage rate v and the entrant rate does not depend on w as a firm owner does not have the possibility to become a worker if he fails). 2 We assume that the innovation cost is the same for entrants and incumbents. Without this assumption a reduction in entrant s cost could lead to a reduction in overall innovation through its impact on the price of human capital for some extreme parameter assumptions. 7

Market clearing for human capital implies: ) x 2 I θ 2 + x 2 E 2 = L h ĥ hdh π H π L ) 2 + πh 2 z) 2 = θv 2 L h2 ĥ2. A5) h This equation establishes the demand for innovative human capital as a function of the wage rate and the cost of innovation. The supply-side equation can be determined by combining A3) with the production labor share equation: wlĥ Y h = µ + µ, as Lĥ is the labor force in production. We then obtain: vlĥ2 h = π L + π L π H πh π L + π H z) 2). A6) θv Plugging A6) into A5), we obtain that the wage rate for innovative human capital is uniquely defined by: z) 2 vlh = π L + π L π H. A7) θv Hence v is decreasing in θ i.e. the lower is the cost of innovation, the higher is the level of wage per unit of human capital). As shown below, a decrease in the innovation cost boosts innovation both by entrants and incumbents. In addition, the threshold ĥ decreases, so that when innovation costs go down, more workers end up working as innovators. Two measures of inequality can be derived here: the share of income going to the firm owners here we implicitly assume that firm ownership is concentrated at the top of the income distribution) and a measure of top labor income inequality. The income share of innovators can be derived as: Innov share = h ĥ ) vlhdh = vl h 2 ĥ2 / 2h ). A8) One can show that this expression is decreasing in θ hence lower innovation costs increase the share of income going to innovators). We show below that the owner share of GDP must satisfy: Owner share = π L µ) + π H µ Innov share A9) = π L + πh π L ) 2 + π H 2π L ) π H z) 2). 2θv Hence a reduction in innovation costs will increase the owner share of income as long as π H π L ) 2 + π H 2π L ) π H z) 2 > 0 the intuition is still that entrant innovations may 8

decrease overall owner s net share of income by suppressing the rents of an incumbent). If firm owners are disproportionately concentrated in the top of the income distribution, this predicts that a reduction in innovation will increase top income inequality. The labor income share going to individuals above some ratio h/h can be expressed as T oplincome h) = = h h vhdh w ĥ + h Y h ĥ vhdh = w h Y h w ĥ + h Y h h 2 h 2 ĥ 2 + h 2 if h ĥ ĥ vhdh = 2ĥ h ĥ 2 + h 2 if h ĥ. In both cases, T oplincome is decreasing in can then prove the following proposition. ĥ and therefore also in innovation costs. One Proposition 3 A reduction in innovation costs leads to an increase in innovation, an increase in top labor income inequality and an increase in the owners share of income if π H π L ) 2 + π H 2π L ) π H z) 2 > 0. Proof: Using A7) we have: Hence we get: dv dθ = v z) π L π 2 H θv 2. θ z) Lh + π L π 2 H θv 2 d θv) dθ Lh = v > 0. z) Lh + π L π 2 H θv 2 Using A4) we then obtain that both entrant innovation x and incumbent innovation x I decrease with θ. Differentiating A5) we get: dĥ dθ = h2 ĥ2 2θ = h2 ĥ2 2θ = + 2 θ ) dv v dθ Lh π L π H z) 2 θv 2 Lh + π L π H z) 2 θv 2 h 2 ĥ2 Lh + π L π H z) 2 θv 2 π L 2θv where we used A7) to obtain the latter equality. Using A5) in A8), we obtain that the share of income that goes to innovators obeys: > 0, Innov share = π H π L ) 2 + πh 2 z)2, 2θv which is decreasing in θ since θv is increasing in θ. 9

To compute the owner share we use the previous equation and A4) in A9) to obtain: Owner share = π L + π H π L ) x I + z) x E) Innov share ) πh π L z = π L + π H π L ) + z) π H π H π L ) 2 + πh 2 z)2 θv θv 2θv = π L + πh π L ) 2 + π H 2π L ) π H z) 2). 2θv Therefore the owner share is increasing in θ if and only if π H π L ) 2 +π H 2π L ) π H z) 2 > 0, which establishes the Proposition. A.5 CES production technology For simplicity we assume that M = + L, and we change the production function to: Y t = 0 ) σ y σ σ σ it di, A0) with y it = q it l it and σ >. A competitive fringe has access at the beginning of the period) to productivity level q it /. We focus only on productive innovations here, so that an when a firm innovates, q it increases by a factor and the fringe gets access to the previous) q it. We assume that is small enough that the firm is forced to limit pricing. We take the final good as the numeraire. Then we get that the equilibrium prices are: Moreover y it = p σ it Y t so that π it = p L it = w t q it in sectors without innovation p H it = w t q it in sectors with innovation ) p it wt q it p σ it Y t. Hence: A) A2) π L it = η σ L wt q it ) σ Y t = η σ L wt q 0 it ) σ Y t, πit H = η ) σ H wt Y ηh σ t = η ) σ H wt Y q it qit 0 t. Here the superscript 0 indicates productivities pre-innovation. A natural assumption e.g. Aghion and Howitt, 998, Chapter 9) is that pre-innovation, all agents in the economy have access to the technology qit 0 = q 0 it )di = Q t. Then so that Π H t > Π L t. Π L t = η σ L wt Q t ) σ Y t and Π H t = η ) σ H wt Y t Q t 0

Note that Π H t /Π L t = / is bigger than in the Cobb-Douglas case. Another difference with the Cobb-Douglas case is the term t ηl σ ) σ w Q t which reflects a competition effect whereby innovation by others increases the wage and therefore raises the price of my own good because the production cost has increased. To express the entrepreneur share of income, we need to solve for the equilibrium wage w t. A), A2) and the dynamics of q it give p L t = w t q t and p H t which together with the price normalization = w t q t, µ t p σ t,h + µ t) p σ t,l =, immediately yields: ) σ ) σ wt w t µ t + µ t ) =, q t q t so that: ) σ wt = q t µ t + µ t ) η σ. A3) L The entrepreneur share of income can then be written as: entrepreneur share t = µ tπ H,t + µ t ) Π L,t = + Y t ) µ t µ t + µ t ) η σ L A4) This expression is increasing in µ t, which is still given by µ t = x It + z) x Et. In addition, we know that social mobility is still equal to Ψ t = x Et z) /L. Therefore, we still get: Proposition 4 i) A higher rate of innovation by a potential entrant, x Et, is associated with a higher entrepreneur share of income and a higher rate of social mobility, but less so the higher the entry barriers z are; ii) A higher rate of innovation by an incumbent, x It, is associated with a higher entrepreneur share of income but has no direct impact on social mobility. We now turn to the endogenous determination of the innovation rates of entrants and incumbents. We use the same innovation function as in the baseline model. The maximization problem of the incumbent is: max x I { x I wt q t ) σ + x I z) x E) wt q t ) σ + z) x w t x 2 I E θ I Y t 2 We then obtain that the optimal innovation decision is simply x I,t = x I = η σ L )) ) σ wt. A5) θ I q t } Y t.

A potential entrant in sector i solves the following maximization problem: { ) } σ wt max z) x E + x E z)) w t x 2 E θ E Y t, x E q t Y t 2 Therefore, we get ) ) σ x E,t = x wt E = w t q t Y t z) θ E. A6) Using A4), we get: w t = entrepreneur share t Y t L = L µ t + µ t ) η σ L µ t + µ t ) η σ L Plugging this expression and A3) into A5) and A6) we obtain: ) x I,t = η σ L θ I x E,t = x E = µ t η σ L L [ µ t ) + η σ L + µ t ) η σ L µ t + µ t ) η σ L., A7) ] z θ E. A8) The above expression shows that a change in µ t for instance because of a change in incumbent innovation x I ) has an ambiguous effect on entrant innovation x Et. On the one hand as in the Cobb-Douglas case, an increase in µ t reduces w t /Y t and therefore makes the outside option of the entrant less appealing, which leads to higher innovation by the entrant. On the other hand, an increase in µ t also increases w t /q t. This is the competition effect mentioned above which decreases entrant innovation. As a result, a reduction in incumbent innovation costs θ I ), which increases incumbent innovation may reduces entrant innovation and thereby social mobility. Overall, we obtain: Proposition 5 An increase in entrant innovation costs θ E reduces entrant innovation x E, incumbent innovation x I and social mobility. An increase in incumbent innovation costs θ I reduces incumbent innovation x I and total innovation µ but has an ambiguous impact on entrant innovation and social mobility. Proof. To solve for the the total number of innovations, combine A7) and A8) to get: ) [ ] µ t = η σ L µ L t + µ t ) η σ L + θ I µ t η σ L ) + η σ L This expression can be rewritten as: ) η σ L ) ) θ I µt η σ L + η σ + L µt L ηl σ µ t + µ t ) η σ L ) µ t L [ µ t + µ t ) η σ L 2 η σ L ] z) 2 z) 2 θ E. θ E = A9)

As > 0 which results from assuming that for any µ L ηl σ t, Π H > w), the LHS is decreasing in µ t. Therefore this expression defines µ t uniquely, and we get that total innovation µ t decreases in the entrant and incumbent innovation costs θ E and θ I. Since µ is decreasing in θ E and x I is decreasing in µ, we have that x I is increasing in θ E. Assume by contradiction that x E is also increasing in θ E, then µ is increasing in θ E which is impossible. Therefore x Et and so Ψ social mobility) are decreasing in θ E. Rewrite A9) as: x It µ t + L ηl σ ) µ t L [ µ t + µ t ) η σ L η σ L ] z) 2 θ E =, A20) an increase in θ I decreases µ t which increases the LHS, therefore it must decrease x It. Hence x It is decreasing in θ I. Yet, since x Et is ambiguous in µ, it is also ambiguous in θ I, and so is Ψ t. 3

B Empirical Appendix B. Industry composition Our regressions so far have mostly) abstracted from industry composition. Here, we look more closely at two sectors Finance and Natural resources) which are likely to have a large impact on top income shares. Patents as a tool to appropriate innovation The role played by patents in appropriating innovation is heterogeneous across sectors. In particular Cohen et al. 2000) conducted a survey analysis to assess the role played by patents versus other means such as trade secrecy in protecting innovation. Based on their results specifically Tables and 2), we measure the use of patents as an appropriability mechanisms for product and process innovations at the SIC level. From this, and using the state-industry composition, we are able to compute a state level measure of the use of patents in appropriating innovation weighting the sector specific score by the share of the sector in the state GDP. We then split states into two groups of equal size each year based on this measure. Table C5 reports the results: we find that the correlation is positive for both groups of states, but it is stronger in states which use patents more intensively to appropriate innovation. That is, the effect of patents on top income inequality is stronger when they are more used to protect innovation. Role of the financial sector The financial sector is heavily represented in the top % income share: Bakija et al. 2008) find that 3.2% of primary tax payers belonging to the top % worked in the financial sector in 2005. The above regressions already controlled for the share of the financial sector in state GDP. Tables C6 and C7 perform additional tests in OLS and IV regressions respectively. First, we control for the average employee compensation in the financial sector to capture any direct effect of this variable on the top % income share column ). Second, we exclude states in which financial activities account for a large fraction of GDP, namely New York, Connecticut, Delaware and Massachusetts column 2). Third, we exclude financial innovations patents belonging to the class 705: Financial, Business Practice ) in column 3. In each case, the effect of innovation on the top % income share is significant and positive, showing very stable values when moving from one specification to another innovation is measured as the number of citations within 5 years per capita). Relatedly financial development may impact both innovation by providing easier access to credit to potential innovators) and income inequality at the top by boosting high wages). We build a specific variable to control for this channel. We map patents to 6 NAICS industries and for each state we compute the share of patents in each industry. Then, knowing the industry-level of external financial dependence, we compute the average level of external financial dependence of innovations in each state. 3 This variable denoted 3 We use the mapping between technological classes and NAICS codes from the http://www.uspto. gov/web/offices/ac/ido/oeip/taf/data/naics_conc/uspto website. To measure external financial dependence at the industry level, we use the numbers computed by Kneer 203) and averaged over the period 980-989 external financial dependence is defined as the ratio of capital expenditure minus cash flow divided by capital expenditure as in Rajan and Zingales, 998). 4

EFD) should capture a variation in innovation at state-level driven by a sector that is highly dependent on external finance. Regression results are presented in column 4: the effect of innovation remains significant with a slightly lower coefficient than in the baseline regression. Role of the oil industry Natural resources, notably oil extraction, represent a large share of GDP in certain states in Wyoming, West Virginia and particularly Alaska, oil extraction activities account for almost 30% of total GDP in 200), so that in these states the top % income share is likely to be affected by these sectors which are quite volatile. To address this issue, we control for the share of natural resources and oil extraction in GDP in column 5 of Table C6 and C7. Moreover, we remove patents from class 208 Mineral oils: process and production) and 96 Mineral oils: Apparatus) in column 6. Here again, our results remain significant. Role of other sectors Finally, we check whether our results are driven by certain sectors which are particularly innovative. We use the mapping between patent technological classes and NAICS sectors to remove patents related to category 334: Computer and Electronic Products to exclude the fast-growing computer industries. Similarly, we remove patents from the pharmaceutical sector NAICS 3254) and from the electrical equipment sector NAICS 335). Next, we add controls for the share of these three sectors. 4 Then, we use the COMTRADE database to look at the extent to which our effect of innovation on top income inequality is driven more by more export-intensive sectors. Over the period from 976 to 203, we identify three such sectors Transportation, Machinery and Electrical Machinery), and we check whether our results are robust to excluding them. The results are shown in Table C8, where we conduct OLS regressions using the number of citations within five years to measure innovation. Innovation remains positively and significantly correlated with the top % income share, and the coefficient remains stable across specifications. B.2 Moving inventors and agglomeration effects Moving inventors. Talented and rich inventors may decide to move to states that are more innovative or to benefit from lower taxes. This could enhance the positive correlation between top income inequality and innovation but through a very different mechanism from the one in our model. However, using disambiguated information on the inventors of patents from the USPTO, we are able to identify the location of successive patents by a same inventor. This in turn allows us to delete patent from inventors that patented in various states. Our results still hold with slightly lower coefficients than in our baseline see Table C9 and C20 for OLS and IV results using only patents by single-state inventors). Agglomeration effects. One may wonder whether our results do not reflect agglomeration effects: for example, suppose that some exogenous investment taking place in one particular location say the Silicon Valley), makes that location more attractive to skilled/talented 4 In order to obtain complete series, we replace the pharmaceutical sector by the whole chemistry manufacturing sector NAICS 325). 5

individuals from other parts of the US. The resulting increased agglomeration of high-skill individuals should lead to both a higher top % income share and a higher level of innovation in the corresponding US state, but without the former necessarily resulting from the latter. Glaeser et al. 2009) and Baum-Snow and Pavan 203) point to higher density cities displaying more income inequality while Carlino et al. 2007) and Gyourko et al. 203) point at a positive correlation between population density and innovation and the importance of superstar cities. Figure 2 in the introduction suggests that this should not be such a big concern for our analysis: neither California nor Massachusetts are among the states that show the fastest increase in both innovation and top income inequality over the period we analyze. To address the agglomeration objection head on, we need a measure of density that is not distorted by large rural areas like upstate New York. In the spirit of Ciccone and Hall 996), we proceed as follows: in any state, we consider the 0 percent most populated counties in 970, in 205, and on average over the period 970-205 from the BEA regional accounts. We then compute the population density in these counties every year. This yields three indicators of urban density that are meant to capture the fact that some states were more attractive over the corresponding periods. Running our previous regressions with these additional control variables does not affect our results as seen in Table C2 OLS regression results in columns, 2, 3 and IV results in columns 4, 5, 6). 6

C Additional empirical results Table C: Top 0.% and Top 0.0% income share and innovation from incumbents and entrants Dependent variable Log of Top 0.% Income Share Log of Top 0.0% Income Share ) 2) 3) 4) 5) 6) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Innovation by entrants 0.030*** 0.025** 0.037*** 0.030** 0.00) 0.00) 0.04) 0.04) by incumbents 0.04*** 0.036*** 0.059*** 0.053*** 0.009) 0.009) 0.02) 0.02) Gdppc 0.063 0.024 0.025 0.048-0.00-0.008 0.094) 0.097) 0.095) 0.3) 0.33) 0.3) Popgrowth 4.047*** 3.823*** 3.920*** 5.49*** 5.69*** 5.305***.382).36).378).923).895).96) Finance 0.39*** 0.78*** 0.73*** 0.39** 0.95*** 0.89*** 0.048) 0.050) 0.050) 0.064) 0.068) 0.067) Government -0.07-0.025-0.04 0.0 0.002 0.06 0.032) 0.032) 0.032) 0.04) 0.04) 0.04) Unemployment -0.007-0.00* -0.009* -0.0-0.06** -0.05* 0.006) 0.006) 0.006) 0.008) 0.008) 0.008) TaxK -0.057*** -0.058*** -0.059*** -0.074*** -0.076*** -0.076*** 0.008) 0.007) 0.007) 0.00) 0.00) 0.00) TaxL 0.026*** 0.027*** 0.029*** 0.028** 0.029** 0.032** 0.00) 0.00) 0.00) 0.03) 0.03) 0.03) R 2 0.826 0.827 0.829 0.785 0.787 0.789 Observations 377 377 377 377 377 377 Notes: Variable description is given in Table. Innovation is taken in log and lagged by two years. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 980-2006. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 7

Table C2: Distribution of patents located by inventors and assignees State Inventors Assignees State Inventors Assignees AK 0.% 0.0% MT 0.% 0.% AL 0.4% 0.3% NC 2.3%.4% AR 0.2% 0.% ND 0.% 0.0% AZ.8% 0.7% NE 0.2% 0.2% CA 22.8% 24.0% NH 0.7% 0.4% CO 2.2%.% NJ 4.% 5.% CT 2.0% 2.9% NM 0.4% 0.3% DC 0.%.2% NV 0.4% 0.6% DE 0.4%.9% NY 7.%.6% FL 2.8%.9% OH 3.5% 3.8% GA.7%.0% OK 0.5% 0.3% HI 0.% 0.0% OR.8% 0.7% IA 0.7% 0.7% PA 3.5% 2.9% ID.8% 2.2% RI 0.3% 0.2% IL 3.9% 5.0% SC 0.6% 0.4% IN.5% 0.9% SD 0.% 0.0% KS 0.5% 0.3% TN 0.8% 0.6% KY 0.5% 0.4% TX 7.0% 6.6% LA 0.5% 0.2% UT 0.8% 0.6% MA 4.4% 4.% VA.4% 0.9% MD.7%.% VT 0.4% 0.% ME 0.2% 0.% WA 2.9% 2.6% MI 4.2% 4.8% WI 2.0%.8% MN 3.0% 3.0% WV 0.% 0.0% MO.0% 0.8% WY 0.% 0.0% MS 0.2% 0.% Notes: Distribution granted patent with an application year equal to 2000 by state whether the allocation is based on the address of the inventors or the address of the assignees. 8

Dependent variable Table C3: Top % income share and innovation allocation by assignee Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Patents Cit5 Claims Generality Top5 Top Innovation 0.06** 0.025*** 0.09*** 0.06** 0.020*** 0.022*** 0.007) 0.006) 0.006) 0.007) 0.005) 0.004) Gdppc 0.06** 0.2** 0.07** 0.** 0.093** 0.097** 0.04) 0.045) 0.043) 0.045) 0.044) 0.044) Popgrowth 0.850 0.923 0.90 0.929 0.98 0.937 0.640) 0.678) 0.646) 0.664) 0.688) 0.666) Finance 0.07** 0.084** 0.072** 0.07** 0.085** 0.089** 0.034) 0.035) 0.034) 0.034) 0.035) 0.036) Government -0.020* -0.025** -0.02* -0.02* -0.023** -0.025** 0.0) 0.0) 0.0) 0.0) 0.0) 0.0) Unemployment -0.005* -0.005-0.005-0.005-0.005-0.005 0.003) 0.003) 0.003) 0.003) 0.003) 0.003) TaxK -0.038*** -0.038*** -0.038*** -0.038*** -0.037*** -0.038*** 0.004) 0.004) 0.004) 0.004) 0.004) 0.004) TaxL 0.07*** 0.04** 0.07*** 0.07*** 0.03** 0.03** 0.006) 0.006) 0.006) 0.006) 0.006) 0.006) R 2 0.889 0.895 0.889 0.889 0.895 0.895 Observations 734 58 734 734 58 58 Notes: Innovation is taken in log and lagged by two years and is assigned to a state using the assignee location. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 976-2009 columns, 3 and 4) and 976-2006 columns 2, 5 and 6). Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 9

Table C4: Descriptive statistics by state in two distinctive years 980 2005 980 2005 Top % Innovation Top % Innovation Top % Innovation Top % Innovation AK 5.33 29 2.47 44 MT 8.02 46 6.35 44 AL 0.0 40 8.48 38 NC 9.03 68 7.04 428 AR 0.05 37 6.68 69 ND 9.62 58 3.23 303 AZ 8.56 74 22.66 5 NE 9.33 46 5.24 92 CA 9.9 252 24.20 57 NH 8.48 229 8.0 853 CO 9.3 209 9.56 855 NJ 9.83 475 20.77 68 CT 2.24 47 3.02 05 NM 8.90 55 5.63 30 DC 4.48 00 23.94 60 NV.09 8 33.30 574 DE 0.9 588 2.38 406 NY 2.08 229 30.25 549 FL 2.23 04 3.78 246 OH 8.98 208 5.86 582 GA 8.95 64 9. 30 OK.44 228 7.74 357 HI 7.52 29 6.47 25 OR 8.25 09 6.9 25 IA 8.24 3 2.92 38 PA 9.37 28 8.7 373 ID 7.68 86 8.08 483 RI 0.25 33 7.36 389 IL 9.63 220 2.67 462 SC 8.6 76 7.74 62 IN 8.44 79 5.52 346 SD 8.58 32 6.94 3 KS 0.7 74 6.09 40 TN 0.09 75 8.76 29 KY 9.69 83 5.76 29 TX 2.8 69 2.90 562 LA.22 63 7.65 82 UT 7.79 24 8.49 768 MA 0.03 324 23.79 392 VA 7.97 9 7.2 295 MD 8.3 83 7.34 44 VT 7.97 80 6.3 64 ME 8.55 73 5.66 40 WA 8.37 34 9.69 689 MI 8.9 233 6.2 678 WI 8.2 67 6.48 529 MN 9.3 260 8.24 54 WV 9.54 8 4.97 49 MO 9.96 85 7. 228 WY 9.00 27 28.52 6 MS 0.48 2 5.8 52 Notes: Number of citations within a five-year window per million of inhabitants and top % income share for all 5 states in 980 and 2005. 20

Table C5: Top % income share and innovation with clustered standard errors Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Patents Cit5 Claims Generality Top5 Top Innovation 0.03 0.049*** 0.07 0.024 0.026*** 0.020*** 0.020) 0.04) 0.04) 0.06) 0.007) 0.005) Gdppc 0.089 0.063 0.096 0.093 0.074 0.087 0.086) 0.084) 0.088) 0.087) 0.084) 0.088) Popgrowth 0.943.089* 0.943 0.934 0.990.074* 0.637) 0.66) 0.640) 0.645) 0.648) 0.647) Finance 0.080 0.09* 0.072 0.078 0.098 0.094 0.064) 0.063) 0.065) 0.064) 0.06) 0.06) Government -0.08-0.09-0.08-0.08-0.08-0.06 0.027) 0.026) 0.028) 0.028) 0.025) 0.025) Unemployment -0.006-0.006-0.005-0.006-0.006-0.005 0.004) 0.005) 0.004) 0.004) 0.005) 0.005) TaxK -0.038*** -0.039*** -0.038*** -0.038*** -0.038*** -0.037*** 0.008) 0.008) 0.008) 0.008) 0.008) 0.008) TaxL 0.07 0.04 0.07 0.08 0.03 0.03 0.02) 0.02) 0.02) 0.02) 0.02) 0.02) R 2 0.889 0.896 0.889 0.889 0.895 0.895 Observations 734 58 734 734 58 58 Notes: Variable description is given in Table. Innovation is taken in log and lagged by two years. The dependent variable is the log of the top % income share. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 976-2009 columns, 3 and 4) and 976-2006 columns 2, 5 and 6). Heteroskedasticity robust standard errors clustered at the state level are presented in parenthesis. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 2

Dependent variable Table C6: Top % income share and innovation Log of Top % Income Share ) 2) 3) 4) 5) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Innovation 0.039*** 0.036*** 0.043*** 0.045*** 0.049*** 0.009) 0.009) 0.009) 0.009) 0.009) Gdppc 0.075 0.055 0.052 0.063 0.054) 0.053) 0.052) 0.044) Popgrowth.46.255* 0.864.089 0.720) 0.73) 0.739) 0.700) Finance 0.0*** 0.8*** 0.09*** 0.04) 0.043) 0.036) Government -0.007-0.00-0.09* 0.03) 0.03) 0.0) Unemployment -0.006* -0.006* 0.004) 0.003) TaxK -0.039*** 0.004) TaxL 0.04** 0.006) R 2 0.880 0.882 0.884 0.884 0.896 Observations 58 58 58 58 58 Notes: Variable description is given in Table. Innovation is taken in log and lagged by two years. The dependent variable is the log of the top % income share. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 976-2006. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 22

Table C7: Top % income share and number of entrepreneurs Dependent variable Measure of innovation Log of Top % Income Share ) 2) 3) 4) Log of entrepreneurs per capita Innovation 0.02* 0.09* 0.024** 0.022** 0.0) 0.00) 0.00) 0.00) Gdppc 0.94** 0.47 0.27 0.087) 0.090) 0.095) Popgrowth 3.369*** 2.09** 2.090** 0.92) 0.848) 0.839) Finance -0.045-0.052 0.062) 0.062) Government -0.027-0.024 0.045) 0.046) Unemployment -0.026*** -0.024*** 0.005) 0.005) TaxK -0.005 0.005) TaxL -0.009 0.007) R 2 0.868 0.878 0.887 0.889 Observations 507 507 507 507 Notes: Variable description is given in Table. The number of entrepreneurs is taken from Guzman and Stern 206) and is available for 34 states from 988. Panel data OLS regressions with state and year fixed effects. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 23

Table C8: Top % income share and innovation by entrants and incumbents - alternative definition of entrants Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Patents Patents Patents Cit5 Cit5 Cit5 Innovation by entrants 0.02 0.006 0.09*** 0.05** 0.009) 0.009) 0.007) 0.007) by incumbents 0.02*** 0.09*** 0.025*** 0.022*** 0.007) 0.007) 0.005) 0.006) Gdppc 0.23** 0.076 0.00* 0.086 0.049 0.055 0.053) 0.056) 0.054) 0.059) 0.060) 0.059) Popgrowth 2.085*** 2.0*** 2.55*** 2.233*** 2.82*** 2.92*** 0.75) 0.752) 0.760) 0.833) 0.823) 0.84) Finance 0.095*** 0.4*** 0.08*** 0.08*** 0.36*** 0.3*** 0.032) 0.032) 0.032) 0.033) 0.033) 0.033) Government -0.022-0.024-0.02-0.02-0.028-0.02 0.02) 0.02) 0.02) 0.020) 0.02) 0.02) Unemployment -0.00-0.003-0.003-0.00-0.003-0.003 0.003) 0.003) 0.003) 0.004) 0.004) 0.004) TaxK -0.037*** -0.039*** -0.038*** -0.038*** -0.039*** -0.039*** 0.005) 0.004) 0.004) 0.005) 0.005) 0.005) TaxL 0.025*** 0.027*** 0.027*** 0.023*** 0.023*** 0.024*** 0.006) 0.006) 0.006) 0.007) 0.007) 0.007) R 2 0.85 0.85 0.853 0.859 0.86 0.862 Observations 530 530 530 377 377 377 Notes: Variable description is given in Table. Innovation by entrants is a count of innovation that restricts to patents whose assignee first patented less than 5 years ago. Other patents enter in the count of Innovation by incumbents. Both these measures of innovation are taken in log and lagged by two years. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 980-2009 columns to 3) and 980-2006 columns 4 to 6). Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 24

Table C9: Descriptive statistics on the Senate appropriation committee composition. Number of years with Number of years with Senator 2 Senators Senator 2 Senators AK 28 0 MT 22 0 AL 4 0 NC 2 0 AR 29 0 ND 25 0 AZ 20 0 NE 7 0 CA 4 0 NH 32 0 CO 7 0 NJ 27 0 CT 2 0 NM 37 0 DE 3 0 NV 32 FL 2 0 NY 4 0 GA 0 0 OH 6 0 HI 33 6 OK 6 0 IA 20 2 OR 25 0 ID 24 0 PA 36 0 IL 2 0 RI 0 IN 9 0 SC 34 0 KS 7 0 SD 7 0 KY 26 0 TN 20 0 LA 33 0 TX 20 0 MA 8 0 UT 27 0 MD 29 VA 0 0 ME 3 0 VT 30 2 MI 0 WA 2 0 MN 0 0 WI 3 8 MO 30 0 WV 39 0 MS 3 8 WY 7 0 Notes: The table gives the number of years between 970 and 2008 with exactly one resp. 2) senator seating in the appropriation committee. The exact composition can be found in http://www.gpo.gov/fdsys/pkg/cdoc-0sdoc4/pdf/ CDOC-0sdoc4.pdfthe appropriation committee official website. 25

Table C0: Innovation and various measures of inequality - IV results Dependent Variable Top % Avgtop Top 0 % Overall Gini G99 Atkinson ) 2) 3) 4) 5) 6) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Innovation 0.85** 0.076* 0.033-0.07-0.047 0.072* 0.078) 0.046) 0.033) 0.028) 0.036) 0.037) Gdppc -0.079-0.030-0.032 0.00-0.005 0.066 0.093) 0.053) 0.045) 0.034) 0.044) 0.047) Popgrowth.663* 0.85 0.46-0.274-0.467* 0.587 0.969) 0.578) 0.455) 0.203) 0.28) 0.370) Finance 0.23*** 0.09*** 0.045 0.000-0.044 0.080** 0.068) 0.039) 0.03) 0.023) 0.03) 0.033) Government -0.078*** -0.046*** -0.022* -0.003 0.04-0.038*** 0.024) 0.05) 0.02) 0.00) 0.02) 0.0) Unemployment -0.02** -0.004-0.000-0.000 0.003-0.004* 0.005) 0.003) 0.002) 0.002) 0.002) 0.002) TaxK -0.039*** -0.06*** -0.00-0.006*** -0.00-0.08*** 0.005) 0.003) 0.003) 0.002) 0.002) 0.002) TaxL 0.06** 0.007* -0.00 0.004** 0.00 0.03*** 0.007) 0.004) 0.003) 0.002) 0.003) 0.003) Highways 0.5 0.97-0.044-0.009-0.27 0.203 0.464) 0.265) 0.82) 0.3) 0.70) 0.243) Military -0.004-0.008-0.00** -0.009** -0.009** -0.007** 0.008) 0.005) 0.004) 0.004) 0.004) 0.003) R 2 0.874 0.807 0.427 0.865 0.74 0.933 F-stat on excluded instruments 4.2 4.2 4.2 4.2 4.2 4.2 Observations 550 550 550 550 550 550 Notes: Variable description is given in Table. Innovation is taken in log and lagged by two years. The dependent variables are also taken in log. Panel data IV 2SLS regressions with state and year fixed effects. Innovation is instrumented by the number of senators that seat on the appropriation committee. The lag between the instrument and the endogenous variable is set to 3 years. Time span for innovation: 976-2009 for columns, 3 and 4 and 976-2006 for columns 2, 5 and 6. DC is removed from the sample because it has no senators. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 26

Table C: Top % income share and innovation at different lags - IV results Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Lag of innovation 2 years 3 years 4 years 5 years 6 years Innovation at t 2 0.2*** 0.077) Innovation at t 3 0.43* 0.075) Innovation at t 4 0.42* 0.077) Innovation at t 5 0.034 0.070) Innovation at t 6-0.004 0.075) Gdppc -0.33-0.098-0.099 0.025 0.073 0.099) 0.06) 0.07) 0.094) 0.098) Popgrowth 2.684** 2.784*** 2.730*** 2.570*** 2.435***.085).038) 0.972) 0.920) 0.90) Finance 0.283*** 0.22*** 0.202*** 0.3*** 0.0** 0.074) 0.066) 0.059) 0.049) 0.047) Government -0.084*** -0.088*** -0.088*** -0.09*** -0.5*** 0.030) 0.029) 0.029) 0.028) 0.029) Unemployment -0.04** -0.008-0.008-0.003-0.00 0.006) 0.005) 0.005) 0.005) 0.005) TaxK -0.037*** -0.036*** -0.036*** -0.035*** -0.035*** 0.006) 0.005) 0.005) 0.005) 0.005) TaxL 0.023*** 0.023*** 0.022*** 0.07** 0.05** 0.008) 0.008) 0.008) 0.007) 0.007) Highways 2.029***.355***.342*** 0.928* 0.783 0.606) 0.497) 0.53) 0.490) 0.50) Military -0.02-0.00-0.00-0.007-0.006 0.008) 0.008) 0.008) 0.007) 0.007) R 2 0.867 0.88 0.875 0.880 0.864 F-stat on excluded instruments 6.3 3.2 2.8 3.4.8 Observations 500 500 450 400 350 Notes: Variable description is given in Table. Innovation is taken in log and lagged by 2 to 6 years. Panel data IV 2SLS regressions with state and year fixed effects. Innovation is instrumented by the number of senators that seat on the appropriation committee. The lag between the instrument and the endogenous variable is set to 3 years. Time span for innovation: 979-2006. DC is removed from the sample because it has no senators. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 27

Table C2: Robustness : regression of innovation on top % income share using two instruments Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of Innovation Patents Cit5 Claims Generality Top5 Top Innovation 0.97*** 0.63*** 0.25*** 0.35*** 0.24*** 0.5*** 0.055) 0.035) 0.035) 0.035) 0.028) 0.038) Gdppc -0.09-0.08-0.079-0.064-0.08-0.08 0.090) 0.076) 0.083) 0.078) 0.085) 0.08) Popgrowth 2.97*** 2.675*** 3.05*** 2.80*** 2.627*** 2.934***.07).024) 0.954) 0.95).02).07) Finance 0.89*** 0.23*** 0.49*** 0.58*** 0.20*** 0.253*** 0.043) 0.04) 0.037) 0.038) 0.042) 0.057) Government -0.09*** -0.088*** -0.0*** -0.7*** -0.038-0.027 0.026) 0.027) 0.026) 0.026) 0.033) 0.037) Unemployment -0.02*** -0.02*** -0.009** -0.009** -0.0** -0.007 0.004) 0.004) 0.004) 0.004) 0.004) 0.005) TaxK -0.034*** -0.033*** -0.032*** -0.033*** -0.034*** -0.03*** 0.005) 0.005) 0.004) 0.004) 0.005) 0.006) TaxL 0.05*** 0.02*** 0.03*** 0.04*** 0.00*** 0.02*** 0.004) 0.004) 0.004) 0.004) 0.004) 0.005) Highways.32***.598***.060**.085***.467***.8*** 0.440) 0.444) 0.42) 0.4) 0.450) 0.529) Military 0.004 0.00 0.002-0.00-0.000 0.0 0.008) 0.008) 0.008) 0.008) 0.009) 0.00) R 2 0.836 0.85 0.844 0.847 0.86 0.75 F-stat on the excluded 8. 26.9 34.7 42.4 22.9 4.0 instruments Sargan-Hansen J-stat 0.256 0.420 0.54 0.20 0.533 0.797 p-value) Observations 500 350 500 500 350 350 Notes: Variable description is given in Table. Innovation is taken in log and lagged by two years. Panel data IV 2SLS regressions with state and year fixed effects. Innovation is instrumented by the number of senators that seat on the appropriation committee and by a measure of spillover as described in section6.. The lag between the first instrument and the endogenous variable is set to 3 years while the lag between the second instrument and the endogenous variable is year. Two additional controls for demand shocks are included, as explained in subsection 6.. Time span: 983-20 for columns 983-2008 for columns 2 to 6. DC is removed from the sample because it has no senators. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 28

Table C3: Regression of innovation on Top % income share using only the spillover instrument Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Patents Cit5 Claims Generality Top5 Top Innovation 0.20*** 0.68*** 0.27*** 0.38*** 0.26*** 0.65*** 0.059) 0.039) 0.036) 0.037) 0.029) 0.047) Gdppc -0.0-0.07-0.067-0.050-0.080-0.03 0.089) 0.073) 0.080) 0.074) 0.076) 0.082) Popgrowth 2.530*** 2.544** 2.538*** 2.297*** 2.240** 3.54*** 0.937) 0.999) 0.865) 0.823) 0.976).056) Finance 0.92*** 0.237*** 0.49*** 0.58*** 0.208*** 0.276*** 0.044) 0.042) 0.037) 0.038) 0.043) 0.066) Government -0.023-0.04-0.030-0.030 0.06 0.043 0.020) 0.020) 0.020) 0.020) 0.022) 0.029) Unemployment -0.03*** -0.02*** -0.00** -0.00*** -0.02*** -0.007 0.005) 0.004) 0.004) 0.004) 0.004) 0.005) TaxK -0.036*** -0.034*** -0.033*** -0.035*** -0.034*** -0.03*** 0.004) 0.005) 0.004) 0.004) 0.005) 0.006) TaxL 0.09*** 0.05*** 0.07*** 0.07*** 0.02*** 0.05*** 0.004) 0.004) 0.004) 0.004) 0.004) 0.005) R 2 0.824 0.838 0.834 0.836 0.804 0.70 F-stat on the excluded 3.0 4.4 6.7 73.0 39.2 9.6 instruments Observations 530 377 530 530 377 377 Notes: Variable description is given in Table. Innovation is taken in log and lagged by 2 years. Panel data IV 2SLS regressions with state and year fixed effects. Innovation is instrumented by a measure of spillover as described in section 6.. The lags between the instruments and the endogenous variable is set to year. Control for spatial correlation involves adding two additional controls for demand shocks as explained in subsection 6.. Time span for innovation: 98-2006. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 29

Table C4: Innovation and social mobility at the Commuting Zone level. Entrants and incumbents innovation Dependent variable AM25 P-5 P2-5 AM25 P-5 P2-5 AM25 ) 2) 3) 4) 5) 6) 7) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Innovation by entrants 0.023** 0.*** 0.048** 0.09* 0.009) 0.039) 0.022) 0.00) by incumbents 0.06** 0.075** 0.034* 0.006 0.008) 0.033) 0.020) 0.007) Gdppc -0.08-0.02-0.37-0.048 0.45-0.072-0.086 0.057) 0.235) 0.43) 0.064) 0.270) 0.46) 0.058) Popgrowth -.770** -4.074-7.770*** -.849** -4.476-7.948*** -.825** 0.82) 3.550) 2.222) 0.838) 3.670) 2.30) 0.863) Finance 0.08-0.05 0.049 0.07-0.02 0.046 0.08 0.08) 0.070) 0.053) 0.09) 0.073) 0.054) 0.09) Government 0.035 0.20 0.08 0.039 0.23 0.090 0.035 0.033) 0.36) 0.094) 0.034) 0.45) 0.096) 0.033) Unemployment -0.225-0.4-0.805-0.99-0.028-0.747-0.203 0.208) 0.866) 0.549) 0.27) 0.900) 0.564) 0.20) Tax -0.00-0.003-0.003-0.00-0.004-0.003-0.00 0.002) 0.006) 0.005) 0.002) 0.006) 0.005) 0.002) School Expenditure 0.009 0.035 0.028 0.007 0.024 0.023 0.009 0.009) 0.033) 0.024) 0.009) 0.036) 0.025) 0.009) Employment Manuf -0.334*** -.400*** -.037*** -0.385*** -.640*** -.46*** -0.358*** 0.09) 0.394) 0.323) 0.3) 0.43) 0.339) 0.3) R 2 0.97 0.225 0.24 0.85 0.207 0.209 0.20 Observations 662 670 670 662 670 670 662 Notes: Variable description is given in Table. The number of citations per inhabitants is averaged over the period 998-2008 and social mobility measures are taken when the child is 30 between 20 and 202 compared to his parents during the period 996-2000, all these measures are taken in logs. Cross section OLS regressions with CZs weighted by population. Regressions also include a dummy for being an urban CZs. Heteroskedasticity robust standard errors clustered at the state level are reported in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 30

Table C5: The effectiveness of appropriability mechanisms for innovation Dependent variable Log of Top % Income Share ) 2) 3) States Above Median Below Median All Measure of innovation Cit5 Cit5 Cit5 Innovation 0.065*** 0.034*** 0.044*** 0.06) 0.009) 0.008) Innovation Above Median 0.06* 0.008) Above Median 0.0 0.068) Gdppc 0.54 0.006 0.063 0.098) 0.039) 0.045) Popgrowth.203 0.683.42.345) 0.552) 0.709) Finance 0.7** 0.62*** 0.0*** 0.047) 0.053) 0.036) Government -0.035-0.007-0.08* 0.055) 0.0) 0.0) Unemployment -0.002-0.03*** -0.006* 0.004) 0.004) 0.003) TaxK -0.027*** -0.02*** -0.024*** 0.004) 0.003) 0.002) TaxL 0.020*** -0.007** 0.0*** 0.004) 0.003) 0.003) R 2 0.88 0.99 0.898 Observations 805 772 58 Notes: Variable description is given in Table. Innovation is taken in log and lagged by two years. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 980-2006. Above Median is a dummy variable equal to if the state is above the yearly median in terms of its effectiveness of appropriability mechanisms, based on Cohen et al. 2000). Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 3

Dependent variable Table C6: Robustness 2: financial sector and natural resources Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Innovation 0.058*** 0.057*** 0.049*** 0.03*** 0.053*** 0.049*** 0.000) 0.000) 0.000) 0.00) 0.000) 0.000) Gdppc -0.07** 0.026 0.067 0.058 0.48*** 0.068 0.04) 0.545) 0.30) 0.63) 0.000) 0.23) Popgrowth.072.07.7.064.266*.6 0.39) 0.2) 0.09) 0.2) 0.053) 0.09) Finance 0.050 0.53*** 0.4*** 0.0*** 0.43*** 0.3*** 0.32) 0.00) 0.00) 0.00) 0.000) 0.002) Government 0.07-0.05-0.02* -0.04-0.05-0.020* 0.40) 0.5) 0.060) 0.88) 0.47) 0.077) Unemployment -0.02*** -0.00*** -0.007** -0.008*** -0.006* -0.007** 0.000) 0.003) 0.04) 0.009) 0.088) 0.043) TaxK -0.035*** -0.03*** -0.036*** -0.037*** -0.029*** -0.036*** 0.000) 0.000) 0.000) 0.000) 0.000) 0.000) TaxL 0.006** 0.003 0.007** 0.008** 0.00*** 0.007** 0.049) 0.344) 0.023) 0.04) 0.00) 0.024) RemunFinance 0.339*** 0.000) EFD 0.66*** 0.000) Mining+Oil 0.036*** 0.000) R 2 0.905 0.896 0.897 0.900 0.903 0.896 Observations 58 457 58 58 58 58 Notes: Variable Mining+oil measure the share of oil related and natural resources extraction activities in GDP, variable RemunFinance measures the compensation per employee in the financial sector and variable EFD measures the financial dependence of innovation. Other variables description is given in Table. Innovation is taken in log and lagged by two years. Column controls for average compensation in the financial sector, column 2 drops NY, CT, DE and MA the state with the largest financial sectors), column 3 removes finance-related patents, column 4 controls for financial dependence in the state as explained in section B., column 5 controls for the size of oil and mining sectors and column 6 removes oil-related patents from the count of citations. Time Span: 976-2008. Panel data OLS regressions with state and year fixed effects. Innovation as well as the top % income share are taken in log. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 32

Table C7: Robustness 2: financial sector and natural resources - IV results Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Innovation 0.90*** 0.260*** 0.89** 0.9* 0.55** 0.88** 0.074) 0.096) 0.080) 0.0) 0.070) 0.080) Gdppc -0.245** -0.69-0.079-0.08 0.039-0.075 0.098) 0.) 0.093) 0.098) 0.08) 0.09) popgrowth.594.764.608.687*.667*.667* 0.980).05) 0.978).007) 0.857) 0.970) Finance 0.27** 0.226*** 0.28*** 0.26*** 0.27*** 0.25*** 0.060) 0.068) 0.070) 0.077) 0.062) 0.069) Government -0.028-0.064*** -0.080*** -0.079*** -0.07*** -0.078*** 0.02) 0.024) 0.024) 0.025) 0.02) 0.024) Unemployment -0.06*** -0.09*** -0.02** -0.02** -0.009* -0.02** 0.005) 0.006) 0.005) 0.005) 0.005) 0.005) TaxK -0.037*** -0.034*** -0.039*** -0.039*** -0.032*** -0.039*** 0.005) 0.006) 0.005) 0.005) 0.005) 0.005) TaxL 0.009 0.00 0.06** 0.06** 0.09*** 0.05** 0.006) 0.009) 0.007) 0.007) 0.006) 0.007) Highways -0.80 0.309 0.535 0.53 0.674* 0.505 0.437) 0.523) 0.476) 0.57) 0.385) 0.467) Military 0.00-0.004-0.004-0.004-0.002-0.004 0.007) 0.008) 0.008) 0.008) 0.007) 0.008) RemunFinance 0.420*** 0.049) EFD -0.27 0.509) Mining+Oil 0.040*** 0.007) R 2 0.886 0.838 0.872 0.872 0.892 0.873 F-stat on the excluded 4. 0.7 3.5 0.5 5.0 3.9 instruments Observations 550 426 550 550 550 550 Notes: Variable Mining+oil measure the share of oil related and natural resources extraction activities in GDP, variable RemunFinance measures the compensation per employee in the financial sector and variable EFD measures the financial dependence of innovation. Other variables description is given in Table. Innovation is taken in log and lagged by two years. Column controls for average compensation in the financial sector, column 2 drops NY, CT, DE and MA the state with the largest financial sectors), column 3 removes finance-related patents, column 4 controls for financial dependence in the state as explained in section B., column 5 controls for the size of oil and mining sectors and column 6 removes oil-related patents from the count of citations. Time Span for innovation: 976-2008. Panel data IV 2SLS regressions with state and year fixed effects. Innovation is instrumented by the number of senators that seat on the appropriation committee. The lag between the instrument and the endogenous variable is set to 3 years. Time span for innovation: 979-2006. DC is removed from the sample because it has no senators. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 33

Table C8: Robustness 3: controlling for industry composition Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Innovation 0.035*** 0.049*** 0.048*** 0.049*** 0.047*** 0.00) 0.009) 0.009) 0.009) 0.007) Gdppc 0.087* 0.062 0.064 0.069 0.058 0.045) 0.044) 0.044) 0.043) 0.043) Popgrowth.076.094.088.249*.082 0.697) 0.700) 0.700) 0.702) 0.697) Finance 0.089** 0.0*** 0.09*** 0.3*** 0.2*** 0.036) 0.036) 0.036) 0.044) 0.036) Government -0.02* -0.09* -0.09* -0.023** -0.09* 0.0) 0.0) 0.0) 0.0) 0.0) Unemployment -0.005-0.006** -0.006* -0.007** -0.007** 0.003) 0.003) 0.003) 0.003) 0.003) TaxK -0.037*** -0.039*** -0.039*** -0.037*** -0.038*** 0.004) 0.004) 0.004) 0.004) 0.004) TaxL 0.03** 0.04** 0.04** 0.03** 0.03** 0.006) 0.006) 0.006) 0.006) 0.006) Size of Sector Computer and Electronic 0.50 0.484) Chemistry -0.642*** 0.86) Electrical Component 3.884* 2.002) R 2 0.894 0.896 0.896 0.898 0.897 Observations 58 58 58 578 58 Notes: Variable description is given in Table. Innovation is taken in log and lagged by 2 years. Column excludes patents from the computer sectors NAICS: 334), column 2 excludes patents from the pharmaceutical sectors NAICS: 3254) and column 3 excludes patents from the electrical equipment sectors NAICS: 335), column 4 adds the share of three sectors as additional controls and column 5) excludes citations to patents belonging to three highly exporting sectors: Transportation, Machinery and Electrical Machinery. The size of a sector see column 4) is defined as the share of GDP from the corresponding sector. Panel data OLS regressions with state and year fixed effects. Time span for innovation: 979-2006. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 34

Table C9: Top % income share and innovation for single state inventors Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Patents Cit5 Claims Generality Top5 Top Innovation 0.020* 0.040*** 0.025** 0.022** 0.020*** 0.09*** 0.02) 0.009) 0.00) 0.0) 0.005) 0.004) Gdppc 0.099** 0.072 0.087** 0.096** 0.090** 0.096** 0.043) 0.044) 0.044) 0.043) 0.043) 0.043) Popgrowth 0.95.075 0.956 0.947 0.902 0.895 0.65) 0.70) 0.650) 0.649) 0.70) 0.68) Finance 0.075** 0.0*** 0.080** 0.079** 0.092** 0.093*** 0.035) 0.036) 0.035) 0.035) 0.036) 0.036) Government -0.08-0.020* -0.08-0.08-0.06-0.08 0.0) 0.0) 0.0) 0.0) 0.0) 0.0) Unemployment -0.005* -0.006* -0.006* -0.005* -0.006* -0.006* 0.003) 0.003) 0.003) 0.003) 0.003) 0.003) TaxK -0.038*** -0.039*** -0.038*** -0.038*** -0.038*** -0.038*** 0.004) 0.004) 0.004) 0.004) 0.004) 0.004) TaxL 0.07*** 0.04** 0.07*** 0.07*** 0.02** 0.02** 0.006) 0.006) 0.006) 0.006) 0.006) 0.006) R 2 0.889 0.895 0.889 0.889 0.894 0.894 Observations 734 58 734 734 58 58 Notes: This table shows similar results as the one from Table 4 but patents from inventors that have changed its state of residence over the period are removed. All the innovation measures as well as the dependent variable are taken in log. Time span for innovation: 976-2009 columns, 3 and 4) and 976-2008 columns 2, 5 and 6). Variable description is given in Table. Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 35

Table C20: Regression of innovation on top % income share using instrument based on Appropriation Committee composition in the Senate. Single state inventors Dependent variable Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Patents Cit5 Claims Generality Top5 Top Innovation 0.22** 0.67** 0.72** 0.25** 0.6** 0.04** 0.097) 0.068) 0.076) 0.00) 0.049) 0.043) Gdppc -0.096-0.060-0.6-0.9-0.06 0.037 0.05) 0.085) 0.0) 0.6) 0.07) 0.054) Popgrowth.987**.689* 2.0** 2.204** 0.778 0.723 0.966) 0.968) 0.95).059) 0.92) 0.744) Finance 0.76*** 0.96*** 0.78*** 0.207*** 0.80*** 0.8*** 0.060) 0.06) 0.059) 0.072) 0.059) 0.058) Government -0.097*** -0.079*** -0.090*** -0.*** -0.042-0.055** 0.025) 0.024) 0.024) 0.028) 0.029) 0.026) Unemployment -0.02** -0.00** -0.0** -0.02** -0.0** -0.00** 0.005) 0.005) 0.005) 0.005) 0.005) 0.004) TaxK -0.042*** -0.040*** -0.040*** -0.04*** -0.037*** -0.037*** 0.006) 0.005) 0.005) 0.005) 0.005) 0.005) TaxL 0.023*** 0.06** 0.02*** 0.023*** 0.009 0.00 0.008) 0.007) 0.007) 0.008) 0.007) 0.006) Highways 0.304 0.359 0.328 0.424 0.468 0.453 0.46) 0.408) 0.42) 0.440) 0.439) 0.404) Military -0.004-0.005-0.006-0.007-0.006-0.002 0.007) 0.007) 0.007) 0.008) 0.007) 0.008) R 2 0.866 0.878 0.87 0.858 0.863 0.866 F-stat on the excluded 7.6 8.2 2.6 3.5 7.3 6.7 instruments Observations 700 550 700 700 550 550 Notes: This table shows similar results as the one from Table but patents from inventors that have changed its state of residence over the period are removed. Time span for innovation: 976-2009 columns, 3 and 4) and 976-2006 columns 2, 5 and 6). Autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator are presented in parentheses. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 36

Table C2: Robustness 4: controlling for agglomeration effect - OLS and IV results. Dependent variable Log of Top % Income Share ) 2) 3) 4) 5) 6) Measure of innovation Cit5 Cit5 Cit5 Cit5 Cit5 Cit5 Innovation 0.046*** 0.047*** 0.046*** 0.96** 0.94** 0.95** 0.008) 0.008) 0.008) 0.076) 0.076) 0.076) Gdppc 0.048 0.046 0.044-0.098-0.097-0.00 0.044) 0.044) 0.044) 0.092) 0.092) 0.092) Popgrowth.64.62.63.709*.706*.70* 0.73) 0.74) 0.72).004).002).002) Finance 0.098*** 0.02*** 0.0*** 0.24*** 0.25*** 0.26*** 0.035) 0.035) 0.035) 0.067) 0.067) 0.067) Government -0.055*** -0.054*** -0.055*** -0.072*** -0.073*** -0.073*** 0.0) 0.0) 0.0) 0.023) 0.023) 0.023) Unemployment -0.008** -0.008** -0.008** -0.04*** -0.03*** -0.04*** 0.003) 0.003) 0.003) 0.005) 0.005) 0.005) TaxK -0.037*** -0.037*** -0.037*** -0.040*** -0.040*** -0.040*** 0.004) 0.004) 0.004) 0.005) 0.005) 0.005) TaxL 0.06*** 0.06*** 0.06*** 0.020*** 0.020*** 0.020*** 0.006) 0.006) 0.006) 0.007) 0.007) 0.007) Agglomeration 0.90*** 0.79*** 0.83*** 0.53*** 0.45*** 0.46*** 0.09) 0.08) 0.08) 0.02) 0.02) 0.022) Highways 0.343 0.346 0.34 0.480) 0.476) 0.480) Military -0.003-0.003-0.003 0.008) 0.007) 0.008) R 2 0.903 0.902 0.903 0.873 0.874 0.873 F-stat on the excluded 4.4 4.3 4.3 instruments Observations 58 58 58 550 550 550 Notes: Variable description is given in Table. Innovation is taken in log and lagged by 2 years. We look at the effect of agglomeration as captured by the variable Agglo. Agglo is the log of the number of firms in the most columns and 4), the two most columns 2 and 5), and the three most columns 3 and 6) innovative sectors for each state and year. Time span: 976-2008. Variable description is given in Table. Panel data OLS columns to 3) and IV 2SLS columns 4 to 6) regressions with state and year fixed effects. DC is removed from the sample in columns 4, 5 and 6 because it has no senators. Innovation is instrumented by the number of senators that seat on the appropriation committee. The lag between the instrument and the endogenous variable is set to 3 years. t/z statistics in parentheses, computed with autocorrelation and heteroskedasticity robust standard errors using the Newey-West variance estimator. ***, ** and * respectively indicate 0.0, 0.05 and 0. levels of significance. 37

Figure C: Distribution of the lag between patent s application and patent s grant D Details on the calibration In this section we explain how the 6 moments in the data determine the 6 parameters of the model θ I, θ E, φ, L,, ). For convenience, we introduce the following notations: t = M denotes the top % income share, R = M 2 denotes the ratio of entrant to incumbent innovations, e = M 3 denotes the elasticity of the top % share with respect to innovation, η = M 4 denotes the average mark-up, E = M 5 denotes the entrants employment share and g = M 6 denotes the growth rate. By definition the innovation ratio obeys x E = Rx I,the growth rate of the economy is given by: φx I + x E ) ln = g, A2) and the average mark-up by: µ + µ) = η. A22) Provided that µ/ + L) < /00 < / + L) which ex-post ends up being the relevant range), 2) implies that the top % share obeys: µ ) + + L ) = t, A23) 00 and, using 3), the semi-elasticity of the top % share with respect to innovation is given by: µ ) = et. A24) The semi-elasticity of innovation with respect to the top % share depends on the innovation rate µ and on the extra-profit share made by innovating entrepreneurs, namely / /. 38

Finally, using that labor costs equal revenues divided by the mark-up, we obtain that the entrant employment share is given by: E = x E µ + µ). A25) In the above equations, we have that µ = x I + x E since z = 0), x I is given by 9) and, using ), x E is given by: ) 2 L + L θ I x E = θ E L ). A26) Assume that one knows µ, and. Then one obtains x E = R µ and x +R I = Given the innovation rates and the innovation step-size, the rate of productive incumbent innovations, φ, is identified by the measured growth rate in the economy, following A2). Given the extra profits made by innovative entrepreneurs µ )), the top % share in A23) determines the number of non-innovative entrepreneurs that are in the top % share and thereby identifies the ratio of workers to entrepreneurs L. Given x I, x E, and, the R&D parameters θ I and then θ E are directly given by 9) and A26). In return, the innovation rate µ and the mark-ups and are determined solely by 3 equations. Equation A22) gives the average mark-up. Equation A24) gives a relationship between the innovation rate and the difference in inverse mark-ups. A25) can be rewritten as R + R µ = E µ + µ) ). A27) +R µ. Given the harmonic average of mark-ups, a high innovation rate increases the entrant employment share but a high innovation step reduces it. In fact, we can go a bit further, combining A24) and A22), we get: = η/ et + ). Using this equation and A24), in A27), we get: ) ) etηh + ηh et + ) et = R et η η + R E A28) A29) The left-hand side is an increasing function of for η and for = η, it is equal to et, which is lower than the right-hand side since R > E. Therefore this equation identifies +R uniquely. And everything else equal, a higher E and a higher R lead to a lower. It is then easy to obtain through A28), where a higher η and lower et and lead to a higher. Finally given and, the innovation rate is determined by A24): a higher semi-elasticity of innovation et, leads to a higher innovation rate e. All parameters can then be identified as explained above and it can be checked that we are indeed in the case where µ/ + L) < /00 < / + L). 39

Calibrating the shocks for the regression. We assume that the shocks to innovation are normally distributed with ε θ,i,t N 0.72 2 /2, 0.72 2 ) and ε θ,i N 0.62 2 /2, 0.62 2 ) this implies that the R&D shock to x I,i,t = ) θ I,i,t,i,t has mean 0). The shocks to non-innovator mark-ups are uniformly distributed with ε η,i,t U 0.6g, 0.6g) and ε η,i,t U 0.4g, 0.4g) where g = min, ) so that we always have >,i,t > ). With the different shocks it is possible that x I,i,t or x E,i,t are greater than which, within the model, makes no sense, therefore we censor the two variables at. The measurement error shocks on top income inequality are normally distributed with ε δ,t N 0.325 2 /2, 0.325 2 ), ε δ,i,t N 0.07 2 /2, 0.07 2 ) and ε δ,i = 0 in the specific regressions we report on. The measurement error to innovation ε µ,i,t is also normally distributed with ε µ,i,t N 0.47 2 /2, 0.47 2 ). The standard deviations are chosen so that the simulated data an average of 500 draws) are in line with the actual data regarding the empirical moments reported in Table D below. 5 Table D: Simulation results Moment Empirics Simulation Affected by the s.d. of: s.d. of the state fixed effects 0.73 0.63 ε δ,i, ε θ,i and ε η,i s.d. of the year fixed effects 0.325 0.39 ε δ,t s.d. of log Cit5.348.07 ε θ,i,t, ε θ,i, ε η,i,t, ε η,i and ε µ.i,t s.d.of log Cit5 controlling for 0.744 0.889 ε θ,i,t, ε η,i,t and ε µ.i,t state fixed effects s.d. of predicted log Cit5.057 0.959 ε θ,i,t, ε θ,i, and ε η,i s.d.of predicted log Cit5 controlling 0.702 0.697 ε θ,i,t for state fixed effects s.d. of top income inequality 0.046 0.056 all except ε µ,i,t OLS coefficient 0.049 0.05 with the s.d. of ε θ,i,t with the s.d. of ε η,i,t and ε µ,i,t Notes: OLS coefficient and standard deviation of various variables in the data and on average in 500 draws of our simulated data. This shows that realistic shocks in terms of the deviations observed in the data can reproduce the gap between the OLS and IV coefficient. The major deviation in the table above is that in our simulations, the standard deviation of the unpredicted) innovation variable log Cit5) is too large when one controls for state fixed effects and too small when one does not. This could be corrected for instance by introducing state-specific measurement errors on innovation. 5 We do not try to identify what the standard deviations of the shocks are and therefore did not try to choose standard deviations so as to minimize the distance between empirical moments and simulated moments here. 40

Figure D: Distribution of the IV coefficients Notes: This figure plots the whole distribution of the IV coefficient of innovation obtained with 000 draws. Average value is 0.83 and standard deviation is 0.078. E Computing top income shares at the CZ level To compute top income shares at the CZ level, we need to estimate the expected income for individuals whose income is censored. To do so we expend on the methodology of Clemens et al. 207) see also Armour et al., 206). The census reports separately individuals labor income, capital income and business income and each income source is censored separately. We assume that above a certain total income level x, income is Pareto distributed. Denote by l, c and b the levels above which the census data are censored. Then for an individual i, denote by l i her labor income, c i her capital income, and b i her business income as reported in the data with l i = l if labor income is censored and similarly b i = b or c i = c if another source of income is censored. Denote x i the true total income of individual i. Then there are two cases. First, if l i < l, c i < c and b i < b, then we know that her total income is x i = l i + b i + c i. Conditional on having x i >x, the conditional probability density function of observing an income x i is given by αx α /x α+ i where α is the shape parameter of the Pareto distribution of income. Denote by N unc the corresponding set of observations for which no information is censored and where x i >x, and by N unc the cardinal of that set. On the other hand, if one observation or more are censored, then we only know that her total income x i x i l i + b i + c i. Conditional on having x i x, then the probability of observing x i x i is given by x/x i ) α. Denote by N cens the corresponding set of observations for which at least one source of income is censored and with x i x we choose x low enough so that this is always the case when an observation is censored). We can then write the likelihood function as P = i N unc α ) α x x i 4 x i i N cens ) α x. x i