[go: up one dir, main page]

Academia.eduAcademia.edu
DEPARTMENT OF ECONOMICS WORKING PAPER 2005 Department of Economics Tufts University Medford, MA 02155 (617) 627 – 3560 http://ase.tufts.edu/econ How Fiscal Decentralization Flattens Progressive Taxes Roland Hodler‡ Harvard University Kurt Schmidheiny§ Tufts University February 2005 Abstract We study the tension between fiscal decentralization and progressive taxation. We present a multi-community model in which the local income tax rate is determined by an exogenous progressive tax schedule and a tax shifter that can differ across communities. The progressivity of the tax schedule induces a self-sorting process that results in substantial though imperfect income sorting. Rich households are more likely to locate themselves in low tax communities than poor households. The actual tax structure is thus less progressive than the exogenous tax schedule. To investigate the quantitative implications of our model, we calibrate a fully-specified version to the largest metropolitan area in Switzerland. The equilibrium values of the simulation show the same pattern across communities as we observe in this area. The theoretical result is challenged by estimating the actual tax structure faced by the households in this area. We find that the actual tax structure is indeed substantially less progressive than the fixed tax schedule. Key words: Progressive Taxation, Fiscal Decentralization, Income Segregation ‡ Department of Economics, email: rhodler@fas.harvard.edu. Department of Economics, Braker Hall, Medford, kurt.schmidheiny@tufts.edu. § 1 MA 02155, email: 1 Introduction In this paper, we study the tension between fiscal decentralization and progressive taxation. We investigate to what extent fiscal decentralization reduces the progressivity of a common tax schedule in a federation in which the level of the tax rates can differ across communities. We show that progressive taxation and fiscal decentralization lead to income sorting, which undermines the progressivity of the tax schedule. We base our analysis on a multi-community model, in which the income tax rate is determined by an exogenous progressive tax schedule and a local tax shifter that can differ across communities. Local tax revenue is used to finance a local public good. In the basic version of the model, the mobile individual households differ only in their incomes. In equilibrium, no household wants to move, local housing markets clear and the communities’ budgets are balanced. It follows that the local tax shifters must be higher in communities in which housing prices are lower. The progressivity of the tax schedule then induces a self-sorting process that results in perfect income sorting. Poor households locate themselves in high tax communities while rich households locate themselves in low tax communities. Different from most of the previous literature,1 sorting is a direct result of the progressive tax schedule and does not require strong assumptions on the preferences for either public goods or housing. The spatial segregation of the population by incomes implies that the actual tax structure must be less progressive than the exogenous tax schedule if progressive at all. In an extension, we assume that the individual households differ not only in their incomes, but also in their preferences for housing. Since each house1 See e.g. Ellickson (1971), Westhoff (1977), Epple and Romer (1991) and the literature surveyed in Ross and Yinger (1999). 2 hold’s location choice depends now on its income and its preferences, the income sorting is imperfect in equilibrium: households with the same income are found in different communities, though rich households are still more likely to locate in low tax communities than poor households. To investigate the quantitative implications of this model, we calibrate a fully-specified version to the Zurich area, the largest Swiss metropolitan area. Swiss metropolitan areas offer an excellent laboratory for the analysis of fiscal decentralization. In Switzerland, each community can individually set the level of income taxes by a local tax shifter while the cantons (states) fix the (progressive) schedule of income taxes. The equilibrium values of this simulation show the same pattern across communities as we observe in the Zurich area: A high tax shifter and low average incomes in the center, and low tax shifters and high average incomes in the fringe. We then use data on the spatial distribution of incomes in the Zurich area to estimate the actual tax structure, i.e. average tax rates as a function of income as faced by the households in this area. We find that the actual tax structure is substantially less progressive than the tax schedule implemented by the canton because rich households are more likely to live in low tax communities than poor households. This finding is in line with the predictions of our theoretical model. This paper is probably most closely related to Feldstein and Wrobel (1998), Epple and Platt (1998) and Schmidheiny (2004). Feldstein and Wrobel show that a shift in a single US state’s tax progressivity has no redistributive effects since migration leads to an adjustment of the net wages and the employment structure. Complementary, we show that migration undermines the redistributive effect of progressive taxation in presence of fiscal decentralization even if wages do not adjust. Our theoretical model shares the formal 3 structure with Epple and Platt. Schmidheiny shares the location choice part of our model and shows empirically that rich households are more likely to move to low tax communities than poor households. It is a well known normative principle of the literature on fiscal federalism that income redistribution should be centralized.2 Our paper relates to this literature as it shows that fiscal decentralization undermines the redistributive effect of progressive taxes. The paper is organized as follows: Section 2 briefly informs about fiscal decentralization and progressive taxation in Switzerland. Section 3 presents the theoretical model and some results concerning the agents’ location choice. It further proves that an (asymmetric) equilibrium exists. Section 4 presents the simulation of a fully-specified version of our model, which is calibrated to the Zurich metropolitan area. Section 5 estimates the actual tax structure faced by the households in this area. Section 6 concludes. 2 Fiscal Decentralization and Progressive Taxation in Switzerland Switzerland is an exemplary federal fiscal system. The Swiss federation comprises 26 states, the so-called cantons. The cantons are divided into roughly 3000 municipalities of varying size and population. All three state levels finance their expenditures essentially by their own taxes and fees. The total tax revenue of all three levels was 93 billion CHF in 2001, of which 46% is imposed by the federation, 32% by the cantons and 22% by the municipalities.3 While the federal government is mainly financed by indirect taxes (61% of 2 See Musgrave (1959) and Oates (1972). For a recent survey of the literature on fiscal federalism see Oates (1999). 3 All figures in this paragraph apply to 2001. Source: Swiss Federal Tax Administration (2002), Öffentliche Finanzen der Schweiz 2001, Neuchâtel: Swiss Federal Statistical Office. 4 federal tax revenue) such as the VAT, the cantons and municipalities largely rely on direct taxes. Income taxes account for 60% of cantonal and 84% of municipal tax revenue. The cantons organize their tax systems autonomously. For example, they decide upon the level of income and corporate taxes and the degree of tax progression. The individual municipalities in turn can generally set a tax shifter for income and corporate taxes. The municipal tax is then the cantonal tax rate multiplied by the municipal tax shifter. Federal and cantonal systems of fiscal equalization limit the tax differences across cantons and across municipalities within the same canton to some extent, but still leave room for considerable variation. The above outlined federal system leads to ample differences of income taxes across Swiss municipalities. For example, for a two-child family with a gross income of 80,000 Swiss francs (CHF) combined cantonal and municipal income taxes ranged from 3.6% to 11.3% in the year 1997. The federal income tax for this household was 0.7%. With an income of 500,000 CHF a two-child family faced much higher tax rates due to the progressivity of the tax schedules. Combined cantonal and municipal income taxes ranged from 10.9% to 28.7% for this household and its federal income tax was 9.4%. Within metropolitan areas the (municipality) tax differences are smaller but still differ by a factor of 1.5 in e.g. the Zurich area. 3 The Model In this section, we introduce and solve the model. After presenting the general setting, we characterize the preferences and derive the resulting allocation of households across distinct communities. We then prove the existence of an asymmetric equilibrium. Finally, we introduce heterogeneity in the 5 preferences and discuss how this affects our results. 3.1 The Setting Given is a metropolitan area with J communities. This area is populated by a continuum of households, which differ in their income y ∈ [y, y]. Income follows a distribution function f (y) > 0. There are three goods in the economy: private consumption b, housing h and a local public good g. The housing h is provided by absentee landlords, and the housing market is competitive. Hence, the price for housing pi equates the housing supply HSi with the aggregate housing demand HDi . We assume that the housing supply HSi = HS(Li , pi , ) is a non-decreasing function of the land area Li and the price pi . Each community i spends the amount ni g to provide the local public good g, where ni is the measure of households living in community i. The communities levy income taxes to finance the public good. In each community i, the tax rate consists of two parts, a local tax shifter ti and a progressive tax rate structure r(y). We assume r(y) continuous and increasing in y, r(y) > 0, the average tax rate t·r(y) ∈ [0, 1) and the marginal tax rate t[r +yr′ (y)] ∈ [0, 1). The quantity of the local public good g and the tax rate structure r(y) are both exogenous (to the communities) and identical across communities. In each community i, the tax shifter ti is then determined by budget balance. Each household can move costlessly and chooses the community maximizing its utility as place of residence. 3.2 Preferences and Location Choice The preferences of the households are described by the utility function U (h, b) , 6 (1) where h is the consumption of housing and b the consumption of the private good. We assume the utility function to be strictly increasing, strictly quasiconcave, twice continuously differentiable in h and b and homothetic.4 Households face the budget constraint (omitting community indices) ph + b ≤ yd = y[1 − t · r(y)] , (2) where p is the price of housing; the price of the private good is set to unity. Disposable income yd depends on the local tax shifter t and the tax rate structure r(y). Maximization of the utility function (1) with respect to h and b subject to constraint (2) yields housing demand h∗ = h(p, yd ) = h(t, p, y), demand for the private good b∗ = y(1 − t) − ph(t, p, y), and indirect utility V (t, p, y) = U (h∗ , b∗ ) . (3) For later use note that V is continuous in t, p and y. We assume that the elasticity of housing with respect to the disposable income is smaller or equal to unity, i.e., εh,yd := ∂h∗ yd ≤ 1 for all yd and p. ∂yd h∗ (4) Next, we present two properties of the households’ indifference curves that will lead to segregation of the population by incomes: Property 1 dt M (t, p, y) := dp dV =0 h∗ ∂V /∂p =− <0 =− ∂V /∂t y · r(y) 4 Since the local public good g is constant across communities and not of primary interest for our considerations, we assume for simplicity that it does not enter the utility function. Equivalently, we could assume that it enters separably. 7 Property 1 follows from the strictly increasing utility function after applying the implicit function theorem and the envelope theorem. It implies that a household can be made indifferent towards an increase in the tax shifter t when it is compensated by decreased housing prices p, and vice versa. Property 2 ∂M ∂h∗ yd ∂yd y ∂r(y) h∗ h∗ = [1 − + > 0 for all y, t and p. ] ∂y ∂yd h∗ ∂y yd y 2 r(y) ∂y y 2 r2 (y) Proof: By assumption, (∂h∗ /∂yd )(yd /h∗ ) ≤ 1. Our assumptions about the bounds of the average and the marginal tax rate guarantee (∂yd /∂y)(y/yd ) = [1 − tr − tyr′ (y)]/[1 − tr(y)] ∈ [0, 1). The assumption that r(y) increases in y, implying ∂r(y)/∂y > 0, concludes the proof. ✷ Property 2 implies that the decrease in housing prices p which compensates a household for a higher tax shifter t has to be larger for poor households than for rich ones. Given a set of community characteristics, (pi , ti ) for i = 1..J, a household prefers community i if and only if V (pi , ti , y) ≥ V (pj , tj , y) for all j 6= i . (5) From this, the following proposition directly follows: Proposition 1 (Order of community characteristics) If any two populated communities differ in their characteristics (pi , ti ), then the community with the higher housing prices pi must impose a lower tax shifter ti . Proof: Suppose the opposite, i.e., that the housing prices pi and the tax shifter ti are both higher in one community. In this case, no household would 8 choose to live in this community (for the same reason that leads to property 1). This is a contradiction. ✷ In the remaining part of this section, we show how households allocate themselves across distinct communities. Distinct communities differ in both tax shifters and prices. Note that our model allows for groups of communities with identical community characteristics (ti , pi ). Such groups appear as one community in our notation. Lemma 1 (Boundary indifference) There is a ‘border’ household between any two communities i and j that is indifferent between these two communities. That is, if a household with income y ′ prefers to live in i and another household with income y ′′ > y ′ prefers to live in j, then there exists a household with income ŷij = ŷ(pi , ti , pj , tj ), y ′ ≤ ŷij ≤ y ′′ , such that V (pi , ti , ŷij ) = V (pj , tj , ŷij ). Proof: Let Vi (y) := V (pi , ti , y) be a household’s utility in i and Vj (y) := V (pj , tj , y) in j. The household with income y ′ prefers i to j, hence Vi (y ′ ) − Vj (y ′ ) ≥ 0. The opposite is true for a household with income y ′′ : Vi (y ′′ ) − Vj (y ′′ ) ≤ 0. From the continuity of V in y follows the continuity of Vi (y) − Vj (y) in y. The intermediate value theorem proves that there is at least one ŷ between y ′ and y ′′ such that Vi (ŷ) − Vj (ŷ) = 0. ✷ Lemma 2 (Two-community income segregation) Given two populated communities i and j with distinct characteristics (ti , pi ) 6= (tj , pj ), where ti < tj , then any household in i is richer than any household in j. That is, if a household with income ŷ is indifferent between i and j, then any household y ′ < ŷ strictly prefers j and any household y ′′ > ŷ strictly prefers i. 9 t tj y' ' ti ŷ y' pj pi p Figure 1: Indifference curves in the (t, p) space Proof: The proof uses figure 1, which shows the indifference curves in the (t, p)-space for three different income levels y ′ < ŷ < y ′′ . The indifference curves represent all (t, p) combinations that households consider as good as community j’s (pj , tj )-pair. Each household prefers pairs south-west of its indifference curve. It follows from property 1 that the indifference curves decrease in the (t, p)-space and from property 2 that they become flatter as income rises. Imagine now a community i, characterized by ti < tj and pi > pj , where household ŷ is indifferent to j. All poorer households, e.g. y ′ , prefer j to i and all richer households, e.g. y ′′ , prefer i to j. ✷ Proposition 2 (Multi-community income segregation) Given J populated communities with distinct characteristics (ti , pi ), then it holds for any two communities i and j with ti < tj that any household in i is richer than any household in j. Proof: The proposition implies that [y, y] must be partitioned into J non-empty and non-overlapping intervals. Suppose the opposite, i.e., y ′ as 10 well as y ′′ prefer community i, but y ′′′ , y ′ < y ′′′ < y ′′ , strictly prefers another community j. Then it follows from lemma 1 that there is a ŷij , y ′ ≤ ŷij < y ′′′ . Lemma 2 implies that y ′′ > yˆij strictly prefers j to i, which is a contradiction. ✷ 3.3 Equilibrium In this section, we prove that an asymmetric equilibrium exists. That is, we show that an allocation in which communities exhibit different characteristics (pi , ti ) can be an equilibrium. An equilibrium requires that each household is located in the community that maximizes its utility, that each household maximizes its utility within the given community, that the housing market clears in each community, that each community has a balanced public budget and that each community has a positive population. There always exists a symmetric equilibrium in which all communities have identical characteristics (pi , ti ) and in which the households allocate themselves such that all communities show the same income distribution.5 However, we are interested in the case in which at least some communities differ in their characteristics (pi , ti ). We therefore show that an asymmetric equilibrium, i.e., an equilibrium in which (pi , ti ) 6= (pj , tj ) for some i and j, exists too. For simplicity, we focus thereby on the case of two distinct communities, 1 and 2. We assume with no loss of generality that t1 > t2 . Hence, any household in 2 must be richer than any household in 1, as lemma 2 implies. We define ∆V (ŷ) = V1 (p1 (ŷ), t1 (ŷ), ŷ) − V2 (p2 (ŷ), t2 (ŷ), ŷ), 5 (6) Other equilibria in which all communities have identical characteristics (pi , ti ) might exist as well. 11 where pi (ŷ) and ti (ŷ) are the equilibrium housing price and the equilibrium tax shifter, respectively, in i given that households with y < ŷ live in 1 and households with y > ŷ in 2. Hence, Vi (pi (ŷ), ti (ŷ), ŷ) is the indirect utility of a household with ŷ in i given this allocation of households. In addition, we assume:6 (i) The housing supply HS(Li , pi ) satisfies HS(Li , 0) = Li > 0 for i = 1, 2. (ii) The minimum income y > g. (iii) If hi → ∞, bi > 0, hj < ∞ and bj < ∞, then U (hi , bi ) > U (hj , bj ). Proposition 3 (Existence of an asymmetric equilibrium) There exists an equilibrium in which the communities 1 and 2 exhibit different characteristics, i.e. t1 > t2 and p1 < p2 . Proof: We prove proposition 3 by showing (1) that ∆V (ŷ) is continuous and (2) that ∆V (ŷ) > 0 as ŷ → y and that ∆V (ŷ) < 0 as ŷ → y. It follows then from the intermediate value theorem that there is at least one ŷ, y < ŷ < y, such that ∆V (ŷ) = 0. This implies - from the definition of ∆V that the border household ŷ is indifferent between the two communities, the prices p1 and p2 clear the local housing markets and the tax shifters t1 and t2 balance the community budgets. (1) The equilibrium housing price pi is determined by HS(Li , pi ) = HDi . It follows from lemma 2 that HDi = Z yi h(pi , ti ; y)f (y)dx, (7) yi where y i and y i are the highest and lowest incomes in community i. The hereby implicitly defined pi is continuous in y i and y i given continuity of 6 As it will become evident in section 4, these assumptions are sufficient, but not necessary for the existence of an asymmetric equilibrium. 12 HS(·), h(·) and f (·). The balanced budget requirement and lemma 2 imply that the equilibrium tax shifter in community i is ni g ti = R yi r(y)f (y)dx yi where ni = Z , (8) yi f (y)dx. (9) yi Given continuity of r(·) and f (·), ti is continuous in y i and y i . Since the indirect utility Vi is continuous in pi , ti and y and since pi and ti are continuous in y, ∆V (ŷ) is continuous in ŷ. (2) If follows from equations (7) and (9) that HD1 → 0 and n1 → 0 as ŷ → y. Since assumption (i) guarantees that HS(L1 , 0) = L1 > 0 (and since ∂HS(Li , pi )/∂pi ≥ 0), it holds that h∗ (p1 , t1 ; y) → ∞ and p1 → 0 as ŷ → y. Hence, b∗ → y − g > 0, where the strict inequality follows from assumption (ii). Assumption (iii) then guarantees that ∆V (ŷ) > 0 as ŷ → y. Analogously, it can be shown that ∆V (ŷ) < 0 as ŷ → y.7 ✷ 3.4 Adding Heterogeneous Preferences So far, we have assumed that households differ only in their incomes y. In this section, we extend the model by assuming that households differ in their preferences as well. The household preferences are now represented by the utility function U (h, b; α), where the parameter α describes the taste for housing. The higher α, the more a household is, ceteris paribus, willing to spend on housing. Hence, the housing demand increase in α, i.e. ∂h∗ ∂h(t, p; y, α) = > 0 for all t, p, y and α. ∂α ∂α 7 The only difference is that b∗ → y − g, which exceeds y − g. 13 (10) Housing taste (α) 1 2 ... j ... J Income (y) Figure 2: Simultaneous income and preference segregation. The areas denoted by j = 1, ..., J show the attributes of the households that prefer community j. Income and preferences are jointly distributed according to the density function f (y, α). It follows that income segregation holds, but only within the subpopulation of households with identical preferences. Preference segregation occurs as well: That is, among the subpopulation of households with the same income y, households with a high α, i.e. a strong taste for housing, tend to allocate themselves to communities with higher tax shifters ti than households with a low α. Simultaneous heterogeneity by incomes and tastes leads to a more realistic pattern of household segregation. Although income groups tend to gather, the segregation is not perfect. Figure 2 shows the resulting allocation of household types to communities. The households on the borders are indifferent between neighboring communities j. Community 1 with the lowest housing prices is populated by the poorest households with strong taste for housing, while the richest households with low housing taste are situated in community J with the lowest tax rate and the highest housing price. 14 However, rich households with strong taste for housing prefer lower-priced communities and poor households with weak taste for housing group with relatively rich households in the lower-tax communities. 4 A Specified Version of the Model To investigate the qualitative and quantitative properties of the model we construct a fully specified example in this section. The specification is kept as simple as possible but still captures all mechanisms of the model. The example is calibrated to the Zurich area, the largest Swiss metropolitan area. The common tax schedule is taken from Young (1990) r(y) = r0 [1 − (1 + r2 y r1 )−1/r1 ] . with parameters r0 > 0, r1 > 0 and r2 > 0. The average local tax rate t r(y) and the local marginal tax rate t[y ∂r(y)/∂y + r(y)] is increasing in income y. The marginal tax rate is always above the marginal tax rate; both asymptotically approach a maximum t r0 . Household preferences are described by a Cobb-Douglas utility function: U = hα b1−α , where 0 < α < 1 stands for the taste parameter of the general model. Utility function and tax schedule satisfy our properties 1 and 2. The locus of indifferent households between two communities i and j for any given taste α is (" ŷij = 1− pαj − pαi r0 pαj ti − pαi tj We adopt the housing supply function  !−r1 HSi = Li (pi )θ 15 # 1 −1 r2 )1/r1 . Figure 3: Center and periphery in the metropolitan area of Zurich. from Epple and Romer (1991).8 We calibrate the above outlined model to the metropolitan area of Zurich in Switzerland. The area around the city of Zurich forms the biggest Swiss metropolitan area. The city of Zurich has about 330 thousand inhabitants and is the capital of the canton (state) of Zurich. The canton of Zurich counts 1.2 million inhabitants in 171 individual communities. As described in section 2, each of these communities can choose its own tax shifter. The analysis is restricted to the city of Zurich and a ring of the most integrated communities around the center. This ring is formed by all communities in the canton of Zurich with more than 1/3 of the working population commuting to the center.9 Figure 3 shows a map with the city of Zurich and the thus defined ring of 40 communities. This agglomeration is 8 Epple and Romer derive this housing supply function from an explicit production function, where 0 ≤ θ ≤ 1 is the ratio of non-land to land input. 9 The number of commuters to the city of Zurich and the size of the working population in the communities is based on the 1990 Census. This definition of the urban area is chosen to justify the model’s assumption that households income is exogenous, i.e. that they choose their place of residence independent of where they work. It results in a set of communities closest to the central business district. 16 Table 1: Equilibrium values of the specified model. homogeneous preferences harmonized center periphery L: area 1 0.4 0.6 p: rent 11.7 6.6 13.1 t: tax shifter 1 5.17 0.91 n: inhabitants 1 0.12 0.88 Ey: mean income 78’547 30’771 85’010 heterogeneous preferences center periphery 0.4 0.6 9.9 12.5 2.30 0.87 0.23 0.77 47’755 87’703 The calibrated model parameters: g = 5000, E(ln y) = 11.1, SD(ln y) = 0.55, ymin = 23, 000, ymax = 500, 000, E(α) = 0.25, S.A.(α) = 0 (homogeneous preferences), S.A.(α) = 0.11 (heterogeneous preferences), θ = 3, r0 = 0.132, r1 = 1 and r2 = 0.00001. modelled as two distinct jurisdictions, which we call center and periphery. The details of the calibration are described in the appendix. The parameters are summarized at the bottom of table 1. 4.1 Simulated Equilibrium The equilibrium values pi and ti in both communities satisfy equations (7) and (8) and guarantee that no households wants to move. As there is no closed form solution to this nonlinear system of four equations and four unknowns, we solve numerically for the equilibrium values of the model.10 Table 1 shows in column 2 and 3 the equilibrium values for the case of homogeneous tastes. There are large differences in both taxes and prices between the two communities. The tax rate in the center community is almost 10 The aggregation of individual behavior requires double integrals over the community population. These integrals cannot be calculated analytically. We use Gauss-Legendre Quadrature with 40 nodes in each dimension to approximate the various double integrals. We numerically solve for the equilibrium values by minimizing the sum of squared deviations from the equilibrium conditions with the Gauss-Newton method. 17 0.8 households prefer center 0.6 indifferent households → 0.4 households prefer periphery 0.2 0 10.5 11 11.5 12 log income 12.5 ← center population density housing taste (α) 1 ← periphery ← whole population 10.5 13 11 11.5 12 12.5 log income 13 Figure 4: Income and taste segregation in equilibrium. The left figure shows the preferred community for all household types. The right figure shows the resulting income distributions in both communities. six times higher than in the periphery while housing prices in the center are halve the ones in the periphery.11 Households are perfectly segregated: All households in the high-tax center are poorer than all households in the lowtax periphery. The mean income in the periphery is therefore almost three times the mean income in the center. Column 1 in table 1 gives the equilibrium values for the hypothetical case that the two communities merged or harmonized their taxes. These predictions of segregation and the implied differences of community characteristics are extreme. The consideration of heterogeneous housing tastes leads to a more realistic situation. Table 1 shows in column 4 and 5 the equilibrium values allowing for heterogeneous tastes. The differences between the two communities are still substantial but smaller than with homogeneous tastes: The center exhibits now 2.5 times higher taxes and 20 % lower housing prices. The left graph in figure 4 shows the segregation 11 The labels ‘center’ and ‘periphery’ are arbitrary. There is always a second equilibrium with lower taxes in the center. 18 Local income tax shifter (1997) Share of households with income above CHF 75’000 (1997/98) 131.00 0.56 125.58 0.52 0.40 101.00 0.28 0.25 85.98 85.00 Figure 5: Taxes and incomes in the Zurich metropolitan area. pattern in the income-taste space. The population is now imperfectly sorted by incomes: While it is still true that more rich households are found in the low-tax periphery, rich households with a strong taste for housing prefer the low-price high-tax center and poor households with a low taste for housing prefer the periphery. The right graph in figure 4 shows the resulting income distributions in the two communities. The mean income in the center is about half the one in the periphery. Figure 5 shows the actual local tax levels and the spatial income distribution in the calibrated area. The left map visualizes the considerable tax differentials in the Zurich area. The right map demonstrates the striking relationship between income taxation and spatial income distribution: the local share of rich households is almost an inverted picture of the local tax levels.12 Our simple two-community model captures this empirical pattern 12 Data from the following sources: Commuter: Swiss Federal Statistical Office, Census 1990. Tax rates: Statistisches Amt des Kantons Zürich, Steuerfüsse 1997. Income distribution: Swiss Federal Tax Administration. Considered are all communities where more than 1/3 of the working population is commuting to the center community. 19 homogeneous tastes heterogenous tastes 0.25 0.5 0.4 average tax rate average tax rate 0.6 center → 0.3 0.2 0.1 0.0 ← mean ↑ periphery 10.5 11 11.5 12 log income 12.5 0.20 mean ↓ 0.10 0.05 0.00 13 center → 0.15 ← periphery 10.5 11 11.5 12 log income 12.5 13 Figure 6: Mean average tax rate by income in the case of homogeneous (left) and heterogenous (right) tastes. well. 4.2 The Resulting Tax Schedule The average tax rate ti r(y) depends not only on the individual household’s income but also on its place of residence. As the model shows the place of residence is not random and rich households are more likely to reside in low-tax communities. In this section, we ask what tax schedule is realized after considering the sorting of the population. In other words, we ask what tax rate a household with income y pays on average. In the case of homogeneous taste this question is trivial. All households with income below the indifferent household (ŷ = 37, 000) face the average tax rate in the high-tax community; rich households the one in the low-tax community. The left graph in figure 6 shows the resulting tax schedule. While progressive within the communities, it is actually regressive as the richest households face lower average tax rates than the poorest households. In general, the expected or mean average tax for a household with in20 come y is E[t r(y)|y] = X i [P (i|y) · ti r(y)] , (11) where ti r(y) is the average tax rate for a household with income y in community i. The probability that a household with income y lives in community i, f (y|i)P (i) , f (y) P (i|y) = (12) is calculated from the income density f (y|i) in community i, the probability P (i) that an arbitrary household resides in community i and the income distribution f (y) of the whole area. In the case of heterogenous tastes, the marginal income distribution f (y|i) in a community i (shown in the right figure 4) is calculated by integrating over tastes in community i: f (y|i) = Z αi f (y, α)dα , αi where αi and αi are the lowest and highest tastes in community i. The right graph in figure 6 shows the mean average tax rate in the case of heterogenous tastes. The realized tax schedule is still progressive, though, much flatter than the tax schedule implemented by the canton. 5 Evidence In this section, we estimate the mean average tax rates that households with a given income face in the Zurich metropolitan area. We then compare our estimates to the results obtained in the previous section. 5.1 Method In principle, the mean average tax rate can be estimated from a random sample of households in the studied area. Knowing each households’ income 21 and community tax rate allows to directly estimate the mean average tax rate with e.g. a kernel regression. The random sampling automatically accounts for the sorting of the population by incomes. Unfortunately, we do not have such microdata with tax information. Furthermore, available survey data suffers from small sample sizes and stratified sampling over communities. We therefore follow an alternative estimation strategy. The mean average tax rate of a household with income y can be estimated from equation (11): Ê[t r(y)|y] = X i [P̂ (i|y) · ti r(y)] As the canton sets the tax structure r(y) and the individual communities their tax shifters ti , the average tax rate ti r(y) for any income y in any community i is known. The estimated probability that a household with income y lives in community i is given by equation (12): P̂ (i|y) = fˆ(y|i)P̂ (i) fˆ(y|i)ni =P , ˆ(y|j)nj ] fˆ(y) [ f j where ni is the known number of households living in community i. It remains, therefore, estimating the income density fˆ(y|i) of each community i in the area. We estimate fˆ(y|i) from publicly available local income distribution data. The federal tax administration publishes the number of households with taxable income in seven different income classes.13 We assume that incomes are log-normally distributed and estimate mean and variance of this distribution using maximum likelihood.14 We estimate a truncated log-normal distribution as the first reported income interval is empty 13 Swiss Federal Tax Administration, Steuerbelastung in der Schweiz, Natürliche Personen nach Gemeinden 1997, Neuchâtel: Swiss Federal Statistical Office. 14 Note that this maximum likelihood estimator corresponds to an ordered probit with known thresholds. 22 for technical reasons. The log likelihood function for any community i is     c −µ  i 6 Φ k+1σi i − Φ ckσ−µ X i ,   log Li = sk · log  c1 −µi 1 − Φ k=1 σi where µi and σi2 are mean and variance of log income in community i. sk is the number of households in income class k with lower interval limit ck ∈ {log(15000), log(20000), log(30000), log(40000), log(50000), log(75000), ∞}. Φ(.) is the cdf of the standard normal distribution. The income density in community i is then estimated as " 1 exp − fˆ(y|i) = 2 σ̂i y 2π 1 √ 5.2  log(y) − µ̂i σ̂i 2 # . Results Figure 7 shows that the average tax rates that households with income y face in the Zurich metropolitan area. The top line is the average tax rate ti r(y) of households living in the community with the highest tax shifter. The bottom line is the average tax rate of households in the community with the lowest tax shifter. The middle line is the estimated mean average tax rate that households in this area face, Ê[t r(y)|y]. This is the expected unconditional, i.e. not conditioned on the place of residence, average tax rate. As one can see, the average poor household faces almost the average tax rate in the highest-tax community (or in the city of Zurich).15 This is, of course, because most poor households live in high tax communities. As households become richer, they live more often in the low-tax communities and thus face an average tax rate that is on average substantially smaller than in high-tax communities. The mean average tax rate of households 15 The tax shifter is 131 in the highest-tax community and 130 in the city of Zurich. 23 average tax rate 0.15 highest tax community → mean ↓ 0.10 0.05 ↑ lowest tax community 0.00 10.5 11 11.5 12 log income 12.5 13 Figure 7: Estimated mean average tax rate by income. with very high incomes y is even relatively close to the average tax rate of very rich households living in the lowest-tax community. The results from the estimation (figure 7) are very similar to the predictions of the calibrated model with taste heterogeneity (figure 6). There are though two noteworthy differences: First, the difference between the highest and the lowest tax shifters is in reality smaller than our model predicts. Second, the mean average tax rate of very rich households remains in reality above the average tax rate of very rich households in the lowest-tax community, unlike in our simulation. While polito-economical considerations may account for the first difference,16 the second might indicate that the location choice depends also on preference characteristics other than the taste for housing. 6 Conclusions We have focused on the tension between fiscal decentralization and progressive taxation. We have presented a multi-community model in which the 16 The threat of a so-called tax harmonization often prevents low-tax communities from further lowering their tax shifters. 24 local income tax rate is determined by an exogenous progressive tax schedule and a tax shifter that can differ across communities. The progressivity of the tax schedule has been shown to induce a self-sorting process that results in substantial though imperfect income sorting. Rich households are found to be more likely to locate themselves in low tax communities than poor households such that the actual tax structure becomes less progressive than the exogenous tax schedule. To investigate the quantitative implications of our model, we have calibrated a fully-specified version to the largest metropolitan area in Switzerland. The equilibrium values of the simulation have shown the same pattern across communities as we observe in this area. We have further estimated the actual tax structure faced by the households in this area. We have found that the actual tax structure is indeed significantly less progressive than the fixed tax schedule. Hence, progressive taxes should be implemented at the state, the national or even the supranational level rather than at the community level given that one wants them to unfold their full redistributive effect. 25 Appendix: Calibration Land Area: The whole area has a physical size of 349km2 , of which 88km2 (25%) form the city of Zurich. 140km2 are dedicated to development, 53km2 (38%) in the inner city and 87km2 in the fringe communities. In 1998, the whole area was populated by around 628’000 inhabitants, of whom 334,000 lived in the city and 294,000 in the fringe communities.17 This agglomeration is modelled as two distinct jurisdictions with land area L1 = 0.4 and L2 = 0.6 respectively. Tax schedule: The parameters r0 = 0.132, r1 = 1 and r2 = 0.00001 almost perfectly approximate the tax scheme of the canton of Zurich.18 Income Distribution: The income distribution is calibrated with data from the Swiss labor force survey.19 The 1995 cross-section contains detailed information on 1124 households in the above defined region. These households had average income (after state and federal taxes) of CHF 92,000, median income of CHF 66,700 and a quartile distance of 47,700.20 We use a log-normal distribution to approximate this right-skewed distribution. A log-normal distribution with mean E(ln y) = 11.1 and standard deviation SD(ln y) = 0.55 matches the observed median and quartile distance. For numerical tractability, the model distribution is truncated at a minimum income of ymin = 23, 000 and a maximum income ymax = 500, 000.21 Taste Distribution: The Swiss labor force survey also contains the monthly 17 Source: Statistisches Amt des Kantons Zürich, Gemeindedaten per 31.12.1998. Tax scheme according to Steuergesetz vom 8. Juni 1997. 19 Swiss Federal Statistical Office, Schweizerische Arbeitskräfterhebung (SAKE) 1995. 20 State and Federal taxes were deducted from net household income (after social security contribution) assuming a two-child family. 21 The minimum income is subsistence level for a one-person-household as defined by the Schweizerische Konferenz für Sozialhilfe (SKOS) and adjusted for inflation. The maximum income is chosen arbitrarily, but has no influence on the numerical simulation due to the low weight on high incomes. 18 26 housing expenditure of renters which allows to calibrate the distribution of tastes for housing.22 Note that the taste parameter α in the Cobb-Douglas utility function is the share of housing in a utility maximizing household. We therefore estimate each household taste parameter as α = (ph)/yd , where ph is observed households housing expenditure and yd is observed household income minus federal, state and communal taxes. A beta distribution with mean E(α) = 0.25 and standard deviation SD(α) = 0.11 describes the distribution of the so calculated taste parameter well. Taste and income are assumed to be uncorrelated. Housing and Public Good Production: The price elasticity of housing supply is θ = 3 as in Epple and Romer (1991) and Goodspeed (1989). The targeted public goods provision is set to 5000. 22 Of course, there is a selection bias by only considering renters. This seems nevertheless justified because the proportion of renters is very high in Switzerland (65% in the data set used). 27 References [1] Ellickson (1971), Jurisdictional Fragmentation and Residential Choice, American Economic Review, 61, 334-339. [2] Epple, Dennis and Glenn J. Platt (1998), Equilibrium and Local Redistribution in an Urban Economy when Households Differ in both Preferences and Income, Journal of Urban Economics, 43, 23-51. [3] Epple, Dennis and Thomas Romer (1991), Mobility and Redistribution, Journal of Political Economy, 99(4), 828-858. [4] Feldstein, Martin and Marian V. Wrobel (1998), Can State Taxes Redistribute Income?, Journal of Public Economics, 68(3), 369-396. [5] Goodspeed, Timothy J. (1989), A Re-Examination of the Use of Ability to Pay Taxes by Local Governments, Journal of Public Economics, 38, 319-342. [6] Musgrave, Richard M. (1959), The Theory of Public Finance, New York: McGraw-Hill. [7] Oates, Wallace E. (1972), Fiscal Federalism, New York: Harcourt Brace Jovanovich. [8] Oates, Wallace E. (1999), An Essay on Fiscal Federalism, Journal of Economic Literature, 37(3), 1120-1149. [9] Ross, Stephen and John Yinger (1999), Sorting and Voting: A Review of the Literature on Urban Public Finance. In: Paul Ceshire and Edwin S. Mills (eds.), Handbook of Regional and Urban Economics, Vol. 3, Amsterdam: North-Holland. 28 [10] Schmidheiny, Kurt (2004), Income Segregation and Local Progressive Taxation: Empirical Evidence from Switzerland, CESifo Working Paper 1313. [11] Westhoff, F. (1977), Existence of Equilibria in Economies with a Local Public Good, Journal of Economic Theory, 14, 84-112. [12] Young, Peyton H. (1990), Progressive Taxation and Equal Sacrifice, American Economic Review, 80(1), 253-266. 29 WORKING PAPER SERIES 2005 http://ase.tufts.edu/econ/papers/papers.html 2005-01 EGGLESTON, Karen, Keqin RAO and Jian WANG; “From Plan to Market in the Health Sector? China's Experience.” 2005-02 SHIMSHACK Jay; “Are Mercury Advisories Effective? Information, Education, and Fish Consumption.” 2005-03 KIM, Henry and Jinill KIM; “Welfare Effects of Tax Policy in Open Economies: Stabilization and Cooperation.” 2005-04 KIM, Henry, Jinill KIM and Robert KOLLMANN; “Applying Perturbation Methods to Incomplete Market Models with Exogenous Borrowing Constraints.” 2005-05 KIM, Henry, Jinill KIM, Ernst SCHAUMBURG and Christopher A. SIMS; “Calculating and Using Second Order Accurate Solutions of Discrete Time Dynamic Equilibrium Models.” 2005-06 KIM, Henry, Soyoung KIM and Yunjong WANG; “International Capital Flows and Boom-Bust Cycles in the Asia Pacific Region.” 2005-07 KIM, Henry, Soyoung KIM and Yunjong WANG; “Fear of Floating in East Asia.” 2005-08 SCHMIDHEINY, Kurt; “How Fiscal Decentralization Flattens Progressive Taxes.” 2005-09 SCHMIDHEINY, Kurt; “Segregation from Local Income Taxation When Households Differ in Both Preferences and Incomes.” 2005-10 DURLAUF, Steven N., Andros KOURTELLOS, and Chih Ming TAN; “How Robust Are the Linkages between Religiosity and Economic Growth?” 2005-11 KEELY, Louise C. and Chih Ming TAN; “Understanding Preferences For Income Redistribution.” 2005-12 TAN, Chih Ming; “No One True Path: Uncovering the Interplay between Geography, Institutions, and Fractionalization in Economic Development.” 2005-13 IOANNIDES, Yannis and Esteban ROSSI-HANSBERG; “Urban Growth.” 2005-14 PATERSON, Robert W. and Jeffrey E. ZABEL; “The Effects of Critical Habitat Designation on Housing Supply: An Analysis of California Housing Construction Activity.” 2005-15 KEELY, Louise C. and Chih Ming TAN; “Understanding Divergent Views on Redistribution Policy in the United States.” 2005-16 DOWNES, Tom and Shane GREENSTEIN; “Understanding Why Universal Service Obligations May Be Unnecessary: The Private Development of Local Internet Access Markets.” 2005-17 CALVO-ARMENGOL, Antoni and Yannis M. IOANNIDES; “Social Networks in Labor Markets.” 2005-18 IOANNIDES, Yannis M.; “Random Graphs and Social Networks: An Economics Perspective.” 2005-19 METCALF, Gilbert E.; “Tax Reform and Environmental Taxation.” 2005-20 DURLAUF, Steven N., Andros KOURTELLOS, and Chih Ming TAN; “Empirics of Growth and Development.” 2005-21 IOANNIDES, Yannis M. and Adriaan R. SOETEVENT; “Social Networking and Individual Outcomes Beyond the Mean Field Case.” 2005-22 CHISHOLM, Darlene and George NORMAN; “When to Exit a Product: Evidence from the U.S. Motion-Pictures Exhibition Market.” 2005-23 CHISHOLM, Darlene C., Margaret S. McMILLAN and George NORMAN; “Product Differentiation and Film Programming Choice: Do First-Run Movie Theatres Show the Same Films?” 2005-24 METCALF, Gilbert E. and Jongsang PARK; “A Comment on the Role of Prices for Excludable Public Goods.”