DEPARTMENT OF ECONOMICS
WORKING PAPER
2005
Department of Economics
Tufts University
Medford, MA 02155
(617) 627 – 3560
http://ase.tufts.edu/econ
How Fiscal Decentralization Flattens
Progressive Taxes
Roland Hodler‡
Harvard University
Kurt Schmidheiny§
Tufts University
February 2005
Abstract
We study the tension between fiscal decentralization and progressive
taxation. We present a multi-community model in which the local income tax rate is determined by an exogenous progressive tax schedule
and a tax shifter that can differ across communities. The progressivity of the tax schedule induces a self-sorting process that results
in substantial though imperfect income sorting. Rich households are
more likely to locate themselves in low tax communities than poor
households. The actual tax structure is thus less progressive than
the exogenous tax schedule. To investigate the quantitative implications of our model, we calibrate a fully-specified version to the largest
metropolitan area in Switzerland. The equilibrium values of the simulation show the same pattern across communities as we observe in
this area. The theoretical result is challenged by estimating the actual
tax structure faced by the households in this area. We find that the
actual tax structure is indeed substantially less progressive than the
fixed tax schedule.
Key words: Progressive Taxation, Fiscal Decentralization, Income Segregation
‡
Department of Economics, email: rhodler@fas.harvard.edu.
Department of Economics, Braker Hall, Medford,
kurt.schmidheiny@tufts.edu.
§
1
MA
02155,
email:
1
Introduction
In this paper, we study the tension between fiscal decentralization and progressive taxation. We investigate to what extent fiscal decentralization reduces the progressivity of a common tax schedule in a federation in which
the level of the tax rates can differ across communities. We show that progressive taxation and fiscal decentralization lead to income sorting, which
undermines the progressivity of the tax schedule.
We base our analysis on a multi-community model, in which the income
tax rate is determined by an exogenous progressive tax schedule and a local
tax shifter that can differ across communities. Local tax revenue is used to
finance a local public good. In the basic version of the model, the mobile
individual households differ only in their incomes.
In equilibrium, no household wants to move, local housing markets clear
and the communities’ budgets are balanced. It follows that the local tax
shifters must be higher in communities in which housing prices are lower.
The progressivity of the tax schedule then induces a self-sorting process that
results in perfect income sorting. Poor households locate themselves in high
tax communities while rich households locate themselves in low tax communities. Different from most of the previous literature,1 sorting is a direct result
of the progressive tax schedule and does not require strong assumptions on
the preferences for either public goods or housing. The spatial segregation
of the population by incomes implies that the actual tax structure must be
less progressive than the exogenous tax schedule if progressive at all.
In an extension, we assume that the individual households differ not only
in their incomes, but also in their preferences for housing. Since each house1
See e.g. Ellickson (1971), Westhoff (1977), Epple and Romer (1991) and the literature
surveyed in Ross and Yinger (1999).
2
hold’s location choice depends now on its income and its preferences, the
income sorting is imperfect in equilibrium: households with the same income
are found in different communities, though rich households are still more
likely to locate in low tax communities than poor households.
To investigate the quantitative implications of this model, we calibrate
a fully-specified version to the Zurich area, the largest Swiss metropolitan
area. Swiss metropolitan areas offer an excellent laboratory for the analysis
of fiscal decentralization. In Switzerland, each community can individually
set the level of income taxes by a local tax shifter while the cantons (states)
fix the (progressive) schedule of income taxes. The equilibrium values of this
simulation show the same pattern across communities as we observe in the
Zurich area: A high tax shifter and low average incomes in the center, and
low tax shifters and high average incomes in the fringe.
We then use data on the spatial distribution of incomes in the Zurich area
to estimate the actual tax structure, i.e. average tax rates as a function of
income as faced by the households in this area. We find that the actual tax
structure is substantially less progressive than the tax schedule implemented
by the canton because rich households are more likely to live in low tax communities than poor households. This finding is in line with the predictions
of our theoretical model.
This paper is probably most closely related to Feldstein and Wrobel
(1998), Epple and Platt (1998) and Schmidheiny (2004). Feldstein and Wrobel show that a shift in a single US state’s tax progressivity has no redistributive effects since migration leads to an adjustment of the net wages and the
employment structure. Complementary, we show that migration undermines
the redistributive effect of progressive taxation in presence of fiscal decentralization even if wages do not adjust. Our theoretical model shares the formal
3
structure with Epple and Platt. Schmidheiny shares the location choice part
of our model and shows empirically that rich households are more likely to
move to low tax communities than poor households.
It is a well known normative principle of the literature on fiscal federalism
that income redistribution should be centralized.2 Our paper relates to this
literature as it shows that fiscal decentralization undermines the redistributive effect of progressive taxes.
The paper is organized as follows: Section 2 briefly informs about fiscal
decentralization and progressive taxation in Switzerland. Section 3 presents
the theoretical model and some results concerning the agents’ location choice.
It further proves that an (asymmetric) equilibrium exists. Section 4 presents
the simulation of a fully-specified version of our model, which is calibrated
to the Zurich metropolitan area. Section 5 estimates the actual tax structure
faced by the households in this area. Section 6 concludes.
2
Fiscal Decentralization and Progressive Taxation in Switzerland
Switzerland is an exemplary federal fiscal system. The Swiss federation comprises 26 states, the so-called cantons. The cantons are divided into roughly
3000 municipalities of varying size and population. All three state levels finance their expenditures essentially by their own taxes and fees. The total
tax revenue of all three levels was 93 billion CHF in 2001, of which 46% is imposed by the federation, 32% by the cantons and 22% by the municipalities.3
While the federal government is mainly financed by indirect taxes (61% of
2
See Musgrave (1959) and Oates (1972). For a recent survey of the literature on fiscal
federalism see Oates (1999).
3
All figures in this paragraph apply to 2001. Source: Swiss Federal Tax Administration
(2002), Öffentliche Finanzen der Schweiz 2001, Neuchâtel: Swiss Federal Statistical Office.
4
federal tax revenue) such as the VAT, the cantons and municipalities largely
rely on direct taxes. Income taxes account for 60% of cantonal and 84% of
municipal tax revenue.
The cantons organize their tax systems autonomously. For example, they
decide upon the level of income and corporate taxes and the degree of tax
progression. The individual municipalities in turn can generally set a tax
shifter for income and corporate taxes. The municipal tax is then the cantonal tax rate multiplied by the municipal tax shifter. Federal and cantonal
systems of fiscal equalization limit the tax differences across cantons and
across municipalities within the same canton to some extent, but still leave
room for considerable variation.
The above outlined federal system leads to ample differences of income
taxes across Swiss municipalities. For example, for a two-child family with a
gross income of 80,000 Swiss francs (CHF) combined cantonal and municipal
income taxes ranged from 3.6% to 11.3% in the year 1997. The federal
income tax for this household was 0.7%. With an income of 500,000 CHF
a two-child family faced much higher tax rates due to the progressivity of
the tax schedules. Combined cantonal and municipal income taxes ranged
from 10.9% to 28.7% for this household and its federal income tax was 9.4%.
Within metropolitan areas the (municipality) tax differences are smaller but
still differ by a factor of 1.5 in e.g. the Zurich area.
3
The Model
In this section, we introduce and solve the model. After presenting the general setting, we characterize the preferences and derive the resulting allocation of households across distinct communities. We then prove the existence
of an asymmetric equilibrium. Finally, we introduce heterogeneity in the
5
preferences and discuss how this affects our results.
3.1
The Setting
Given is a metropolitan area with J communities. This area is populated by
a continuum of households, which differ in their income y ∈ [y, y]. Income
follows a distribution function f (y) > 0.
There are three goods in the economy: private consumption b, housing
h and a local public good g. The housing h is provided by absentee landlords, and the housing market is competitive. Hence, the price for housing
pi equates the housing supply HSi with the aggregate housing demand HDi .
We assume that the housing supply HSi = HS(Li , pi , ) is a non-decreasing
function of the land area Li and the price pi .
Each community i spends the amount ni g to provide the local public good
g, where ni is the measure of households living in community i. The communities levy income taxes to finance the public good. In each community i, the
tax rate consists of two parts, a local tax shifter ti and a progressive tax rate
structure r(y). We assume r(y) continuous and increasing in y, r(y) > 0, the
average tax rate t·r(y) ∈ [0, 1) and the marginal tax rate t[r +yr′ (y)] ∈ [0, 1).
The quantity of the local public good g and the tax rate structure r(y) are
both exogenous (to the communities) and identical across communities. In
each community i, the tax shifter ti is then determined by budget balance.
Each household can move costlessly and chooses the community maximizing its utility as place of residence.
3.2
Preferences and Location Choice
The preferences of the households are described by the utility function
U (h, b) ,
6
(1)
where h is the consumption of housing and b the consumption of the private
good. We assume the utility function to be strictly increasing, strictly quasiconcave, twice continuously differentiable in h and b and homothetic.4
Households face the budget constraint (omitting community indices)
ph + b ≤ yd = y[1 − t · r(y)] ,
(2)
where p is the price of housing; the price of the private good is set to unity.
Disposable income yd depends on the local tax shifter t and the tax rate
structure r(y).
Maximization of the utility function (1) with respect to h and b subject
to constraint (2) yields housing demand h∗ = h(p, yd ) = h(t, p, y), demand
for the private good b∗ = y(1 − t) − ph(t, p, y), and indirect utility
V (t, p, y) = U (h∗ , b∗ ) .
(3)
For later use note that V is continuous in t, p and y.
We assume that the elasticity of housing with respect to the disposable
income is smaller or equal to unity, i.e.,
εh,yd :=
∂h∗ yd
≤ 1 for all yd and p.
∂yd h∗
(4)
Next, we present two properties of the households’ indifference curves
that will lead to segregation of the population by incomes:
Property 1
dt
M (t, p, y) :=
dp
dV =0
h∗
∂V /∂p
=−
<0
=−
∂V /∂t
y · r(y)
4
Since the local public good g is constant across communities and not of primary
interest for our considerations, we assume for simplicity that it does not enter the utility
function. Equivalently, we could assume that it enters separably.
7
Property 1 follows from the strictly increasing utility function after applying
the implicit function theorem and the envelope theorem. It implies that a
household can be made indifferent towards an increase in the tax shifter t
when it is compensated by decreased housing prices p, and vice versa.
Property 2
∂M
∂h∗ yd ∂yd y
∂r(y) h∗
h∗
= [1 −
+
> 0 for all y, t and p.
]
∂y
∂yd h∗ ∂y yd y 2 r(y)
∂y y 2 r2 (y)
Proof: By assumption, (∂h∗ /∂yd )(yd /h∗ ) ≤ 1. Our assumptions about the
bounds of the average and the marginal tax rate guarantee (∂yd /∂y)(y/yd )
= [1 − tr − tyr′ (y)]/[1 − tr(y)] ∈ [0, 1). The assumption that r(y) increases
in y, implying ∂r(y)/∂y > 0, concludes the proof. ✷
Property 2 implies that the decrease in housing prices p which compensates a household for a higher tax shifter t has to be larger for poor households
than for rich ones.
Given a set of community characteristics, (pi , ti ) for i = 1..J, a household
prefers community i if and only if
V (pi , ti , y) ≥ V (pj , tj , y) for all j 6= i .
(5)
From this, the following proposition directly follows:
Proposition 1 (Order of community characteristics)
If any two populated communities differ in their characteristics (pi , ti ), then
the community with the higher housing prices pi must impose a lower tax
shifter ti .
Proof: Suppose the opposite, i.e., that the housing prices pi and the tax
shifter ti are both higher in one community. In this case, no household would
8
choose to live in this community (for the same reason that leads to property
1). This is a contradiction. ✷
In the remaining part of this section, we show how households allocate
themselves across distinct communities. Distinct communities differ in both
tax shifters and prices. Note that our model allows for groups of communities
with identical community characteristics (ti , pi ). Such groups appear as one
community in our notation.
Lemma 1 (Boundary indifference)
There is a ‘border’ household between any two communities i and j that is
indifferent between these two communities. That is, if a household with income y ′ prefers to live in i and another household with income y ′′ > y ′ prefers
to live in j, then there exists a household with income ŷij = ŷ(pi , ti , pj , tj ),
y ′ ≤ ŷij ≤ y ′′ , such that V (pi , ti , ŷij ) = V (pj , tj , ŷij ).
Proof: Let Vi (y) := V (pi , ti , y) be a household’s utility in i and Vj (y) :=
V (pj , tj , y) in j. The household with income y ′ prefers i to j, hence Vi (y ′ ) −
Vj (y ′ ) ≥ 0. The opposite is true for a household with income y ′′ : Vi (y ′′ ) −
Vj (y ′′ ) ≤ 0. From the continuity of V in y follows the continuity of Vi (y) −
Vj (y) in y. The intermediate value theorem proves that there is at least one
ŷ between y ′ and y ′′ such that Vi (ŷ) − Vj (ŷ) = 0. ✷
Lemma 2 (Two-community income segregation)
Given two populated communities i and j with distinct characteristics (ti , pi ) 6=
(tj , pj ), where ti < tj , then any household in i is richer than any household
in j. That is, if a household with income ŷ is indifferent between i and j,
then any household y ′ < ŷ strictly prefers j and any household y ′′ > ŷ strictly
prefers i.
9
t
tj
y' '
ti
ŷ
y'
pj
pi
p
Figure 1: Indifference curves in the (t, p) space
Proof: The proof uses figure 1, which shows the indifference curves in the
(t, p)-space for three different income levels y ′ < ŷ < y ′′ . The indifference
curves represent all (t, p) combinations that households consider as good as
community j’s (pj , tj )-pair. Each household prefers pairs south-west of its
indifference curve. It follows from property 1 that the indifference curves
decrease in the (t, p)-space and from property 2 that they become flatter
as income rises. Imagine now a community i, characterized by ti < tj and
pi > pj , where household ŷ is indifferent to j. All poorer households, e.g. y ′ ,
prefer j to i and all richer households, e.g. y ′′ , prefer i to j. ✷
Proposition 2 (Multi-community income segregation)
Given J populated communities with distinct characteristics (ti , pi ), then it
holds for any two communities i and j with ti < tj that any household in i
is richer than any household in j.
Proof: The proposition implies that [y, y] must be partitioned into J
non-empty and non-overlapping intervals. Suppose the opposite, i.e., y ′ as
10
well as y ′′ prefer community i, but y ′′′ , y ′ < y ′′′ < y ′′ , strictly prefers another
community j. Then it follows from lemma 1 that there is a ŷij , y ′ ≤ ŷij < y ′′′ .
Lemma 2 implies that y ′′ > yˆij strictly prefers j to i, which is a contradiction.
✷
3.3
Equilibrium
In this section, we prove that an asymmetric equilibrium exists. That is, we
show that an allocation in which communities exhibit different characteristics
(pi , ti ) can be an equilibrium.
An equilibrium requires that each household is located in the community
that maximizes its utility, that each household maximizes its utility within
the given community, that the housing market clears in each community, that
each community has a balanced public budget and that each community has
a positive population.
There always exists a symmetric equilibrium in which all communities
have identical characteristics (pi , ti ) and in which the households allocate
themselves such that all communities show the same income distribution.5
However, we are interested in the case in which at least some communities
differ in their characteristics (pi , ti ). We therefore show that an asymmetric
equilibrium, i.e., an equilibrium in which (pi , ti ) 6= (pj , tj ) for some i and
j, exists too. For simplicity, we focus thereby on the case of two distinct
communities, 1 and 2.
We assume with no loss of generality that t1 > t2 . Hence, any household
in 2 must be richer than any household in 1, as lemma 2 implies. We define
∆V (ŷ) = V1 (p1 (ŷ), t1 (ŷ), ŷ) − V2 (p2 (ŷ), t2 (ŷ), ŷ),
5
(6)
Other equilibria in which all communities have identical characteristics (pi , ti ) might
exist as well.
11
where pi (ŷ) and ti (ŷ) are the equilibrium housing price and the equilibrium
tax shifter, respectively, in i given that households with y < ŷ live in 1 and
households with y > ŷ in 2. Hence, Vi (pi (ŷ), ti (ŷ), ŷ) is the indirect utility of
a household with ŷ in i given this allocation of households.
In addition, we assume:6
(i) The housing supply HS(Li , pi ) satisfies HS(Li , 0) = Li > 0 for i = 1, 2.
(ii) The minimum income y > g.
(iii) If hi → ∞, bi > 0, hj < ∞ and bj < ∞, then U (hi , bi ) > U (hj , bj ).
Proposition 3 (Existence of an asymmetric equilibrium)
There exists an equilibrium in which the communities 1 and 2 exhibit different characteristics, i.e. t1 > t2 and p1 < p2 .
Proof: We prove proposition 3 by showing (1) that ∆V (ŷ) is continuous
and (2) that ∆V (ŷ) > 0 as ŷ → y and that ∆V (ŷ) < 0 as ŷ → y. It
follows then from the intermediate value theorem that there is at least one ŷ,
y < ŷ < y, such that ∆V (ŷ) = 0. This implies - from the definition of ∆V that the border household ŷ is indifferent between the two communities, the
prices p1 and p2 clear the local housing markets and the tax shifters t1 and
t2 balance the community budgets.
(1) The equilibrium housing price pi is determined by HS(Li , pi ) = HDi .
It follows from lemma 2 that
HDi =
Z
yi
h(pi , ti ; y)f (y)dx,
(7)
yi
where y i and y i are the highest and lowest incomes in community i. The
hereby implicitly defined pi is continuous in y i and y i given continuity of
6
As it will become evident in section 4, these assumptions are sufficient, but not necessary for the existence of an asymmetric equilibrium.
12
HS(·), h(·) and f (·). The balanced budget requirement and lemma 2 imply
that the equilibrium tax shifter in community i is
ni g
ti = R yi
r(y)f (y)dx
yi
where
ni =
Z
,
(8)
yi
f (y)dx.
(9)
yi
Given continuity of r(·) and f (·), ti is continuous in y i and y i . Since the indirect utility Vi is continuous in pi , ti and y and since pi and ti are continuous
in y, ∆V (ŷ) is continuous in ŷ.
(2) If follows from equations (7) and (9) that HD1 → 0 and n1 → 0
as ŷ → y. Since assumption (i) guarantees that HS(L1 , 0) = L1 > 0 (and
since ∂HS(Li , pi )/∂pi ≥ 0), it holds that h∗ (p1 , t1 ; y) → ∞ and p1 → 0 as
ŷ → y. Hence, b∗ → y − g > 0, where the strict inequality follows from
assumption (ii). Assumption (iii) then guarantees that ∆V (ŷ) > 0 as ŷ → y.
Analogously, it can be shown that ∆V (ŷ) < 0 as ŷ → y.7 ✷
3.4
Adding Heterogeneous Preferences
So far, we have assumed that households differ only in their incomes y. In
this section, we extend the model by assuming that households differ in their
preferences as well.
The household preferences are now represented by the utility function
U (h, b; α), where the parameter α describes the taste for housing. The higher
α, the more a household is, ceteris paribus, willing to spend on housing.
Hence, the housing demand increase in α, i.e.
∂h∗
∂h(t, p; y, α)
=
> 0 for all t, p, y and α.
∂α
∂α
7
The only difference is that b∗ → y − g, which exceeds y − g.
13
(10)
Housing taste (α)
1
2
...
j
...
J
Income (y)
Figure 2: Simultaneous income and preference segregation. The areas denoted by j = 1, ..., J show the attributes of the households that prefer community j.
Income and preferences are jointly distributed according to the density function f (y, α).
It follows that income segregation holds, but only within the subpopulation of households with identical preferences. Preference segregation occurs
as well: That is, among the subpopulation of households with the same income y, households with a high α, i.e. a strong taste for housing, tend to
allocate themselves to communities with higher tax shifters ti than households with a low α.
Simultaneous heterogeneity by incomes and tastes leads to a more realistic pattern of household segregation. Although income groups tend to
gather, the segregation is not perfect. Figure 2 shows the resulting allocation of household types to communities. The households on the borders are
indifferent between neighboring communities j. Community 1 with the lowest housing prices is populated by the poorest households with strong taste
for housing, while the richest households with low housing taste are situated in community J with the lowest tax rate and the highest housing price.
14
However, rich households with strong taste for housing prefer lower-priced
communities and poor households with weak taste for housing group with
relatively rich households in the lower-tax communities.
4
A Specified Version of the Model
To investigate the qualitative and quantitative properties of the model we
construct a fully specified example in this section. The specification is kept
as simple as possible but still captures all mechanisms of the model. The
example is calibrated to the Zurich area, the largest Swiss metropolitan area.
The common tax schedule is taken from Young (1990)
r(y) = r0 [1 − (1 + r2 y r1 )−1/r1 ] .
with parameters r0 > 0, r1 > 0 and r2 > 0. The average local tax rate
t r(y) and the local marginal tax rate t[y ∂r(y)/∂y + r(y)] is increasing in
income y. The marginal tax rate is always above the marginal tax rate; both
asymptotically approach a maximum t r0 .
Household preferences are described by a Cobb-Douglas utility function:
U = hα b1−α ,
where 0 < α < 1 stands for the taste parameter of the general model. Utility
function and tax schedule satisfy our properties 1 and 2.
The locus of indifferent households between two communities i and j for
any given taste α is
("
ŷij =
1−
pαj − pαi
r0 pαj ti − pαi tj
We adopt the housing supply function
!−r1
HSi = Li (pi )θ
15
#
1
−1
r2
)1/r1
.
Figure 3: Center and periphery in the metropolitan area of Zurich.
from Epple and Romer (1991).8
We calibrate the above outlined model to the metropolitan area of Zurich
in Switzerland. The area around the city of Zurich forms the biggest Swiss
metropolitan area. The city of Zurich has about 330 thousand inhabitants
and is the capital of the canton (state) of Zurich. The canton of Zurich
counts 1.2 million inhabitants in 171 individual communities. As described
in section 2, each of these communities can choose its own tax shifter.
The analysis is restricted to the city of Zurich and a ring of the most
integrated communities around the center. This ring is formed by all communities in the canton of Zurich with more than 1/3 of the working population commuting to the center.9 Figure 3 shows a map with the city of
Zurich and the thus defined ring of 40 communities. This agglomeration is
8
Epple and Romer derive this housing supply function from an explicit production
function, where 0 ≤ θ ≤ 1 is the ratio of non-land to land input.
9
The number of commuters to the city of Zurich and the size of the working population
in the communities is based on the 1990 Census. This definition of the urban area is
chosen to justify the model’s assumption that households income is exogenous, i.e. that
they choose their place of residence independent of where they work. It results in a set of
communities closest to the central business district.
16
Table 1: Equilibrium values of the specified model.
homogeneous
preferences
harmonized
center periphery
L: area
1
0.4
0.6
p: rent
11.7
6.6
13.1
t: tax shifter
1
5.17
0.91
n: inhabitants
1
0.12
0.88
Ey: mean income
78’547
30’771
85’010
heterogeneous
preferences
center periphery
0.4
0.6
9.9
12.5
2.30
0.87
0.23
0.77
47’755
87’703
The calibrated model parameters: g = 5000, E(ln y) = 11.1, SD(ln y) = 0.55,
ymin = 23, 000, ymax = 500, 000, E(α) = 0.25, S.A.(α) = 0 (homogeneous preferences), S.A.(α) = 0.11 (heterogeneous preferences), θ = 3, r0 = 0.132, r1 = 1 and
r2 = 0.00001.
modelled as two distinct jurisdictions, which we call center and periphery.
The details of the calibration are described in the appendix. The parameters
are summarized at the bottom of table 1.
4.1
Simulated Equilibrium
The equilibrium values pi and ti in both communities satisfy equations (7)
and (8) and guarantee that no households wants to move. As there is no
closed form solution to this nonlinear system of four equations and four
unknowns, we solve numerically for the equilibrium values of the model.10
Table 1 shows in column 2 and 3 the equilibrium values for the case of
homogeneous tastes. There are large differences in both taxes and prices between the two communities. The tax rate in the center community is almost
10
The aggregation of individual behavior requires double integrals over the community
population. These integrals cannot be calculated analytically. We use Gauss-Legendre
Quadrature with 40 nodes in each dimension to approximate the various double integrals. We numerically solve for the equilibrium values by minimizing the sum of squared
deviations from the equilibrium conditions with the Gauss-Newton method.
17
0.8
households prefer center
0.6
indifferent
households →
0.4
households prefer
periphery
0.2
0
10.5
11
11.5 12
log income
12.5
← center
population density
housing taste (α)
1
← periphery
← whole
population
10.5
13
11
11.5 12 12.5
log income
13
Figure 4: Income and taste segregation in equilibrium. The left figure shows
the preferred community for all household types. The right figure shows the
resulting income distributions in both communities.
six times higher than in the periphery while housing prices in the center are
halve the ones in the periphery.11 Households are perfectly segregated: All
households in the high-tax center are poorer than all households in the lowtax periphery. The mean income in the periphery is therefore almost three
times the mean income in the center. Column 1 in table 1 gives the equilibrium values for the hypothetical case that the two communities merged or
harmonized their taxes.
These predictions of segregation and the implied differences of community characteristics are extreme. The consideration of heterogeneous housing
tastes leads to a more realistic situation. Table 1 shows in column 4 and
5 the equilibrium values allowing for heterogeneous tastes. The differences
between the two communities are still substantial but smaller than with homogeneous tastes: The center exhibits now 2.5 times higher taxes and 20
% lower housing prices. The left graph in figure 4 shows the segregation
11
The labels ‘center’ and ‘periphery’ are arbitrary. There is always a second equilibrium
with lower taxes in the center.
18
Local income tax shifter (1997)
Share of households with income
above CHF 75’000 (1997/98)
131.00
0.56
125.58
0.52
0.40
101.00
0.28
0.25
85.98
85.00
Figure 5: Taxes and incomes in the Zurich metropolitan area.
pattern in the income-taste space. The population is now imperfectly sorted
by incomes: While it is still true that more rich households are found in the
low-tax periphery, rich households with a strong taste for housing prefer the
low-price high-tax center and poor households with a low taste for housing
prefer the periphery. The right graph in figure 4 shows the resulting income
distributions in the two communities. The mean income in the center is
about half the one in the periphery.
Figure 5 shows the actual local tax levels and the spatial income distribution in the calibrated area. The left map visualizes the considerable tax
differentials in the Zurich area. The right map demonstrates the striking
relationship between income taxation and spatial income distribution: the
local share of rich households is almost an inverted picture of the local tax
levels.12 Our simple two-community model captures this empirical pattern
12
Data from the following sources: Commuter: Swiss Federal Statistical Office, Census
1990. Tax rates: Statistisches Amt des Kantons Zürich, Steuerfüsse 1997. Income distribution: Swiss Federal Tax Administration. Considered are all communities where more
than 1/3 of the working population is commuting to the center community.
19
homogeneous tastes
heterogenous tastes
0.25
0.5
0.4
average tax rate
average tax rate
0.6
center →
0.3
0.2
0.1
0.0
← mean
↑ periphery
10.5
11 11.5 12
log income
12.5
0.20
mean
↓
0.10
0.05
0.00
13
center →
0.15
← periphery
10.5
11 11.5 12
log income
12.5
13
Figure 6: Mean average tax rate by income in the case of homogeneous (left)
and heterogenous (right) tastes.
well.
4.2
The Resulting Tax Schedule
The average tax rate ti r(y) depends not only on the individual household’s
income but also on its place of residence. As the model shows the place
of residence is not random and rich households are more likely to reside in
low-tax communities. In this section, we ask what tax schedule is realized
after considering the sorting of the population. In other words, we ask what
tax rate a household with income y pays on average.
In the case of homogeneous taste this question is trivial. All households
with income below the indifferent household (ŷ = 37, 000) face the average
tax rate in the high-tax community; rich households the one in the low-tax
community. The left graph in figure 6 shows the resulting tax schedule.
While progressive within the communities, it is actually regressive as the
richest households face lower average tax rates than the poorest households.
In general, the expected or mean average tax for a household with in20
come y is
E[t r(y)|y] =
X
i
[P (i|y) · ti r(y)] ,
(11)
where ti r(y) is the average tax rate for a household with income y in community i. The probability that a household with income y lives in community i,
f (y|i)P (i)
,
f (y)
P (i|y) =
(12)
is calculated from the income density f (y|i) in community i, the probability
P (i) that an arbitrary household resides in community i and the income
distribution f (y) of the whole area.
In the case of heterogenous tastes, the marginal income distribution f (y|i)
in a community i (shown in the right figure 4) is calculated by integrating
over tastes in community i:
f (y|i) =
Z
αi
f (y, α)dα ,
αi
where αi and αi are the lowest and highest tastes in community i.
The right graph in figure 6 shows the mean average tax rate in the case
of heterogenous tastes. The realized tax schedule is still progressive, though,
much flatter than the tax schedule implemented by the canton.
5
Evidence
In this section, we estimate the mean average tax rates that households with
a given income face in the Zurich metropolitan area. We then compare our
estimates to the results obtained in the previous section.
5.1
Method
In principle, the mean average tax rate can be estimated from a random
sample of households in the studied area. Knowing each households’ income
21
and community tax rate allows to directly estimate the mean average tax rate
with e.g. a kernel regression. The random sampling automatically accounts
for the sorting of the population by incomes. Unfortunately, we do not have
such microdata with tax information. Furthermore, available survey data
suffers from small sample sizes and stratified sampling over communities.
We therefore follow an alternative estimation strategy. The mean average
tax rate of a household with income y can be estimated from equation (11):
Ê[t r(y)|y] =
X
i
[P̂ (i|y) · ti r(y)]
As the canton sets the tax structure r(y) and the individual communities
their tax shifters ti , the average tax rate ti r(y) for any income y in any
community i is known.
The estimated probability that a household with income y lives in community i is given by equation (12):
P̂ (i|y) =
fˆ(y|i)P̂ (i)
fˆ(y|i)ni
=P
,
ˆ(y|j)nj ]
fˆ(y)
[
f
j
where ni is the known number of households living in community i.
It remains, therefore, estimating the income density fˆ(y|i) of each community i in the area. We estimate fˆ(y|i) from publicly available local income
distribution data. The federal tax administration publishes the number of
households with taxable income in seven different income classes.13 We assume that incomes are log-normally distributed and estimate mean and variance of this distribution using maximum likelihood.14 We estimate a truncated log-normal distribution as the first reported income interval is empty
13
Swiss Federal Tax Administration, Steuerbelastung in der Schweiz, Natürliche Personen nach Gemeinden 1997, Neuchâtel: Swiss Federal Statistical Office.
14
Note that this maximum likelihood estimator corresponds to an ordered probit with
known thresholds.
22
for technical reasons. The log likelihood function for any community i is
c −µ
i
6
Φ k+1σi i − Φ ckσ−µ
X
i
,
log Li =
sk · log
c1 −µi
1
−
Φ
k=1
σi
where µi and σi2 are mean and variance of log income in community i. sk
is the number of households in income class k with lower interval limit ck ∈
{log(15000), log(20000), log(30000), log(40000), log(50000), log(75000), ∞}.
Φ(.) is the cdf of the standard normal distribution. The income density
in community i is then estimated as
"
1
exp −
fˆ(y|i) =
2
σ̂i y 2π
1
√
5.2
log(y) − µ̂i
σ̂i
2 #
.
Results
Figure 7 shows that the average tax rates that households with income y
face in the Zurich metropolitan area. The top line is the average tax rate
ti r(y) of households living in the community with the highest tax shifter.
The bottom line is the average tax rate of households in the community
with the lowest tax shifter. The middle line is the estimated mean average
tax rate that households in this area face, Ê[t r(y)|y]. This is the expected
unconditional, i.e. not conditioned on the place of residence, average tax
rate. As one can see, the average poor household faces almost the average
tax rate in the highest-tax community (or in the city of Zurich).15 This is,
of course, because most poor households live in high tax communities. As
households become richer, they live more often in the low-tax communities
and thus face an average tax rate that is on average substantially smaller
than in high-tax communities. The mean average tax rate of households
15
The tax shifter is 131 in the highest-tax community and 130 in the city of Zurich.
23
average tax rate
0.15
highest tax community →
mean
↓
0.10
0.05
↑ lowest tax community
0.00
10.5
11 11.5 12
log income
12.5
13
Figure 7: Estimated mean average tax rate by income.
with very high incomes y is even relatively close to the average tax rate of
very rich households living in the lowest-tax community.
The results from the estimation (figure 7) are very similar to the predictions of the calibrated model with taste heterogeneity (figure 6). There are
though two noteworthy differences: First, the difference between the highest and the lowest tax shifters is in reality smaller than our model predicts.
Second, the mean average tax rate of very rich households remains in reality
above the average tax rate of very rich households in the lowest-tax community, unlike in our simulation. While polito-economical considerations may
account for the first difference,16 the second might indicate that the location choice depends also on preference characteristics other than the taste
for housing.
6
Conclusions
We have focused on the tension between fiscal decentralization and progressive taxation. We have presented a multi-community model in which the
16
The threat of a so-called tax harmonization often prevents low-tax communities from
further lowering their tax shifters.
24
local income tax rate is determined by an exogenous progressive tax schedule and a tax shifter that can differ across communities. The progressivity
of the tax schedule has been shown to induce a self-sorting process that results in substantial though imperfect income sorting. Rich households are
found to be more likely to locate themselves in low tax communities than
poor households such that the actual tax structure becomes less progressive
than the exogenous tax schedule. To investigate the quantitative implications of our model, we have calibrated a fully-specified version to the largest
metropolitan area in Switzerland. The equilibrium values of the simulation
have shown the same pattern across communities as we observe in this area.
We have further estimated the actual tax structure faced by the households
in this area. We have found that the actual tax structure is indeed significantly less progressive than the fixed tax schedule. Hence, progressive taxes
should be implemented at the state, the national or even the supranational
level rather than at the community level given that one wants them to unfold
their full redistributive effect.
25
Appendix: Calibration
Land Area: The whole area has a physical size of 349km2 , of which 88km2
(25%) form the city of Zurich. 140km2 are dedicated to development, 53km2
(38%) in the inner city and 87km2 in the fringe communities. In 1998, the
whole area was populated by around 628’000 inhabitants, of whom 334,000
lived in the city and 294,000 in the fringe communities.17 This agglomeration
is modelled as two distinct jurisdictions with land area L1 = 0.4 and L2 = 0.6
respectively.
Tax schedule: The parameters r0 = 0.132, r1 = 1 and r2 = 0.00001 almost
perfectly approximate the tax scheme of the canton of Zurich.18
Income Distribution: The income distribution is calibrated with data
from the Swiss labor force survey.19 The 1995 cross-section contains detailed
information on 1124 households in the above defined region. These households had average income (after state and federal taxes) of CHF 92,000,
median income of CHF 66,700 and a quartile distance of 47,700.20 We use
a log-normal distribution to approximate this right-skewed distribution. A
log-normal distribution with mean E(ln y) = 11.1 and standard deviation
SD(ln y) = 0.55 matches the observed median and quartile distance. For
numerical tractability, the model distribution is truncated at a minimum
income of ymin = 23, 000 and a maximum income ymax = 500, 000.21
Taste Distribution: The Swiss labor force survey also contains the monthly
17
Source: Statistisches Amt des Kantons Zürich, Gemeindedaten per 31.12.1998.
Tax scheme according to Steuergesetz vom 8. Juni 1997.
19
Swiss Federal Statistical Office, Schweizerische Arbeitskräfterhebung (SAKE) 1995.
20
State and Federal taxes were deducted from net household income (after social security
contribution) assuming a two-child family.
21
The minimum income is subsistence level for a one-person-household as defined by the
Schweizerische Konferenz für Sozialhilfe (SKOS) and adjusted for inflation. The maximum
income is chosen arbitrarily, but has no influence on the numerical simulation due to the
low weight on high incomes.
18
26
housing expenditure of renters which allows to calibrate the distribution of
tastes for housing.22 Note that the taste parameter α in the Cobb-Douglas
utility function is the share of housing in a utility maximizing household. We
therefore estimate each household taste parameter as α = (ph)/yd , where ph
is observed households housing expenditure and yd is observed household
income minus federal, state and communal taxes. A beta distribution with
mean E(α) = 0.25 and standard deviation SD(α) = 0.11 describes the distribution of the so calculated taste parameter well. Taste and income are
assumed to be uncorrelated.
Housing and Public Good Production: The price elasticity of housing
supply is θ = 3 as in Epple and Romer (1991) and Goodspeed (1989). The
targeted public goods provision is set to 5000.
22
Of course, there is a selection bias by only considering renters. This seems nevertheless
justified because the proportion of renters is very high in Switzerland (65% in the data set
used).
27
References
[1] Ellickson (1971), Jurisdictional Fragmentation and Residential Choice,
American Economic Review, 61, 334-339.
[2] Epple, Dennis and Glenn J. Platt (1998), Equilibrium and Local Redistribution in an Urban Economy when Households Differ in both Preferences
and Income, Journal of Urban Economics, 43, 23-51.
[3] Epple, Dennis and Thomas Romer (1991), Mobility and Redistribution,
Journal of Political Economy, 99(4), 828-858.
[4] Feldstein, Martin and Marian V. Wrobel (1998), Can State Taxes Redistribute Income?, Journal of Public Economics, 68(3), 369-396.
[5] Goodspeed, Timothy J. (1989), A Re-Examination of the Use of Ability
to Pay Taxes by Local Governments, Journal of Public Economics, 38,
319-342.
[6] Musgrave, Richard M. (1959), The Theory of Public Finance, New York:
McGraw-Hill.
[7] Oates, Wallace E. (1972), Fiscal Federalism, New York: Harcourt Brace
Jovanovich.
[8] Oates, Wallace E. (1999), An Essay on Fiscal Federalism, Journal of
Economic Literature, 37(3), 1120-1149.
[9] Ross, Stephen and John Yinger (1999), Sorting and Voting: A Review
of the Literature on Urban Public Finance. In: Paul Ceshire and Edwin
S. Mills (eds.), Handbook of Regional and Urban Economics, Vol. 3,
Amsterdam: North-Holland.
28
[10] Schmidheiny, Kurt (2004), Income Segregation and Local Progressive
Taxation: Empirical Evidence from Switzerland, CESifo Working Paper
1313.
[11] Westhoff, F. (1977), Existence of Equilibria in Economies with a Local
Public Good, Journal of Economic Theory, 14, 84-112.
[12] Young, Peyton H. (1990), Progressive Taxation and Equal Sacrifice,
American Economic Review, 80(1), 253-266.
29
WORKING PAPER SERIES 2005
http://ase.tufts.edu/econ/papers/papers.html
2005-01
EGGLESTON, Karen, Keqin RAO and Jian WANG; “From Plan
to Market in the Health Sector? China's Experience.”
2005-02
SHIMSHACK Jay; “Are Mercury Advisories Effective?
Information, Education, and Fish Consumption.”
2005-03
KIM, Henry and Jinill KIM; “Welfare Effects of Tax Policy in
Open Economies: Stabilization and Cooperation.”
2005-04
KIM, Henry, Jinill KIM and Robert KOLLMANN; “Applying
Perturbation Methods to Incomplete Market Models with
Exogenous Borrowing Constraints.”
2005-05
KIM, Henry, Jinill KIM, Ernst SCHAUMBURG and Christopher
A. SIMS; “Calculating and Using Second Order Accurate
Solutions of Discrete Time Dynamic Equilibrium Models.”
2005-06
KIM, Henry, Soyoung KIM and Yunjong WANG; “International
Capital Flows and Boom-Bust Cycles in the Asia Pacific Region.”
2005-07
KIM, Henry, Soyoung KIM and Yunjong WANG; “Fear of
Floating in East Asia.”
2005-08
SCHMIDHEINY, Kurt; “How Fiscal Decentralization Flattens
Progressive Taxes.”
2005-09
SCHMIDHEINY, Kurt; “Segregation from Local Income Taxation
When Households Differ in Both Preferences and Incomes.”
2005-10
DURLAUF, Steven N., Andros KOURTELLOS, and Chih Ming
TAN; “How Robust Are the Linkages between Religiosity and
Economic Growth?”
2005-11
KEELY, Louise C. and Chih Ming TAN; “Understanding
Preferences For Income Redistribution.”
2005-12
TAN, Chih Ming; “No One True Path: Uncovering the Interplay
between Geography, Institutions, and Fractionalization in
Economic Development.”
2005-13
IOANNIDES, Yannis and Esteban ROSSI-HANSBERG; “Urban
Growth.”
2005-14
PATERSON, Robert W. and Jeffrey E. ZABEL; “The Effects of
Critical Habitat Designation on Housing Supply: An Analysis of
California Housing Construction Activity.”
2005-15
KEELY, Louise C. and Chih Ming TAN; “Understanding
Divergent Views on Redistribution Policy in the United States.”
2005-16
DOWNES, Tom and Shane GREENSTEIN; “Understanding Why
Universal Service Obligations May Be Unnecessary: The Private
Development of Local Internet Access Markets.”
2005-17
CALVO-ARMENGOL, Antoni and Yannis M. IOANNIDES;
“Social Networks in Labor Markets.”
2005-18
IOANNIDES, Yannis M.; “Random Graphs and Social Networks:
An Economics Perspective.”
2005-19
METCALF, Gilbert E.; “Tax Reform and Environmental
Taxation.”
2005-20
DURLAUF, Steven N., Andros KOURTELLOS, and Chih Ming
TAN; “Empirics of Growth and Development.”
2005-21
IOANNIDES, Yannis M. and Adriaan R. SOETEVENT; “Social
Networking and Individual Outcomes Beyond the Mean Field
Case.”
2005-22
CHISHOLM, Darlene and George NORMAN; “When to Exit a
Product: Evidence from the U.S. Motion-Pictures Exhibition
Market.”
2005-23
CHISHOLM, Darlene C., Margaret S. McMILLAN and George
NORMAN; “Product Differentiation and Film Programming
Choice: Do First-Run Movie Theatres Show the Same Films?”
2005-24
METCALF, Gilbert E. and Jongsang PARK; “A Comment on the
Role of Prices for Excludable Public Goods.”