Optimal Capital Structure and Project Financing : Salman Shah

JOURNAL
OF ECONOMIC
THEORY
42, 209-243
(1987)
Optimal
Capital
Structure
SALMAN
and Project
SHAH
Financing*
Faculty of Management Studies, Unicersiry 246 Bloor St. West, Toronto, Ontario MSS
of Toronto, 1 V4, Canada
AND
ANJAN School
V. THAKOR
of Business, Indiana University. Bloomington, Indiana 47405 August 1, 1984; revised May 5, 1986
Received
We examine the financing and incorporation modes for new projects. There are two objectives. The first is to provide a theory of optimal capital structure that links risk, leverage, and value and is particularly applicable to large firms. Counter to conventional wisdom, we show riskier firms acquire more debt, pay higher interest rates, and have higher values in equilibrium. Second, we provide an economic rationale for project financing which entails organizing a new project legally distinct from the ftrms other assets. We explain why project tinancing involves higher leverage than conventional financing and why highly risky assets are projectClassitication Numbers: 022, 521. financed. Journal qf Economic Literature
( 1987 Academtc Press. Inc
I. INTRODUCTION
This paper is concerned with how an investment should be financed and whether it should be incorporated separately or within the firm. Our specific objectives are twofold. The first is to develop a theory of optimal capital structure, and the second is to show how the theory can explain a rapidly growing institutional practice, namely project financing. Notwithstanding the seminal Modigliani and Miller [23, 241 theorem, a firms capital structure appears to be a key financial decision that is not yet completely understood. One strand of the literature, exemplified by Bulow
* Without implicating them, Shee Chart, Rob Heinkel, Dan helpful comments. Revisions of department at the J. L. Kellogg Shah was at Indiana University we would like to thank David Besanko, David Brown, YukSiegel, an anonymous referee, and an associate editor for this paper were made while Thakor was visiting the tinance Graduate School of Management, Northwestern University. during the early phase of the research on this paper.
209
0022-0531/87 $3.00
Copyright IiJ 1987 by Acedemlc Press, Inc. All rights of reproduction ,n any lorm reserved
210
SHAH
AND
THAKOR
and Shoven [3], Kim [1.5], and Scott [33], suggests that an interior optimal capital structure results from the tradeoff between the tax advantages and exogenous bankruptcy costs of leverage. More recent papers within this group have demonstrated optimal capital structure without relying on exogenous leverage-related costs by suggesting that the presence of firm-specific tax shields unrelated to leverage (DeAngelo and Masulis [8] and Kim [16]) increases the measure of the set of states characterized by debt-tax shield redundance. They show that this induces a concavity with respect to leverage in the firms after-tax expected profit and thus results in an interior optimal capital structure. Another strand, pioneered by Ross [30], and Leland and Pyle [lS], and subsequently developed by Heinkel [ 121, and Lee, Thakor and Vora [ 173, assumes asymmetric information between managers and investors and shows that a firms capital structure may be important because it acts as a signal of an a priori unknown parameter of the probability distribution of its future cash flows. In related papers, Jensen and Meckling [ 141, Hellwig [13], and Chaney and Thakor [4] propose capital structure relevance under moral hazard. Recently, Darrough and Stoughton [S] have integrated moral hazard and adverse selection considerations in a capital structure model. While these papers and others highlight the multifaceted role of leverage, they do not explain why there are significant leverage differences between conventionally financed ventures (those incorporated as integral components of the sponsoring firms) and project-financed ventures (those incorporated as legally segregated entities), and why project financing has grown so rapidly in the corporate taxation era. Thus, to build a theory of project financing we need an alternative model of optimal capital structure. In this paper we develop such a model, based on corporate taxes and asymmetric information. We demonstrate that, in raising the required investment outlay for a given project, each firm will use a unique (interior) mix of debt and equity-despite corporate taxation and the absence of bankruptcy costs-and show that there are some projects whose values are maximized with separate incorporation; moreover, our model predicts systematic leverage differences between such projects and those which are conventionally financed. In contrast to many existing papers, our theory does not rely on risk aversion, exogenously given signaling cost structures, bankruptcy costs (or any other exogenous costs associated with debt), or
I Actually, this induced concavity in DeAngelo and Masulis [S] can be obtained even without nondebt related tax shields. See Talmor, Haugen, and Barnea 1351. A commonly stated reason for developing capital structure models without taxes is that the tax deductibility of corporate interest payments is a post-1913 development. while interior capital structures were observed even prior to 1913. However, we are not concerned with this justification for excluding taxes because project financing is a relatively recent development and taxes, we think, are an important factor in its growing popularity.
CAPITAL
STRUCTURE
211
nonpecuniary moral hazard in loan contracts. (Specific distinctions between our research and the existing literature are outlined in Sect. III.) The reliance is on asymmetric information, which is in the spirit of Arrows [l] conjecture about the importance of informational imperfections in understanding financing decisions in a securities market without Arrow-Debreu-type contingent dealings. Our principal finding on capital structure differs from traditional wisdom. We show that in equilibrium riskier firms choose higher debt levels and pay higher interest rates. Moreover, riskier firms also have higher equilibrium values. (The intuition is provided in the discussion following Theorem 3 in Sect. III.) These results, which play a pivotal role in our examination of project financing, are of interest in their own right because they provide a possible explanation for documented differences in leverage ratios across industries. As Miller [22] notes, these differences have yet to be well understood,
On the other hand, many questions clearly still remain to be answered. What about cross-sectional variations in debt ratios. for example-a subject on which surprisingly little work has yet been done?
Our study of project financing is motivated by the scant attention paid to it in the literature.3 Yet, over the past decade, this has become an increasingly important financing tool for marshalling capital resources for a wide array of natural resource projects both in the United States and abroad. Project financing is currently a multibillion dollar activity involving the construction of docks for supertankers, supermarkets, massive iron ore pellet processing facilities, drilling rigs for deep sea oil and gas exploration, power generation facilities, etc. Project financing is defined as an arrangement whereby a sponsor or group of sponsors incorporates a project as a legally separate entity, with project cash flows kept segregated for financing purposes from its sponsors, thereby permitting an appraisal independent of any direct support from the participants themselves. Project financing usually involves the sponsors providing equity and management for the project and issuing debt that is nonrecourse to the sponsors. That is, creditors must rely exclusively on the ability of the project for repayment of project-related debt obligations. Thus, from the creditors perspective, a key difference between conventional financing and project financing is as follows. If conventional financing is used and the project is organized as a component of the firm, the project cash flows are mingled with the firms other cash flows. The lenders evaluation of the project is then based on the post-project-adoption
A The sole exception is an excellent practitioner-oriented tures that the ultimate economic rationale for project imperfections. paper by Mao 1201 which conjecfinancing may lie in informational
212
SHAH
AND
THAKOR
creditworthiness of the total firm-and, of course, the seniority of the project lenders claims in case of default-rather than the cash flow distribution of the project per se. On the other hand, if project financing is resorted to, the creditors evaluation of the project is linked solely to the projects own cash flow generation capability. Two project financing-related issues appear to deserve attention. First, we would like an economic rationale for project financing. And second, we would like the theory to explain two stylized facts:4 (i) project financing involves higher leverage ratios than conventional financing, and (ii) many investments utilizing project financing appear to be highly risky. Our theory sheds light on the above stylized facts. We show that, for a variety of projects, the method of incorporation of the project affects both its leverage and economic value; project financing enhances the values of some of these projects by permitting higher optimal leverage than with conventional financing. The conditions that guarantee this constitute sufficiency conditions for the strict Pareto dominance of project financing and supply the first economic rationale for the practice.5 Our research is related to the work of Myerson [26] and others in the theory of resource allocation mechanisms. As in that work, we begin with a description of the economic environment at a primitive level by specifying agents preferences, endowments, exchange technology, and the initial information structure. Then, assuming that economic agents are limited to debt and equity contracts, but abstaining from any fixed resource allocation scheme, we examine a broad class of schemes to discover the constraints on resource allocation implied by asymmetric information. Properties of the Pareto optimal outcomes are subsequently derived by solving the usual programming problem. An interior optimal capital structure as well as an economic rationale for project financing emerge endogenously as Pareto optimal resource allocation outcomes. Our results can be seen as buttressing Townsends [36] idea that financial structures-viewed there as all payment devices-are communication systems. The paper is organized as follows. Section II contains a description of the economy and a development of the symmetric information equilibrium. In
4 See McGuinty [21] and Nevitt 1271 for discussions of these stylized facts. 5 Casual explanations for project financing can be found in the popular literature (see, e.g., the Wall Street Journal). These usually revolve around the conjecture that firms like project financing because it allows them to obtain off-balance sheet debt and thus be relatively highly leveraged while maintaining apparently conservative financial statements and financial ratios that do not run afoul of existing loan covenants. This is, however, an unsatisfactory explanation-ven in a world with informational limitations-because it requires lenders to systematically ignore highly visible off-balance sheet investments by their borrowers. This criticism is further underscored by the fact that most loan covenants these days impose explicit restrictions even on activities that do not completely appear on borrowers balance sheets, such as project financing.
CAPITALSTRUCTURE
213
Section III it is assumed that project sponsors have information about their projects payoff distributions that outsiders do not have. Adopting Rileys [29] reactive equilibrium concept, we characterize the asymmetric information capital structure equilibrium. The model is extended in Section IV to provide an economic justification for project financing. Section V contains all the formal proofs.
II. DESCRIPTION
OF THE ECONOMY
The economy consists of two types of economic agents (project sponsors and creditors), two points in time, and a single commodity. All agents are risk neutral. A firm is defined as a collection of project sponsors. There are many firms in the economy and each has the option to access a production process that is an indivisible point-input, point-output project requiring an investment of 1 units of the commodity at the beginning of the period (the first point-in-time) and yielding a random gross return of R at the end of the period (the second point-in-time). Thus, a project sponsor can either consume its endowment at the beginning of the period or invest it for later consumption. Similarly, a creditor can consume its endowment now or lend it to a firm to obtain a claim to a risky, end-of-period payoff, The marginal rate of substitution (the number of end-of-period units of the commodity that must be offered in exchange for one unit of the commodity now) is p. We assume that all agents have the option to forego a unit of the commodity at the beginnning of the period in exchange for a riskless claim to p units of the commodity at the end of the period. Let iw, E [0, co]. We assume R takes its values in Iw+ . The randomness in R is described by a probability measure, F(R, 0), defined on a sigmaalgebra of the subsets of Iw,, where 8 is a parameter that indexes the riskiness of R. The value of B varies across firms and we normalize it to take values in a compact subset, [e,, @J, of [O, 11. We also assume that there exists a density function, f(R, 0), associated with F(R, 0). Thus, the project to which a firm has access varies from one firm to the next, and this cross-sectional variation is described by the continuum [S, , 0, J. Increasing values of B imply increasing risk in the Rothschild and Stiglitz [32] sense. That is,
i' R/(R,a,dR=J^ Rf(R,O)dR=R Rt R+

and for 0<gand x>O,
s0
(1)
XF(R,&dR2j-*F(R,8)dR
0
(2)
214
SHAH
AND
THAKOR
with strict inequality for I> 0 and 8 and t? in the interior of [0,, @,I. Thus, firms have projects with the same mean return, K, but different risks. Henceforth, for any two density functionsf(R, 0) andf(R, 8) satisfying (1) and (2) we shall use the notation f(R, g) @ f(R, (3) to indicate that f(R, &I) is riskier than (but has the same mean as) f(R, 0) in the above sense. For now we assume 8 is common knowledge. In this framework, project sponsors are shareholders and creditors are bondholders. (We temporarily assume that the firm has no risky endowment other than the option to invest in the risky project.) Let c(1 be the amount of initial investment a firm finances with debt, and [l -a] I the amount it finances with the shareholders equity. In exchange for every unit of the commodity the creditors lend at the beginning of the period, they require a payment of K units at the end of the period. We shall assume that each unit of the commodity is worth $1, so that the single commodity can be called (consumable) cash. A central taxing authority levies a wealth tax at the rate t on the end-of-period cash flow accruing to the shareholders, and creditors operate in a competitive environment. Denote by P the total expected end-of-period after-tax cash flow generated by the project, by B the expected end-of-period cash flow accruing to the creditors, and by l? the expected end-of-period cash flow accruing to the shareholders. Discounted back to the present, these variables would represent the total value of the project, the value of debt, and the value of equity, respectively. It is notationally convenient to partition R, as follows,
Z-s (R(R<aK, RE[W+)
and
~E(RIR>~~K,REF%+~.
Thus.
p=Zj.
F
Rf(R,O)dR+j
P
Z[R-crK][l-z]f(R,O)dR (3) e)dR, f3)dR. (4) (5)
+IP IcrKf (R, 6) dR,

B=ZR-Z Z?=j IP [R-aKlf(R, -z]f(R, Ff Z[R-aK][l
Note
p= B+l?. (6)
CAPITALSTRUCTURE
215
The net expected end-of-period therefore, is E?=P-[l-a]@ =

!r
cash flow accruing to the shareholders,
Z[R-aK][l
-z]f(R,
O)dR-
[l-a]
Ip.
(7)
The prqject sponsors choose 01to maximize (7) and then K is determined competitively to yield the creditors an expected return of p. That is,
B = aIp.
(8)
Combining
(4) and (8) gives us sr

Z[R-aK]f(R,O)dR=IR-a/p
(9)
and substituting
(9) in (7) produces P=Z{[l-T]

R-p).+dpt,
(10)
which is maximized mation equilibrium leverage proposition
at a = 1. Thus, not surprisingly, our perfect inforreplicates the Modigliani and Miller [24] extreme in the presence of corporate taxes.
III.
CAPITAL
STRUCTURE
EQUILIBRIUM
UNDER ASYMMETRIC
INFORMATION
Suppose now that each firm knows its own 0, but creditors know only the cross-sectional dispersion of probability distributions with the same expected return R. That is, the project sponsors within the firm know the i? and 0 for their project while creditors (and other firms) know i? but are a priori uninformed about 0. Let g(U) denote the cross-sectional probability density function over 0. We assume g(8) is common knowledge and that its support is [O,, 8?]. To simplify the analysis, suppose S( R, 0) is decomposable as follows
f(R 0) =f(R) + W-(R) -.fW)l,
(11)
where f(R) @f(R) and F(R) and P(R) are the associated cumulative distribution functions. We now establish that this decomposition preserves the property that increasing 0 implies increasing risk.
216
SHAHANDTHAKOR
THEOREM 1. Letf(R, 0) be described by (11) with 0~ [0,, 0,] c [0, 11. Then,for o>O with 0, QE [e,, 0,], we havef(R) @>(R, 0) @f(R,8) @
f(R).
From now on, f (R, 8) will be assumed to be of the form specified in ( 11). It is easy to see that, if there were no leverage, firms would be indifferent to their 8s because project sponsors are risk neutral. This, of course, is no longer true when debt is introduced, and creditors themselves care about the 8 of the project they help to finance. This observation is formalized in the lemma below.
LEMMA 1. For any pair (a, K) E [0, l] x [w+ , the expected cash j7ow accruing to the creditor declines monotonically as 13increases.
Thus, market participants have an incentive to develop information transmission channels to prevent market failure. One solution is costly information production. For the present, however, we assume it is prohibitively costly to do so and look for other alternatives. The subsequent analysis of optimal capital structure is tailored to enable examination of project financing. It is for this reason that we constrain the firm to raise capital no greater than the required investment. The major concern in project financing is the structuring of a financial package designed to procure the funds necessary to get the project on line; the issue is not to decide whether to sell the entire project to outsiders. In fact, retention of some interest in the project by the initial sponsors is usually essential for access to the project. Thus, our focus is on analyzing how the required investment is composed of debt and equity. We now present some notation needed to define equilibrium. We consider an atomless economy in which lenders compete on the basis of contracts. A capital structure contract, w, is defined as a pair (a, K) = o E [0, 1 ] x R + When a firm selects such a pair, it binds itself to financing a fraction cxof its initial investment with debt and paying an interest factor of K. The contract a firm chooses is costlessly observable, which precludes secret leverage. Let ~~(0) be the capital structure contract designed by creditor i for a firm reporting 0. The set therefore describes the strategic policy of creditor i. Let N = { l,..., n} denote the set of all possible (competing) creditors (the counting measure of N could be infinity) and let t,(Q, ,..., Q,) be the net expected profit of creditor i when the vector of strategic policies being offered is (Q, ,..., 52,). A creditors net expected profit is the expected cash flow accruing to it from its lending less the oportunity cost of not being able to invest the same amount at the riskless rate. We can now formally define equilibrium.
CAPITAL STRUCTURE
217
DEFINITION. A reactive capital structure equilibrium (RCSE) is a set of feasible strategic policies, Q* = (52: ,..., a,*), for the n creditors if for any in N and any feasible strategic policy sZi such that
t,ta:,
I.., n,,..., Q,*) > 5,(Q2*)
3 another creditor je N and another feasible strategic policy Q, such that (i) (ii 1 (iii) (iv) t,(QT, .... Q,,..., Q,*) d t,(Q*h <,(QF ,..., Ri )...) Qi ,..., a;, > t,(n: )..., Qi )...) sz: ,...) Q,T), ti(Q:,..., Q; ,..., a,, -.., SZZ) < 5;(Q*), Vm E N, m # i.j, and all feasible Q,,, &(Q:: ,..., Q ,,...Q,, .... Q,, ...I Q,T) 3 i,(s2: )...) ni )...) i-2: . ...( n; )...) ii?,*,.
This definition of the RCSE is due to Riley [29]. Rileys motivation for developing it came from the (by now well-known) problem with the existence of pure strategy Nash equilibria with a continuum of types.h To characterize the properties of the RCSE, we now examine the design of capital structure contracts. We can view each creditor as publicly announcing its strategic policy in the form of a complete menu of capital structure contracts, with firms allowed to select their own contracts. Alternatively, following the revelation principle developed in the literature on resource allocation mechanisms (see, e.g., Myerson [26]), we can think of every creditor asking each firm that approaches it to directly report its true Q to the creditor, and then awarding the firm an o(O) contingent on that report. Thus, a capital structure contract is a mapping w: [O,,O,]-[O, l]xK!+.
For a firm that reports t? when its true attribute is 8, let E(t?l6) be the difference between the net expected end-of-period cash flow accruing to the shareholders if there is leverage and the net expected end-of-period cash flow accruing to the shareholders if there is no leverage. Similarly, define B((?lO) as the expected end-of-period cash flow accruing to the creditors when a firm with a true attribute 0 reports 0. When there is truthful reporting, we set E(O) 0) = E(B) and B(#J 8) = B(8). In what follows, we characterize the properties of feasible capital structure allocations before proving the existence of the RCSE. The next result forms the basis of our analysis.
6 Dasgupta and Maskin [6, 71 have recently shown that the existence of mixed strategy equilibria can usually be proved in market games in which pure strategy Nash equilibria fail to exist.
218
SHAHANDTHAKOR
LEMMA 2. Assume the RCSE exists. Then, the RCSE contracts are a solution to the following constrained maximization program
maximize subject to
i 01
E(O)g(8)d6
(12)
E(fqe)=j
r(e)
Z[R - CC(~) K(@][
1 - r]f(R,
0) dR
(13) (14)
we) = 40) ZP,
ve E ce, , e,i, veE w,, e,i,
(15) (16) (17) (18)
we) 2 0, 06+w 1, zwe)3E(Qe)
t/k [e,, 61,

ve, & [e,, e,].
Thus, this lemma says that creditors compete on the basis of the capital structure contracts they offer, and competition induces each creditor to design its offer set to maximize the cross-sectionally weighted expected utilities of its borrowers. The constraints are straightforward; (13) and (14) are definitional constraints, (15) is a competitive consistency condition which says that the creditor must earn zero net expected profit on every borrowing firm in equilibrium, (16) is an individual rationality constraint (no firm will accept a capital structure contract that yields it lower welfare than using all equity), (17) says that the leverage assumed by a firm cannot be negative and it cannot exceed the initial financing required, and finally, (18) is an incentive compatibility constraint which guarantees that all feasible capital structure contracts induce truth-telling. The next lemma shows that this maximization problem can be further simplified.
LEMMA 3. Assuming the RCSE exists, the RCSE contracts are a solution to the following constrained maximization program
maximize
* s81E(e) g(e) de,
(12)
As is always the case with reactive equilibria, the RCSE contracts here are independent of g(0). That is, despite the fact that the reactive equilibrium is the Pareto dominating member of the set of informationally consistent equilibria-as Lemma 2 claims-the allocations under the reactive equilibrium do not depend on the welfare weights used.
CAPITALSTRUCTURE
219
subject to
E(B) = Ia(B) pz,

Z[R - L(O)]f(R, sl-(O) 0) dR + IL(e) = a(@ Zp,
(19)
(20) (21) (22)
L(B) E a(8) K(B), E(6) a 0, 06a(@)6 1,

E(8)= [1 -r][
F(B)
(23)
dR,
Z[R-L(8)]{f(R)-f(R)}
for almost every (a.e.) 8 E [Q,, t!I,], E(8) 3 0, E(8) 2 0, A sequence of lemmata RCSE.
LEMMA
(24) (240
wherever E(B) exists.
are next presented to examine the nature of the
4. For a fixed c(, the value of K that satisfies (20) increases monotonically as e increases.
LEMMA 5. For a fixed 8, the value of K that satisfies (20) increases monotonically as a increases.
Our next result shows that, for any feasible allocation, almost every type of firm gets a level of debt financing that falls short of its first best level. A feasible allocation is defined as one that satisfies the informational consistency requirements (24) and (24) as well as the other constraints in Lemma 3.
THEOREM 2. For any feasible capital structure schedule, all firms with 8 E [t?,, t&) that obtain nonzero leverage under that schedule have interior capital structures.
Thus, if one thinks of a firms capital structure as a signal of its true risk exists-is attribute, the signaling equilibrium-if an equilibrium dissipative. There is a loss in tax shield, the value of which does not accrue to the creditors. This occurs for all firms, except possibly the riskiest. Whether the riskiest firms obtain their first best level of financing depends on some parametric restrictions, as we show later. The principal message of this theorem, however, is that almost all firm types that are able to borrow
s The existence of an equilibrium sipative only relative to the private is proved sector. in Theorem 5. Clearly, the equilibrium is dis-
220
SHAH
AND
THAKOR
have interior optimal capital structures. 9 This contrasts sharply with the conventional notion that firms should be all-debt financed when there is corporate taxation, personal taxes are inconsequential to the corporate capital structure decision, and leverage entails neither exogenously stipulated deadweight losses such as bankruptcy costs nor any endogenous tradeoff of the DeAngelo and Masulis [ 81 type. lo Given next is a description of the dependence of a firms leverage, debt interest rate, equity value, and total asset value on its risk attribute.
THEOREM 3. For any feasible capital structure schedule, debt financing available and the interest rate paid are functions of 0. Moreover, the firms debt repayment obligation debt), the net (per share) value of its equity and the value of are also nondecreasing functions of 8.
the amount of nondecreasing (face value of its total assets
This is a strong and empirically testable prediction about the capital structure policies of firms and about the admissible relationships between leverage, value and risk. As in Ross [30], a cross-sectional regression of leverage and firm values will, according to this theory, have a positive slope. But the causality here is reversed. Unlike Ross the feasible allocation does not involve the higher valued firms using more leverage. Rather, firms that use higher leverages have higher values! Moreover, Rosss model is nondissipative whereas ours is dissipative, and the underlying resource allocation scheme that generates the informational equilibrium there is exogenous-Ross takes the linearity restriction on managerial incentive contracts as given-whereas it is endogenous in our model. Our modeling and results also differ from Leland and Pyles [ 183. In their model, riskier firms choose lower optimal debt levels (even without bankruptcy costs) and the project sponsors (entrepreneurs) in these firms enjoy lower expected utilities. * The Leland and Pyle model, in contrast to
As we show later, incentive compatibility may dictate that a subset of [0,, 0,] should be denied credit. Hence, the interior capital structure finding applies only to these firms in [0,, @,I that obtain credit. We do prove later, though, that the Lebesgue measure of firm types that obtain credit in equilibrium is strictly positive. lo Such a tradeoff is obviously absent in our model or else we would have encountered an interior optimum with perfect information. There is also a side payments issue in Ross [30] that is absent here. Because truthful reporting in that model is predicated upon a managerial incentive contract, managers-whose wealths could generally be expected to be small relative to the values of the firms they manage<ould potentially be offered side payments by the shareholders to induce them to emit false signals. * Actually, Proposition III in Leland and Pyle [IS] states that an increase in risk increases the entrepreneurs expected utility. However, it can be shown (see Diamond 191) that this proposition is erroneous. Fortunately, it does not affect the crux of the Leland and Pyle analysis.
CAPITAL
STRUCTURE
221
ours, depends critically on the assumption that project sponsors are risk averse and thus requires knowledge of sponsors preferences in designing an informationally consistent equilibrium valuation schedule. I3 Further, the debt in that model is riskless and the parameter that is asymmetrically known is the mean project return. In our model, debt is risky and the mean return is common knowledge. Because of its assumptions, the Leland and Pyle model seems suited for relatively small, closely held (manager controlled) firms with low leverage ratios. By contrast, our model is particularly applicable to large firms in which entrepreneurial risk preferences are relatively unimportant. Under the maintained assumption of universal risk neutrality, our findings apply to firms of any size. The Leland and Pyle conclusion that optimal leverage should be a decreasing function of risk can be found elsewhere too, particularly in the literature that assigns a central role to bankruptcy costs to explain capital structure (see Scott [33] and Kim Cl.51 for the effects of bankruptcy costs on the capital structure decision). The essence of the argument is that, since expected backruptcy costs increase with the firms riskiness and its marginal tax rate is invariant to risk, an optimal tradeoff between the bankruptcy costs and tax advantages of debt would imply an inverse association between leverage and risk. For example, Lintner [19] writes
For all such companies, the optimal debt level will vary strongly and inversely with assessments of the companys unlevered bankruptcy risk-one measure of business as distinct from financial risk assessed at any amount of debt commitment).
The inverse relationship between leverage and risk is also implicit in Stiglitzs [34] analysis. In Stiglitzs model, the optimal amount of debt to issue is a level such that any further increment would make the debt risky, i.e., every firm issues only riskless debt. Since a higher level of business risk in the firms assets implies a lower maximum level of riskless debt, Stiglitzs prediction is similar to Lintners [ 191. In view of this, Theorem 3 seems surprising. Note also that our finding does not depend on arbitrageimpeding transactions costs as, for example, in Baumol and Malkiel [2] or moral hazard as in Grossman and Hart [lo] and Hellwig [ 131. It does, of course, depend on the absence of bankruptcy costs, but then their importance in capital structure theory has been challenged on both theoretical and empirical grounds. l4
One may argue that it is more likely that there will be an informational asymmetry about project sponsors preferences-if sponsors are indeed risk averse-than about the attributes of the projects they seek funding for. I4 See Haugen and Senbet [ 111 and Warner [37]. The only other paper we know of that also finds that riskier lirms use more debt and have higher values is Heinkel [ 121. Unlike the endogeneity of our result, however, Heinkels result stems from an exogenous restriction on the cross-sectional correlation between risk and value. Also related to our work is the paper
642:42,2-i
222
SHAH
AND
THAKOR
A paper related to ours, although it assumes symmetric information, is DeAngelo and Masulis [S]. It also relies only on the tax code-without appealing to exogenous debt-related costs-to show an optimal capital structure. Unlike our work, however, that paper does not predict any direct relationship between leverage and risk. Rather, it predicts that firms with larger nondebt related tax shields will employ lesser leverage. The intuition behind our main result is as follows. High risk firms have an incentive to understate their os because doing so results in lower interest rates. To offset this misrepresentation proclivity, they are offered higher levels of debt financing as a reward for reporting higher 8s. Low risk firms, on the other hand, are induced not to perversely mimic the high risk firms by being offered low interest rates as a reward for forgoing high leverage levels. The riskier firm, with a greater likelihood of extreme realizations, has a larger set of states in which its income is very high and thus a relatively high tax liability ceteris paribus. Consequently, the high leverage-high interest rate combination is appealing for such a firm because it permits greater sheltering of its relatively more likely large income. On the other hand, the less risky firm attaches a smaller probability to the occurrence of very high income states and thus prefers a lower interest rate to a larger tax shelter. These disparate preferences sustain a perfectly separating equilibrium in which a distinct capital structure contract is optimally chosen for each 8. We have thus far discussed feasible capital structure allocations without having proved the existence of equilibrium. Doing so is the next main task. En route, however, we shall establish a number of intermediate results for which additional notation is necessary. Define L E [w+ as the first crossover point of the cumulative distribution functions associated with f(R) and f(R). That is, t satisfies oL/.(R) dR = jLfo(R)
0
dR
by Myers and Majluf [25] which touches briefly upon how a firm may wish to issue debt and equity simultaneously when there is asymmetric information. However, their analysis is aimed at establishing a hierarchial ordering for tinancing methods and most of it seems to indicate a strict dominance for debt. Recently, Ross [31] has developed a symmetric information model with taxes and uncertainty to explore the relationship between capital structure and risk. He finds that, in a cross section of firms with identical total variance of earnings, those with higher (positive) cash flow betas will have lower debt levels. In a recent paper, Talmor, Haugen, and Barnea [35] show that the DeAngelo and Masulis result is an outcome of their assumption that debt principal is tax deductible. Assuming only interest is tax deductible, they show that the firms after-tax pro& function is usually conrex with respect to leverage, ruling out interior optima.
CAPITAL STRUCTURE
223
and [)/1(R)
0
dR > [f(R)
0
dR,
vy E (0, LJ).
Clearly, (0, 1) has strictly positive Lebesgue measure on [w+ . If this were not true, there would exist an F >O small enough such that (2) would be violated with E = x. By Theorem 3 we know that the riskiest project gets the maximum leverage. Suppose ~(0~) = 1. Then define L, as the value of L(8,) that satisfies (20) with a(0,) = 1. That is,
Ln i I[R
0
- L,,]f(R,
02) dR + IL,,, = Zp.
(25)
Let K,(Q2) satisfy (25) so that L,= of c[ satisfying L s0
1. K,,,(8,). Next define Cras the value
I[ R - t] f( R, 6J dR + IT = Zip.
(26)
It is clear that Cc 0. > We can now state a simple but useful fact.
LEMMA
6. For any feasible capital structure schedule, L(N 6 L, V9E [O,, CL].
Our next result is concerned with the maximum admissible debt face value and an identification of the condition under which the riskiest project is granted its first best leverage. Notationally, A designates the min operator.
THEOREM 4. If the set of informationally consistent schedules is nonempty, then the Pareto dominant member of that set satisfies (stars denote Pareto dominance) L*(%,) = L, A E. If L, < E, then cx*(O,) = 1.
We now establish the main existence result. The basic idea in the proof is to show that the set of informationally consistent schedules is nonempty.
THEOREM
5.
The RCSE exists and is unique.
In the context of the Spencian signaling model, a necessary requirement fo,r informational consistency-assuming smoothness-is that JE( 6 IO)/ 30 I H= 4 = 0, and the sufficiency condition is that 8*E($l0)/# Id= t) d 0. If
224
SHAH AND THAKOR
these two conditions are met, then a firms equity value is maximized when it signals truthfully. It is easy to verify that analogous conditions hold in our model. We now summarize the properties of the RCSE. Stars are used to designate RCSE values, and p(. ) denotes the Lebesgue measure on Iw+ . For some 0~ 0,, define O* G [P, 19,] n [0,, 0,].
THEOREM
6. In the RCSE,
cr*(B)>O, a*(0)>,0, VIE [0,,0,], and 30*~[0,,8,] such
that
I ()
=o, @*(@I 1 E (03@*(w),
v/8$0* VoEO* n [O,, &),
cr*(&)=
[l A c?] E (0, 1-J
p(o*) > 0; (ii) (iii) t*(0) > 0, V/8E [0,, 0,]; K*(o)>O,ME [IO,, e,].
Thus, whenever incentive compatibility permits it and the exclusionary risk parameter, 9, equals 8,, firms of all types are granted positive leverage. It is possible for some firms with 9 E [0,, 0,) to be denied leverage, but the theorem asserts that not alf firm types are precluded from borrowing. The theorem also says that the riskiest firm obtains strictly higher leverage than any other firm a lower risk parameter. This is a stronger statement than Theorem 3 which only claims that ~(0) is nondecreasing. I6 We conclude this section with comments on four points. First, we have defined a capital structure contract as a combination of interest rate and leverage. We could have introduced a third instrument, namely the probability that debt would be made available to the firm. However, we have verified that the equilibrium value of this probability is always either zero or one. Thus, there is little to gain by formally modeling this probability, since a zero probability is equivalent to ~$0) = 0. Second, we have modeled a firm as a collection of shareholders. It is important to note, therefore, that shareholder unanimity obtains. The reason is that in equilibrium each firms net equity value is maximized subject to informational constraints and this is synonymous with the maximization of each shareholders personal welfare. Intuitively, unanimity is the consequence of each firms initial shareholders being homogeneous with respect to their preferences and information structures.
Although we show in the proof of Theorem 2 that u*(8) -c a*(fl,), VfJ< 0,. that finding is based on the assun~prion that at least some firm types are granted positive leverage.
CAPITALSTRUCTURE
225
Third, although we have restricted the firm to raising no more external capital than the required investment outlay, lifting that restriction would be easy. As long as the sponsors retain some-no matter how small-residual interest the capital structure equilibrium will be qualitatively unaltered even if external capital in excess of the required outlay can be raised. Finally, we have not formally dealt with personal taxes. The implicit assumption has been that personal income from stocks is taxed at the same rate as personal income from bonds, so that personal taxes are allocationally neutral. Thus, we differ from Miller [22] who assumes income from stocks is tax-exempt whereas income from corporate bonds is taxed. Ix
IV. A RATIONALE
FOR PROJECT FINANCING
In the previous section we assumed that creditors could not profitably produce costly information about the return distributions of firms, or equivalently that designing a revelation game was the Pareto dominant alternative. Suppose now that creditors can feasibly produce information about a project at a cost Q(p), which is a positive, real valued function of a vector, p, of various publicly observable factors related to the project, such as its size, its technical complexity, etc. We assume that information
This condition rules out an (absurd) equilibrium in which the debt face value is set at infinity. That is, the entire project is sold to the bondholders and there is no residual equity claim. Because there is now only one class of claims, the risk parameter H is irrelevant given risk neutrality, and all projects will be priced contingent on the commonly known mean return. However, this equilibrium is practically absurd because, if the IRS were to allow such things, every firm would be all-equity financed and would simply relabel its equity as debt. No taxes would ever be paid and there would be no bankruptcies. From a theoretical standpoint too, this is unacceptable since it makes little sense to refer to a class of claims as debt unless there is at least a small residual equity claim. * Another important difference is that the debt in Millers [22] model is riskless whereas it is risky in ours. Risky debt affects Millers results because the bonds of companies in default will not, in general, yield the issuing stockholders their full tax shield. But even if one ignores this effect and assumes that personal income from stocks is taxed at a lower rate than personal income from bonds, our capital structure equilibrium under asymmetric information will not generally be the same as Millers, This is because the extreme leverage propensity on the part of tirms in Millers model makes it plausible to argue that eventually a suflicient number of low tax bracket investors will be exhausted by an ever increasing corporate debt supply to move p[ 1 - r] <p. to an equality, where p. is the riskless tax-exempt yield and p the riskless taxable yield. In our imperfect information model, however, every firm (except those with 0 = Oz) has an inrerior optimal capital structure. This implies an aggregate leverage constraint in our model that is absent in Millers Consequently, the aggregate supply of taxable corporate bonds may be small enough to ensure p[I -r] <p., and as long as that is true, our tindings in this section will be qualitatively unaffected.
226
SHAH
AND
THAKOR
production permits perfect ex ante identification of the unknown risk parameter. Consequently, first-best leverage follows information production. Although it is creditors who produce information, in equilibrium the cost of producing information will be borne by the firm itself because the price of credit will impound the information production cost. Thus, the firm now has a choice of playing a revelation game or having creditors produce information at a cost to itself. So far we have also assumed that the firm starts out as an empty shell in the absence of the new project. We now amend that assumption by assuming that the firm currently has J distinct divisions, i.e., it is a portfolio of J different projects. It has available a new project calling for an investment of I. If this project is conventionally financed and incorporated as an additional (J+ 1)th division within the firm, creditors who provide part of the financing must produce information about all the divisions in the firm. The reason is that the firm can comingle the cash flows of its various divisions, thereby rendering dubious the usefulness of credit terms based solely on the cash flow distribution of the new project. From the lenders perspective, another problem with multidivisional firms is that even significant alterations in the composition of the firms assets may be difficult to detect and prevent. Jensen and Meckling [14} argued that, despite bond covenants, firms could engage in asset substitution and change the riskiness of their underlying assets to the bondholders detriment subsequent to the issuance of the bonds. To capture this notion in a simple fashion, we assume that the firm has private information about a change in its return distribution that will occur after borrowing has taken place. This change will require no additional financing.) That is, the firm knows that some signal, 6 E { 6,) 6, ,..., 6 ,) z A, will arrive after it has financed the new project, and this signal will affect the firms return distribution. This signal is irrelevant to the project itself only if it is separately incorporated, because we assume it does not affect the projects 6 per se. The best way to think about A is to view it as the firms feasible investment opportunity set in addition to the project in question. The firm knows precisely which element from this set will become available to it to possibly substitute for some of its existing assets, whereas creditors can only assign a subjective prior density function over the set A. Alternatively, creditors can find out 6 by producing information about every 6j~ A, at a total cost of C6,Ed Q1(6i).1g In a perfect information world these problems would clearly be absent. Subsequent discussion shows that our theory predicts no advantage for project financing in this case. Project financing is advantageous whenever it
I9 It is not essential as Q( ) is increasing
to our theory
in the counting
that Q(. ) be additive. measure of A.
Subadditivity
would
suffice
as fang
CAPITAL
STRUCTURE
227
increases the projects net equity value; under perfect information, every project receives complete debt financing-and thus enjoys its maximum potential value--even when conventionally incorporated because perfect ex post monitoring by creditors can be used to ensure compliance with bond covenants that specify precisely how project cash flows should be utilized. The ensuing analysis is aimed at establishing conditions under which project financing is beneficial when there is imperfect information. The intuition behind the formal analysis is as follows. Whenever the net gains from information production are sufficiently high, project financing is beneficial because creditors incur lower screening costs in evaluating a separately incorporated project. On the other hand, when a revelation game is preferred to information production, project financing is value enhancing if it can attract higher leverage. From the previous section we know that this occurs when the post-project-adoption risk parameter of the firm-considered relevant by the creditors with conventional financing-.---is lower than the projects own risk parameter (the relevant parameter for project financing). Suppose the firms total investment (in the J divisions) currently is I,, and its total end-of-period payoff is IJRJ, a random variable. The firms total existing debt obligation (principal plus interest) is for an amount D,, payable at the end of the period. Define P,,r I,R,D, and let h,(P,, e,(S)) be the density function of P,. BJ is expressed as a function of 6 to indicate that the firms risk parameter depends also on the signal it knows for sure it is going to receive. Also, the density function h,( ., . ) is a mixture of a low risk density function and a high risk density function as in (11) with 0, as the risk parameter. Without the new project, the end-ofperiod expected cash flow accruing to the firms shareholders is
we,) =I 4; p,c 1- tl h,(P,, e,(s)) dP,,

where
A;= (P,] P,>,O, I,R,e [w, )..
(27
Now suppose the new project is conventionally financed and incorporated within the firm. If the firm avoids information production and decides to engage in a revelation game as in Section III, it will be asked to report 8,+ ,(S), the (post-signal) risk parameter for the density function of the random variable P J+ 1 E I,R, - D, + ZR, where Z is still the investment required in the new project and R is the return on the new project. In this case, the incremental expected net cash flow from the new project accruing
228
SHAH
AND
THAKOR
to the shareholders of a firm which has an overall (post-project-adoption) risk parameter 6,+ ,(S) and a project risk parameter 0 is
4e,.l(@J)=[A<
C~J+,-~(~,+,)l~,+,(~,+,,~,+,(~))~~,+, 1+1 - EJ(O,)- ZCl- a*te,+ ,)I P,
(28)
where
Note that @*GA+,(W, K*Ub+ ,(W) is the capital structure contract

awarded the firm in the RCSE when it reports e,+,(s). We assume, for simplicity, that the creditors who lend funds for the project have the lowest seniority among all of the firms creditors. We also assume that the density functions h,, ,(. , . ) can be cross-sectionally ordered by the @ relationship stipulated in (11). Now suppose the firm opts for conventional financing but decides to have costly information produced. Creditors will produce information about the new project as well as about all the J divisions and the 6;s at a total cost of Q(fl,+ 1) + CaiEd Q(si). Since information production reveals the necessary information to creditors ex ante, the firm will be given its first best leverage subsequent to information production. Now using the fact that, in a competitive equilibrium, information production costs must be borne by the firm itself, the incremental net expected end-of-period cash flow accruing to the shareholders from adopting the new project and having information produced is ~(e,+,(6),e)=zp~+~(e,+,(6))-Qo(p,+,)1 e%u 6,~A (29)
where t+b( is the incremental benefit to the firm as a whole from having .) information produced. Because information is produced about all the assets of the firm, its capital structure can now be rearranged to attain the first best leverage for the whole firm. $(. ) then measures the cash flow increment resulting from an increase in the firms leverage-for existing assets-from its previous level to the new first best level. We shall assume that J and the counting measure of A are sufficiently large so that -Qe,+ ,(a 8 >&e,+ ,(a e),
ve, 6.
(30)
To us this seems a sensible asshmption for large firms that have a rich variety of investment options available. For such firms we should expect that inducing revelation of a single relevant parameter through a self-selec-
CAPITAL
STRUCTURE
229
tion mechanism would be less costly than having creditors produce information not only about the firms existing assets but also about possible post-contract events that could affect the return distribution. Of course, this also presumes that the gain to the firm from an increase in leverage for its other assets is not large relative to information production costs, and implies that information production is never optimal with conventional financing. Now suppose the firm utilizes project financing and sets up the new project as a legally segregated entity. Creditors are no longer concerned about the attributes of the firms other assets. Only the projects risk parameter matters. If the firm participates in a revelation game in this case, the net end-of-period expected cash flow accruing to the shareholders from the new project will be E,(e) = h*(e) pt. (31)
If the firm decides in favor of information production, creditors need produce information only about the new project. The cost of doing so is Q(B). Because Q( . ) is assumed to be an increasing function of size as well as the number of divisions information is being produced about, Q(/I) < Q(fl,+ ,). The net end-of-period expected project cash flow accruing to the shareholders in this case is B,(e) = zpr - Q(p). (32)
Figure 1 is a graph of ES(e) and E,(0) as functions of 0. Both functions are sketched as broken lines. Tke full curve is the upper envelope of the two functions, E;(8) = E,(e) v E,(B), where v is the max operator. Condirional on project financing being the chosen mode of incorporation, E;(0) is the value of the net end-of-period expected cash flow accruing to the
RISK
PARAMETER
FIG.
1.
Equity
values
as functions
of the risk
parameter
for all possible
risk
parameter
values.
230
SHAH
AND
THAKOR
shareholders, and this value is a functon of 8. To exclude the uninteresting case in which E;(0) is constant for all 8, we assume E,(e,) = zcl*(ez) pz > zpz- QO(fl). This assumption is obviously dispensible when a*(C),) = 1. Now, a necessary and sufficient condition for project financing to be preferred to conventional financing by a given firm is that
E;(e) > &t,+ ,(S), 0)

for that firm. Define
(33)
where @(A ) is the set of all possible values of O,, I (6). Figure 2 is a graph of B,(S) and g,(O) with 8 restricted to take values in 0. Let
and
It is apparent that 0 ~ is a connected set and so is 0 +. and that B,(e) is convex and nondecreasing over 0. For a project with a risk parameter 8, let 8 be its pseudo risk, defined through the relationship E(O,+ ,(S), 0) = E,(O). We can now state the following result.
,---
@--
-,-0*-y
RISK PARAMETER
-9
FIG. 2. Equity values as functions which project financing is preferred.
of the risk parameter
for those risk parameter
values for
CAPITALSTRUCTURE
231
THEOREM 7. Whenever project financing is adopted for a project, the ,following observations hold.
(i) The project has a higher value than it would bcith conventional financing.
(ii)
The project has a higher leverage than it would with conventional
-financing. (iii) A project with a relatively low 6,E 0 is more likel)) to involve information production, whereas a product with a relatively high 0 E 0 is more likely to involve a revelation game.
(iv) The projrct gets higher leverage if information production is optimal than it would if information production is not optimal.
Moreover,aslongas~({~E~*~E,~(H)<~S(~)))>0,3auniqrreBEO*su~I~ that for all projects with pseudo risks 8 E [d, O,], conventional financing is Pareto dominant if 8 < 0 and project ,financing is Pareto dominant {f 0 3 0. Regardless qf their pseudo risks, hoM,ever, the riskiest projects (those with 0 = 0,) alwa)v utilize project financing. Finally, 3 a number r E [0, 1] such that, !f ~(0 + ) < r, a cross-sectional test will ,find that the mean leverage in investments utilizing preject financing is higher than the mean leverage in conventionally financed prqjects. This theorem has the following empirical connotations. In project financing ventures that employ very high leverage, one should find creditors quite involved in numerous phasesof the project since our theory predicts information production by creditors will accompany very high leverage. Moreover, such projects should also be less risky than those project-financed ventures that use somewhat lower leverage and appear to have lesser creditor involvement. The two strongest implications of the theorem are that the riskiest investments will always use project financing and that, at least for projects that have relatively high pseudo risks, a project whose risk exceeds its pseudo risk will be project financed. In combination these two implications suggestthe optimality of project financing for highly risky investments and provide a prediction that seemsconsistent with observed firm behavior.
? The contemporary theory of financial intermediation holds the view that costly information production is an essential feature of financial intermediaries; see, for example, Ramakrishnan and Thakor [28]. The theorem says nothing about projects with pseudo risks 0 E [H, . 81. For such projects, project financing is always optimal, sometimes with information production and sometimes with a revelation game. The important point, however, is that for at least a subset of projects one can assert that project financing is Pareto dominant for a project that is riskier-in a sense-when separately incorporated than when it is incorporated as a part of the firm.
232
SHAHANDTHAKOR
If information production is disallowed, matters become much simpler. The following result is immediate from the preceding analysis and is thus presented without proof.
COROLLARY. Suppose information production is prohibitively costly. Then, any project that is riskier if separately incorporated than it is if adopted within the firm will be organized as a project financing venture.
This corollary is a more direct prediction. It suggests that if one makes an ex post comparision of project financed ventures with conventionally financed ventures, one should expect the former to be riskier and more highly levered on average. In closing then, our theory suggests that the economic motivation for project financing stems from a rational attempt to minimize the allocational distortions of asymmetric information rather than the popularly assumed inability on the part of the creditors to detect the offbalance sheet debt associated with project linancing.2
V. PROOFS Proof of Theorem 1. We know that

j Rf(R,O)dR= j Rf(R)dR+G j
R+
R+
R+
Rf(R)dR-tli f(R)@)(R)
R+
Rf(R)dR and (1) (34)
=j R+
Rf(R)dR
using
= R. Similarly, replacing 6 by e gives
.r
J4+ Combining
R=j
Rf (R, 4) dR = i?.
(35)
(34) and (35) we have, for 0 > 8,

Rfl(R)dR=I
R+
R+
Rf(R,O^)dR=i^
R*
Rf(R,O)dR=
R+
Rf(R)dR. (36)
* For 1983.
an example
of the popular
viewpoint,
see the Wall
Street
Journal
of December
13,
CAPITAL
STRUCTURE
233
Now integrating
(11) for any 8 E [I@,, @,I.

(37)
0 0
IF(R,R)dR=[;~(R)dR+O{jF(R)dR-jp(R)dR}.
0
Since j; F(R) dR 3 16 p(R)
dR, (37) implies that (38)
{F(RJ?)dR~[%(R)dR. 0 0
Thus, (36) and (38) imply that, for 0 E [IO,, 0,], .f(R @W(R). Next, add and subtract s; F(R) to obtain
~F(R,H)dR=~F(R)dR-[l-0] 0 0 dR from the right-hand (39)
side (r.h.s.) of (37)
[F(R)dR-[+(R)dR
0 0
(40)
Since H < 1 and j; F(R)
dR > 1; p(R)
dR, (36) and (40) jointly
imply that
(41)
.f(R) Q.f(R 0).

Since (41) holds for any 0~ [e,, Hz], it must be true that f(R) Of(R, 6).
(42)
Further, from (40) we see that for 0~4, I0 Combining

F(R, 0) dR < 1 F(R, 0) dR. 0 (43)
(36) and (43) means that for d<o^, .f(R 0) @.f(R 0).
(44)
Joining (39) (41) (42), and (44) yields the desired result.
Proof elf Lemma 1. The
Q.E.D.
expected end-of-period
cash flow to the
creditors is
B(c&,O)=[ r I[R-crK]f(R,fI)dR+IctK. (45 1
234 And using (11) we can write

B(aK, %)=I
SHAH
AND
THAKOR
I-
Z[R-uK][f(R)+%{f(R)-f(R)}]
dR+orKZ.
(46)
Differentiating
(46) with respect to %, holding c( and K fixed, produces

iiB(ctK, %)jr;%= j Z[R-aK][f(R)-f(R)] dR.
(47)
Rearranging
(47) we obtain,
s
F
I-Z[R - c4U-W) -.fWH dR

= j IRfl(R)dR-1 r I IRf(R)dR-aKj I .f(R)dR+aK/ If(R)dR.
Integrating
the above equation by parts, we get

dR=j
I-
j Z[R-clK][f(R)-,f(R)]
P(R)dR-j
I-
F(R)dR<O.
Q.E.D.
Proof of Lemma 2. From Riley 129, Theorem 71, it follows immediately that the reactive equilibrium is the Pareto dominant member of the set of informationally consistent contracts. Riley has also proved that the reactive equilibrium is always fully separating, which justifies constraint (15). Note that ( 18) ensures informational consistency. Q.E.D. Proqf of Lemma 3. We need to show that (19) is equivalent to (13) with %=%, (20) and (21) are equivalent to (14) and (15), and (24) and (24) are jointly equivalent to (18). To see that (19) is equivalent to (13) with %= 9, simplify (13) to get
E(%)=ZR[l
-r]-J
I-(H)
ZR[l-t]f(R,%)dR
4 PIO) ZL(%)[l
- [l -a(%)] Combining (14) and (15) we see that
-t]f(R,%)dR
zp-ZR[l
-T]
+zp.
(48)
d%)zp= jr,tilZRfW,WR+j
r(0)
IU%)f(R,%)dR.
(49)
CAPITAL
STRUCTURE
235
Thus,
P(O) =
[ 1 - s] ZL(0)f( R, 0) dR
[l -71
sI-IO,
ZRf(R, O)dR-
[l -
~1 de IP.
(50)
Substituting (50) into (48) shows the desired equivalence. To see that (14) and (15) are equivalent to (20) and (21) note that (49) can be written as
a(B)Ip=j r(B) IRf(R,O)dR+IL(O)-j ZL(fl),f(R,l?)dR. l-10,
Finally, we will prove that (18) is equivalent to (24) and (24). First, we show that (18) implies (24) and (24). Using (13) and (18) we obtain
= -Wb + 11- ~1s,-,,j, CR- U&l,f(R
H^) dR
- Cl - 71I,,t,j, I[R- UH^)lf(R 0) dR.

Thus, using (11) and rearranging
E(8)4(8)>Z[l -r-j
we get
dR.
1 _ [R-I@)][&s][f(R)-,f(R)] f-co,
(51)
Reversing the roles of 8 and g produces E(B)-@)<I[1 Taking

-r] [R-L@)][&e][f(R)-f(R)] j /-((II dR. (52)
0 < 8 and combining V(Q) 6 [ad)
(51) and (52) gives us - wzbi [e - 01~~1 6 ff(e), (53)
where
q(e)=Z[l -z] [R-a(B)][f(R)-f(R)] 1 I-(O) dR>O.
Since O-C 0, (53) implies that the function q(0) is nondecreasing in 8. Thus, it is Riemann integrable, continuous a.e., and differentiable a.e. in 0. Taking limits as 4 -+ 8 we get E(0) = q(0) for a.e. 8 E [0,, e,]. Moreover, because ~(6) is nondecreasing, we have E(O) 2 0 wherever E(B) exists.
236
SHAH
AND
THAKOR
We will now prove that (24) and (24) together imply (18). Note that E(B)-E(B(8)
= ~(8) - ~(8) - [ 1- T] ?*,I,,[R - Whlf(R

+[1-r]j
=E(8)-E(d)r(B) [R-L(&]f(RJ)dR [
4) dR
[l -z] [0-e]
_ Z[R-L(8)][8-8][f(Rj-f(R)] r(n)
dR
=E(B)-E(8)-
E(d)
fora.e. 8~[0,,tI,] fora.e. 8~[0,,0,].
(using(24)) (54)
= E(z)dz-[H-g]E(d) 5ri By (24) we know that
11 s ti
E(z)dz>
[e-t?]
E(8)
for
0 > d and a.e. 8,ee [0,, fI,].
Thus, the r.h.s. of (54) is nonnegative, implying that (18) holds for a.e. 8. Of course, over the set of Lebesgue measure zero that (24) does not hold, ( 18) must hold. Q.E.D.
Proof of Lemma 4.
Differentiating
(20) with respect to 8, holding
tl
fixed, we obtain
-ctZ[3K/M] ! [f(R)+e(f(R)-f(R)}] r(o) I[R-cK][f(R)-f(R)] dR=O. dR+Ia[lK/iM]
+I
Thus,
r(u)
f(R,tl)dR
=I = j
f-(H) r(e)
Z[R-aK][f(R)-f(R)] E.l(R)dR-j p(R)dR,
dR
r(B)
which implies frro,F1(R)dR-j
aK/ae =
r(8)
E(R)dR
Ml - fr(e,.f(R 0) dR1
> 0.
Q.E.D.
CAPITAL
STRUCTURE
237 8
Proof of Lemma 5. Differentiating fixed, and rearranging gives
(20) with respect to ct, holding
From (20) we also obtain p= K+cr- Substituting j r(n) Rf(R.QdR-Kj f(R,QdR. l-(H) (56)
(56) in (55) yields
f (R, 0) dR
f(8)
Q.E.D.
Proof of Theorem 2. Any feasible capital structure schedule satisfies (19), (20), and (24) simultaneously. Combining (19) and (24) we obtain
h(~)~T=~/XX(6,)1-
[l -11 j j I[R-L(,-)][f(R)-f](R)] n r(z)
dRdz. (58)
Assuming that a solution
to (57) and (20) exists, denote the solution
by (59)
(60) (61)
L(0) = L*(8), a(8) = LY*(tl), K(0) = K*(0) = L*(8)/a*(0). Substituting (59), (60), and (61) into (58) yields h*(e)
PT
= @M*(&) T - [l -T] x [.f( R) -f(R)]
j,:I jJ-e,r, I[R - L*(z)] dR dz.

(62)
Now for any t3~ [e,, e,), the second term on the r.h.s. of (62) is strictly positive since T*(Z) has positive measure because of our assumption that the firms under consideration have nonzero leverage. To see this, note that since L>O and L, >O, we know that L*(B,)>O. Thus, p(f*(fl,)) >O. It is also obvious then that we can find a 8 = o2 -E, where E> 0 is arbitrarily small, such that p(r*(W)) > 0. Thus, h*(6) which means
b42/4212-4 PT
<@X*(0,)
1, Q.E.D.
cc*(e) < a*(B,) d lV8~ [G,, 19,).
238
Proof of Theorem 3.
SHAH
AND
THAKOR
Differentiating
(19) with respect to 8 gives

(63 1
E(B) = Za(0) p. Joining (24) and (63) yields

Za(B)pz=
[l -s]
Z[R-L(tl)][f(R)-f(R)] 110)
dR>O.
Thus, ~((3) 3 0. Now we have shown in Lemma 4 that for any CX, higher 8 a implies a higher K, and we have shown in Lemma 5 that for any 8, a higher a implies a higher K. Therefore, for 19~ 8,, we have >
which means
We,) 2 Me,).
Hence, ,!,(e)>O, ME [e,, ez]. To prove that the net equity value is nondecreasing integrate the r.h.s. of (24) to obtain
Fl(R)dR-1 p(R)dR
in 8, we partially
l-(H)
1
dR.
20.
Q.E.D.
Proof of Lemma 6.
Differentiating
(24) with respect to I3 gives us

(f(R) -.f(R)}
~(8) = [ 1 - s] L(B) Z jr,,,
From (24) and Theorem 3, we know that E( 19)2 0 and L( (3)> 0. Thus,
s
which implies L(B) 6 E.
Proof qf Theorem 4.
rlH, {f(R)
-f(R)
i dR 2 O,
(64) Q.E.D.
Integrating H l-l=,
(24) produces (65)
~(~=~(e~)-[l-r]j~~j-
Z[R-L(=)lCf(R)-f(R)ldRd,-.
CAPITAL
STRUCTURE
239
Substituting
[Yqe2)g(e)de-[l f)l
(65) into (12) yields the following modified

-T]je2[ 0 f-l:;)
objective function
z[R-L(z)][fO(R)-f(R)]dRg(z)d~. (66)
Clearly, (66) is maximized
when E(f??) is mvimized.

E(k) = /cc(&) pt.
From (19)
Therefore, in the Pareto dominant informationally consistent schedule, ~(8~) takes on the maximum value that it can while still satisfying (20) and (64). Combining this with the assertion of Lemma 5 that a higher K accompanies a higher CI, we have a*(&)= 1Ar and
L*(k) = L,, A E.
It is transparent that if L, d 1, then cc*(BZ) = 1.
Q.E.D.
Proof of Theorem 5. Riley [29] has shown in his Theorem 7 that. if the set of informationally consistent schedules is nonempty, then there exists a unique reactive equilibrium which is the Pareto dominant member of that set. Thus, we only need to prove that the set of informationally consistent capital structure schedules in our model is nonempty, and that the Pareto dominant member of that set is sustainable as an equilibrium in the sense that it cannot be upset by a pooling allocation. We know from the proof of Theorem 2 that an informationally consistent schedule must satisfy (58). We also know from Lemma 6 that the Pareto dominant informationally consistent schedule must satisfy
a*(e,) = 1 A 1. Further, the global incentive compatibility requirement (18) calls for continuity of E(8) everywhere on its domain, and hence continuity of ~(0) through (19). It is also easy to verify that L(8) is continuous. The Pareto dominant informationally consistent schedule must, therefore, satisfy
pcP(8) IT + q(O) = zcc*(e~) pz, (67)
where
q(tl= [l--T] Z[R-L(z)]{f(R)-f(R)j dRdz
is a nonincreasing function of 0 that is continuous because E(B) is continuous. Assuming for now that p(r(13,)) > 0 (we shall verify shortly that this is indeed true), we see that, because of the continuity of L(O),
q(8)=-[I-r]j I[R-L(B)]{f(R)-f(R);dR
F(B)
240
SHAH
AND
THAKOR
-0
RISK PARAMETER
FIG.
3.
Equilibrium
when
some firm
types are not granted
leverage.
is strictly negative for all 8 d O2 at least in a neighborhood of 8,. In fact, q(8) < OV8 such that L(B) > 0 and q(8) = 0, VtI such that L(O) = 0. Thus, q(8,) = 0 and q(8) increases as 0 decreases. This implies 38 satisfying q(@) = hX*(e,)
PT.
From (67) then, a*(P) = 0. There are two possibilities. One is that 8 lies in the interior of [e,, f?,]. In this case, (67) implies that a(0) < 0, V0 < 6. However, since negative values of ~(0) are infeasible, we set u*(e) = OV0 < 8. This situation is depicted in Fig. 3. The second possibility, sketched in Fig. 4, is that 8 is strictly less than 8,) in which case E*(O) > 0, Ve E [e, ,0,]. It is easy to see now that q(e) <z@*(b) PT, VOE CQ,, W n Co, W.
Hence, +x*(~)E (0, a*(t),)) for a set of Bs with positive Lebesgue measure such that (67) holds for that set. Substituting this value of CC*(~) in (20) produces
s
r*(o)
Z[R-L*(B)]f(R,8)dR+ZLL*(8)=Zcz*(8)p,
(68)
RISK
PARAMETER
FIG.
4.
Equilibrium
when
all firm types
are granted
positive
leverage.
CAPITAL
STRUCTURE
241
which yields the value of L*(8).
Finally,
K*(e) = L*(e)/a*(e). Denote Q* = [e,, 0,] A (P, 19~1 as the set of firm types granted positive leverage in equilibrium. To complete the proof, we need to verify that p(O*)-and hence p(r(8,))-is indeed positive and that the second order condition for global incentive compatibility, E(B) > 0, is met. Now we know from the proof of Lemma 6 that this second order condition is met if
-f(R)} b 0, s if(R) dR
r*(O) We have
L*(O) 6 L*(b)
t/eE Q*.
(69) L, A L. Thus,
proved that L(B)>0 G e, V0E O*. Further, .r, {f(R) -So(W)
and
L*(&)=
dR = 0,
(70)
and
I./(R)
0
dR > S,f(
0
R) dR,
VJI, (0, L). E
(71)
Combining (70) and (71) yields (69). Thus, E(8) > 0, VB E O*. Note also that, because (0, E) has positive Lebesgue measure on Iw+ , (60) can be satisfied for a range Bs with positive measure. That is, p(O*) > 0. To verify that the candidate reactive equilibrium schedule cannot be upset by a pooling allocation, note that in c(- K space, indifference curves of firms with different 8s have the appropriate stacking property with respect to 0 required in Rileys [29] condition (A-5). Thus, no lender can profitably deviate from the candidate equilibrium with a pooling contract, given the anticipated reactions of other lenders, and we have a reactive equilibrium. Q.E.D.
Proof of Theorem 6. The only item that needs to be proved is that E*(I~) E (0, u*(e,)) VOE O*. In Theorem 2 we proved this fact under the assumption that p(O*) >O. This assumption has now been proven in Theorem 5. Q.E.D. Proof of Theorem 7. Claim (i) follows from since project financing is used for the project, that Es(e) > E,(e). Now if E;(e) = Es(e), the complete debt financing. And if E$(@) = E,(e), project tinanced venture has higher leverage.
(33). To show (ii) note that the definition of B shows project financed venture has then 8> B and again the This establishes claim (ii).
242
SHAH
AND
THAKOR
Claim (iii) follows from the [act that E,(O) in (31) is a nondecreasing, convex function of 8 whereas E,(8) in (32) is constant for all 8. Thus, there exists a 8 such that 0 E 0 -, V/e< 9 and 0 E 0 +, V/e> 6. This proves (iii). Claim (iv) follows from the fact that information production is always accompanied by complete debt financing. Next, consider a project with a risk parameter 0 that produces a postproject-adoption risk pframeter of Q,, ,(6) for the whole firm. Because E,(O,) > Z~T- e(p) = ES(02) by assumption, we know that as long as E,(B) > E,(G) over a set of nonzero Lebesgue measure, the monotonicity and convexityof E,(O) ensyres that 3 a unique f3 in thz interior of O* such that ,!?,(S)<E,(O), VB<B and E:,(Q)>E,(O), V838. Now suppose the projects 8 E [8, Oz]. Looking at Fig. 1 we see that if 0 < 8, then E!J 0) = i,(e) = 1~7 - Q(p) < E,(fI) if 0 d 4 and E;(O) = E,(O) < B,(O) if I3> 8. Thus, coFventiona1 financing is better if 6 < 8. By similar logic, when 0 >, 8 > 0, we have E!JO) = g:,(O) > E,(O). Thus, project financing is better if 0 2 8. The riskiest projects have Eg(O,) = Zpz[ 1 A a]. Since conventional linancing cannot produce an equity value exceeding Zpz[l A a], but could produce a lower equity value for a project with 0 = 62, it follows that the values of the riskiest projects will be maximized with project financing. Finally, if ~(0 + ) 6 r for r small enough, most of the ventures involving project financing will have complete debt financing, leading to the crossQ.E.D. sectional claim.
REFERENCES
I. K. J. ARROW, Limited knowledge and economic analysis. Amer. Econ. Rev. 64 (1974), l-10. 2. W. BAUM~L AND B. MALKIEL, The firms optimal debt-equity combination and the cost of capital, Quart. J. Econ. 81 (l967), 547-578. 3. J. I. BULOW AND J. B. SHOVEN. The bankruptcy decision, Bell J. Econ. 9 (1978). 437456. 4. P. K. CHANEY ANL) A. V. THAKOR. Incentive effects of benevolent intervention: The case of government loan guarantees, J. Public Econ. 26 (1985). 169-189. 5. M. V. DARROUGH AND N. M. STOUGHTON. Moral Hazard and Adverse Selection: The Question of Financial Structure, manuscript, Univ. of British Columbia. Vancouver. August 1984. 6. P. DASGUPTA AND E. MASKIN, The existence of equilibrium in discontinuous economic games, I: Theory, Rev. Gon. Stud. 53 (1986), l-26. 7. P. DASGUPTA ANU E. MASKIN, The existence of equilibrium in discontinuous economic games. II. Applications, Rec. Econ. Stud. 53 (1986), 2742. 8. H. DEANGELO AND R. W. MASULIS. Optimal capital structure under corporate and personal taxation, J. Finan. Econ. 8 (1980). 39-65. 9. D. DIAMOND, Financial intermediation and delegated monitoring, Rev. Econ. Stud. 51 (1984), 393415.
CAPITAL
STRUCTURE
243
10. S. J. GROSSMAN AND 0. D. HART, Corporate Financial Structure and Managerial Incentives, Working Paper No. 21-79, Rodney L. White Center for Financial Research, Univ. of Pennsylvania, Philadelphia, 1979. Il. R. A. HAUGEN AND L. W. SENBET. The irrelevance of bankruptcy costs to the theory of optimal capital structure, J. Finance 33 (1978). 383-394. 12. R. HEIN~~EL. A theory of capital structure relevance under imperfect information, J. Finance 37 (1982). 1141-1150. 13. M. F. HELLWIG, Bankruptcy, limited liability, and the Modighani-Miller theorem. Amer. Econ. Rev. 71 ( 198 1). 155-l 70. 14. M. C. JENSEN AND W. H. MECKLING, Theory of the hrm: Managerial behavior, agency costs, and ownership structure. 1. Finw. Econ. 3 (1976) 305-367. 15. E. H. KIM, A mean-variance theory of optimal capital structure and corporate debt capacity. J. Finance 33 (1978), 45-64. 16. E. H. KIM. Millers equilibrium, shareholder leverage clienteles, and optimal capital structure, J. Finunce 37 (1982), 301-319. 17. W. Y. LEE, A. V. THAKOK, ANL) G. VORA, Screening, market signalling and capital structure theory, J. Finunca 38 (1983) 150771518. 18. H. E. LELAND AND D. H. PYLE, Information asymmetries, financial structure and linancial intermediaries, J. Finance 32 (1977) 371-387. 19. J. LINTNEK. Bankruptcy risk, market segmentation. and optimal capita1 structure, in Risk and Return in Finance (I. Friend and J. Bicksler, Eds.). Sect. 1, pp. I-128, Ballinger. Cambridge, 1977. 20. J. C. T. MAO, Project linancing: Fundmg the future, Financid Execuriue (1982), 23-28. Project Organization and Finance. The Cleveland-Cliffs Iron 21. T. E. McG~ivu. Company, Cleveland, Ohio, 198 1. 22. M. H. MILLER, Debt and taxes, J. Finance 32 (1977). 261-274. 23. F. MODIGLIANI AND M. H. MILLER, The cost of capital, corporation tinance, and the theory of investment, Amer. Eeon. Rer;. 48 (1958), 261-297. 24. F. MOUIGLIANI AND M. H. MILLEK, Corporate income taxes and the cost of capital: A correction, Amer. Econ. Rev. 53 (1963), 433443. 25. S. MYERS AND N. S. MAJLUF, Corporate Iinancing and investment decisions when lirms have information that investors do not have. J. Finun. Econ. 13-2 (1984), 187-221. 26. R. B. MYEKSOS, Incentive compatibility and the bargaining problem, Econometrics 47
(1979), 27. 28. 29. 61-74.
30. 31. 32.
P. K. NEVITT. Project Financing. Euromoney Publications, London. 1979. R. T. S. RAMAKRISHNAN AND A. V. THAKOR, Information reliability and a theory of financial intermediation, Rev. Econ. Sd. 51 (1984). 415432. J. G. RILEY, Informational equilibrium, Econometrica 47 (1979), 331-359. S. A. Ross, The determination of financial structure: The incentive signalling approach, Bell J. Econ. 8 ( 1977). 2340. S. A. ROSS. Debt and taxes and uncertainty, J. Finance 40 (1985). 637-656. M. A. ROTHSCHILD ANI) J. E. STIGLITZ, Increasing risk: A defmition, J. Econ. Theory 2
of optimal capital structure. Be//J. Econ. 7 (1976), 33-53. 34. J. E. STIGLITZ, Some aspects of the pure theory of corporate finance: Bankruptcies and takeovers, Bell J. Econ. Munage. Sri. 3 (1972). 458482. 35. E. TALMOR, R. HAUCEN AND A. BARPI;F.A, The value of the tax subsidy on risky debt. J. Bus. 58-2 (1985). 191-202. 36. R. M. TOWNSENU, Financial Structures as Communication Systems, manuscript. February 1984. 37. J. B. WARNER. Bankruptcy costs: Some evidence. J. Finance 32 (1977). 333-347.
(1970), 225-243. 33. J. H. Scar-r. JR., A theory

Optimal Capital Structure and Project Financing : Salman Shah

Uploaded by

Document Informationclick to expand document information

Copyright:

Available Formats

Optimal Capital Structure and Project Financing : Salman Shah

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Optimal Capital Structure and Project Financing : Salman Shah

Uploaded by

Copyright:

Available Formats

JOURNAL

of Toronto, 1 V4, Canada

i' R/(R,a,dR=J^ Rf(R,O)dR=R Rt R+

Z[R-crK][l-z]f(R,O)dR (3) e)dR, f3)dR. (4) (5)

+IP IcrKf (R, 6) dR,

The net expected end-of-period therefore, is E?=P-[l-a]@ =

cash flow accruing to the shareholders,

(4) and (8) gives us sr

(9) in (7) produces P=Z{[l-T]

which is maximized mation equilibrium leverage proposition

f(R 0) =f(R) + W-(R) -.fW)l,

I.., n,,..., Q,*) > 5,(Q2*)

Z[R - CC(~) K(@][

we) = 40) ZP,

ve E ce, , e,i, veE w,, e,i,

(15) (16) (17) (18)

we) 2 0, 06+w 1, zwe)3E(Qe)

t/k [e,, 61,

* s81E(e) g(e) de,

E(B) = Ia(B) pz,

L(B) E a(8) K(B), E(6) a 0, 06a(@)6 1,

wherever E(B) exists.

are next presented to examine the nature of the

the amount of nondecreasing (face value of its total assets

02) dR + IL,,, = Zp.

Let K,(Q2) satisfy (25) so that L,= of c[ satisfying L s0

1. K,,,(8,). Next define Cras the value

The RCSE exists and is unique.

SHAH AND THAKOR

=o, @*(@I 1 E (03@*(w),

v/8$0* VoEO* n [O,, &),

[l A c?] E (0, 1-J

FOR PROJECT FINANCING

that Q(. ) be additive. measure of A.

we,) =I 4; p,c 1- tl h,(P,, e,(s)) dP,,

C~J+,-~(~,+,)l~,+,(~,+,,~,+,(~))~~,+, 1+1 - EJ(O,)- ZCl- a*te,+ ,)I P,

Note that @*GA+,(W, K*Ub+ ,(W) is the capital structure contract

for all possible

E;(e) > &t,+ ,(S), 0)

FIG. 2. Equity values as functions which project financing is preferred.

of the risk parameter

for those risk parameter

The project has a higher leverage than it would with conventional

V. PROOFS Proof of Theorem 1. We know that

Rf(R)dR and (1) (34)

= R. Similarly, replacing 6 by e gives

(34) and (35) we have, for 0 > 8,

see the Wall

(11) for any 8 E [I@,, @,I.

Since j; F(R) dR 3 16 p(R)

dR, (37) implies that (38)

side (r.h.s.) of (37)

Since H < 1 and j; F(R)

dR, (36) and (40) jointly

.f(R) Q.f(R 0).

Further, from (40) we see that for 0~4, I0 Combining

cash flow to the

234 And using (11) we can write

I.., n,,..., Q,) > 5,(Q2)

=o, @(@I 1 E (03@(w),

Note that @GA+,(W, KUb+ ,(W) is the capital structure contract

cc(e) < a(B,) d lV8~ [G,, 19,).